This is a basic guide to understand the fundamentals of how AI models work. It lays the foundation for deeper concepts explained in this guide.
Artificial Intelligence
, the model assigns much higher probability to related terms like machine learning
than unrelated ones like banana cake
.
Hi
from the user lets the model probably answer with a greeting. It answers with Hello
.
Then, it generates the next most likely word based on Hi
and Hello
. This process is repeated until the model decides the request was sufficiently answered.
The generation process works token by token:
Hi
Hello
Hi Hello