Chapter 1: The Entrance (Embeddings & Positional Encodings) Our story begins with raw text—the sentence "The car is blue"—entering the machine. To be understood by the model, these words must shed their human form. They are converted into Embeddings, transforming them into vector representations of meaning. However, meaning isn't enough; order matters. So, Positional Encodings are added to these vectors, giving each word a unique address in the sequence so the model knows that "The" comes before "car."
Chapter 2: The Three Personalities (Queries, Keys, and Values) As the vectors move deeper (0:00-0:02), they undergo a linear transformation. Each input vector splits into three distinct roles, known as the Query, the Key, and the Value.
- The Query is the distinct representation of the word asking questions (e.g., "What am I related to?").
- The Key is the label or identifier for every other word.
- The Value is the actual content or substance of the word.
Chapter 3: The Great Conversation (Self-Attent