Large Language Models
Large Language Models (LLMs) like ChatGPT, DeepSeek, Gemini, or Claude, have taken the world by storm.
In this course, you’ll learn all the foundations of how LLMs work, including the whole architecture of transformer models, tokenizers, positional encoding, and a deep dive into the attention mechanism, the secret sauce behind transformers.
-
Chapter 1: Attention Mechanisms
-
Chapter 2: Transformer Architecture
Retake this course?
Retaking this course from the beginning will reset all of your tracked progress.