Understanding the Differences in Language Models - Transformers vs. Markov Models
2023-10-07
data:image/s3,"s3://crabby-images/9fc83/9fc83347a17e6a9d098ce5ca5ae8388cee23560c" alt=""
This article explores distinguishing details of Markov Models and Transformer-based models like GPT, focusing on how they predict the next character in a sequence. It explores the fundamental differences between these models, with a particular emphasis on how the self-attention mechanism in Transformer models makes a difference compared to the fixed context length in Markov models.
Continue reading