2025-03-30
Transformers lack recurrence (unlike RNNs)
No inherent sense of word order
Solution: Add positional info to token embeddings