‹ Back to glossary
Transformer
advanced
The neural network architecture behind all modern large language models. Introduced by Google in 2017, the transformer uses "attention mechanisms" to understand relationships between words regardless of their distance in a text. This breakthrough enabled the creation of GPT, BERT, Claude, and all other modern AI models.