glossary_term · Angestrom
Transformer
The neural network architecture behind most modern language models.
The neural network architecture behind most modern language models. The neural network architecture behind most modern language models.
Read it here, in full.View original →
repoengineering87/llm-atlasmodelsentence-transformers/all-MiniLM-L6-v2paperWhen are likely answers right? On Sequence Probability and Correctness in LLMspaperHow Surprising Is Historical Italian to Language Models? Tokenization Tax, Comprehension Tax, and a Simple Mitigationrepominimal-diffusion-lmrepovlm-starter
modelsentence-transformers/all-MiniLM-L6-v2newsWhat exactly does word2vec learn?paperHow Surprising Is Historical Italian to Language Models? Tokenization Tax, Comprehension Tax, and a Simple MitigationpaperWhen are likely answers right? On Sequence Probability and Correctness in LLMsnewsIdentifying Interactions at Scale for LLMsrepoengineering87/llm-atlasrepominimal-diffusion-lmrepovlm-starter