Topic cluster · 2 items

attention

paper

Sparse attention at million-token context

A linear-cost attention variant that holds quality past a million tokens.

repo

attention-zoo

Implementations of many attention variants, benchmarked.

Related topics