A linear-cost attention variant that holds quality past a million tokens.
Implementations of many attention variants, benchmarked.