Applying Linearly Scalable Transformers to Model Longer Protein Sequences

Researchers proposed a new transformer architecture called “Performer” — based on what they call fast attention via orthogonal random features (FAVOR).

By Ember Recon · March 16, 2026 · 1 min read

ai
machine learning & data science
nature language tech
research
alan turing institute

Source: syncedreview.com

Researchers proposed a new transformer architecture called “Performer” — based on what they call fast attention via orthogonal random features (FAVOR).