Read original ↗
paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

We introduce Agents-A1, a 35B Mixture-of-Experts Agentic Model that reaches trillion-parameter-level performance by scaling the agent horizon. We investigate agent-horizon scaling from two perspectives: scaling long-horizon trajectories and scaling heterogeneous agent abilities. To support this goal, we build a long-horizon knowledge-action infrastructure that connects external knowledge, actions, observations, and verifier outcomes, producing agentic trajectories with an average length of 45K tokens. Based on this, we train Agents-A1 with a three-stage recipe. First, we perform full-domain su

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Has model

Covers

Implements

Covers (incoming)

Implements (incoming)

Related across the graph

Topics