Read original ↗

paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

We introduce Agents-A1, a 35B Mixture-of-Experts Agentic Model that reaches trillion-parameter-level performance by scaling the agent horizon. We investigate agent-horizon scaling from two perspectives: scaling long-horizon trajectories and scaling heterogeneous agent abilities. To support this goal, we build a long-horizon knowledge-action infrastructure that connects external knowledge, actions, observations, and verifier outcomes, producing agentic trajectories with an average length of 45K tokens. Based on this, we train Agents-A1 with a three-stage recipe. First, we perform full-domain su

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Has model

modelAgentCore-8B

Covers

newsGeneral Intuition’s $2.3B bet that video games can train AI agents for the real world newsAlibaba's model never trained as an agent — and improved agent performance across seven benchmarks newsGoogle DeepMind is worried about what happens when millions of agents start to interact newsMonitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch

Implements

repoFabiojvv/ai-cortex-hub

Covers (incoming)

newsDeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%newsClaude Meets Blackwell Ultra: Anthropic’s Models Now Run on NVIDIA GB300 in Azure newsInto the Omniverse: Three Workflows for Improving Vision AI Agent Accuracy With Synthetic Data and Fine-Tuning

Implements (incoming)

reporeunios2024/cortex-sentinel-trading-nexus repotensorflow/serving repodeepset-ai/haystack-core-integrations

Related across the graph

newsInto the Omniverse: Three Workflows for Improving Vision AI Agent Accuracy With Synthetic Data and Fine-Tuning newsDeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%newsGeneral Intuition’s $2.3B bet that video games can train AI agents for the real world repotensorflow/serving newsClaude Meets Blackwell Ultra: Anthropic’s Models Now Run on NVIDIA GB300 in Azure repoFabiojvv/ai-cortex-hub newsMonitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch newsGoogle DeepMind is worried about what happens when millions of agents start to interact repodeepset-ai/haystack-core-integrations newsAlibaba's model never trained as an agent — and improved agent performance across seven benchmarks modelAgentCore-8B reporeunios2024/cortex-sentinel-trading-nexus

Topics