paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago
TraceLab: Characterizing Coding Agent Workloads for LLM Serving
Coding agents are rapidly becoming a major application of agentic LLMs, but serving them efficiently remains challenging. Progress on this challenge requires understanding real workload patterns, yet the data needed for such analysis is largely absent. Existing public traces and benchmarks do not capture real, day-to-day coding-agent usage across multiple agents and model families for serving-system analysis. To help fill this gap, we collect and release a trace of roughly 4,300 coding-agent sessions, containing about 350,000 LLM steps and 430,000 tool calls from our own day-to-day use of Clau
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Implements
Related to
Covers
Covers (incoming)
newsScarfBench: Benchmarking AI Agents for Enterprise Java Framework MigrationnewsREAP: Automatic Curation of Coding Agent Benchmarks from Interactive Production Usage [R]newsStructured memory filtering with metadata in AgentCore MemorynewsSentryCode: Real-time Auditor + Honeytokens for AI Coding Agents [P]newsI spent ~4.5 months building a free, self-hosted AI gateway: one endpoint for 237 providers (90+ free), auto-fallback, and a token-compression pipeline (MIT)
Implements (incoming)
reposipyourdrink-ltd/bernsteinrepoautomagik-dev/genierepoagentforce314/clawcodexrepoJKHeadley/instarrepoBerriAI/litellmrepoactiveloopai/hivemindrepomm7894215/TokenTrackerrepogeneralaction/emdashrepoSakshxm1/hermes-agency-orchestratorrepobytechefhq/bytechefrepocomet-ml/opikrepoGreen-PT/honey-for-devsrepothedotmack/claude-memrepoheadroomlabs-ai/headroomrepoGitlawb/zerorepoJingbiaoMei/Tokdashrepolotus-data/lotusrepogolobokov.misha/llm-review-agentsreposquirrelscan/squirrelscanrepomjason/longrepoGiskard-AI/giskard-ossrepojuyterman1000/entrolyrepotarunlnmiit/autopilot-jobhuntrepoliaohch3/claude-taprepoLazyAGI/LazyLLMrepoprasenjeet-symon/ogcode
Related across the graph
repoSakshxm1/hermes-agency-orchestratorrepoLazyAGI/LazyLLMrepoGiskard-AI/giskard-ossrepoJKHeadley/instarrepojuyterman1000/entrolyrepotarunlnmiit/autopilot-jobhuntrepogeneralaction/emdashreposipyourdrink-ltd/bernsteinrepoactiveloopai/hivemindnewsSentryCode: Real-time Auditor + Honeytokens for AI Coding Agents [P]repoGitlawb/zerorepomjason/longreposquirrelscan/squirrelscannewsStructured memory filtering with metadata in AgentCore MemorynewsDebugging production agents with Amazon Bedrock AgentCore Observabilityrepomm7894215/TokenTrackerrepoJingbiaoMei/Tokdashrepoliaohch3/claude-tapnewsAgentic Resource Discovery: Let agents searchrepocomet-ml/opikrepothedotmack/claude-memnewsScarfBench: Benchmarking AI Agents for Enterprise Java Framework MigrationrepoGreen-PT/honey-for-devsrepoheadroomlabs-ai/headroomnewsI spent ~4.5 months building a free, self-hosted AI gateway: one endpoint for 237 providers (90+ free), auto-fallback, and a token-compression pipeline (MIT)repolotus-data/lotusnewsWhat's one local AI workflow you wish you'd discovered sooner?repobytechefhq/bytechefrepogolobokov.misha/llm-review-agentsrepoagentforce314/clawcodexrepoautomagik-dev/genierepoBerriAI/litellmrepoagent-toolsrepoprasenjeet-symon/ogcodenewsREAP: Automatic Curation of Coding Agent Benchmarks from Interactive Production Usage [R]toolAgentTrace
