repoGitHubTrust 82 · PrimaryPublished yesterdayLive · yesterday
patrick-toulme/harnessgym
Iterative agent harness improvement: run a coding agent on a hard task, generate the reusable tooling it was missing, qualify it, and replay fresh sessions with it activated. Works with Codex and Claude Code.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Related to
Covers
Implements
Implements (incoming)
Related across the graph
newsI built an agent Harness for Small Models. I got Qwen 3.5 4b managing servers.paperLearning from Failure: Inference-Time Self-Improvement for Computer-Use AgentspaperReasoning effort, not tool access, buys first-try reliability in agentic code generation: an observational studymodelAgentCore-8BpaperAutoTrainess: Teaching Language Models to Improve Language Models AutonomouslypaperAgentic Hardware Design as Repository-Level Code Evolution
