Read original ↗

paperarXivTrust 82 · PrimaryPublished 2d agoLive · yesterday

SWE-Doctor: Guiding Software Engineering Agents with Runtime Diagnosis from Multi-Faceted Bug Reproduction Tests

Large language model (LLM)-based software engineering agents are increasingly developed to resolve software issues by generating patches from issue reports and code repositories. Bug reproduction tests (BRTs) are an important building block for such agents and have been shown useful for patch validation. However, it remains unclear whether BRTs can also help the more central stage of patch generation. We first conduct a preliminary study and find that directly using advanced BRT generators to guide patch generation is not beneficial: fail-to-fail BRTs can mislead agents, while even fail-to-pas

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

newsA system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]newsPatch the Planet: a Daybreak initiative to support open source maintainers newsDebugging production agents with Amazon Bedrock AgentCore Observability

Implements

repoagent-tools

Related to

Implements (incoming)

repobug-ops/zeph repomozilla/bugbug repopromptfoo/promptfoo repogolobokov.misha/llm-review-agents repohuhusmang/Awesome-LLMs-for-Vulnerability-Detection repoNayjest/Gito

Related across the graph

repobug-ops/zeph repomozilla/bugbug newsDebugging production agents with Amazon Bedrock AgentCore Observability repoNayjest/Gito newsPatch the Planet: a Daybreak initiative to support open source maintainers repohuhusmang/Awesome-LLMs-for-Vulnerability-Detection newsA system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]repopromptfoo/promptfoo repogolobokov.misha/llm-review-agents repoagent-tools toolAgentTrace

Topics