paperarXivTrust 82 · PrimaryPublished 2d agoLive · yesterday
SWE-Doctor: Guiding Software Engineering Agents with Runtime Diagnosis from Multi-Faceted Bug Reproduction Tests
Large language model (LLM)-based software engineering agents are increasingly developed to resolve software issues by generating patches from issue reports and code repositories. Bug reproduction tests (BRTs) are an important building block for such agents and have been shown useful for patch validation. However, it remains unclear whether BRTs can also help the more central stage of patch generation. We first conduct a preliminary study and find that directly using advanced BRT generators to guide patch generation is not beneficial: fail-to-fail BRTs can mislead agents, while even fail-to-pas
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Covers
Implements
Related to
Implements (incoming)
Related across the graph
repobug-ops/zephrepomozilla/bugbugnewsDebugging production agents with Amazon Bedrock AgentCore ObservabilityrepoNayjest/GitonewsPatch the Planet: a Daybreak initiative to support open source maintainersrepohuhusmang/Awesome-LLMs-for-Vulnerability-DetectionnewsA system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]repopromptfoo/promptfoorepogolobokov.misha/llm-review-agentsrepoagent-toolstoolAgentTrace
