paperarXivTrust 82 · PrimaryPublished 2d agoLive · yesterday
GMO-E$^2$DIT: Grounded Multi-Operation Editing for E-Commerce Images
Real-world e-commerce image editing often requires multiple, localized, and auditable operations rather than global restyling. This compositional nature poses a dual challenge: models must precisely apply all requested edits to the correct regions while preserving unmodified content, even under ambiguous instructions. Existing one-shot editors conflate intent resolution, spatial grounding, and synthesis into a single step, frequently resulting in partial execution failures, which is unacceptable for commercial scenarios. To address this, we introduce GMO-E$^2$DIT, an agentic editing framework
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
