Read original ↗
paperarXivTrust 82 · PrimaryPublished 2d agoLive · yesterday

GMO-E$^2$DIT: Grounded Multi-Operation Editing for E-Commerce Images

Real-world e-commerce image editing often requires multiple, localized, and auditable operations rather than global restyling. This compositional nature poses a dual challenge: models must precisely apply all requested edits to the correct regions while preserving unmodified content, even under ambiguous instructions. Existing one-shot editors conflate intent resolution, spatial grounding, and synthesis into a single step, frequently resulting in partial execution failures, which is unacceptable for commercial scenarios. To address this, we introduce GMO-E$^2$DIT, an agentic editing framework

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

Implements

Related across the graph

Topics