Learning to Evolve Scenes: Reasoning about Human Activities with Scene Graphs
Understanding human behavior while interacting with the surrounding world is crucial for many applications of embodied AI. First-person videos are particularly informative for this problem, as they well capture how activities reshape the scene over time. However, existing approaches often rely on implicit visual or language-aligned representations, disregarding structured reasoning over the scene dynamic. We argue that explicit, compositional and editable representations of human-environment interactions can play a crucial role for rich grounded activity understanding. To this end, we introduc
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Why these links exist
- Linked via arxiv authorFrancesca Pistilli →
Learning to Evolve Scenes: Reasoning about Human Activities with Scene Graphs
- Linked via arxiv authorSimone Alberto Peirone →
Learning to Evolve Scenes: Reasoning about Human Activities with Scene Graphs
- Linked via arxiv authorGiuseppe Averta →
Learning to Evolve Scenes: Reasoning about Human Activities with Scene Graphs
