paperarXivTrust 82 · PrimaryPublished 2d agoLive · 21h ago

FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model

Current work on robot furniture assembly mostly focuses on toy-scale settings or single-arm manipulation. We introduce FurnitureVLA, the first systematic study of real-scale bimanual furniture assembly using Vision-Language-Action models (VLAs). We formalize the task, develop a scalable simulation pipeline for expert data generation and evaluation, and build a VR teleoperation system for single-operator bimanual control to collect high-quality real-world demonstrations. To address extreme long-horizon assembly with up to 7 subtasks and 1550 control steps, we propose a progress-enhanced VLA, fi

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

Linked via arxiv authorChenyang Ma →
FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model
Linked via arxiv authorYue Yang →
FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model
Linked via arxiv authorRadu Corcodel →
FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model
Linked via arxiv authorSiddarth Jain →
FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model
Linked via arxiv authorAndrew Wu →
FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model
Linked via arxiv authorChiori Hori →
FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model
Linked via arxiv authorDiego Romeres →
FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model

Has model

modelAgentCore-8B modelVioletVision-3B

authored (incoming)

personChenyang Ma personYue Yang personRadu Corcodel personSiddarth Jain personAndrew Wu personChiori Hori personDiego Romeres

Related across the graph

personChiori Hori personSiddarth Jain personChenyang Ma personAndrew Wu modelVioletVision-3B modelAgentCore-8B personYue Yang personDiego Romeres personRadu Corcodel

Topics

cs.AI