paperarXivTrust 82 · PrimaryPublished 5d agoLive · 3d ago

SurgVLA-Bench: Towards Evaluating Vision-Language-Action Models for Laparoscopic Surgical Robotics

Vision-Language-Action (VLA) models represent a promising direction for embodied intelligence in surgical robotics. Despite the prevalence of VLA benchmarks for general robotics, standardized evaluation platforms specifically designed for surgical contexts remain absent. To address this limitation, we present SurgVLA-Bench, the first comprehensive benchmark for evaluating VLA models in laparoscopic surgical robotics. Leveraging the SurRoL simulation platform, we construct a hierarchical task taxonomy ranging from atomic actions to complete surgical procedures, complemented by a multi-dimension

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Has model

modelVioletVision-3B

Implements

repovlm-starter

Implements (incoming)

repoVG-GUI-TASKER/VG-GUI-TASKER reposou350121/VLA-Handbook repolucidrains/mimic-video

Related across the graph

repoVG-GUI-TASKER/VG-GUI-TASKER repolucidrains/mimic-video modelVioletVision-3B reposou350121/VLA-Handbook repovlm-starter

Topics

cs.AI