repoGitHubTrust 82 · PrimaryPublished 13h agoLive · 9h ago
Somnusochi/VLM-AutoYOLO
AI Auto Annotation & YOLO Training Pipeline, End-to-end object detection auto-labeling and YOLO training platform. VLM-powered annotation with NVIDIA LocateAnything-3B, manual refinement, one-click YOLO training, video keyframe extraction, and model validation. Supports image and video.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Covers
Implements
paperAnyGroundBench: A Specialized-Domain Benchmark for Video Grounding in Vision-Language ModelspaperObject-centric LeJEPApaperOctoSense: Self-Supervised Learning for Multimodal Robot PerceptionpaperAdaCount: Training-Free Similarity-Guided Spatial and Feature Adaptation for Zero-Shot Object Counting
Related across the graph
paperObject-centric LeJEPAnewsInto the Omniverse: Three Workflows for Improving Vision AI Agent Accuracy With Synthetic Data and Fine-TuningpaperAdaCount: Training-Free Similarity-Guided Spatial and Feature Adaptation for Zero-Shot Object CountingpaperAnyGroundBench: A Specialized-Domain Benchmark for Video Grounding in Vision-Language ModelspaperOctoSense: Self-Supervised Learning for Multimodal Robot Perception
