paperarXivTrust 82 · PrimaryPublished yesterdayLive · 19h ago

Efficient PEFT Methods with Adaptive Checkpointing for Vision Models and VLMs on Resource Constrained Consumer-GPUs

Modern pretrained vision models achieve strong accuracy but demand substantial GPU memory for fine-tuning, making edge deployment impractical. This paper compares five parameter-efficient fine-tuning (PEFT) methods (Full FT, LoRA, AdaLoRA, QLoRA, BitFit) on Transformers- (ViT-Small, TinyViT) and Mamba-based vision backbones (Vim-Small, MambaVision-T) under an on-device VRAM budget (e.g., 2 GB), together with three gradient-checkpointing strategies (none, static, and a proposed memory-budget-aware adaptive algorithm); and we evaluate three families of foundation-model baselines: zero-shot contr

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

Linked via arxiv authorAltay Toktassyn →
Efficient PEFT Methods with Adaptive Checkpointing for Vision Models and VLMs on Resource Constrained Consumer-GPUs
Linked via arxiv authorJurn-Gyu Park →
Efficient PEFT Methods with Adaptive Checkpointing for Vision Models and VLMs on Resource Constrained Consumer-GPUs

Implements

repoopen-edge-platform/geti repoDoubangoTelecom/compv repoNVIDIA-NeMo/Curator

Covers

newsGoing from single GPU to dual GPU is nice but not in the way I expected

Implements (incoming)

repoNVIDIA/TransformerEngine

authored (incoming)

personAltay Toktassyn personJurn-Gyu Park

Related across the graph

repoDoubangoTelecom/compv repoNVIDIA/TransformerEngine personAltay Toktassyn repoopen-edge-platform/geti personJurn-Gyu Park newsGoing from single GPU to dual GPU is nice but not in the way I expected repoNVIDIA-NeMo/Curator

Topics

cs.CV