repoGitHubTrust 82 · PrimaryPublished 19h agoLive · 18h ago
quic/efficient-transformers
This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficiently on Qualcomm Cloud AI 100 accelerators.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Implements
Covers
newsDesigning the hf CLI as an agent-optimized way to work with the HubnewsHardware startup unveils inference acceleratornewsMonitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatchnewsA barebones CPU-only inference engine for Qwen 3, written from scratch in pure C
Related across the graph
newsDesigning the hf CLI as an agent-optimized way to work with the HubpaperFlexViT: A Flexible FPGA-based Accelerator for Edge Vision TransformersnewsMonitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatchnewsHardware startup unveils inference acceleratornewsA barebones CPU-only inference engine for Qwen 3, written from scratch in pure C
