repoGitHubTrust 82 · PrimaryPublished 19h agoLive · 18h ago

quic/efficient-transformers

This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficiently on Qualcomm Cloud AI 100 accelerators.

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Implements

paperFlexViT: A Flexible FPGA-based Accelerator for Edge Vision Transformers

Covers

newsDesigning the hf CLI as an agent-optimized way to work with the Hub newsHardware startup unveils inference accelerator newsMonitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch newsA barebones CPU-only inference engine for Qwen 3, written from scratch in pure C

Related across the graph

newsDesigning the hf CLI as an agent-optimized way to work with the Hub paperFlexViT: A Flexible FPGA-based Accelerator for Edge Vision Transformers newsMonitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch newsHardware startup unveils inference accelerator newsA barebones CPU-only inference engine for Qwen 3, written from scratch in pure C

Topics