Topic

Ffi

15 items across the graph · 1 news stories — tagged with Ffi.

Latest news

NewsAI NewsLive · 1mo ago

Breakthrough in long-context efficiency announced

A new attention scheme cuts memory use for very long inputs.

Read full story →

From the graph · 14

repo
PaddlePaddle/Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

repo
zjunlp/EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

repo
FB208/OpenBidKit_Yibiao

开箱即用的AI标书编写工具,标书AI生成工具,投标工具箱、知识库、标书查重、废标项检查,完全开源免费,欢迎使用

repo
apache/tvm-ffi

Open ABI and FFI for Machine Learning Systems

repo
Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

repo
Apil-Shrestha/token-efficiency-lex

Token Cost Parity: Multilingual LLM Efficiency Analysis 2026

repo
charanpreetSingh123/MediScanAI

🏥 AI-powered multi-disease early detection platform | Brain Tumor · Diabetic Retinopathy · Skin Cancer | PyTorch · FastAPI · React · Grad-CAM | B.Tech Major Pr…

model
Nano-Refuse-0.4B

A tiny safety classifier for fast content filtering.

paper
Speculative decoding with draft models

Accelerating generation by drafting tokens with a small model.

repo
quant-kit

Post-training quantization tools for transformers.

glossary term
Quantization

Shrinking a model by storing its weights at lower precision.

paper
Quantization at 1.58 bits

Ternary-weight models that retain most of full-precision quality.

model
Whisper-Lite

A compact speech-to-text model for on-device use.

tool
QuantBench

A one-click quantization and benchmarking tool.

Related topics