Fp4
4 items across the graph — tagged with Fp4.
From the graph · 4
repo
NVIDIA/TransformerEngine
→repoA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwel…
Tencent/AngelSlim
→repoModel compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-DFlash
→repoFully uncensored, capability-enhanced abliteration of Qwen3.6-27B. NVFP4 + z-lab DFlash speculative decoding (n=12) on the unified ghcr.io/aeon-7/aeon-vllm-ulti…
kekzl/imp
→From-scratch C++/CUDA inference engine for the NVIDIA RTX 5090 (sm_120a) — the best single-GPU backend for agentic AI: tool calling, long-context loops, reasoni…
