Gpt Oss
9 items across the graph — tagged with Gpt Oss.
From the graph · 9
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
SGLang is a high-performance serving framework for large language models and multimodal models.
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Run frontier LLMs and VLMs locally on Qualcomm devices across NPU, GPU, and CPU with a few lines of code
TokenSpeed is a speed-of-light LLM inference engine.
Local AI app and inference engine for agents. Run open-weight LLMs locally — private, 100% offline on your computer.
🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
