Kimi
12 items across the graph — tagged with Kimi.
From the graph · 12
A high-throughput and memory-efficient inference and serving engine for LLMs
Hugging Face model with 2828 likes. Tags: transformers, safetensors, kimi_k25, feature-extraction, compressed-tensors, image-text-to-text, conversational, custo…
TokenSpeed is a speed-of-light LLM inference engine.
Open Source Continuous Inference Benchmark Research Platform — Kimi K2.7-Code, MiniMax M3, DeepSeekv4, GLM5 - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 & soo…
♾️ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack…
🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
Pure Rust + CUDA LLM inference engine — no PyTorch, OpenAI-compatible, serves Qwen3 to Kimi-K2
Desktop app + OpenAI-compatible API that proxies LLM web UIs for unofficial integration of LLMs into SillyTavern and other clients.
A powerful CLI tool that brings AI assistance directly to your terminal. Gokin understands your codebase and helps with file operations, code search, shell comm…
✨ 让经典名言焕发新生!基于LLM模型动态生成创意故事,用AI重新诠释金山每日一句的智慧结晶
Litefuse - Agent Observability and Evaluation Platform
Deterministic multi-provider AI-agent orchestrator — ODA loops, GPT/Claude/Gemini/Kimi adapters, safety controls, event log
