Topic

Kimi

12 items across the graph — tagged with Kimi.

From the graph · 12

A high-throughput and memory-efficient inference and serving engine for LLMs

Hugging Face model with 2828 likes. Tags: transformers, safetensors, kimi_k25, feature-extraction, compressed-tensors, image-text-to-text, conversational, custo…

→repo

lightseekorg/tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

→repo

SemiAnalysisAI/InferenceX

Open Source Continuous Inference Benchmark Research Platform — Kimi K2.7-Code, MiniMax M3, DeepSeekv4, GLM5 - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 & soo…

→repo

helixml/helix

♾️ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack…

→repo

NVIDIA-NeMo/Automodel

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

→repo

openinfer-project/openinfer

Pure Rust + CUDA LLM inference engine — no PyTorch, OpenAI-compatible, serves Qwen3 to Kimi-K2

→repo

LyubomirT/intense-rp-next

Desktop app + OpenAI-compatible API that proxies LLM web UIs for unofficial integration of LLMs into SillyTavern and other clients.

→repo

ginkida/gokin

A powerful CLI tool that brings AI assistance directly to your terminal. Gokin understands your codebase and helps with file operations, code search, shell comm…

→repo

Liao-Ke/everyday

✨ 让经典名言焕发新生！基于LLM模型动态生成创意故事，用AI重新诠释金山每日一句的智慧结晶

→repo

litefuse/litefuse

Litefuse - Agent Observability and Evaluation Platform

→repo

Francis1998/multi-bot-agentic

Deterministic multi-provider AI-agent orchestrator — ODA loops, GPT/Claude/Gemini/Kimi adapters, safety controls, event log

→