repoGitHubTrust 82 · PrimaryPublished yesterdayLive · 21h ago
raketenkater/ggrun
Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Covers
Related to
Covers (incoming)
Related across the graph
newsOpenAI unveils its first custom chip, built by BroadcomnewsOpenAI’s Jalapeño chip is Big Tech’s spiciest move away from NvidianewsBuild real agentic apps using CUGA: two dozen working examples on a lightweight harnessnewsOpenAI and Broadcom unveil LLM-optimized inference chipnewsGPT-5.6 launches, but OpenAI is taking it slow - IBMmodelopenai/gpt-oss-120b
