repoGitHubTrust 82 · PrimaryPublished 13h agoLive · 12h ago
notwitcheer/llm-bench-rig
Dual-engine (llama.cpp + vLLM) LLM benchmarking pipeline for GGUF & safetensors on NVIDIA GPUs — speed, quality, live dashboard, publishable cards.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Implements
Covers
newsBolt Graphics GPU will have 2 DDR5 laptop DIMM slotsnewsNous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code momentnewsGoing from single GPU to dual GPU is nice but not in the way I expectednewsAre there good closed vs open LLM rankings? Also, are 70B–350B models actually worth it?
Related across the graph
newsNous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code momentnewsAre there good closed vs open LLM rankings? Also, are 70B–350B models actually worth it?paperOne-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM PretrainingnewsBolt Graphics GPU will have 2 DDR5 laptop DIMM slotsnewsGoing from single GPU to dual GPU is nice but not in the way I expected
