Read original ↗
newsReddit r/LocalLLaMATrust 58 · CommunityPublished 5d agoLive · 4d ago

Ornith-1.0-35B GGUF update: native MTP speculative-decode graft + full serving/TTFT/long-context numbers (llama.cpp, tp=1)