repoGitHubTrust 82 · PrimaryPublished 18h agoLive · 15h ago
raullenchai/Rapid-MLX
The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Covers
Covers (incoming)
Related across the graph
newsFrom Materials Simulation to Experimental Astronomy, New NVIDIA AI Software Unlocks Scientific DiscoveriesnewsFollow-up: DeepSeek V4 Flash on 2x RTX PRO 6000 finishes real coding tasks faster than Sonnet and Opus, at about Sonnet qualitynewsOpenAI and Broadcom unveil LLM-optimized inference chipnewsHolo3.1: Fast & Local Computer Use AgentsnewsGPT-5.6 launches, but OpenAI is taking it slow - IBMnewsOpenAI reveals its first AI processor: JalapeñonewsOpenAI to acquire Ona
