repoGitHubTrust 82 · PrimaryPublished 15h agoLive · 15h ago
sgl-project/rbg
A workload for deploying LLM inference services on Kubernetes
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Covers
Implements
Covers (incoming)
Related across the graph
paperMCP Server Architecture Patterns for LLM-Integrated ApplicationsnewsUnderstanding dynamic resource allocation in KubernetesnewsSelf-hosted GitHub Actions runners on Lambda MicroVMsnewsImplementing resilience patterns with Amazon Bedrock and LLM gatewaynewsHow're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]
