Topic

Kubernetes Operator

1 items across the graph — tagged with Kubernetes Operator.

From the graph · 1

Kubernetes operator for self-hosted LLM inference across a heterogeneous GPU fleet: NVIDIA CUDA, AMD Vulkan, and Apple Silicon Metal. Runtimes: llama.cpp, vLLM,…

→

From the graph · 1

Related topics