Autoscaling
2 items across the graph — tagged with Autoscaling.
From the graph · 2
repo
NexusGPU/tensor-fusion
→repoTensor Fusion is a state-of-the-art GPU virtualization and pooling solution designed to optimize GPU cluster utilization to its fullest potential.
defilantech/LLMKube
→Kubernetes operator for self-hosted LLM inference across a heterogeneous GPU fleet: NVIDIA CUDA, AMD Vulkan, and Apple Silicon Metal. Runtimes: llama.cpp, vLLM,…
