Topic

Amd

16 items across the graph — tagged with Amd.

From the graph · 16

repo
vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

repo
LMCache/LMCache

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

repo
lemonade-sdk/lemonade

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Z…

repo
dstackai/dstack

Vendor-agnostic orchestration for training, inference and agentic workloads across NVIDIA, AMD, TPU, and Tenstorrent on clouds, Kubernetes, and bare metal.

repo
FastFlowLM/FastFlowLM

Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.

repo
uccl-project/uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

repo
SemiAnalysisAI/InferenceX

Open Source Continuous Inference Benchmark Research Platform — Kimi K2.7-Code, MiniMax M3, DeepSeekv4, GLM5 - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 & soo…

repo
Kaden-Schutt/hipfire

RDNA-native LLM inference engine in Rust.

repo
ROCm/MIVisionX

AMD MIVisionX is a computer vision toolkit built around a highly optimized, conformant open-source implementation of the Khronos OpenVX™ 1.3 specification. As o…

repo
NexusGPU/tensor-fusion

Tensor Fusion is a state-of-the-art GPU virtualization and pooling solution designed to optimize GPU cluster utilization to its fullest potential.

repo
engeldlgado/toshllm

Run large language models locally on Intel Macs with AMD GPUs — native macOS app with Metal acceleration

repo
b-data/jupyterlab-python-docker-stack

(GPU accelerated) Multi-arch (linux/amd64, linux/arm64/v8) JupyterLab Python docker images. Please submit Pull Requests to the GitLab repository. Mirror of

repo
b-data/data-science-devcontainers

(GPU accelerated) Multi-arch (linux/amd64, linux/arm64/v8) Data Science dev containers for R, Python, Julia and MAX/Mojo

repo
SemiAnalysisAI/InferenceX-app

Dashboard for InferenceX™, Open Source Continuous Inference

repo
Hal0ai/hal0

Open-source self-hosted home AI inference platform for AMD Strix Halo — multi-backend slots, OpenAI-compatible gateway, Vue 3 + FastAPI + systemd.

tool
@streamdown/math

Dependency/tool package detected from repository manifests (@streamdown/math).

Related topics