Topic

Amd

16 items across the graph — tagged with Amd.

From the graph · 16

repo

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

→repo

LMCache/LMCache

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

→repo

lemonade-sdk/lemonade

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Z…

→repo

dstackai/dstack

Vendor-agnostic orchestration for training, inference and agentic workloads across NVIDIA, AMD, TPU, and Tenstorrent on clouds, Kubernetes, and bare metal.

→repo

FastFlowLM/FastFlowLM

Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.

→repo

uccl-project/uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

→repo

SemiAnalysisAI/InferenceX

Open Source Continuous Inference Benchmark Research Platform — Kimi K2.7-Code, MiniMax M3, DeepSeekv4, GLM5 - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 & soo…

→repo

Kaden-Schutt/hipfire

RDNA-native LLM inference engine in Rust.

→repo

ROCm/MIVisionX

AMD MIVisionX is a computer vision toolkit built around a highly optimized, conformant open-source implementation of the Khronos OpenVX™ 1.3 specification. As o…

→repo

NexusGPU/tensor-fusion

Tensor Fusion is a state-of-the-art GPU virtualization and pooling solution designed to optimize GPU cluster utilization to its fullest potential.

→repo

engeldlgado/toshllm

Run large language models locally on Intel Macs with AMD GPUs — native macOS app with Metal acceleration

→repo

b-data/jupyterlab-python-docker-stack

(GPU accelerated) Multi-arch (linux/amd64, linux/arm64/v8) JupyterLab Python docker images. Please submit Pull Requests to the GitLab repository. Mirror of

→repo

b-data/data-science-devcontainers

(GPU accelerated) Multi-arch (linux/amd64, linux/arm64/v8) Data Science dev containers for R, Python, Julia and MAX/Mojo

→repo

SemiAnalysisAI/InferenceX-app

Dashboard for InferenceX™, Open Source Continuous Inference

→repo

Hal0ai/hal0

Open-source self-hosted home AI inference platform for AMD Strix Halo — multi-backend slots, OpenAI-compatible gateway, Vue 3 + FastAPI + systemd.

→tool

@streamdown/math

Dependency/tool package detected from repository manifests (@streamdown/math).