Topic

Distributed

34 items across the graph — tagged with Distributed.

From the graph · 34

repo
tensorflow/tensorflow

An Open Source Machine Learning Framework for Everyone

repo
mudler/LocalAI

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

repo
milvus-io/milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

repo
ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

repo
PaddlePaddle/Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

repo
skypilot-org/skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

repo
Netflix/metaflow

Build, Manage and Deploy AI/ML Systems

repo
rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

repo
Eventual-Inc/Daft

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

repo
llm-d/llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

repo
astroautomata/PySR

High-Performance Symbolic Regression in Python and Julia

repo
lakehq/sail

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

repo
kubeflow/trainer

Distributed AI Model Training and LLM Fine-Tuning on Kubernetes

repo
hyperspaceai/agi

The first distributed AGI system. Thousands of autonomous AI agents collaboratively train models, share experiments via P2P gossip, and push breakthroughs here.…

repo
study8677/awesome-architecture

🧭 Architecture-first system design: 26 bilingual tutorials, 25 architecture templates, and 6 end-to-end cases covering distributed systems, AI-native systems,…

repo
beam-cloud/beta9

Ultrafast serverless GPU inference, sandboxes, and background jobs

repo
google/vizier

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

repo
Mesh-LLM/mesh-llm

Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.

repo
google/yggdrasil-decision-forests

A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.

repo
modal-labs/modal-client

SDK libraries for Modal

repo
redai-infra/Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

repo
jaylfc/taOS

Self-hosted AI agent OS. Your memory, chat, agents, and files stay on hardware you own, offline by default, cloud by choice. Offline AI memory (taOSmd), self-ho…

repo
softmata/horus

Fastest Robotics Runtime System. If phones have Android, robots deserve HORUS.

repo
bodo-ai/Bodo

High Performance Data Processing in Python

repo
Ikalus1988/MisakaNet

📚 A zero-dependency, git-backed micro-lesson library for AI Agents to asynchronously share and search verified debugging experience. Python stdlib only. | http…

repo
traceopt-ai/traceml

A lightweight runtime health check for PyTorch training runs.

repo
aws/sagemaker-xgboost-container

This is the Docker container based on open source framework XGBoost (https://xgboost.readthedocs.io/en/latest/) to allow customers use their own XGBoost scripts…

repo
kubeflow/sdk

Universal Python SDK to run AI workloads on Kubernetes

repo
professorpalmer/Puppetmaster

Provider-neutral control plane for durable-state agent swarms: subprocess workers, leases, artifacts, memory, and deterministic stitching.

repo
NoteDance/Note

Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch

repo
thoughtbot/opentelemetry-instrumentation-ruby_llm

OpenTelemetry instrumentation for RubyLLM. 💬🔭

repo
ruimalheiro/gradient-garden

Research platform for model training, evaluation, and experimentation across architectures, benchmarks, and recipes.

repo
truvaagents/truva-g3

A Go framework for the microagents architecture with dynamic discovery — specialized agents and tools that register their capabilities with a shared registry, f…

repo
jameslamb/lightgbm-dask-testing

Test LightGBM's Dask integration on different cluster types

Related topics