Topic

Distributed

34 items across the graph — tagged with Distributed.

From the graph · 34

An Open Source Machine Learning Framework for Everyone

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

→repo

milvus-io/milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

→repo

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

→repo

PaddlePaddle/Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

→repo

skypilot-org/skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

→repo

Netflix/metaflow

Build, Manage and Deploy AI/ML Systems

→repo

rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

→repo

Eventual-Inc/Daft

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

→repo

llm-d/llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

→repo

astroautomata/PySR

High-Performance Symbolic Regression in Python and Julia

→repo

lakehq/sail

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

→repo

kubeflow/trainer

Distributed AI Model Training and LLM Fine-Tuning on Kubernetes

→repo

hyperspaceai/agi

The first distributed AGI system. Thousands of autonomous AI agents collaboratively train models, share experiments via P2P gossip, and push breakthroughs here.…

→repo

study8677/awesome-architecture

🧭 Architecture-first system design: 26 bilingual tutorials, 25 architecture templates, and 6 end-to-end cases covering distributed systems, AI-native systems,…

→repo

beam-cloud/beta9

Ultrafast serverless GPU inference, sandboxes, and background jobs

→repo

google/vizier

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

→repo

Mesh-LLM/mesh-llm

Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.

→repo

google/yggdrasil-decision-forests

A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.

→repo

modal-labs/modal-client

SDK libraries for Modal

→repo

redai-infra/Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

→repo

jaylfc/taOS

Self-hosted AI agent OS. Your memory, chat, agents, and files stay on hardware you own, offline by default, cloud by choice. Offline AI memory (taOSmd), self-ho…

→repo

softmata/horus

Fastest Robotics Runtime System. If phones have Android, robots deserve HORUS.

→repo

bodo-ai/Bodo

High Performance Data Processing in Python

→repo

Ikalus1988/MisakaNet

📚 A zero-dependency, git-backed micro-lesson library for AI Agents to asynchronously share and search verified debugging experience. Python stdlib only. | http…

→repo

traceopt-ai/traceml

A lightweight runtime health check for PyTorch training runs.

→repo

aws/sagemaker-xgboost-container

This is the Docker container based on open source framework XGBoost (https://xgboost.readthedocs.io/en/latest/) to allow customers use their own XGBoost scripts…

→repo

kubeflow/sdk

Universal Python SDK to run AI workloads on Kubernetes

→repo

professorpalmer/Puppetmaster

Provider-neutral control plane for durable-state agent swarms: subprocess workers, leases, artifacts, memory, and deterministic stitching.

→repo

NoteDance/Note

Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch

→repo

thoughtbot/opentelemetry-instrumentation-ruby_llm

OpenTelemetry instrumentation for RubyLLM. 💬🔭

→repo

ruimalheiro/gradient-garden

Research platform for model training, evaluation, and experimentation across architectures, benchmarks, and recipes.

→repo

truvaagents/truva-g3

A Go framework for the microagents architecture with dynamic discovery — specialized agents and tools that register their capabilities with a shared registry, f…

→repo

jameslamb/lightgbm-dask-testing

Test LightGBM's Dask integration on different cluster types

→

From the graph · 34

Related topics