Topic

Reinforcemen

14 items across the graph — tagged with Reinforcemen.

From the graph · 14

repo
aws/amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

repo
kvcache-ai/Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

repo
rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

repo
areal-project/AReaL

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

repo
google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

repo
walkinglabs/hands-on-modern-rl

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

repo
NVIDIA-NeMo/Gym

Evaluate and improve models and agents using environments

repo
hud-evals/hud-python

RL environments + evals for AI agents. Define once, train anything.

repo
lucidrains/dreamer4

Implementation of Danijar's latest iteration for his Dreamer line of work

repo
airbus/scikit-decide

AI framework for Reinforcement Learning, Automated Planning and Scheduling

repo
NoteDance/Note

Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch

repo
xlang-ai/CUA-Gym-Hub

CUA-Gym-Hub: mock web apps as reproducible RL training environments for computer-use agents

repo
Mungeryang/CS336-From-Scratch-Spring2026

The NoteBook and Assignments implemention via Learning CS336 Spring 2026😛

repo
benjaminzwhite/reasoning-models

Experiments with reasoning models, training techniques, papers

Related topics