Topic

Reinforcemen

14 items across the graph — tagged with Reinforcemen.

From the graph · 14

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

Democratizing Reinforcement Learning for LLMs

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

Evaluate and improve models and agents using environments

RL environments + evals for AI agents. Define once, train anything.

Implementation of Danijar's latest iteration for his Dreamer line of work

AI framework for Reinforcement Learning, Automated Planning and Scheduling

Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch

CUA-Gym-Hub: mock web apps as reproducible RL training environments for computer-use agents

The NoteBook and Assignments implemention via Learning CS336 Spring 2026😛

Experiments with reasoning models, training techniques, papers