Reinforcemen
14 items across the graph — tagged with Reinforcemen.
From the graph · 14
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Democratizing Reinforcement Learning for LLMs
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.
Evaluate and improve models and agents using environments
RL environments + evals for AI agents. Define once, train anything.
Implementation of Danijar's latest iteration for his Dreamer line of work
AI framework for Reinforcement Learning, Automated Planning and Scheduling
Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch
CUA-Gym-Hub: mock web apps as reproducible RL training environments for computer-use agents
The NoteBook and Assignments implemention via Learning CS336 Spring 2026😛
Experiments with reasoning models, training techniques, papers
