repoGitHubTrust 82 · PrimaryPublished 14h agoLive · 14h ago
redai-infra/Relax
An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Implements
paperJoint Learning of Experiential Rules and Policies for Large Language Model AgentspaperZ-1: Efficient Reinforcement Learning for Vision-Language-Action ModelspaperIs One Layer Enough? Training A Single Transformer Layer Can Match Full-Parameter RL TrainingpaperAsk, Solve, Generate: Self-Evolving Unified Multimodal Understanding and Generation via Self-Consistency Rewards
Covers
Related across the graph
newsRL without TD learningpaperJoint Learning of Experiential Rules and Policies for Large Language Model AgentspaperIs One Layer Enough? Training A Single Transformer Layer Can Match Full-Parameter RL TrainingpaperZ-1: Efficient Reinforcement Learning for Vision-Language-Action ModelspaperAsk, Solve, Generate: Self-Evolving Unified Multimodal Understanding and Generation via Self-Consistency Rewards
