paperarXivTrust 82 · PrimaryPublished 2d agoLive · yesterday

Graph-Native Reinforcement Learning Enables Traceable Scientific Hypothesis Generation through Conceptual Recombination

Accelerating materials discovery requires AI systems that can generate scientifically valid hypotheses through multi-step, domain-grounded reasoning. Standard large language models often produce fluent but weakly traceable responses to open-ended materials design problems, making it difficult to determine whether final answers are supported by coherent intermediate reasoning. We develop Graph-PRefLexOR, a family of graph-native reasoning models fine-tuned with Group Relative Policy Optimization (GRPO) to organize reasoning into explicit phases for mechanism exploration, graph construction, pat

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Related to

companyNorthwind AI

Covers (incoming)

newsEmpowering biomedical evidence exploration and synthesis with deep knowledge graph research newsLearning Structured Reasoning via Tractable Trajectory Control - Apple Machine Learning Research

Implements (incoming)

reposileod/reasoning-core

Related across the graph

companyNorthwind AI newsEmpowering biomedical evidence exploration and synthesis with deep knowledge graph research reposileod/reasoning-core newsLearning Structured Reasoning via Tractable Trajectory Control - Apple Machine Learning Research

Topics

cs.AI