Read original ↗
paperarXivTrust 82 · PrimaryPublished 2d agoLive · yesterday

Graph-Native Reinforcement Learning Enables Traceable Scientific Hypothesis Generation through Conceptual Recombination

Accelerating materials discovery requires AI systems that can generate scientifically valid hypotheses through multi-step, domain-grounded reasoning. Standard large language models often produce fluent but weakly traceable responses to open-ended materials design problems, making it difficult to determine whether final answers are supported by coherent intermediate reasoning. We develop Graph-PRefLexOR, a family of graph-native reasoning models fine-tuned with Group Relative Policy Optimization (GRPO) to organize reasoning into explicit phases for mechanism exploration, graph construction, pat

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Related to

Covers (incoming)

Implements (incoming)

Related across the graph

Topics