Read original ↗
paperarXivTrust 82 · PrimaryPublished 5d agoLive · 3d ago

Evidence-Informed LLM Beliefs for Continual Scientific Discovery

Open-ended scientific discovery with large language models (LLMs) increasingly operates as a long-horizon loop of hypothesis search and verification, where a reward signal guides which hypotheses to test next. A notable recent example is AutoDiscovery, which uses "Bayesian surprise" - the belief shift an LLM undergoes after observing evidence for a hypothesis - as both a discovery metric and a reward for search. We first observe that AutoDiscovery treats surprisal as a static quantity, while surprisal in human reasoning is non-stationary - it is defined relative to beliefs that evolve with exp

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Related to

Has model

Covers (incoming)

Related across the graph

Topics