paperarXivTrust 82 · PrimaryPublished 5d agoLive · 3d ago
Sample Complexity of Scientific Discovery: PAC Learnability of Compositional Function Trees
Scientific discovery via symbolic regression is often viewed as statistically and computationally intractable because the hypothesis space of expressions grows combinatorially with depth. This paper revisits the statistical side through the lens of PAC learning, focusing on compositional function trees built from a finite vocabulary of smooth operators (e.g., $\{+,\times,\sin,\exp\}$ and affine maps). We prove that the relevant generalization quantity, Rademacher complexity, hence the excess risk, does not necessarily blow up exponentially with the number of distinct symbolic structures, but i
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
