person profile

Michael Y. Li

Michael Y. Li — researcher or builder tracked in the Angestrom contributor network.

5Connections

1Papers

0Models

0Repos

0News

Papers · 1

QuasiMoTTo: Quasi-Monte Carlo Test-Time Scaling

Scaling inference compute, by generating many parallel attempts per problem, is a costly but reliable lever for improving language model capabilities. By default these attempts are generated independently, wasting inference compute on redundant solutions. This waste seems unavoidable. After all, independence is what makes parallel sampling trivial to scale. However, this tradeoff is not fundamental: there is a rich design space of samplers that generate correlated but exact samples entirely in parallel. We explore this design space as an avenue for improving sample efficiency in scaling infere