s1: simple test time scaling

Best AI papers explained - A podcast by Enoch H. Kang

Categories:

Test-time scaling improves language model performance using extra computeA dataset of 1,000 questions was curated for validationBudget forcing controls compute by managing the model's reasoning process The model outperformed o1-preview by up to 27% on math questions The model and data are open-source for public access