Reasoning 🧠
updated
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
Paper
•
2501.04519
•
Published
•
287
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Though
Paper
•
2501.04682
•
Published
•
99
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Paper
•
2408.03314
•
Published
•
63
Training Large Language Models to Reason in a Continuous Latent Space
Paper
•
2412.06769
•
Published
•
92
Test-time Computing: from System-1 Thinking to System-2 Thinking
Paper
•
2501.02497
•
Published
•
45
The Lessons of Developing Process Reward Models in Mathematical
Reasoning
Paper
•
2501.07301
•
Published
•
99
Evolving Deeper LLM Thinking
Paper
•
2501.09891
•
Published
•
115
Hallucinations Can Improve Large Language Models in Drug Discovery
Paper
•
2501.13824
•
Published
•
10
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
Paper
•
2501.17161
•
Published
•
123
LIMO: Less is More for Reasoning
Paper
•
2502.03387
•
Published
•
62
s1: Simple test-time scaling
Paper
•
2501.19393
•
Published
•
124
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of
Physical Concept Understanding
Paper
•
2502.08946
•
Published
•
191