Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering Paper • 2505.23604 • Published May 29, 2025 • 23
RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning Paper • 2505.15034 • Published May 21, 2025 • 5