NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated about 16 hours ago • 107
Language Server CLI Empowers Language Agents with Process Rewards Paper • 2510.22907 • Published Oct 27, 2025 • 4
view article Article Lanser-CLI: Language Server CLI Empowers Language Agents with Process Rewards 🛠️🏆 Oct 27, 2025 • 1
On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning Paper • 2505.17508 • Published May 23, 2025 • 8
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Paper • 2505.02735 • Published May 5, 2025 • 33
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14, 2025 • 300
Scaling Image Tokenizers with Grouped Spherical Quantization Paper • 2412.02632 • Published Dec 3, 2024 • 10
Training and Evaluating Language Models with Template-based Data Generation Paper • 2411.18104 • Published Nov 27, 2024 • 3
view article Article Revisiting TemplateGSM: Advancing Mathematical Reasoning in Language Models with Template-based Data Generation Nov 14, 2024 • 3