TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning Paper • 2603.12529 • Published Mar 13 • 19
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset Paper • 2403.12945 • Published Mar 19, 2024
Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon Decision-Making in Embodied AI Paper • 2310.01824 • Published Oct 3, 2023 • 1
Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning Paper • 2603.11653 • Published Mar 12 • 2
EntRGi: Entropy Aware Reward Guidance for Diffusion Language Models Paper • 2602.05000 • Published Feb 4 • 2
view post Post 1641 Are you familiar with reverse residual connections or looping in language models?Excited to share my Looped-GPT blog post and codebase 🚀https://github.com/sanyalsunny111/Looped-GPTTL;DR: looping during pre-training improves generalization.Plot shows GPT2 LMs pre-trained with 15.73B OWT tokensP.S. This is my first post here — I have ~4 followers and zero expectations for reach 😄 See translation 3 replies · 🧠 6 6 👍 3 3 + Reply
Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training Paper • 2512.13706 • Published Dec 5, 2025 • 1
Concentration of Measure for Distributions Generated via Diffusion Models Paper • 2501.07741 • Published Jan 13, 2025
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper • 2410.10792 • Published Oct 14, 2024 • 31
Beyond First-Order Tweedie: Solving Inverse Problems using Latent Diffusion Paper • 2312.00852 • Published Dec 1, 2023
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control Paper • 2405.17401 • Published May 27, 2024 • 5
Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models Paper • 2307.00619 • Published Jul 2, 2023 • 1
TIPS: Topologically Important Path Sampling for Anytime Neural Networks Paper • 2305.08021 • Published May 13, 2023