TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model Paper • 2510.16449 • Published Oct 18 • 34
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published Feb 13 • 191
Driving Everywhere with Large Language Model Policy Adaptation Paper • 2402.05932 • Published Feb 8, 2024 • 5
SpiRit-LM: Interleaved Spoken and Written Language Model Paper • 2402.05755 • Published Feb 8, 2024 • 15
Question Aware Vision Transformer for Multimodal Reasoning Paper • 2402.05472 • Published Feb 8, 2024 • 10
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Paper • 2402.05937 • Published Feb 8, 2024 • 14
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models Paper • 2402.05935 • Published Feb 8, 2024 • 17
Suppressing Pink Elephants with Direct Principle Feedback Paper • 2402.07896 • Published Feb 12, 2024 • 11
Premier-TACO: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss Paper • 2402.06187 • Published Feb 9, 2024 • 11
Animated Stickers: Bringing Stickers to Life with Video Diffusion Paper • 2402.06088 • Published Feb 8, 2024 • 11
GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting Paper • 2402.07207 • Published Feb 11, 2024 • 12
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models Paper • 2402.07865 • Published Feb 12, 2024 • 15