MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published 5 days ago • 43
SpotEdit: Selective Region Editing in Diffusion Transformers Paper • 2512.22323 • Published 22 days ago • 38
WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion Paper • 2512.19678 • Published 26 days ago • 29
LongVideoAgent: Multi-Agent Reasoning with Long Videos Paper • 2512.20618 • Published 25 days ago • 53
WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion Paper • 2512.19678 • Published 26 days ago • 29
DeContext as Defense: Safe Image Editing in Diffusion Transformers Paper • 2512.16625 • Published about 1 month ago • 24
DeContext as Defense: Safe Image Editing in Diffusion Transformers Paper • 2512.16625 • Published about 1 month ago • 24
DeContext as Defense: Safe Image Editing in Diffusion Transformers Paper • 2512.16625 • Published about 1 month ago • 24
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published Dec 16, 2025 • 69
In-Video Instructions: Visual Signals as Generative Control Paper • 2511.19401 • Published Nov 24, 2025 • 31
Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper • 2510.07318 • Published Oct 8, 2025 • 30
SparseD: Sparse Attention for Diffusion Language Models Paper • 2509.24014 • Published Sep 28, 2025 • 30