Native Active Perception as Reasoning for Omni-Modal Understanding Paper • 2606.19341 • Published 4 days ago • 14
Native Active Perception as Reasoning for Omni-Modal Understanding Paper • 2606.19341 • Published 4 days ago • 14
Native Active Perception as Reasoning for Omni-Modal Understanding Paper • 2606.19341 • Published 4 days ago • 14
yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF Text Generation • 12B • Updated 2 days ago • 312k • 1.99k
CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation Paper • 2604.19636 • Published Apr 21 • 88
FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published Apr 8 • 97
The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping Paper • 2604.11297 • Published Apr 13 • 144
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published Apr 11 • 82
Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception Paper • 2510.12720 • Published Oct 14, 2025 • 2
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO Paper • 2505.17017 • Published May 22, 2025