5 9 7

Runsen Xu

RunsenXu

https://runsenxu.com/

AI & ML interests

Large Language Models, Multi-modal Learning, 3D Perception and Understanding, Self-supervised Learning

Recent Activity

upvoted a paper 8 days ago

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

authored a paper 10 days ago

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

authored a paper 10 days ago

Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

View all activity

Organizations

None yet

upvoted a paper 8 days ago

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

Paper • 2512.10863 • Published 14 days ago • 21

authored 7 papers 10 days ago

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

Paper • 2406.09401 • Published Jun 13, 2024

Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

Paper • 2505.17015 • Published May 22 • 9

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Paper • 2505.23764 • Published May 29 • 3

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Paper • 2507.07984 • Published Jul 10 • 42

VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization

Paper • 2508.05211 • Published Aug 7 • 1

ChangingGrounding: 3D Visual Grounding in Changing Scenes

Paper • 2510.14965 • Published Oct 16

G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Paper • 2511.21688 • Published 29 days ago • 8

liked a dataset 10 days ago

rbler/MMSI-Video-Bench

Updated 3 days ago • 631 • 3

updated a dataset 18 days ago

RunsenXu/VSR

Viewer • Updated 18 days ago • 1.22k • 29

published a dataset 18 days ago

RunsenXu/VSR

Viewer • Updated 18 days ago • 1.22k • 29

updated a dataset 2 months ago

RunsenXu/MMSI-Bench

Viewer • Updated Oct 23 • 1k • 677 • 9

upvoted a paper 3 months ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 81

updated 3 datasets 3 months ago

Runsen Xu

AI & ML interests

Recent Activity

Organizations

RunsenXu's activity