Yuseung "Phillip" Lee's picture

Yuseung "Phillip" Lee

phillipinseoul

·

https://phillipinseoul.github.io/

phillipinseoul

AI & ML interests

Computer Vision

Recent Activity

liked a model about 17 hours ago

Qwen/Qwen3-VL-4B-Thinking

upvoted a paper 1 day ago

AdaTooler-V: Adaptive Tool-Use for Images and Videos

upvoted a paper 1 day ago

Next-Embedding Prediction Makes Strong Vision Learners

View all activity

Organizations

liked a model about 17 hours ago

Qwen/Qwen3-VL-4B-Thinking

Image-Text-to-Text • 4B • Updated Oct 15 • 55.1k • 90

upvoted 3 papers 1 day ago

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Paper • 2512.16918 • Published 2 days ago • 10

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 2 days ago • 54

N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

Paper • 2512.16561 • Published 2 days ago • 17

upvoted a paper 2 days ago

Adaptation of Agentic AI

Paper • 2512.16301 • Published 3 days ago • 71

upvoted 2 papers 3 days ago

DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

Paper • 2512.15713 • Published 3 days ago • 13

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Paper • 2512.14681 • Published 4 days ago • 39

upvoted 4 papers 4 days ago

Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure

Paper • 2512.14336 • Published 4 days ago • 27

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Paper • 2512.14614 • Published 4 days ago • 60

RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics

Paper • 2512.13660 • Published 5 days ago • 36

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published 4 days ago • 112

authored a paper 5 days ago

Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection

Paper • 2512.13250 • Published 5 days ago • 9

upvoted 3 papers 5 days ago

Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection

Paper • 2512.13250 • Published 5 days ago • 9

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 5 days ago • 98

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 12 days ago • 100

upvoted a paper 6 days ago

X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale

Paper • 2512.04537 • Published 17 days ago • 6

upvoted 2 papers 9 days ago

Stronger Normalization-Free Transformers

Paper • 2512.10938 • Published 9 days ago • 18

Evaluating Gemini Robotics Policies in a Veo World Simulator

Paper • 2512.10675 • Published 9 days ago • 15

liked 2 models 10 days ago

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 23.6k • 120

google/siglip-so400m-patch14-384

Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 6.48M • 632