3 32 17

Cong Wei PRO

CongWei1230

https://congwei1230.github.io/

AI & ML interests

Generative Models; Reasoning

Recent Activity

updated a collection 7 days ago

MoCha

liked a model 7 days ago

CongWei1230/MoCha-Demo

updated a model 7 days ago

CongWei1230/MoCha-Demo

View all activity

Organizations

updated a collection 7 days ago

MoCha

Collection

The pioneering work in Dialogue-driven Movie Shot Generation • 4 items • Updated 7 days ago • 2

liked a model 7 days ago

CongWei1230/MoCha-Demo

Text-to-Video • Updated 7 days ago • 2

updated a model 7 days ago

CongWei1230/MoCha-Demo

Text-to-Video • Updated 7 days ago • 2

updated a collection 7 days ago

MoCha

Collection

Dialogue-driven Movie Shot Generation • 4 items • Updated 7 days ago • 1

upvoted a paper 10 days ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 11 days ago • 88

published a model 19 days ago

CongWei1230/MoCha-Demo

Text-to-Video • Updated 7 days ago • 2

upvoted a paper 20 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 22 days ago • 36

upvoted 3 papers about 1 month ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

Paper • 2512.03041 • Published Dec 2, 2025 • 62

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published Nov 25, 2025 • 46

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 70

upvoted a paper 2 months ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published Oct 24, 2025 • 21

upvoted a paper 3 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 49

commented a paper 3 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9, 2025 • 71 •

upvoted a paper 3 months ago

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12, 2025 • 27

authored a paper 3 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9, 2025 • 71

upvoted 2 papers 3 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9, 2025 • 71

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

upvoted 2 papers 4 months ago

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published Sep 1, 2025 • 33

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 76

liked a dataset 6 months ago

APRIL-AIGC/UltraVideo-Long

Viewer • Updated Jul 14, 2025 • 16.6k • 589 • 6

Cong Wei PRO

AI & ML interests

Recent Activity

Organizations

CongWei1230's activity