From Pixels to Words -- Towards Native Vision-Language Primitives at Scale
Haiwen Diao
Paranioar
AI & ML interests
Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model
Recent Activity
liked
a model
about 9 hours ago
Paranioar/NEO1_0-9B-SFT
liked
a model
about 9 hours ago
Paranioar/NEO1_0-2B-SFT
updated
a dataset
about 15 hours ago
Paranioar/datacomp