LMMs-Lab-Encoder

community

EvolvingLMMs-Lab

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

xiangan updated a model 4 days ago

lmms-lab-encoder/onevision-encoder-large

xiangan published a model 7 days ago

lmms-lab-encoder/onevision-encoder-large-si

xiangan updated a model 8 days ago

lmms-lab-encoder/onevision-encoder-large-si

View all activity

xiangan

updated a model 4 days ago

lmms-lab-encoder/onevision-encoder-large

0.3B • Updated 4 days ago • 306 • 8

xiangan

published a model 7 days ago

lmms-lab-encoder/onevision-encoder-large-si

0.3B • Updated 8 days ago • 2

xiangan

updated a model 8 days ago

lmms-lab-encoder/onevision-encoder-large-si

0.3B • Updated 8 days ago • 2

xiangan

published a model 12 days ago

lmms-lab-encoder/onevision-encoder-large

0.3B • Updated 4 days ago • 306 • 8

xiangan

authored 4 papers 3 months ago

ForCenNet: Foreground-Centric Network for Document Image Rectification

Paper • 2507.19804 • Published Jul 26, 2025 • 11

Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval

Paper • 2509.09118 • Published Sep 11, 2025 • 8

UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning

Paper • 2510.13515 • Published Oct 15, 2025 • 11

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation

Paper • 2411.13025 • Published Nov 20, 2024 • 2

luodian

authored 2 papers 3 months ago

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Paper • 2411.15296 • Published Nov 22, 2024 • 21

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23, 2025 • 23

xiangan

authored a paper 3 months ago

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published Sep 28, 2025 • 47

luodian

authored 2 papers 3 months ago

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published Sep 28, 2025 • 47

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published Sep 29, 2025 • 36

xiangan

authored a paper 5 months ago

Region-based Cluster Discrimination for Visual Representation Learning

Paper • 2507.20025 • Published Jul 26, 2025 • 19

luodian

authored 2 papers 6 months ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5, 2025 • 46

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25, 2025 • 64

xiangan

authored a paper 10 months ago

RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm

Paper • 2502.12513 • Published Feb 18, 2025 • 16

luodian

authored 2 papers about 1 year ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 46

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 19

xiangan

authored a paper about 1 year ago

Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension

Paper • 2410.14332 • Published Oct 18, 2024 • 1

AI & ML interests

Recent Activity

Team members 2

lmms-lab-encoder's activity