4 30 6

Baifeng Shi

bfshi

https://bfshi.github.io

AI & ML interests

computer vision

Recent Activity

updated a model about 8 hours ago

bfshi/VideoMAE_AutoGaze

updated a model about 8 hours ago

bfshi/AutoGaze

published a model about 8 hours ago

bfshi/VideoMAE_AutoGaze

View all activity

Organizations

upvoted a paper 24 days ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 21

upvoted a collection 2 months ago

NVILA (HuggingFace)

Collection

HuggingFace Transformers can load us. • 5 items • Updated Sep 13, 2025 • 5

upvoted 2 papers 3 months ago

Learning to Grasp Anything by Playing with Random Toys

Paper • 2510.12866 • Published Oct 14, 2025 • 5

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

upvoted a paper 6 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

upvoted 7 papers 9 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21, 2025 • 44

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22, 2025 • 63

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17, 2025 • 39

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93

upvoted a paper 11 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 123

upvoted a paper 12 months ago

An Empirical Study of Autoregressive Pre-training from Videos

Paper • 2501.05453 • Published Jan 9, 2025 • 41

upvoted a collection about 1 year ago

NVILA

Collection

11 items • Updated Sep 13, 2025 • 16

upvoted 4 papers about 1 year ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 59

Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Dataset

Paper • 2410.22325 • Published Oct 29, 2024 • 10

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 69

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation

Paper • 2410.01680 • Published Oct 2, 2024 • 34

upvoted a paper over 1 year ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23, 2024 • 26

Baifeng Shi

AI & ML interests

Recent Activity

Organizations

bfshi's activity