Baifeng Shi's picture

4 23 3

Baifeng Shi

bfshi

·

https://bfshi.github.io

AI & ML interests

computer vision

Recent Activity

upvoted a paper about 2 hours ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

upvoted a paper about 18 hours ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

upvoted a paper 10 days ago

One-Minute Video Generation with Test-Time Training

View all activity

Organizations

bfshi's activity

commented a paper 24 days ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published 24 days ago • 39 •

New activity in Efficient-Large-Model/NVILA-8B-Video 2 months ago

What is the difference between the nvila 8b base model and video model?

#1 opened 2 months ago by

New activity in Efficient-Large-Model/NVILA-15B 3 months ago

Ask about demo

#1 opened 4 months ago by

New activity in laion/CLIP-ViT-B-16-laion2B-s34B-b88K about 1 year ago

there are no config for transformers

#1 opened over 1 year ago by