Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
23
3
Baifeng Shi
bfshi
Follow
jeremy-london's profile picture
21world's profile picture
yehors-cv's profile picture
4 followers
·
4 following
https://bfshi.github.io
baifeng_shi
bfshi
AI & ML interests
computer vision
Recent Activity
upvoted
a
paper
about 2 hours ago
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling
upvoted
a
paper
about 18 hours ago
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
upvoted
a
paper
10 days ago
One-Minute Video Generation with Test-Time Training
View all activity
Organizations
bfshi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
commented
a paper
24 days ago
Scaling Vision Pre-Training to 4K Resolution
Paper
•
2503.19903
•
Published
24 days ago
•
39
•
2
New activity in
Efficient-Large-Model/NVILA-8B-Video
2 months ago
What is the difference between the nvila 8b base model and video model?
1
#1 opened 2 months ago by
YoungjaeDev
New activity in
Efficient-Large-Model/NVILA-15B
3 months ago
Ask about demo
5
#1 opened 4 months ago by
Lanbai44
New activity in
laion/CLIP-ViT-B-16-laion2B-s34B-b88K
about 1 year ago
there are no config for transformers
1
#1 opened over 1 year ago by
forty-lock