wuziheng

wuziheng
·

AI & ML interests

CV/SSL/MultiMedia

Recent Activity

updated a model 2 days ago
bytedance-research/Valley2-DPO
published a model 5 days ago
bytedance-research/Valley2-DPO
updated a model 20 days ago
bytedance-research/Valley-Eagle-7B
View all activity

Organizations

Alibaba-PAI's profile picture bytedance-research's profile picture

wuziheng's activity

reacted to tianchez's post with 🚀 about 2 months ago
view post
Post
4312
Introducing VLM-R1!

GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks?

The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task).

https://github.com/om-ai-lab/VLM-R1
·
New activity in bytedance-research/Valley-Eagle-7B 3 months ago

Update README.md

#3 opened 3 months ago by
Hyggge