1 6

wuziheng

wuziheng

AI & ML interests

CV/SSL/MultiMedia

Recent Activity

updated a model 2 days ago

bytedance-research/Valley2-DPO

published a model 5 days ago

bytedance-research/Valley2-DPO

updated a model 20 days ago

bytedance-research/Valley-Eagle-7B

View all activity

Organizations

wuziheng's activity

updated a model 2 days ago

bytedance-research/Valley2-DPO

Updated 2 days ago • 5 • 2

published a model 5 days ago

bytedance-research/Valley2-DPO

Updated 2 days ago • 5 • 2

updated a model 20 days ago

bytedance-research/Valley-Eagle-7B

Updated 20 days ago • 229 • 37

reacted to tianchez's post with 🚀 about 2 months ago

Post

4312

Introducing VLM-R1!

GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks?

The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task).

https://github.com/om-ai-lab/VLM-R1