
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
β’
0.7B
β’
Updated
β’
70.1k
β’
1.51k
Generate MIDI music from prompts
Segment and track objects in videos
Demo for multimodal understanding and generation