Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
johannhartmann
's Collections
Music
GUI Intelligence
Document & UI Intelligence
Multimodal Models
Medical MultiModal
GUI Intelligence
updated
4 days ago
Upvote
1
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
73B
•
Updated
Jan 25
•
1.79k
•
134
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
8B
•
Updated
Jan 25
•
3.26k
•
219
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
1.18k
•
1.68k
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
8B
•
Updated
Jan 8
•
391
•
68
microsoft/GUI-Actor-7B-Qwen2.5-VL
Image-Text-to-Text
•
8B
•
Updated
Jun 10
•
502
•
16
Upvote
1
Share collection
View history
Collection guide
Browse collections