Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated Jan 17 • 34
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 10 days ago • 76
view post Post 1669 A new OPEN Omni model just dropped by @Alibaba_Qwen on the hub🔥🤯Qwen2.5-Omni: a 7B end-to-end multimodal model Qwen/Qwen2.5-Omni-7B✨ Thinker-Talker architecture✨ Real-time voice & video chat✨ Natural speech generation✨ Handles text, image, audio & video See translation 1 reply · 🤗 12 12 🔥 8 8 + Reply
Running on Zero 754 754 Florence 2 📉 Analyze images to generate captions, detect objects, or perform OCR