Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
adhisetiawan 's Collections
Papers
Multimodal Models
SLMs
LLMs
Audio
Multimodal Papers

Multimodal Models

updated May 27, 2024
Upvote
-

  • microsoft/kosmos-2-patch14-224

    Image-to-Text • 2B • Updated Nov 28, 2023 • 172k • 168

  • Tyrannosaurus/TinyGPT-V

    Updated Jan 19, 2024 • 50

  • naver-clova-ix/donut-base

    Image-to-Text • Updated Aug 13, 2022 • 114k • 227

  • llava-hf/llava-v1.6-34b-hf

    Image-Text-to-Text • 35B • Updated Jan 27 • 4.86k • 85

  • deepseek-ai/deepseek-vl-7b-base

    7B • Updated Mar 15, 2024 • 908 • 61

  • deepseek-ai/deepseek-vl-7b-chat

    Image-Text-to-Text • 7B • Updated Mar 15, 2024 • 53.1k • 259

  • vikhyatk/moondream2

    Image-Text-to-Text • 2B • Updated 25 days ago • 523k • 1.23k

  • zai-org/cogvlm-chat-hf

    Text Generation • 18B • Updated Dec 19, 2023 • 5.27k • 198

  • Qwen/Qwen-VL-Chat

    Text Generation • Updated Jan 25, 2024 • 61.2k • 368

  • Qwen/Qwen-VL

    Text Generation • Updated Jan 25, 2024 • 30.3k • 247

  • microsoft/git-base

    Image-to-Text • 0.2B • Updated Apr 24, 2023 • 122k • 101
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs