view article Article Fast LoRA inference for Flux with Diffusers and PEFT By sayakpaul and 1 other β’ 7 days ago β’ 34
view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events By vinid and 6 others β’ 13 days ago β’ 27
view article Article Five Big Improvements to Gradio MCP Servers By freddyaboulton β’ 13 days ago β’ 18
view article Article Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders By orionweller and 5 others β’ 14 days ago β’ 50
view article Article Migrating the Hub from Git LFS to Xet By jsulz and 2 others β’ 15 days ago β’ 23
google/medsiglip-448 Zero-Shot Image Classification β’ 0.9B β’ Updated 20 days ago β’ 10.3k β’ 53
view article Article Upskill your LLMs with Gradio MCP Servers By freddyaboulton β’ 21 days ago β’ 17
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others β’ 22 days ago β’ 592
view article Article Efficient MultiModal Data Pipeline By ariG23498 and 4 others β’ 22 days ago β’ 50
view changelog Changelog Organization and User profiles now include repository listing pages Jun 20 β’ 123
view post Post 1687 The bunch of comparable demos for Multimodal VLMs (excels in OCR, cinematography understanding, spatial reasoning, etc.) now up on the Hub π€ β max recent till Jun'25.β¦ Demo Spaces β > [Nanonets-OCR-s, MonkeyOCR, Typhoon-OCR-7B, SmolDocling] : prithivMLmods/Multimodal-OCR2> [GLM-4.1v, docscopeOCR-7B, MonkeyOCR, coreOCR-7B] : prithivMLmods/core-OCR> [Camel-Doc-OCR, ViLaSR-7B, OCRFlux-3B, ShotVL-7B] : https://huggingface.co/spaces/prithivMLmods/Doc-VLMs-v2-Localization> [SkyCaptioner-V1, SpaceThinker-3B, coreOCR-7B, SpaceOm-3B] : prithivMLmods/VisionScope-R2> [RolmOCR-7B, Qwen2-VL-OCR-2B, Aya-Vision-8B, Nanonets-OCR-s] : prithivMLmods/Multimodal-OCR> [DREX-062225-7B, Typhoon-OCR-3B, olmOCR-7B-0225, VIREX-062225-7B] : prithivMLmods/Doc-VLMs-OCR > [Cosmos-Reason1-7B, docscopeOCR-7B, Captioner-7B, visionOCR-3B] : prithivMLmods/DocScope-R1β¦ Space Collection : prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0...To know more about it, visit the model card of the respective model. !! See translation 1 reply Β· π₯ 3 3 π€ 2 2 π 2 2 + Reply