Sumit Yadav's picture

2 7 32

Sumit Yadav

rockerritesh

·

https://sumityadav.com.np

AI & ML interests

AI(GAN) || LLM RAG

Recent Activity

updated a dataset 39 minutes ago

rockerritesh/devanagari_and_roman_digits

published a dataset about 1 hour ago

rockerritesh/devanagari_and_roman_digits

updated a Space 7 days ago

aioverlords-amnil/internal-ollama

View all activity

Organizations

rockerritesh's activity

upvoted a collection 14 days ago

Cogito v1 Preview

5 items • Updated 15 days ago • 102

upvoted a paper 15 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 15 days ago • 168

upvoted a collection 22 days ago

Vision Language Models Quantization

Vision Language Models (VLMs) quantized by Neural Magic • 20 items • Updated Mar 4 • 6

upvoted a collection 30 days ago

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 8 days ago • 31

upvoted a collection about 1 month ago

MoshiVis v0.1

MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated Mar 21 • 22

upvoted an article about 1 month ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 398

upvoted an article 2 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 237