Omar Sanseviero's picture

Omar Sanseviero

osanseviero

·

https://osanseviero.github.io/hackerllama/

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Recent Activity

liked a model about 12 hours ago

CraneAILabs/swahili-gemma-1b

liked a model 5 days ago

google/gemma-3-270m-it

liked a model 5 days ago

google/gemma-3-270m

View all activity

Organizations

upvoted a collection about 1 month ago

T5Gemma

32 items • Updated Jul 10 • 64

upvoted an article about 2 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

By

and 7 others •

Jun 26

• 115

upvoted a paper about 2 months ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 37

upvoted a changelog 3 months ago

Changelog

New Inference Providers Dashboard

Jun 5

• 62

upvoted a collection 3 months ago

GRMR V3 Models

An improved set of models for grammar correction. (Chat template should work, no "responding as an LLM" anymore, that kind of stuff). • 6 items • Updated Jun 4 • 10

upvoted a paper 3 months ago

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published May 23 • 60

upvoted an article 3 months ago

Article

The Transformers Library: standardizing model definitions

By

and 3 others •

May 15

• 117

upvoted 2 collections 3 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11 • 282

Gemma 3n Preview

4 items • Updated Jul 10 • 169

upvoted an article 4 months ago

Article

17 Reasons Why Gradio Isn't Just Another UI Library

By

and 1 other •

Apr 16

• 42

upvoted a collection 4 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 29 items • Updated 5 days ago • 31

upvoted a paper 4 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 280

upvoted an article 4 months ago

Article

The Large Language Model Course

By

•

Jan 16

• 199

upvoted a collection 5 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 208

upvoted an article 5 months ago

Article

Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning

By

•

Apr 1

• 25

upvoted a paper 5 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 55

upvoted a collection 5 months ago

Gemma 3 Release

28 items • Updated 9 days ago • 462

upvoted an article 5 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 453

upvoted a collection 5 months ago

Cohere Labs Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 19 days ago • 69

upvoted an article 6 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

By

and 3 others •

Mar 4

• 75