21 59 22

Sergio Paniego PRO

sergiopaniego

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

updated a Space about 1 hour ago

agents-course/Unit4-Final-Certificate

updated a model about 2 hours ago

sergiopaniego/g3-od-lora

published a model about 2 hours ago

sergiopaniego/g3-od-lora

View all activity

Organizations

sergiopaniego's activity

updated a Space about 1 hour ago

Unit4 Final Certificate

🎓

Generate Certificate of Excellence from Agents Course

updated a model about 2 hours ago

sergiopaniego/g3-od-lora

Image-Text-to-Text • Updated about 2 hours ago

published a model about 2 hours ago

sergiopaniego/g3-od-lora

Image-Text-to-Text • Updated about 2 hours ago

updated a model about 21 hours ago

sergiopaniego/distilbert-base-uncased-example

Text Classification • Updated about 21 hours ago

published a model about 21 hours ago

sergiopaniego/distilbert-base-uncased-example

Text Classification • Updated about 21 hours ago

New activity in agents-course/course-images 1 day ago

Upload 2 files

#12 opened 1 day ago by

sergiopaniego

New activity in agents-course/notebooks 8 days ago

fix-dependencies-issue

#74 opened 8 days ago by

Se7en258

updated a model 9 days ago

sergiopaniego/gemma3_license_plate_detection

Image-Text-to-Text • Updated 9 days ago • 4

published a model 9 days ago

sergiopaniego/gemma3_license_plate_detection

Image-Text-to-Text • Updated 9 days ago • 4

reacted to merve's post with 🔥 10 days ago

Post

4170

sooo many open AI releases past week, let's summarize! 🤗
merve/april-11-releases-67fcd78be33d241c0977b9d2

multimodal
> Moonshot AI released Kimi VL Thinking, first working open-source multimodal reasoning model and Kimi VL Instruct, both 16B MoEs with 3B active params (OS)
> InternVL3 released based on Qwen2.5VL, 7 ckpts with various sizes (1B to 78B)

LLMs
> NVIDIA released Llama-3_1-Nemotron-Ultra-253B-v1 an LLM built on Llama 405B for reasoning, chat and tool use
> Agentica released DeepCoder-14B-Preview, fine-tuned version of DeepSeek-R1-Distilled-Qwen-14B on problem-test pairs, along with the compiled dataset
> Zyphra/ZR1-1.5B is a new small reasoning LLM built on R1-Distill-1.5B (OS)
> Skywork-OR1-32B-Preview is a new reasoning model by Skywork

Image Generation
> HiDream releases three new models, HiDream I1 Dev, I1 Full, and I1 fast for image generation (OS)

*OS ones have Apache 2.0 or MIT licenses