Sergio Paniego's picture

Sergio Paniego PRO

sergiopaniego

AI & ML interests

None yet

Recent Activity

updated a Space about 1 hour ago
agents-course/Unit4-Final-Certificate
updated a model about 2 hours ago
sergiopaniego/g3-od-lora
published a model about 2 hours ago
sergiopaniego/g3-od-lora
View all activity

Organizations

Hugging Face's profile picture The LLM Course's profile picture TRL's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture Hugging Face Discord Community's profile picture Cookbook Authors's profile picture open/ acc's profile picture RoboticsLabURJC's profile picture Hugging Face Agents Course's profile picture

sergiopaniego's activity

New activity in agents-course/course-images 1 day ago

Upload 2 files

#12 opened 1 day ago by
sergiopaniego
New activity in agents-course/notebooks 8 days ago

fix-dependencies-issue

1
#74 opened 8 days ago by
Se7en258
reacted to merve's post with ๐Ÿ”ฅ 10 days ago
view post
Post
4170
sooo many open AI releases past week, let's summarize! ๐Ÿค—
merve/april-11-releases-67fcd78be33d241c0977b9d2

multimodal
> Moonshot AI released Kimi VL Thinking, first working open-source multimodal reasoning model and Kimi VL Instruct, both 16B MoEs with 3B active params (OS)
> InternVL3 released based on Qwen2.5VL, 7 ckpts with various sizes (1B to 78B)

LLMs
> NVIDIA released Llama-3_1-Nemotron-Ultra-253B-v1 an LLM built on Llama 405B for reasoning, chat and tool use
> Agentica released DeepCoder-14B-Preview, fine-tuned version of DeepSeek-R1-Distilled-Qwen-14B on problem-test pairs, along with the compiled dataset
> Zyphra/ZR1-1.5B is a new small reasoning LLM built on R1-Distill-1.5B (OS)
> Skywork-OR1-32B-Preview is a new reasoning model by Skywork

Image Generation
> HiDream releases three new models, HiDream I1 Dev, I1 Full, and I1 fast for image generation (OS)

*OS ones have Apache 2.0 or MIT licenses
ยท
view reply

We've created a complete Fine-tuning a Multimodal Model Using SFT (Single or Multi-Image Dataset)
guide using Gemma 3, in case you're intereseted!

New activity in agents-course/course-images 13 days ago
upvoted an article 16 days ago
view article
Article

Mixture of Experts Explained

โ€ข 574