Salim Belhaddad's picture

Salim Belhaddad

salym

·

https://salym.me

salym

AI & ML interests

Knowledge Graphs

Recent Activity

replied to philschmid's post 25 days ago

Gemini 2.5 Pro, thinking by default! We excited launch our best Gemini model for reasoning, multimodal and coding yet! #1 on LMSYS, Humanity’s Last Exam, AIME and GPQA and more! TL;DR: - 💻 Best Gemini coding model yet, particularly for web development (excels on LiveCodeBench). - 🧠 Default "Thinking" with up to 64k token output - 🌌 1 Million multimodal input context for text, image, video, audio, and pdf - 🛠️ Function calling, structured output, google search & code execution. - 🏆 #1 on LMArena & sota on AIME, GPQA, Humanity's Last Exam - 💡 Knowledge cut of January 2025 - 🤗 Available for free as Experimental in AI Studio, Gemini API & Gemini APP - 🚀 Rate limits - Free 2 RPM 50 req/day Try it ⬇️ https://aistudio.google.com/?model=gemini-2.5-pro-exp-03-25

updated a model about 1 month ago

salym/ppo-LunarLander-v2

reacted to mlabonne's post with 🚀 about 1 month ago

✂️ Gemma 3 Abliterated I noticed that Gemma 3 was much more resilient to refusal removal than other models like Qwen 2.5. I experimented with different recipes and improved the abliteration technique I wrote about last year. It's still experimental but the refusal rate is super low in my tests. Enjoy! https://huggingface.co/mlabonne/gemma-3-4b-it-abliterated https://huggingface.co/mlabonne/gemma-3-12b-it-abliterated https://huggingface.co/mlabonne/gemma-3-27b-it-abliterated

View all activity

Organizations

salym's activity

replied to philschmid's post 25 days ago

We need to invent a new way to benchmark models. The actual practice is fishy.

updated a model about 1 month ago

salym/ppo-LunarLander-v2

Reinforcement Learning • Updated about 1 month ago • 14

reacted to mlabonne's post with 🚀 about 1 month ago

Post

6123

✂️ Gemma 3 Abliterated

I noticed that Gemma 3 was much more resilient to refusal removal than other models like Qwen 2.5.

I experimented with different recipes and improved the abliteration technique I wrote about last year.

It's still experimental but the refusal rate is super low in my tests. Enjoy!

mlabonne/gemma-3-4b-it-abliterated
mlabonne/gemma-3-12b-it-abliterated
mlabonne/gemma-3-27b-it-abliterated

1 reply

·

reacted to mlabonne's post with 🔥 about 1 month ago

Post

8959

✂️ AutoAbliteration

I made a Colab notebook to automatically abliterate models.

It's quite general, so you can do interesting stuff like blocking a given language in the model outputs.

💻 Colab: https://colab.research.google.com/drive/1RmLv-pCMBBsQGXQIM8yF-OdCNyoylUR1?usp=sharing

updated a model about 1 month ago

salym/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Mar 21

published a model about 1 month ago

salym/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Mar 21

updated 2 models about 1 month ago

salym/PPO-CleanRL-LunarLander-v2

Reinforcement Learning • Updated Mar 20

salym/a2c-PandaPickAndPlace-v3

Reinforcement Learning • Updated Mar 20 • 3

published a model about 1 month ago

salym/PPO-CleanRL-LunarLander-v2

Reinforcement Learning • Updated Mar 20

updated a model about 1 month ago

salym/poca-SoccerTwos

Reinforcement Learning • Updated Mar 20 • 17

published 2 models about 1 month ago

salym/poca-SoccerTwos

Reinforcement Learning • Updated Mar 20 • 17

salym/a2c-PandaPickAndPlace-v3

Reinforcement Learning • Updated Mar 20 • 3

updated a model about 1 month ago

salym/a2c-PandaReachDense-v3

Reinforcement Learning • Updated Mar 20 • 2

published a model about 1 month ago

salym/a2c-PandaReachDense-v3

Reinforcement Learning • Updated Mar 20 • 2

updated a model about 1 month ago

salym/ppo-Pyramids

Reinforcement Learning • Updated Mar 19 • 47

published a model about 1 month ago

salym/ppo-Pyramids

Reinforcement Learning • Updated Mar 19 • 47

updated a model about 1 month ago

salym/ppo-SnowballTarget

Reinforcement Learning • Updated Mar 19 • 40

published a model about 1 month ago

salym/ppo-SnowballTarget

Reinforcement Learning • Updated Mar 19 • 40

updated a model about 1 month ago

salym/Reinforce-Pixelcopter-PLE-v0

Reinforcement Learning • Updated Mar 19

published a model about 1 month ago

salym/Reinforce-Pixelcopter-PLE-v0

Reinforcement Learning • Updated Mar 19