Derry Pratama
ibndias
AI & ML interests
None yet
Recent Activity
liked
a dataset
6 days ago
exp-models/dolphin-r1-deepseek-toolcalls
liked
a dataset
6 days ago
glaiveai/glaive-function-calling-v2
liked
a dataset
6 days ago
interstellarninja/tool-calls-multiturn
Organizations
Collections
2
Papers
2
models
16

ibndias/gemma-3-1b-reasoning-grpo
Text Generation
•
Updated
•
1

ibndias/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
Updated
•
11

ibndias/Qwen-2.5-7B-Simple-RL
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
2

ibndias/Qwen-2.5-7B_Base_Math_smalllr
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated

ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
2

ibndias/taxi-v3
Reinforcement Learning
•
Updated

ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

ibndias/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
3