Kevin King PRO
NeoCodes-dev
AI & ML interests
Deep RL, RL for LLMs
Recent Activity
updated
a model
1 day ago
NeoCodes-dev/Qwen2-0.5B-GRPO-test
published
a model
1 day ago
NeoCodes-dev/Qwen2-0.5B-GRPO-test
updated
a collection
1 day ago
Research Papers
Organizations
Collections
17
models
20
NeoCodes-dev/Qwen2-0.5B-GRPO-test
Updated
NeoCodes-dev/SmolLM_135M_GRPO
Text Generation
•
Updated
•
1
NeoCodes-dev/Qwen2_7B-GRPO-test
Updated
NeoCodes-dev/Qwen2.5_3B-GRPO-test
Updated
NeoCodes-dev/codeparrot-ds
Updated
•
1
NeoCodes-dev/gemma-2-2B-it-thinking-function_calling-V0
Updated
NeoCodes-dev/Unit8_part1_V1
Reinforcement Learning
•
Updated
NeoCodes-dev/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
NeoCodes-dev/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
12
NeoCodes-dev/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
datasets
None public yet