1 18 12

Kevin King PRO

NeoCodes-dev

king112

AI & ML interests

Deep RL, RL for LLMs

Recent Activity

updated a model 1 day ago

NeoCodes-dev/Qwen2-0.5B-GRPO-test

published a model 1 day ago

NeoCodes-dev/Qwen2-0.5B-GRPO-test

updated a collection 1 day ago

Research Papers

View all activity

Organizations

Collections 17

spaces 1

Sleeping

First Agent Template

⚡

Find the current time in any timezone

models 20

datasets

None public yet

Kevin King PRO

AI & ML interests

Recent Activity

Organizations

Collections 17

state-spaces/mamba2attn-2.7b

watt-ai/watt-tool-70B

agent-husky/husky-v1-action-llama2-13b

all-hands/openhands-lm-32b-v0.1

spaces 1

First Agent Template

models 20

NeoCodes-dev/Qwen2-0.5B-GRPO-test

NeoCodes-dev/SmolLM_135M_GRPO

NeoCodes-dev/Qwen2_7B-GRPO-test

NeoCodes-dev/Qwen2.5_3B-GRPO-test

NeoCodes-dev/codeparrot-ds

NeoCodes-dev/gemma-2-2B-it-thinking-function_calling-V0

NeoCodes-dev/Unit8_part1_V1

NeoCodes-dev/rl_course_vizdoom_health_gathering_supreme

NeoCodes-dev/poca-SoccerTwos

NeoCodes-dev/a2c-PandaReachDense-v2

datasets

Kevin King PRO

AI & ML interests

Recent Activity

Organizations

Collections 17

spaces 1

First Agent Template

models 20 Sort: Recently updated

datasets

models 20