Yuzhen Huang's picture

Yuzhen Huang

yuzhen17

·

https://hyz17.github.io

HYZ17

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

updated a model 14 days ago

RL4Reasoning/verl-grpo-lr-deepscaler-bsz128-16384-rtl-dynamic-m-e-cliphigh-hf-1.5B-4_deepscaler_step590

published a model 14 days ago

RL4Reasoning/verl-grpo-lr-deepscaler-bsz128-16384-rtl-dynamic-m-e-cliphigh-hf-1.5B-4_deepscaler_step590

View all activity

Organizations

yuzhen17's activity

upvoted a paper 5 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 6 days ago • 56

updated a model 14 days ago

RL4Reasoning/verl-grpo-lr-deepscaler-bsz128-16384-rtl-dynamic-m-e-cliphigh-hf-1.5B-4_deepscaler_step590

Updated 14 days ago • 4

published a model 14 days ago

RL4Reasoning/verl-grpo-lr-deepscaler-bsz128-16384-rtl-dynamic-m-e-cliphigh-hf-1.5B-4_deepscaler_step590

Updated 14 days ago • 4

upvoted a paper 19 days ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published 25 days ago • 78

upvoted a paper 20 days ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published 22 days ago • 18

upvoted a paper 21 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 25 days ago • 43

New activity in ceval/ceval-exam 24 days ago

[bot] Conversion to Parquet

#7 opened 27 days ago by

parquet-converter

updated a collection 27 days ago

SimpleRL-Zoo

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild" • 12 items • Updated 20 days ago • 6

New activity in ceval/ceval-exam 27 days ago

Convert dataset to Parquet

#6 opened 28 days ago by

authored a paper 28 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 28 days ago • 30

upvoted 2 papers 28 days ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 28 days ago • 30

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 46

updated a collection 28 days ago

SimpleRL-Zoo

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild" • 12 items • Updated 20 days ago • 6

updated a model 28 days ago

hkust-nlp/Llama-3.1-8B-SimpleRL-Zoo

Updated 28 days ago • 44

published a model 28 days ago

hkust-nlp/Llama-3.1-8B-SimpleRL-Zoo

Updated 28 days ago • 44

updated a collection 28 days ago

SimpleRL-Zoo

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild" • 12 items • Updated 20 days ago • 6

updated a collection 29 days ago

SimpleRL-Zoo

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild" • 12 items • Updated 20 days ago • 6

updated 3 models 29 days ago

hkust-nlp/Qwen-2.5-32B-SimpleRL-Zoo

Updated 29 days ago • 199

hkust-nlp/Qwen-2.5-7B-SimpleRL-Zoo

Updated 29 days ago • 881

hkust-nlp/DeepSeek-Math-7B-SimpleRL-Zoo

Updated 29 days ago • 74