Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10
minju
iaminju
Follow
saytes's profile picture
KangsanKim71's profile picture
gmlwns5176's profile picture
5 followers
·
3 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
13 days ago
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models
updated
a model
25 days ago
iaminju/rlpvr_pref_only
published
a model
25 days ago
iaminju/rlpvr_pref_only
View all activity
Organizations
models
13
Sort: Recently updated
iaminju/rlpvr_pref_only
Updated
25 days ago
•
2
iaminju/rlpvr_math_only
Updated
25 days ago
•
18
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_3
Updated
Feb 28
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_2
Updated
Feb 28
•
1
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k
Updated
Feb 27
•
1
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_10k
Text Generation
•
Updated
Feb 26
•
7
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_1k
Text Generation
•
Updated
Feb 26
•
7
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s_pref
Text Generation
•
Updated
Feb 25
•
3
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_pref
Text Generation
•
Updated
Feb 25
•
2
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_nq_s
Updated
Feb 25
Expand 13 models
datasets
None public yet