Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shizhe Diao's picture
18 21 26

Shizhe Diao

shizhediao2
bunyaminergen's profile picture 21world's profile picture research4pan's profile picture
·
https://shizhediao.github.io/
  • shizhediao
  • shizhediao
  • shizhediao

AI & ML interests

LLM pre-training and reasoning

Recent Activity

upvoted a paper 9 days ago
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
updated a model 10 days ago
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B
new activity 10 days ago
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B:Can you open source your training dataset on STEM (after selection)? Thanks! :)
View all activity

Organizations

NVIDIA's profile picture temp_math_data's profile picture UGPhysics's profile picture Data Filtering Challenge for Training Edge Language Models's profile picture

models 1

shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B

Updated May 14

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs