Shizhe Diao

shizhediao2

AI & ML interests

LLM pre-training and reasoning

Recent Activity

updated a model about 10 hours ago
data4elm/Llama-400M-12L
published a model about 10 hours ago
data4elm/Llama-400M-12L
updated a dataset about 16 hours ago
nvidia/ClimbLab
View all activity

Organizations

NVIDIA's profile picture temp_math_data's profile picture UGPhysics's profile picture Data Filtering Challenge for Training Edge Language Models's profile picture

models

None public yet

datasets

None public yet