Shizhe Diao
shizhediao2
AI & ML interests
LLM pre-training and reasoning
Recent Activity
updated
a model
about 10 hours ago
data4elm/Llama-400M-12L
published
a model
about 10 hours ago
data4elm/Llama-400M-12L
updated
a dataset
about 16 hours ago
nvidia/ClimbLab
Organizations
models
None public yet
datasets
None public yet