Jiahao Xu's picture

3 5 3

Jiahao Xu

Jiahao004

·

Jiahao004

AI & ML interests

Sentence Emebddings; Neural Machine Translation

Recent Activity

updated a dataset 5 days ago

Jiahao004/agentllm_trainingset

updated a dataset 6 days ago

Jiahao004/agentllm

published a dataset 6 days ago

Jiahao004/agentllm

View all activity

Organizations

Jiahao004 's models 13

Jiahao004/agentllm_SFT-template-3_1_qwen-train-Qwen3-8B-1e-5LR_best

Text Generation • 0.0B • Updated Jul 10 • 8

Jiahao004/SFT-agentllm-template2-train4-Qwen3-0.6B-1e-6LR-3Epochs-32768Tokens-1BS-think-step-by-step

0.6B • Updated Jun 27 • 9

Jiahao004/SFT-agentllm-template2-train3-1example-Qwen3-0.6B-1e-5LR-50Epochs-checkpoint-50

0.6B • Updated Jun 27 • 8

Jiahao004/SFT-agentllm-template1-train2-Qwen3-0.6B-1e-5LR-50Epochs-32768Tokens-1BS-think-step-by-step

Text Generation • 0.6B • Updated Jun 25 • 20

Jiahao004/SFT-agentllm-template1-Qwen3-0.6B-5e-5LR-3Epochs-32768Tokens-1BS-think-step-by-step

0.6B • Updated Jun 25 • 9

Jiahao004/agentllm-SFT-baseline-Qwen3-8B-5e-5LR-3Epochs

0.0B • Updated Jun 25 • 8

Jiahao004/SFT-agentllm-template1-Qwen3-8B-5e-5LR-3Epochs-32768Tokens-1BS-think-step-by-step

8B • Updated Jun 25 • 8

Jiahao004/SFT-agentllm-template1-Qwen3-8B-5e-5LR-3Epochs-32768Tokens

8B • Updated Jun 24 • 7

Jiahao004/test

Jiahao004/SFT-agentllm-template1-Qwen3-0.6B-5e-5LR-3Epochs-32768Tokens-1BS-1GA-flash-attn2-8GPUs-1Nodes

0.6B • Updated Jun 23 • 7

Jiahao004/DeepTheorem-qwen-7b-rl

8B • Updated May 26 • 6 • 3

Jiahao004/DeepTheorem-qwen-3b-rl

3B • Updated May 26 • 5

Jiahao004/DeepTheorem-qwen-1.5b-rl

2B • Updated May 26 • 5 • 1