zijie tian
zijie-tian
AI & ML interests
Storage for AI
Recent Activity
upvoted
a
paper
7 days ago
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
upvoted
a
paper
7 days ago
Chain-of-Model Learning for Language Model
new activity
19 days ago
deepseek-ai/DeepSeek-R1-0528:求一个swe bench verified 跑分