Xiaohan Fu
x5fu
AI & ML interests
Security and Safety
Recent Activity
upvoted
a
paper
about 2 months ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via
Multi-Agent Multi-Turn Reinforcement Learning
authored
a paper
3 months ago
Training Language Models to Generate Quality Code with Program Analysis
Feedback
upvoted
a
paper
3 months ago
Training Language Models to Generate Quality Code with Program Analysis
Feedback
Organizations
None yet