pangpangxuan's picture

6 3

pangpangxuan

pangxuan

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

TTRL: Test-Time Reinforcement Learning

upvoted a paper 14 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted a paper about 2 months ago

START: Self-taught Reasoner with Tools

View all activity

Organizations

None yet

pangxuan's activity

upvoted a paper 1 day ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 1 day ago • 68

upvoted a paper 14 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 385

upvoted 2 papers about 2 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 111

Optimal Brain Apoptosis

Paper • 2502.17941 • Published Feb 25 • 10

upvoted 2 papers 3 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 87

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 22