2 18 22

Junnan Liu

jnanliu

jnanliu

AI & ML interests

NLP, LLM, LLM Reasoning, Agentic AI

Recent Activity

authored a paper 1 day ago

Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis

authored a paper 1 day ago

Intern-S1: A Scientific Multimodal Foundation Model

upvoted a paper 2 days ago

Intern-S1: A Scientific Multimodal Foundation Model

View all activity

Organizations

upvoted 2 papers 2 days ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published 2 days ago • 186

Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis

Paper • 2508.15754 • Published 2 days ago • 2

upvoted a paper 18 days ago

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Paper • 2508.03686 • Published 18 days ago • 33

upvoted 2 papers about 1 month ago

CompassJudger-2: Towards Generalist Judge Model via Verifiable Rewards

Paper • 2507.09104 • Published Jul 12 • 17

Rethinking Verification for LLM Code Generation: From Generation to Testing

Paper • 2507.06920 • Published Jul 9 • 28

upvoted a paper about 2 months ago

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published Jul 8 • 20

upvoted 2 papers 2 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 263

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 256

upvoted 3 papers 3 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 177

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 69

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Paper • 2505.19815 • Published May 26 • 37

upvoted 2 collections 4 months ago

Qwen3

Collection

84 items • Updated 17 days ago • 1.13k

GLM-4-0414

Collection

GLM-4-0414 series model • 8 items • Updated Jun 30 • 130

upvoted an article 5 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted 2 papers 6 months ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 75

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 60

upvoted a paper 7 months ago

Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement

Paper • 2501.12273 • Published Jan 21 • 14

upvoted a paper 8 months ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 95

Junnan Liu

AI & ML interests

Recent Activity

Organizations

jnanliu's activity

Open R1: Update #3