Jiaheng Liu's picture

Jiaheng Liu

CheeryLJH

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

upvoted a paper 2 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

upvoted a paper 6 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

View all activity

Organizations

CheeryLJH's activity

commented 2 papers about 2 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28 •

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 27 •

commented a paper 5 months ago

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Paper • 2411.07140 • Published Nov 11, 2024 • 35 •