Hao Li's picture

13

Hao Li

Richardleee

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

upvoted a paper 21 days ago

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

upvoted a paper 21 days ago

Z1: Efficient Test-time Scaling with Code

View all activity

Organizations

Richardleee's activity

upvoted 3 papers 21 days ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published 23 days ago • 38

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published 23 days ago • 75

Z1: Efficient Test-time Scaling with Code

Paper • 2504.00810 • Published 22 days ago • 26

upvoted 2 papers 24 days ago

Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation

Paper • 2503.22675 • Published 26 days ago • 34

PHYSICS: Benchmarking Foundation Models on University-Level Physics Problem Solving

Paper • 2503.21821 • Published 28 days ago • 17

upvoted a paper about 1 month ago

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 88

updated a Space about 1 month ago

Paper

Crawl and summarize papers from Google Scholar profiles

published a Space about 1 month ago

Paper

Crawl and summarize papers from Google Scholar profiles

upvoted 3 papers about 2 months ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 42

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6 • 70

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 21

upvoted 4 papers 3 months ago

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21 • 13

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Paper • 2501.12375 • Published Jan 21 • 22

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21 • 46

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 86