3 18 1

zhu

xuekai

AI & ML interests

None yet

Recent Activity

authored a paper about 6 hours ago

TTRL: Test-Time Reinforcement Learning

upvoted a paper about 16 hours ago

TTRL: Test-Time Reinforcement Learning

upvoted a paper 29 days ago

Video-T1: Test-Time Scaling for Video Generation

View all activity

Organizations

xuekai's activity

authored a paper about 6 hours ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 1 day ago • 55

upvoted a paper about 16 hours ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 1 day ago • 55

upvoted a paper 29 days ago

Video-T1: Test-Time Scaling for Video Generation

Paper • 2503.18942 • Published about 1 month ago • 88

authored a paper about 1 month ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 27

upvoted a paper about 1 month ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 27

upvoted a paper 3 months ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 22

upvoted an article 3 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 87

upvoted 2 papers 4 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 35

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 42

commented a paper 4 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 53 •

authored a paper 4 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 53

upvoted a paper 4 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 53

upvoted 2 papers 5 months ago

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 29

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 29

upvoted a paper 8 months ago

VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges

Paper • 2409.01071 • Published Sep 2, 2024 • 28

upvoted a collection 9 months ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 131

upvoted a paper 9 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 94