zhongwei666

bruce360568

11612201@mail.sustech.edu.cn

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

Electrocardiogram Instruction Tuning for Report Generation

updated a dataset 4 days ago

bruce360568/SRPO_RL_datasets

published a dataset 4 days ago

bruce360568/SRPO_RL_datasets

View all activity

Organizations

None yet

upvoted a paper about 21 hours ago

Electrocardiogram Instruction Tuning for Report Generation

Paper • 2403.04945 • Published Mar 7, 2024 • 2

updated a dataset 4 days ago

bruce360568/SRPO_RL_datasets

Preview • Updated 4 days ago • 50

published a dataset 4 days ago

bruce360568/SRPO_RL_datasets

Preview • Updated 4 days ago • 50

upvoted a paper 7 days ago

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Paper • 2403.07378 • Published Mar 12, 2024 • 4

upvoted 2 papers about 1 month ago

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Paper • 2507.02259 • Published Jul 3 • 3

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 156

upvoted 2 papers about 2 months ago

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Paper • 2506.17218 • Published Jun 20 • 27

PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

Paper • 2506.10406 • Published Jun 12 • 2

upvoted a collection about 2 months ago

DyCodeEval

Collection

DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5. • 3 items • Updated Jun 27 • 4

upvoted 5 papers 2 months ago

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Paper • 2505.19640 • Published May 26 • 13

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Paper • 2505.17952 • Published May 23 • 20

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2 • 47

Efficient Large Language Models: A Survey

Paper • 2312.03863 • Published Dec 6, 2023 • 4

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Paper • 2505.05422 • Published May 8 • 8

upvoted 6 papers 3 months ago

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Paper • 2505.17225 • Published May 22 • 65

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

Paper • 2505.18125 • Published May 23 • 113

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 89

zhongwei666

AI & ML interests

Recent Activity

Organizations

bruce360568's activity