Zhuoran Yang

zhuoranyang

AI & ML interests

reinforcement learning, game theory, AGI

Recent Activity

authored a paper about 2 months ago

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

updated a collection 2 months ago

LLM Agents (Prompting)

updated a collection 2 months ago

LLM-Reasoning (training)

View all activity

Organizations

zhuoranyang's activity

authored a paper about 2 months ago

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Paper • 2502.16707 • Published Feb 23 • 13

updated 2 collections 2 months ago

LLM Agents (Prompting)

Collection

2 items • Updated Feb 16

LLM-Reasoning (training)

Collection

LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated Feb 16

liked a dataset 2 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 36.5k • 557

updated a collection 2 months ago

LLM-Reasoning (training)

Collection

LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated Feb 16

upvoted a paper 2 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

authored a paper 5 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

updated 2 collections over 1 year ago

in-context learning & chain of thought

Collection

1 item • Updated Jan 21, 2024

Control

Collection

1 item • Updated Oct 26, 2023

authored a paper over 1 year ago

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Paper • 2207.14800 • Published Jul 29, 2022

upvoted a paper almost 2 years ago

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control

Paper • 2307.00117 • Published Jun 30, 2023 • 6