5 8 14

Boyuan Zheng

boyuanzheng010

https://boyuanzheng010.github.io/

AI & ML interests

Language Agents, Multilinguality

Recent Activity

upvoted a paper 7 days ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

liked a Space 7 days ago

McGill-NLP/agent-reward-bench-demo

upvoted a paper 12 days ago

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

View all activity

Organizations

boyuanzheng010's activity

upvoted a paper 7 days ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published 11 days ago • 27

liked a Space 7 days ago

Agent Reward Bench Demo

💻

Visualize agent interactions with WebArena tasks

upvoted a paper 12 days ago

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Paper • 2504.07079 • Published 13 days ago • 11

commented a paper 13 days ago

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Paper • 2504.07079 • Published 13 days ago • 11 •

published a model 17 days ago

boyuanzheng010/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated 17 days ago

updated a model 20 days ago

boyuanzheng010/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated 20 days ago • 3

published a model 20 days ago

boyuanzheng010/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated 20 days ago • 3

liked a Space 28 days ago

Online-Mind2Web Leaderboard

🏆

Display and visualize evaluation results for human and automated agents

liked a Space about 1 month ago

Safearena Leaderboard

🏃

SafeArena Leaderboard

authored a paper 4 months ago

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Paper • 2411.06559 • Published Nov 10, 2024 • 14

liked a dataset 4 months ago

xlangai/aguvis-stage2

Preview • Updated Dec 20, 2024 • 613 • 18

upvoted a paper 5 months ago

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Paper • 2411.06559 • Published Nov 10, 2024 • 14

authored a paper 6 months ago

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Paper • 2410.05243 • Published Oct 7, 2024 • 19

liked a model 7 months ago

osunlp/UGround

Image-Text-to-Text • Updated 6 days ago • 225 • 23

upvoted a paper 7 months ago

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Paper • 2410.05243 • Published Oct 7, 2024 • 19

updated a dataset 11 months ago

osunlp/Multimodal-Mind2Web

Viewer • Updated Jun 5, 2024 • 14.2k • 3.63k • 66

upvoted a paper about 1 year ago

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28, 2024 • 22

liked 2 datasets about 1 year ago

osunlp/Mind2Web

Viewer • Updated Jul 19, 2023 • 253 • 696 • 100

osunlp/TravelPlanner

Viewer • Updated Jul 14, 2024 • 1.23k • 3.96k • 56