1 6 6

Micheal Tian

StarBurger

hctian713

AI & ML interests

self-driving, computer vision, self-supervised learning

Recent Activity

liked a model 3 days ago

OpenGVLab/InternVL3-78B

upvoted an article 3 days ago

LeRobot goes to driving school: World’s largest open-source self-driving dataset

upvoted a paper 8 days ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

View all activity

Organizations

None yet

StarBurger's activity

liked a model 3 days ago

OpenGVLab/InternVL3-78B

Image-Text-to-Text • Updated 6 days ago • 12.8k • 125

upvoted an article 3 days ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

Mar 11

• 76

upvoted a paper 8 days ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published 8 days ago • 30

upvoted a paper 15 days ago

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Paper • 2504.03641 • Published 18 days ago • 14

authored a paper 2 months ago

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 35

upvoted a paper 4 months ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published Jan 3 • 46

upvoted a paper 5 months ago

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Paper • 2411.15296 • Published Nov 22, 2024 • 22

authored a paper 5 months ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23, 2024 • 27

commented a paper 8 months ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23, 2024 • 27 •

upvoted a paper 8 months ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23, 2024 • 27

liked a dataset 8 months ago

yifanzhang114/MME-RealWorld

Preview • Updated Nov 14, 2024 • 425 • 15

liked a Space about 1 year ago

Driving with Language 2024

🤔

liked a dataset about 1 year ago

OpenDriveLab/DriveLM

Updated Mar 4 • 131 • 22

liked 2 models about 1 year ago

llava-hf/llava-1.5-7b-hf

Image-Text-to-Text • Updated Jan 27 • 584k • • 253

liuhaotian/llava-v1.6-34b

Image-Text-to-Text • Updated May 9, 2024 • 9.91k • 350