1 10 3

Shiyu Zhu

ShiyuZhu

AI & ML interests

Multimodal

Recent Activity

upvoted a paper about 1 month ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

new activity about 1 month ago

Qwen/QwQ-32B-AWQ:有没有在3090上部署这个awq版本的，速度只有6tokens/s，正常吗

liked a Space 5 months ago

GanymedeNil/Qwen2-VL-7B

View all activity

Organizations

None yet

ShiyuZhu's activity

upvoted a paper about 1 month ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 155

New activity in Qwen/QwQ-32B-AWQ about 1 month ago

有没有在3090上部署这个awq版本的，速度只有6tokens/s，正常吗

#4 opened about 1 month ago by

Jsoooooo

liked a Space 5 months ago

253

Qwen2-VL-7B

🔥

Generate text by combining an image and a question

liked 2 models 6 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 6.07M • • 3.87k

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27, 2024 • 547k • 6.15k

upvoted 8 papers 7 months ago

MuCodec: Ultra Low-Bitrate Music Codec

Paper • 2409.13216 • Published Sep 20, 2024 • 24

Portrait Video Editing Empowered by Multimodal Generative Priors

Paper • 2409.13591 • Published Sep 20, 2024 • 17

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 28

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 75

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Paper • 2409.11055 • Published Sep 17, 2024 • 17

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 148

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 69