-
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Paper • 2409.18124 • Published • 34 -
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Paper • 2409.18125 • Published • 35 -
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Paper • 2410.11795 • Published • 18 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
Xuejian Rong
xrong
·
AI & ML interests
None yet
Recent Activity
liked
a model
15 days ago
microsoft/phi-4
liked
a model
5 months ago
HuggingFaceTB/SmolVLM-Instruct-DPO
upvoted
a
collection
5 months ago
SmolVLM
Organizations
None yet
Collections
11
models
None public yet
datasets
None public yet