Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
flow2023 's Collections
human generation
MLLM
3D
LLM
motion generation
CLIP
generation-diffusion
video mllm
LLM+generate

3D

updated Sep 29, 2024
Upvote
-

  • GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

    Paper • 2401.04092 • Published Jan 8, 2024 • 22

  • AToM: Amortized Text-to-Mesh using 2D Diffusion

    Paper • 2402.00867 • Published Feb 1, 2024 • 11

  • Advances in 3D Generation: A Survey

    Paper • 2401.17807 • Published Jan 31, 2024 • 19

  • SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

    Paper • 2401.09340 • Published Jan 17, 2024 • 22

  • PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

    Paper • 2404.13026 • Published Apr 19, 2024 • 25

  • Probing the 3D Awareness of Visual Foundation Models

    Paper • 2404.08636 • Published Apr 12, 2024 • 14

  • 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination

    Paper • 2406.05132 • Published Jun 7, 2024 • 31

  • LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

    Paper • 2409.18125 • Published Sep 26, 2024 • 35
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs