YOLO World
Detect objects in images or videos
Convert text to speech with adjustable settings
Transform Your Face Into Legendary Characters!
Find matching images based on uploaded samples
Voice Clone Multilingual TTS
Generate depth video from input video
Generate videos from text or images
https://huggingface.co/papers/2501.03006
3D generation from sketchs with TRELLIS & sdxl