Convert images of screens to structured elements
Classify audio to identify sound types
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Efficient, fast, and natural text to speech with StyleTTS 2!
Generate a short video from an image
Generate music from text prompts