Running on Zero 46 46 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 Generate speech from text using reference audio
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 13 days ago • 618k • 1.31k