PodAgent: A Comprehensive Framework for Podcast Generation Paper • 2503.00455 • Published Mar 1 • 6
1-800-BAD-CODE/xlm-roberta_punctuation_fullstop_truecase Text2Text Generation • Updated Jul 15, 2023 • 43.1k • 54
1-800-BAD-CODE/punctuation_fullstop_truecase_english Text2Text Generation • Updated Mar 19, 2023 • 30.4k • 8
Towards Robust Speech Representation Learning for Thousands of Languages Paper • 2407.00837 • Published Jun 30, 2024 • 11
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data Paper • 2402.08093 • Published Feb 12, 2024 • 62
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data Paper • 2402.08093 • Published Feb 12, 2024 • 62
simonl0909/whisper-large-v2-cantonese Automatic Speech Recognition • Updated Sep 30, 2023 • 141 • 12