parler-tts
/

parler_tts_mini_v0.1

text2text-generation

Model card Files Files and versions Community

ylacombe commited on Apr 9, 2024

Commit

7458eda

·

verified ·

1 Parent(s): 76739bd

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -12,12 +12,10 @@ pipeline_tag: text-to-speech
 # Parler-TTS v0.1
-Parler-TTS v0.1 is a lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data, that can generate high-quality, natural sounding speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation)
 ## Usage
-**NOTE:** You can directly try it out in an interactive demo [here](https://huggingface.co/spaces/parler-tts/parler_tts_mini)!
 Using Parler-TTS is as simple as "bonjour". Simply install the library once:
 ```sh
 pip install git+https://github.com/huggingface/parler-tts.git
@@ -44,6 +42,9 @@ audio_arr = generation.cpu().numpy().squeeze()
 sf.write("parler_tts_out.wav", audio_arr, model.config.sampling_rate)
 ```
 **Tips**:
 * Include the term "very clear audio" to generate the highest quality audio, and "very noisy audio" for high levels of background noise
 * * Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech

 # Parler-TTS v0.1
+**Parler-TTS v0.1** is a lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data, that can generate high-quality, natural sounding speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation)
 ## Usage
 Using Parler-TTS is as simple as "bonjour". Simply install the library once:
 ```sh
 pip install git+https://github.com/huggingface/parler-tts.git
 sf.write("parler_tts_out.wav", audio_arr, model.config.sampling_rate)
 ```
+**NOTE:** You can directly try it out in an interactive demo [here](https://huggingface.co/spaces/parler-tts/parler_tts_mini)!
 **Tips**:
 * Include the term "very clear audio" to generate the highest quality audio, and "very noisy audio" for high levels of background noise
 * * Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech