havok2 commited on
Commit
80ac382
·
verified ·
1 Parent(s): d5be189

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -16,19 +16,19 @@ tags:
16
 
17
  ## Model Description
18
 
19
- This repository contains an experimental, **standalone** German Text-to-Speech model based on the [Chatterbox-TTS](https://github.com/anotherjesse/Chatterbox-TTS) framework.
20
 
21
  This model is a **hybrid** created by merging two fine-tuned models:
22
  1. The well-known German TTS "patch" [SebastianBodza/Kartoffelbox-v0.1](https://huggingface.co/SebastianBodza/Kartoffelbox-v0.1).
23
- 2. A custom model extensively fine-tuned on a specific [male/female] German voice.
24
 
25
- The goal was to combine the natural German prosody of `Kartoffelbox` with the unique vocal identity and robustness of the custom-trained model. The final weights are a **65/35 merge**, favoring the custom model. Unlike patch-based models, this is a complete, self-contained model that can be loaded directly.
26
 
27
  **Key Features:**
28
  - **Language:** German
29
- - **Type:** Standalone, Merged Hybrid Model
30
- - **Capabilities:** High-quality speech synthesis and Zero-Shot Voice Cloning.
31
- - **Vocal Characteristics:** [Describe what you hear here. E.g., A clear, male voice with a very natural German intonation, sounding less robotic than many standard models.]
32
 
33
  ## How to Use the Model
34
 
 
16
 
17
  ## Model Description
18
 
19
+ This repository contains an experimental, **standalone** German Text-to-Speech model based on the [Chatterbox](https://github.com/resemble-ai/chatterbox) framework.
20
 
21
  This model is a **hybrid** created by merging two fine-tuned models:
22
  1. The well-known German TTS "patch" [SebastianBodza/Kartoffelbox-v0.1](https://huggingface.co/SebastianBodza/Kartoffelbox-v0.1).
23
+ 2. A custom model extensively fine-tuned on a large, diverse dataset of German voices (~12.000 samples).
24
 
25
+ The goal was to create a robust, general-purpose German TTS model by combining the natural prosody of `Kartoffelbox` with a model trained on a wide variety of voices and data types. The final weights are a **65/35 merge**, favoring the custom-trained, multi-speaker model. Unlike patch-based models, this is a complete, self-contained model that can be loaded directly.
26
 
27
  **Key Features:**
28
  - **Language:** German
29
+ - **Type:** Standalone, Multi-Speaker, Merged Hybrid Model
30
+ - **Capabilities:** High-quality speech synthesis and Zero-Shot Voice Cloning for **variable German voices**.
31
+ - **Robustness:** Specifically trained to handle numbers, dates, and other complex data formats. (which work some times :D)
32
 
33
  ## How to Use the Model
34