---
license: apache-2.0
language: de
library_name: transformers
tags:
- text-to-speech
- tts
- german
- chatterbox
- voice-cloning
- zero-shot
- merged-model
---

# Kartoffelbox-v0.1_0.65h2: A Merged German Chatterbox-TTS Model

## Model Description

This repository contains an experimental, **standalone** German Text-to-Speech model based on the [Chatterbox](https://github.com/resemble-ai/chatterbox) framework.

This model is a **hybrid** created by merging two fine-tuned models:
1.  The well-known German TTS "patch" [SebastianBodza/Kartoffelbox-v0.1](https://huggingface.co/SebastianBodza/Kartoffelbox-v0.1).
2.  A custom model extensively fine-tuned on a large, diverse dataset of German voices (~12.000 samples).

The goal was to create a robust, general-purpose German TTS model by combining the natural prosody of `Kartoffelbox` with a model trained on a wide variety of voices and data types. The final weights are a **65/35 merge**, favoring the custom-trained, multi-speaker model. Unlike patch-based models, this is a complete, self-contained model that can be loaded directly.

**Key Features:**
- **Language:** German
- **Type:** Standalone, Multi-Speaker, Merged Hybrid Model
- **Capabilities:** High-quality speech synthesis and Zero-Shot Voice Cloning for **variable German voices**.
- **Robustness:** Specifically trained to handle numbers, dates, and other complex data formats. (which work some times :D)

## How to Use the Model

This is a complete model and does not require manual patching. You will need the `chatterbox` library from Resemble AI to run it.

**1. Installation**

```bash
# Clone the official Chatterbox repository and install its dependencies
git clone https://github.com/resemble-ai/chatterbox.git
cd chatterbox
pip install -e .