HiFiGAN Arabic Vocoder
A standalone implementation of HiFiGAN vocoder for Arabic text-to-speech, based on the paper "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis" (https://arxiv.org/pdf/2010.05646.pdf).
Usage
from hifigan_ar_v2 import HiFiGANArabicGenerator
import torch
# Load the model
model = HiFiGANArabicGenerator.from_pretrained("generator.ckpt", "config.json")
# Generate audio from mel spectrogram
mel = torch.rand(1, 80, 122) # Example mel spectrogram
audio = model(mel) # Shape: [1, 1, 8448]
Model Details
- Sample Rate: 22050 Hz
- Input: Mel spectrogram (80 channels)
- Output: Audio waveform (1 channel)
- Downloads last month
- 13
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.