HiFiGAN Arabic Vocoder

A standalone implementation of HiFiGAN vocoder for Arabic text-to-speech, based on the paper "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis" (https://arxiv.org/pdf/2010.05646.pdf).

Usage

from hifigan_ar_v2 import HiFiGANArabicGenerator
import torch

# Load the model
model = HiFiGANArabicGenerator.from_pretrained("generator.ckpt", "config.json")

# Generate audio from mel spectrogram
mel = torch.rand(1, 80, 122)  # Example mel spectrogram
audio = model(mel)  # Shape: [1, 1, 8448]

Model Details

  • Sample Rate: 22050 Hz
  • Input: Mel spectrogram (80 channels)
  • Output: Audio waveform (1 channel)
Downloads last month
13
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support