Slow Audio Generation – Output Voice Slower Than Input

by Vicky15 - opened Jun 9

Jun 9

The generated audio from the PlayDiffusion model is significantly slower than expected. I tested it both locally and on the Hugging Face Space, and in both cases, the output voice duration is longer than the original input with the same transcript. This results in a slow, unnatural-sounding voice. Please let me know if this is a known issue or if there’s a way to adjust the speed.

yavorr

PlayAI org Jun 9

Answered in github. Let's keep the discussion there - https://github.com/playht/PlayDiffusion/issues/11

yavorr changed discussion status to closed Jun 9

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment