Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
184 Bytes
Create a prepare_dataset function that takes in a
single example and uses the SpeechT5Processor object to tokenize the input text and load the target audio into a log-mel spectrogram.