Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
241 Bytes
Take a look at the sequence length of these two audio samples:
dataset[0]["audio"]["array"].shape
(173398,)
dataset[1]["audio"]["array"].shape
(106496,)
Create a function to preprocess the dataset so the audio samples are the same lengths.