The Wav2Vec2 tokenizer is only trained on uppercase characters so you'll need to make sure the text matches the tokenizer's vocabulary: def uppercase(example): return {"transcription": example["transcription"].upper()} minds = minds.map(uppercase) Now create a preprocessing function that: Calls the audio column to load and resample the audio file.