Load a feature extractor with [AutoFeatureExtractor.from_pretrained]:

from transformers import AutoFeatureExtractor
feature_extractor = AutoFeatureExtractor.from_pretrained(
     "ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition"
 )

AutoProcessor
Multimodal tasks require a processor that combines two types of preprocessing tools.