AutoModelForAudioClassification | |
[[autodoc]] AutoModelForAudioClassification | |
AutoModelForAudioFrameClassification | |
[[autodoc]] TFAutoModelForAudioClassification | |
TFAutoModelForAudioFrameClassification | |
[[autodoc]] AutoModelForAudioFrameClassification | |
AutoModelForCTC | |
[[autodoc]] AutoModelForCTC | |
AutoModelForSpeechSeq2Seq | |
[[autodoc]] AutoModelForSpeechSeq2Seq | |
TFAutoModelForSpeechSeq2Seq | |
[[autodoc]] TFAutoModelForSpeechSeq2Seq | |
FlaxAutoModelForSpeechSeq2Seq | |
[[autodoc]] FlaxAutoModelForSpeechSeq2Seq | |
AutoModelForAudioXVector | |
[[autodoc]] AutoModelForAudioXVector | |
AutoModelForTextToSpectrogram | |
[[autodoc]] AutoModelForTextToSpectrogram | |
AutoModelForTextToWaveform | |
[[autodoc]] AutoModelForTextToWaveform | |
Multimodal | |
The following auto classes are available for the following multimodal tasks. |