Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
TheAudioConditioner upsamples the outputs of the previous prior to raw tokens at a certain audio frame per second resolution.