File size: 110 Bytes
5fa1a76
1
CLAP (Contrastive Language-Audio Pretraining) is a neural network trained on a variety of (audio, text) pairs.