Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
In this paper, we propose a pipeline of contrastive language-audio pretraining to develop an audio representation by combining audio data with natural language descriptions.