5fa1a76
1
The dot product between the projected image and text features is then used as a similarity score.