Many image captioning datasets contain multiple captions per image.