A [CLS] token is added to serve as representation of an entire image.