The position embeddings are also learnable and have the same size as the patch embeddings.