These functions convert the images into pixel_values and annotations to labels.