python generated_ids = model.generate(pixel_values=pixel_values, max_length=50) generated_caption = processor.batch_decode(generated_ids, skip_special_tokens=True)[0] print(generated_caption) a drawing of a pink and blue pokemon Looks like the fine-tuned model generated a pretty good caption!