File size: 201 Bytes
5fa1a76
 
 
1
2
3
Despite training on low-resolution ImageNet without labels,
we find that a GPT-2 scale model learns strong image representations as measured by linear probing, fine-tuning, and
low-data classification.