Spaces:

Ahmadzei
/

RAG

Runtime error

RAG

File size: 222 Bytes

5fa1a76

Rather than pre-training the model to predict the class
of an image (as done in the original ViT paper), BEiT models are pre-trained to
predict visual tokens from the codebook of OpenAI's DALL-E model given masked
patches.