Spaces:

Ahmadzei
/

RAG

Runtime error

RAG

File size: 190 Bytes

5fa1a76

The main change ViT introduced was in how images are fed to a Transformer:

An image is split into square non-overlapping patches, each of which gets turned into a vector or patch embedding.