Spaces:

Ahmadzei
/

RAG

Runtime error

RAG

File size: 254 Bytes

5fa1a76

Unlike the recently proposed Vision 
Transformer (ViT) that was designed for image classification specifically, we introduce the Pyramid Vision Transformer 
(PVT), which overcomes the difficulties of porting Transformer to various dense prediction tasks.