File size: 114 Bytes
5fa1a76
 
1
2
Transfer performance in downstream
tasks outperforms supervised pre-training and shows promising scaling behavior.