5fa1a76
1
2
Transfer performance in downstream tasks outperforms supervised pre-training and shows promising scaling behavior.