Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Usage tips
BiT models are equivalent to ResNetv2 in terms of architecture, except that: 1) all batch normalization layers are replaced by group normalization,
2) weight standardization is used for convolutional layers.