Usage tips | |
BiT models are equivalent to ResNetv2 in terms of architecture, except that: 1) all batch normalization layers are replaced by group normalization, | |
2) weight standardization is used for convolutional layers. |
Usage tips | |
BiT models are equivalent to ResNetv2 in terms of architecture, except that: 1) all batch normalization layers are replaced by group normalization, | |
2) weight standardization is used for convolutional layers. |