Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance.