Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
The original code is not released, this implementation is based on the Kakao Brain implementation based on the original paper.