Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
The original implementation can be found
here: https://github.com/TsinghuaAI/CPM-Generate
CPM's architecture is the same as GPT-2, except for tokenization method.