Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
We further propose a variant of Mega that offers linear time and space complexity yet yields only minimal quality loss, by efficiently splitting the whole sequence into multiple chunks with fixed length.