Empirically, with comparable or fewer FLOPs, Funnel-Transformer outperforms the standard Transformer on | |
a wide variety of sequence-level prediction tasks, including text classification, language understanding, and reading | |
comprehension. |
Empirically, with comparable or fewer FLOPs, Funnel-Transformer outperforms the standard Transformer on | |
a wide variety of sequence-level prediction tasks, including text classification, language understanding, and reading | |
comprehension. |