Following prior work on long-sequence transformers, we | |
evaluate Longformer on character-level language modeling and achieve state-of-the-art results on text8 and enwik8. |
Following prior work on long-sequence transformers, we | |
evaluate Longformer on character-level language modeling and achieve state-of-the-art results on text8 and enwik8. |