File size: 129 Bytes
5fa1a76
 
1
2
In this paper, we show that a standard Transformer architecture can be used with
minimal modifications to process byte sequences.