File size: 201 Bytes
5fa1a76
 
 
 
 
 
1
2
3
4
5
6
Before you begin, install 🤗 Datasets so you can load some datasets to experiment with:

pip install datasets
Natural Language Processing

The main tool for preprocessing textual data is a tokenizer.