File size: 441 Bytes
5fa1a76 |
1 2 3 4 5 6 7 8 9 10 11 12 |
from datasets import load_dataset squad = load_dataset("squad", split="train[:5000]") Split the dataset's train split into a train and test set with the [~datasets.Dataset.train_test_split] method: squad = squad.train_test_split(test_size=0.2) Then take a look at an example: squad["train"][0] {'answers': {'answer_start': [515], 'text': ['Saint Bernadette Soubirous']}, 'context': 'Architecturally, the school has a Catholic character. |