Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

441 Bytes

	from datasets import load_dataset
	squad = load_dataset("squad", split="train[:5000]")

	Split the dataset's train split into a train and test set with the [~datasets.Dataset.train_test_split] method:

	squad = squad.train_test_split(test_size=0.2)

	Then take a look at an example:

	squad["train"][0]
	{'answers': {'answer_start': [515], 'text': ['Saint Bernadette Soubirous']},
	'context': 'Architecturally, the school has a Catholic character.