Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

243 Bytes

	You can speed up the map function by setting batched=True to process multiple elements of the dataset at once:

	tokenized_wnut = wnut.map(tokenize_and_align_labels, batched=True)

	Now create a batch of examples using [DataCollatorWithPadding].