Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

504 Bytes

	source_lang = "en"
	target_lang = "fr"
	prefix = "translate English to French: "
	def preprocess_function(examples):
	inputs = [prefix + example[source_lang] for example in examples["translation"]]
	targets = [example[target_lang] for example in examples["translation"]]
	model_inputs = tokenizer(inputs, text_target=targets, max_length=128, truncation=True)
	return model_inputs

	To apply the preprocessing function over the entire dataset, use 🤗 Datasets [~datasets.Dataset.map] method.