GameScribes
/

stella_en_400M_v5

Model card Files Files and versions Community

stella_en_400M_v5 / README.md

devve1's picture

Update README.md

5604e1b verified 6 months ago

|

582 Bytes

	Re-Upload of https://huggingface.co/dunzhang/stella_en_400M_v5 with :
	- Only 1024 embeddings dimensions folder
	- Max tokens lenght to 512 ( model has been trained on this sequence lenght )
	- Padding strategy set to "BatchLongest" to avoid batching every line to a fixed length of 512 tokens in case of size less than 512. Remove unecessary padding.



	Parameters at the end of the file "config.json" has been set manually to false for CPU usage:
	```"unpad_inputs": false, "use_memory_efficient_attention": false```

	You can turn them back to "true" to enable GPU usage back again