Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
As mentioned above, the dtype of the storage weights is mostly irrelevant unless you are using torch_dtype="auto" when initializing a model using.