Spaces:

Ahmadzei
/

RAG

Runtime error

RAG

File size: 266 Bytes

5fa1a76

Usage tips

The Persimmon models were trained using bfloat16, but the original inference uses float16 The checkpoints uploaded on the hub use torch_dtype = 'float16' which will be
used by the AutoModel API to cast the checkpoints from torch.float32 to torch.float16.