Text Generation
Transformers
Safetensors
English
ddllama
conversational
custom_code
xuan-luo's picture
Update evals/2-hellaswag.out
ebeebc6 verified
hf (pretrained=../,trust_remote_code=True), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 16
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|---------|------:|------|-----:|--------|---|-----:|---|-----:|
|hellaswag| 1|none | 5|acc |↑ |0.5624|± |0.0050|
| | |none | 5|acc_norm|↑ |0.7430|± |0.0044|