hf (pretrained=../,trust_remote_code=True), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 16 | |
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr| | |
|---------|------:|------|-----:|--------|---|-----:|---|-----:| | |
|hellaswag| 1|none | 5|acc |↑ |0.5624|± |0.0050| | |
| | |none | 5|acc_norm|↑ |0.7430|± |0.0044| |