hf (pretrained=../,trust_remote_code=True), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 16 |Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr| |-----|------:|----------------|-----:|-----------|---|-----:|---|-----:| |gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.6831|± |0.0128| | | |strict-match | 5|exact_match|↑ |0.6573|± |0.0131|