hf (pretrained=../,trust_remote_code=True), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: 16 |Tasks|Version|Filter|n-shot|Metric| |Value | |Stderr| |-----|------:|------|-----:|------|---|-----:|---|-----:| |coqa | 3|none | 0|em |↑ |0.6575|± |0.0183| | | |none | 0|f1 |↑ |0.8028|± |0.0130|