Commit History
add benchmark descriptions and links to About page
67a665c
Increase floating point number in benchmark metrics
7fcf611
add winogrande and arc-challenge
56926f2
show private models by default
2bd1158
skip model detail validation for OAI/Anthropic models
4ec9008
fix typo in metric name
b1416b0
remove debug prints
9e6a3bf
fix metric name
a0ee03a
add debug prints
105e1f2
revert to correct usage of ModelDetails (without api)
24c8d00
remove swp
1e9c5dd
debug print
ee4b341
debug print
a5c094b
verified
debug print
decb818
verified
debug print
6a989eb
verified
debug print
427f12d
verified
debug print
ea10299
verified
Added empty default for api in ModelDetails
e8f05cc
verified
Added model API to submission screen
20fd601
verified
add Icelandic evals
9ef7f1a
verified
switch to mideind's fork of Eval Harness
da87917
verified
Change metric string
96f9cbe
verified
Comment out winogrande for debugging
ab6318a
verified
Add task
839d7dc
verified
Change title
4d276e3
verified
Change title
2a3757e
verified
Change title
72a1baf
verified
Make name for HF token explicit
bd503b0
verified
Fix repo names
c9a0e12
verified
Update src/envs.py
d7e7ffd
verified
Update requirements.txt
bcc83eb
verified
Update README.md
d0f181a
verified
Update app.py
84582a1
verified
doc
c1b8a96
Clémentine
commited on
more info README
910a08e
Clémentine
commited on
simplified the template
24622c4
Clémentine
commited on
CPU, TOKEN, env variables (#4)
55cc480
verified
Update app.py
4879b93
verified
Update src/submission/check_validity.py
6eb8bfd
removed last restart
daf60ae
Clémentine
commited on
simplified calls
50df158
Clémentine
commited on
made token a requirement
f982b8e
Clémentine
commited on
test
f0298e1
Clémentine
commited on
Update requirements.txt
069df83
fix
c15e77e
Clémentine
commited on
removed quantization to simplify
b899767
Clémentine
commited on
now with a functionning backend
1ffc326
Clémentine
commited on
update read
943f952
Clémentine
commited on
fixs
314f91a
Clémentine
commited on