Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Chaew00n
/
test-policy-optimization-query-expansion-keyword
like
0
Transformers
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
test-policy-optimization-query-expansion-keyword
Commit History
Model save
79c6414
verified
Chaew00n
commited on
Jun 11
Training in progress, step 5000, checkpoint
dd6402b
verified
Chaew00n
commited on
Jun 11
Training in progress, step 5000
3edd003
verified
Chaew00n
commited on
Jun 11
Training in progress, step 4000, checkpoint
8f447d8
verified
Chaew00n
commited on
Jun 11
Training in progress, step 4000
8399bce
verified
Chaew00n
commited on
Jun 11
Training in progress, step 3000, checkpoint
77e2727
verified
Chaew00n
commited on
Jun 11
Training in progress, step 3000
a7a4470
verified
Chaew00n
commited on
Jun 11
Training in progress, step 2000, checkpoint
c9e3438
verified
Chaew00n
commited on
Jun 10
Training in progress, step 2000
d1e40e3
verified
Chaew00n
commited on
Jun 10
Training in progress, step 1000, checkpoint
b65d26d
verified
Chaew00n
commited on
Jun 10
Training in progress, step 1000
317bc99
verified
Chaew00n
commited on
Jun 10
initial commit
c5da396
verified
Chaew00n
commited on
Jun 10