Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Yuhan123
/
reading-level-pairwise-reward-chosen-12th-grade-rejected-preschool-1-steps-1000
like
0
Text Generation
Safetensors
English
olmo2
language-model
fine-tuned
ppo
rlhf
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
reading-level-pairwise-reward-chosen-12th-grade-rejected-preschool-1-steps-1000
/
tokenizer.json
Yuhan123
Upload best model checkpoint from data/olmo_reading_level_pairwise_reward_chosen_12th_grade_rejected_preschool_-1_steps_1000/best_model
472376b
verified
about 1 month ago
raw
Copy download link
history
contribute
delete
Safe
7.14 MB
File too large to display, you can
check the raw version
instead.