license: mit datasets: - trl-lib/ultrafeedback_binarized base_model: - meta-llama/Meta-Llama-3-8B-Instruct