Holarissun
/
SFT_gemma2b_hh-rlhf-helpful-gpt4_lr5e-06_epoch2-subset-1

Model card Files Files and versions Metrics Training metrics Community