vwxyzjn
/
ppo_zephyr_vllm_1e-6_kl_0.05

Model card Files Files and versions Metrics Training metrics Community