Training time
#10
by
iHaag
- opened
I’m curious how long it took to train this model? How can Reinforcement Learning from Human Feedback (RLHF), Supervised Fine-Tuning
(SFT) And reasoning happen with diffusion based models? Looking forward to the progress amazing work very impressed and so happy you have made it semi-open source.