Training time

#10

by iHaag - opened Mar 19

Mar 19

I’m curious how long it took to train this model? How can Reinforcement Learning from Human Feedback (RLHF), Supervised Fine-Tuning
(SFT) And reasoning happen with diffusion based models? Looking forward to the progress amazing work very impressed and so happy you have made it semi-open source.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment