Branches step1200000-tokens5033B and fp32 do not exist
As you say in the model card, step1200000-tokens5033B should be the pre-annealing base model branch, however, it is not present in this repo. fp32 is also not there.
Yes I think the branch information is copied from the older model card and does not fully apply here maybe @soldni knows what needs to be changed in this model card?
I see - it's also not really documented how -0125 differs from -0924, at least, not here or anywhere I can find much detail - could you explain that, or is it just somewhere I didn't see?
oh, thanks! I didn't find the updated version
so, the pre-train section is the same, just annealed / mid-trained on different data?
Hi, thanks again for the inquiry! We’re currently working on closing out old tickets, so we’re closing this out for now, but if you require a follow-up response, please re-open and we will get back to you!