A wider Baby Berta Model trained using curriculum learning and layer stacking for the BabyLM Challenge Strict Small track.

Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support