Do Construction Distributions Shape Formal Language Learning In German BabyLMs? Paper • 2503.11593 • Published Mar 14 • 1
Subword models struggle with word learning, but surprisal hides it Paper • 2502.12835 • Published Feb 18
Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas Paper • 2410.01487 • Published Oct 2, 2024