BabyLM-community/babylm-multimodal-baseline-git Image-to-Text • 0.2B • Updated about 3 hours ago • 66
BabyLM-community/babylm-multimodal-baseline-flamingo Text Generation • 0.3B • Updated 3 days ago • 107
Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque Paper • 2506.07597 • Published Jun 9
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 64
Lessons from the Trenches on Reproducible Evaluation of Language Models Paper • 2405.14782 • Published May 23, 2024
Truth Knows No Language: Evaluating Truthfulness Beyond English Paper • 2502.09387 • Published Feb 13 • 1
MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language Paper • 2505.14395 • Published May 20 • 6
When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun Paper • 2411.04822 • Published Nov 7, 2024
LLM-C3MOD: A Human-LLM Collaborative System for Cross-Cultural Hate Speech Moderation Paper • 2503.07237 • Published Mar 10
HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja Paper • 2501.11951 • Published Jan 21
Pretraining Language Models for Diachronic Linguistic Change Discovery Paper • 2504.05523 • Published Apr 7 • 6
Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies Paper • 2410.22886 • Published Oct 30, 2024 • 2
Do Construction Distributions Shape Formal Language Learning In German BabyLMs? Paper • 2503.11593 • Published Mar 14 • 1
Subword models struggle with word learning, but surprisal hides it Paper • 2502.12835 • Published Feb 18
Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas Paper • 2410.01487 • Published Oct 2, 2024
SemEval 2019 Shared Task: Cross-lingual Semantic Parsing with UCCA - Call for Participation Paper • 1805.12386 • Published May 31, 2018