SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ⢠2502.02737 ⢠Published Feb 4 ⢠226
view post Post 1456 Hey š I'm helping out on some community research to learn about the AI community. If you want to join in the conversation, head over here where I started a community discussion on the most influential model since BERT. OSAIResearchCommunity/README#2 See translation š 2 2 + Reply
view post Post 4383 I have just released a new blogpost about kv caching and its role in inference speedup šš https://huggingface.co/blog/not-lain/kv-caching/some takeaways : See translation 4 replies Ā· š„ 8 8 š¤ 4 4 + Reply