Lun Zima PRO

Lunzima

AI & ML interests

Merge & fine-tune models for personal use

Recent Activity

updated a model about 3 hours ago
Lunzima/NQLSG-Qwen2-VL-2B-v2
updated a model about 3 hours ago
Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v9.4-Coder
updated a collection about 3 hours ago
NQLSG and its friends
View all activity

Organizations

None yet

Posts 3

view post
Post
1319
I'm currently experimenting with the SFT dataset Lunzima/alpaca_like_dataset to further boost the performance of NQLSG-Qwen2.5-14B-MegaFusion-v9.x. This includes data sourced from DeepSeek-R1 or other cleaned results (excluding CoTs). Additionally, datasets that could potentially enhance the model's performance in math and programming/code, as well as those dedicated to specific uses like Swahili, are part of the mix.
@sometimesanotion @sthenno @wanlige
view post
Post
656
🚀 Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v5 now excels in reasoning and coding, built on top of v4 which improved Chinese capabilities through SFT.