33 16 124

Yixin Song

yixinsong

AI & ML interests

None yet

Recent Activity

liked a dataset 12 days ago

OmniSVG/MMSVG-Icon

liked a dataset 13 days ago

starriver030515/FUSION-Pretrain-10M

liked a dataset 14 days ago

starriver030515/FUSION-Finetune-12M

View all activity

Organizations

yixinsong's activity

New activity in PowerInfer/SmallThinker-3B-Preview 3 months ago

Eval script

#9 opened 3 months ago by

rawsh

New activity in PowerInfer/SmallThinker-3B-Preview 4 months ago

About the training details

#5 opened 4 months ago by

hiyouga

How to Pair with Larger Models

#7 opened 4 months ago by

windkkk

Prompt/token adjust to stop "Overthinking" in unnescissary cases

#6 opened 4 months ago by

fuzzy-mittenz

example use colab?

#3 opened 4 months ago by

NickyNicky

Update README.md

#4 opened 4 months ago by

AISafety

Training: Second Phase

#2 opened 4 months ago by

tugstugi

New activity in PowerInfer/QWQ-LONGCOT-500K 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

parquet-converter

New activity in PowerInfer/LONGCOT-Refine-500K 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

parquet-converter

Librarian Bot: Add language metadata for dataset

#2 opened 4 months ago by

librarian-bot

New activity in PowerInfer/SmallThinker-3B-Preview 4 months ago

Evaluation

#1 opened 4 months ago by

tugstugi

New activity in PowerInfer/TurboSparse-Mistral-Instruct 7 months ago

problems about sample strategies

#1 opened 7 months ago by

thuzhizhi

New activity in yixinsong/persona 8 months ago

[bot] Conversion to Parquet

#1 opened 8 months ago by

parquet-converter

New activity in HuggingFaceTB/SmolLM-1.7B 9 months ago

MMLU doesn't match on lm-evaluation-harness

#2 opened 9 months ago by

yixinsong

New activity in SparseLLM/relu2-5B 10 months ago

Inference API not working properly. Lack of proper modeling file?

#1 opened 10 months ago by

xunkai55

New activity in SparseLLM/relu-5B 10 months ago

Difference between SparseLLM/relu and SparseLLM/reglu - lack of modeling file?

#1 opened 10 months ago by

xunkai55

commented 3 papers 10 months ago

commented a paper about 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 615 •

143