xianfeng's picture

21 11

xianfeng

xianf

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

zhibinlan/LLaVE-2B

new activity 3 months ago

bigcode/the-stack-v2-train-smol-ids:script to download the data

new activity 11 months ago

Qwen/Qwen2-1.5B:lm_eval results is weird

View all activity

Organizations

None yet

xianf's activity

New activity in bigcode/the-stack-v2-train-smol-ids 3 months ago

script to download the data

#7 opened 5 months ago by

New activity in Qwen/Qwen2-1.5B 11 months ago

lm_eval results is weird

#2 opened 11 months ago by

New activity in THUDM/glm-4-9b 11 months ago

使用 lm_eval 测试时报错了

#1 opened 11 months ago by

New activity in Qwen/Qwen-1_8B 11 months ago

QWEN-1.8B finetune 之后输出全是重复的 token

#1 opened over 1 year ago by

New activity in allenai/dolma 12 months ago

Please provide a list of file hashes in order to check integrity of downloads

#24 opened about 1 year ago by

New activity in edugp/kenlm over 1 year ago

Add korean kenLMs

#6 opened almost 2 years ago by

New activity in mosaicml/mpt-30b-chat almost 2 years ago

The model keeps generating up to the maximum length but no EOS token.

#13 opened almost 2 years ago by

How many memory for GPU are needed?

#12 opened almost 2 years ago by

New activity in bigscience/mt0-base about 2 years ago

Why the vocab_size of tokenizer is different from model?

#2 opened about 2 years ago by

New activity in SirNeural/flan_v2 about 2 years ago

Is the missing data repaired?

#1 opened about 2 years ago by