xianfeng
xianf
·
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
zhibinlan/LLaVE-2B
new activity
3 months ago
bigcode/the-stack-v2-train-smol-ids:script to download the data
new activity
11 months ago
Qwen/Qwen2-1.5B:lm_eval results is weird
Organizations
None yet
xianf's activity
script to download the data
1
1
#7 opened 5 months ago
by
eminorhan

lm_eval results is weird
5
#2 opened 11 months ago
by
xianf
使用 lm_eval 测试时报错了
2
#1 opened 11 months ago
by
xianf
QWEN-1.8B finetune 之后输出全是重复的 token
5
#1 opened over 1 year ago
by
xianf
Please provide a list of file hashes in order to check integrity of downloads
2
#24 opened about 1 year ago
by
markusheimerl

Add korean kenLMs
1
#6 opened almost 2 years ago
by
hac541309
The model keeps generating up to the maximum length but no EOS token.
1
#13 opened almost 2 years ago
by
xianf
How many memory for GPU are needed?
#12 opened almost 2 years ago
by
xianf
Why the vocab_size of tokenizer is different from model?
1
#2 opened about 2 years ago
by
xianf
Is the missing data repaired?
5
#1 opened about 2 years ago
by
xianf