World's Largest Dataset
#67 opened 5 days ago
by
UJJAWAL-TYAGI

Is it possible to reduce the number of llama4 expert models to use less memory?
#65 opened 7 days ago
by
gukui

Does LLama4 have chunked attention in generation phase ?
#64 opened 9 days ago
by
vanshils
The "force_words_ids" does not seem to be available on llama4
#63 opened 11 days ago
by
nlp-g
Access Rejected
3
#62 opened 13 days ago
by
ansenang

Less Knowledge Than Llama 3.3 70b?
2
5
#60 opened 13 days ago
by
phil111
No attribute `sliding_window`?
#59 opened 14 days ago
by
farzadab

Any luck doing inference in 8xA100?
5
#57 opened 15 days ago
by
taytun
Fine-tuning with BitsAndBytes
#56 opened 15 days ago
by
arnavgrg

Update config.json -- important default parameters were left out from the config
1
#55 opened 15 days ago
by
mdabbah-nvidia

VLLM not loading meta-llama/Llama-4-Scout-17B-16E-Instruct
1
3
#53 opened 16 days ago
by
alokkrsahu
13 B and34 B Pleeease!!! Most people cannot even run this.
4
4
#52 opened 16 days ago
by
UniversalLove333
🍭Llama4 SFT Training Script
2
#47 opened 17 days ago
by
study-hjt

Max Output Tokens of Llama-4
#46 opened 17 days ago
by
MengboZhou
[Issue report] missing keys in the json files
9
3
#45 opened 17 days ago
by
ShervinGhasemlou

access denied
8
1
#44 opened 17 days ago
by
qulong
FP8 weights
4
#41 opened 18 days ago
by
getfit

Update README.md
#40 opened 18 days ago
by
mfarre

Update README.md
1
#39 opened 18 days ago
by
mfarre

torch compile compatibility issue
6
#38 opened 18 days ago
by
axiomlab
Sagemaker - How to test the image multimodal?
#37 opened 18 days ago
by
TheSuperAgent
Access request got denied
6
13
#35 opened 18 days ago
by
migtissera

Deploying production ready Llama-4 models on your AWS with vLLM
3
#34 opened 18 days ago
by
agam30

Unethical comparisons with Deepseek replacing chinese languages by thai/vietnamese only
9
5
#32 opened 19 days ago
by
krustik
Request: DOI
#31 opened 19 days ago
by
ylx2ai
Couldn't connect
#30 opened 19 days ago
by
Wouze

Ridicolous demands for model gate.
9
2
#29 opened 19 days ago
by
marksverdhei

Llama 4 - open-source fine-tuning script
5
#27 opened 19 days ago
by
hiyouga

Bug in AutoModel
1
3
#26 opened 19 days ago
by
random-checkin

pad error
7
8
#25 opened 19 days ago
by
bobber
AWQ version?
14
#24 opened 19 days ago
by
devops724
Object Detection?
5
#23 opened 19 days ago
by
buckeye17-bah
Thanks zuck
3
#22 opened 19 days ago
by
WyattTheSkid

converstational?
1
#21 opened 19 days ago
by
HassanStar
No one with a consumer grade GPU (< 32 vram) can run the lower L4 model... 😓
10
13
#20 opened 19 days ago
by
UniversalLove333
Request denied llama4 - lost access to whole meta-llama repos which I already had access
7
#19 opened 19 days ago
by
doaonduty

Thank you!, Is it possible to run this with vLLM or sglang ?
5
#18 opened 19 days ago
by
getfit

Thanks! Request to access the model rejected due to typo
#17 opened 19 days ago
by
Xinxinli
[request for feedback] Faster downloads with Xet
12
18
#16 opened 19 days ago
by
clem

License
6
8
#14 opened 19 days ago
by
mrfakename
