17 1

Timon

KeyboardMasher

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

bartowski/inclusionAI_Ling-lite-0415-GGUF:Update README.md

new activity 3 days ago

bartowski/google_gemma-3-27b-it-qat-GGUF:Other Imatrix quants (IQ3_XS) ?

reacted to bartowski's post with 👍 8 days ago

Access requests enabled for latest GLM models While a fix is being implemented (https://github.com/ggml-org/llama.cpp/pull/12957) I want to leave the models up for visibility and continued discussion, but want to prevent accidental downloads of known broken models (even though there are settings that could fix it at runtime for now) With this goal, I've enabled access requests. I don't really want your data, so I'm sorry that I don't think there's a way around that? But that's what I'm gonna do for now, and I'll remove the gate when a fix is up and verified and I have a chance to re-convert and quantize! Hope you don't mind in the mean time :D

View all activity

Organizations

None yet

KeyboardMasher's activity

New activity in bartowski/inclusionAI_Ling-lite-0415-GGUF 1 day ago

Update README.md

#1 opened 1 day ago by

KeyboardMasher

New activity in bartowski/google_gemma-3-27b-it-qat-GGUF 3 days ago

Other Imatrix quants (IQ3_XS) ?

#1 opened 4 days ago by

notmebug

New activity in bartowski/QVQ-72B-Preview-GGUF 3 months ago

llama.cpp inference too slow?

#6 opened 4 months ago by

ygsun

New activity in unsloth/DeepSeek-R1-GGUF 3 months ago

Over 2 tok/sec agg backed by NVMe SSD on 96GB RAM + 24GB VRAM AM5 rig with llama.cpp

#13 opened 3 months ago by

ubergarm

New activity in unsloth/DeepSeek-V3-GGUF 3 months ago

Issue with --n-gpu-layers 5 Parameter: Model Only Running on CPU

#10 opened 3 months ago by

vuk123

Advice on running llama-server with Q2_K_L quant

#6 opened 3 months ago by

vmajor

I loaded DeepSeek-V3-Q5_K_M up on my 10yrs old old Tesla M40 (Dell C4130)

#8 opened 3 months ago by

gng2info

New activity in bartowski/Llama-3_1-Nemotron-51B-Instruct-GGUF 4 months ago

Model will need to be requantized, rope issues for long context

#2 opened 4 months ago by

treehugg3

New activity in allenai/OLMo-2-1124-7B-GGUF 5 months ago

Instruct version?

#1 opened 5 months ago by

KeyboardMasher

New activity in Nexusflow/Athene-70B 5 months ago

we need llama athene 3.1 70b

#5 opened 9 months ago by

gopi87

New activity in bartowski/Qwen2.5.1-Coder-7B-Instruct-GGUF 6 months ago

Change the 'Original model' link to tree/9092a8a, which contains the updated weights.

#2 opened 6 months ago by

AaronFeng753

New activity in bartowski/Reflection-Llama-3.1-70B-GGUF 6 months ago

Remove this model from Recent highlights collection

#9 opened 6 months ago by

KeyboardMasher

New activity in bartowski/granite-3.0-8b-instruct-GGUF 6 months ago

Continuous output

#1 opened 6 months ago by

kth8

New activity in pabloce/dolphin-2.8-gemma-7b-GGUF about 1 year ago

Q8_0 file is damaged.

#1 opened about 1 year ago by

KeyboardMasher