Jeff Wadsworth
jeffwadsworth
AI & ML interests
Machine Learning
Recent Activity
liked
a Space
7 days ago
enzostvs/deepsite
liked
a Space
7 days ago
victor/deepsite-gallery
liked
a model
17 days ago
meta-llama/Llama-4-Maverick-17B-128E-Instruct
Organizations
None yet
jeffwadsworth's activity
Temperature Setting of Different Tasks
2
#36 opened 28 days ago
by
sanbingyouyong

When will you fix the model replies missing</think>\n start tags
17
#19 opened about 2 months ago
by
xldistance
Doesn't Generate `<think>` tags
3
#25 opened about 2 months ago
by
bingw5
When using the web version of DeepSeek v3, it keeps repeating responses without stopping.
1
#12 opened 4 months ago
by
Nydaym
Works very well, but I had an issue with the following prompt timing out.
2
#3 opened 8 months ago
by
jeffwadsworth
9.9 vs 9.11 example
5
11
#19 opened 8 months ago
by
IlyaGusev

a few edits for your model card (sorry I'm a grammar/writing nerd)
1
2
#16 opened 12 months ago
by
luke-data-leader

Model
3
25
#5 opened about 1 year ago
by
mrfakename

How to combine split files?
3
#1 opened about 1 year ago
by
deleted
model missing
7
#1 opened over 1 year ago
by
barius

This is a very impressive model. Using the 8bit version.
2
#2 opened over 1 year ago
by
jeffwadsworth
Tokenizer issue?
5
27
#1 opened over 1 year ago
by
sleepyjoecheated
Unzip problem
1
#15 opened over 1 year ago
by
ShubhangiV
This model looks insanely good for coding ( 73.2 for humanEval )!
2
18
#1 opened over 1 year ago
by
mirek190
Are the weights updated?
3
#1 opened over 1 year ago
by
krao
Performance of quantified models
1
#3 opened over 1 year ago
by
danielus

Prompts
16
#2 opened almost 2 years ago
by
spirilis
๐ฉ Report โ Legal Issue: Clarity Needed for Commercial Licensing
20
#6 opened about 2 years ago
by
alexjc

Can we run this model on CPU?
13
#3 opened about 2 years ago
by
gustavomr
