Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
gmongaras
/
Cosine_Attention_GPT_300M
like
0
Feature Extraction
PyTorch
gptj
arxiv:
2409.18747
License:
mit
Model card
Files
Files and versions
xet
Community
2
main
Cosine_Attention_GPT_300M
Ctrl+K
Ctrl+K
2 contributors
History:
4 commits
gmongaras
Update README.md
c63c9e2
verified
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
README.md
Safe
214 Bytes
Update README.md
11 months ago
added_tokens.json
Safe
4.35 kB
Upload 11 files
11 months ago
config.json
Safe
995 Bytes
Upload 11 files
11 months ago
config.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.12 kB
xet
Upload 11 files
11 months ago
generation_config.json
Safe
119 Bytes
Upload 11 files
11 months ago
merges.txt
Safe
456 kB
Upload 11 files
11 months ago
pytorch_model.bin
1.25 GB
xet
Upload 11 files
11 months ago
scheduler.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1 kB
xet
Upload 11 files
11 months ago
special_tokens_map.json
Safe
131 Bytes
Upload 11 files
11 months ago
tokenizer.pt
pickle
Detected Pickle imports (6)
"_codecs.encode"
,
"regex._regex.compile"
,
"tokenizers.AddedToken"
,
"transformers.models.gpt2.tokenization_gpt2.GPT2Tokenizer"
,
"transformers.tokenization_utils.Trie"
,
"__builtin__.set"
How to fix it?
3.15 MB
xet
Upload 11 files
11 months ago
tokenizer_config.json
Safe
26.6 kB
Upload 11 files
11 months ago
vocab.json
Safe
999 kB
Upload 11 files
11 months ago