Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Caiyun-AI
/
DCFormer-2.8B
like
1
Text Generation
Transformers
PyTorch
English
dcformer
causal-lm
dcmha
custom_code
arxiv:
2405.08553
License:
mit
Model card
Files
Files and versions
Community
1
Train
Use this model
main
DCFormer-2.8B
Ctrl+K
Ctrl+K
3 contributors
History:
10 commits
Hilbertmeng
fix k_mask
51d254e
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
12 months ago
README.md
Safe
2.42 kB
add paper link
12 months ago
config.json
Safe
751 Bytes
upload model and code
12 months ago
configuration_dcformer.py
Safe
2.51 kB
upload model and code
12 months ago
generation_demo.py
Safe
1.31 kB
update readme
12 months ago
modeling_dcformer.py
Safe
32.7 kB
fix k_mask
11 months ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.HalfStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
5.81 GB
LFS
upload model and code
12 months ago
tokenizer.json
Safe
2.11 MB
upload model and code
12 months ago
tokenizer_config.json
Safe
264 Bytes
upload model and code
12 months ago