Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

JINIAC
/
JINIAC-5B-culturex-code0-9-lr-5e-5-ja_hq-5e-5-sft_configuration-3_prod-checkpoint-500

Text Generation
Transformers
Safetensors
deepseek
custom_code
Model card Files Files and versions Community
JINIAC-5B-culturex-code0-9-lr-5e-5-ja_hq-5e-5-sft_configuration-3_prod-checkpoint-500
Ctrl+K
Ctrl+K
  • 1 contributor
History: 15 commits
OsakanaTeishoku's picture
OsakanaTeishoku
Upload generation_config.json
84c8f44 verified 12 months ago
  • .gitattributes
    1.52 kB
    initial commit 12 months ago
  • added_tokens.json
    22 Bytes
    Upload added_tokens.json 12 months ago
  • config.json
    1.26 kB
    Upload config.json 12 months ago
  • configuration_deepseek.py
    10.2 kB
    Upload configuration_deepseek.py 12 months ago
  • generation_config.json
    111 Bytes
    Upload generation_config.json 12 months ago
  • model-00001-of-00003.safetensors
    5 GB
    LFS
    Upload model-00001-of-00003.safetensors 12 months ago
  • model-00002-of-00003.safetensors
    4.82 GB
    LFS
    Upload model-00002-of-00003.safetensors 12 months ago
  • model-00003-of-00003.safetensors
    244 MB
    LFS
    Upload model-00003-of-00003.safetensors 12 months ago
  • model.safetensors.index.json
    145 kB
    Upload model.safetensors.index.json 12 months ago
  • modeling_deepseek.py
    72.7 kB
    Upload modeling_deepseek.py 12 months ago
  • special_tokens_map.json
    968 Bytes
    Upload special_tokens_map.json 12 months ago
  • spiece.model
    1.07 MB
    LFS
    Upload spiece.model 12 months ago
  • tokenizer_config.json
    1.8 kB
    Upload tokenizer_config.json 12 months ago
  • trainer_state.json
    9.11 kB
    Upload trainer_state.json 12 months ago
  • zero_to_fp32.py
    24.2 kB
    Upload zero_to_fp32.py 12 months ago