Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

hotchpotch
/
fineweb-2-japanese-text-cleaner

Safetensors
Japanese
xlm-roberta
Model card Files Files and versions
xet
Community
fineweb-2-japanese-text-cleaner / scripts
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
hotchpotch's picture
hotchpotch
Upload 2 files
92c0372 verified 6 months ago
  • noise_detecter.py
    7.03 kB
    Upload 2 files 6 months ago
  • trainer-fineweb-2-japanese-text-cleaner.py
    10.2 kB
    Upload 2 files 6 months ago