Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dataset-Tools 's Collections
Dataset transformation, preparation and edition
Models for dataset curation
Dataset Exploration
Synthetic Dataset Creation
Dataset Creation

Models for dataset curation

updated Dec 5, 2024
Upvote
17

  • HuggingFaceFW/fineweb-edu-classifier

    Text Classification • 0.1B • Updated Nov 17, 2024 • 2.69k • • 188

    Note Classify texts based on their educational quality


  • minishlab/potion-base-8M

    0.0B • Updated Jan 21 • 103k • 68

    Note A blazing-fast embedding generator


  • nvidia/domain-classifier

    0.2B • Updated Jan 24 • 7.87k • 86

    Note A model to classify text according to different domains


  • nvidia/quality-classifier-deberta

    0.2B • Updated Jan 31 • 6.8k • 65

    Note Classify texts based on their general quality


  • urchade/gliner_multi_pii-v1

    Token Classification • Updated Apr 20, 2024 • 45.3k • 123

    Note Identify and classify personal identifiable information PII


  • giacomoarienti/nsfw-classifier

    Image Classification • 0.1B • Updated Mar 26 • 50.3k • • 40

  • Falconsai/nsfw_image_detection

    Image Classification • 0.1B • Updated Apr 6 • 112M • • 740

  • PleIAs/celadon

    Text Classification • 0.1B • Updated Jun 12 • 94 • 32
Upvote
17
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs