Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
BEE-spoke-data 's Collections
Survivor Library Books - OCR
smol llama
finetuned smol 220M
Pretrained Encoders
Bee Models 🍯
book genre classifiers
tokenizers
FineWeb Concept Datasets

Survivor Library Books - OCR

updated Jul 14

Books from the Survivor Library (mostly ~1920s & earlier) OCR'd with recent VLMs

Upvote
5

  • BEE-spoke-data/SurvivorLib-Nanonets-OCR-s

    Viewer • Updated Jul 14 • 11.7k • 142 • 3

    Note .md format, higher variance


  • BEE-spoke-data/SurvivorLib-rolmOCR

    Viewer • Updated Jul 8 • 13.3k • 61 • 2

    Note "plaintext" format, lower (but not zero) variance

Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs