No description provided.

Instead of returning just a .txt file path for download,

It returns the actual extracted text as a string (which is what gradio_client needs).

drewThomasson changed pull request status to merged

Hi! Thanks for merging the changes β€” really appreciate it.

However, after the merge, I’m getting a 500 Internal Server Error when uploading a PDF. Could you please check that on your end?

Your idea was great, and I actually created my own Space inspired by yours:
πŸ”— https://huggingface.co/spaces/habibahmad/Extract_any_type_Data_from_pdf

Would be great if you could fix your Space β€” I’d love to test it again properly.

Thanks in advance!

Hm tried modifying it to also export a searchable PDF after the OCR step tell me what your thoughts on it are

Sign up or log in to comment