Skip to content

Issues: jlsutherland/doc2text

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

What is wrong with this ? Can someone please explain ?
#11 opened Sep 1, 2016 by iamvc7 updated Sep 1, 2016
Error on doc.process()
#14 opened Sep 2, 2016 by rsteca updated Sep 5, 2016
Support for non scanned documents (.doc, .docx, regular pdf)
#15 opened Sep 5, 2016 by rcatajar updated Sep 9, 2016
Question: Support for Windows
#21 opened Nov 11, 2016 by modulexcite updated Nov 11, 2016
Unable to process
#22 opened Nov 12, 2016 by alonecoder1337 updated Nov 12, 2016
text extraction from png files does not seem to work
#23 opened Dec 24, 2016 by vsriram28 updated Dec 24, 2016
FileNotFoundError
#29 opened Nov 27, 2017 by jashuRc updated Nov 27, 2017
ModuleNotFoundError: No module named 'PyPDF2'
#28 opened Nov 6, 2017 by alexauvray updated Sep 14, 2018
Python 3.5 compatibility
#24 opened Feb 1, 2017 by andjelx updated Dec 19, 2018
Does is support stream data ?
#32 opened Apr 24, 2020 by multinucliated updated Apr 24, 2020
No module name PythonMagick
#34 opened Jul 21, 2021 by atul219 updated Jan 7, 2023
Image not cropped accurately
#35 opened Feb 15, 2023 by tekurkaa updated Feb 15, 2023
it'd be nice if this could produce text-overlaid PDFs
#10 opened Aug 31, 2016 by jbothma updated Dec 24, 2023
ProTip! Mix and match filters to narrow down what you’re looking for.