Stars
8
results
for source starred repositories
written in Python
Clear filter
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Convert PDF to markdown + JSON quickly with high accuracy
pdfrw is a pure Python library that reads and writes PDFs
💉 Stuff which works in Chrome and maybe Acrobat and Foxit.
A more complete example of programming with PDFMiner, which continues where the default documentation stops
Materials for a course on open data at UC Berkeley I School (2013)
Cheshire3 Search Engine and Information Framework