Committer Apache PDFBox and Apache Tika
- Berlin, Germany
- http://www.xenu.de/
Stars
The Apache PdfBox project ported to work on Android
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
THausherr / pdfbox-docs
Forked from apache/pdfbox-docsMirror of Apache PDFBox Docs
Tabula is a tool for liberating data tables trapped inside PDF files
veraPDF test corpus for ISO 19005 (PDF/A) and ISO 14289 (PDF/UA)
TwelveMonkeys ImageIO: Additional plug-ins and extensions for Java's ImageIO