You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is JAX-RS Tika server for Tika
(https://issues.apache.org/jira/browse/TIKA-593)
Running
-------
java -jar target/tikaserver-1.0-SNAPSHOT.jar
Usage
-----
Usage examples from command line with curl utility:
1) Extract plain text:
curl -T price.xls http://localhost:9998/tika
2) Extract text with mime-type hint:
curl -v -H "Content-type: application/vnd.openxmlformats-officedocument.wordprocessingml.document" -T document.docx http://localhost:9998/tika
3) Get all document attachments as ZIP-file:
curl -v -T Doc1_ole.doc http://localhost:9998/unpacker > /var/tmp/x.zip
4) Extract metadata to CSV format:
curl -T price.xls http://localhost:9998/meta
HTTP Codes
----------
200 - Ok
204 - No content (for example when we are unpacking file without attachments)
415 - Unknown file type
422 - Unparsable document of known type (password protected documents and unsupported versions like Biff5 Excel)
500 - Internal error