Skip to content

laura-xy-lee/table-to-markdown

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF Table to Markdown Converter

This is a wrapper around the Python Camelot library which converts tables in PDF to markdown.

Please install Ghostscript, a Camelot dependency.


Convert one PDF file to markdown

> python table_to_markdown.py '<FILEPATH>/<FILENAME>.pdf'

Markdown tables will be saved to <FILEPATH>/<FILENAME>_table_<TABLENUMBER>.md


Convert all PDF files in directory to markdown

> python table_to_markdown.py '<FILEPATH>/<DIRECTORY_NAME>'

Markdown tables will be saved to <FILEPATH>/<DIRECTORY_NAME>/<FILENAME>_table_<TABLENUMBER>.md


Convert all PDF files in directory to markdown and save to specific location:

> python table_to_markdown.py '<FILEPATH>/<DIRECTORY_NAME>' '<SECOND_FILEPATH>/<DIRECTORY_NAME>'

Markdown tables will be saved to <SECOND_FILEPATH>/<DIRECTORY_NAME>/<FILENAME>_table_<TABLENUMBER>.md

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages