Skip to content

Simple script to extract tables in pdf and store in csv file

Notifications You must be signed in to change notification settings

khushpanchal/pdf_to_excel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdf_to_excel

PPP.csv contains the extracted table in excel

pdf_file --> Contains pdf file

pdf_to_excel.py --> Simple script to extract tables in pdf and store in csv file Used Tabula, an open source tool --> pip install tabula-py

output.csv --> Converted csv file using pdf_to_excel.py

Note:- Table can be cleaned afterwards using excel itself or by using pandas.

PPP.csv --> Manually cleaned version of output.csv using excel.

About

Simple script to extract tables in pdf and store in csv file

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages