Here, There're using two different libraries (PDFBox, iText) for extracting data from some particular bank statements by using java. There has been made an XML file which is one of the easiest ways which is a third party library & also used python library for data extraction from pdf. The main purpose of this repo is to find out which is the more accurate, readable, less boilerplate code & more convenient ways for data extraction from pdf & making XML files.
-
Notifications
You must be signed in to change notification settings - Fork 0
regain001/Data-Extraction-From-PDF
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description or website provided.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published