TableRecognition

Recognize tables from images and restore them into word.

How to run

python server.py
Load the unet model to extract table lines from the input image
python test.py
Feed the input image
python image2word.py
Restore table use opencv & python-docx

Tips

Step 1 & 2 are not necessary if you have quite neat PDF images, meanwhile this project can't deal with some complex samples like tortuous and colorful receipts, I am still working on it.

To do

I am handling table recognition like this image, struggling with the dataset. Optimistically, there could be a radical change in weeks. If you are researching page layout and table recognition, please contact me.[email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
__pycache__		__pycache__
data		data
model		model
predict		predict
test		test
word		word
README.md		README.md
data.py		data.py
image2word.py		image2word.py
requirements.txt		requirements.txt
restore_table.py		restore_table.py
server.py		server.py
simplified_unet.py		simplified_unet.py
single_table.jpg		single_table.jpg
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TableRecognition

How to run

Tips

To do

Reference and some useful projects

About

Releases

Packages

Languages

DamonYangyang/Table-OCR

Folders and files

Latest commit

History

Repository files navigation

TableRecognition

How to run

Tips

To do

Reference and some useful projects

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages