Xia-gx

Xia-gx

Stars

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 45,649 7,925 Updated Jan 9, 2025

abcAnonymous / EDSL

EDSL code

Python 19 2 Updated Mar 19, 2022

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,243 1,058 Updated Dec 5, 2024

pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 6,190 556 Updated Jan 10, 2025

RFCNLP / RFCNLP

Open-source code for RFCNLP paper.

Promela 53 9 Updated Nov 9, 2022

sanjaykariyappa / MAZE

Implementation of the paper "MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation".

Python 29 6 Updated Dec 12, 2021

note286 / xduts

Xidian University TeX Suite 西安电子科技大学LaTeX套装

TeX 787 77 Updated Jan 10, 2025

Layout-Parser / layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Python 5,008 477 Updated Aug 15, 2024

writecrow / ocr2text

Convert a PDF via OCR to a TXT file in UTF-8 encoding

Python 140 30 Updated Oct 3, 2023

xiaofengShi / CHINESE-OCR

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Python 2,920 965 Updated Aug 13, 2019

xiaofengShi / Image2Katex

公式图片ocr，输入图片输出对应的latex表达式

HTML 289 76 Updated Apr 11, 2020

opensemanticsearch / open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipe…

Python 265 72 Updated Oct 9, 2022

lucab85 / PDFtoTXT

Python code to read text from a PDF file (OCR).

Python 66 20 Updated May 26, 2020

jlsutherland / doc2text

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

Python 1,273 99 Updated Dec 1, 2020

jungomi / math-formula-recognition

Math formula recognition (Images to LaTeX strings)

Jupyter Notebook 295 65 Updated Oct 3, 2023

Joshua-li-yi / img2latex

Call mathpix API to make Mathpix snipping tool.

Python 34 15 Updated Apr 30, 2021

cseas / ocr-table

Extract tables from scanned image PDFs using Optical Character Recognition.

Python 271 66 Updated Jun 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly