Skip to content
View Xia-gx's full-sized avatar

Block or report Xia-gx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 45,649 7,925 Updated Jan 9, 2025

EDSL code

Python 19 2 Updated Mar 19, 2022

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,243 1,058 Updated Dec 5, 2024

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 6,190 556 Updated Jan 10, 2025

Open-source code for RFCNLP paper.

Promela 53 9 Updated Nov 9, 2022

Implementation of the paper "MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation".

Python 29 6 Updated Dec 12, 2021

Xidian University TeX Suite 西安电子科技大学LaTeX套装

TeX 787 77 Updated Jan 10, 2025

A Unified Toolkit for Deep Learning Based Document Image Analysis

Python 5,008 477 Updated Aug 15, 2024

Convert a PDF via OCR to a TXT file in UTF-8 encoding

Python 140 30 Updated Oct 3, 2023

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Python 2,920 965 Updated Aug 13, 2019

公式图片ocr,输入图片输出对应的latex表达式

HTML 289 76 Updated Apr 11, 2020

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipe…

Python 265 72 Updated Oct 9, 2022

Python code to read text from a PDF file (OCR).

Python 66 20 Updated May 26, 2020

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

Python 1,273 99 Updated Dec 1, 2020

Math formula recognition (Images to LaTeX strings)

Jupyter Notebook 295 65 Updated Oct 3, 2023

Call mathpix API to make Mathpix snipping tool.

Python 34 15 Updated Apr 30, 2021

Extract tables from scanned image PDFs using Optical Character Recognition.

Python 271 66 Updated Jun 9, 2020