Arabic-OCR

Arabic Image-to-text Converter

Link to Web App: https://bit.ly/Arabic-image-to-text

A user can input a whole PDF with Arabic text images and it will output 2 files with digitized Arabic text that can be copied/searched/analyzed/etc. One file is a PDF of the output and the other is a Word file of the output.

This was done using the Tesseract Python library. Its accuracy is far from a 100% but example image is shown below.

Usage

Install Python and Streamlit
Install all required library and packages
Run "streamlit run SL_ArabicOCR.py"

Hosted by Streamlit Cloud

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.devcontainer		.devcontainer
Thumbnails		Thumbnails
.DS_Store		.DS_Store
NotoNaskhArabic-Regular.ttf		NotoNaskhArabic-Regular.ttf
NotoNastaliqUrdu-Regular.ttf		NotoNastaliqUrdu-Regular.ttf
README.md		README.md
Roboto-Regular.ttf		Roboto-Regular.ttf
SL_ArabicOCR.py		SL_ArabicOCR.py
Urdu_Example_pg1to6_INAAM_UL_BARI_VOL_2_4988918.pdf		Urdu_Example_pg1to6_INAAM_UL_BARI_VOL_2_4988918.pdf
balaghah_43_46.pdf		balaghah_43_46.pdf
packages.txt		packages.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arabic-OCR

Arabic Image-to-text Converter

Link to Web App: https://bit.ly/Arabic-image-to-text

Usage

About

Releases

Packages

Contributors 2

Languages

ssraza21/Arabic-OCR

Folders and files

Latest commit

History

Repository files navigation

Arabic-OCR

Arabic Image-to-text Converter

Link to Web App: https://bit.ly/Arabic-image-to-text

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages