A powerful command-line OCR tool built with Apple's Vision framework, supporting single image and batch processing with detailed positional information output.

Swift 34 5 Updated Nov 29, 2024

wjbmattingly / gliner-finetune

A package for generating synthetic data and fine-tuning a gliner model.

Jupyter Notebook 8 4 Updated Jun 5, 2024

judaicadh / wikibaseopenrefine

Tutorial for creating a reconciliation service for wikibase.cloud for OpenRefine

Python 2 2 Updated Sep 17, 2024

aourednik / historical-basemaps

Collection of georeferenced boundaries of world countries and cultural regions for use in mapping historical data on global or continental scale

JavaScript 478 86 Updated Sep 12, 2024

litchiar / ShotClassification

The Implement of "A Lightweight Weak Semantic Framework for Cinematographic Shot Classification"

Python 5 Updated Dec 18, 2023

urchade / ATG

Official code for our paper "An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction" which will be published at AAAI 2024.

Python 47 5 Updated Jan 3, 2024

rsomani95 / shot-type-classifier

Detecting cinema shot types using a ResNet-50

Jupyter Notebook 191 39 Updated Dec 15, 2022

TheScienceMuseum / collectionsonline

Science Museum Group Collection Online

JavaScript 47 3 Updated Dec 17, 2024

ListfulAl / gpl

Gato Prompt Language (GPL): A system for generating focused instructions and short-form outputs.

20 Updated Oct 1, 2024

davanstrien / awesome-synthetic-datasets

awesome synthetic (text) datasets

Jupyter Notebook 250 11 Updated Oct 29, 2024

getomni-ai / zerox

PDF to Markdown with vision models

Python 7,454 441 Updated Dec 18, 2024

wizenheimer / cyyrus

Transform Unstructured Data into Synthetic Datasets

Python 22 3 Updated Sep 3, 2024

freedmand / textra

A command-line application to convert images, PDFs, and audio files to text using Apple's APIs

Swift 715 25 Updated Apr 14, 2023

allenai / pawls

Software that makes labeling PDFs easy.

Python 399 74 Updated May 13, 2024

JosefAlbers / Phi-3-Vision-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

Jupyter Notebook 249 16 Updated Sep 7, 2024

microsoft / Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…

Jupyter Notebook 2,610 289 Updated Dec 12, 2024

kba / awesome-ocr

Links to awesome OCR projects

2,847 351 Updated Jul 6, 2024

wkentaro / labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 13,780 3,429 Updated Dec 22, 2024

CtrHellenicStudies / OpenVideoAnnotation

Open Video Annotation Project

JavaScript 111 36 Updated Sep 7, 2017

adjaba / video-annotation-tool

JavaScript 24 5 Updated Jan 26, 2023

Smithsonian / smithsonian-openaccess

Python module to query the Smithsonian Institution Open Access API

Python 2 Updated Feb 1, 2024

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,085 459 Updated Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

maaxlong

Achievements

Achievements

Block or report maaxlong

Stars

palewire / savepagenow

peq10 / job_scraping

AndEsterson / gpt_job_monitor

congruence-engine / experimenting-with-optical-character-recognition

DS4SD / docling

OAK-WJR / Swift_Vision_OCR

sfomuseum / swift-text-emboss-cli

straussmaximilian / ocrmac

bytefer / macos-vision-ocr