Skip to content
View maaxlong's full-sized avatar

Block or report maaxlong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service

Python 169 23 Updated Oct 15, 2024

A repo for scraping jobs from jobs.ac.uk

Python 2 Updated Apr 29, 2022

A configurable jobs.ac.uk job searcher, web scraping then filtering the results with chatgpt

Python 1 Updated May 12, 2024

Repository on a series of Experimentations with Optical Character Recognition

Jupyter Notebook 1 Updated Dec 20, 2024

Get your documents ready for gen AI

Python 16,836 870 Updated Dec 19, 2024

OCR by Vision framework

Swift 4 Updated Mar 27, 2024

Command line tool for extracting text from images using Apple's Vision framework.

Swift 18 4 Updated Dec 6, 2023

A python wrapper to extract text from images on a mac system. Uses the vision framework from Apple.

Jupyter Notebook 291 24 Updated Nov 7, 2024

A powerful command-line OCR tool built with Apple's Vision framework, supporting single image and batch processing with detailed positional information output.

Swift 34 5 Updated Nov 29, 2024

A package for generating synthetic data and fine-tuning a gliner model.

Jupyter Notebook 8 4 Updated Jun 5, 2024

Tutorial for creating a reconciliation service for wikibase.cloud for OpenRefine

Python 2 2 Updated Sep 17, 2024

Collection of georeferenced boundaries of world countries and cultural regions for use in mapping historical data on global or continental scale

JavaScript 478 86 Updated Sep 12, 2024

The Implement of "A Lightweight Weak Semantic Framework for Cinematographic Shot Classification"

Python 5 Updated Dec 18, 2023

Official code for our paper "An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction" which will be published at AAAI 2024.

Python 47 5 Updated Jan 3, 2024

Detecting cinema shot types using a ResNet-50

Jupyter Notebook 191 39 Updated Dec 15, 2022

Science Museum Group Collection Online

JavaScript 47 3 Updated Dec 17, 2024

Gato Prompt Language (GPL): A system for generating focused instructions and short-form outputs.

20 Updated Oct 1, 2024

awesome synthetic (text) datasets

Jupyter Notebook 250 11 Updated Oct 29, 2024

PDF to Markdown with vision models

Python 7,454 441 Updated Dec 18, 2024

Transform Unstructured Data into Synthetic Datasets

Python 22 3 Updated Sep 3, 2024

A command-line application to convert images, PDFs, and audio files to text using Apple's APIs

Swift 715 25 Updated Apr 14, 2023

Software that makes labeling PDFs easy.

Python 399 74 Updated May 13, 2024

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

Jupyter Notebook 249 16 Updated Sep 7, 2024

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…

Jupyter Notebook 2,610 289 Updated Dec 12, 2024

Links to awesome OCR projects

2,847 351 Updated Jul 6, 2024

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 13,780 3,429 Updated Dec 22, 2024

Open Video Annotation Project

JavaScript 111 36 Updated Sep 7, 2017
JavaScript 24 5 Updated Jan 26, 2023

Python module to query the Smithsonian Institution Open Access API

Python 2 Updated Feb 1, 2024

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,085 459 Updated Dec 20, 2024
Next