Stars
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
CRAFT(Baek et al., 2019) model training code
Official implementation of Character Region Awareness for Text Detection (CRAFT)
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Robust Speech Recognition via Large-Scale Weak Supervision
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Code for the paper "Language Models are Unsupervised Multitask Learners"
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Google AI 2018 BERT pytorch implementation
Simple Online Realtime Tracking with a Deep Association Metric
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
PyTorch Tutorial for Deep Learning Researchers
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf
😎 Awesome lists about all kinds of interesting topics