Starred repositories
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
A synthetic data generator for text recognition
Optical character recognition for Japanese text, with the main focus being Japanese manga
Headless chrome/chromium automation library (unofficial port of puppeteer)
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Contains some simple and commonly used WPF controls
WPF UI provides the Fluent experience in your known and loved WPF framework. Intuitive design, themes, navigation and new immersive controls. All natively and effortlessly.
State-of-the-art 2D and 3D Face Analysis Project
NSFW detection on the client-side via TensorFlow.js
Collection of NSFW images URLs for the purposes of training an NSFW Image Classifier
Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more