Stars
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multi…
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
ncnn is a high-performance neural network inference framework optimized for the mobile platform
NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
DocBank: A Benchmark Dataset for Document Layout Analysis
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall
The 2nd place solution for KDD Cup AutoGraph2020
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
A Toolbox for Adversarial Robustness Research
PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"
[ICLR 2020]: 'AtomNAS: Fine-Grained End-to-End Neural Architecture Search'
FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search
Bridging the gap Between Stability and Scalability in Neural Architecture Search
Video Representation Learning by Dense Predictive Coding. Tengda Han, Weidi Xie, Andrew Zisserman.