Stars
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
World's fastest ANPR / ALPR implementation for CPUs, GPUs, VPUs and NPUs using deep learning (Tensorflow, Tensorflow lite, TensorRT, OpenVX, OpenVINO). Multi-Charset (Latin, Korean, Chinese) & Mult…
DeepPrivacy: A Generative Adversarial Network for Face Anonymization
**ARCHIVED** An anonymizer to obfuscate faces and license plates.
This is a collection of our NAS and Vision Transformer work.
PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
ncnn is a high-performance neural network inference framework optimized for the mobile platform
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
XavierCHEN34 / ClickSEG
Forked from alibaba/ClickSEGA code base for interactive segmentation
ICCV2021 (Oral) - Exploring Cross-Image Pixel Contrast for Semantic Segmentation
CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View
CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarchy.
[CVPR 2022] Official CoTTA Code for our paper Continual Test-Time Domain Adaptation
A very simple framework for state-of-the-art Natural Language Processing (NLP)
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
scikit-learn cross validators for iterative stratification of multilabel data
Test Time Augmentation (TTA) wrapper for computer vision tasks: segmentation, classification, super-resolution, ... etc.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Apache Superset is a Data Visualization and Data Exploration Platform
PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).
Human Pose Estimation Related Publication
It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.
Lucid Data Dreaming for Multiple Object Tracking
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network (ECCV 2018)
mixup: Beyond Empirical Risk Minimization