Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting

Python 520 37 Updated Jan 30, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 8,942 786 Updated Aug 7, 2024

frank-xwang / debiased-pseudo-labeling

[CVPR 2022] Pytorch implementation for “Debiased Learning from Naturally Imbalanced Pseudo-Labels”

Python 96 5 Updated Oct 3, 2022

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 31,771 4,718 Updated Oct 9, 2024

baaivision / Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,507 167 Updated Oct 31, 2023

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 47,065 5,568 Updated Sep 18, 2024

cvlab-columbia / viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Jupyter Notebook 1,653 118 Updated Jan 29, 2024

cvjena / semantic-embeddings

Hierarchy-based Image Embeddings for Semantic Image Retrieval

Python 263 50 Updated Apr 26, 2021

JaidedAI / EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 24,016 3,129 Updated Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Elvis Shi ElvishElvis

Achievements

Achievements

Highlights

Block or report ElvishElvis

Stars

rerun-io / rerun

315386775 / DeepLearing-Interview-Awesome-2024

nv-tlabs / lift-splat-shoot

RuixiangJiang / blog-source

Duboshi / OI

wzzheng / TPVFormer

zhulf0804 / 3D-PointCloud

peiyunh / ff

tarashakhurana / emergent-occ-forecasting

patrick-llgc / Learning-Deep-Learning

OpenDriveLab / End-to-end-Autonomous-Driving

allenai / visprog

TheAlgorithms / C-Plus-Plus

Light-City / CPlusPlusThings

openai / whisper

saurabhgarg1996 / ATC_code

mlfoundations / wise-ft

open-mmlab / playground

tranluan / Nonlinear_Face_3DMM

AutoHDR / HD-Net

yuezunli / celeb-deepfakeforensics

yeungchenwa / OCR-SAM