Starred repositories
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Industry leading face manipulation platform
A new one shot face swap approach for image and video domains
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, …
👮♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。内置支持单词标签分类分级。请勿发布涉及政治、广告、营销、翻墙、违反国家法律法规等内容。高性能敏感词检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。)
一款高性能敏感词(非法词/脏字)检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。
Image-to-Image Translation in PyTorch
InspireFace is a cross-platform face recognition SDK developed in C/C++, supporting multiple operating systems and various backend types for inference, such as CPU, GPU, and NPU.
This is the official project website of RealFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping
A high resolution face dataset for face editing purpose
Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
A PyTorch3D walkthrough and a Medium article 👋 on how to render 3D .obj meshes from various viewpoints to create 2D images.
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers
A research project for natural language generation, containing the official implementations by MSRA NLC team.
real time face swap and one-click video deepfake with only a single image
Audio generation using diffusion models, in PyTorch.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation
Official PyTorch Implementation of EDGE (CVPR 2023)
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Release for Improved Denoising Diffusion Probabilistic Models
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A simple implementation of classifier-free guidance DDIM on MNIST