Highlights
- Pro
Lists (6)
Sort Name ascending (A-Z)
Stars
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Large Concept Models: Language modeling in a sentence representation space
A much modified lyric editor with fancy (maybe) UI based on WPF
一个输入法状态(中文/英文/大写锁定)实时提示工具。An input method status (Chinese/English/CapsLock) tip tool.
Kandinsky 2 — multilingual text2image latent diffusion model
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Windows动态修改鼠标指针光标颜色图标样式, 根据键盘布局/语言/输入法的切换而变化. Change cursor/pointer style/color dynamically base on Keyboard Layout/Input Locale/IME/Language on Windows.
PyTorch 1.11 reimplementation of multi task gradient adaptation ideas: Gradient Surgery (PCGrad) and Gradient Vaccine
A dictionary for Montreal-Forced-Aligner users to align mandarin data labeled in pinyin form using the mfa pretrained model v2.0.
An all-purpose window upscaler for Windows 10/11.
Command line utility for forced alignment using Kaldi
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Sprites video data used in the ICML 2018 paper
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python audio and music signal processing library
[CVPR2019]Learning Not to Learn : An adversarial method to train deep neural networks with biased data
SVHN dataset preprocessing and annotation file reading and converting python library
Stable Diffusion web UI
sketch + style = paints 🎨 (TOG2018/SIGGRAPH2018ASIA)
Research code for pixel-based encoders of language (PIXEL)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…