-
Zhejiang University
- HangZhou
Stars
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
Code for "Animatable Implicit Neural Representations for Creating Realistic Avatars from Videos" TPAMI 2024, ICCV 2021
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
A curated list of awesome 3d generation papers
Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
A curated list of Multimodal Related Research.
CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)
v objective diffusion inference code for PyTorch.
The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020)
A curated list of different papers and datasets in various areas of audio-visual processing
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
MLNLP: This repository is a collection of AI top conferences papers (e.g. ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, ICLR, NeurIPS, and ICML) with open resource code
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …
📚 A collection of papers about Referring Image Segmentation.
A unified 3D Transformer Pipeline for visual synthesis
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
A curated list of papers, code and resources pertaining to weak-shot classification, detection, and segmentation.
VQVAEs, GumbelSoftmaxes and friends
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
This repository contains the source code for the paper First Order Motion Model for Image Animation