-
Nanyang Technological University
- Singapore
- https://scholar.google.com/citations?user=iAT_5-kAAAAJ&hl=en
Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Robust Speech Recognition via Large-Scale Weak Supervision
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
PyTorch Tutorial for Deep Learning Researchers
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Image-to-Image Translation in PyTorch
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Google AI 2018 BERT pytorch implementation
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Code examples in pyTorch and Tensorflow for CS230
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Self-Supervised Speech Pre-training and Representation Learning Toolkit
A PyTorch Library for Multi-Task Learning
A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility
TensorFlow template application for deep learning
Minimal, clean example of lstm neural network training in python, for learning purposes.
A quickstart and benchmark for pytorch distributed training.
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).
SoundFile is an audio library based on libsndfile, CFFI, and NumPy
An Open Source Tools for Speaker Recognition
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174