Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 24,333 4,412 Updated Aug 18, 2024

junyanz / pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Python 23,320 6,342 Updated May 14, 2024

jindongwang / transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 13,576 3,820 Updated Dec 19, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,781 878 Updated Oct 3, 2024

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,070 868 Updated Jul 6, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,102 1,413 Updated Dec 20, 2024

jadore801120 / attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,950 1,985 Updated Apr 16, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,619 2,198 Updated Dec 23, 2024

codertimo / BERT-pytorch

Google AI 2018 BERT pytorch implementation

Python 6,258 1,315 Updated Sep 15, 2023

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,603 445 Updated Nov 25, 2024

cs230-stanford / cs230-code-examples

Code examples in pyTorch and Tensorflow for CS230

Python 3,966 999 Updated Mar 24, 2023

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,547 310 Updated Jan 4, 2024

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,297 486 Updated Dec 22, 2024

median-research-group / LibMTL

A PyTorch Library for Multi-Task Learning

Python 2,114 196 Updated Oct 18, 2024

haitongli / knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

Python 1,883 346 Updated Mar 25, 2023

tobegit3hub / tensorflow_template_application

TensorFlow template application for deep learning

Python 1,872 714 Updated Jul 5, 2023

nicodjimenez / lstm

Minimal, clean example of lstm neural network training in python, for learning purposes.

Python 1,777 653 Updated Jul 9, 2021

tczhangzhi / pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

Python 1,647 298 Updated Jul 25, 2024

google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python 1,565 320 Updated Sep 25, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,238 117 Updated Jul 11, 2024

sooftware / conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 973 178 Updated Dec 22, 2023

SimonVandenhende / Multi-Task-Learning-PyTorch

PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).

Python 784 112 Updated Jan 13, 2022

bastibe / python-soundfile

SoundFile is an audio library based on libsndfile, CFFI, and NumPy

Python 725 112 Updated Dec 4, 2024

Snowdar / asv-subtools

An Open Source Tools for Speaker Recognition

Python 605 131 Updated Aug 5, 2024

yangdongchao / AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 599 80 Updated Dec 27, 2023

Lyken17 / pytorch-memonger

Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174

Python 591 54 Updated Dec 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hexin Liu Lhx94As

Achievements