-
CUHK&PKU&SHU
- http://dongchaoyang.top
-
RSTnet Public
Real-time Speech-Text Foundation Model Toolkit (wip)
-
-
-
-
-
-
Codec-SUPERB Public
Forked from voidful/Codec-SUPERBAudio Codec Speech processing Universal PERformance Benchmark
-
ChatGPT Public
Forked from PawanOsman/ChatGPTOpenAI API Free Reverse Proxy
TypeScript GNU Affero General Public License v3.0 UpdatedApr 6, 2024 -
-
-
-
vit-vqgan-jax Public
Forked from jiasenlu/vit-vqgan-jaxJax implementation of VIT-VQGAN
Python UpdatedJan 25, 2024 -
AcademiCodec Public
AcademiCodec: An Open Source Audio Codec Model for Academic Research
-
seamless_communication Public
Forked from facebookresearch/seamless_communicationFoundational Models for State-of-the-Art Speech and Text Translation
Jupyter Notebook Other UpdatedDec 26, 2023 -
-
EdVAE Public
Forked from ituvisionlab/EdVAEOfficial PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"
Python UpdatedDec 8, 2023 -
-
-
-
Text-to-sound-Synthesis Public
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
-
TranSpeech Public
Forked from Rongjiehuang/TranSpeechPyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
Python MIT License UpdatedMar 29, 2023 -
audiolm-pytorch Public
Forked from lucidrains/audiolm-pytorchImplementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Python MIT License UpdatedFeb 13, 2023 -
-
imagen-pytorch Public
Forked from lucidrains/imagen-pytorchImplementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Python MIT License UpdatedNov 19, 2022 -
DALLE2-pytorch Public
Forked from lucidrains/DALLE2-pytorchImplementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Python MIT License UpdatedNov 11, 2022 -
-
text-to-sound-synthesis-demo Public
Forked from ZhaZhaFon/demo-confusionThis is a demo webpage for our paper 'text-to-sound synthesis'
-
A Two-student Learning Framework for Mixed Supervised Target Sound Detection
-
-
beautiful-jekyll Public
Forked from daattali/beautiful-jekyll✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
HTML MIT License UpdatedMay 19, 2022