Skip to content
View JohnsonTsing's full-sized avatar

Block or report JohnsonTsing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

542 68 Updated Nov 13, 2024

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,908 4,199 Updated Feb 8, 2025

Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned Sound (VAS) dataset.

Python 52 12 Updated Dec 15, 2020

A curated list of different papers and datasets in various areas of audio-visual processing

690 69 Updated Jan 30, 2024
Python 92 17 Updated May 23, 2017

IPA Pronunciation Dictionaries in DSL format

39 6 Updated Jan 13, 2017

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python 778 195 Updated Apr 6, 2023

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,450 8,886 Updated Aug 14, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,054 2,664 Updated Feb 9, 2025

A collection of implementations of adversarial domain adaptation algorithms

Python 623 108 Updated Sep 21, 2021

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5 4 Updated Mar 11, 2020

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 51 6 Updated Nov 1, 2019

WeUI icon

JavaScript 62 9 Updated Aug 7, 2024

[ECCV 2020] XingGAN for Person Image Generation

Python 227 36 Updated Feb 17, 2023

[BMVC 2020 Oral] Bipartite Graph Reasoning GANs for Person Image Generation

Python 130 16 Updated Jan 18, 2023

Give a portrait face, move the gaze up (ACM MM 2020)

Python 155 15 Updated Mar 25, 2023

[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis

Python 313 70 Updated Mar 4, 2023

Tensorflow implementation of the Gradient Reversal layer from https://arxiv.org/abs/1505.07818

Python 13 8 Updated Jun 19, 2018

Forked from NVIDIA/tacotron2 and merged with Rayhane-mamah/Tacotron-2

Python 81 38 Updated Nov 22, 2020

The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019

Python 23 5 Updated Aug 12, 2019

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,296 910 Updated Jul 6, 2023