Jiaxin-Ye

Follow

🏪

Working in the laboratory

Jiaxin Ye Jiaxin-Ye

🏪

Working in the laboratory

Follow

A second-year Ph.D. student at Fudan University.

29 followers · 20 following

Shanghai, China
https://jiaxin-ye.github.io/

Achievements

Achievements

Lists (8)

Sort

Affective Computing 🤓

11 repositories

AIGC 🫨

18 repositories

Diffusion-based Method 🫡

14 repositories

FaceTTS 😊🎙️

Mamba🐍

Speech Generation 🎤

Talking Head Generation 🤖️

40 repositories

Toolkit 👍

30 repositories

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

CyberAgentAILab / layout-dm

LayoutDM: Discrete Diffusion Model for Controllable Layout Generation [Inoue+, CVPR2023]

Python 215 23 Updated Oct 24, 2023

OpenSource-O1 / Open-O1

378 7 Updated Oct 8, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 32,784 3,754 Updated Oct 8, 2024

TinyLLaVA / TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Python 599 53 Updated Sep 10, 2024

modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 353 30 Updated Jan 25, 2024

baaivision / Emu3

Next-Token Prediction is All You Need

Python 866 25 Updated Oct 8, 2024

google-research / byt5

Python 484 30 Updated Feb 13, 2024

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

Python 1,330 135 Updated Jun 21, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,204 659 Updated Sep 30, 2024

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 2,404 223 Updated Jul 31, 2024

nubcico / EAV

EEG-Audio-Video Dataset for Emotion Recognition in Conversations

Python 8 2 Updated Oct 7, 2024

knowledgetechnologyuhh / MELD-FAIR

Python 6 Updated Oct 10, 2023

PolyAI-LDN / pheme

Python 248 23 Updated Mar 15, 2024

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 437 39 Updated Jun 9, 2024

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,944 417 Updated May 10, 2023

naver-ai / facetts

Python 48 6 Updated May 17, 2023

xuanyuzhang21 / EditGuard

[CVPR 2024🔥] EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection

Python 144 9 Updated Oct 7, 2024

DNLINYJ / Bilibili-Downloader-Python

B站视频/弹幕下载器

Python 11 3 Updated Jun 23, 2024

Nemo2011 / bilibili-api

哔哩哔哩常用API调用。支持视频、番剧、用户、频道、音频等功能。原仓库地址：https://github.com/MoyuScript/bilibili-api

Python 2,138 203 Updated Oct 7, 2024

alexmercerind / youtube-search-python

🔎 Search for YouTube videos, channels & playlists. Get 🎞 video & 📑 playlist info using link. Get search suggestions. WITHOUT YouTube Data API v3.

Python 734 161 Updated Jun 30, 2022

corralm / ted-scraper

🎙️ TED Talks web scraper

Python 25 7 Updated May 21, 2024

wanglin2 / mind-map

一个还算强大的Web思维导图。A relatively powerful web mind map.

JavaScript 6,189 870 Updated Sep 30, 2024

xmindflow / Awesome_Mamba

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

186 12 Updated Sep 25, 2024

cosmaadrian / multimodal-depression-from-video

Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"

Python 40 6 Updated May 16, 2024

ondyari / FaceForensics

Github of the FaceForensics dataset

Python 2,355 533 Updated Dec 8, 2022

shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 653 81 Updated Oct 8, 2024

innnky / emotional-vits

无需情感标注的情感可控语音合成模型，基于VITS

Jupyter Notebook 1,316 167 Updated Mar 30, 2023

HLTSingapore / Emotional-Speech-Data

This is the GitHub page for publicly available emotional speech data.

316 22 Updated Jan 6, 2022

NX-AI / xlstm

Official repository of the xLSTM.

Python 1,279 92 Updated Sep 7, 2024

speechandlanguageprocessing / ICASSP2022-Depression

Automatic Depression Detection: a GRU/ BiLSTM-based Model and An Emotional Audio-Textual Corpus

Python 128 31 Updated Jul 10, 2023