Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)

Python 750 147 Updated Dec 15, 2023

ASR-project / Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021…

Python 216 18 Updated May 9, 2022

facebookresearch / CPC_audio

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Python 355 63 Updated Oct 12, 2021

ziwenhahaha / Code-of-RL-Beginning

Jupyter Notebook 115 11 Updated Jan 5, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 4,361 575 Updated Dec 26, 2024

worldveil / dejavu

Audio fingerprinting and recognition in Python

Python 6,480 1,438 Updated Apr 22, 2024

JorenSix / Olaf

Olaf: Overly Lightweight Acoustic Fingerprinting is a portable acoustic fingerprinting system.

C 328 33 Updated May 16, 2024

ChrisNick92 / deep-audio-fingerprinting

A repository for my MSc thesis in Data Science & Machine Learning @ NTUA. A deep learning approach to audio fingerprinting for recognizing songs on real time through the microphone.

Jupyter Notebook 25 2 Updated Nov 12, 2024

MathewSachin / Captura

Capture Screen, Audio, Cursor, Mouse Clicks and Keystrokes

C# 9,984 1,877 Updated Apr 9, 2023

nicfit / eyeD3

eyeD3 is a Python module and command line program for processing ID3 tags. Information about mp3 files (i.e bit rate, sample frequency, play time, etc.) is also provided. The formats supported are …

Python 558 59 Updated Sep 4, 2024

dennisvdang / chorus-detection

A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input a YouTube link and utilize a pre-trained CRNN model to dete…

Jupyter Notebook 16 4 Updated Oct 27, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 38,198 4,990 Updated Jan 17, 2025

hesamsheikh / ml-retreat

Machine Learning Journal for Intermediate to Advanced Topics.

Jupyter Notebook 1,456 136 Updated Dec 14, 2024

chinue / FFmpeg4Win

FFmpeg for windows with x264

C 4 1 Updated Jun 9, 2020

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,523 126 Updated Jan 17, 2025

yangkang2021 / I_am_a_person

实时互动的GPT数字人

Python 361 76 Updated Dec 26, 2024

Freedium-cfd / web

THIS REPOSITORY IS JUST MIRROR! Main development repository is https://codeberg.org/Freedium-cfd/web

Python 846 68 Updated Jan 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TY Bian tybian

Block or report tybian

Starred repositories

ddlBoJack / Awesome-Speech-Language-Model

minzwon / musicfm

X-LANCE / SLAM-LLM

davisking / dlib

tybian / riff_wave

Qqi-HE / DeepChorus

SJTU-Lucy / EmoFace

lucidrains / linear-attention-transformer

RVC-Boss / GPT-SoVITS

TencentGameMate / chinese_speech_pretrain

postech-ami / 3d-talking-head-av-guidance

0nutation / SpeechGPT

ZJU-LLMs / Foundations-of-LLMs

yiranran / Audio-driven-TalkingFace-HeadPose