This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,522 2,430 Updated Feb 10, 2025

YUANZHUO-BNU / metahuman_overview

数字人资料整理

741 84 Updated Jan 8, 2025

facebookresearch / generative-recommenders

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 929 173 Updated Mar 13, 2025

WujiangXu / AMID

The code for WWW2024 paper "Rethinking Cross-Domain Sequential Recommendation under Open-World Assumptions".

Python 29 2 Updated Aug 12, 2024

Doragd / Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集（来源：知乎、Datafuntalk、技术公众号）

Python 3,109 371 Updated Mar 13, 2025

modelscope / FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 4,272 479 Updated Mar 11, 2025

ChunhanLi / 2nd-kaggle-LEAP

Python 8 2 Updated Jul 28, 2024

state-spaces / mamba

Mamba SSM architecture

Python 14,219 1,238 Updated Jan 18, 2025

dunovank / jupyter-themes

Custom Jupyter Notebook Themes

CSS 9,816 1,053 Updated Oct 18, 2023

lukan217 / kaggle_otto_rec_sys

kaggle Otto Recommender system code, single model LB0.596, about rank 22

Python 21 1 Updated Feb 9, 2023

iawia002 / lux

👾 Fast and simple video download library and CLI tool written in Go

Go 28,835 3,085 Updated Mar 13, 2025

JoungheeKim / Non-Attentive-Tacotron

This is Pytorch Implementation of Google's Non-attentive Tacotron.

Jupyter Notebook 58 12 Updated Dec 21, 2022

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,586 124 Updated Aug 13, 2024

makotu1208 / Otto-kaggle-solution-makotupart

kaggle:otto competition

Python 18 3 Updated Feb 13, 2023

bschifferer / Kaggle-Otto-Comp

I share my solution for the Otto Competition, scoring LB 0.601, using Reranker, Transformers and GRU

Jupyter Notebook 26 4 Updated Mar 22, 2023

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 35,063 3,787 Updated Feb 18, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,873 1,181 Updated Mar 13, 2025

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

Python 4,060 678 Updated Jul 30, 2024

shenweichen / AlgoNotes

【浅梦学习笔记】文章汇总:包含排序&CXR预估，召回匹配，用户画像&特征工程，推荐搜索综合计算广告，大数据，图算法，NLP&CV，求职面试等内容

1,665 232 Updated Dec 24, 2022

xiaomingnio / kantts

TTS appalication based on modelscope KAN-TTS

Python 43 6 Updated Apr 11, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,675 675 Updated Mar 3, 2025

UlionTse / mlgb

MLGB is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch. 「妙计包」是一个包含50+点击率预估和推荐系统深度模型的、通过TensorFlow和PyTorch撰写的库。

Python 520 26 Updated Mar 2, 2025

Shuigs18

Highlights

Lists (6)

AIGC

NLP

tools

推荐

算法比赛

语音合成

Stars