Lists (6)
Sort Name ascending (A-Z)
Stars
Faster Whisper transcription with CTranslate2
JerryWu-code / TinyZero
Forked from Jiayi-Pan/TinyZeroDeepseek R1 zero tiny version own reproduce on two A100s.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Official inference repo for FLUX.1 models
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
The code for WWW2024 paper "Rethinking Cross-Domain Sequential Recommendation under Open-World Assumptions".
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
kaggle Otto Recommender system code, single model LB0.596, about rank 22
👾 Fast and simple video download library and CLI tool written in Go
This is Pytorch Implementation of Google's Non-attentive Tacotron.
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
kaggle:otto competition
I share my solution for the Otto Competition, scoring LB 0.601, using Reranker, Transformers and GRU
A generative speech model for daily dialogue.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Foundational model for human-like, expressive TTS
【浅梦学习笔记】文章汇总:包含 排序&CXR预估,召回匹配,用户画像&特征工程,推荐搜索综合 计算广告,大数据,图算法,NLP&CV,求职面试 等内容
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
MLGB is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch. 「妙计包」是一个包含50+点击率预估和推荐系统深度模型的、通过TensorFlow和PyTorch撰写的库。