Skip to content
View Shuigs18's full-sized avatar

Highlights

  • Pro

Block or report Shuigs18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Faster Whisper transcription with CTranslate2

Python 14,746 1,246 Updated Jan 1, 2025

Deepseek R1 zero tiny version own reproduce on two A100s.

Python 49 19 Updated Feb 1, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,594 549 Updated Mar 12, 2025

Taming Stable Diffusion for Lip Sync!

Python 2,928 437 Updated Jan 19, 2025

Official inference repo for FLUX.1 models

Python 20,781 1,462 Updated Feb 6, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,219 381 Updated Feb 27, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,522 2,430 Updated Feb 10, 2025

数字人资料整理

741 84 Updated Jan 8, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 929 173 Updated Mar 13, 2025

The code for WWW2024 paper "Rethinking Cross-Domain Sequential Recommendation under Open-World Assumptions".

Python 29 2 Updated Aug 12, 2024

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Python 3,109 371 Updated Mar 13, 2025

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 4,272 479 Updated Mar 11, 2025
Python 8 2 Updated Jul 28, 2024

Mamba SSM architecture

Python 14,219 1,238 Updated Jan 18, 2025

Custom Jupyter Notebook Themes

CSS 9,816 1,053 Updated Oct 18, 2023

kaggle Otto Recommender system code, single model LB0.596, about rank 22

Python 21 1 Updated Feb 9, 2023

👾 Fast and simple video download library and CLI tool written in Go

Go 28,835 3,085 Updated Mar 13, 2025

This is Pytorch Implementation of Google's Non-attentive Tacotron.

Jupyter Notebook 58 12 Updated Dec 21, 2022

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,586 124 Updated Aug 13, 2024

kaggle:otto competition

Python 18 3 Updated Feb 13, 2023

I share my solution for the Otto Competition, scoring LB 0.601, using Reranker, Transformers and GRU

Jupyter Notebook 26 4 Updated Mar 22, 2023

A generative speech model for daily dialogue.

Python 35,063 3,787 Updated Feb 18, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,873 1,181 Updated Mar 13, 2025

Foundational model for human-like, expressive TTS

Python 4,060 678 Updated Jul 30, 2024

【浅梦学习笔记】文章汇总:包含 排序&CXR预估,召回匹配,用户画像&特征工程,推荐搜索综合 计算广告,大数据,图算法,NLP&CV,求职面试 等内容

1,665 232 Updated Dec 24, 2022

TTS appalication based on modelscope KAN-TTS

Python 43 6 Updated Apr 11, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,675 675 Updated Mar 3, 2025

MLGB is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch. 「妙计包」是一个包含50+点击率预估和推荐系统深度模型的、通过TensorFlow和PyTorch撰写的库。

Python 520 26 Updated Mar 2, 2025
Next