Skip to content
View moviewang's full-sized avatar
💭
Stay hungry
💭
Stay hungry

Block or report moviewang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,218 387 Updated Dec 18, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,703 231 Updated Oct 16, 2024

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,733 144 Updated Jan 17, 2025
1 Updated Dec 18, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,351 176 Updated Feb 14, 2025

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Python 1,122 227 Updated Jul 25, 2024

A PyTorch-based Speech Toolkit

Python 9,459 1,446 Updated Mar 6, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,592 1,892 Updated Mar 5, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,986 841 Updated Mar 6, 2025

how does voiceprint recognition work in wechat page

JavaScript 42 9 Updated Jun 5, 2018

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python 6,759 727 Updated Feb 25, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,622 891 Updated Mar 5, 2025

The official gpt4free repository | various collection of powerful language models | o3 and deepseek r1, gpt-4.5

Python 63,763 13,567 Updated Mar 6, 2025

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 4,241 475 Updated Aug 22, 2024

在线批量下载微信公众号文章,支持阅读量、评论、内嵌音视频,无需搭建任何环境,可100%还原文章样式,支持私有部署

Vue 3,347 477 Updated Mar 4, 2025

Simple Online Realtime Tracking with a Deep Association Metric

Python 5,551 1,507 Updated Mar 2, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 41,819 4,667 Updated Mar 5, 2025

🎉🎉🎉JAVA高级架构师技术栈==任何技能通过 “刻意练习” 都可以达到融会贯通的境界,就像烹饪一样,这里有一份JAVA开发技术手册,只需要增加自己练习的次数。🏃🏃🏃

Java 957 296 Updated Feb 22, 2023

A reading list of video generation

509 35 Updated Mar 3, 2025

Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.

TypeScript 5,380 666 Updated Jul 19, 2023

A real-time motion capture system for 3D virtual character animating.

JavaScript 2,603 420 Updated Jul 18, 2024

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 6,188 1,303 Updated Aug 7, 2024

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 4,435 404 Updated Mar 7, 2025

Sz-Admin:一个开源RBAC中后台框架,专为现代应用设计。它结合了最新的技术栈,包括后端的Spring Boot 3、JDK 21、Mybatis Flex、Sa-Token、Knife4j和Flyway,以及前端的Vue 3、Vite5、TypeScript和Element Plus,致力于为您提供一个直观、流畅且功能强大的开发体验。

Java 210 52 Updated Mar 6, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,093 1,379 Updated Feb 24, 2025

forest(森林)——一款现代化的知识社区后台项目,使用 SpringBoot + Shiro + MyBatis + JWT + Redis 实现

Java 762 138 Updated Jan 9, 2025

fuint会员营销系统是一款实体店铺会员管理、积分商城、营销系统。基于Java SpringBoot、Vue、Uniapp,包含前台微信小程序、h5、后台管理收银端。具有优惠券、预存卡、实体卡、集次(计次卡)、储值卡、电子券,会员积分体系,会员等级等营销功能。适合各类实体店铺结合线上电商系统,如:零售超市、汽车4S店、花店、甜品店、餐饮等。本系统可当成收银系统使用,打通了线下收银系统和线上会…

Java 1,114 261 Updated Mar 6, 2025

🌮塔可商城, 一个基于springboot3+uniapp+vue3技术栈开发的开源跨平台小程序、管理后台,后端服务的项目,它内置提供了会员分销, 区域代理, 商品零售等功能的新零售电商系统。

Java 633 147 Updated Mar 29, 2024

如花商城系统thinkphp6+uniapp,小程序直播拼团限时分销APP商城

JavaScript 245 92 Updated Aug 3, 2023

企业级 LLM API 快速集成系统,支持OpenAI、Azure、文心一言、讯飞星火、通义千问、智谱GLM、Gemini、DeepSeek、Anthropic Claude以及OpenAI格式的模型等,简洁的页面风格,轻量高效且稳定,支持Docker一键部署。

Go 241 26 Updated Mar 3, 2025
Next