Skip to content
View pango99's full-sized avatar

Block or report pango99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,907 163 Updated Mar 19, 2025
Python 232 32 Updated Apr 11, 2025

Official repository for the paper, "Towards Real-world Event-guided Low-light Video Enhancement and Deblurring", ECCV 2024.

Python 33 1 Updated Feb 19, 2025

Single Image Deraining: A Comprehensive Benchmark Analysis

173 23 Updated Jan 5, 2021

Keep track of makes, misses, and total shooting percentage using YOLOv8 and openCV in Python.

Jupyter Notebook 4 Updated Aug 14, 2024

[CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation

Python 234 21 Updated Jan 30, 2025
Python 137 18 Updated Apr 8, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 2,383 197 Updated Mar 14, 2025

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 937 51 Updated Apr 7, 2025

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

Python 664 111 Updated Apr 10, 2025
Python 76 8 Updated Jan 24, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,572 198 Updated Apr 14, 2025

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Python 1,134 228 Updated Jul 25, 2024

You can find the speech algorithms you want here

C 796 248 Updated Jan 1, 2025

Noise supression using deep filtering

Python 27 4 Updated May 23, 2024

SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denoising using an ONNX model. This repository contains everythi…

Python 72 12 Updated Aug 16, 2024

c# library for decoding paraformer, sensevoice Models,used in speech recognition (ASR)

C# 49 4 Updated Aug 23, 2024

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,446 1,130 Updated Mar 29, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 5,606 629 Updated Apr 10, 2025
Python 78 14 Updated Jan 14, 2025

Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything

Python 1,261 83 Updated Nov 7, 2024

Open and efficient video watermarking

Python 349 42 Updated Apr 4, 2025

使用YOLOv5+DeepLabV3Plus实现仪表的检测、指针表盘分割和刻度读数识别

C++ 44 10 Updated Oct 14, 2021

使用YoloX+DeepLabV3Plus实现仪表的检测、指针表盘分割和刻度读数识别(借助ncnn框架)

C++ 27 5 Updated Oct 12, 2024

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

Python 701 157 Updated Dec 4, 2024

文档方向分类

Python 216 15 Updated Nov 20, 2024

修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We will continue to follow and integrate the latest and best docu…

Python 50 8 Updated Dec 15, 2024

Memory-Guided Diffusion for Expressive Talking Video Generation

Python 780 88 Updated Jan 24, 2025
Next