Lists (4)
Sort Name ascending (A-Z)
Stars
Multilingual Voice Understanding Model
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Excel file(*.xlsx) reader/writer library using Qt 5 or 6. Descendant of QtXlsxWriter.
Animal identification using face recognition based methods
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
深挖b站如何控评-对阿瓦隆系统探究
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
A resource for learning about Machine learning & Deep Learning
The world's simplest facial recognition api for Python and the command line
🎉 一个简约的音乐播放器,支持逐字歌词,下载歌曲,展示评论区,音乐云盘及歌单管理,音乐频谱,移动端基础适配 | 网易云音乐 | A minimalist music player
An open-source user mode debugger for Windows. Optimized for reverse engineering and malware analysis.
ASCII generator (image to text, image to image, video to video)
[CVPR 2024] SinSR: Diffusion-Based Image Super-Resolution in a Single Step
[NeurIPS 2024] Generalizable Implicit Motion Modeling for Video Frame Interpolation
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
MSVC's implementation of the C++ Standard Library.
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/