- Shanghai, China
- https://jiaxin-ye.github.io/
Toolkit 👍
Turn your ideas into emojis in seconds. Generate your favorite Slack emojis with just one click.
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Phoneme alignment representation compatible with multiple forced aligners
🔊 Text-Prompted Generative Audio Model
Now we have become very big, Different from the original idea. Collect premium software in various categories.
Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NeMo text processing for ASR and TTS
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Comprehensive quantitative comparison of lossless and lossy audio codecs
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
Let us democratise high-resolution generation! (CVPR 2024)
Simple text to phones converter for multiple languages
A TensorFlow-based spoken language identification
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Fast and memory-efficient exact attention
A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
🌏🌍🌎Translators🌎🌍🌏 is a library that aims to bring free, multiple, enjoyable translations to individuals and students in Python. Translators是一个旨在用Python为个人和学生带来免费、多样、愉快翻译的库。
Realtime human head pose estimation with ONNXRuntime and OpenCV.