Lists (1)
Sort Name ascending (A-Z)
Stars
A simple python script that turns text from an input.txt file into speech. Great for converting blog content into audio.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
📖 Uncle小说,PC版,一个全网小说下载器及阅读器,目录解析与书源结合,支持有声小说与文本小说,可下载mobi、epub、txt格式文本小说。
【国内梯子排行】最好用的VPN梯子推荐与科学上网测评 -梯子、科学上网、翻墙、机场、v2ray、trojan、shadowsock
RetinaFace: Deep Face Detection Library for Python
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
国际中文朗读语料评分模型建构与优化实践
A simple project to visualize kernels and features of a trained U-Net model
Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
C0untFloyd / roop-unleashed
Forked from s0md3v/roopEvolved Fork of roop with Web Server and lots of additions
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
CityU ML tutorials for your baseline experiments
哈尔滨工业大学(深圳)计算机专业课程攻略 | Guidance for courses in Department of Computer Science, Harbin Institute of Technology (Shenzhen)
哈尔滨工业大学(深圳)数据科学与大数据技术专业课程攻略 | Guidance for courses in Department of Data Science and Big Data Technology, Harbin Institute of Technology (Shenzhen)
A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)
Windows version of NVIDIA's NCCL ('Nickel') for multi-GPU training - please use https://github.com/NVIDIA/nccl for changes.
The official implementation of "Intellectual Property Protection of Diffusion Models via the Watermark Diffusion Process"