Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
A feature-rich command-line audio/video downloader
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Robust Speech Recognition via Large-Scale Weak Supervision
A natural language interface for computers
Clone a voice in 5 seconds to generate arbitrary speech in real-time
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A high-throughput and memory-efficient inference and serving engine for LLMs
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Official Code for DragGAN (SIGGRAPH 2023)
Real-time face swap for PC streaming or video calls
Generative Models by Stability AI
Run macOS on QEMU/KVM. With OpenCore + Monterey + Ventura + Sonoma support now! Only commercial (paid) support is available now to avoid spammy issues. No Mac system is required.
GUI for a Vocal Remover that uses Deep Neural Networks.
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Avatars for Zoom, Skype and other video-conferencing apps.
提供多款 Shadowrocket 规则,带广告过滤功能。用于 iOS 未越狱设备选择性地自动翻墙。
Bringing Old Photo Back to Life (CVPR 2020 oral)
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs