Stars
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
An unofficial PyTorch implementation of the audio LM VALL-E
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Vector (and Scalar) Quantization, in Pytorch
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge managemen…
sentence-transformers to onnx 让sbert模型推理效率更快
A demo PDF viewer implemented with Vue and PDF.js
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch
StyleGAN2-ADA - Official PyTorch implementation
Unlock the Power of LLM: Explore These Datasets to Train Your Own ChatGPT!
Official Code for DragGAN (SIGGRAPH 2023)
Robust Speech Recognition via Large-Scale Weak Supervision
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
SoftVC VITS Singing Voice Conversion
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
🔊 Text-Prompted Generative Audio Model
An enterprise-class package of Flutter components for mobile applications. ( Bruno 是基于一整套设计体系的 Flutter 组件库。)
An open-source tool-augmented conversational language model from Fudan University
Inpaint images with ControlNet
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调