- Beijing, China
- https://liaoliaojun.com
Stars
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A modular graph-based Retrieval-Augmented Generation (RAG) system
DSPy: The framework for programming—not prompting—language models
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
a state-of-the-art-level open visual language model | 多模态预训练模型
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Official implementations for paper: Anydoor: zero-shot object-level image customization
Various AI scripts. Mostly Stable Diffusion stuff.
Understand Human Behavior to Align True Needs
All of the Civitai models inside Automatic 1111 Stable Diffusion Web UI
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
Lumina-T2X is a unified framework for Text to Any Modality Generation
An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension
A realtime sketch to image demo using LCM and the gradio library.
set prompt to divided region
🎯 Task-oriented embedding tuning for BERT, CLIP, etc.
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
A UI made in Pyside6 to make training LoRA/LoCon and other LoRA type models in sd-scripts easy