- Montréal, Canada
- http://solipsist.studio
Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A Gradio web UI for Large Language Models with support for multiple inference backends.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Easily train a good VC model with voice data <= 10 mins!
Generative Models by Stability AI
State-of-the-art 2D and 3D Face Analysis Project
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Industry leading face manipulation platform
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
WebUI extension for ControlNet
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Generate 3D objects conditioned on text or images
Large Language Model Text Generation Inference
High-Resolution 3D Human Digitization from A Single Image.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.