Lists (8)
Sort Name ascending (A-Z)
Stars
Autosvg is tracing tool, which can convert image format like (jpg,png,gif) into vector
📜 Klipper Preprocessor script for Cura
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
TripoSR custom node for comfyui
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A system for agentic LLM-powered data processing and ETL
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
sketch + style = paints 🎨 (TOG2018/SIGGRAPH2018ASIA)
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
Turns Data and AI algorithms into production-ready web applications in no time.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)
This repo contains code and a pre-trained model for clothes segmentation.
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis".