- Hong Kong
- https://shihaozhaozsh.github.io/
Stars
[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
Open-Sora: Democratizing Efficient Video Production for All
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Pytorch Lightning入门中文教程,转载请注明来源。(当初是写着玩的,建议看完MNIST这个例子再上手)
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
[TMLR] CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
collection of diffusion model papers categorized by their subareas
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
High-fidelity performance metrics for generative models in PyTorch
CLIP-based aesthetics predictor inspired by the interface of 🤗 huggingface transformers.
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
animatediff prompt travel
Using Low-rank adaptation to quickly fine-tune diffusion models.
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset
pytorch structural similarity (SSIM) loss
Finetune ModelScope's Text To Video model using Diffusers 🧨