SJLeo

Follow

Shaojie Li SJLeo

Follow

20 followers · 7 following

Xiamen University
https://shaojieli.github.io

Achievements

Achievements

Stars

kohya-ss / sd-scripts

Python 5,527 910 Updated Jan 5, 2025

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,760 212 Updated Jan 9, 2025

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,208 953 Updated Jan 8, 2025

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 1,839 94 Updated Jan 8, 2025

gnobitab / InstaFlow

⚡ InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Python 1,243 41 Updated Jun 7, 2024

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,417 230 Updated Jun 14, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 19,278 1,362 Updated Dec 31, 2024

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,289 826 Updated Jul 18, 2024

ToTheBeginning / PuLID

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 2,943 208 Updated Nov 27, 2024

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,525 350 Updated Jun 28, 2024

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,719 5,103 Updated Oct 10, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 62,933 6,722 Updated Jan 9, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,039 5,546 Updated Jan 9, 2025

modelscope / facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,215 864 Updated Dec 10, 2024

lllyasviel / ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Python 4,849 384 Updated Aug 8, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,041 2,776 Updated Sep 4, 2024

dvlab-research / ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Python 1,466 72 Updated Sep 25, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 31,159 2,791 Updated Feb 25, 2024

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 69,080 10,252 Updated Jun 18, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,680 597 Updated May 31, 2024

dbolya / tomesd

Speed up Stable Diffusion with this one simple trick!

Python 1,309 81 Updated Nov 29, 2023

ymcui / Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,137 581 Updated Sep 23, 2024

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,197 147 Updated Sep 3, 2024

ge25nab / Awesome-VLM-AD-ITS

This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. The repository will be continuously updated to track the lates…

183 14 Updated Sep 15, 2024

radarFudan / Awesome-state-space-models

Collection of papers on state-space models

567 20 Updated Dec 26, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

13,440 853 Updated Jan 6, 2025

Thinklab-SJTU / Awesome-LLM4AD

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

1,105 55 Updated Sep 25, 2024

Timothyxxx / Chain-of-ThoughtsPapers

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

1,979 132 Updated Oct 5, 2023

MCZhi / Driving-IRL-NGSIM

[T-ITS] Driving Behavior Modeling using Naturalistic Human Driving Data with Inverse Reinforcement Learning

Python 223 40 Updated May 13, 2022

PaddlePaddle / Paddle3D

A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.

Python 583 142 Updated Jan 9, 2025