Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,172 7,975 Updated Feb 6, 2025

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,594 356 Updated Jun 28, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 19,958 1,393 Updated Feb 6, 2025

Kwai-Kolors / Kolors

Kolors Team

Python 4,141 309 Updated Nov 13, 2024

mulanai / MuLan

MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)

Python 130 3 Updated Jan 24, 2025

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 3,861 325 Updated Jan 13, 2025

salaniz / pycocoevalcap

Forked from tylin/coco-caption

Python 3 support for the MS COCO caption evaluation tools

Python 310 85 Updated Aug 1, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,139 89 Updated Aug 6, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,241 2,290 Updated Jan 22, 2025

mini-sora / minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,251 152 Updated Dec 19, 2024

NVlabs / DiffiT

[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation

483 17 Updated Oct 31, 2024

YangLing0818 / Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

3,080 255 Updated Dec 2, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,901 129 Updated Jan 1, 2025

labuladong / fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

Markdown 126,706 23,298 Updated Jan 31, 2025

mosaicml / diffusion

Python 689 73 Updated Jan 10, 2025

huggingface / diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 3,853 419 Updated Aug 19, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,398 5,623 Updated Feb 6, 2025

YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,741 102 Updated Feb 1, 2025

leeguandong / Awesome-Chinese-Stable-Diffusion

中文文生图stable diffsion模型集合

286 17 Updated Jul 8, 2024

awesome-stable-diffusion / awesome-stable-diffusion

Curated list of awesome resources for the Stable Diffusion AI Model.

1,526 74 Updated Apr 9, 2024

SingleZombie / DL-Demos

Demos for deep learning

Python 497 115 Updated Dec 4, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,940 529 Updated Dec 25, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,321 426 Updated May 29, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,211 2,789 Updated Sep 4, 2024

Starred topics

fine-grained-classification

Kang Zhao Miracle2333

Lists (7)

3D Vision

Detection

NLP

Others

PaddlePaddle

SSL

SSOD

Starred repositories

fine-grained-classification