Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek3, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 5,144 445 Updated Jan 24, 2025

CLAY-3D / OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

877 11 Updated Jun 21, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 34,604 5,290 Updated Jan 24, 2025

santisoler / cc-licenses

Creative Commons Licenses for Github

566 304 Updated Dec 10, 2024

PyAV-Org / PyAV

Pythonic bindings for FFmpeg's libraries.

Cython 2,621 375 Updated Jan 22, 2025

JourneyDB / JourneyDB

163 5 Updated Jul 18, 2023

3DTopia / 3DTopia

Text-to-3D Generation within 5 Minutes

Python 686 48 Updated Mar 10, 2024

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 2,387 227 Updated Apr 24, 2024

lllyasviel / sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Python 3,941 338 Updated Aug 30, 2024

mit-han-lab / distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 645 26 Updated Dec 2, 2024

lllyasviel / LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

2,058 29 Updated Jun 16, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,941 184 Updated Oct 31, 2024

apple / ml-mgie

Python 3,873 253 Updated Mar 15, 2024

TencentARC / MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,376 75 Updated Sep 20, 2024

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 412 23 Updated Jan 18, 2025

duyguceylan / pix2video

Code for the paper "Pix2Video: Video Editing using Image Diffusion"

Python 68 5 Updated Oct 2, 2023

lllyasviel / Fooocus

Focus on prompting and generating

Python 42,755 6,268 Updated Jan 14, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,194 2,327 Updated Aug 12, 2024

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 97,660 7,655 Updated Jan 23, 2025

necla-ml / Diff-JPEG

Official and maintained implementation of the paper "Differentiable JPEG: The Devil is in the Details" [WACV 2024].

Python 91 5 Updated Dec 30, 2023

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,568 4,599 Updated Jan 23, 2025