Skip to content
View ysm2000's full-sized avatar

Block or report ysm2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

✨✨Latest Advances on Multimodal Large Language Models

13,353 847 Updated Jan 2, 2025

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-contex…

282 19 Updated Dec 23, 2024

Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"

Python 37 2 Updated Feb 15, 2024

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Python 536 40 Updated Apr 23, 2024

关于domain generalization,domain adaptation,causality,robutness,prompt,optimization,generative model各式各样研究的阅读笔记

1,185 102 Updated Dec 14, 2023

[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition

Python 125 3 Updated Aug 23, 2024
87 Updated Jan 25, 2024

A curated list of papers, code and resources pertaining to few-shot image generation.

370 46 Updated Jun 3, 2023

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Python 11 Updated Jul 16, 2024

A collection of resources on controllable generation with text-to-image diffusion models.

959 27 Updated Dec 31, 2024

[CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"

Python 61 10 Updated May 1, 2024

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)

Python 519 30 Updated Jan 8, 2024

[TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space"

Python 48 6 Updated Nov 29, 2024

Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)

Python 181 26 Updated Mar 23, 2023

Official codebase for the Paper “Retrieval-Augmented Diffusion Models”

Jupyter Notebook 119 8 Updated Apr 5, 2023

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,233 192 Updated Nov 7, 2024

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,359 92 Updated Aug 20, 2024

Retrieval augmented diffusion from CompVis.

Jupyter Notebook 51 7 Updated Aug 20, 2022

Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"

Python 1,413 268 Updated Jan 10, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 32,203 3,690 Updated Dec 28, 2024

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 30,931 7,542 Updated Dec 19, 2024

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 12,961 2,910 Updated Jan 3, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,506 1,427 Updated Sep 5, 2024

OpenMMLab Detection Toolbox and Benchmark

Python 29,960 9,501 Updated Aug 21, 2024

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,103 91 Updated Jun 13, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,477 694 Updated Dec 24, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,162 1,223 Updated Dec 12, 2024

This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.

318 18 Updated May 6, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 105,706 8,453 Updated Jan 4, 2025
Next