Skip to content
View thaoshibe's full-sized avatar
🐾
Why are you looking at me?
🐾
Why are you looking at me?

Highlights

  • Pro

Block or report thaoshibe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A webapp to visualize relationships among Chinese characters and to see example sentences that illustrate their use. Also available for Japanese learners.

JavaScript 65 Updated Jan 20, 2025

[CVPR 2024] Wired Perspectives: Multi-View Wire Art Embraces Generative AI

80 2 Updated Feb 27, 2024

Computer Science Conference Statistics

HTML 11 3 Updated Jan 22, 2025

Personalized Representation from Personalized Generation

Python 49 Updated Dec 23, 2024

A curated list of Awesome Personalized Large Multimodal Models resources

6 Updated Jan 10, 2025
Python 7 Updated Oct 2, 2024

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,482 396 Updated Dec 10, 2024

Bring portraits to life!

Python 13,743 1,470 Updated Jan 1, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 17,320 2,403 Updated Jan 20, 2025

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 907 41 Updated Jan 16, 2025

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,142 49 Updated Jan 23, 2025

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 716 38 Updated Aug 5, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,811 1,378 Updated Dec 25, 2024

Utilities intended for use with Llama models.

Python 5,675 950 Updated Jan 24, 2025

Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model"

Python 145 16 Updated Jul 14, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,475 218 Updated Apr 15, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,913 113 Updated Jul 29, 2024

🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant

Python 79 6 Updated Oct 28, 2024

Your image is almost there!

Python 7,481 428 Updated Jul 26, 2024

🌸 A collection of Vietnamese women who are currently working in the field of Computer Science.

CSS 10 Updated Jan 9, 2025
Jupyter Notebook 1 Updated Dec 31, 2023

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 308 22 Updated Jul 17, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 11,996 11,498 Updated Jan 27, 2025

A curated list of Awesome Makeup Transfer resources

228 37 Updated Nov 25, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 23,178 2,281 Updated Jan 22, 2025

[WACV 2024] An implementation of MEGANet for polyp segmentation with multi-scale edge-guided attention

Python 67 5 Updated Feb 5, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,678 876 Updated Jan 28, 2025

[ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers

Python 121 2 Updated Jun 14, 2024
Next