Skip to content
View thaoshibe's full-sized avatar
🐾
Why are you looking at me?
🐾
Why are you looking at me?

Highlights

  • Pro

Block or report thaoshibe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A webapp to visualize relationships among Chinese characters and to see example sentences that illustrate their use. Also available for Japanese learners.

JavaScript 66 Updated Jan 20, 2025

[CVPR 2024] Wired Perspectives: Multi-View Wire Art Embraces Generative AI

80 2 Updated Feb 27, 2024

Computer Science Conference Statistics

HTML 11 3 Updated Jan 22, 2025

Personalized Representation from Personalized Generation

Python 49 Updated Dec 23, 2024

A curated list of Awesome Personalized Large Multimodal Models resources

6 Updated Jan 10, 2025
Python 7 Updated Oct 2, 2024

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,485 396 Updated Dec 10, 2024

Bring portraits to life!

Python 13,757 1,472 Updated Jan 1, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 17,351 2,412 Updated Jan 20, 2025

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 907 41 Updated Jan 16, 2025

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,143 49 Updated Jan 23, 2025

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 716 38 Updated Aug 5, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,825 1,379 Updated Dec 25, 2024

Utilities intended for use with Llama models.

Python 5,696 953 Updated Jan 29, 2025

Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model"

Python 145 16 Updated Jul 14, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,476 218 Updated Apr 15, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,913 113 Updated Jul 29, 2024

🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant

Python 79 6 Updated Oct 28, 2024

Your image is almost there!

Python 7,479 428 Updated Jul 26, 2024

🌸 A collection of Vietnamese women who are currently working in the field of Computer Science.

CSS 10 Updated Jan 9, 2025
Jupyter Notebook 1 Updated Dec 31, 2023

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 308 22 Updated Jul 17, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 12,013 11,501 Updated Jan 28, 2025

A curated list of Awesome Makeup Transfer resources

229 37 Updated Nov 25, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 23,187 2,283 Updated Jan 22, 2025

[WACV 2024] An implementation of MEGANet for polyp segmentation with multi-scale edge-guided attention

Python 67 5 Updated Feb 5, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,687 876 Updated Jan 28, 2025

[ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers

Python 121 2 Updated Jun 14, 2024
Next