Skip to content
View cuicathy's full-sized avatar

Block or report cuicathy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A list of awesome papers and resources of recommender system on large language model (LLM).

1,706 136 Updated Mar 17, 2025

SAM with text prompt

Python 2,060 229 Updated Feb 16, 2025

JAX implementation ViT-VQGAN

Python 82 11 Updated Sep 21, 2022

An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch

Python 298 33 Updated May 23, 2023

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Python 334 19 Updated Aug 9, 2022

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Python 359 18 Updated Dec 15, 2023

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,235 62 Updated Oct 18, 2022

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,753 466 Updated Aug 18, 2024

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Python 526 35 Updated Mar 16, 2025

When do we not need larger vision models?

Python 380 12 Updated Feb 8, 2025

[CVPR2024 Highlight] Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images

Python 180 22 Updated Apr 6, 2024

Experiment on combining CLIP with SAM to do open-vocabulary image segmentation.

Jupyter Notebook 362 33 Updated Apr 5, 2023

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 949 33 Updated Jul 31, 2024

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 792 55 Updated Jul 30, 2024

Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI (Nature Medicine). PLIP is a large-scale pre-trained model that can be used to ex…

Python 315 30 Updated Sep 20, 2023

A collection of resources on applications of multi-modal learning in medical imaging.

698 65 Updated Feb 27, 2025

Segment Anything in Medical Images

Jupyter Notebook 3,378 469 Updated Oct 10, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,093 145 Updated Feb 16, 2025

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 315 23 Updated Jul 17, 2024

An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)

Python 47 6 Updated Dec 5, 2023

Uniformaly: Towards Task-Agnostic Unified Anomaly Detection

Python 13 1 Updated Sep 15, 2023

Official Implement of "ADGym: Design Choices for Deep Anomaly Detection", NeurIPS 2023

Python 29 6 Updated Aug 23, 2023
Python 171 17 Updated Jan 31, 2024

Anomaly detection with diffusion models

Python 131 28 Updated Jun 5, 2023
Python 4 Updated Mar 7, 2023

My attempt at reproducing the paper Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection

Jupyter Notebook 407 107 Updated Dec 24, 2022
Python 186 34 Updated Jun 30, 2024

HVTSurv: Hierarchical Vision Transformer for Patient-level Survival Prediction from Whole Slide Image-AAAI 2023

Python 27 1 Updated Jul 2, 2023

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 8,959 1,399 Updated Mar 24, 2025
Next
Showing results