Stars
Code for Tensorflow Machine Learning Cookbook
Official Tensorflow and PyTorch Implementation of "Generalized Sum Pooling for Metric Learning"
Approaching (Almost) Any Machine Learning Problem
Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Boosting vision transformers for image retrieval, proposed design of Deep Token Pooling(DToP)
📚 A collection of papers about Referring Image Segmentation.
Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)
An open source implementation of CLIP.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
awesome grounding: A curated list of research papers in visual grounding
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
Scenic: A Jax Library for Computer Vision Research and Beyond
GAMA: Generative Adversarial Multi-Object Scene Attacks (NeurIPS'22)
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Deep functional residue identification
Accurate ADMET Prediction with XGBoost