Lists (2)
Sort Name ascending (A-Z)
Stars
A latent text-to-image diffusion model
Google Research
Instruct-tune LLaMA on consumer hardware
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
LAVIS - A One-stop Library for Language-Vision Intelligence
PyTorch code and models for the DINOv2 self-supervised learning method.
Best Practices, code samples, and documentation for Computer Vision.
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
A suite of image and video neural tokenizers
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
OmniXAI: A Library for eXplainable AI
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"
DiffSeg is an unsupervised zero-shot segmentation method using attention information from a stable-diffusion model. This repo implements the main DiffSeg algorithm and additionally includes an expe…
Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"
This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.
B-cos Networks: Alignment is All we Need for Interpretability
Official repository for R2Former: Unified Retrieval and Reranking Transformer for Place Recognition
A simple q learning game played using openAI gym
A collection of IPython notebooks covering various topics.
A basic adaptation of email spam filter using naive bayes and svm approach
Notebook for DL for foreign currencies
An End to End system which combines speech recognition,summarization and sentiment analysis using a variety of machine and deep learning approaches