An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Python 4,006 709 Updated Jan 23, 2025

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,876 1,026 Updated Jan 4, 2025

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,488 783 Updated Jul 31, 2024

uncbiag / Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

921 40 Updated Jan 25, 2025

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,458 194 Updated Dec 3, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 99,172 16,129 Updated Jan 30, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,827 1,380 Updated Dec 25, 2024

ggerganov / ggml

Tensor library for machine learning

C++ 11,663 1,099 Updated Jan 29, 2025

monatis / clip.cpp

CLIP inference in plain C/C++ with no extra dependencies

C++ 475 37 Updated Aug 18, 2024

mazenmel / Deep-homography-estimation-Pytorch

Deep homography network with Pytorch

Jupyter Notebook 143 23 Updated May 21, 2022

letatanu / DeepHomography

The unofficial implementation of the paper Deep Image Homography Estimation.

Jupyter Notebook 18 2 Updated Aug 1, 2023

rpautrat / homography_est

Light-weight library to perform homography estimation with RANSAC from point, line or point-line correspondences

C++ 45 6 Updated Jun 13, 2023

gmayday1997 / SceneChangeDet

pytorch implementation of scene change detection

Python 241 73 Updated Mar 5, 2023

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,231 150 Updated Sep 3, 2024

unum-cloud / uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Python 1,087 63 Updated Jan 3, 2025

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,984 662 Updated Aug 5, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,138 3,416 Updated Jul 23, 2024

THUDM / VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,127 422 Updated Aug 23, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,310 426 Updated May 29, 2024

knwng / awesome-vehicle-re-identification

collection of dataset&paper&code on Vehicle Re-Identification

417 76 Updated Jul 28, 2019

joaofaro / KCFcpp

C++ Implementation of KCF Tracker

C++ 906 448 Updated Jul 25, 2024

emptysoal / TensorRT-YOLOv8-ByteTrack

An object tracking project with YOLOv8 and ByteTrack, speed up by C++ and TensorRT.

C++ 158 17 Updated Jan 15, 2025

shaoshengsong / DeepSORT

support deepsort and bytetrack MOT(Multi-object tracking) using yolov5 with C++

C++ 854 200 Updated Jul 14, 2023

LiheYoung / Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,235 555 Updated Jul 17, 2024

AIVFI / Monocular-Depth-Estimation-Rankings-and-2D-to-3D-Video-Conversion-Rankings

4DS BetterDepth Buffer Anytime ChronoDepth Depth Any Video Depth Anything Depth Pro DepthCrafter DINOv2 FutureDepth GenPercept GeoWizard LightedDepth Marigold Metric3D MiDaS MoGe MonST3R NeWCRFs NV…

105 Updated Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

edge-ai4cv

Achievements