Skip to content
View ederev's full-sized avatar

Block or report ederev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python sample codes and textbook for robotics algorithms.

Python 24,285 6,670 Updated Feb 21, 2025

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Python 1,859 204 Updated Jan 16, 2025

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 6,108 843 Updated Feb 10, 2025

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,096 86 Updated Oct 21, 2024

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Python 3,009 375 Updated May 25, 2024

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Python 1,209 80 Updated Nov 7, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,453 752 Updated Aug 12, 2024

[ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"

Python 55 4 Updated Jul 28, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,183 1,450 Updated Dec 25, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,758 167 Updated Jan 22, 2025

Famous Vision Language Models and Their Architectures

Markdown 651 34 Updated Feb 23, 2025

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 545 40 Updated May 8, 2024

Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future

146 7 Updated Dec 11, 2024

SSSegmentation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch.

Python 827 106 Updated Jan 27, 2025

(TPAMI 2024) A Survey on Open Vocabulary Learning

887 51 Updated Dec 10, 2024

Awesome-LLM: a curated list of Large Language Model

21,632 1,773 Updated Feb 2, 2025

Mixture-of-Experts for Large Vision-Language Models

Python 2,089 130 Updated Dec 3, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,395 159 Updated Oct 21, 2024

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 559 24 Updated Dec 11, 2024

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 5,193 160 Updated Feb 22, 2025

An open source implementation of CLIP.

Python 11,046 1,045 Updated Feb 23, 2025

Efficient vision foundation models for high-resolution generation and perception.

Python 2,655 213 Updated Jan 24, 2025

A simple python implemented frame-by-frame visual odometry with SuperPoint feature detector and SuperGlue feature matcher.

Python 340 41 Updated Dec 25, 2020

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,552 2,368 Updated Aug 12, 2024

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Python 2,195 140 Updated Jun 7, 2023

This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).

888 58 Updated Feb 23, 2025

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,144 75 Updated Apr 15, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,086 288 Updated Feb 18, 2025

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

1,761 97 Updated Nov 15, 2023

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,075 486 Updated Nov 5, 2024
Next