Skip to content
View Kratos-Wen's full-sized avatar
  • Karlsruhe Institute of Technology (KIT)

Highlights

  • Pro

Block or report Kratos-Wen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).

Python 49 1 Updated Jan 7, 2024

Constraint Satisfaction Visual Grounding

Python 9 Updated Nov 29, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 430 19 Updated Apr 8, 2024

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 559 38 Updated Jan 7, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,298 264 Updated Jan 21, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 907 92 Updated Jan 16, 2025

Collection of 2D Datasets for Anatomy Segmentation

Jupyter Notebook 2 Updated Nov 26, 2024

Python scripts to download Assembly101 from Google Drive

Python 36 3 Updated Oct 10, 2024
Python 13 1 Updated Apr 1, 2023

MASS: Multi-Attentional Semantic Segmentation ofLiDAR Data for Dense Top-View Understanding

Python 24 2 Updated Aug 18, 2022
Python 29 3 Updated Mar 7, 2023
Python 4 Updated Oct 3, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,240 735 Updated Aug 12, 2024

Code for OrthDNNs: Orthogonal Deep Neural Networks

Python 13 1 Updated Jan 9, 2020

Map (deep learning) model weights between different model implementations.

Python 16 1 Updated Jan 20, 2024

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

Python 54 10 Updated Feb 18, 2022

Universal Tensor Operations in Einstein-Inspired Notation for Python.

Python 342 10 Updated Nov 29, 2024

Statewide Visual Geolocalization in the Wild (ECCV 2024)

Python 61 4 Updated Dec 2, 2024

The implementation of NoiseEraSAR in "Skeleton-Based Human Action Recognition with Noisy Labels"

Python 7 Updated Aug 5, 2024

Github repo for referring atomic video action recognition

18 Updated Oct 2, 2024

[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training

Python 122 3 Updated May 28, 2024

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,506 414 Updated Jan 21, 2025
Python 28 4 Updated Jun 2, 2023

VMamba: Visual State Space Models,code is based on mamba

Python 2,365 155 Updated Oct 28, 2024

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

Python 30 2 Updated Jan 23, 2025
Python 15 4 Updated May 14, 2024

A curated paper list of awesome skeleton-based action recognition.

469 65 Updated Jan 9, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,177 212 Updated Nov 22, 2024

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…

Python 422 21 Updated Jan 20, 2025
Next