Kratos-Wen

Follow

Di Wen Kratos-Wen

Follow

4 followers · 10 following

Karlsruhe Institute of Technology (KIT)

Highlights

Pro

Lists (1)

Sort

🔮 Future ideas

Stars

amazon-science / c2f-seg

Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).

Python 49 1 Updated Jan 7, 2024

sunsleaf / CSVG

Constraint Satisfaction Visual Grounding

Python 9 Updated Nov 29, 2024

UX-Decoder / DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 430 19 Updated Apr 8, 2024

penghao-wu / vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 559 38 Updated Jan 7, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,298 264 Updated Jan 21, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 907 92 Updated Jan 16, 2025

JieHu1996 / DeformableMamba

12 Updated Nov 26, 2024

ConstantinSeibold / 2DAnatomyDatasets

Collection of 2D Datasets for Anatomy Segmentation

Jupyter Notebook 2 Updated Nov 26, 2024

assembly-101 / assembly101-download-scripts

Python scripts to download Assembly101 from Google Drive

Python 36 3 Updated Oct 10, 2024

KPeng9510 / Trans4SOAR

Python 13 1 Updated Apr 1, 2023

KPeng9510 / MASS

MASS: Multi-Attentional Semantic Segmentation ofLiDAR Data for Dense Top-View Understanding

Python 24 2 Updated Aug 18, 2022

KPeng9510 / MuscleMap

Python 29 3 Updated Mar 7, 2023

KPeng9510 / EBiL-HaDS

Python 4 Updated Oct 3, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,240 735 Updated Aug 12, 2024

Gorilla-Lab-SCUT / OrthDNNs

Code for OrthDNNs: Orthogonal Deep Neural Networks

Python 13 1 Updated Jan 9, 2020

fferflo / weightbridge

Map (deep learning) model weights between different model implementations.

Python 16 1 Updated Jan 20, 2024

fferflo / semantic-meshes

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

Python 54 10 Updated Feb 18, 2022

fferflo / einx

Universal Tensor Operations in Einstein-Inspired Notation for Python.

Python 342 10 Updated Nov 29, 2024

fferflo / statewide-visual-geolocalization

Statewide Visual Geolocalization in the Wild (ECCV 2024)

Python 61 4 Updated Dec 2, 2024

xuyizdby / NoiseEraSAR

The implementation of NoiseEraSAR in "Skeleton-Based Human Action Recognition with Noisy Labels"

Python 7 Updated Aug 5, 2024

KPeng9510 / RAVAR

Github repo for referring atomic video action recognition

18 Updated Oct 2, 2024

JacobYuan7 / RLIPv2

[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training

Python 122 3 Updated May 28, 2024

Breakthrough / PySceneDetect

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,506 414 Updated Jan 21, 2025

md-mohaiminul / TranS4mer

Python 28 4 Updated Jun 2, 2023

MzeroMiko / VMamba

VMamba: Visual State Space Models，code is based on mamba

Python 2,365 155 Updated Oct 28, 2024

yufanchen96 / RoDLA

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

Python 30 2 Updated Jan 23, 2025

KPeng9510 / OS-SAR

Python 15 4 Updated May 14, 2024

firework8 / Awesome-Skeleton-based-Action-Recognition

A curated paper list of awesome skeleton-based action recognition.

469 65 Updated Jan 9, 2025

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,177 212 Updated Nov 22, 2024

kyegomez / VisionMamba

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…

Python 422 21 Updated Jan 20, 2025