Skip to content
View zwq456's full-sized avatar
  • Tianjin University

Highlights

  • Pro

Block or report zwq456

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Arxiv 2025: MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation]

Python 26 1 Updated Apr 7, 2025

From Handcrafted to Deep Features for Pedestrian Detection: A Survey (TPAMI 2021)

173 28 Updated Aug 18, 2023

Official PyTorch Implementation of Unified Video Action Model (RSS 2025)

Python 164 7 Updated Mar 20, 2025

[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 135 8 Updated Mar 6, 2025

[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"

Jupyter Notebook 70 4 Updated Sep 23, 2024

[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices". (by Junyan Lin)

Python 14 2 Updated Mar 7, 2025
Python 61 2 Updated Apr 6, 2025

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 1,927 121 Updated Mar 28, 2025
Python 69 3 Updated Apr 14, 2025

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 200 14 Updated Apr 1, 2025

Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence

Python 393 16 Updated Apr 17, 2025

Conditional Convolutions for Instance Segmentation, achives 37.1mAP on coco val

Python 141 15 Updated Jan 23, 2021

[ICLR 2025] Glad: A Streaming Scene Generator for Autonomous Driving

4 Updated Feb 9, 2025

A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Python 235 19 Updated Feb 11, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 2,581 324 Updated Mar 23, 2025

FCOS: Fully Convolutional One-Stage Object Detection (ICCV'19)

Python 3,304 628 Updated Dec 9, 2023

Open-vocabulary Semantic Segmentation

Python 171 16 Updated Mar 28, 2023

[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Python 187 11 Updated Dec 3, 2024

A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!

Python 1,041 117 Updated Jan 30, 2025

[ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"

Python 20 1 Updated Oct 23, 2024

ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

17 1 Updated Aug 9, 2024

The official implementation of CVPR 24' Paper "Learning Transferable Negative Prompts for Out-of-Distribution Detection"

Python 53 6 Updated Apr 8, 2024

ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No

Python 135 13 Updated Dec 2, 2023

Official Implementation of "Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation"

Python 10 Updated Mar 11, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 7,919 507 Updated Apr 2, 2025
Python 175 13 Updated Jan 2, 2025

Code base of the BEVDet series .

Python 1,558 272 Updated Jul 4, 2024

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 1,878 3,687 Updated Apr 11, 2025

[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"

Python 84 1 Updated Feb 27, 2025
Next