Skip to content
View yanweifu's full-sized avatar

Block or report yanweifu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This package contains the original 2012 AlexNet code.

Cuda 1,638 185 Updated Mar 12, 2025

[ICRA 2024]: Train your parkour robot in less than 20 hours.

Python 693 118 Updated Nov 28, 2023

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,396 175 Updated Mar 18, 2025

Fast and memory-efficient exact attention

Python 16,491 1,561 Updated Mar 22, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,328 565 Updated Mar 20, 2025

[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection

Python 35 1 Updated Aug 29, 2023

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

Python 812 104 Updated Mar 6, 2025

Refine high-quality datasets and visual AI models

Python 9,303 608 Updated Mar 24, 2025

Oracle Character Recognition Dataset - Oracle-50K

12 Updated May 5, 2022

[ICCV2023] GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction

Python 397 36 Updated Feb 8, 2024

[CoRL 2023 Oral] GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields

Python 130 11 Updated Dec 28, 2023

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 31,559 7,606 Updated Jan 14, 2025

Repository to train and evaluate RoboAgent

Python 332 26 Updated Apr 2, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,897 3,448 Updated May 18, 2024

Code for "PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring" ICCV2023

Python 28 5 Updated Dec 6, 2023

✨✨Latest Advances on Multimodal Large Language Models

14,427 927 Updated Mar 21, 2025
Python 106 6 Updated Feb 20, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,083 524 Updated Nov 20, 2024

CLIP+MLP Aesthetic Score Predictor

Python 1,020 95 Updated Jul 1, 2024

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,432 3,175 Updated Mar 24, 2025

Code for "SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation" CVPR2022

Python 73 7 Updated Feb 6, 2024

Split screen video comparison tool using FFmpeg and SDL2

C++ 1,178 47 Updated Feb 23, 2025

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors (TPAMI2023)

Python 70 7 Updated Dec 17, 2023

An English-language shell for any OS, powered by LLMs

Python 2,183 190 Updated Dec 2, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,402 5,834 Updated Sep 18, 2024

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,894 319 Updated Jun 12, 2024

List of Camera Calibration Tools + Patterns

171 23 Updated Dec 28, 2023

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images. In ECCV2018.

Python 1,702 300 Updated Dec 15, 2021
Next