zhangxgu

Zhangxuan Gu zhangxgu

Ant Group && computer vision

Achievements

Stars

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 52,890 11,646 Updated Dec 24, 2024

lukemelas / EfficientNet-PyTorch

A PyTorch implementation of EfficientNet

Python 7,985 1,530 Updated Apr 8, 2022

huangzizheng01 / ShuffleMamba

Code of paper 'Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training'

Python 14 4 Updated Sep 2, 2024

alipay / PC2-NoiseofWeb

Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text matching/retrieval models.

Python 12 1 Updated Nov 26, 2024

apple / ml-4m

4M: Massively Multimodal Masked Modeling

Python 1,647 99 Updated Oct 7, 2024

HZAI-ZJNU / Mamba-YOLO

the official pytorch implementation of “Mamba-YOLO：SSMs-based for Object Detection”

Python 326 36 Updated Dec 14, 2024

chenhaoxing / DeMamba

This repository is the code of paper 'DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark'.

Python 77 3 Updated Dec 24, 2024

yuweihao / MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Python 2,090 35 Updated Oct 22, 2024

MzeroMiko / VMamba

VMamba: Visual State Space Models，code is based on mamba

Python 2,305 152 Updated Oct 28, 2024

Harry24k / adversarial-attacks-pytorch

PyTorch implementation of adversarial attacks [torchattacks]

Python 1,938 353 Updated Jun 29, 2024

NExT-ChatV / NExT-Chat

The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".

Python 227 9 Updated Feb 5, 2024

apple / ml-ferret

Python 8,523 503 Updated Oct 9, 2024

microsoft / GLIP

Grounded Language-Image Pre-training

Python 2,278 196 Updated Jan 24, 2024

google-research-datasets / uibert

It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item Selection (VIS) data. Both datasets are written TFRecords.

41 4 Updated Aug 2, 2021

google-research / pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Jupyter Notebook 882 72 Updated Nov 7, 2023

chineseocr / trocr-chinese

transformers ocr for chinese

Python 365 56 Updated Jan 13, 2023

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,273 5,708 Updated Sep 18, 2024

jshilong / GPT4RoI

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Python 519 26 Updated Jun 11, 2024

open-mmlab / mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,517 1,070 Updated Nov 1, 2024

OptimalScale / DetGPT

Jupyter Notebook 761 71 Updated Aug 7, 2024

THUDM / VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,117 423 Updated Aug 23, 2024

MasterBin-IIAU / UNINEXT

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

Python 1,512 158 Updated Jul 18, 2023

Megvii-BaseDetection / YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 9,535 2,225 Updated Nov 20, 2024

antmachineintelligence / MUI-zh

The dataset MUI-zh is proposed by Ant Group.

8 Updated Mar 9, 2023

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,374 177 Updated Nov 27, 2024

deep-floyd / IF

Python 7,699 505 Updated Apr 14, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,512 2,922 Updated Sep 2, 2024

ZhendongWang6 / DIRE

[ICCV 2023] Official implementation of the paper: "DIRE for Diffusion-Generated Image Detection"

Python 309 24 Updated Sep 26, 2024

Cheems-Seminar / grounded-segment-any-parts

Grounded Segment Anything: From Objects to Parts

Jupyter Notebook 391 18 Updated May 19, 2023

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,064 712 Updated Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhangxuan Gu zhangxgu

Achievements

Achievements

Block or report zhangxgu

Stars

youngyangyang04 / leetcode-master

lukemelas / EfficientNet-PyTorch

huangzizheng01 / ShuffleMamba

alipay / PC2-NoiseofWeb

apple / ml-4m

HZAI-ZJNU / Mamba-YOLO

chenhaoxing / DeMamba

yuweihao / MambaOut

MzeroMiko / VMamba

Harry24k / adversarial-attacks-pytorch

NExT-ChatV / NExT-Chat

apple / ml-ferret

microsoft / GLIP

google-research-datasets / uibert

google-research / pix2seq

chineseocr / trocr-chinese

facebookresearch / segment-anything

jshilong / GPT4RoI

open-mmlab / mmpretrain

OptimalScale / DetGPT

THUDM / VisualGLM-6B

MasterBin-IIAU / UNINEXT

Megvii-BaseDetection / YOLOX

antmachineintelligence / MUI-zh

X-PLUG / mPLUG-Owl

deep-floyd / IF

Vision-CAIR / MiniGPT-4

ZhendongWang6 / DIRE

Cheems-Seminar / grounded-segment-any-parts

IDEA-Research / GroundingDINO