Skip to content
View stupidZZ's full-sized avatar

Organizations

@msracver

Block or report stupidZZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"

Python 108 4 Updated Jan 31, 2025

[CVPR 2024] Code release for TransNeXt model

Python 495 20 Updated Jun 13, 2024

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 775 42 Updated Mar 20, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,300 192 Updated Mar 19, 2025

MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)

Python 2,262 40 Updated Mar 9, 2025

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Jupyter Notebook 230 14 Updated May 5, 2024
Python 358 15 Updated Oct 21, 2024

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,003 600 Updated Jul 19, 2024

Open reproduction of MUSE for fast text2image generation.

Python 347 29 Updated Jun 1, 2024

Open-vocabulary Semantic Segmentation

Python 337 33 Updated Oct 16, 2024
Python 56 6 Updated Jan 17, 2022

This is an official implementation of our CVPR 2023 paper "Human Pose as Compositional Tokens" (https://arxiv.org/pdf/2303.11638.pdf)

Python 360 25 Updated Jun 12, 2023

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,028 1,057 Updated Mar 24, 2025

Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"

Python 759 73 Updated Apr 7, 2024

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Python 964 93 Updated Sep 29, 2022

Semi-Supervised Learning, Object Detection, ICCV2021

Python 913 122 Updated Jun 9, 2024

EsViT: Efficient self-supervised Vision Transformers

Python 410 44 Updated Aug 28, 2023

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Python 1,854 381 Updated Apr 9, 2023

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.

Python 1,208 225 Updated Aug 24, 2022

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Python 647 70 Updated May 13, 2021

Group-Free 3D Object Detection via Transformers

Python 247 33 Updated Jun 2, 2021

Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation, ECCV 2020 Oral

Python 71 6 Updated Aug 26, 2020

在我们开发项目的时候,无论项目规模大小,在所难免会写一些工具型函数来解决一些问题,随着项目开发和维护的时间越来越长,这些工具型函数会越来越多,同时还会穿插在各个项目的各模块或者文件当中,使得项目变的越来越臃肿,也不方便复用和维护。这时我们就会提取出一个类似的工具库或者基础库作为项目基础依赖,在项目中重复利用起来。 为了这样的工具库或类库更易扩展、易维护、易复用和更加稳定,我们就需要更好的去管…

JavaScript 40 8 Updated Mar 2, 2023

Parallel Faster R-CNN implementation with MXNet.

Python 670 287 Updated Aug 25, 2018
Lua 1,347 955 Updated Jan 12, 2021