Skip to content
View sportzhang's full-sized avatar

Block or report sportzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

CV

Computer Vision.
25 repositories

图片向量检索服务,包含Numpy、Faiss、ES、Milvus多种计算引擎

Python 131 32 Updated Jan 5, 2023

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 53,126 16,787 Updated Mar 27, 2025

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 13,236 2,923 Updated Mar 17, 2025

Video editing with Python

Python 13,205 1,711 Updated Feb 6, 2025

[AAAI 2025] Official codes of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".

Python 741 25 Updated Mar 9, 2025

[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,522 155 Updated Dec 2, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,689 432 Updated Aug 7, 2024

Stable Diffusion web UI

Python 150,005 27,954 Updated Mar 4, 2025

Retrieval and Retrieval-augmented LLMs

Python 9,104 655 Updated Mar 20, 2025

😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。

118 10 Updated Apr 27, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,664 281 Updated Jun 28, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,352 567 Updated Mar 20, 2025

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 986 69 Updated Mar 25, 2023

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,339 873 Updated Dec 10, 2024

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,494 181 Updated Mar 5, 2025

Let us control diffusion models!

Python 31,843 2,844 Updated Feb 25, 2024

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Python 1,111 73 Updated Sep 17, 2024

Kolors Team

Python 4,298 323 Updated Nov 13, 2024

Bringing Old Photo Back to Life (CVPR 2020 oral)

Python 15,402 2,042 Updated Oct 26, 2023

TransNet V2: Shot Boundary Detection Neural Network

Python 588 100 Updated Dec 4, 2023

TensorRT Extension for Stable Diffusion Web UI

Python 1,967 157 Updated Jun 14, 2024

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 18,377 2,538 Updated Mar 26, 2025

Face recognition using Tensorflow

Python 14,011 4,810 Updated Jul 24, 2023

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 3,208 325 Updated Feb 27, 2025

End-to-End Object Detection with Transformers

Python 14,158 2,543 Updated Mar 12, 2024