Stars
Open-Sora: Democratizing Efficient Video Production for All
VideoSys: An easy and efficient system for video generation
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Your most handy video processing software
[CSUR] A Survey on Video Diffusion Models
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
一个支持windows/linux/mac的文本编辑器,目标是做中国人自己的编辑器,来自中国。
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
YOLOv6: a single-stage object detection framework dedicated to industrial applications.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
[NeurIPS 2022] Official code for "Focal Modulation Networks"
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
This is a Pytorch implementation of deep image blending
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Official code for "A Normalized Gaussian Wasserstein Distance for Tiny Object Detection"
🕶 A curated list of Tiny Object Detection papers and related resources.
USB: Universal-Scale Object Detection Benchmark (BMVC 2022)