Stars
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
OpenMMLab Detection Toolbox and Benchmark
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Effortless data labeling with AI support from Segment Anything and other awesome models.
.NET is a cross-platform runtime for cloud, mobile, desktop, and IoT apps.
CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.
An annotated implementation of the Transformer paper.
Source code for Twitter's Recommendation Algorithm
UnityChanToonShaderVer2 Project / v.2.0.9 Release
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"
AI PC starter app for doing AI image creation, image stylizing, and chatbot on a PC powered by an Intel® Arc™ GPU.
A curated list of awesome resources for design and implement RESTful API's.
A collaborative list of public APIs for developers
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
A very simple toon lit shader example, for you to learn writing custom lit shader in Unity URP
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"