Stars
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
A comprehensive video analysis tool that combines computer vision, audio transcription, and natural language processing to generate detailed descriptions of video content. This tool extracts key fr…
Utilities intended for use with Llama models.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
A large scale video database for violence detection, which has 2,000 video clips containing violent or non-violent behaviours.
Original reference implementation of "GES : Generalized Exponential Splatting for Efficient Radiance Field Rendering" [CVPR 2024]
Aligning pretrained language models with instruction data generated by themselves.
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation (ICRA 2023)
Simple, unified interface to multiple Generative AI providers
Open standard for machine learning interoperability
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
Retinaface get 80.99% in widerface hard val using mobilenet0.25.
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
RetinaFace: Deep Face Detection Library for Python
Python Framework for Saliency Modeling and Evaluation
Code for CVPR 2019 paper. BASNet: Boundary-Aware Salient Object Detection
RGB-D Salient Object Detection: A Survey
Graph Neural Network Library for PyTorch
Graph Transformer Architecture. Source code for "A Generalization of Transformer Networks to Graphs", DLG-AAAI'21.
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Code that'll help you kickstart a personal website that showcases your work as a software developer.
Torchreid: Deep learning person re-identification in PyTorch.