Stars
Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23
这是一个faster-rcnn的pytorch实现的库,可以利用voc数据集格式的数据进行训练。
基于Clash Core 制作的Clash For Linux备份仓库 A Clash For Linux Backup Warehouse Based on Clash Core
Caffe implementation of multiple popular object detection frameworks
Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
A collection of recent video understanding datasets, under construction!
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
[AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing
The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)
TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.
Stable Diffusion web UI
New home for my stardew valley mod source code
北京航空航天大学(北航)课程作业资料共享计划
[ACM MM 2021] Visible Watermark Removal via Self-calibrated Localization and Background Refinement
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Convolutional neural network model for video classification trained on the Kinetics dataset.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image