-
GOT-OCR2.0 Public
Forked from Ucas-HaoranWei/GOT-OCR2.0Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Python UpdatedSep 13, 2024 -
Awesome-Image-Quality-Assessment Public
Forked from chaofengc/Awesome-Image-Quality-AssessmentA comprehensive collection of IQA papers
TeX MIT License UpdatedMar 11, 2024 -
ImageBind Public
Forked from facebookresearch/ImageBindImageBind One Embedding Space to Bind Them All
Python Other UpdatedFeb 21, 2024 -
polite_flamingo Public
Forked from ChenDelong1999/polite-flamingoVisual Instruction Tuning with Polite Flamingo🦩
Python UpdatedJul 10, 2023 -
baichuan-7B Public
Forked from baichuan-inc/Baichuan-7BA large-scale 7B pretraining language model developed by Baichuan
Python Apache License 2.0 UpdatedJun 15, 2023 -
DynMM Public
Forked from zihuixue/DynMMCode for the paper 'Dynamic Multimodal Fusion'
Python UpdatedApr 6, 2023 -
cross_modal_adaptation Public
Forked from linzhiqiu/cross_modal_adaptationCross-modal few-shot adaptation with CLIP
Python MIT License UpdatedMar 9, 2023 -
ffmpeg_beginner Public
Forked from JackeyLea/ffmpeg_beginner食铁兽(feater.top)ffmpeg4入门系列教程代码
C++ MIT License UpdatedJul 16, 2022 -
Bert-Chinese-Text-Classification-Pytorch Public
Forked from 649453932/Bert-Chinese-Text-Classification-Pytorch使用Bert,ERNIE,进行中文文本分类
-
VRT Public
Forked from JingyunLiang/VRTVRT: A Video Restoration Transformer (official repository)
Python Other UpdatedFeb 20, 2022 -
RepVGG Public
Forked from DingXiaoH/RepVGGRepVGG: Making VGG-style ConvNets Great Again
Python MIT License UpdatedAug 22, 2021 -
Video-Swin-Transformer Public
Forked from SwinTransformer/Video-Swin-TransformerThis is an official implementation for "Video Swin Transformers".
Python Apache License 2.0 UpdatedJul 28, 2021 -
ASL Public
Forked from Alibaba-MIIL/ASLOfficial Pytorch Implementation of: "Asymmetric Loss For Multi-Label Classification"(2020) paper
Python MIT License UpdatedNov 27, 2020 -
VILLA Public
Forked from zhegan27/VILLAResearch Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER adversarial training part
Python MIT License UpdatedOct 20, 2020 -
HERO Public
Forked from linjieli222/HEROResearch code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
Python MIT License UpdatedOct 20, 2020 -
vilbert-multi-task Public
Forked from facebookresearch/vilbert-multi-taskMulti Task Vision and Language
Jupyter Notebook MIT License UpdatedOct 5, 2020 -
VL-BERT Public
Forked from jackroos/VL-BERTCode for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
Jupyter Notebook MIT License UpdatedSep 23, 2020 -
recurrent-transformer Public
Forked from jayleicn/recurrent-transformer[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
-
VLP Public
Forked from LuoweiZhou/VLPVision-Language Pre-training for Image Captioning and Question Answering
Python Apache License 2.0 UpdatedSep 15, 2020 -
Silent-Face-Anti-Spoofing Public
Forked from minivision-ai/Silent-Face-Anti-Spoofing静默活体检测(Silent-Face-Anti-Spoofing)
Python Apache License 2.0 UpdatedJul 15, 2020 -
FaceImageQuality Public
Forked from pterhoer/FaceImageQualityCode and information for face image quality assessment with SER-FIQ
Python UpdatedJul 3, 2020 -
M3P Public
Forked from microsoft/M3PMultitask Multilingual Multimodal Pre-training
Python MIT License UpdatedJun 29, 2020 -
CVPR2020-Code Public
Forked from amusi/CVPR2025-Papers-with-CodeCVPR 2020 论文开源项目合集
UpdatedJun 15, 2020 -
MasterProject Public
Forked from MDSKUL/MasterProjectCode voor mijn Master project omtrent VideoBERT
Python UpdatedJun 5, 2020 -
Cross_Modality_Relevance Public
Forked from HLR/Cross_Modality_RelevanceThe source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
Python UpdatedMay 14, 2020 -
Pytorch_Retinaface Public
Forked from biubug6/Pytorch_RetinafaceRetinaface get 80.99% in widerface hard val using mobilenet0.25.
Python MIT License UpdatedApr 20, 2020 -
ATSS Public
Forked from sfzhang15/ATSSBridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection, CVPR, Oral, 2020
Python Other UpdatedMar 15, 2020 -
light-LPR Public
Forked from lqian/light-LPRLight-LPR是一个瞄准可以在嵌入式设备、手机端和普通的x86平台上运行的车牌识别开源项目,旨在支持各种场景的车牌识别,车牌字符识别准确率超99.95%,综合识别准确率超过99%,支持目前国内所有的车牌识别,觉得好用的一定要加星哦。
C UpdatedJan 18, 2020 -
EasyPR Public
Forked from liuruoze/EasyPRAn easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
C++ Apache License 2.0 UpdatedDec 11, 2019 -
Face-Detector-1MB-with-landmark Public
Forked from biubug6/Face-Detector-1MB-with-landmark1M人脸检测模型(含关键点)
Python MIT License UpdatedDec 11, 2019