Stars
Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning
MixGen: A New Multi-Modal Data Augmentation
MixGen: A New Multi-Modal Data Augmentation
Baseline model of EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge
[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources
A library of transformer models for computer vision and multi-modality research
Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning