-
Facebook AI
- California
- http://lichengunc.github.io/
Stars
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
BrandonHanx / mmf
Forked from facebookresearch/mmf[ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning
CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.
Official pytorch implementation of StyleMapGAN (CVPR 2021)
Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).
A Collection of Variational Autoencoders (VAE) in PyTorch.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
A unified framework to jointly model images, text, and human attention traces.
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
A video retrieval dataset How2R and a video QA dataset How2QA
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
A repository to curate and summarise research papers related to fashion and e-commerce
PyTorch implementation of Contrastive Learning methods
lichengunc / detectron2
Forked from facebookresearch/detectron2Detectron2 is FAIR's next-generation research platform for object detection and segmentation.
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"