This is the PyTorch Implementation of our paper "Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering".
@article{peng2021,
title={Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering},
author={Peng Min, Wang Chongyang, Gao Yuan, Shi Yu, Zhou Xiang-Dong},
year={2021}}