The ERNIE-ViL (including pre-trained models and VCR task-pretrained models) has been released at here.