Skip to content

Commit

Permalink
Merge branch 'main' of github.com:Vision-CAIR/VisualGPT into main
Browse files Browse the repository at this point in the history
  • Loading branch information
junchen14 committed Apr 7, 2021
2 parents d4419be + afda1a4 commit 1d91237
Show file tree
Hide file tree
Showing 2 changed files with 18 additions and 1 deletion.
19 changes: 18 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,5 +3,22 @@
Our Paper [VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning](https://arxiv.org/abs/2102.10407)

## Main Architecture of Our VisualGPT
our mmodel is shown ![here!](images/architecture.pdf)
![image](images/final_architecture.jpg)

## Enviroment setup
Clone the repository and create the `visualgpt` conda environmnet




Please cite our paper from the following bibtex


```
@article{chen2021visualgpt,
title={VisualGPT: Data-efficient Image Captioning by Balancing Visual Input and Linguistic Knowledge from Pretraining},
author={Chen, Jun and Guo, Han and Yi, Kai and Li, Boyang and Elhoseiny, Mohamed},
journal={arXiv preprint arXiv:2102.10407},
year={2021}
}
```
Binary file added images/final_architecture.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 1d91237

Please sign in to comment.