Multimodal Label Relevance Ranking via Reinforcement Learning (ECCV2024)

This is an official PyTorch implementation of LR²PPO, the ECCV2024 paper is available at here. The introduction video can be found at here.

Getting Started

Data Preparation

Download the proposed LRMovieNet Dataset from here.

[Optional] You can also download MovieNet Dataset from its Official Website.

Initialization Weights Preparation

Download roberta_base_en_model and vit_base_patch16_224_model weights from this link or from its official repositories, and save it in ./pretrained_models/ folder.

Prerequisites

Install Requirements

pip3 install -r requirements.txt

Hardware

4 GPUs

Usage

Before running the following commands, make sure the data path is correct and the GPUs are sufficient (e.g., 4 GPUs).

Stage 1. Label Relevance Ranking Base Model.

sh pointwise.sh <your_stage1>

Stage 2. Reward Model.

sh reward_pair_dataloader.sh <your_stage2>

Stage 3. LR²PPO.

sh ppo.sh <your_stage3>

Evaluation.

sh ppo_eval.sh <your_eval>

Models

We provide logs and checkpoints for the LRMovieNet dataset in the logs/ folder and through this link, respectively.

License

For more details, please refer to the LICENSE file.

Acknowledgments

Part of our code is borrowed from the following repositories:

We are grateful for these excellent works and repositories.

Citation

If you find our work helpful for your research, please consider citing it.

@inproceedings{guo2024multimodal,
  title={Multimodal Label Relevance Ranking via Reinforcement Learning},
  author={Guo, Taian and Zhang, Taolin and Wu, Haoqian and Li, Hanjun and Qiao, Ruizhi and Sun, Xing},
  booktitle={European Conference on Computer Vision},
  pages={391--408},
  year={2024},
  organization={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
finetune		finetune
licenses		licenses
logs		logs
models		models
tencentpretrain		tencentpretrain
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
ndcg.py		ndcg.py
pointwise.sh		pointwise.sh
ppo.sh		ppo.sh
ppo_eval.sh		ppo_eval.sh
preprocess.py		preprocess.py
requirements.txt		requirements.txt
reward_pair_dataloader.sh		reward_pair_dataloader.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Label Relevance Ranking via Reinforcement Learning (ECCV2024)

Getting Started

Data Preparation

Initialization Weights Preparation

Prerequisites

Install Requirements

Hardware

Usage

Stage 1. Label Relevance Ranking Base Model.

Stage 2. Reward Model.

Stage 3. LR²PPO.

Evaluation.

Models

License

Acknowledgments

Citation

About

Releases

Packages

Languages

License

ChazzyGordon/LR2PPO

Folders and files

Latest commit

History

Repository files navigation

Multimodal Label Relevance Ranking via Reinforcement Learning (ECCV2024)

Getting Started

Data Preparation

Initialization Weights Preparation

Prerequisites

Install Requirements

Hardware

Usage

Stage 1. Label Relevance Ranking Base Model.

Stage 2. Reward Model.

Stage 3. LR2PPO.

Evaluation.

Models

License

Acknowledgments

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Stage 3. LR²PPO.

Packages