CIRS-codes/reproduce_results_of_our_paper at main · chongminggao/CIRS-codes

History

Name		Name	Last commit message	Last commit date
parent directory ..
figures		figures
results_all_methods		results_all_methods
results_alpha_beta		results_alpha_beta
results_leave		results_leave
scripts		scripts
README.md		README.md
item_cnt_train.json		item_cnt_train.json
user_cnt_train.json		user_cnt_train.json
visual_ab.py		visual_ab.py
visual_leave_threshold.py		visual_leave_threshold.py
visual_main_figure.py		visual_main_figure.py
visual_main_table.py		visual_main_table.py
visual_utils.py		visual_utils.py
visualize_main_results.ipynb		visualize_main_results.ipynb

README.md

A guide to reproduce the main results of the paper.

There are three steps to reproduce the results.

Run the scripts to run the experiments to produce the log files.
Collect the results by copying the log files and pasting them to certain directories.
Visualize the results and reproduce the main figures and tables in the paper.

We have finished the first two steps and prepared all logs as required. You can visualize the results by only running the third step.

Step 1. Re-run the experiments.

All parameters are all set for reproducing the results. We fix the random seed so that the results will not change each time on the same machine. However, the results can still vary when running on different machines.

Different methods are running in a parallel way by adding & at the end of each shell command. Please revise the parameter --cuda for each line and distribute them according to the GPU memories of your server. A server with 8 GPUs is recommended.

Make sure you have installed the required environment and downloaded the datasets (see here).

The following scripts are used for reproducing Figure 5 in the paper.

Train the user model on two datasets.

# Make sure you are at the root of the project!

# KuaiEnv, Length == 100 and 30
python CIRS-UserModel-kuaishou.py --cuda 0 --leave_threshold 0 --num_leave_compute 1 --tau 1000 --is_ab --epoch 20 --message "Pair11" & # User model for CIRS.
python CIRS-UserModel-kuaishou.py --cuda 1 --leave_threshold 0 --num_leave_compute 1 --tau    0 --no_ab --epoch 20 --message "Pair1" & # User model for CIRS w/o CI.


# VirtualTaobao, Length == 50
python CIRS-UserModel-taobao.py    --cuda 2 --epoch 10 --max_turn 50 --tau 0.01 --leave_threshold 3 --num_leave_compute 5 --message "taobao tau 0.01" & 
python CIRS-UserModel-taobao.py    --cuda 3 --epoch 10 --max_turn 50 --tau 0 --leave_threshold 3 --num_leave_compute 5 --message "taobao tau 0" & 

# VirtualTaobao, Length == 10
python CIRS-UserModel-taobao.py    --cuda 4 --epoch 10 --tau 1 --leave_threshold 1 --num_leave_compute 5 --message "taobao tau 1" &

The logs and saved models are saved in /saved_models. Please wait until all processes terminate before learning the RL policies.

Learn the RL policies.

The baseline models (PD, IPS, DICE, DeepFM, $$\epsilon$$-greedy, UCB, Random, MLP) are also run in this lump.

## KuaiEnv, Length == 30
python "CIRS-RL-kuaishou.py"   --cuda 1 --tau 10  --seed 0  --gamma_exposure 10  --leave_threshold 0  --num_leave_compute 1 --max_turn 30 --is_ab --epoch 1000 --top_rate 0.8 --read_message "Pair11" --message "K_CIRS_len30" &
python "CIRS-RL-kuaishou.py"   --cuda 0 --tau 0   --seed 0  --leave_threshold 0  --num_leave_compute 1 --max_turn 30 --no_ab --epoch 1000 --top_rate 0.8 --read_message "Pair1" --message "K_CIRSwoCI_len30" &

## KuaiEnv, Length == 100
python "CIRS-RL-kuaishou.py"   --cuda 0 --tau 100 --seed 0 --gamma_exposure 10  --leave_threshold 0  --num_leave_compute 1 --max_turn 100 --is_ab --epoch 200 --top_rate 0.8 --read_message "Pair11"  --message "K_CIRS_len100" &
python "CIRS-RL-kuaishou.py"   --cuda 1 --tau 0   --seed 0   --leave_threshold 0  --num_leave_compute 1 --max_turn 100 --no_ab --epoch 200 --top_rate 0.8 --read_message "Pair1" --message "K_CIRSwoCI_len100" &

python PD-pairwise.py             --cuda 0 --leave_threshold 0 --num_leave_compute 1 --top_rate 0.8 --epoch 200 --message "PD"    &
python DeepFM-IPS-pairwise.py     --cuda 1 --leave_threshold 0 --num_leave_compute 1 --top_rate 0.8 --epoch 200 --message "IPS"   &
python DICE.py                    --cuda 2 --leave_threshold 0 --num_leave_compute 1 --top_rate 0.8 --epoch 200 --message "DICE"  &
python CIRS-UserModel-kuaishou.py --cuda 3 --leave_threshold 0 --num_leave_compute 1 --top_rate 0.8 --tau 0 --no_ab --epoch 200 --is_softmax --message "DeepFM+Softmax" &
python CIRS-UserModel-kuaishou.py --cuda 4 --leave_threshold 0 --num_leave_compute 1 --top_rate 0.8 --tau 0 --no_ab --epoch 200  --epsilon 0.1 --not_softmax --message "K_epsilon-greedy" &
python CIRS-UserModel-kuaishou.py --cuda 5 --leave_threshold 0 --num_leave_compute 1 --top_rate 0.8 --tau 0 --no_ab --epoch 200  --epsilon 1.0 --not_softmax --message "K_Random" &
python CIRS-UserModel-kuaishou.py --cuda 6 --leave_threshold 0 --num_leave_compute 1 --top_rate 0.8 --tau 0 --no_ab --epoch 200 --not_softmax --is_ucb --message "UCB" &

## Taobao, Length == 50
python CIRS-RL-taobao.py --cuda 0  --max_turn 50 --epoch 100 --tau 10 --leave_threshold 3 --num_leave_compute 5 --read_message "taobao tau 0.01" --message "T_CIRS_len50" &
python CIRS-RL-taobao.py --cuda 0  --max_turn 50 --epoch 100 --tau 0  --leave_threshold 3 --num_leave_compute 5 --read_message "taobao tau 0" --message "T_CIRSwoCI_len50" &

## Taobao, Length == 10
python CIRS-RL-taobao.py --cuda 0  --max_turn 10 --epoch 200 --tau 0.1 --leave_threshold 1 --num_leave_compute 5 --read_message "taobao tau 1" --message "T_CIRS_len10" &
python CIRS-RL-taobao.py --cuda 0  --max_turn 10 --epoch 200 --tau 0 --leave_threshold 1 --num_leave_compute 5 --read_message "taobao tau 0" --message "T_CIRSwoCI_len10" &

Note: Learning on VirtualTaobao is slow and the GPU utilization is very low (near 0). The bottleneck is the VirtualTaobao environment, which computes the rewards on the fly. By contrast, experiments running on KuaiEnv are very efficient.

Wait until all processes terminate. You can inspect the performance in the logs under /saved_models.

Step 2. Collect all results by copying the log files.

All log files for visualizing Figure 5 and Table 2 should be saved as shown in the following structure.

We have prepared all files for you, so you can directly conduct step 3 without conducting steps 1 and 2. If you want to run it yourself, you should copy your logs from /saved_models and replace the logs of the corresponding methods.

CIRS-codes
│   ├── results_all_methods
│   │   ├── kuaishou_len100
│   │   │   ├── [DICE]_2022_12_24-16_43_59.log
│   │   │   ├── [DeepFM+Softmax]_2022_12_24-16_43_59.log
│   │   │   ├── [IPS]_2022_12_25-13_34_03.log
│   │   │   ├── [K_CIRS_len100_r08]_2022_12_24-16_43_58.log
│   │   │   ├── [K_CIRSwoCI_len100_r08]_2022_12_24-08_35_42.log
│   │   │   ├── [K_Random]_2022_12_24-16_43_59.log
│   │   │   ├── [K_epsilon-greedy]_2022_12_24-16_43_59.log
│   │   │   ├── [PD]_2022_12_24-16_43_59.log
│   │   │   └── [UCB]_2022_12_24-16_43_59.log
│   │   ├── kuaishou_len30
│   │   │   ├── [K_CIRS_len30_r08]_2022_12_24-08_35_42.log
│   │   │   └── [K_CIRSwoCI_len30_r08]_2022_12_24-16_43_58.log
│   │   ├── taobao_len10
│   │   │   ├── [T_CIRS_len10]_2022_12_18-00_07_19.log
│   │   │   └── [T_CIRSwoCI_len10]_2022_12_18-00_07_19.log
│   │   └── taobao_len50
│   │       ├── [T_CIRS_len50]_2022_12_17-07_01_33.log
│   │       ├── [T_CIRSwoCI_len50]_2022_12_17-07_01_33.log
│   │       ├── [T_MLP]_2022_12_16-15_14_14.log
│   │       ├── [T_Random]_2022_12_16-15_26_18.log
│   │       └── [T_epsilon-greedy]_2022_12_16-15_14_14.log
│   ├── results_alpha_beta
│   │   ├── DeepFM_Pair11.pt
│   │   ├── DeepFM_params_Pair11.pickle
│   │   └── normed_mat-Pair11.pickle

Step 3. Visualize the results.

We have already prepared those logs in this repository. So you can directly visualize the results.

We illustrate the results via the Jupyter Notebook file: visualize_main_results.ipynb.

All generated figures and tables are saved under figures.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reproduce_results_of_our_paper

reproduce_results_of_our_paper

README.md

A guide to reproduce the main results of the paper.

Step 1. Re-run the experiments.

Train the user model on two datasets.

Learn the RL policies.

Step 2. Collect all results by copying the log files.

Step 3. Visualize the results.

Files

reproduce_results_of_our_paper

Directory actions

More options

Directory actions

More options

Latest commit

History

reproduce_results_of_our_paper

Folders and files

parent directory

README.md

A guide to reproduce the main results of the paper.

Step 1. Re-run the experiments.

Train the user model on two datasets.

Learn the RL policies.

Step 2. Collect all results by copying the log files.

Step 3. Visualize the results.