MineCraftGPT

Implementation of course project for PKU 2023 NLPDL.

Requirements

conda install --file requirements.txt

And install the trl package by

git clone https://github.com/huggingface/trl.git
cd trl/
pip install -e .

Datasets

We use the datasets provided by MineDojo.

To get the reddit dataset, you first need to follow instructions on PRAW to get your own reddit client_id, client_secret and usr_agent and fill them in get_reddit_data and preprocess_reddit_data in utils.py.

Then run utils.py to get the wiki and reddit dataset.

Training

To train the models for wiki generation, run

python wiki_train.py

after changing the wandb project name and model path in it.

To train the models for reddit reply, run

python reddit_train.py

If you want to try RLHF on the reddit dataset, first run

python reward_model.py

to train the reward model, then run

python PPO.py

to train the RLHF model.

Evaluation

Access to the Google Gemini model is required to run the evaluation. Please follow the instructions on Gemini.

Run python elo_rating.py to see the result.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MineCraftGPT

Requirements

Datasets

Training

Evaluation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
PPO.py		PPO.py
README.md		README.md
elo_rating.py		elo_rating.py
extract_knowledge.py		extract_knowledge.py
reddit_train.py		reddit_train.py
requirements.txt		requirements.txt
reward_model.py		reward_model.py
utils.py		utils.py
wiki_train.py		wiki_train.py

muzhancun/MineCraftGPT

Folders and files

Latest commit

History

Repository files navigation

MineCraftGPT

Requirements

Datasets

Training

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages