GitHub - mufaka/EASDRL: Forking to attempt update for running with Tensorflow 2.x

EASDRL

Code for the IJCAI-18 paper 'Extracting Action Sequences from Texts Based on Deep Reinforcement Learning'

requriements:

tensorflow / keras

wxpython

gensim

ipdb

...

PS: There should be a folder named 'weights', but it was automatically removed by github since it is empty. Remember to add it if you want to train a new model.
```
$ mkdir weights
```

Running

Training: All arguments are preset in main.py, so you can start training by:

$ python main.py

For trianing Argument Extractor, you can run:

$ python main.py --agent_mode arg

If you want to change the domain from 'cooking' to 'win2k' or 'wikihow', try:

$ python main.py --domain win2k

It may takes 2-4 hours for "win2k", 10-15 hours for "cooking" and 20-30 hours for "wikihow" in our computer with TITAN Xp GPU. Change the size of replay memory, GPU fraction or number of epochs according to your servers.

Human-agent Interaction: If you want to use the interacting environment, make sure you have installed the wxpython, and try:

$ python gui.py

It's the initial version, which is simple and maybe has some bugs. We have a latest version which adopts Active Learning for labeling data. It can be run by:

$ python guiActiveLearning.py

About the data

The following {domain} can be one of "cooking", "wikihow" and "win2k"

Labeled data

{domain}_labeled_text_data.pkl is the labeled data for action name extractor
refined_{domain}_data.pkl is the labeled data for action argument extractor

POS data

{domain}_dependency.pkl contains the part-of-speech data for action name extractor
{domain}_arg_pos.pkl contains the part-of-speech data for action argument extractor

Unlabeled data

home_and_garden_500_words_with_tile.pkl contains more than 15k unlabeled texts from WikiHow Home and Garden.

Others

wordvec_dim* is pre-trained word2vec
There are some simple text in ./data/online_test/. They are originally used for online interaction test.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
__pycache__		__pycache__
data		data
results		results
weights		weights
Agent.py		Agent.py
Clustering.py		Clustering.py
EADQN.py		EADQN.py
Environment.py		Environment.py
KerasEADQN.py		KerasEADQN.py
README.md		README.md
ReplayMemory.py		ReplayMemory.py
gui.py		gui.py
guiActiveLearning.py		guiActiveLearning.py
main.py		main.py
trainact.sh		trainact.sh
trainarg.sh		trainarg.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EASDRL

Code for the IJCAI-18 paper 'Extracting Action Sequences from Texts Based on Deep Reinforcement Learning'

requriements:

Running

About the data

Labeled data

POS data

Unlabeled data

Others

About

Releases

Packages

Languages

mufaka/EASDRL

Folders and files

Latest commit

History

Repository files navigation

EASDRL

Code for the IJCAI-18 paper 'Extracting Action Sequences from Texts Based on Deep Reinforcement Learning'

requriements:

Running

About the data

Labeled data

POS data

Unlabeled data

Others

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages