Reinforcement Learning with Tree-LSTM for Join Order Selection(RTOS) is an optimizer which focous on Join order Selection(JOS) problem. RTOS learn from previous queries to build plan for further queries with the help of DQN and TreeLSTM.
This repository contains a very early version of RTOS.It is written in python and built on PostgreSQL. It now contains the training and testing part of our system.
It has been fully tested on join order benchmark(JOB). You can train RTOS and see its' performance on JOB.
This code is still under development and not stable, add issues if you have any questions. We will polish our code and provide further tools.
Here we have listed the most important parameters you need to configure to run RTOS on a new database.
- schemafile
- sytheticDir
- Directory contains the sytheic workload(queries), more sythetic queries will help RTOS make better plans.
- It is nesscery when you want apply RTOS for a new database.
- JOBDir
- Directory contains all JOB queries.
- Configure of PostgreSQL
- dbName : the name of database
- userName : the user name of database
- password : your password of userName
- ip : ip address of PostgreSQL
- port : port of PostgreSQL
- Pytorch 1.0
- Python 3.7
- torchfold
- psqlparse
-
Follow the https://github.com/gregrahn/join-order-benchmark to configure the database
-
Add JOB queries in JOBDir.
-
Add sythetic queries in sytheticDir, more sythetic queries will help RTOS generate better results. E.g. You can generate queries by using templates.
-
Cost Training, It will store the model which optimize on cost in PostgreSQL
python3 CostTraining.py
-
Latency Tuning, It will load the cost training model and generate the latency training model
python3 LatencyTuning.py
Contact Xiang Yu ([email protected]) if you have any questions.