RLRep

RLRep is a project to automatically generate program repair recommendation in the field of smart contracts for given code snippets with their contexts. The source code and dataset are opened.

Introduction

multistep_RLRep.py: the model framework for reinforcement learning and its implementation

src/, config/, utils/ and smartBugs.py: integrated the smartBugs tool (Ferreira et al.).

similarity_compute.py and FastText/(download from Zenodo because of Github upload size limit): by FastText library, compute the similarity (proposed by Gao et al.) between the generated contract and the buggy contract. (one of the modules that make up the reward function)

entropy_compute.py and entropy_compute/(download from Zenodo because of Github upload size limit): by the concept of entropy proposed by Ray et al., compute the entropy of the generated contract. (one of the modules that make up the reward function as well)

utils2.py: some useful methods of reward function and reinforcement learning.

top300_identifier_dict.txt: top-300 most frequent tokens in the source code.

solidityparser/: the Solidity lexer and parser built on top of ANTLR.

code2ast.js and node_modules/: convert source code to a preorder traversal sequence of AST.

genetic.py: the implement of the search-and-genetic-algorithm-based smart contract repair approach proposed by Yu et al.

dataset_vul.tar.gz: unzip it to get the folder dataset_vul/. It includes full_contract_dataset/ (853 vulnerable smart contracts), contract/(the source code of the buggy contract labeled with fault location), ast/(the preorder sequence of the abstract syntax tree of the buggy function), threelines-tokenseq/(the previous line, the next line and the buggy line) and repair_contract/ (the correct generated patches).

requirements.txt: a file listing all the dependencies for RLRepair

Usage

Install packages needed using pip:

pip install -r requirements.txt

Unzip dataset_vul.tar.gz
make sure that all input files are ready: (you can refer to the format of our input files in dataset_vul/newALLBUGS/)

mapping (map source token to index): code_w2i.pkl, code_i2w.pkl, ast_w2i.pkl and ast_i2w.pkl
first input: threelines-tokenseq/
second input: ast/
data for pretraining: pretrain/ and pretrain_label/
data for validation: validation/

training and validation

python main.py [model_name] [dataset_path]
# for example:
python main.py multistep_RLRep dataset_vul/newALLBUGS
# or
python main.py mutation dataset_vul/newALLBUGS

result

At last, result can be got in dataset_vul/newALLBUGS/validation/result/.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RLRep

Introduction

Usage

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
config		config
node_modules		node_modules
solidityparser		solidityparser
src		src
utils		utils
README.md		README.md
code2ast.js		code2ast.js
dataset_vul.tar.gz		dataset_vul.tar.gz
entropy_compute.py		entropy_compute.py
genetic.py		genetic.py
main.py		main.py
multistep_RLRep.py		multistep_RLRep.py
requirements.txt		requirements.txt
similarity_compute.py		similarity_compute.py
smartBugs.py		smartBugs.py
top300_identifier_dict.txt		top300_identifier_dict.txt
utils2.py		utils2.py

mokita-j/RLRep

Folders and files

Latest commit

History

Repository files navigation

RLRep

Introduction

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages