SimNet

SimNet is a machine learning (ML)-based computer architecture simulator that evaluates program performance by predicting the latencies of executed instructions. First, an ML-based instruction latency prediction framework that accounts for both static instruction properties and dynamic processor states is constructed. Then, a GPU-accelerated parallel simulator is implemented based on the proposed instruction latency predictor. More details can be found in our paper at https://dl.acm.org/doi/10.1145/3530891. If you are interested in SimNet, we encourage you to check out our latest work PerfVec (paper, code), which can be considered as a more general version of SimNet.

Requirements:

Python
Pytorch
CUDA (for GPU support)
Libtorch

Folders

dp: contains scripts for data pre-processing and running the simulator.

ml: contains scripts for machine learning training & testing.

sim: contains scripts for building the simulator.

data: link to the data folder.

Each folder contains the instructions.

Building and running the simulator

SimNet requires instruction traces from gem5 for training and simulation. The modified gem5 can be downloaded using

git clone -b simnet https://github.com/lingda-li/gem5.git

It generates a full instruction trace and a store queue trace when simulating a program execution.

Step 1: Train the machine learning model.

The first step is to train a model which can predict latency of a given instruction.

Preparing training data

Follow the instructions in section 1 (Preprocessing training data) dp folder to prepare the training data.

Generate a.qq: ./dp/buildQ a.txt a.sq.txt
Deduplicate qq file: python dp/1_unique.py a.qq
Make npy file: python dp/2_Q-to-mmap.py --total-rows=<entry num> a.qqu

Training model

Use the script ml/train_qq_ddp_mul folder to train the model. The trained module would be used by simulator. CNN3_F and E1DNet are the default models used in the paper.

Step 2: Build the simulator

Follow the instruction in sim folder to build the simulator.

Step 3: Running for simulator

For running the simulator, we nedd to first process the instruction trace from gem5 to the simulator format. Follow the instructions in section 2 (Building simulator input) dp folder for processing input for simulator. Place all the required files and simulator file build in step 2 in the same directory. Note: Use the model converted to the libtorch version.

./simulator_qq <trace> <aux trace> <lat module> <class module> <variances (optional)>

or

./simulator_qq <trace> <aux trace> <lat module> <variances (optional)>

Change log

082820, v1.0: a stable version that works with datasets from O3CPU output that includes instruction, register, data access, instruction access, and page table walking information.

051420: Fix the simulator bug that didn't generate context instruction relationship at runtime.

050820: Init the repo.

Name		Name	Last commit message	Last commit date
Latest commit History 363 Commits
an		an
bak_models		bak_models
cuda_inference		cuda_inference
dp		dp
ml		ml
pipeline		pipeline
sim		sim
sim_q_GPU		sim_q_GPU
sim_qq_GPU		sim_qq_GPU
sim_qq_GPU_dynamic		sim_qq_GPU_dynamic
upload		upload
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SimNet

Requirements:

Folders

Building and running the simulator

Step 1: Train the machine learning model.

Preparing training data

Training model

Step 2: Build the simulator

Step 3: Running for simulator

Change log

About

Releases

Packages

Contributors 2

Languages

lingda-li/simnet

Folders and files

Latest commit

History

Repository files navigation

SimNet

Requirements:

Folders

Building and running the simulator

Step 1: Train the machine learning model.

Preparing training data

Training model

Step 2: Build the simulator

Step 3: Running for simulator

Change log

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages