Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources

The 32nd ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC'23)

Set up environment

pip install -r requirements.txt

Decompress the data.tar.gz file which contains cloud operation data that can be used to run locally

tar -xf data.tar.gz

Kairos query distributor

Kairos performs optimal query distribution given a fixed heterogeneous configuration which is set in config.json. The config.json file configures the number of instances to use in the heterogeneous server. Start the inference servers in a service node:

python launch_servers.py

These servers capture the response time of the ML inference service for requests of different batche sizes. For actual implementation of the ML inference models, please refer to https://github.com/harvard-acc/DeepRecSys

From another node, run

python kairos_query_distributor

The output will show that Kairos provides a much higher throughput than a naive query distributor, while having 99% of queries to meet QoS. The arrival rate argument in the script controls request rate, reduce it to further increase the throughput.

Kairos resource allocator

Run the following script

python kairos_resource_allocator.py

This calculates and ranks the upperbounds of all possible heterogeneous configurations without any online exploration. The results are stored in the upperbounds folder. In each result saved as .json format, the key represents the heterogeneous configuration, and the value is a list of the [cost, upperbound] for the corresponding configuration.

Note

You can reach me at my email: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
grpc		grpc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
allocator_profile.json		allocator_profile.json
config.json		config.json
data.tar.gz		data.tar.gz
grpc_requester.py		grpc_requester.py
grpc_server.py		grpc_server.py
helper.py		helper.py
instance.json		instance.json
kairos_query_distributor.py		kairos_query_distributor.py
kairos_resource_allocator.py		kairos_resource_allocator.py
launch_servers.py		launch_servers.py
price.csv		price.csv
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources

Set up environment

Kairos query distributor

Kairos resource allocator

Note

About

Releases 2

Packages

Languages

License

boringlee24/hpdc23-kairos

Folders and files

Latest commit

History

Repository files navigation

Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources

Set up environment

Kairos query distributor

Kairos resource allocator

Note

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages