Scheduler Plugins

Scheduler Plugins maintains multiple plugins used to differentiate the scheduling strategies for different workloads.

Plugin List

ResourceFungibility Plugin

A llama2-7B model can be running on 1xA100 GPU, also on 1xA10 GPU, even on 1x4090 and a variety of other types of GPUs as well, that's what we called resource fungibility. In practical scenarios, we may have a heterogeneous cluster with different GPU types, and high-end GPUs will stock out a lot, to meet the SLOs of the service as well as the cost, we need to schedule the workloads on different GPU types.

With resourceFungibility plugin, we can simply achieve this with at most 8 alternative GPU types.

In the future, we need to explore the GPU usage dynamically, not only for the availability and cost, but also the performance. See related paper about Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github		.github
api/config/scheme		api/config/scheme
cmd		cmd
hack		hack
pkg		pkg
.dockerignore		.dockerignore
.gitignore		.gitignore
.golangci.yaml		.golangci.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
OWNERS		OWNERS
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scheduler Plugins

Plugin List

ResourceFungibility Plugin

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

InftyAI/scheduler-plugins

Folders and files

Latest commit

History

Repository files navigation

Scheduler Plugins

Plugin List

ResourceFungibility Plugin

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages