Reproducible, Portable, and Distributable ML Solutions in Python

When achieved, the combination of reproducibility, portability, and distributability in machine learning (ML) solutions becomes a powerful capability that unlocks numerous operational opportunities.

While reproducibility is a well-established practice in scientific research, it often doesn't receive the same recognition in the data product industry. Similarly, portability and distributability are typically considered irrelevant for custom solutions and are only pursued when explicitly required. However, with modern tooling, these properties can be easily achieved without significant additional effort. As a result, they offer substantial benefits such as highly collaborative research and development, inherent lifecycle management, effective model troubleshooting, seamless and flexible deployment (latency/throughput-optimal runtime modes), and even potential commoditization in the form of turnkey solutions.

In this workshop, we will delve deeper into these concepts, carefully examining the available technologies and reviewing existing tools. A significant portion of our time will be dedicated to working with the ForML framework, where we will implement a practical end-to-end ML solution that demonstrates all of these principles we've discussed.

Slides

The introduction slides are available on google slides
The workshop slides are available at formlio.github.io/mlprague23.

Article

An accompanying article in more depth on the lifecycle patterns can be found on medium

Setup

Clone the workshop repository:

$ git clone [email protected]:formlio/mlprague23.git
$ cd mlprague23

Install Docker Engine along with the Docker Compose plugin (should be already part of any recent docker engine version).
Spin up the workspace container from within the mlprague23 project root directory (this will need to bind ports 8888, 8000 and 4040 on your machine):

$ docker compose up -d

Load the workspace notebook interface at http://127.0.0.1:8888/lab using your browser.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
2-tutorial		2-tutorial
3-solution		3-solution
LICENSE		LICENSE
README.md		README.md
config.toml		config.toml
docker-compose.yml		docker-compose.yml
dummycatalog.py		dummycatalog.py
index.ipynb		index.ipynb
workspace.json		workspace.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproducible, Portable, and Distributable ML Solutions in Python

Slides

Article

Setup

About

Releases

Packages

Contributors 2

Languages

License

formlio/mlprague23

Folders and files

Latest commit

History

Repository files navigation

Reproducible, Portable, and Distributable ML Solutions in Python

Slides

Article

Setup

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages