Skip to content
forked from formlio/forml

Lifecycle management framework for Data science projects

License

Notifications You must be signed in to change notification settings

hieuqtran/forml

 
 

Repository files navigation

ForML

GitHub Build Coverage Status Documentation Status License PyPI Version PyPI Downloads Code Style: black

ForML is a framework for researching, implementing and operating data science projects.

Use ForML to formally describe a data science problem as a composition of high-level operators. ForML expands your project into a task dependency graph specific to a given life-cycle phase and executes it using any of its supported runners.

Solutions built on ForML are naturally easy to reuse, extend, reproduce, or share and collaborate on.

Not Just Another DAG

Despite DAG (directed acyclic graph) being at the heart of ForML operations, it stands out amongst the many other task dependency processing systems due to:

  1. Its specialization on machine learning problems, that is wired right into the flow topology.
  2. Concept of high-level operator composition which helps to wrap complex ML techniques into simple reusable units.
  3. An abstraction of runtime dependencies allowing to run the same project using different technologies.

History

ForML started as an open-source project in response to ever painful transitions of datascience research into production. While there are other projects trying to solve this problem, they are typically either generic data processing systems too low-level to provide out-of-the-box ML lifecycle routines or special scientific frameworks that are on the other end too high-level to allow for robust operations.

Resources

About

Lifecycle management framework for Data science projects

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%