A Lightweight Declarative Data Workflow Framework that build on the 🏃 Workflow package.
Warning
This framework does not allow you to custom your pipeline yet. If you want to create you workflow, you can use the 🏃 Workflow package instead that already installed.
pip install -U deflow
After initialize data framework project with Version 1, your data pipeline config files will store with this file structure:
conf/
├─ conn/
│ ├─ c_conn_01.yml
│ ╰─ c_conn_02.yml
╰─ stream/
╰─ s_stream_01/
├─ g_group_01.tier.priority/
│ ├─ p_proces_01.yml
│ ╰─ p_proces_02.yml
├─ g_group_02.tier.priority/
│ ├─ p_proces_01.yml
│ ╰─ p_proces_02.yml
╰─ config.yml
You can run the data flow by:
from deflow.flow import Flow
from ddeutil.workflow import Result
flow: Result = Flow(name="s_stream_01").run(mode="N")
Support data framework version:
Version | Supported | Description |
---|---|---|
1 | Yes | A data framework that base on stream, group, and process |
I do not think this project will go around the world because it has specific propose, and you can create by your coding without this project dependency for long term solution. So, on this time, you can open the GitHub issue on this project 🙌 for fix bug or request new feature if you want it.