Skip to content

Commit

Permalink
Bump Flytekit version in Great Expectations integration (flyteorg#379)
Browse files Browse the repository at this point in the history
Signed-off-by: Samhita Alla <[email protected]>
  • Loading branch information
samhita-alla authored Sep 1, 2021
1 parent 88872d9 commit 7d1fc86
Show file tree
Hide file tree
Showing 5 changed files with 35 additions and 34 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -76,24 +76,24 @@ For example, this is how the error message looks like:
Plugin Parameters
-----------------

- **data_source**: Data source, in general, is the "name" we use in the Great Expectations config file.
When combined with the data to be validated, the data source helps Great Expectations ascertain the type of data.
Moreover, data source assists in building batches out of data (for validation).
- **expectation_suite**: Defines the data validation.
- **data_connector**: Tells how the data batches have to be identified.
- **datasource_name**: Data source, in general, is the "name" we use in the Great Expectations config file.
A Datasource brings together a way of interacting with data (like a database or Spark cluster) and some specific data
(like a CSV file, or a database table). Moreover, data source assists in building batches out of data (for validation).
- **expectation_suite_name**: Defines the data validation.
- **data_connector_name**: Tells how the data batches have to be identified.

Optional Parameters
^^^^^^^^^^^^^^^^^^^

- **context_root_dir**: Sets the path of the great expectations config directory.
- **checkpoint_params**: Optional :py:class:`greatexpectations:great_expectations.checkpoint.checkpoint.SimpleCheckpoint` class parameters.
- **batchrequest_config**: Additional batch request configuration parameters.
- **batch_request_config**: Additional batch request configuration parameters.

- data_connector_query: Query to request a data batch
- runtime_parameters: Parameters to be sent at run-time
- batch_identifiers: Batch identifiers
- batch_spec_passthrough: Reader method if your file doesn’t have an extension
- **data_asset_name**: name of the data asset (to be used for ``RuntimeBatchRequest``)
- **data_asset_name**: Name of the data asset (to be used for ``RuntimeBatchRequest``)
- **local_file_path**: Helpful to download the given dataset to the user-given path

.. note::
Expand Down
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
-r ../../../common/requirements-common.in
flytekitplugins-great_expectations>=v0.22.0b3
flytekitplugins-great_expectations>=v0.22.0
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ appnope==0.1.2
# via
# ipykernel
# ipython
argon2-cffi==20.1.0
argon2-cffi==21.1.0
# via notebook
attrs==21.2.0
# via
Expand All @@ -20,7 +20,7 @@ backcall==0.2.0
# via ipython
backports.zoneinfo==0.2.1
# via tzlocal
bleach==4.0.0
bleach==4.1.0
# via nbconvert
certifi==2021.5.30
# via requests
Expand All @@ -36,7 +36,7 @@ croniter==1.0.15
# via flytekit
cycler==0.10.0
# via matplotlib
dataclasses-json==0.5.4
dataclasses-json==0.5.5
# via flytekit
debugpy==1.4.1
# via ipykernel
Expand All @@ -50,6 +50,8 @@ deprecated==1.2.12
# via flytekit
dirhash==0.2.1
# via flytekit
diskcache==5.2.1
# via flytekit
docker-image-py==0.1.12
# via flytekit
docstring-parser==0.10
Expand All @@ -59,38 +61,39 @@ entrypoints==0.3
# altair
# jupyter-client
# nbconvert
flyteidl==0.19.25
flyteidl==0.20.1
# via flytekit
flytekit==0.21.4
flytekit==0.22.0
# via
# -r ../../../common/requirements-common.in
# flytekitplugins-great-expectations
flytekitplugins-great-expectations==0.22.0b3
flytekitplugins-great-expectations==0.22.0
# via -r requirements.in
great-expectations==0.13.28
great-expectations==0.13.31
# via flytekitplugins-great-expectations
grpcio==1.39.0
# via flytekit
idna==3.2
# via requests
importlib-metadata==4.6.4
importlib-metadata==4.8.1
# via
# great-expectations
# keyring
ipykernel==6.2.0
ipykernel==6.3.1
# via
# ipywidgets
# notebook
ipython==7.26.0
ipython==7.27.0
# via
# ipykernel
# ipywidgets
ipython-genutils==0.2.0
# via
# ipykernel
# ipywidgets
# nbformat
# notebook
# traitlets
ipywidgets==7.6.3
ipywidgets==7.6.4
# via great-expectations
jedi==0.18.0
# via ipython
Expand All @@ -100,8 +103,6 @@ jinja2==3.0.1
# great-expectations
# nbconvert
# notebook
joblib==1.0.1
# via flytekit
jsonpatch==1.32
# via great-expectations
jsonpointer==2.1
Expand All @@ -111,7 +112,7 @@ jsonschema==3.2.0
# altair
# great-expectations
# nbformat
jupyter-client==7.0.1
jupyter-client==7.0.2
# via
# ipykernel
# nbclient
Expand All @@ -124,11 +125,11 @@ jupyter-core==4.7.1
# notebook
jupyterlab-pygments==0.1.2
# via nbconvert
jupyterlab-widgets==1.0.0
jupyterlab-widgets==1.0.1
# via ipywidgets
keyring==23.1.0
# via flytekit
kiwisolver==1.3.1
kiwisolver==1.3.2
# via matplotlib
markupsafe==2.0.1
# via jinja2
Expand Down Expand Up @@ -249,7 +250,7 @@ pyzmq==22.2.1
# via
# jupyter-client
# notebook
regex==2021.8.21
regex==2021.8.28
# via docker-image-py
requests==2.26.0
# via
Expand All @@ -260,7 +261,7 @@ responses==0.13.4
# via flytekit
retry==0.9.2
# via flytekit
ruamel.yaml==0.17.13
ruamel.yaml==0.17.16
# via great-expectations
ruamel.yaml.clib==0.2.6
# via ruamel.yaml
Expand All @@ -272,7 +273,6 @@ send2trash==1.8.0
# via notebook
six==1.16.0
# via
# argon2-cffi
# bleach
# cycler
# flytekit
Expand All @@ -286,8 +286,6 @@ sortedcontainers==2.4.0
# via flytekit
statsd==3.3.0
# via flytekit
stringcase==1.2.0
# via dataclasses-json
termcolor==1.1.0
# via great-expectations
terminado==0.11.1
Expand All @@ -304,7 +302,7 @@ tornado==6.1
# terminado
tqdm==4.62.2
# via great-expectations
traitlets==5.0.5
traitlets==5.1.0
# via
# ipykernel
# ipython
Expand All @@ -316,7 +314,7 @@ traitlets==5.0.5
# nbconvert
# nbformat
# notebook
typing-extensions==3.10.0.0
typing-extensions==3.10.0.2
# via typing-inspect
typing-inspect==0.7.1
# via dataclasses-json
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -75,7 +75,7 @@ def simple_wf(dataset: str = DATASET_LOCAL) -> int:
# The ``local_file_path`` argument helps in copying the remote file to the user-given path.
#
# .. note::
# ``local_file_path``'s directory and ``base_directory`` ought to be the same.
# ``local_file_path``'s directory and ``base_directory`` in Great Expectations config ought to be the same.
file_task_object = GreatExpectationsTask(
name="great_expectations_task_flytefile",
datasource_name="data",
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -112,7 +112,10 @@ def file_wf() -> pd.DataFrame:
# FlyteSchema
# ===========
#
# We define a ``GreatExpectationsType`` to validate ``FlyteSchema``. The ``local_file_path`` is where we've our parquet file.
# We define a ``GreatExpectationsType`` to validate ``FlyteSchema``. The ``local_file_path`` is where we would have our parquet file.
#
# .. note::
# ``local_file_path``'s directory and ``base_directory`` in Great Expectations config ought to be the same.
@task(limits=Resources(mem="500Mi"))
def schema_task(
dataframe: GreatExpectationsType[
Expand Down

0 comments on commit 7d1fc86

Please sign in to comment.