Data testing, monitoring and profiling for SQL-accessible data.
Soda Core is a free, open-source, command-line tool that enables you to use the Soda Checks Language to turn user-defined input into aggregated SQL queries.
When it runs a scan on a dataset, Soda Core executes the checks to find invalid, missing, or unexpected data. When your Soda Checks fail, they surface the data that you defined as “bad”.
Connect Soda Core to your data source, then define your Soda Checks for data quality in a checks.yml file. Use Soda Core to run scans of your data to execute the checks you defined.
Soda Core currently supports PostgreSQL, Amazon Redshift, GCP BigQuery, and Snowflake. To get started, use the install command, replacing soda-postgres
with the package that matches your data source.
pip install soda-postgres
soda-core-postgres
soda-core-redshift
soda-core-bigquery
soda-core-snowflake