Skip to content

JorgeGarciaIrazabal/soda-core

Repository files navigation

Soda logo

Soda Core

Data testing, monitoring and profiling for SQL-accessible data.

License: Apache 2.0 Slack

Soda Core is a free, open-source, command-line tool that enables you to use the Soda Checks Language to turn user-defined input into aggregated SQL queries.

When it runs a scan on a dataset, Soda Core executes the checks to find invalid, missing, or unexpected data. When your Soda Checks fail, they surface the data that you defined as “bad”.

Connect Soda Core to your data source, then define your Soda Checks for data quality in a checks.yml file. Use Soda Core to run scans of your data to execute the checks you defined.

Get started

Soda Core currently supports PostgreSQL, Amazon Redshift, GCP BigQuery, and Snowflake. To get started, use the install command, replacing soda-postgres with the package that matches your data source.

pip install soda-postgres

  • soda-core-postgres
  • soda-core-redshift
  • soda-core-bigquery
  • soda-core-snowflake

Documentation

About

Data reliability tools for SQL- and Spark-accessible data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.2%
  • Other 0.8%