Change the repository type filter
All
Repositories list
35 repositories
- Accelerates migrations to Databricks by automating code conversion and migration validation
- Automated migrations to Unity Catalog
- Experimental labs projects
- Databricks framework to validate Data Quality of pySpark DataFrames
- API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
pytester
PublicPython Testing for Databrickspartner-connect-api
Public- Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
blueprint
PublicBaseline for Databricks Labs projects written in Pythondiscoverx
PublicA Swiss-Army-knife for your Data Intelligence platform administration.- Lightweight SQL execution wrapper only on top of Databricks SDK
- Capture deep metrics on one or all assets within a Databricks workspace
pylint-plugin
PublicDatabricks Plugin for PyLinttika-ocr
Public- 🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
geoscan
Public- Automated provisioning of an industry Lakehouse with enterprise data model
dataframe-rules-engine
Publicsplunk-integration
PublicDatabricks Add-on for Splunk- Databricks SDK for R (Experimental)
databricks-sync
Publicarcuate
PublicDelta Sharing + MLflow for ML model & experiment exchange (arcuate delta - a fan shaped river delta)- DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-rc1 release.Documentation here - https://databrickslabs.github.io/delta-oms/v0.7.0-rc1/