Skip to content
View tony88331's full-sized avatar

Block or report tony88331

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

5 stars written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 39,494 28,254 Updated Oct 16, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,521 1,688 Updated Oct 15, 2024

The leader in Next-Generation Customer Data Infrastructure

Scala 6,831 1,189 Updated Sep 2, 2024

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,284 537 Updated Oct 9, 2024

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,087 910 Updated Oct 16, 2024