Skip to content
View southernriver's full-sized avatar

Block or report southernriver

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Java 1,307 172 Updated Feb 4, 2025

The Feldera Incremental Computation Engine

Rust 1,132 53 Updated Feb 4, 2025
Python 1,833 101 Updated Nov 20, 2024

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 1,220 395 Updated Feb 4, 2025

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 969 158 Updated Feb 3, 2025

The dbt-spark-livy adapter allows you to use dbt along with Apache Spark, by connecting via Apache Livy

Python 12 10 Updated Mar 30, 2023

The event stream processing platform for developers. Unified experience for real-time data ingestion, stream processing, and low-latency serving. Best-in-class performance and cost-efficiency. Supp…

Rust 7,309 601 Updated Feb 4, 2025

Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

Java 910 307 Updated Jan 23, 2025

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,451 1,888 Updated Feb 4, 2025

Apache Iceberg

Java 6,852 2,353 Updated Feb 4, 2025

Uniffle is a high performance, general purpose Remote Shuffle Service.

Java 398 153 Updated Feb 3, 2025

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Java 924 374 Updated Jan 23, 2025

An Extensible Data Skipping Framework

Scala 43 14 Updated Jan 15, 2025

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

Java 2,311 401 Updated Jan 20, 2025

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

Java 3,331 1,167 Updated Jan 22, 2025

Java utilities for transforming distance along N-dimensional Hilbert Curve to a point and back. Also supports range splitting queries on the Hilbert Curve.

Java 112 21 Updated Dec 9, 2024

Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers

Java 254 72 Updated Apr 7, 2023

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Java 6,452 2,814 Updated Jan 19, 2025

Performance Analysis Tool

Python 76 53 Updated Dec 8, 2022

TPC-H queries in Apache Spark SQL using native DataFrames API

C 98 82 Updated Jan 24, 2024

Star Schema Benchmark dbgen

C 121 81 Updated Mar 11, 2024

Remote shuffle service for Apache Spark to store shuffle data on remote servers.

Java 327 99 Updated Sep 29, 2023

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

C++ 23 7 Updated Sep 25, 2024

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 26,552 8,740 Updated Feb 4, 2025

一站式云原生实时流数据平台,通过0侵入、插件化构建企业级Kafka服务,极大降低操作、存储和管理实时流数据门槛

Java 7,030 1,287 Updated Oct 12, 2024

It is open source ebook about TensorFlow kernel and implementation mechanism.

TeX 2,894 583 Updated May 5, 2023

.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.

C# 2,038 321 Updated Jan 14, 2025

An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes

Shell 81 29 Updated Mar 16, 2020

Production-Grade Container Scheduling and Management

Go 112,829 40,149 Updated Feb 4, 2025
Next