Stars
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Uniffle is a high performance, general purpose Remote Shuffle Service.
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Apache Spark - A unified analytics engine for large-scale data processing
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apache Kyuubi
Alibaba Java Coding Guidelines pmd implements and IDE plugin
The Metadata Platform for your Data and AI Stack
hsweb (haʊs wɛb) 是一个基于spring-boot 2.x开发 ,首个使用全响应式编程的企业级后台管理系统基础项目。
An easy to use, self-service open BI reporting and BI dashboard platform.
High Performance Inter-Thread Messaging Library
Web-based SQL editor. Legacy project in maintenance mode.
Benchmark comparing serialization libraries on the JVM
A Spring Framework based, pragmatic style JavaEE application reference architecture.
Ctrip Hadoop Job Scheduling System derived from https://github.com/alibaba/zeus