integration / data stack
EventMesh is a new generation serverless event middleware for building distributed event-driven applications.
Apache Superset is a Data Visualization and Data Exploration Platform
Apache Spark - A unified analytics engine for large-scale data processing
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Apache Pulsar - distributed pub-sub messaging system
Apache Druid: a high performance real-time analytics database.
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Apache Beam is a unified programming model for Batch and Streaming data processing.
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
Apache Pinot - A realtime distributed OLAP datastore
🦎 a tool to build and deploy software on many servers 🦎
Privacy and Security focused Segment-alternative, in Golang and React
lakeFS - Data version control for your data lake | Git for data