Starred repositories
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and bat…
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
RobustMQ is a next-generation, high-performance, cloud-native, converged message queue that is compatible with multiple mainstream message queuing protocols and has complete Serveless capabilities.
A topic-centric list of HQ open datasets.
A low-level, versioned, embedded, ACID-compliant, key-value database for Rust
Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI
A cloud-native open source distributed time series database with high performance, high compression ratio and high availability. http://www.cnosdb.cloud
A resource to help you pass system design interview and become good at work 👇
Focalboard is an open source, self-hosted alternative to Trello, Notion, and Asana.
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
A tutorial of building an LSM-Tree storage engine (database) in a week.
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
A learning road map for junior programmers focusing from basic to advanced technical skills
Tooling for creating your own distributed systems.
Pure Rust LSM-tree based embedded storage engine
😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识
Actix Web is a powerful, pragmatic, and extremely fast web framework for Rust.
A Decentralized Operating System for ZK Applications
A high-performance observability data pipeline.
Secure and fast microVMs for serverless computing.
A guide to the adventurer.
A scalable, distributed, collaborative, document-graph database, for the realtime web
Learn how to design systems at scale and prepare for system design interviews