Skip to content
View maxiaoniu's full-sized avatar

Block or report maxiaoniu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

☄🌌️ The minimal, blazing-fast, and infinitely customizable prompt for any shell!

Rust 47,163 2,045 Updated Feb 21, 2025

Flock: A Low-Cost Streaming Query Engine on FaaS Platforms

Rust 268 40 Updated Dec 29, 2023

A Python Interpreter written in Rust

Rust 19,642 1,271 Updated Feb 22, 2025

Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS

Rust 1,509 109 Updated Jan 8, 2025

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,658 2,459 Updated Feb 21, 2025

955 不加班的公司名单 - 工作 955,work–life balance (工作与生活的平衡)

34,992 1,757 Updated Feb 3, 2025

A simple CLI utility that makes it easier to switch between different AWS roles

Python 38 8 Updated Apr 2, 2020

B-tree library for use with remote storage (DynamoDB, S3) in C++

C++ 22 1 Updated Sep 28, 2017

troposphere - Python library to create AWS CloudFormation descriptions

Python 4,940 1,436 Updated Feb 14, 2025

A streaming JsonPath processor in Java

Java 295 54 Updated Jun 3, 2024

JSON Stream Editor (command line utility)

Go 1,997 56 Updated Dec 16, 2023

A variety of python utilities focusing on numerical, scientific, and astrophysical computing

C++ 35 19 Updated Feb 7, 2025

DynamoDB data source for Apache Spark

Scala 95 44 Updated Sep 2, 2021

A series of DAGs/Workflows to help maintain the operation of Airflow

Python 1,703 399 Updated Jun 18, 2024

Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.

Python 116 25 Updated Dec 26, 2022

PipelineAI

Jsonnet 4,174 973 Updated Apr 17, 2024

Pentaho Data Integration ( ETL ) a.k.a Kettle

Java 7,876 3,503 Updated Feb 22, 2025

using airflow, calls adwords api for various ads performance reports, and perform basic transformation using Hive

Python 2 2 Updated Mar 31, 2017

Redshift data source for Apache Spark

Scala 606 348 Updated Aug 10, 2023

🚚 ETL for Spark and Airflow

Python 24 6 Updated Mar 19, 2018

A library that allows you to easily mock out tests based on AWS infrastructure.

Python 7,778 2,080 Updated Feb 22, 2025

Full-featured library for writing Alfred 3 & 4 workflows

Python 2,981 236 Updated Jan 10, 2023

JSON to JSON transformation library written in Java.

Java 1,590 333 Updated Jul 26, 2024

Apache Flink

Java 24,546 13,518 Updated Feb 21, 2025

Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.

Java 71 34 Updated Feb 21, 2024

Flink 官方文档中文翻译项目 🇨🇳

Ruby 380 167 Updated Nov 27, 2024

An example application using Word2Vec. Given a list of words, it finds the one which isn't 'like' the others - a typical language understanding evaluation task.

Python 289 88 Updated Oct 8, 2013

🦆 Contextually-keyed word vectors

Python 1,638 239 Updated Mar 17, 2024

A collection of design patterns/idioms in Python

Python 40,966 6,954 Updated Sep 5, 2024

Scheduled task execution on top of AWS Data Pipeline

Python 43 4 Updated Mar 9, 2015
Next