Stars
☄🌌️ The minimal, blazing-fast, and infinitely customizable prompt for any shell!
Flock: A Low-Cost Streaming Query Engine on FaaS Platforms
Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
Upserts, Deletes And Incremental Processing on Big Data.
955 不加班的公司名单 - 工作 955,work–life balance (工作与生活的平衡)
A simple CLI utility that makes it easier to switch between different AWS roles
B-tree library for use with remote storage (DynamoDB, S3) in C++
troposphere - Python library to create AWS CloudFormation descriptions
A variety of python utilities focusing on numerical, scientific, and astrophysical computing
DynamoDB data source for Apache Spark
A series of DAGs/Workflows to help maintain the operation of Airflow
Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.
Pentaho Data Integration ( ETL ) a.k.a Kettle
using airflow, calls adwords api for various ads performance reports, and perform basic transformation using Hive
Redshift data source for Apache Spark
A library that allows you to easily mock out tests based on AWS infrastructure.
Full-featured library for writing Alfred 3 & 4 workflows
JSON to JSON transformation library written in Java.
Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.
An example application using Word2Vec. Given a list of words, it finds the one which isn't 'like' the others - a typical language understanding evaluation task.
A collection of design patterns/idioms in Python
Scheduled task execution on top of AWS Data Pipeline