-
Stratio
- Madrid,Spain,Europe
Stars
The AWS CloudFormation Public Coverage Roadmap
A library that allows you to easily mock out tests based on AWS infrastructure.
Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPCDS on PySpark, how to create histograms with Spark, tools for performance testing CPUs, Jupyter note…
Generate an IAM policy from AWS, Azure, or Google Cloud (GCP) calls using client-side monitoring (CSM) or embedded proxy
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…
An extensible Java library for HTTP request and response logging
gsrodelgo / jvm
Forked from caarlos0-graveyard/jvmJVM switcher with support for openjdk. Fork repo from caarlos0/jvm
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Spark metrics related custom classes and sinks (e.g. Prometheus)
Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Curated list of resources about Apache Airflow
Embeddable, replicated and fault-tolerant SQL engine.
Learn Go with test-driven development
Distributed lock for your scheduled tasks
The Internals of Spark Structured Streaming
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
A simplified, lightweight ETL Framework based on Apache Spark
🐳 A curated list of Docker resources and projects
Testcontainers is a Java library that supports JUnit tests, providing lightweight, throwaway instances of common databases, Selenium web browsers, or anything else that can run in a Docker container.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Cracking the Coding Interview 6th Ed. Solutions
Scala FP configuration library with a focus on runtime clarity
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
Apache Spark Connector for Azure Cosmos DB