Stars
scopt / scopt
Forked from jstrachan/scoptcommand line options parsing for Scala
Nested array transformation helper extensions for Apache Spark
Samples builds using the Gradle Kotlin DSL
Databricks Terraform Provider
📘 OpenAPI/Swagger-generated API Reference Documentation
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Apache Spark - A unified analytics engine for large-scale data processing
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Web Starter Kit - a workflow for multi-device websites
A boilerplate for Node.js web applications