-
Triple20 Computer Services
- Australia
-
jupyter-pyspark Public
Jupyter Notebook (with PyPI Apache Spark)
Jupyter Notebook MIT License UpdatedJan 7, 2025 -
makester Public
Common Infrastructure commands wrapped in Makefiles
-
pyjdk Public
Gives you Python3 and OpenJDK as a primer for PySpark (or anything else ...)
Makefile UpdatedJan 5, 2025 -
-
offline-coder Public
Customised build of the Coder container image for offline deployments.
Go MIT License UpdatedDec 15, 2024 -
-
-
-
-
hadoop-pseudo Public
Hadoop pseudo-distributed container image
-
spark-pseudo Public
Apache Spark (YARN on Pseudo Distributed Hadoop) with Docker
Shell UpdatedAug 2, 2023 -
-
diffit Public
Report differences between two Apache Spark DataFrames
Python MIT License UpdatedMar 3, 2023 -
jupyter-spark-pseudo Public
Jupyter Notebook (with Apache Spark on YARN over Pseudo Distributed Hadoop)
Makefile UpdatedOct 16, 2022 -
zeppelin-spark-pseudo Public
Apache Zeppelin (with Apache Spark on YARN over Pseudo Distributed Hadoop)
-
hadoop-hive Public
Quick and easy way to get Hive running in Hadoop pseudo distributed mode using docker
-
data-kafka-connect Public
Streaming data to/from Kafka using Kafka Connect
Makefile UpdatedMar 17, 2022 -
zeppelin-hive Public
Zeppelin on Docker with minimal capability and Hive Interpreter capability
Python UpdatedOct 2, 2021 -
-
-
data-pipelines-dags Public
Airflow DAGs (task-level) component of a Data Workflow Management system
Python UpdatedJul 28, 2020 -
Infrastructure component of a Data Workflow Management system
Makefile UpdatedApr 26, 2020 -
-
pytest-docker-postgresql Public
Docker PostgreSQL plugin for pytest
-
-
-
-
-
-