Skip to content

TomLous/meetup-spark-airflow-demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark & Airflow demo for Data Driven Rijnmond meetup

Contains simple example using small Spark Jobs and Airflow pipeline on Google Cloud

The pipeline consists of ETL 2 json files, matching them in 2 different ways, merging them and storing the result in Elastic Search

Try to change & run the scripts/xx-xxx.sh files in order to create yourown setup
Your experience may differ from mine. :-)

Created for Data Driven Rijnmond meetup https://www.meetup.com/nl-NL/Data-Driven-Rijnmond/events/246008610/

Please join us for other cool talks & demo's

airflow pipeline