Skip to content

rekino/data_engineering_assignment

Repository files navigation

Data engineering assignment

This is the repository of the assignment for the data engineering position.

The solution is tested with Python 3.9 and PostgreSQL 12.

The repository does not contain the csv files and images. Please extract/copy the images to 'flaskr/static/images/' and import the data into DB using the 'import_quarter.py' script.

Installation

The '.devcontainer' folder would help you set up the application in VSCODE with the help of the "Remote Containers" plugin. If you prefer to not use VSCODE, consult the the files in the '.devcontainer' folder. The application needs a PostgreSQL instance available at 'db:5432'.

Usage

Initialize the DB schema

$ flask init-db

Importing data

To import csv data:

$ python import_quarter.py -f <folder> -q <quarter>

For Example:

$ python import_quarter.py -f ./csv/4 -q 4

Run the application

$ flask run

Tests

Create a DB called "test" in your PostgreSQL instance.

$ pytest

Stress test

$ python stresstest_script.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published