Space Fan Art is a working API prototype created with Flask REST-Plus to showcase the OWASP Top Ten Proactive Controls in two different architectures: a monolith and a microservice. In the monolithic architecture, users have to register to get an authentication key, which allows them to download images from NASA and also contribute to our database by uploading pictures to our database of 365 images from NASA. In the microservice architecture, users will only be able to download images, which will come directly from APOD(one of the most famous NASA's Open API).
- 1. Overview
- 2. Architecture
- 3. Project Interface
- 4. Development timeline
- 5. Project Structure
- 6. Project Files
- 7. Python Dependencies
- 8. Tools
- 9. Installation
- 10. Quick Start
- 11. Authentication Details
- 12. OWASP Proactive Controls
- 13. Automated Testing
- 14. Project Roadmap
- 15. Useful Links
This API, called SFA (Space Fan Art), is an API created with Flask REST-Plus with two Prototypes: a monolithic architecture and a microservice architecture. Astronomy Picture Of The Day (APOD), which is a NASA open API that returns the picture of the day, has been used as a model thoughout the development of both Prototypes:
This project uses the Model-View-Controller (MVC) architecture framework, which can be described as an architectural pattern that separates an application into three main logical components: model (data), view (user interface), and controller (processes that handle input). In SFA-API, here is our architeture:
- Model => The type of data we are using in the application: user's data and picture data (json)
- View => Our interfaces (Html/CSS/Javascript) and Swagger
- Controller => In the file with five controllers: users_controller.py, picture_controller.py, gdpr_controller.py, demo_key_controller.py, limiters_controller.py.
🔹 Design based on a monolithic architecture, using MVC (Model-View-Controller) pattern. The images are taken from a local database. Here a small NASA dataset of images is used (365 pictures). To create this dataset we consumed APOD through insomnia.
🔸 Design based on a microservice architecture, using MVC (Model-View-Controller) pattern. SFA-API is connected to a NASA Open API APOD, which returns the picture of the day. For this, we had to sign up for a NASA developer key, which you can find here.
🔵 MONOLITHIC ARCHITECTURE ON SWAGGER INTERFACE
Swagger enabled the development across the entire API lifecycle, from design and documentation, to test and deployment. It has also been used as an interface.
🔘 We have 6 endpoints, with the following features:
▶️ 1 - user registration (generates an authentication key)▶️ 2 - download images (users can download images with their authentication key and also with a demo key)▶️ 3 - upload images (users can upload images to increment our database)▶️ 4 - demo key (the demo key is: Demo_Key_SFA_Trial and can only be used to download images)▶️ 5 - GDPR1 (to comply with GDPR, the system allows users to see their data)▶️ 6 - GDPR2 (to comply with GDPR, the system allows users to delete their data)
🔵 MICROSERVICE ARCHITECTURE ON SWAGGER INTERFACE
🔘 We have 5 endpoints, with the following features:
▶️ 1 - user registration (generates an authentication key)▶️ 2 - download images (user can download images with their authentication key and also with a demo key)▶️ 3 - demo key (the demo key is: Demo_Key_SFA_Trial and can only be used to download images)▶️ 4 - GDPR1 (to comply with GDPR, the system allows users to see their data)▶️ 5 - GDPR2 (to comply with GDPR, the system allows users to delete their data)
The file server.py contains simple instructions on how to switch the system to run the monolithic and microservice architecture. Here is a summary showing how to use them step-by-step:
📍 Monolithics Architecture Step-by-step:
- STEP 1 - USER NAVIGATION: Users can consume the API using the demo key (Demo_Key_SFA_Trial) even when they are not registered.
- STEP 2 - REGISTRATION: For user registration there are four parameters: first name, last name, email and password.
- STEP 3 - AUTHENTICATION KEY: After registration, users receives an authentication key.
- STEP 4 - CONSUME THE API: With the authentication key, the user is able to download and upload pictures.
- STEP 5 - GDPR: Registered users are able to see his or her stored personal details and delete it from our database (for GDPR compliance)
📍 Microservice Architecture Step-by-step:
- STEP 1 - USER NAVIGATION: Users can consume the API using the demo key (Demo_Key_SFA_Trial) even when they are not registered.
- STEP 2 - REGISTRATION: For user registration there are four parameters: first name, last name, email and password.
- STEP 3 - AUTHENTICATION KEY: After registration, users receive an authentication key.
- STEP 4 - CONSUME THE API: With the authentication key, the user is only able to download pictures. In this case returns JSON-formatted data.
- STEP 5 - GDPR: Registered users are able to see his or her stored personal details and delete it from our database (for GDPR compliance)
Click the links below to see the activity diagrams:
Notice that both prototypes allows users to download images, but only in the monolith users can also upload images to contribute to our database. In both, we save only basic user data in our database and comply with GDPR by allowing them to see their stored data and delete it.
📎 VERSION 1 - In the file: picture_controller.py
This is the first version of SFA-API. It contains one endpoint exclusively for the Monolithics architecture that allows you to download an image.
* Description:
* HTTP Request Type -> GET
* Response -> Download file
* URL GET Parameters -> N/A
* Request URL (temporary): -> http://127.0.0.1:5000/pictures/api/v1/download
📎VERSION 2 - In the file: picture_controller.py
This is the second version of SFA-API. It contains one more endpoint that allows you to upload an image.
* Description:
* HTTP Request Type -> POST
* Response -> Upload file
* URL GET Parameters -> file / title / explanation / date / copyright
* Request URL (temporary): -> http://127.0.0.1:5000/pictures/api/v1/upload
📎VERSION 3 - In the file: users_controller.py
This is the third version of SFA-API. It contains one more endpoint that allows you to register a new user.
* Description:
* HTTP Request Type -> POST
* Response -> Register user
* URL POST Parameters -> First name / Last name / email / password
* Request URL (temporary): -> http://127.0.0.1:5000/pictures/api/v1/register
📎VERSION 4 - In the file: demo_key_controller.py
This is the fourth version of SFA-API. It contains one more endpoint that allows you to use a demo key: Demo_Key_SFA_Trial
* Description:
* HTTP Request Type -> GET
* Response -> Get demo key
* URL GET Parameters -> N/A
* Request URL (temporary): -> http://127.0.0.1:5000/pictures/api/v1/demo_key
📎VERSION 5 - In the file: gdpr_controller.py
This is the fifth version of SFA-API. It contains the first endpoint for GDPR compliance: allows users to see their personal data.
* Description:
* HTTP Request Type -> POST
* Response -> Retrieve user data
* URL POST Parameters -> email / password
* Request URL (temporary): -> http://127.0.0.1:5000/pictures/api/v1/gdpr1
📎VERSION 6 - In the file: gdpr_controller.py
This is the sixth version of SFA-API. It contains the second endpoint for GDPR compliance: allows users to delete their personal data.
* Description:
* HTTP Request Type -> DELETE
* Response -> Delete user data
* URL POST Parameters -> email / password
* Request URL (temporary): -> http://127.0.0.1:5000/pictures/api/v1/gdpr2
📎VERSION 7 - In the file: picture_controller_apod.py
This is the seventh version of SFA-API. It contains one endpoint exclusively created for the microservice architecture. This endpoint allows users to get images directly from NASA API APOD (HTTP Request Type -> GET). Users can do it using the demo key (Demo_Key_SFA_Trial) or the authentication key.
* Description:
* HTTP Request Type -> GET
* Response -> Get url
* URL GET Parameters -> N/A
* Request URL (temporary): -> http://127.0.0.1:5000/pictures/api/v1/download
The following directory diagram was generated with the following command in the terminal: "tree /F"
app/
│ APOD_365
│ application_structure.py
│ decorators.py
│ docker-compose.yaml
│ Dockerfile
│ minimal.py
│ populate_db.py
│ README.md
│ requirements.txt
│ senhas.py
│ server.py
├───images (this folder contains 365 images)
├───controllers
│ │ demo_key_controller.py
│ │ limiters_controller.py
│ │ picture_controller.py
│ │ users_controller.py
README.md
README.md- Contains the description and documentation of the project.users_controller.py
users_controller.py - defines operations/endpoints with users (user registration).picture_controller.py
picture_controller.py - defines operations/endpoints with pictures (download and upload).picture_controller_apod.py
picture_controller.py - defines operations/endpoints with pictures (only download). Here is our microservice, so users can get pictures directly from the Nasa's open API: APOD.demo_key_controller.py
demo_key_controller.py - defines operations/endpoints with gdpr (users can see data and delete their personal data).gdpr_controller.py
gdpr_controller.py - defines operations/endpoints with gdpr (users can see data and delete their personal data).limiters_controller.py
limiters_controller.py - This file implements the counting to limit the use of the API (through user's IP address) and demo key.demo_key_controller.py
demo_key_controller.py - This file implements the demo key.decorators.py
decorators.py - This file contains the decorators for the api key and the demo key (to verify the existence and authenticity of the key).Dockerfile
Dockerfile - Docker config file is a recipe to build a Docker image running this RESTful API Server.docker-compose.yaml
docker-compose.yaml - This is a config file to deploy and configure the docker-container to run.minimal.py
minimal.py - This is a config file containing the class API.populate_db.py
populate_db.py - This file is aimed to populate the database.senhas.py
- This file contains the passwords of the database.server.py
server.py - This file imports controllers and runs the API.gitignore
gitignore- Lists files and file masks of the files which should not be added to git repository.common_passwords.py
common_passwords.py - File containing the 1000 most common passwords.application_structure.py
application_structure.py - Directory tree structure in Python.requirements.txt
requirements.txt - The list of Python (PyPi) requirements. Script: 1) pip install pipreqs; 2) pipreqs --encoding=utf8 C:\Users\Alice\PycharmProjects\SFA_DBsqlserver_db_script.sql
sqlserver_db_script.sql - File containing the script of our database created with SQL Server.automation_anywhere_testing.txt
automation_anywhere_testing.txt - File containing our automated tesing script generated by Automation Anywhere.
- Flask-RestPlus (+ flask)
- Werkzeug - for password hashing RESTful API documentation.
- Secrets - for generating cryptographically strong pseudo-random numbers for managing user authentication.
- Pyodbc - for accessing the database and carry out user registration.
- Requests - for making HTTP requests in Python.
- Flask Cors - A Flask extension for handling Cross Origin Resource Sharing (CORS), making cross-origin AJAX possible.
- Flask Limiters - Flask-Limiter provides rate limiting features to flask routes.
- Validator Collection - to validade users' inputs.
- Dependency Check - scans application dependencies and checks whether they contain any published vulnerabilities.
- Docker - for storing the database and the API in a container.
- Swagger-UI - used for documentation and to allow the development team to visualise and interact with the API's.
- SQL Server - to create the database of Prototype 1 (monolithic architecture)
- Ngrok - enabled the exposure of the a local development server to the Internet with minimal effort.
- AWS EC2 - cloud computing service used in both prototypes to host and run the applications
- Github - to document the project.
- Insomnia - used to consume APOD and retrieve the 365 images for our dataset (monolithic architecture).
- NASA APOD - In the monolithic architecture it was used as an API model. In the microservice architecture it was connected to SFA-API, so our API could therefore provide APOD's functionality which is to return the picture of the day.
- Automation Anywhere - for testing SFA-API (both prototypes).
It's easy to start exploring SFA-API using Docker:
$ docker run -it --rm --publish 5000:5000 alicevillar/sfa_api
$ git clone https://github.com/alicevillar/sfa_api
You will need invoke
package to work with everything related to this project.
$ pip install requirements.txt
Check our monolithic architecture on Swagger with the following link: http://5c2b286ddb69.ngrok.io/docs
In APOD, you do not need to authenticate in order to explore the NASA data. However, if you will be intensively using the APIs to, say, support a mobile application, then you should sign up for a ASA developer key.
The request body must follow the following structure:
🔵 USER MODEL:
{
"First Name": "Teresa",
"Last Name": "Saldanha",
"Email": "[email protected]",
"Password": "123"
}
That is it!
After registration, the system will generate the API Key.
🔵 USER REGISTRATION:
curl -X POST "http://127.0.0.1:5000/users/api/v1/register" -H "accept: application/json" -H "Content-Type: application/json" -d "{ \"First Name\": \"Teresa\", \"Last Name\": \"Saldanha\", \"Email\": \"[email protected]\", \"Password\": \"123\"}"
{
"API Key": "1665Sg0UlrAq1BhZOstyPMj9DF-9d-i2o0DcIIB9",
"Expiration Date": "2021-07-30"
}
The response body contains the API Key and the expiration date. Once the access authentication key expires, you have to create a new one.
In NASA, limits are placed on the number of API requests you may make using your API key. The defaults are ==>> Hourly Limit: 1,000 requests per hour. For each API key, these limits are applied across all api.nasa.gov API requests. Exceeding these limits will lead to your API key being temporarily blocked from making further requests.
In SFA-API we will allow the rate limits NASA uses for the DEMO_KEY, which are:
- Hourly Limit: 80 requests per IP address per hour
- Daily Limit: 80 requests per IP address per day
The OWASP Top Ten Proactive Controls is a list of security techniques that should be included in every software development project. They are ordered by order of importance, with control number 1 being the most important.
The OWASP Application Security Verification Standard (ASVS) contains categories such as authentication, access control, error handling / logging, and web services. Each category contains a collection of requirements that represent the best practices for that category drafted as verifiable statements.
✅ SFA-API security requirements:
- Protect against injection: We use parameterised queries to avoid SQL injection attacks in all the operations with the database.
- Protect Sensitive Data: protected password with hashing and api key generated securely using library Secrets.
- Broken Access Control: Restrictions on what authenticated users are allowed to do.
- Protect the system from denial-of-service attacks: SFA has rate limits. Limits are placed on the number of API requests you may make using your API key.
- Protect Security Misconfiguration: all operating systems, frameworks, libraries, and applications must be securely configured and patched/upgraded in a timely fashion. This is done on the Microservice Architecture (Beam Stalk - AWS).
- Key exchange communication - only happens in the microservice architecture, where AWS cloud services are used. We hold a private key which enabled us to access our VM in EC2.
- Protect against Cross-Site Scripting (XSS): We protect against XSS in our web page, using javascript.
- Protect against Components with Known Vulnerabilities: use Project Dependency to scan application dependencies and check if they contain any published vulnerabilities.
- Logging & Monitoring: The logging from FlasK-Rest-Plus is standardised, a request is received and then returned. In FlasK-Rest-Plus loggins are very simple, they are not very informative (thus, it is not possible to know details about each request). Using docker it is possible to query, store and analyse the loggins.
Secure frameworks and libraries can help to prevent a wide range of web application vulnerabilities.
✅ In SFA-API we did a dependency check to detect publicly disclosed vulnerabilities contained within a project's dependencies. Click here to see the result.
According to OWASP, secure access to databases consider: secure queries, secure configuration, secure communication and secure authentication.
✅ SFA-API handles secure database access with the following measures:
- Secure queries: In order to mitigate SQL injection we used ‘Query Parameterization’. However, certain locations in a database query are not parameterisable. Because of the large variation in the pattern of SQL injection attacks they are often unable to protect databases. OWASP recommends testing queries for performance, but this is not done here because the queries are all very small and therefore it is not necessary.
- Secure configuration: we run the database in a docker container, which has connectivity restrictions (can only be accessed by the administrator and only has one door open - 1433). The server which runs the database does not allow external access. All access to the database should be properly authenticated. Thus, it is not possible to directly access the database from outside the instance.
- Secure communication: we use Pyodbc, an open source Python module to communicate with the database. We apply secure (authenticated, encrypted) communications methods.
Encoding and escaping are defensive techniques meant to stop injection attacks. Here is the OWASP definition:
- Encoding (commonly called “Output Encoding”) involves translating special characters into some different but equivalent form that is no longer dangerous in the target interpreter, for example translating the < character into the < string when writing to an HTML page.
- Escaping involves adding a special character before the character/string to avoid it being misinterpreted, for example, adding a \ character before a " (double quote) character so that it is interpreted as text and not as closing a string.
✅ In SFA-API, we don't apply encoding or escaping because it was not needed.
⭕ It should be highlightened that a hash is not ‘encryption’ – it cannot be decrypted back to the original text (it is a ‘one-way’ cryptographic function, Whereas encryption is a two-way function, hashing is a one-way function. Hashing is used in conjunction with authentication to produce strong evidence that a given message has not been modified and serves the purpose of ensuring integrity, i.e. making it so that if something is changed you can know that it’s changed password is stored in hashed into the database and the authentication process uses hashing comparison. For password hashing we use the library Werkzeug.
Input validation is a programming technique that ensures only properly formatted data may enter a software system component. It validates that an input value is what you think it should be. Syntax validity means that the data is in the form that is expected.
✅ SFA-API validates inputs we use the Python library Validator Collection, which is a Python library that provides functions that can be used to validate the type and contents of an input value.
OWASP provides several recommendations for secure implementation of Digital Identity, a unique representation of a user. OWASP divide it in three levels:
- Level 1 : Passwords => It's necessary to store them securely and follow OWASP password requirements.
- Level 2 : Multi-Factor Authentication => Using passwords as a sole factor provides weak security. Multi-factor solutions provide a more robust solution by requiring an attacker to acquire more than one element to authenticate with the service.
- Level 3 : Cryptographic Based Authentication => requires authentication that is "based on proof of possession of a key through a cryptographic protocol.” This type of authentication is used to achieve the strongest level of authentication assurance.
✅ SFA-API applies digital identity, authentication and session management recommendation. We use libraries werkzeug (for password hashing) and secrets (to generate authentication key). However, we're only scratching the surface. We are maximising the security in our API. Following the above OWASP recommendations, here are the basic security measures we apply to maximise the security in our API:
- Level 1 : Passwords =>
- a) In SFA-API, passwords have at least 8 characters in length;
- b) All printing ASCII characters as well as the space character are acceptable in memorized secrets.
- c) We follow the the OWASP recommedation, which is to remove complexity requirements as these have been found to be of limited effectiveness. OWASP recommends the adoption of MFA or longer password lengths instead. We encourage the use of long passwords and passphrases by recommending this action in the interface.
- d) we ensure that passwords used are not commonly used passwords by blocking the top 1000 most common passwords; c) we securely store user credentials, so if a password is compromised, the attacker does not immediately have access to this information.
- Level 2 : Multi-Factor Authentication (MFA) => SFA-API applies 2 layers of protection: passwords and authentication key (which works as a token).
- Level 3 : Cryptographic Based Authentication => Once the initial successful user authentication has taken place, the application tracks this user (this is called Session Management) so it can store details about usage. Flask does it through encrypted cookies. This is implemented on top of cookies for you and signs the cookies cryptographically. What this means is that the user could look at the contents of your cookie but not modify it, unless they know the secret key used for signing.
Access Control functionality often spans many areas of software depending on the complexity of the access control system. In SFA-API, we apply two of the OWASP recommendations: a) all requests go through some kind of access control verification layer; a) all access control failures should be logged as these may be indicative of a malicious user probing the application for vulnerabilities.
✅ In SFA-API, we maximise the security by using multi-factor authentication in 2 layers of protection: hashing passwords and authentication key. Authentication key is given to users after their registration (to registrate, users have to give: first name, last name, email and password). The authentication key is monitored in three different ways:
- a) expiration date – SFA-API has an expiration date, which is done using pyodbc;
- b) rate limits - We implement rate limits with flask limiter. The requests per IP address have the following limits: 80 per day/80 per hour.
- c) user IP – The system creates a key that corresponds with the client’s IP. If the same client changes IP and tries to authenticate, the system does not allow it and will ask the client to authenticate again. This is done with pyodbc. We count the number of requests per key so the system can recognise when the same IP uses the API more than our rate limits allow.
Sensitive data such as passwords, credit card numbers, health records, personal information and business secrets require extra protection, particularly if that data falls under privacy laws (EU’s General Data Protection Regulation GDPR), financial data protection rules such as PCI Data Security Standard (PCI DSS) or other regulations. Here we describe how we have followed OWASP recommendations.
✅ In SFA-API, we protect data with parametrised queries to protect against SQL injection. OWASP recommends stored procedures but this is not done in SFA-API because the queries are small, so this measure is unecessary. OWASP also recommends escaping, which we didn't need to do in our API because it wasn't necessary.
According to OWASP, security logging can be used for: Feeding intrusion detection systems, Forensic analysis and investigations, Satisfying regulatory compliance requirements
✅ In SFA-API, we use the logging framework, a utility designed to standardise the process of logging in your application. Our logging framework is Flask RESTPlus, an extension for Flask that adds support and encourages best practices with minimal setup. It provides a coherent collection of decorators and tools to describe your API and expose its documentation properly (using Swagger). The logging from FlasK-Rest-Plus is standardised: a request is received and then returned, but with not much information. Thus, it is not possible to know details about each request. To have informative loggins we will use docker, which is where all the loggins will be stored. It is secure.
Exception handling is a programming concept that allows an application to respond to different error states (like network down, or database connection failed, etc) in various ways. Handling exceptions and errors correctly is critical to making your code reliable and secure. The try block lets you test a block of code for errors. The except block lets you handle the error.
✅ In SFA-API, we use the try-except statement in the following files:
- a) to check input validation. This is done in the files users_controller.py, picture_controller.py and gdpr_controller.py
- b) to check the number of requests per Key, in the file limiters_controller.py
The file automation_anywhere_testing.txt contains our automated tesing script generated by Automation Anywhere.
✔️ Scenario 1a: Trying to register a user with an invalid password (among top 1000 most common passwords: football). Expected response: HTTP - 422 Unprocessable entity
✔️ Scenario 1b: Trying to register a user with an invalid password (password with only 6 digits: python). Expected response: HTTP - 422 Unprocessable entity
✔️ Scenario 1c: Trying to register a user with a valid password (valid password with 8 digits: pythonnn).
Expected response: HTTP - 200 OK success status✔️ Scenario 2a: Testing a valid Authentication Key - trying to download an image.
Expected response: HTTP - 200 OK Success status✔️ Scenario 2b: Testing an invalid Authentication Key - trying to download an image.
Expected response: HTTP - 401 Unauthorised✔️ Scenario 3a: Testing input validation in the endpoint GDPR1 - trying to retrieve user data with an invalid email (michaelgmail.com). Expected response: HTTP - 422 Unprocessable entity
✔️ Scenario 3b: Testing input validation in the endpoint GDPR2 - trying to delete user with invalid password. Expected response: HTTP - 401 Unauthorised
✔️ Scenario 4a: Testing demo key endpoint. Expected response: HTTP - 200 OK Success status
✔️ Scenario 4b: Testing demo key - trying to download a picture without a demo key.
Expected response: HTTP - 401 Unauthorised✔️ Scenario 4c: Testing demo key - trying to download a picture with the right demo key. Expected response: HTTP - 200 OK Success status
✔️ Scenario 5a: Testing SQL Injection with the following malicious SQL query "SELECT Reg_Name FROM [SFA_DB].[dbo].[TB_SFA_Registration]". This will be inserted as being the name of a user registration. Expected response: HTTP - 200 OK Success status
✔️ Scenario 5b: Retriving SQL injection to check if the API is succefully protecting against injection. Expected response: HTTP - 200 OK Success status
To take this project further:
- WEB INTERFACE: Create a webpage for SAF-API using HTML/CSS and Javascript. The Web Interface will be built with HTML/CSS and Javascript. The goal is to give it an additional feature: users will be able to automaticaly change users' wallpaper with Python
- AWS: Host the project in AWS, which offers reliable, scalable, and inexpensive cloud computing services.
Best Practices for a pragmatic restful API