Revolut Fraud Detection

This project aims to build a model to detect fraudsters on a Revolut Dataset.

The dataset was downloaded from Kaggle, and it contains three different CSV files.

One called transactions.csv with information about each transaction, user_id, timestamp, etc. Another is called users.csv, which, as the name says, has information about the user: country, age, creation date, etc. And finally, the fraudsters.csv, which contains only the user_id of the fraudsters.

The project comprehends the following phases:

Merging and cleaning the CSV files;
Check if the data is balanced. In this case, it was not, so I applied undersampling of the majority class;
Econding using Target Encoding and One Hot Encoding;
Feature selection, for this I used Pearson Correlation;
Try different Regression models. In the end, I chose the Random Forest Regressor Model
Model Evaluation.

The full description of the project can be followed on this Medium post:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
model_functions.py		model_functions.py
preprocessing.py		preprocessing.py
run_model.ipynb		run_model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Revolut Fraud Detection

Medium Blog Post

About

Releases

Packages

Languages

macrodrigues/revolut_fraud_detection

Folders and files

Latest commit

History

Repository files navigation

Revolut Fraud Detection

Medium Blog Post

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages