This repository contains a deep learning project aimed at detecting aneurysms in patients with diabetic retinopathy. The project utilizes 1200 eye fundus images in .tif file format and employs a deep learning algorithm implemented using PyTorch. The model achieves an accuracy of 80% using a Convolutional Neural Network (CNN) with a ResNet18 architecture.
The main folder structure of the repository is as follows:
- Notebooks
-- 02_data_wrangling.ipynb
-- 03_EDA.ipynb
-- 04_preprocessing_modeling.ipynb
-- helper_functions.py
This notebook focuses on the initial data wrangling tasks, including data loading, cleaning, and preprocessing. It covers tasks such as handling missing values, data normalization, and data augmentation if applicable.
In this notebook, Exploratory Data Analysis (EDA) is performed on the dataset. It includes statistical analysis, visualizations, and insights into the distribution of classes and features within the dataset. EDA helps in understanding the underlying patterns and characteristics of the data.
This notebook combines preprocessing tasks with the model building process. It includes steps such as feature extraction, data transformation, model training, evaluation, and hyperparameter tuning. The implementation of the deep learning algorithm using PyTorch with a ResNet18 architecture is detailed in this notebook.
This Python script contains helper functions used throughout the project. These functions may include data preprocessing functions, model evaluation metrics, image processing utilities, or any other reusable code snippets.
To get started with the project, follow these steps:
- Clone the repository to your local machine.
- Navigate to the
Notebooks
directory. - Open and execute the notebooks in sequential order (
02_data_wrangling.ipynb
,03_EDA.ipynb
,04_preprocessing_modeling.ipynb
). - Ensure you have the necessary dependencies installed. You can install them using the provided
requirements.txt
file or by manually installing the required packages usingpip
orconda
.
- Python 3.8
- PyTorch
- NumPy
- Pandas
- Matplotlib
- scikit-learn
- Other dependencies as mentioned in the notebooks.
[TODO]
Happy to contribute!
MIT