Computer Vision for Surgical Applications Project

Overview

This project implements an advanced image segmentation system for surgical instruments using synthetic data generation, segmentation model development, and domain adaptation techniques. It simulates a real-world scenario where annotated medical data is unavailable due to the high cost of expert annotation.

The project is divided into three phases:

Synthetic Data Generation: Creating a synthetic dataset of surgical instrument images and segmentation masks.
Segmentation Model Development: Training and fine-tuning a DeepLabV3+ model on the synthetic dataset.
Domain Adaptation: Enhancing the model's performance on real surgical images by bridging the domain gap.

Additionally, the project provides two utility scripts:

predict.py: For running segmentation on single images.
video.py: For applying segmentation on video frames.

A link to the final model weights: DeepLab v3 model

Project Structure

project/
│
├── data_generation/          # Phase 1: Synthetic Data Generation
├── model_development/        # Phase 2: Segmentation Model Development
├── domain_adaptation/        # Phase 3: Domain Adaptation
├── predict.py                # Utility script for single-image predictions
├── video.py                  # Utility script for video segmentation
└── requirements/             # Dependency requirements for each phase

Key Scripts

`predict.py`

This script predicts segmentation masks for a single image using a trained DeepLabV3+ model.

Key Arguments

-i or --image_idx (optional):
Specify the index of an image from the default path ./data_generation/output/jpg_format/.
Example: -i 0 for the image 0_color.jpg.
Cannot be used with --image_path.
-p or --image_path (optional):
Provide the full path to the image file.
Example: -p ./path/to/image.jpg.
-m or --model_path:
Path to the trained model. Default: ./model_development/deeplabv3_model.pth.
-o or --output_dir:
Directory to save the results. Must be specified unless --dev or --test is used.
--dev: Use the development directory (./model_development/predictions/) as the output location.
--test: Use the test directory (./domain_adaptation/predictions/) as the output location.

Rules and Examples

Provide either --image_idx or --image_path (not both).
Specify one output option: -o, --dev, or --test.

Example:

python predict.py -i 0 --dev
python predict.py -p ./image.jpg -o ./results/

`video.py` Arguments

This script processes a video and applies segmentation frame-by-frame.

Key Arguments

-v or --input_video_path (required):
Path to the input video. Default: /datashare/project/vids_test/4_2_24_A_1.mp4.
-m or --model_path:
Path to the trained model. Default: ./model_development/deeplabv3_model.pth.
-o or --output_dir:
Directory to save the segmented video. Must be specified unless --dev or --test is used.
--dev: Use the development directory (./model_development/predictions/) as the output location.
--test: Use the test directory (./domain_adaptation/predictions/) as the output location.

Rules and Examples

Specify one output option: -o, --dev, or --test.

Example:

python video.py -v ./input.mp4 --test
python video.py -o ./output/

Installation

Clone the repository:

git clone https://github.com/AmitShirazi1/CV_for_Surgical_Applications_Project.git
cd CV_for_Surgical_Applications_Project/

Install basic dependencies:
```
pip install -r requirements/basic.txt
```
Install dependencies for the desired phase:
```
pip install -r requirements/<phase>.txt
```
Replace <phase> with data_generation, model_development, or domain_adaptation.
Install Blender for Phase 1 (Synthetic Data Generation):
Download and Install Blender.

How to Run

Phase 1: Synthetic Data Generation

Generate synthetic images and segmentation masks:
```
blenderproc run ./data_generation/synthetic_data_generator.py -b -n 1000
```
Replace 1000 with the desired number of images.
Review and inspect the generated dataset: Open investigate_hdf5_output.ipynb in Jupyter Notebook to visualize or convert the dataset.

Refer to the Phase 1 README for details.

Phase 2: Segmentation Model Development

Train the model using synthetic data:
```
python ./model_development/train.py --images_path <train_data> --val_images_path <val_data>
```
Replace <train_data> and <val_data> with the paths to training and validation datasets.

Fine-tune the trained model:

python ./model_development/finetuning.py --images_path <train_data>

Refer to the Phase 2 README for details.

Phase 3: Domain Adaptation

Prepare and organize datasets:

python ./domain_adaptation/adapting_with_cyclegan.py

Train a CycleGAN model for translating synthetic images into real-style images:

python ./pytorch-CycleGAN-and-pix2pix/train.py --dataroot ./domain_adaptation/data --name synthetic2real

Fine-tune the segmentation model using the domain-adapted dataset:

python ./domain_adaptation/finetuning.py -i <images_dir> -m <masks_dir>

Refer to the Phase 3 README for details.

Outputs

Phase 1: Synthetic dataset in HDF5 and JPEG formats.
Phase 2: Trained segmentation model (deeplabv3_model.pth) and fine-tuned model (deeplabv3_model_tuned.pth).
Phase 3: Domain-adapted model (deeplabv3_model_adapted.pth).

Predicted results are saved in the specified output_dir during inference.

Notes and Recommendations

Use a GPU for training and fine-tuning to improve performance.
Phase-specific READMEs provide additional implementation details, insights, and challenges encountered.
Further refinements could include experimenting with additional domain adaptation techniques or expanding the synthetic dataset's diversity.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Vision for Surgical Applications Project

Overview

Project Structure

Key Scripts

`predict.py`

Key Arguments

Rules and Examples

`video.py` Arguments

Key Arguments

Rules and Examples

Installation

How to Run

Phase 1: Synthetic Data Generation

Phase 2: Segmentation Model Development

Phase 3: Domain Adaptation

Outputs

Notes and Recommendations

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data_generation		data_generation
domain_adaptation		domain_adaptation
model_developement		model_developement
requirements		requirements
README.md		README.md
predict.py		predict.py
video.py		video.py

AmitShirazi1/CV_for_Surgical_Applications_Project

Folders and files

Latest commit

History

Repository files navigation

Computer Vision for Surgical Applications Project

Overview

Project Structure

Key Scripts

predict.py

Key Arguments

Rules and Examples

video.py Arguments

Key Arguments

Rules and Examples

Installation

How to Run

Phase 1: Synthetic Data Generation

Phase 2: Segmentation Model Development

Phase 3: Domain Adaptation

Outputs

Notes and Recommendations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`predict.py`

`video.py` Arguments

Packages