Avatar paper

Description

Avatar paper is an analysis project that allows researchers to see, re do, or explore figure and data of the "Patient-centric synthetic data generation, no reason to risk re-identification in biomedical data analysis". The aim of the paper is to present the avatarization method.

We really recommend to have a look to the scientific paper before to explore the repository.

Prerequisites

Install Poetry (a Python package and dependency management tool).

This repo uses Git LFS (Large File Storage) to handle large datasets. Make sure to install it for your platform, following the instructions at https://git-lfs.github.com/

Install

Run:

make install

R packages

All required packages will be installed using the R package librarian.

How to use it

This code mainly consists in Jupyter notebooks and datasets.

It is written in Python and R.

git lfs pull download large file using git lfs, can take around 30min.
Run the following command in a terminal. make notebook
Open notebook - such as 0.main.ipynb.
Run the cells.

Structure

datasets/
   AIDS/          # original and avatarized AIDS datasets
   WBCD/          # original and avatarized WBCD datasets
   results_df/    # computationally expensive analysis results.

notebooks/        # analysis and graph generation

metrics/
   privacy_metrics/       # multiple function used to compute avatarization metrics

figures/           # figures presented in the article
color.csv         # colors for the figures

Contributing

Feel free to do the analysis again or to explore avatarized datasets.

License

This code is licensed under the Apache-2.0 license.

Contributors

Morgan Guillaudeux [email protected]
Olivia Rousseau [email protected]
Julien Petot [email protected]
Pierre-Antoine Gourraud [email protected], corresponding author.

Cite this article

Guillaudeux, M., Rousseau, O., Petot, J. et al. Patient-centric synthetic data generation, no reason to risk re-identification in biomedical data analysis. npj Digit. Med. 6, 37 (2023). https://doi.org/10.1038/s41746-023-00771-5

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
datasets		datasets
figures		figures
generate_synthetic_data		generate_synthetic_data
metrics		metrics
notebooks		notebooks
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md
color.csv		color.csv
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Avatar paper

Description

Prerequisites

Install

R packages

How to use it

Structure

Contributing

License

Contributors

Cite this article

About

Releases

Packages

Contributors 4

Languages

License

octopize/avatar-paper

Folders and files

Latest commit

History

Repository files navigation

Avatar paper

Description

Prerequisites

Install

R packages

How to use it

Structure

Contributing

License

Contributors

Cite this article

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages