tidytcells
is a lightweight python package that cleans and standardizes T cell receptor (TR) and Major Histocompatibility (MH) data to be IMGT-compliant.
The main purpose of the package is to solve the problem of parsing and collating together non-standardized TR datasets.
It is often difficult to compile TR data from multiple sources because the formats/nomenclature of how each dataset encodes TR and MH gene names are slightly different, or even inconsistent within themselves.
tidytcells
can ameliorate this issue by auto-correcting and auto-standardizing your data.
Check out the documentation page.
Via PyPI (recommended)
tidytcells
can be installed using pip
:
$ pip install tidytcells
From source
The source code for the package is available on Github. To install from source, clone the git repository, and run:
$ pip install .
from inside the project root directory.
To cite tidytcells
, please refer to this manuscript.