GitHub - jbelyeu/XPRESSpipe: An alignment and analysis pipeline for RNAseq data

An alignment and analysis pipeline for RNAseq data

Please refer to the documentation for more in depth details.

Development Notes:

XPRESSpipe is still in beta production
The current release XPRESSpipe-v0.1.4b2 is running relatively stable on MacOS and Linux (including HPCs)
- The meta-analysis plotting seems to work on a local MacOS, but not when running on a Linux HPC, will hopefully have kinks worked out in the next couple of weeks
- Yet to incorporate UMI handling, representative gene housekeeping, along with some other features in the near future

Citation:

Berg, JA, et. al. (2019). XPRESSyourself: Automating and Democratizing High-Throughput Sequencing. https://github.com/XPRESSyourself.

Installation:

Installing from source

Installation requires Python (distributed with most operating systems automatically) and setuptools. If you have a more current version of Python, you can install setuptools as follows:

$ pip install setuptools

If this does not work, please refer to this site for more information 2. Get XPRESSpipe by downloading and unpacking the most recent archive found here 3. Unzip the folder and navigate to the appropriate directory in the command line

$ cd /path/to/XPRESSpipe

Install XPRESSpipe

$ python setup.py install

Test the installation

$ xpresspipe -h

If the help menu is not displayed, try adding the path where you installed XPRESSpipe to the system PATH

$ echo 'export PATH=$PATH:/path/to/xpresspipe' >> ~/.bash_profile

If you do not have a file names ~/.bash_profile, try looking for one called ~/.profile

Using a Docker container

Install Docker
Download the XPRESSpipe Docker container

$ docker pull docker push jordanberg/xpresspipe:latest

Run the Docker container

docker run jordanberg/xpresspipe --help

QuickStart:

$ xpresspipe riboprof -i /path/to/raw/data/ -o /path/to/output/ -r /path/to/reference/ ...

Important Notes:

Basic Starting Input

input directory with raw sequence data
- Sequence data files should be FASTQ format and end in .fastq or .fq and can be .zip or .gz compressed
An empty output directory
A reference directory (see documentation for curateReference for more details)

Naming Conventions

In order for ordered output after alignment (except for generation of a raw counts table), recommended file naming conventions should be followed.

Download your raw sequence data and place in a folder -- this folder should contain all the sequence data and nothing else.
Make sure files follow a pattern naming scheme. For example, if you had 3 genetic backgrounds of ribosome profiling data, the naming scheme would go as follows:

ExperimentName_BackgroundA_FP.fastq(.qz)
ExperimentName_BackgroundA_RNA.fastq(.qz)
ExperimentName_BackgroundB_FP.fastq(.qz)
ExperimentName_BackgroundB_RNA.fastq(.qz)
ExperimentName_BackgroundC_FP.fastq(.qz)
ExperimentName_BackgroundC_RNA.fastq(.qz)

If the sample names are replicates, their sample number needs to be indicated.
If you want the final count table to be in a particular order and the samples ordered that way are not alphabetically, append a letter in front of the sample name to force this ordering.

ExperimentName_a_WT.fastq(.qz)
ExperimentName_a_WT.fastq(.qz)
ExperimentName_b_exType.fastq(.qz)
ExperimentName_b_exType.fastq(.qz)

If you have replicates:

ExperimentName_a_WT_1.fastq(.qz)
ExperimentName_a_WT_1.fastq(.qz)
ExperimentName_a_WT_2.fastq(.qz)
ExperimentName_a_WT_2.fastq(.qz)
ExperimentName_b_exType_1.fastq(.qz)
ExperimentName_b_exType_1.fastq(.qz)
ExperimentName_b_exType_2.fastq(.qz)
ExperimentName_b_exType_2.fastq(.qz)

Name		Name	Last commit message	Last commit date
Latest commit History 422 Commits
docs		docs
images		images
install		install
recipes		recipes
tests		tests
xpresspipe		xpresspipe
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
GETCOMMAND.py		GETCOMMAND.py
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
TESTINFO.md		TESTINFO.md
TODO.md		TODO.md
requirements.yml		requirements.yml
setup.py		setup.py
versionINFO.md		versionINFO.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An alignment and analysis pipeline for RNAseq data

Development Notes:

Citation:

Installation:

Installing from source

Using a Docker container

QuickStart:

Important Notes:

Basic Starting Input

Naming Conventions

About

Releases

Packages

Languages

License

jbelyeu/XPRESSpipe

Folders and files

Latest commit

History

Repository files navigation

An alignment and analysis pipeline for RNAseq data

Development Notes:

Citation:

Installation:

Installing from source

Using a Docker container

QuickStart:

Important Notes:

Basic Starting Input

Naming Conventions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages