Skip to content

cebile-lekh/cps_nf

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CPS Extractor Pipeline

This pipeline is in the early stages of development and is not fully tested!

Nextflow run with docker run with singularity Launch on Nextflow Tower

The CPS extractor pipeline is a Nextflow pipeline designed for processing Streptococcus pneumoniae FASTA sequences to extract the capsular locus sequence (CPS) and check for disruptive mutations

The pipeline is designed to be easy to set up and use, and is suitable for use on local machines and high-performance computing (HPC) clusters alike. Once you have downloaded the necessary docker/singularity images the pipeline can be used offline unless you have changed the selection of any database or container image.

The development of this pipeline is part of the GPS Project (Global Pneumococcal Sequencing Project).

 

Table of contents

 

Usage

Requirement

Accepted Inputs

  • Only Illumina paired-end short reads are supported
  • Each sample is expected to be a pair of raw reads following this file name pattern:
    • *.{fa,fasta}

Profile

  • By default, Docker is used as the container engine and all the processes are executed by the local machine. To change this, you could use Nextflow's built-in -profile option to switch to other available profiles

    ℹ️ -profile is a built-in Nextflow option, it only has one leading -

    nextflow run . -profile [profile name]
    
  • Available profiles:
    Profile Name Details
    standard
    (Default)
    Docker is used as the container engine.
    Processes are executed locally.
    singularity Singularity is used as the container engine.
    Processes are executed locally.
    lsf The pipeline should be launched from a LSF cluster head node with this profile.
    Singularity is used as the container engine.
    Processes are submitted to your LSF cluster via bsub by the pipeline.
    (Tested on Wellcome Sanger Institute farm5 LSF cluster only)

Options

|Usage:
|nextflow run . [option] [value]
|
|--bakta_threads [INT]           Threads used for bakta. Default: 4
|--bakta_db [PATH]               Path to bakta database. Default: /data/pam/software/bakta/v5
|--blastdb [PATH]                Path to blast database. Default: cps_blastdb
|--input [PATH]                  Path to the input directory that contains the sequences to be processed {.fa,.fasta}. Default: input
|--output [PATH]                 Path to the output directory that save the results. Default: output
|--prodigal-training-file [PATH] Path to prodigal training file used in annotation. Default: all.trn
|--version                       Alternative workflow for getting versions of pipeline, container images, tools and databases
|--help                          Print this help message

Releases

No releases published

Packages

No packages published

Languages

  • Nextflow 65.5%
  • Python 30.8%
  • Shell 3.7%