Tool to align a genome sequence to a reference genome sequence using the Parasail library and Edlib Library.
C++ Implementation:
Dependencies:
- Edlib
- Parasail
- g++
- OpenMP
- C++14
Usage: ./align [OPTIONS] --reads reads.fastq --control control_oligos.txt --output_file output_file.txt --parasail
-p, --print_alignment display best alignment for every read --version display version --help display this help and exit -r, --reads=FILE the ONT reads are in fastq FILE -c, --control=FILE the control oligos that are in a text FILE -f, --output_file=FILE the file to which the score will be output to --table_out the output to file will be in a tabular format including only the score and names of the read and the control --align_out the output to file will be in an alignment format which will include the read, control oligo and the SAM cigar --SAM_out the output to file will be a SAM file -t, --threads=NUM use NUM threads (default: 1) --parasail use the parasail library to generate the alignment sequence --edlib use the edlib library to produce the alignment sequence
Mandatory Fields:
- --reads
- --control
- --parasail or --edlib
Compilation:
- Clone/donwload the repository into your drive.
- Change the working directory to the one you just cloned.
- Make sure you have g++ and all dependencies installed.
- Run "make"
- Run the tool using the usage statement provided above.
Python Implementation:
Dependencies:
- Parasail
- pysam
- python2.7
Compilation:
- Run the command python "python align.py
Coded in Python and C++.