Skip to content

Flavia95/VGpop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VGpop

The aim is drive standard population genetic analyses using pangenomic data models. Typically represented in the Graphical Fragment Assembly (GFA) format, these models can represent whole genome alignments in a compact graphical structure.

I have focused on algorithms for bubble detection that allow us to generate Variant Call Format (VCF) files from graphs.

I use this projection to drive standard population genetic analyzes, and as a mechanism to validation results that I obtain from pangenome graph based population genetic analyzes which I am designing.

Component Required

ODGI

  • Building from GitHub odgi

  • Download from Bioconda odgi

Download

Clone the source locally:

git clone https://github.com/Flavia95/VGpop.git
cd VGpop

SCRIPT

Before run the script GfatoVcf.py and bubblepop.py, you need to:

  1. Convert the GFA format to the ODGI format

By example:

odgi build -g samplePath3.gfa -o samplePath3.odgi

The odgi format as input to the script.

  1. With -path specify the path of the odgi library and with -input specify the input file as odgi format
python GfatoVcf.py -path /../odgi/lib/ -input /../samplePath3.odgi

FUNCTIONS

These functions are here:

1. num_sequences
    arg1: matrix
2. num_segregating_sites
    arg1: matrix
    arg1: matrix
    arg2: collections
    arg1: matrix
5. avg_num_pairwise_differences
    arg1: matrix

These functions are here:

    arg1: num_sequences
    arg2: num_segregating_sites
    arg3: avg_num_pairwise_differences 
7. gfa2fst

Author

Flavia Villani

License

MIT

About

Population genetics analysis on VG

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published