You Tang([email protected])
Yao Zhou
JDK1.8 or above. It can be downloaded at:
http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html
back to top
LDkit is free of installation.
GUI package is under the GUI folder, please double-click the LDkit_GUI.jar to start.
Executable file LDkit.jar for command line users is on the executable folder
Both PLINK ped and map format and VCF format are supported. VCF format could be compressed or uncompressed.
Subgroup should be formatted as:
[subgroup1Name]:sample1,sample2,sample3…
[subgroup2Name]:sample1,sample2,sample3…
GUI of LDkit is very easy to use. The main interface is like below:
1 Genotype file could be dragged into the window;
2 Multiple genotype files could be put in the same folder, and then choose the folder as input
3 Other files must choose from disk.
Window size: max distance between two variants (kb) for LD decay.
Missing rate: max ration of missing allele in the population.
Thread num: number of threads. Default is half of the available resources.
MAF: minor allele frequency
Output file: save output.
Three types of LD analysis are supported by LDkit.
LD site refer to the LD between a given site and a given region.
-
This step could be skipped if you want to plot with other software.
-
If you want to plot with previous results, you could just input the previous results and adjust the parameters here. You needn't to run step1-step3 again.
InFile: none or previous results generated by LDkit.
Merge: if your input is a folder with multiple files, you could merge them all together as one population.
Mergechr: If your input is a file with multiple chromosomes, you could plot each chromosome by choosing no.
Bin: the size of bin for calculating mean r² or D’.
ResultName: file name for output.
r² or D’ could be chose here.
back to top
Before you click start, you could check your parameters at right bottom.
back to top
After clicking the start button, the dynamic DNA strand shown above will run.
-
If your input is a folder, you should make sure there is only file format. If more than one format in the folder, only the first appeared one will be used;
-
Do not support multiple files input for PLINK format;
-
PLINK format must be .ped and .map file suffix;
java -jar LDkit.jar --infile [input files] --output [output file] [parameters]
Parameters:
--infile: input file or folder
--out: output file
--ws: max distance between two variants (kb) for LD decay. Default is 100 Kb.
--subpop: input of subgroup files;
--chr: chromosome name if you just want to calculate one or some of them. Multiple chromosomes should be separated by comma. Default is all.
--maf: minor allele frequency filter. Default is 0.005;
--threads: number of threads, default is 1.
--type: measurements of LD. 1 for r-sqaure, 2 for D prime. Default is 1.
--Intermediate save the LD file for LD block or not. Default is no.
--block: chr:start-end. Region for LD block or LD site. For example: chr1:1000-20000;
--site: chr:start-end chr:site. Given site for LD site. For example: chr1:1000-2000 chr1:24556--h: help
java -jar LDkit.jar --plot --inp [input files] [parameters]
Parameters:
--inp: input file for plot
--merge: plot all subgroups in one figure or not. Default is yes.
--mergechr: plot all chromosomes or not. Default is yes.
--bin: the size of bin for calculating mean r² or D'.
- LD decay for one population
- LD decay for partial chromosomes in a population
- LD decay for multiple subpopulations
- LD block analysis
- LD site analysis