Software to analyze multi-locus genotype data for entire populations is useful for estimating haplotype frequencies, deviation from Hardy-Weinberg equilibrium and patterns of linkage disequilibrium. These statistical results are important to both those interested in human genome variation and disease predisposition as well as evolutionary genetics. As part of the 13th International Histocompatibility and Immunogenetics Working Group (IHWG), we have developed a software framework (PyPop). The primary novelty of this package is that it allows integration of statistics across large numbers of data-sets by heavily utilizing the XML file format and the R statistical package to view graphical output, while retaining the ability to inter-operate with existing software. Largely developed to address human population data, it can, however, be used for population based data for any organism. We tested our software on the data from the 13th IHWG which involved data sets from at least 50 laboratories each of up to 1000 individuals with 9 MHC loci (both class I and class 11) and found that it scales to large numbers of data sets well.
References in zbMATH (referenced in 1 article , 1 standard article )
Showing result 1 of 1.
- Lancaster, Alex; Nelson, Mark P.; Meyer, Diogo; Thomson, Glenys; Single, Richard M.: PyPop: a software framework for population genomic: analyzing large-scale multi-locus genotype data (2002)