Malaria haplotype frequency estimation. 2013

Leonore Wigger, and Julia E Vogt, and Volker Roth
Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland. leonore.wigger@unil.ch

We present a Bayesian approach for estimating the relative frequencies of multi-single nucleotide polymorphism (SNP) haplotypes in populations of the malaria parasite Plasmodium falciparum by using microarray SNP data from human blood samples. Each sample comes from a malaria patient and contains one or several parasite clones that may genetically differ. Samples containing multiple parasite clones with different genetic markers pose a special challenge. The situation is comparable with a polyploid organism. The data from each blood sample indicates whether the parasites in the blood carry a mutant or a wildtype allele at various selected genomic positions. If both mutant and wildtype alleles are detected at a given position in a multiply infected sample, the data indicates the presence of both alleles, but the ratio is unknown. Thus, the data only partially reveals which specific combinations of genetic markers (i.e. haplotypes across the examined SNPs) occur in distinct parasite clones. In addition, SNP data may contain errors at non-negligible rates. We use a multinomial mixture model with partially missing observations to represent this data and a Markov chain Monte Carlo method to estimate the haplotype frequencies in a population. Our approach addresses both challenges, multiple infections and data errors.

UI MeSH Term Description Entries
D008390 Markov Chains A stochastic process such that the conditional probability distribution for a state at any future instant, given the present state, is unaffected by any additional knowledge of the past history of the system. Markov Process,Markov Chain,Chain, Markov,Chains, Markov,Markov Processes,Process, Markov,Processes, Markov
D009010 Monte Carlo Method In statistics, a technique for numerically approximating the solution of a mathematical problem by studying the distribution of some random variable, often generated by a computer. The name alludes to the randomness characteristic of the games of chance played at the gambling casinos in Monte Carlo. (From Random House Unabridged Dictionary, 2d ed, 1993) Method, Monte Carlo
D010219 Papua New Guinea A country consisting of the eastern half of the island of New Guinea and adjacent islands, including New Britain, New Ireland, the Admiralty Islands, and New Hanover in the Bismarck Archipelago; Bougainville and Buka in the northern Solomon Islands; the D'Entrecasteaux and Trobriand Islands; Woodlark (Murua) Island; and the Louisiade Archipelago. It became independent on September 16, 1975. Formerly, the southern part was the Australian Territory of Papua, and the northern part was the UN Trust Territory of New Guinea, administered by Australia. They were administratively merged in 1949 and named Papua and New Guinea, and renamed Papua New Guinea in 1971. New Guinea, East,New Guinea, Papua
D010963 Plasmodium falciparum A species of protozoa that is the causal agent of falciparum malaria (MALARIA, FALCIPARUM). It is most prevalent in the tropics and subtropics. Plasmodium falciparums,falciparums, Plasmodium
D003627 Data Interpretation, Statistical Application of statistical procedures to analyze specific observed or assumed facts from a particular study. Data Analysis, Statistical,Data Interpretations, Statistical,Interpretation, Statistical Data,Statistical Data Analysis,Statistical Data Interpretation,Analyses, Statistical Data,Analysis, Statistical Data,Data Analyses, Statistical,Interpretations, Statistical Data,Statistical Data Analyses,Statistical Data Interpretations
D006239 Haplotypes The genetic constitution of individuals with respect to one member of a pair of allelic genes, or sets of genes that are closely linked and tend to be inherited together such as those of the MAJOR HISTOCOMPATIBILITY COMPLEX. Haplotype
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia
D014644 Genetic Variation Genotypic differences observed among individuals in a population. Genetic Diversity,Variation, Genetic,Diversity, Genetic,Diversities, Genetic,Genetic Diversities,Genetic Variations,Variations, Genetic

Related Publications

Leonore Wigger, and Julia E Vogt, and Volker Roth
August 1998, Human immunology,
Leonore Wigger, and Julia E Vogt, and Volker Roth
January 2002, Human heredity,
Leonore Wigger, and Julia E Vogt, and Volker Roth
November 2003, Annals of human genetics,
Leonore Wigger, and Julia E Vogt, and Volker Roth
January 2003, Human heredity,
Leonore Wigger, and Julia E Vogt, and Volker Roth
August 2019, Bioinformatics (Oxford, England),
Leonore Wigger, and Julia E Vogt, and Volker Roth
October 2001, American journal of human genetics,
Leonore Wigger, and Julia E Vogt, and Volker Roth
October 2012, BMC genetics,
Leonore Wigger, and Julia E Vogt, and Volker Roth
March 2010, Journal of computational biology : a journal of computational molecular cell biology,
Leonore Wigger, and Julia E Vogt, and Volker Roth
January 2007, Human heredity,
Leonore Wigger, and Julia E Vogt, and Volker Roth
January 2002, Human heredity,
Copied contents to your clipboard!