IPED2X: a robust pedigree reconstruction algorithm for complicated pedigrees. 2014

Dan He, and Eleazar Eskin
IBM T.J. Watson Research, Yorktown Heights, USA.

Reconstruction of family trees, or pedigree reconstruction, for a group of individuals is a fundamental problem in genetics. Some recent methods have been developed to reconstruct pedigrees using genotype data only. These methods are accurate and efficient for simple pedigrees which contain only siblings, where two individuals share the same pair of parents. A most recent method IPED2 is able to handle complicated pedigrees with half-sibling relationships, where two individuals share only one parent. However, the method is shown to miss many true positive half-sibling relationships as it removes all suspicious half-sibling relationships during the parent construction process. In this work, we propose a novel method IPED2X, which deploys a more robust algorithm for parent construction in the pedigrees by considering more possible operations rather than simple deletion. We convert the parent construction problem into a graph labeling problem and propose a more effective labeling algorithm. We show in our experiments that IPED2X is more powerful on capturing the true half-sibling relationships, which further leads to better reconstruction accuracy.

UI MeSH Term Description Entries
D008957 Models, Genetic Theoretical representations that simulate the behavior or activity of genetic processes or phenomena. They include the use of mathematical equations, computers, and other electronic equipment. Genetic Models,Genetic Model,Model, Genetic
D010375 Pedigree The record of descent or ancestry, particularly of a particular condition or trait, indicating individual family members, their relationships, and their status with respect to the trait or condition. Family Tree,Genealogical Tree,Genealogic Tree,Genetic Identity,Identity, Genetic,Family Trees,Genealogic Trees,Genealogical Trees,Genetic Identities,Identities, Genetic,Tree, Family,Tree, Genealogic,Tree, Genealogical,Trees, Family,Trees, Genealogic,Trees, Genealogical
D005838 Genotype The genetic constitution of the individual, comprising the ALLELES present at each GENETIC LOCUS. Genogroup,Genogroups,Genotypes
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software

Related Publications

Dan He, and Eleazar Eskin
March 2003, Theoretical population biology,
Dan He, and Eleazar Eskin
October 1987, Clinical genetics,
Dan He, and Eleazar Eskin
November 2003, Journal of forensic sciences,
Dan He, and Eleazar Eskin
February 2013, Theoretical population biology,
Dan He, and Eleazar Eskin
March 1987, IEEE transactions on pattern analysis and machine intelligence,
Dan He, and Eleazar Eskin
January 1992, Cytogenetics and cell genetics,
Dan He, and Eleazar Eskin
May 2015, Bioinformatics (Oxford, England),
Dan He, and Eleazar Eskin
November 2021, Bioinformatics (Oxford, England),
Copied contents to your clipboard!