Efficient Identification of Null-Allele Single Nucleotide Polymorphism Markers. 2015

Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, N.Y., USA.

OBJECTIVE At the beginning of a genome-wide association study, many markers are discarded because they fail to meet standard quality control criteria. Some of these markers are out of Hardy-Weinberg equilibrium (HWE) because they have 'null alleles' (which may be deletions or third alleles that do not hybridize to standard probes). It may be useful to identify null-allele markers so that they can be analyzed under different models or in order to explore regions of copy number variation. METHODS We present a model for the chip-based genotype data that are produced when a null-allele single nucleotide polymorphism (SNP) is genotyped under standard (2-allele) assumptions. We show that this model can be combined with the standard HWE model to develop classification procedures based on the supervised learning algorithms Support Vector Machines (SVM), Classification and Regression Trees (CART) or Random Forests for identifying null-allele SNPs. RESULTS We report a list of null-allele SNPs we identified on the Illumina 660W-Quad chip and provide suggestions for applying our CART model to other SNP sets. CONCLUSIONS Properly identified null-allele SNPs can be used to test for genotype-phenotype associations or to identify regions which may contain copy number variants.

UI MeSH Term Description Entries
D008957 Models, Genetic Theoretical representations that simulate the behavior or activity of genetic processes or phenomena. They include the use of mathematical equations, computers, and other electronic equipment. Genetic Models,Genetic Model,Model, Genetic
D005819 Genetic Markers A phenotypically recognizable genetic trait which can be used to identify a genetic locus, a linkage group, or a recombination event. Chromosome Markers,DNA Markers,Markers, DNA,Markers, Genetic,Genetic Marker,Marker, Genetic,Chromosome Marker,DNA Marker,Marker, Chromosome,Marker, DNA,Markers, Chromosome
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000483 Alleles Variant forms of the same gene, occupying the same locus on homologous CHROMOSOMES, and governing the variants in production of the same gene product. Allelomorphs,Allele,Allelomorph
D015233 Models, Statistical Statistical formulations or analyses which, when applied to data and found to fit the data, are then used to verify the assumptions and parameters used in the analysis. Examples of statistical models are the linear model, binomial model, polynomial model, two-parameter model, etc. Probabilistic Models,Statistical Models,Two-Parameter Models,Model, Statistical,Models, Binomial,Models, Polynomial,Statistical Model,Binomial Model,Binomial Models,Model, Binomial,Model, Polynomial,Model, Probabilistic,Model, Two-Parameter,Models, Probabilistic,Models, Two-Parameter,Polynomial Model,Polynomial Models,Probabilistic Model,Two Parameter Models,Two-Parameter Model
D055106 Genome-Wide Association Study An analysis comparing the allele frequencies of all available (or a whole GENOME representative set of) polymorphic markers to identify gene candidates or quantitative trait loci associated with a specific organism trait or specific disease or condition. Genome Wide Association Analysis,Genome Wide Association Study,GWA Study,Genome Wide Association Scan,Genome Wide Association Studies,Whole Genome Association Analysis,Whole Genome Association Study,Association Studies, Genome-Wide,Association Study, Genome-Wide,GWA Studies,Genome-Wide Association Studies,Studies, GWA,Studies, Genome-Wide Association,Study, GWA,Study, Genome-Wide Association
D020641 Polymorphism, Single Nucleotide A single nucleotide variation in a genetic sequence that occurs at appreciable frequency in the population. SNPs,Single Nucleotide Polymorphism,Nucleotide Polymorphism, Single,Nucleotide Polymorphisms, Single,Polymorphisms, Single Nucleotide,Single Nucleotide Polymorphisms

Related Publications

Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
December 2011, Anticancer research,
Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
April 2010, Cancer science,
Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
June 2002, Tissue antigens,
Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
January 2015, Horticulture research,
Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
May 1999, Genome research,
Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
September 2003, Drug metabolism and disposition: the biological fate of chemicals,
Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
March 2005, Journal of biotechnology,
Umut Özbek, and Eleanor Feingold, and Daniel E Weeks
January 2014, Horticulture research,
Copied contents to your clipboard!