A tool for selecting SNPs for association studies based on observed linkage disequilibrium patterns. 2006

Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
Applied Biosystems, 850 Lincoln Centre Dr., Foster City, CA 94404, USA. delavefm@appliedbiosystems.com

The design of genetic association studies using single-nucleotide polymorphisms (SNPs) requires the selection of subsets of the variants providing high statistical power at a reasonable cost. SNPs must be selected to maximize the probability that a causative mutation is in linkage disequilibrium (LD) with at least one marker genotyped in the study. The HapMap project performed a genome-wide survey of genetic variation with about a million SNPs typed in four populations, providing a rich resource to inform the design of association studies. A number of strategies have been proposed for the selection of SNPs based on observed LD, including construction of metric LD maps and the selection of haplotype tagging SNPs. Power calculations are important at the study design stage to ensure successful results. Integrating these methods and annotations can be challenging: the algorithms required to implement these methods are complex to deploy, and all the necessary data and annotations are deposited in disparate databases. Here, we present the SNPbrowser Software, a freely available tool to assist in the LD-based selection of markers for association studies. This stand-alone application provides fast query capabilities and swift visualization of SNPs, gene annotations, power, haplotype blocks, and LD map coordinates. Wizards implement several common SNP selection workflows including the selection of optimal subsets of SNPs (e.g. tagging SNPs). Selected SNPs are screened for their conversion potential to either TaqMan SNP Genotyping Assays or the SNPlex Genotyping System, two commercially available genotyping platforms, expediting the set-up of genetic studies with an increased probability of success.

UI MeSH Term Description Entries
D003196 Computer Graphics The process of pictorial communication, between human and computers, in which the computer input and output have the form of charts, drawings, or other appropriate pictorial representation. Computer Graphic,Graphic, Computer,Graphics, Computer
D005838 Genotype The genetic constitution of the individual, comprising the ALLELES present at each GENETIC LOCUS. Genogroup,Genogroups,Genotypes
D006239 Haplotypes The genetic constitution of individuals with respect to one member of a pair of allelic genes, or sets of genes that are closely linked and tend to be inherited together such as those of the MAJOR HISTOCOMPATIBILITY COMPLEX. Haplotype
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software
D015810 Linkage Disequilibrium Nonrandom association of linked genes. This is the tendency of the alleles of two separate but already linked loci to be found together more frequently than would be expected by chance alone. Disequilibrium, Linkage,Disequilibriums, Linkage,Linkage Disequilibriums
D019295 Computational Biology A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets. Bioinformatics,Molecular Biology, Computational,Bio-Informatics,Biology, Computational,Computational Molecular Biology,Bio Informatics,Bio-Informatic,Bioinformatic,Biologies, Computational Molecular,Biology, Computational Molecular,Computational Molecular Biologies,Molecular Biologies, Computational
D020641 Polymorphism, Single Nucleotide A single nucleotide variation in a genetic sequence that occurs at appreciable frequency in the population. SNPs,Single Nucleotide Polymorphism,Nucleotide Polymorphism, Single,Nucleotide Polymorphisms, Single,Polymorphisms, Single Nucleotide,Single Nucleotide Polymorphisms
D030561 Databases, Nucleic Acid Databases containing information about NUCLEIC ACIDS such as BASE SEQUENCE; SNPS; NUCLEIC ACID CONFORMATION; and other properties. Information about the DNA fragments kept in a GENE LIBRARY or GENOMIC LIBRARY is often maintained in DNA databases. DDBJ,DNA Data Bank of Japan,DNA Data Banks,DNA Databases,Databases, DNA,Databases, DNA Sequence,Databases, Nucleic Acid Sequence,Databases, RNA,Databases, RNA Sequence,EMBL Nucleotide Sequence Database,GenBank,Nucleic Acid Databases,RNA Databases,DNA Databanks,DNA Sequence Databases,European Molecular Biology Laboratory Nucleotide Sequence Database,Nucleic Acid Sequence Databases,RNA Sequence Databases,Bank, DNA Data,Banks, DNA Data,DNA Data Bank,DNA Databank,DNA Database,DNA Sequence Database,Data Bank, DNA,Data Banks, DNA,Databank, DNA,Databanks, DNA,Database, DNA,Database, DNA Sequence,Database, Nucleic Acid,Database, RNA,Database, RNA Sequence,Nucleic Acid Database,RNA Database,RNA Sequence Database,Sequence Database, DNA,Sequence Database, RNA,Sequence Databases, DNA,Sequence Databases, RNA

Related Publications

Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
January 2003, Genetic epidemiology,
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
November 2005, Bioinformatics (Oxford, England),
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
January 2008, Human heredity,
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
January 2004, In silico biology,
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
August 2004, Human molecular genetics,
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
April 2003, Nature genetics,
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
September 2009, Genetic epidemiology,
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
April 2001, Human mutation,
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
January 2013, PloS one,
Francisco M De La Vega, and Hadar I Isaac, and Charles R Scafe
January 2013, PloS one,
Copied contents to your clipboard!