Unifying generative and discriminative learning principles. 2010

Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany. Jens.Keilwagen@ipk-gatersleben.de

BACKGROUND The recognition of functional binding sites in genomic DNA remains one of the fundamental challenges of genome research. During the last decades, a plethora of different and well-adapted models has been developed, but only little attention has been payed to the development of different and similarly well-adapted learning principles. Only recently it was noticed that discriminative learning principles can be superior over generative ones in diverse bioinformatics applications, too. RESULTS Here, we propose a generalization of generative and discriminative learning principles containing the maximum likelihood, maximum a posteriori, maximum conditional likelihood, maximum supervised posterior, generative-discriminative trade-off, and penalized generative-discriminative trade-off learning principles as special cases, and we illustrate its efficacy for the recognition of vertebrate transcription factor binding sites. CONCLUSIONS We find that the proposed learning principle helps to improve the recognition of transcription factor binding sites, enabling better computational approaches for extracting as much information as possible from valuable wet-lab data. We make all implementations available in the open-source library Jstacs so that this learning principle can be easily applied to other classification problems in the field of genome and epigenome analysis.

UI MeSH Term Description Entries
D010363 Pattern Recognition, Automated In INFORMATION RETRIEVAL, machine-sensing or identification of visible patterns (shapes, forms, and configurations). (Harrod's Librarians' Glossary, 7th ed) Automated Pattern Recognition,Pattern Recognition System,Pattern Recognition Systems
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D016002 Discriminant Analysis A statistical analytic technique used with discrete dependent variables, concerned with separating sets of observed values and allocating new values. It is sometimes used instead of regression analysis. Analyses, Discriminant,Analysis, Discriminant,Discriminant Analyses
D016013 Likelihood Functions Functions constructed from a statistical model and a set of observed data which give the probability of that data for various values of the unknown model parameters. Those parameter values that maximize the probability are the maximum likelihood estimates of the parameters. Likelihood Ratio Test,Maximum Likelihood Estimates,Estimate, Maximum Likelihood,Estimates, Maximum Likelihood,Function, Likelihood,Functions, Likelihood,Likelihood Function,Maximum Likelihood Estimate,Test, Likelihood Ratio
D016247 Information Storage and Retrieval Organized activities related to the storage, location, search, and retrieval of information. Information Retrieval,Data Files,Data Linkage,Data Retrieval,Data Storage,Data Storage and Retrieval,Information Extraction,Information Storage,Machine-Readable Data Files,Data File,Data File, Machine-Readable,Data Files, Machine-Readable,Extraction, Information,Files, Machine-Readable Data,Information Extractions,Machine Readable Data Files,Machine-Readable Data File,Retrieval, Data,Storage, Data
D016678 Genome The genetic complement of an organism, including all of its GENES, as represented in its DNA, or in some cases, its RNA. Genomes
D023281 Genomics The systematic study of the complete DNA sequences (GENOME) of organisms. Included is construction of complete genetic, physical, and transcript maps, and the analysis of this structural genomic information on a global scale such as in GENOME WIDE ASSOCIATION STUDIES. Functional Genomics,Structural Genomics,Comparative Genomics,Genomics, Comparative,Genomics, Functional,Genomics, Structural

Related Publications

Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
February 2020, Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence,
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
August 2007, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society,
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
January 2012, IEEE transactions on medical imaging,
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
December 2008, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society,
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
December 2022, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society,
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
January 2024, Neural networks : the official journal of the International Neural Network Society,
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
October 2019, Machine learning in medical imaging. MLMI (Workshop),
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
September 2022, IEEE transactions on medical imaging,
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
December 2023, IEEE journal of biomedical and health informatics,
Jens Keilwagen, and Jan Grau, and Stefan Posch, and Marc Strickert, and Ivo Grosse
May 2020, IEEE transactions on cybernetics,
Copied contents to your clipboard!