Syntactic recognition of regulatory regions in Escherichia coli. 1996

D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Ciudad Universitaria, México D.F.

BACKGROUND One of the most common methodologies to identify cis-regulatory sites in regulatory regions in the DNA is that of weight matrices, as testified by several articles in this issue. An alternative to strengthen the computational predictions in regulatory regions is to develop methods that incorporate more biological properties present in such DNA regions. The grammatical implementation presented in this paper provides a concrete example in this direction. RESULTS On the basis of the analysis of an exhaustive collection of regulatory regions in Escherichia coli, a grammatical model for the regulatory regions of sigma 70 promoters has been developed. The terminal symbols of the grammar represent individual sites for the binding of activator and repressor proteins, and include the precise position of sites in relation to transcription initiation. Combining these symbols, the grammar generates a large number of different sentences, each of which can be searched for matching against a collection of regulatory regions by means of weight matrices specific for each set of sites for individual proteins. On the basis of this grammatical model, a Prolog syntactic recognizer is presented here. Specific subgrammars for ArgR, LexA and TyrR were implemented. When parsing a collection of 128 sigma 70 promoter regions, the syntactic recognizer produces a much lower number of false-positive sites than the standard search using weight matrices.

UI MeSH Term Description Entries
D011381 Programming Languages Specific languages used to prepare computer programs. Language, Programming,Languages, Programming,Programming Language
D011401 Promoter Regions, Genetic DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes. rRNA Promoter,Early Promoters, Genetic,Late Promoters, Genetic,Middle Promoters, Genetic,Promoter Regions,Promoter, Genetic,Promotor Regions,Promotor, Genetic,Pseudopromoter, Genetic,Early Promoter, Genetic,Genetic Late Promoter,Genetic Middle Promoters,Genetic Promoter,Genetic Promoter Region,Genetic Promoter Regions,Genetic Promoters,Genetic Promotor,Genetic Promotors,Genetic Pseudopromoter,Genetic Pseudopromoters,Late Promoter, Genetic,Middle Promoter, Genetic,Promoter Region,Promoter Region, Genetic,Promoter, Genetic Early,Promoter, rRNA,Promoters, Genetic,Promoters, Genetic Middle,Promoters, rRNA,Promotor Region,Promotors, Genetic,Pseudopromoters, Genetic,Region, Genetic Promoter,Region, Promoter,Region, Promotor,Regions, Genetic Promoter,Regions, Promoter,Regions, Promotor,rRNA Promoters
D012045 Regulatory Sequences, Nucleic Acid Nucleic acid sequences involved in regulating the expression of genes. Nucleic Acid Regulatory Sequences,Regulatory Regions, Nucleic Acid (Genetics),Region, Regulatory,Regions, Regulatory,Regulator Regions, Nucleic Acid,Regulatory Region,Regulatory Regions
D004269 DNA, Bacterial Deoxyribonucleic acid that makes up the genetic material of bacteria. Bacterial DNA
D004926 Escherichia coli A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc. Alkalescens-Dispar Group,Bacillus coli,Bacterium coli,Bacterium coli commune,Diffusely Adherent Escherichia coli,E coli,EAggEC,Enteroaggregative Escherichia coli,Enterococcus coli,Diffusely Adherent E. coli,Enteroaggregative E. coli,Enteroinvasive E. coli,Enteroinvasive Escherichia coli
D005189 False Positive Reactions Positive test results in subjects who do not possess the attribute for which the test is conducted. The labeling of healthy persons as diseased when screening in the detection of disease. (Last, A Dictionary of Epidemiology, 2d ed) False Positive Reaction,Positive Reaction, False,Positive Reactions, False,Reaction, False Positive,Reactions, False Positive
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D001483 Base Sequence The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence. DNA Sequence,Nucleotide Sequence,RNA Sequence,DNA Sequences,Base Sequences,Nucleotide Sequences,RNA Sequences,Sequence, Base,Sequence, DNA,Sequence, Nucleotide,Sequence, RNA,Sequences, Base,Sequences, DNA,Sequences, Nucleotide,Sequences, RNA

Related Publications

D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
June 2024, NAR genomics and bioinformatics,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
June 1994, Journal of biotechnology,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
January 1982, The EMBO journal,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
May 1998, Journal of bacteriology,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
November 2009, Cell research,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
November 2014, ACS chemical biology,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
January 1988, Annual review of biochemistry,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
March 1969, Journal of biochemistry,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
June 2011, The Biochemical journal,
D A Rosenblueth, and D Thieffry, and A M Huerta, and H Salgado, and J Collado-Vides
January 1986, Molekuliarnaia biologiia,
Copied contents to your clipboard!