Development of an expert system for amino acid sequence identification. 1996

L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
Center for Intelligent Chemical Instrumentation, Clippinger Laboratories, Ohio University, Athens 45701-2979, USA.

An expert system for amino acid sequence identification has been developed. The algorithm uses heuristic rules developed by human experts in protein sequencing. The system is applied to the chromatographic data of phenylthiohydantoin-amino acids acquired from an automated sequencer. The peak intensities in the current cycle are compared with those in the previous cycle, while the calibration and succeeding cycles are used as ancillary identification criteria when necessary. The retention time for each chromatographic peak in each cycle is corrected by the corresponding peak in the calibration cycle at the same run. The main improvement of our system compared with the onboard software used by the Applied Biosystems 477A Protein/Peptide Sequencer is that each peak in each cycle is assigned an identification name according to the corrected retention time to be used for the comparison with different cycles. The system was developed from analyses of ribonuclease A and evaluated by runs of four other protein samples that were not used in rule development. This paper demonstrates that rules developed by human experts can be automatically applied to sequence assignment. The expert system performed more accurately than the onboard software of the protein sequencer, in that the misidentification rates for the expert system were around 7%, whereas those for the onboard software were between 13 and 21%.

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D010446 Peptide Fragments Partial proteins formed by partial hydrolysis of complete proteins or generated through PROTEIN ENGINEERING techniques. Peptide Fragment,Fragment, Peptide,Fragments, Peptide
D010669 Phenylthiohydantoin Thiohydantoin benzene derivative.
D011506 Proteins Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein. Gene Products, Protein,Gene Proteins,Protein,Protein Gene Products,Proteins, Gene
D011817 Rabbits A burrowing plant-eating mammal with hind limbs that are longer than its fore limbs. It belongs to the family Leporidae of the order Lagomorpha, and in contrast to hares, possesses 22 instead of 24 pairs of chromosomes. Belgian Hare,New Zealand Rabbit,New Zealand Rabbits,New Zealand White Rabbit,Rabbit,Rabbit, Domestic,Chinchilla Rabbits,NZW Rabbits,New Zealand White Rabbits,Oryctolagus cuniculus,Chinchilla Rabbit,Domestic Rabbit,Domestic Rabbits,Hare, Belgian,NZW Rabbit,Rabbit, Chinchilla,Rabbit, NZW,Rabbit, New Zealand,Rabbits, Chinchilla,Rabbits, Domestic,Rabbits, NZW,Rabbits, New Zealand,Zealand Rabbit, New,Zealand Rabbits, New,cuniculus, Oryctolagus
D005103 Expert Systems Computer programs based on knowledge developed from consultation with experts on a problem, and the processing and/or formalizing of this knowledge using these programs in such a manner that the problems may be solved. Expert System,System, Expert,Systems, Expert
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia

Related Publications

L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
October 2000, Journal of photochemistry and photobiology. B, Biology,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
January 1994, Proceedings. International Conference on Intelligent Systems for Molecular Biology,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
January 2005, Revista brasileira de enfermagem,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
September 1996, The Journal of biological chemistry,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
April 1992, Computer methods and programs in biomedicine,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
January 2001, Journal of rehabilitation research and development,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
April 2019, Current opinion in chemical biology,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
June 2002, Biochemical and biophysical research communications,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
January 1990, Progress in clinical and biological research,
L Hu, and E F Saulinskas, and P Johnson, and P B Harrington
August 2022, The Analyst,
Copied contents to your clipboard!