Prediction of O-glycosylation of mammalian proteins: specificity patterns of UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase. 1995

J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
Laboratory for Infectious Diseases, Hvidovre Hospital, University of Copenhagen, Denmark.

The specificity of the enzyme(s) catalysing the covalent link between the hydroxyl side chains of serine or threonine and the sugar moiety N-acetylgalactosamine (GalNAc) is unknown. Pattern recognition by artificial neural networks and weight matrix algorithms was performed to determine the exact position of in vivo O-linked GalNAc-glycosylated serine and threonine residues from the primary sequence exclusively. The acceptor sequence context for O-glycosylation of serine was found to differ from that of threonine and the two types were therefore treated separately. The context of the sites showed a high abundance of proline, serine and threonine extending far beyond the previously reported region covering positions -4 through +4 relative to the glycosylated residue. The O-glycosylation sites were found to cluster and to have a high abundance in the N-terminal part of the protein. The sites were also found to have an increased preference for three different classes of beta-turns. No simple consensus-like rule could be deduced for the complex glycosylation sequence acceptor patterns. The neural networks were trained on the hitherto largest data material consisting of 48 carefully examined mammalian glycoproteins comprising 264 O-glycosylation sites. For detection neural network algorithms were much more reliable than weight matrices. The networks correctly found 60-95% of the O-glycosylated serine/threonine residues and 88-97% of the non-glycosylated residues in two independent test sets of known glycoproteins. A computer server using E-mail for prediction of O-glycosylation sites has been implemented and made publicly available. The Internet address is NetOglyc@cbs.dtu.dk.

UI MeSH Term Description Entries
D007256 Information Systems Integrated set of files, procedures, and equipment for the storage, manipulation, and retrieval of information. Ancillary Information Systems,Emergency Care Information Systems,Information Retrieval Systems,Perinatal Information System,Ancillary Information System,Information Retrieval System,Information System,Information System, Ancillary,Information System, Perinatal,Perinatal Information Systems,Systems, Information Retrieval
D008322 Mammals Warm-blooded vertebrate animals belonging to the class Mammalia, including all that possess hair and suckle their young. Mammalia,Mammal
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D003195 Computer Communication Networks A system containing any combination of computers, computer terminals, printers, audio or visual display devices, or telephones interconnected by telecommunications equipment or cables: used to transmit or receive information. (Random House Unabridged Dictionary, 2d ed) Cognitive Radio,Computer Network Management,Databases, Distributed,Distributed Systems,Extranets,Intranets,Network Communication Protocols,Telecommunication Networks,Cognitive Radios,Communication Network, Computer,Communication Networks, Computer,Communication Protocol, Network,Communication Protocols, Network,Computer Communication Network,Database, Distributed,Distributed Database,Distributed Databases,Distributed System,Extranet,Intranet,Management, Computer Network,Network Communication Protocol,Network Management, Computer,Network, Computer Communication,Network, Telecommunication,Protocol, Network Communication,Radio, Cognitive,System, Distributed,Telecommunication Network
D006023 Glycoproteins Conjugated protein-carbohydrate compounds including MUCINS; mucoid, and AMYLOID glycoproteins. C-Glycosylated Proteins,Glycosylated Protein,Glycosylated Proteins,N-Glycosylated Proteins,O-Glycosylated Proteins,Glycoprotein,Neoglycoproteins,Protein, Glycosylated,Proteins, C-Glycosylated,Proteins, Glycosylated,Proteins, N-Glycosylated,Proteins, O-Glycosylated
D006031 Glycosylation The synthetic chemistry reaction or enzymatic reaction of adding carbohydrate or glycosyl groups. GLYCOSYLTRANSFERASES carry out the enzymatic glycosylation reactions. The spontaneous, non-enzymatic attachment of reducing sugars to free amino groups in proteins, lipids, or nucleic acids is called GLYCATION (see MAILLARD REACTION). Protein Glycosylation,Glycosylation, Protein
D000097763 Polypeptide N-acetylgalactosaminyltransferase Family of enzymes that catalyze the formation of GalNAcAlpha1-serine/threonine linkages in glycoproteins. Galactosylgalactosylglucosylceramide beta-D-acetylgalactosaminyltransferase,Globoside Synthase,Globoside beta GalNAc Transferase,Protein-UDPacetylgalactosaminyltransferase,(1-3)-N-acetyl-beta-galactosaminyltransferase,(1-4)-N-acetyl-beta-D-galactosaminyltransferase,4-GalNActransferase,GalNAc-T1,GalNAc-T10,GalNAc-T2,GalNAc-T3,GalNAc-T4,GalNAc-T5,GalNAc-T8,GalNAc-transferase,GalNAcT-1,GalNAcT-2,GalNAcT-4,GalNAcT-8,UDP-GPAGAT,UDP-GalNAc-beta-galactose beta 1,4-N-acetylgalactosaminyltransferase,UDP-GalNAc-polypeptide N-acetylgalactosaminyltransferase,UDP-N-acetyl-D-galactosamine polypeptide N-acetylgalactosaminyltransferase-T4,UDP-N-acetylgalactosamine mucin transferase,UDP-N-acetylgalactosamine-beta-galactose beta 1,4-N-acetylgalactosaminyltransferase,UDP-N-acetylgalactosamine-globoside beta-N-acetylgalactosaminyltransferase,UDP-N-acetylgalactosamine-globosidetriaosylceramide beta-3-N-acetylgalactosaminyltransferase,UDP-N-acetylgalactosamine-polypeptide N-acetylgalactosamine transferase,UDPacetylgalactosamine-galactosyl-galactosyl-glucosylceramide beta-N-acetyl-D-galactosaminyltransferase,UDPacetylgalactosamine-protein acetylgalactosaminyltransferase,beta-1,4-N-acetylgalactosaminyltransferase,beta-N-acetylgalactosaminyltransferase,beta1,6N-acetylgalactosaminyltransferase,polypeptide N-acetylgalactosaminyltransferase 1,polypeptide N-acetylgalactosaminyltransferase 10,polypeptide N-acetylgalactosaminyltransferase 2,polypeptide N-acetylgalactosaminyltransferase 3,polypeptide N-acetylgalactosaminyltransferase 4,polypeptide N-acetylgalactosaminyltransferase 5,polypeptide N-acetylgalactosaminyltransferase 8,pp-GalNAc-T10,ppGalNAc-T,Synthase, Globoside
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia
D012694 Serine A non-essential amino acid occurring in natural form as the L-isomer. It is synthesized from GLYCINE or THREONINE. It is involved in the biosynthesis of PURINES; PYRIMIDINES; and other amino acids. L-Serine,L Serine

Related Publications

J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
December 1993, Journal of dental research,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
April 1998, Glycobiology,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
September 1999, The Journal of biological chemistry,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
February 2010, Glycoconjugate journal,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
June 1992, The Journal of biological chemistry,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
July 1996, Analytical biochemistry,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
November 1996, Biochemical and biophysical research communications,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
December 2002, The Journal of biological chemistry,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
March 1998, Glycoconjugate journal,
J E Hansen, and O Lund, and J Engelbrecht, and H Bohr, and J O Nielsen, and J E Hansen
December 1995, Glycoconjugate journal,
Copied contents to your clipboard!