Structural and functional insights into Mimivirus ORFans. 2007

Harpreet Kaur Saini, and Daniel Fischer
Computer Science and Engineering Dept., University at Buffalo, Buffalo, NY 14260-2000, USA. hksaini@cse.buffalo.edu <hksaini@cse.buffalo.edu>

BACKGROUND Mimivirus isolated from A. polyphaga is the largest virus discovered so far. It is unique among all the viruses in having genes related to translation, DNA repair and replication which bear close homology to eukaryotic genes. Nevertheless, only a small fraction of the proteins (33%) encoded in this genome has been assigned a function. Furthermore, a large fraction of the unassigned protein sequences bear no sequence similarity to proteins from other genomes. These sequences are referred to as ORFans. Because of their lack of sequence similarity to other proteins, they can not be assigned putative functions using standard sequence comparison methods. As part of our genome-wide computational efforts aimed at characterizing Mimivirus ORFans, we have applied fold-recognition methods to predict the structure of these ORFans and further functions were derived based on conservation of functionally important residues in sequence-template alignments. RESULTS Using fold recognition, we have identified highly confident computational 3D structural assignments for 21 Mimivirus ORFans. In addition, highly confident functional predictions for 6 of these ORFans were derived by analyzing the conservation of functional motifs between the predicted structures and proteins of known function. This analysis allowed us to classify these 6 previously unannotated ORFans into their specific protein families: carboxylesterase/thioesterase, metal-dependent deacetylase, P-loop kinases, 3-methyladenine DNA glycosylase, BTB domain and eukaryotic translation initiation factor eIF4E. CONCLUSIONS Using stringent fold recognition criteria we have assigned three-dimensional structures for 21 of the ORFans encoded in the Mimivirus genome. Further, based on the 3D models and an analysis of the conservation of functionally important residues and motifs, we were able to derive functional attributes for 6 of the ORFans. Our computational identification of important functional sites in these ORFans can be the basis for a subsequent experimental verification of our predictions. Further computational and experimental studies are required to elucidate the 3D structures and functions of the remaining Mimivirus ORFans.

UI MeSH Term Description Entries
D008958 Models, Molecular Models used experimentally or theoretically to study molecular shape, electronic properties, or interactions; includes analogous molecules, computer-generated graphics, and mechanical structures. Molecular Models,Model, Molecular,Molecular Model
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D003198 Computer Simulation Computer-based representation of physical systems and phenomena such as chemical processes. Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D004267 DNA Viruses Viruses whose nucleic acid is DNA. DNA Virus,Virus, DNA,Viruses, DNA
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D013045 Species Specificity The restriction of a characteristic behavior, anatomical structure or physical system, such as immune response; metabolic response, or gene or gene variant to the members of one species. It refers to that property which differentiates one species from another but it is also used for phylogenetic levels higher or lower than the species. Species Specificities,Specificities, Species,Specificity, Species
D013329 Structure-Activity Relationship The relationship between the chemical structure of a compound and its biological or pharmacological activity. Compounds are often classed together because they have structural characteristics in common including shape, size, stereochemical arrangement, and distribution of functional groups. Relationship, Structure-Activity,Relationships, Structure-Activity,Structure Activity Relationship,Structure-Activity Relationships
D014764 Viral Proteins Proteins found in any species of virus. Gene Products, Viral,Viral Gene Products,Viral Gene Proteins,Viral Protein,Protein, Viral,Proteins, Viral
D014780 Viruses Minute infectious agents whose genomes are composed of DNA or RNA, but not both. They are characterized by a lack of independent metabolism and the inability to replicate outside living host cells. Animal Viruses,Zoophaginae,Animal Virus,Virus,Virus, Animal,Viruses, Animal
D016208 Databases, Factual Extensive collections, reputedly complete, of facts and data garnered from material of a specialized subject area and made available for analysis and application. The collection can be automated by various contemporary methods for retrieval. The concept should be differentiated from DATABASES, BIBLIOGRAPHIC which is restricted to collections of bibliographic references. Databanks, Factual,Data Banks, Factual,Data Bases, Factual,Data Bank, Factual,Data Base, Factual,Databank, Factual,Database, Factual,Factual Data Bank,Factual Data Banks,Factual Data Base,Factual Data Bases,Factual Databank,Factual Databanks,Factual Database,Factual Databases

Related Publications

Harpreet Kaur Saini, and Daniel Fischer
June 2009, Biochemistry,
Harpreet Kaur Saini, and Daniel Fischer
August 2008, Histology and histopathology,
Harpreet Kaur Saini, and Daniel Fischer
September 2020, Trends in pharmacological sciences,
Harpreet Kaur Saini, and Daniel Fischer
October 2012, Biological chemistry,
Harpreet Kaur Saini, and Daniel Fischer
May 2011, The Journal of biological chemistry,
Harpreet Kaur Saini, and Daniel Fischer
January 2021, Microbial pathogenesis,
Harpreet Kaur Saini, and Daniel Fischer
October 2010, Advanced drug delivery reviews,
Harpreet Kaur Saini, and Daniel Fischer
July 2014, Biochimica et biophysica acta,
Harpreet Kaur Saini, and Daniel Fischer
January 2018, Frontiers in molecular biosciences,
Harpreet Kaur Saini, and Daniel Fischer
December 2018, Biomolecules,
Copied contents to your clipboard!