Nematode histone H2A variant evolution reveals diverse histories of retention and loss and evidence for conserved core-like variant histone genes. 2024

Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
Quantitative & Systems Biology, University of California, Merced, Merced, California, United States of America.

Histone variants are paralogs that replace canonical histones in nucleosomes, often imparting novel functions. However, how histone variants arise and evolve is poorly understood. Reconstruction of histone protein evolution is challenging due to large differences in evolutionary rates across gene lineages and sites. Here we used intron position data from 108 nematode genomes in combination with amino acid sequence data to find disparate evolutionary histories of the three H2A variants found in Caenorhabditis elegans: the ancient H2A.ZHTZ-1, the sperm-specific HTAS-1, and HIS-35, which differs from the canonical S-phase H2A by a single glycine-to-alanine C-terminal change. Although the H2A.ZHTZ-1 protein sequence is highly conserved, its gene exhibits recurrent intron gain and loss. This pattern suggests that specific intron sequences or positions may not be important to H2A.Z functionality. For HTAS-1 and HIS-35, we find variant-specific intron positions that are conserved across species. Patterns of intron position conservation indicate that the sperm-specific variant HTAS-1 arose more recently in the ancestor of a subset of Caenorhabditis species, while HIS-35 arose in the ancestor of Caenorhabditis and its sister group, including the genus Diploscapter. HIS-35 exhibits gene retention in some descendent lineages but gene loss in others, suggesting that histone variant use or functionality can be highly flexible. Surprisingly, we find the single amino acid differentiating HIS-35 from core H2A is ancestral and common across canonical Caenorhabditis H2A sequences. Thus, we speculate that the role of HIS-35 lies not in encoding a functionally distinct protein, but instead in enabling H2A expression across the cell cycle or in distinct tissues. This work illustrates how genes encoding such partially-redundant functions may be advantageous yet relatively replaceable over evolutionary timescales, consistent with the patchwork pattern of retention and loss of both genes. Our study shows the utility of intron positions for reconstructing evolutionary histories of gene families, particularly those undergoing idiosyncratic sequence evolution.

UI MeSH Term Description Entries
D007438 Introns Sequences of DNA in the genes that are located between the EXONS. They are transcribed along with the exons but are removed from the primary gene transcript by RNA SPLICING to leave mature RNA. Some introns code for separate genes. Intervening Sequences,Sequences, Intervening,Intervening Sequence,Intron,Sequence, Intervening
D008297 Male Males
D010802 Phylogeny The relationships of groups of organisms as reflected by their genetic makeup. Community Phylogenetics,Molecular Phylogenetics,Phylogenetic Analyses,Phylogenetic Analysis,Phylogenetic Clustering,Phylogenetic Comparative Analysis,Phylogenetic Comparative Methods,Phylogenetic Distance,Phylogenetic Generalized Least Squares,Phylogenetic Groups,Phylogenetic Incongruence,Phylogenetic Inference,Phylogenetic Networks,Phylogenetic Reconstruction,Phylogenetic Relatedness,Phylogenetic Relationships,Phylogenetic Signal,Phylogenetic Structure,Phylogenetic Tree,Phylogenetic Trees,Phylogenomics,Analyse, Phylogenetic,Analysis, Phylogenetic,Analysis, Phylogenetic Comparative,Clustering, Phylogenetic,Community Phylogenetic,Comparative Analysis, Phylogenetic,Comparative Method, Phylogenetic,Distance, Phylogenetic,Group, Phylogenetic,Incongruence, Phylogenetic,Inference, Phylogenetic,Method, Phylogenetic Comparative,Molecular Phylogenetic,Network, Phylogenetic,Phylogenetic Analyse,Phylogenetic Clusterings,Phylogenetic Comparative Analyses,Phylogenetic Comparative Method,Phylogenetic Distances,Phylogenetic Group,Phylogenetic Incongruences,Phylogenetic Inferences,Phylogenetic Network,Phylogenetic Reconstructions,Phylogenetic Relatednesses,Phylogenetic Relationship,Phylogenetic Signals,Phylogenetic Structures,Phylogenetic, Community,Phylogenetic, Molecular,Phylogenies,Phylogenomic,Reconstruction, Phylogenetic,Relatedness, Phylogenetic,Relationship, Phylogenetic,Signal, Phylogenetic,Structure, Phylogenetic,Tree, Phylogenetic
D006657 Histones Small chromosomal proteins (approx 12-20 kD) possessing an open, unfolded structure and attached to the DNA in cell nuclei by ionic linkages. Classification into the various types (designated histone I, histone II, etc.) is based on the relative amounts of arginine and lysine in each. Histone,Histone H1,Histone H1(s),Histone H2a,Histone H2b,Histone H3,Histone H3.3,Histone H4,Histone H5,Histone H7
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia
D017124 Conserved Sequence A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences. Conserved Sequences,Sequence, Conserved,Sequences, Conserved
D017173 Caenorhabditis elegans A species of nematode that is widely used in biological, biochemical, and genetic studies. Caenorhabditis elegan,elegan, Caenorhabditis
D019143 Evolution, Molecular The process of cumulative change at the level of DNA; RNA; and PROTEINS, over successive generations. Molecular Evolution,Genetic Evolution,Evolution, Genetic
D029742 Caenorhabditis elegans Proteins Proteins from the nematode species CAENORHABDITIS ELEGANS. The proteins from this species are the subject of scientific interest in the area of multicellular organism MORPHOGENESIS. C elegans Proteins

Related Publications

Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
April 2003, Nature cell biology,
Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
October 2017, The Journal of heredity,
Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
October 2010, Experimental cell research,
Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
December 2011, Nature structural & molecular biology,
Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
June 1983, Journal of biochemistry,
Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
October 2005, Cell,
Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
December 1979, Cell,
Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
February 1986, The Journal of biological chemistry,
Swadha Singh, and Noelle Anderson, and Diana Chu, and Scott W Roy
December 1994, Molecular & general genetics : MGG,
Copied contents to your clipboard!