Polymorphism-Aware Species Trees with Advanced Mutation Models, Bootstrap, and Rate Heterogeneity. 2019

Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
Department of Biological Physics, Eötvös Loránd University, Budapest, Hungary.

Molecular phylogenetics has neglected polymorphisms within present and ancestral populations for a long time. Recently, multispecies coalescent based methods have increased in popularity, however, their application is limited to a small number of species and individuals. We introduced a polymorphism-aware phylogenetic model (PoMo), which overcomes this limitation and scales well with the increasing amount of sequence data whereas accounting for present and ancestral polymorphisms. PoMo circumvents handling of gene trees and directly infers species trees from allele frequency data. Here, we extend the PoMo implementation in IQ-TREE and integrate search for the statistically best-fit mutation model, the ability to infer mutation rate variation across sites, and assessment of branch support values. We exemplify an analysis of a hundred species with ten haploid individuals each, showing that PoMo can perform inference on large data sets. While PoMo is more accurate than standard substitution models applied to concatenated alignments, it is almost as fast. We also provide bmm-simulate, a software package that allows simulation of sequences evolving under PoMo. The new options consolidate the value of PoMo for phylogenetic analyses with population data.

UI MeSH Term Description Entries
D008957 Models, Genetic Theoretical representations that simulate the behavior or activity of genetic processes or phenomena. They include the use of mathematical equations, computers, and other electronic equipment. Genetic Models,Genetic Model,Model, Genetic
D010802 Phylogeny The relationships of groups of organisms as reflected by their genetic makeup. Community Phylogenetics,Molecular Phylogenetics,Phylogenetic Analyses,Phylogenetic Analysis,Phylogenetic Clustering,Phylogenetic Comparative Analysis,Phylogenetic Comparative Methods,Phylogenetic Distance,Phylogenetic Generalized Least Squares,Phylogenetic Groups,Phylogenetic Incongruence,Phylogenetic Inference,Phylogenetic Networks,Phylogenetic Reconstruction,Phylogenetic Relatedness,Phylogenetic Relationships,Phylogenetic Signal,Phylogenetic Structure,Phylogenetic Tree,Phylogenetic Trees,Phylogenomics,Analyse, Phylogenetic,Analysis, Phylogenetic,Analysis, Phylogenetic Comparative,Clustering, Phylogenetic,Community Phylogenetic,Comparative Analysis, Phylogenetic,Comparative Method, Phylogenetic,Distance, Phylogenetic,Group, Phylogenetic,Incongruence, Phylogenetic,Inference, Phylogenetic,Method, Phylogenetic Comparative,Molecular Phylogenetic,Network, Phylogenetic,Phylogenetic Analyse,Phylogenetic Clusterings,Phylogenetic Comparative Analyses,Phylogenetic Comparative Method,Phylogenetic Distances,Phylogenetic Group,Phylogenetic Incongruences,Phylogenetic Inferences,Phylogenetic Network,Phylogenetic Reconstructions,Phylogenetic Relatednesses,Phylogenetic Relationship,Phylogenetic Signals,Phylogenetic Structures,Phylogenetic, Community,Phylogenetic, Molecular,Phylogenies,Phylogenomic,Reconstruction, Phylogenetic,Relatedness, Phylogenetic,Relationship, Phylogenetic,Signal, Phylogenetic,Structure, Phylogenetic,Tree, Phylogenetic
D011110 Polymorphism, Genetic The regular and simultaneous occurrence in a single interbreeding population of two or more discontinuous genotypes. The concept includes differences in genotypes ranging in size from a single nucleotide site (POLYMORPHISM, SINGLE NUCLEOTIDE) to large nucleotide sequences visible at a chromosomal level. Gene Polymorphism,Genetic Polymorphism,Polymorphism (Genetics),Genetic Polymorphisms,Gene Polymorphisms,Polymorphism, Gene,Polymorphisms (Genetics),Polymorphisms, Gene,Polymorphisms, Genetic
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software
D016013 Likelihood Functions Functions constructed from a statistical model and a set of observed data which give the probability of that data for various values of the unknown model parameters. Those parameter values that maximize the probability are the maximum likelihood estimates of the parameters. Likelihood Ratio Test,Maximum Likelihood Estimates,Estimate, Maximum Likelihood,Estimates, Maximum Likelihood,Function, Likelihood,Functions, Likelihood,Likelihood Function,Maximum Likelihood Estimate,Test, Likelihood Ratio
D059645 Mutation Rate The number of mutations that occur in a specific sequence, GENE, or GENOME over a specified period of time such as years, CELL DIVISIONS, or generations. Mutation Frequency,Frequencies, Mutation,Frequency, Mutation,Mutation Frequencies,Mutation Rates,Rate, Mutation,Rates, Mutation

Related Publications

Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
January 2018, Algorithms for molecular biology : AMB,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
March 1996, Molecular biology and evolution,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
June 2011, Environmental and molecular mutagenesis,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
February 2020, Journal of theoretical biology,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
March 1995, Molecular biology and evolution,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
December 2023, Nature genetics,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
February 2024, ArXiv,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
October 2009, Journal of computational biology : a journal of computational molecular cell biology,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
February 2006, Journal of molecular evolution,
Dominik Schrempf, and Bui Quang Minh, and Arndt von Haeseler, and Carolin Kosiol
December 1994, Proceedings of the National Academy of Sciences of the United States of America,
Copied contents to your clipboard!