Monophyletic classification and information content. 2020

James E Hayden
Division of Plant Industry, Florida Department of Agriculture and Consumer Services, Gainesville, FL, 32608, USA.

The connection between monophyly and efficient taxonomic diagnoses is elaborated. The inefficiency of nonmonophyletic groups is shown by reconstructing data matrices from hierarchical sets of diagnoses that are derived from apomorphies and read in order from highest to lowest rank. The practice of diagnosing nonmonophyletic groups either results in omitting data, resulting in errors in reconstructed datasets, or repeating character information to make up for the implied losses. Step-by-step demonstrations with hypothetical and real data are used as guidance. Provisions are made for missing, inapplicable and polymorphic data. Slow optimization (delayed transformation) is useful for choosing a state reconstruction in order to report apomorphies completely. The diagnoses of paraphyletic groups can be expressed in different ways, including regrafting derived clades, reanalyzing data with constraints, and reading the original diagnoses in a different order--the last is the least efficient. A cladistic version of the data compression ratio is proposed to quantify the diagnostic efficiency of a cladogram.

UI MeSH Term Description Entries
D002965 Classification The systematic arrangement of entities in any field into categories classes based on common characteristics such as properties, morphology, subject matter, etc. Systematics,Taxonomy,Classifications,Taxonomies
D044962 Data Compression Information application based on a variety of coding methods to minimize the amount of data to be stored, retrieved, or transmitted. Data compression can be applied to various forms of data, such as images and signals. It is used to reduce costs and increase efficiency in the maintenance of large volumes of data. Image Compression,Compression, Data,Compression, Image

Related Publications

James E Hayden
October 1981, Circulation,
James E Hayden
August 2001, Current protocols in human genetics,
James E Hayden
February 2013, Acta crystallographica. Section D, Biological crystallography,
James E Hayden
February 2013, Acta crystallographica. Section F, Structural biology and crystallization communications,
James E Hayden
April 2018, Current protocols in human genetics,
James E Hayden
December 2013, Molecular phylogenetics and evolution,
James E Hayden
September 1984, Science (New York, N.Y.),
James E Hayden
April 2001, Physical review. E, Statistical, nonlinear, and soft matter physics,
James E Hayden
November 2013, Physical review. E, Statistical, nonlinear, and soft matter physics,
Copied contents to your clipboard!