Statistical significance approximation for local similarity analysis of dependent time series data. 2019

Fang Zhang, and Fengzhu Sun, and Yihui Luan
School of Mathematics, Shandong University, Jinan, Shandong, 250100, China.

BACKGROUND Local similarity analysis (LSA) of time series data has been extensively used to investigate the dynamics of biological systems in a wide range of environments. Recently, a theoretical method was proposed to approximately calculate the statistical significance of local similarity (LS) scores. However, the method assumes that the time series data are independent identically distributed, which can be violated in many problems. RESULTS In this paper, we develop a novel approach to accurately approximate statistical significance of LSA for dependent time series data using nonparametric kernel estimated long-run variance. We also investigate an alternative method for LSA statistical significance approximation by computing the local similarity score of the residuals based on a predefined statistical model. We show by simulations that both methods have controllable type I errors for dependent time series, while other approaches for statistical significance can be grossly oversized. We apply both methods to human and marine microbial datasets, where most of possible significant associations are captured and false positives are efficiently controlled. CONCLUSIONS Our methods provide fast and effective approaches for evaluating statistical significance of dependent time series data with controllable type I error. They can be applied to a variety of time series data to reveal inherent relationships among the different factors.

UI MeSH Term Description Entries
D008297 Male Males
D005260 Female Females
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D013997 Time Factors Elements of limited time intervals, contributing to particular results or situations. Time Series,Factor, Time,Time Factor
D015233 Models, Statistical Statistical formulations or analyses which, when applied to data and found to fit the data, are then used to verify the assumptions and parameters used in the analysis. Examples of statistical models are the linear model, binomial model, polynomial model, two-parameter model, etc. Probabilistic Models,Statistical Models,Two-Parameter Models,Model, Statistical,Models, Binomial,Models, Polynomial,Statistical Model,Binomial Model,Binomial Models,Model, Binomial,Model, Polynomial,Model, Probabilistic,Model, Two-Parameter,Models, Probabilistic,Models, Two-Parameter,Polynomial Model,Polynomial Models,Probabilistic Model,Two Parameter Models,Two-Parameter Model
D059001 Aquatic Organisms Organisms that live in water. Marine Organisms,Aquatic Organism,Marine Organism,Organism, Aquatic,Organism, Marine,Organisms, Aquatic,Organisms, Marine
D019992 Databases as Topic Works on organized collections of records, standardized in format and content, that are stored in any of a variety of computer-readable modes. Data Banks as Topic,Data Bases as Topic,Databanks as Topic
D064307 Microbiota The full collection of microbes (bacteria, fungi, virus, etc.) that naturally exist within a particular biological niche such as an organism, soil, a body of water, etc. Human Microbiome,Microbiome,Microbiome, Human,Microbial Community,Microbial Community Composition,Microbial Community Structure,Community Composition, Microbial,Community Structure, Microbial,Community, Microbial,Composition, Microbial Community,Human Microbiomes,Microbial Communities,Microbial Community Compositions,Microbial Community Structures,Microbiomes,Microbiotas

Related Publications

Fang Zhang, and Fengzhu Sun, and Yihui Luan
January 2013, Bioinformatics (Oxford, England),
Fang Zhang, and Fengzhu Sun, and Yihui Luan
January 2022, Frontiers in genetics,
Fang Zhang, and Fengzhu Sun, and Yihui Luan
November 2018, Statistical applications in genetics and molecular biology,
Fang Zhang, and Fengzhu Sun, and Yihui Luan
January 2006, International journal of data mining and bioinformatics,
Fang Zhang, and Fengzhu Sun, and Yihui Luan
January 2011, BMC systems biology,
Fang Zhang, and Fengzhu Sun, and Yihui Luan
August 1996, Physical review. B, Condensed matter,
Fang Zhang, and Fengzhu Sun, and Yihui Luan
June 2015, Scientific reports,
Fang Zhang, and Fengzhu Sun, and Yihui Luan
March 2006, Chaos (Woodbury, N.Y.),
Fang Zhang, and Fengzhu Sun, and Yihui Luan
January 1997, Romanian journal of internal medicine = Revue roumaine de medecine interne,
Copied contents to your clipboard!