A novel method to accurately calculate statistical significance of local similarity analysis for high-throughput time series. 2018

Fang Zhang, and Ang Shan, and Yihui Luan
School of Mathematics, Shandong University, Jinan, 250100, P.R. China.

In recent years, a large number of time series microbial community data has been produced in molecular biological studies, especially in metagenomics. Among the statistical methods for time series, local similarity analysis is used in a wide range of environments to capture potential local and time-shifted associations that cannot be distinguished by traditional correlation analysis. Initially, the permutation test is popularly applied to obtain the statistical significance of local similarity analysis. More recently, a theoretical method has also been developed to achieve this aim. However, all these methods require the assumption that the time series are independent and identically distributed. In this paper, we propose a new approach based on moving block bootstrap to approximate the statistical significance of local similarity scores for dependent time series. Simulations show that our method can control the type I error rate reasonably, while theoretical approximation and the permutation test perform less well. Finally, our method is applied to human and marine microbial community datasets, indicating that it can identify potential relationship among operational taxonomic units (OTUs) and significantly decrease the rate of false positives.

UI MeSH Term Description Entries
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D015233 Models, Statistical Statistical formulations or analyses which, when applied to data and found to fit the data, are then used to verify the assumptions and parameters used in the analysis. Examples of statistical models are the linear model, binomial model, polynomial model, two-parameter model, etc. Probabilistic Models,Statistical Models,Two-Parameter Models,Model, Statistical,Models, Binomial,Models, Polynomial,Statistical Model,Binomial Model,Binomial Models,Model, Binomial,Model, Polynomial,Model, Probabilistic,Model, Two-Parameter,Models, Probabilistic,Models, Two-Parameter,Polynomial Model,Polynomial Models,Probabilistic Model,Two Parameter Models,Two-Parameter Model
D056186 Metagenomics The systematic study of the GENOMES of assemblages of organisms. Community Genomics,Environmental Genomics,Population Genomics,Genomics, Community,Genomics, Environmental,Genomics, Population
D030541 Databases, Genetic Databases devoted to knowledge about specific genes and gene products. Genetic Databases,Genetic Sequence Databases,OMIM,Online Mendelian Inheritance In Man,Genetic Data Banks,Genetic Data Bases,Genetic Databanks,Genetic Information Databases,Bank, Genetic Data,Banks, Genetic Data,Data Bank, Genetic,Data Banks, Genetic,Data Base, Genetic,Data Bases, Genetic,Databank, Genetic,Databanks, Genetic,Database, Genetic,Database, Genetic Information,Database, Genetic Sequence,Databases, Genetic Information,Databases, Genetic Sequence,Genetic Data Bank,Genetic Data Base,Genetic Databank,Genetic Database,Genetic Information Database,Genetic Sequence Database,Information Database, Genetic,Information Databases, Genetic,Sequence Database, Genetic,Sequence Databases, Genetic

Related Publications

Fang Zhang, and Ang Shan, and Yihui Luan
January 2013, Bioinformatics (Oxford, England),
Fang Zhang, and Ang Shan, and Yihui Luan
January 2019, BMC bioinformatics,
Fang Zhang, and Ang Shan, and Yihui Luan
April 2014, Journal of chemical information and modeling,
Fang Zhang, and Ang Shan, and Yihui Luan
June 2021, Entropy (Basel, Switzerland),
Fang Zhang, and Ang Shan, and Yihui Luan
June 2015, Scientific reports,
Fang Zhang, and Ang Shan, and Yihui Luan
September 2023, Briefings in bioinformatics,
Fang Zhang, and Ang Shan, and Yihui Luan
April 2008, Bioresource technology,
Copied contents to your clipboard!