Review on the Application of Machine Learning Algorithms in the Sequence Data Mining of DNA. 2020

Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
College of Science, North China University of Science and Technology, Tangshan, China.

Deoxyribonucleic acid (DNA) is a biological macromolecule. Its main function is information storage. At present, the advancement of sequencing technology had caused DNA sequence data to grow at an explosive rate, which has also pushed the study of DNA sequences in the wave of big data. Moreover, machine learning is a powerful technique for analyzing largescale data and learns spontaneously to gain knowledge. It has been widely used in DNA sequence data analysis and obtained a lot of research achievements. Firstly, the review introduces the development process of sequencing technology, expounds on the concept of DNA sequence data structure and sequence similarity. Then we analyze the basic process of data mining, summary several major machine learning algorithms, and put forward the challenges faced by machine learning algorithms in the mining of biological sequence data and possible solutions in the future. Then we review four typical applications of machine learning in DNA sequence data: DNA sequence alignment, DNA sequence classification, DNA sequence clustering, and DNA pattern mining. We analyze their corresponding biological application background and significance, and systematically summarized the development and potential problems in the field of DNA sequence data mining in recent years. Finally, we summarize the content of the review and look into the future of some research directions for the next step.

UI MeSH Term Description Entries

Related Publications

Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
March 2014, Bone marrow transplantation,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
March 2007, Alternatives to laboratory animals : ATLA,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
January 2024, Healthcare informatics research,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
October 2023, Neurologia,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
February 2021, Neurologia,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
November 2022, Journal of gastroenterology and hepatology,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
September 2015, IEEE journal of biomedical and health informatics,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
August 2022, Mathematical biosciences and engineering : MBE,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
June 2022, Waste management & research : the journal of the International Solid Wastes and Public Cleansing Association, ISWA,
Aimin Yang, and Wei Zhang, and Jiahao Wang, and Ke Yang, and Yang Han, and Limin Zhang
April 2021, Entropy (Basel, Switzerland),
Copied contents to your clipboard!