Application of decision tree-based ensemble learning in the classification of breast cancer. 2021

Mohammad M Ghiasi, and Sohrab Zendehboudi
Faculty of Engineering and Applied Science, Memorial University, St. John's, NL A1B 3X5, Canada. Electronic address: mm.ghiasi@gmail.com.

As a common screening and diagnostic tool, Fine Needle Aspiration Biopsy (FNAB) of the suspicious breast lumps can be used to distinguish between malignant and benign breast cytology. In this study, we first review published works on the classification of breast cancer where the machine learning and data mining algorithms have been applied by using the Wisconsin Breast Cancer Database (WBCD). This work then introduces useful new tools, based on Random Forest (RF) and Extremely Randomized Trees or Extra Trees (ET) algorithms to classify breast cancer. The RF and ET strategies use the decision trees as proper classifiers to attain the ultimate classification. The RF and ET approaches include four main stages: input identification, determination of the optimal number of trees, voting analysis, and final decision. The models implemented in this research consider important factors such as uniformity of cell size, bland chromatin, mitoses, and clump thickness as the input parameters. According to the statistical analysis, the proposed methods are able to classify the type of breast cancer accurately. The error analysis results reveal that the designed RF and ET models offer easy-to-use outcomes and the highest diagnostic performance, compared to previous tools/models in the literature for the WBCD classification. The highest and lowest magnitudes of relative importance are attributed to the uniformity of cell size and mitoses among the factors. It is expected that the RF and ET algorithms play an important role in medicine and health systems for screening and diagnosis in the near future.

UI MeSH Term Description Entries
D001940 Breast In humans, one of the paired regions in the anterior portion of the THORAX. The breasts consist of the MAMMARY GLANDS, the SKIN, the MUSCLES, the ADIPOSE TISSUE, and the CONNECTIVE TISSUES. Breasts
D001943 Breast Neoplasms Tumors or cancer of the human BREAST. Breast Cancer,Breast Tumors,Cancer of Breast,Breast Carcinoma,Cancer of the Breast,Human Mammary Carcinoma,Malignant Neoplasm of Breast,Malignant Tumor of Breast,Mammary Cancer,Mammary Carcinoma, Human,Mammary Neoplasm, Human,Mammary Neoplasms, Human,Neoplasms, Breast,Tumors, Breast,Breast Carcinomas,Breast Malignant Neoplasm,Breast Malignant Neoplasms,Breast Malignant Tumor,Breast Malignant Tumors,Breast Neoplasm,Breast Tumor,Cancer, Breast,Cancer, Mammary,Cancers, Mammary,Carcinoma, Breast,Carcinoma, Human Mammary,Carcinomas, Breast,Carcinomas, Human Mammary,Human Mammary Carcinomas,Human Mammary Neoplasm,Human Mammary Neoplasms,Mammary Cancers,Mammary Carcinomas, Human,Neoplasm, Breast,Neoplasm, Human Mammary,Neoplasms, Human Mammary,Tumor, Breast
D003663 Decision Trees A graphic device used in decision analysis, series of decision options are represented as branches (hierarchical). Decision Tree,Tree, Decision,Trees, Decision
D005260 Female Females
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000069550 Machine Learning A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data. Transfer Learning,Learning, Machine,Learning, Transfer
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm

Related Publications

Mohammad M Ghiasi, and Sohrab Zendehboudi
July 2005, Bioinformatics (Oxford, England),
Mohammad M Ghiasi, and Sohrab Zendehboudi
January 2023, IEEE/ACM transactions on computational biology and bioinformatics,
Mohammad M Ghiasi, and Sohrab Zendehboudi
January 2022, Computational intelligence and neuroscience,
Mohammad M Ghiasi, and Sohrab Zendehboudi
January 2014, International journal of data mining and bioinformatics,
Mohammad M Ghiasi, and Sohrab Zendehboudi
January 2011, Advances in experimental medicine and biology,
Mohammad M Ghiasi, and Sohrab Zendehboudi
October 2019, Breast cancer research and treatment,
Mohammad M Ghiasi, and Sohrab Zendehboudi
November 2006, Sheng wu gong cheng xue bao = Chinese journal of biotechnology,
Mohammad M Ghiasi, and Sohrab Zendehboudi
January 2014, Asian Pacific journal of cancer prevention : APJCP,
Copied contents to your clipboard!