Computer Aided Breast Cancer Detection Using Ensembling of Texture and Statistical Image Features. 2021

Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
Department of Metallurgical and Material Engineering, Jadavpur University, Kolkata 700032, India.

Breast cancer, like most forms of cancer, is a fatal disease that claims more than half a million lives every year. In 2020, breast cancer overtook lung cancer as the most commonly diagnosed form of cancer. Though extremely deadly, the survival rate and longevity increase substantially with early detection and diagnosis. The treatment protocol also varies with the stage of breast cancer. Diagnosis is typically done using histopathological slides from which it is possible to determine whether the tissue is in the Ductal Carcinoma In Situ (DCIS) stage, in which the cancerous cells have not spread into the encompassing breast tissue, or in the Invasive Ductal Carcinoma (IDC) stage, wherein the cells have penetrated into the neighboring tissues. IDC detection is extremely time-consuming and challenging for physicians. Hence, this can be modeled as an image classification task where pattern recognition and machine learning can be used to aid doctors and medical practitioners in making such crucial decisions. In the present paper, we use an IDC Breast Cancer dataset that contains 277,524 images (with 78,786 IDC positive images and 198,738 IDC negative images) to classify the images into IDC(+) and IDC(-). To that end, we use feature extractors, including textural features, such as SIFT, SURF and ORB, and statistical features, such as Haralick texture features. These features are then combined to yield a dataset of 782 features. These features are ensembled by stacking using various Machine Learning classifiers, such as Random Forest, Extra Trees, XGBoost, AdaBoost, CatBoost and Multi Layer Perceptron followed by feature selection using Pearson Correlation Coefficient to yield a dataset with four features that are then used for classification. From our experimental results, we found that CatBoost yielded the highest accuracy (92.55%), which is at par with other state-of-the-art results-most of which employ Deep Learning architectures. The source code is available in the GitHub repository.

UI MeSH Term Description Entries
D001943 Breast Neoplasms Tumors or cancer of the human BREAST. Breast Cancer,Breast Tumors,Cancer of Breast,Breast Carcinoma,Cancer of the Breast,Human Mammary Carcinoma,Malignant Neoplasm of Breast,Malignant Tumor of Breast,Mammary Cancer,Mammary Carcinoma, Human,Mammary Neoplasm, Human,Mammary Neoplasms, Human,Neoplasms, Breast,Tumors, Breast,Breast Carcinomas,Breast Malignant Neoplasm,Breast Malignant Neoplasms,Breast Malignant Tumor,Breast Malignant Tumors,Breast Neoplasm,Breast Tumor,Cancer, Breast,Cancer, Mammary,Cancers, Mammary,Carcinoma, Breast,Carcinoma, Human Mammary,Carcinomas, Breast,Carcinomas, Human Mammary,Human Mammary Carcinomas,Human Mammary Neoplasm,Human Mammary Neoplasms,Mammary Cancers,Mammary Carcinomas, Human,Neoplasm, Breast,Neoplasm, Human Mammary,Neoplasms, Human Mammary,Tumor, Breast
D002285 Carcinoma, Intraductal, Noninfiltrating A noninvasive (noninfiltrating) carcinoma of the breast characterized by a proliferation of malignant epithelial cells confined to the mammary ducts or lobules, without light-microscopy evidence of invasion through the basement membrane into the surrounding stroma. Carcinoma, Intraductal,DCIS,Ductal Carcinoma In Situ,Atypical Ductal Hyperplasia,Intraductal Carcinoma, Noninfiltrating,Atypical Ductal Hyperplasias,Carcinoma, Noninfiltrating Intraductal,Carcinomas, Intraductal,Carcinomas, Noninfiltrating Intraductal,Ductal Hyperplasia, Atypical,Ductal Hyperplasias, Atypical,Hyperplasia, Atypical Ductal,Hyperplasias, Atypical Ductal,Intraductal Carcinoma,Intraductal Carcinomas,Intraductal Carcinomas, Noninfiltrating,Noninfiltrating Intraductal Carcinoma,Noninfiltrating Intraductal Carcinomas
D003201 Computers Programmable electronic devices designed to accept data, perform prescribed mathematical and logical operations at high speed, and display the results of these operations. Calculators, Programmable,Computer Hardware,Computers, Digital,Hardware, Computer,Calculator, Programmable,Computer,Computer, Digital,Digital Computer,Digital Computers,Programmable Calculator,Programmable Calculators
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000069550 Machine Learning A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data. Transfer Learning,Learning, Machine,Learning, Transfer
D016571 Neural Networks, Computer A computer architecture, implementable in either hardware or software, modeled after biological neural networks. Like the biological system in which the processing capability is a result of the interconnection strengths between arrays of nonlinear processing nodes, computerized neural networks, often called perceptrons or multilayer connectionist models, consist of neuron-like units. A homogeneous group of units makes up a layer. These networks are good at pattern recognition. They are adaptive, performing tasks by example, and thus are better for decision-making than are linear learning machines or cluster analysis. They do not require explicit programming. Computational Neural Networks,Connectionist Models,Models, Neural Network,Neural Network Models,Neural Networks (Computer),Perceptrons,Computational Neural Network,Computer Neural Network,Computer Neural Networks,Connectionist Model,Model, Connectionist,Model, Neural Network,Models, Connectionist,Network Model, Neural,Network Models, Neural,Network, Computational Neural,Network, Computer Neural,Network, Neural (Computer),Networks, Computational Neural,Networks, Computer Neural,Networks, Neural (Computer),Neural Network (Computer),Neural Network Model,Neural Network, Computational,Neural Network, Computer,Neural Networks, Computational,Perceptron

Related Publications

Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
October 2008, Journal of digital imaging,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
July 2008, Academic radiology,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
November 2004, Radiology,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
January 2013, IEEE reviews in biomedical engineering,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
January 2023, Current medical imaging,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
June 1997, IEEE transactions on information technology in biomedicine : a publication of the IEEE Engineering in Medicine and Biology Society,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
August 2006, Seminars in ultrasound, CT, and MR,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
July 2000, Radiologic clinics of North America,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
December 2009, Academic radiology,
Soumya Deep Roy, and Soham Das, and Devroop Kar, and Friedhelm Schwenker, and Ram Sarkar
April 2015, Medical physics,
Copied contents to your clipboard!