Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method-a Comparative Study. 2020

Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
MCA Department, VBS Purvanchal University, Jaunpur, 222002, Uttar Pradesh, India.

Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.

UI MeSH Term Description Entries
D011237 Predictive Value of Tests In screening and diagnostic tests, the probability that a person with a positive test is a true positive (i.e., has the disease), is referred to as the predictive value of a positive test; whereas, the predictive value of a negative test is the probability that the person with a negative test does not have the disease. Predictive value is related to the sensitivity and specificity of the test. Negative Predictive Value,Positive Predictive Value,Predictive Value Of Test,Predictive Values Of Tests,Negative Predictive Values,Positive Predictive Values,Predictive Value, Negative,Predictive Value, Positive
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000069550 Machine Learning A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data. Transfer Learning,Learning, Machine,Learning, Transfer
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D001499 Bayes Theorem A theorem in probability theory named for Thomas Bayes (1702-1761). In epidemiology, it is used to obtain the probability of disease in a group of people with some characteristic on the basis of the overall rate of that disease and of the likelihood of that characteristic in healthy and diseased individuals. The most familiar application is in clinical decision analysis where it is used for estimating the probability of a particular diagnosis given the appearance of some symptoms or test result. Bayesian Analysis,Bayesian Estimation,Bayesian Forecast,Bayesian Method,Bayesian Prediction,Analysis, Bayesian,Bayesian Approach,Approach, Bayesian,Approachs, Bayesian,Bayesian Approachs,Estimation, Bayesian,Forecast, Bayesian,Method, Bayesian,Prediction, Bayesian,Theorem, Bayes
D012871 Skin Diseases Diseases involving the DERMIS or EPIDERMIS. Dermatoses,Skin and Subcutaneous Tissue Disorders,Dermatosis,Skin Disease
D057225 Data Mining Use of sophisticated analysis tools to sort through, organize, examine, and combine large sets of information. Text Mining,Mining, Data,Mining, Text

Related Publications

Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
September 2021, Interdisciplinary sciences, computational life sciences,
Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
December 2020, Medical & biological engineering & computing,
Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
April 2019, Asian Pacific journal of cancer prevention : APJCP,
Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
March 2022, Medical & biological engineering & computing,
Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
January 2018, International journal of nanomedicine,
Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
December 2023, Computational biology and chemistry,
Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
November 2023, Diagnostics (Basel, Switzerland),
Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
January 2013, PloS one,
Anurag Kumar Verma, and Saurabh Pal, and Surjeet Kumar
September 2023, Heliyon,
Copied contents to your clipboard!