Skin Cancer Classification Using Image Processing and Machine Learning
Skin Cancer Classification Using Image Processing and Machine Learning
Abstract—one of the most rapidly spreading cancers among science behind skin cancer reveals that melanin is present in
various other types of cancers known to humans is skin cancer. human skin andmelanocytes are cells in the skin layer that
Melanoma is the worst and the most dangerous type of skin produce melanin. The amount and kinds of melanin produced
cancer that appears usually on the skin surface and then extends by melanocytes of different human bodies vary from person to
deeper into the layers of skin. However, if diagnosed at an early person.In addition to coloring our skin, it also protects from
stage; the survival rate of Melanoma patients is 96% with simple ultraviolet rays of the sun.
and economical treatments. The conventional method of
diagnosing Melanoma involves expert dermatologists, equipment, The factors contributing to skin cancer include prolonged
and Biopsies. To avoid the expensive diagnosis, and to assist exposure to direct sunlightultraviolet (UV) rays, the presence
dermatologists, the field of machine learning has proven to of many or unusual moles, skin types, and also if there is a
provide state of the art solutions for skin cancer detection at an history of melanoma that runs in family. The mortality rate due
earlier stage with high accuracy. In this paper, a method for skin to melanoma is usually very high butif diagnosed at an early
lesion classification and segmentation as benign or malignant is stage has a 99% probability of survival[1], [2]. In many cases,
proposed using image processing and machine learning. A novel it is a difficult task even for expert dermatologists to make
method of contrast stretching of dermoscopic images based on decisions whether a lesion is benign or malignant because of
the methods of mean values and standard deviation of pixels is the high resemblance of malignant with benign. Dermatologists
proposed. Then the OTSU thresholding algorithm is applied for use a few techniques such as ABCD rule (Atypical, Border,
image segmentation. After the segmentation, features including
Color, and Diameter) to get better classification accuracy, but
Gray level Co-occurrence Matrix (GLCM) features for texture
identification, the histogram of oriented gradients (HOG) object,
still, human expertise is required[3].
and color identification features are extracted from the Frequent use of biopsies is also not encouraged by
segmented images. Principal component analysis (PCA) dermatologists. According to International Skin Imaging
reduction of HOG features is performed for dimensionality Collaboration, the number of unnecessary culture tests which
reduction. Synthetic minority oversampling technique (SMOTE) are being performed vastly varies depending upon various
sampling is performed to deal with the class imbalance problem. parameters which include clinical setup,expertise of
The feature vector is then standardized and scaled. A novel
dermatologist, and the technology applied.For illustration,
approach of feature selection based on the wrapper method is
consider the cases of youngsters in which melanoma rates are
proposed before classification. Classifiers including Quadratic
Discriminant, SVM (Medium Gaussian), and Random Forest are
significantly low, 500000 culture tests a year have been
used for classification. The proposed approach is verified on the performed to analyze roughly 400 melanomas[4].
publicly accessible dataset of ISIC-ISBI 2016. Maximum Computer procedures and advancements in machine
accuracy is achieved using the Random Forest classifier. The learning not only aid the dermatologists in early detection of
classification accuracy of the proposed system with the Random melanoma but also avoid heavy expenses of melanoma
Forest classifier on ISIC-ISBI 2016 is 93.89%.The proposed detection and unnecessary biopsies. Novel automatic
approach of contrast stretching before the segmentation gives
melanoma detection systems save a lot of time, money, and
satisfactory results of segmentation. Further, the proposed
wrapper-based approach of feature selection in combination with
effort. Machine learning has proven to provide melanoma
the Random Forest classifier gives promising results as compared classification with improved and higher accuracies.
to other commonly used classifiers.
II. RELATED WORKS
Keywords—Skin lesion segmentation, contrast stretching,
features extraction, features reduction, features normalization,
Although the advancement in the dermatological equipment
features scaling, wrapper method, SMOTE sampling, skin cancer has increased the classification accuracy of melanoma, the
classification, random forest classifier. technological developments and improvements in the area of
machine learning and image processing have resulted in a
medical breakthrough in diagnosis, detection, and classification
I. INTRODUCTION of melanoma with much more accuracy and reliability.
With rapidly increasing global air pollution and damage to
the ozone layer an alarming number of the human population is The literature review reveals that different practices have
diagnosed to develop skin cancer as compared to any other been used to develop computer-aided automatic diagnostics
type of cancer combined. Melanoma has a very high death ratio systems for the classification of skin cancer which take
as compared to other types of skin cancer. The study of the
,(((
Fig. 1. Block Diagram of system suggested for skin lesion segmentation & classification
Features Mean hue Mean Saturation Mean Variance Hue Variance Variance Value
Kurtosis H Kurtosis S Kurtosis V Skewness H Skewness S Skewness V
Classification
Accuracy (%)
Algorithm
SVM 88.17
Quadratic
Fig.
Fig 3.
3 Comparison of segmentation results to ground truth a) 90.84
Discriminant
Actual image b) Segmented skin lesion c) Segmented lesion masked
on actual image d) Ground Truth Random Forest 93.89
[1] D. Schadendorf et al., “Melanoma,” Lancet, vol. 392, no. 10151, pp.
971–984, 2018.
[2] S. Gupta and H. Tsao, “Epidemiology of melanoma,” Pathol.
Epidemiol. Cancer, pp. 591–611, 2016.
[3] F. Nachbar et al., “The ABCD rule of dermatoscopy: high
prospective value in the diagnosis of doubtful melanocytic skin
lesions,” J. Am. Acad. Dermatol., vol. 30, no. 4, pp. 551–559, 1994.
[4] “The International Skin Imaging Collaboration (ISIC).”
https://www.isic-
archive.com/#!/topWithHeader/tightContentTop/about/isicArchive
(accessed Oct. 04, 2020).
[5] H. Alquran et al., “The melanoma skin cancer detection and
classification using support vector machine,” 2017 IEEE Jordan
Conf. Appl. Electr. Eng. Comput. Technol. AEECT 2017, vol. 2018-
Janua, pp. 1–5, 2017.
[6] U. B. Ansari and M. E. Student, “Skin Cancer Detection Using
Image Processing Tanuja Sarode 2,” Int. Res. J. Eng. Technol., vol.
Fig. 4. Confusion Matrix for ISIC-ISBI-2016 4, no. 4, pp. 2395–56, 2017, [Online]. Available:
The https://www.irjet.net/archives/V4/i4/IRJET-V4I4702.pdf.
proposed system gives remarkable results using the Random [7] M. A. Khan et al., “An implementation of normal distribution based
Forest classifier in terms of accuracy, sensitivity, precision, and segmentation and entropy controlled features selection for skin
AUC-ROC. lesion detection and classification,” BMC Cancer, vol. 18, no. 1, pp.
1–20, 2018.
[8] V. M. M, “Melanoma Skin Cancer Detection using Image
Processing and Machine Learning,” Int. J. Trend Sci. Res. Dev., vol.