Skin_Cancer_Detection_using_Deep_Learning
Skin_Cancer_Detection_using_Deep_Learning
Abstract— Identifying mel anoma at the early stages of utilized the k-means clustering algorithm as well as the
di agnosis is imperati ve as early detection can Support Vector Machine(SVM ) so as to determine a certain
exponentially increase one's chances of cure. The paper classification method for the types of melanoma. Abbas et
first proposes a literature survey of multi ple methods al. [3], and Germán C et al. [7] looked into AdaBoost MC to
used for performing skin cancer classification. Our create an image classificat ion method of classificat ion of
methodology consists of using Convolutional Neural different kinds of cancers and skin lesions image
Network (CNN) to i dentify and di agnose the skin cancer classification methods separately. Michael B et al. [23], and
using the IS IC dataset containing 2637 images. The Vicente B et al. [5] factored into the account that lesion
proposed model gi ves an accuracy of 88% for classifying texture, colour, affected area and degree of damage and
the training dataset as either benign or malignant. developed a system of support for the decision for the
differentiation of melano ma. An Android application was
Keywords—Convolutional Neural Networks, Deep also developed by Kiran et al. [20] fo r the melanoma
Learning Models, Image Processing, Support Vector determination in a clin ical approach. In this paper we have
Machine, K Means Clustering decided to employ a CNN technique on the ISIC dataset
fro m Kaggle within the Keras python library in order to
classify the given input or data as either malignant or
benign.
Melanoma is identified by the changing or unusual on S. R. Guha et al. [20] - The author started off the research
both sun-exposed and sun-protected areas of the body. with the intent of studying the existing accuracy of a certain
Melanomas usually appear on the legs of women, and the skin disease detection algorithm that used CNN and wished
most common area of occurrence on men is the trunk. to expand the accuracy which could have been obtained.
Therefore, it is important to keep in mind, that melano mas Since the goal was determined, the writers had moved on to
can arise anywhere on the skin, even in areas where the sun preprocess the input image to detect the lesion part of the
doesn’t shine. The recurring method for skin cancer skin and wrench out the features of the afo rementioned part.
classification has been variations of Convolutional Neural two of the chosen models were prepared and compared and
Networks. Deep Learning models are mo re beneficial as they claimed to obtain better accuracy with the transfer
they require no hu man intervention wh ile performing image learning technique of the model 'VGG16'. The model has
processing, feature ext raction and has a significant been compared to show accuracy against existing
advantage over other strategies like Machine Learning architecture models wh ich have been selected for reference.
algorith ms wh ich can solve well-structured problems. But they had come to the conclusion of an 11% greater accuracy
for more co mplex data applications we require Deeper rate to the advantage of the transfer learning method and
Architectures. The automated machines and systems used further developments have been said to be in development
for the classification and differentiat ion of the diseases like to fine-tune such a model.
melano mas and Anas et al. [16] The research that was built
on the specified paper published by Hitoshi et al.[9] by the A. W. Setiawan [23]- The author had taken the inspiration
classification of melanomas into various different of the effects of UV lights rays which penetrate the ozone
classification (four to be exact), Almansour et al. [ 6] had layer and attack the hu man skin, which is one of the ways
humans get skin cancer. The writer had decided to develop a of Natural Language Processing or NLP for short such that
simp le image processing tool to detect early signs of cancer. there exists a user interface with the patients and their
The technique the author chose to use included image treatment head whenever required through the internet. The
improvement using MSRCR and CLAHE as a contradiction chosen data model had yielded an accuracy rate of 75% with
to a convoluted neural network. The result shows us that the the chosen input data set and deployment of the app would
former is more suitable in color enhancement of pictures to allo w further investigations and required data through back
detect early signs using convoluted neural networks but the tested data.
margins of co mparison 0.8476 & 0.8161 for the enhanced
datasets against the normal dataset did not prove to change Keun-Kwang et al. [6] The premise of this paper is to
the accuracy of validation or training. However, it d id prove devise a highly efficient CNN for classifying skin cancer.
to make the computational load allo wing it to be easily The above study classifies ECOC SVM into 4 different
portable to a smaller device. types of skin cancer. It uses an already existing AlexNet
CNN model in order to mine the features. The results
H. K. Kondaveeti et al. [8]- Th is paper has introduced the showed a tremendous accuracy for each kind of skin cancer.
technique of transfer learning in neural networks as a The highest values for maximu m accuracy was found while
method of train ing them to detect early signs of detection of classifying the squamous cell carcino ma(95%) and lo west
cancer. The proposed models in a HAM 10000 data samp le while classifying melano ma (90.74%). This study when
were given by data models of MobileNet, Inception V3, and compared with previous research showed higher results for
such. The altercations to the class imbalance had been made sensitivity, accuracy as well as specificity.
an attempt to resolve it by the augmentation of input data
and the adjustments to its bias weights. After such Yi-Xin Cai et al. [20] suggests a new and improved
methodologies were taken and tested, the selected model version of CNN which co mbines CNN and a whale
which is the ResNet50 had yielded a mu lti-class precision optimization algorith m wh ich was conceived by Lewis and
rate of about 90% (with weighted recall and accuracy being Mirjalili. Th is technique combines the high efficiency of
90% and 89% respectively). The application of such models CNNs with an optimizat ion algorithm to fu rther enhance the
within the first layer of the networks helps us to understand efficiency of the diagnosis. The whale algorith m was
how convoluted neural networks interpret images and how inspired by the behavior of hu mpback whales when trapping
they can be modelled after to present a better yielding their prey. The performance of th is novel technique was
output. The paper also introduced a way to reduce noise and analyzed against two primary parameters i.e DermIS and
external disturbances from input data before it is processed. Dermquest. The subsequent results revealed that the
This method helped to achieve higher performance o f the proposed method achieved the best results when compared
said model and saves on computational resources. with 10 d ifferent methods. Some of the methods in the
comparison were ResNet, AlexNet, Ord inary CNN, semi-
A. A. Ibrahim et al. [10]- The primary objective of the supervised method and Spot-mole.
Presented thesis work presented by the authors was to find
and build appropriate and acceptable CNN models which
are to be used for the detection of skin cancer post-training. III.CONVOLUTIONA L NEURAL NETWORKS
A large part of the research done by the authors was on
convoluted neural networks. To obtain a trained model with
CNN with the desired arch itectures of the author, the A Multifunctional network which can be applied in many
parameters such as the batch size, dropout value and epochs, different areas, A convolutional neural network is a certain
and other variables were ad justed to meet specificat ions. type of artificial neural network wh ich is primarily used to
process pixel data and such image data. In itself, it is a
The best-observed data model(for the given dataset) with an
multilayer perceptron made in a particular manner to
accuracy of 85% was retrieved by Xception. The authors
accommodate picture and image-related info rmation
also pointed out the risk of poor computational power and classification and processing. Inclusive
quality of images wh ich hinder the working and precision of to the prior fact, the architecture of a deep neural network
the convoluted neural network and hence can cause an error includes multip le layers to process more information and do
and extend the time required to train the dataset. The authors more co mp lex functionalities at the expense of
have moved on to implement computational power. Regardless of the restriction other
such work with real-life applications but might produce data model machines face, CNN is not as constricted for
devalued accuracy. adjoining layers of the connections in-between them. They
share weights and work the feel of a part icular local area of
Y. Huo [24] -The author of the presented thesis had the image data set. There are two p rimary object ives for any
emp loyed the use of convoluted neural networks to find given convolutional neural network, which are depicted in
early signs of cancer to t reat it with ease. The application o f Fig 3.1.
the CNN by the author required a Keras CNN model
through the TensorFlow, Java Script to classify and detect (i) A data set selection process, also referred to as
the processed image. the model had a backend built on sampling. We obtain 'n' number of data points(which in our
Exp ress JavaScript and the front-end GUI was implemented case would be pixels) for each of the neighborhood through
by Swift user interface. The data set introduced as the input a series of steps known as pooling. These steps start off with
value was one of HAM10000 with integrated functionalities identifying the pixel and then adding a bias before
weighting them by scalar property 'W x + 1'. Finally, the whether or not to fire a neuron resides upon the activation
activation function comes into play to produce a narro w 'n ' function. The most frequently used ones happen to be
times feature map. Sig mo id, ReLu and Tan H. The output is restrained to values
between 0 & 1 for all sig moid function inputs which happen
(ii) We employ the use of a trainable filter for the to be real numbers. TanH on the other hand is like sig mo id,
reconstruction of the given data set to be entered after wh ich but the output for this function is between -1 and 1. ReLU is
we add our personal bias to retrieve our final convolution widely considered as the most used activation function
layer. because of its Lower co mputational load. The end of the
convoluted neural network has a co mpletely covered layer.
Since the layer consists of neurons, each interconnected
with each other in the prior layer of the network. This
henceforth mimics an approach in wh ich it is fully
connected and is used as a classifier for CNN. The final
output layer is also shown to us by this output layer. The
output layer also emp loys the use of some loss functions to
find out the percentage of predicted error in the entire
sample in the model which had been made. This erro r
represents the increasing normalization of the actual and
predicted output. After which we finally use 'Re lu' to rectify
as an activation function. It is given to add non -linearity to
the system. The final output probability of each class will be
found to be taken in the final layer. The results fro m the
models used had yielded a 86% accuracy for our data
model. For the model in which we used 50 epochs with a
Fig 3.1: Convolution Neural Networks target size of (150,150) and a batch size of 32. It had
compared the pred icted probability to the actual class output
The advantages of the employment for our
in a binary cross-entropy to show an output of
implementation include the fully connected layers, which
allo w the computation of highly co mplex problems. It can 0.86(predicted) and a 1(actual). Given that our model
also be used for image and ob ject classification, self-driving showed consistent accurate results after execution, it proved
cars, etc. And yield very h igh-performance rates and to yield high efficiency and performance.
efficiency.
VI. RESULTS
[5] Dorj, UO., Lee, KK., Choi, JY. et al., “The skin
cancer classificat ion using deep CNN”, Published:
22nd Feb, 2018.