Detection and Classification of Dental Caries in X-Ray Images Using Deep Neural Networks
Detection and Classification of Dental Caries in X-Ray Images Using Deep Neural Networks
Detection and Classification of Dental Caries in X-ray Images Using Deep Neural
Networks
Abstract—Dental caries, also known as dental cavities, is the Dental radiography is an important element in the oral
most widespread pathology in the world. Up to a very recent health follow up. It comes in addition to the visual
period, almost all individuals had the experience of this examination done by the dentist. The x-ray allows dentist to
pathology at least once in their life. Early detection of dental "see" what is happening inside of the teeth and bones, thanks
caries can help in a sharp decrease in the dental disease rate. to x-rays of low intensity which can cross these structures.
Thanks to the growing accessibility to medical imaging, the The types of radiography most common used by the dentist
clinical applications now have better impact on patient care. are: the retro-alveolar, the bite-wing, the panoramic
Recently, there has been interest in the application of machine radiograph [3].
learning strategies for classification and analysis of image data.
The machine learning is defined as the ability to make an
In this paper, we propose a new method to detect and identify
dental caries using X-ray images as dataset and deep neural
agent learn how to take a decision on the basis of observations
network as technique. This technique is based on stacked sparse [4]. In the biomedical context, the action of this agent is
auto-encoder and a softmax classifier. Those techniques, sparse reflected by additional information to assist the dentists in
auto-encoder and softmax, are used to train a deep neural making his decision. The patient management is found
network. The novelty here is to apply deep neural network to assigned to several steps, either at the level of diagnosis, of
diagnosis of dental caries. This approach was tested on a real the treatment choice, or also in the surgical intervention. In
dataset and has demonstrated a good performance of detection. the framework of this paper, the agent under focus has a role
to classify biomedical images by machine learning with the
Keywords-dental X-ray; classification; Deep Neural intention of discovering clinically pertinent pathology
Networks; Stacked sparse auto-encoder; Softmax. patterns. These classification operations are based on
decision-making tool. However, the inter patients variability
I. INTRODUCTION poses many challenges for the traditional classification
The radiographs are essential to establish a good algorithms. These have for the most part been configured and
diagnosis and identify several problems that are impossible parameterized on small data sets or on a very specific cohort.
to visualize otherwise. During the last decade, the representations learning, a
In orthodontics, dental radiography that is used frequently sub-domain of the machine learning, has experienced a huge
is the panoramic shot which offers a good "overview" of the comeback particularly in the computer vision domain. These
teeth and jaws and provides the essential information for representations algorithms have especially allowed crossing
screening and diagnosis of several conditions and problems a significant step with regards to the objects recognition [7]
which can be detected at an early age. and to speech recognition [8]. In machine learning, the model
Dental caries is an infectious disease. The enamel of the of Artificial Neural Networks (ANN) is a valuable tool.
tooth is the first affected. A cavity forms in the tooth and then Although the ANN, was invented close to sixty years ago, it
the decay spreads in depth. If the cavity is not treated, the hole still remains an area of active research. Recently, with the
expands and decay can affect the dentin (layer under the deep learning, ANN has in fact allowed to dramatic
enamel) [1]. Pain is beginning to be felt, especially with the improvements in many applications fields such as the
hot, cold or sweet things. Decay can affect the pulp of the computer vision. The increasing amount of available data and
tooth. We then speak about a toothache. Finally, a dental the computing power have made it easier to train high
abscess may appear when the bacteria attack the periodontal capacity models such as deep learning. However, the inherent
ligament, the bone or the gum. difficulties involved in training such models, as an example
Cavities are very frequent. More than nine out of ten the local minima, still have an important impact. The deep
people would have had, at least one, tooth decay. In France, learning thus aims to find solutions through adding some
more than a third of 6 year-old children and more than half of regularization or improving the optimization. Unsupervised
12 year-old children have been affected by this infection. In pre-training or dropout are examples of such solutions.
Canada, 57 percent of children aged between 6 and 12 years Our contribution is as follows: we propose a system of
have had at least one tooth decay [2]. detection and classification of dental caries in X-ray images
using deep neural network. This system can be very useful
for dentists to classify dental X-ray images into tooth decay One of the perspectives of the techniques of deep learning
or normal tooth images. A stacked sparse auto-encoder and a is the replacement of work which still is relatively laborious
softmax classifier [8] are used in our deep neural network. by algorithmic models of supervised learning, non-
This paper will be structured as follows: Section 2 supervised (i.e., not requiring specific knowledge of the
presents the recent work. In Section 3, we will describe our problem studied) or by techniques of extraction of
methodology and demonstrate how to train and classify tooth hierarchical characteristics.
images with deep neural networks with a stacked sparse auto- The idea is very simple: the training system consists of a
encoder and a softmax classifier. In Section 4, we will give series of modules, each one represent a processing step. Each
some results of testing experiments. Finally, Section 5 module can be trained, with adjustable parameters similar to
concludes this paper. the weight of the linear classifiers. The system is trained end-
to-end: to each example, all the parameters of all the modules
II. RELATED WORK are adjusted to approximate the output produced by the
Primarily, the detection of dental caries has been a visual system of the desired output. The deep qualifying term comes
process, principally based on visual-tactile examination and from the arrangement of these modules in successive layers
radiographic examination [1]. In the recent literature, several [12].
techniques have been developed for the detection of dental To be able to train the system in this way, it must be
caries. Kositbowornchai et al. [9] developed a neural network known in which direction and how much to adjust each
to detect artificial dental caries using images from a charged parameter of each module. For this, it is necessary to
coupled device (CCD) camera and intra-oral digital calculate a gradient. The calculation of this gradient is done
radiography. The main disadvantage of this method is that the by the method of back-propagation, practised since mid-
evaluation of the system was done using teeth with artificial 1980s. A deep architecture can be viewed as a multilayer
carries, which are completely different from naturally network of simple elements, similar to the linear classifiers,
affected ones. Saravanan et al. [10] developed a new method inter-connected by training weight. This is what is called a
to detect dental caries in its early stage using histogram and neural network multi-layers.
power spectral analysis. In this method, the detection of tooth The advantage of deep architectures stems from their
cavities is done based on the region of concentration of pixels capacity to learn to represent the world in a hierarchical
with regard to the histogram and based on the magnitude manner. As all layers can be trained, no need to build a
values with regard to the spectrum. The main drawback of characteristics extractor by hand. The training will do it [13].
this study is that this method depends only on the intensity of In addition, the first layers extract some simple characteristics
pixels. Berdouses et al. [11] developed a computer-aided and after that the following layers will combine to form more
automated methodology for the detection and classification and more complex concepts.
of occlusal caries from photographic color images. This
method is based on the segmentation of photographic color III. METHODOLOGY
images. In this section, we will describe and motivate how to train
Even though there are many methods for caries detection and classify tooth images with deep neural networks with
in early stage, it is still necessary to develop accurate carries multiple hidden layers. Multiple hidden layers neural
detection method to help dentist. networks can be very useful in solving classification problems
The problem of the classical approach of forms with complex data, such as images. Each layer can learn
recognition is that it is very difficult to build a good features at a different level of abstraction. We will use, in our
characteristics extractor and that it must be readjusted for deep neural, stacked sparse auto-encoders for features
each new application. The deep learning is a class of methods extraction and a softmax layer to classify the teeth images.
whose principles are known since the end of 1980s, but
whose use was really generalized since approximately 2012.
tooth decay
h1
X2 X1
h2 X1
X3 X1
....
S
h3
....
X4 X1
....
X1
....
X1
Input image Xn 1st hidden layer 2nd hidden layer 3rd hidden layer
Figure 3. Shematization of stacked auto-encoder and a softmax classfier is to classify dental X-ray images into tooth decay or normal tooth images
To measure the quality of our classification system, we result, as approved in Table 1. However, the accuracy and
perform the result by the confusion matrix (Table 1). The reliability of our results can be improved using a larger dental
classification test of our deep neural networks shows very database.
good performance with a rate that reaches 97%. The implementation of training strategies on really deep
structures with several hidden layers is a future extension of
TABLE I. QUALITATIVE RESULT OF OUR CLASSIFICATION APPROACH this work.
Accuracy Target Class
97% Tooth Normal Class ACKNOWLEDGMENT
Decay Tooth precision
The authors would like to acknowledge the financial
Tooth Decay 48 % 1% 98%
Output