0% found this document useful (0 votes)
79 views16 pages

3 Abbas2016

The document presents a machine learning approach for detecting malaria parasites in Giemsa-stained thin blood smear images. The proposed methodology uses color segmentation and adaptive Gaussian mixture modeling to identify parasites. It improves quantification of red blood cells by splitting occlusions using distance transform and local maxima. Infected and uninfected red blood cells are then classified to grade parasitemia. The method achieves 98% sensitivity and 97% specificity compared to ground truth, outperforming other state-of-the-art approaches. This provides a low-cost solution for large-scale malaria parasitemia quantification.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
79 views16 pages

3 Abbas2016

The document presents a machine learning approach for detecting malaria parasites in Giemsa-stained thin blood smear images. The proposed methodology uses color segmentation and adaptive Gaussian mixture modeling to identify parasites. It improves quantification of red blood cells by splitting occlusions using distance transform and local maxima. Infected and uninfected red blood cells are then classified to grade parasitemia. The method achieves 98% sensitivity and 97% specificity compared to ground truth, outperforming other state-of-the-art approaches. This provides a low-cost solution for large-scale malaria parasitemia quantification.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

Neural Comput & Applic

DOI 10.1007/s00521-016-2474-6

ORIGINAL ARTICLE

Machine aided malaria parasitemia detection in Giemsa-stained


thin blood smears
Naveed Abbas1 • Tanzila Saba2 • Dzulkifli Mohamad3 • Amjad Rehman4 •

Abdulaziz S. Almazyad4,5 • Jarallah Saleh Al-Ghamdi2

Received: 24 December 2015 / Accepted: 6 July 2016


 The Natural Computing Applications Forum 2016

Abstract Malaria parasitemia is the quantitative mea- out on image dataset with respect to ground truth data,
surement of the parasites in the blood to grade the degree of determining the degree of infection with the sensitivity of
infection. Light microscopy is the most well-known 98 % and specificity of 97 %. The accuracy and efficiency
method used to examine the blood for parasitemia quan- of the proposed scheme in the context of being automatic
tification. The visual quantification of malaria parasitemia were proved experimentally, surpassing other state-of-the-
is laborious, time-consuming and subjective. Although art schemes. In addition, this research addressed the pro-
automating the process is a good solution, the available cess with independent factors (RBCs’ morphology).
techniques are unable to evaluate the same cases such as Eventually, this can be considered as low-cost solutions for
anemia and hemoglobinopathies due to deviation from malaria parasitemia quantification in massive
normal RBCs’ morphology. The main aim of this research examinations.
is to examine the microscopic images of stained thin blood
smears using a variety of computer vision techniques, Keywords Malaria Parasitemia  Thin blood smears 
grading malaria parasitemia on independent factors (RBC’s Machine aided  Features mining
morphology). The proposed methodology is based on
inductive approach, color segmentation of malaria parasites
through adaptive algorithm of Gaussian mixture model 1 Introduction
(GMM). The quantification accuracy of RBCs is improved,
splitting the occlusions of RBCs with distance transform Malaria is a serious infectious disease caused by genus
and local maxima. Further, the classification of infected plasmodium, a blood parasite injected by female Anophe-
and non-infected RBCs has been made to properly grade les mosquito into the human body. According to the World
parasitemia. The training and evaluation have been carried Health Organization’s annual malaria report 2013 [1],
malaria takes the life of a child every 45 min. The plas-
modium attacks the RBCs, which are blood components.
& Tanzila Saba Parasitemia, the quantitative measurement of the parasites
[email protected]; [email protected]
in blood, is used to check the severity of malaria [2]. For
1
Department of Computer Science, Islamia College Peshawar, this purpose, visual quantification through light microscopy
Peshawar, KPK, Pakistan is still the most prevalent and commonly practiced method
2
College of Computer and Information Science, Prince Sultan because of its availability and economical methods of
University, Riyadh 11586, Saudi Arabia testing [3, 4]. The microscopy examination of malaria
3
Faculty of Computing, Universiti Teknologi Malaysia, involves two types of slides, i.e., thick and thin blood
81310 Skudai, Johor, Malaysia smears. The thick blood smear tests are mainly used in
4
College of Computer and Information Systems, Al-Yamamah malaria to test for the presence or absence of plasmodium
University, Riyadh 11512, Saudi Arabia in the blood. The thin blood smear tests are used for
5
College of Computer and Information Sciences, King Saud detailed examination of malaria such as quantification of
University, Riyadh 12372, Saudi Arabia parasitemia, specie identification and life cycle

123
Neural Comput & Applic

classification. According to the recommendations of WHO In contrast, inductive approach is bottom-up or the
in [3] and revised version in 2004 [5], the thin blood smear deductive approach in reverse [43–45]. Both of these
must be examined under 70–100 windows while diagnos- approaches suffered from RBCs’ morphology dependent
ing via the microscopy of malaria; the number of infected factors. However, inductive approach is better because it
RBCs will be counted among 100 RBCs in each window. has more freedom to select morphology independent fac-
Physicians frequently ask for the thin blood smear test in tors. The color, intensities, shape, size, area, radius and all
severe stages of malaria under microscope through visual other morphology-related factors of RBCs are highly
quantification, which has proven to be too laborious, time- variable factors in different patients. In the same way, we
consuming, and the results are often erroneous due to the cannot depend on the morphology of parasites except for
massive number of on-going examinations [4]. color. Moreover, in the literature, we also discovered
The rest of this paper is arranged as follows. Section 2 another serious problem of occluded RBCs. The occluded
discusses the research background and current challenges, RBCs problem was addressed by few studies on the same
Sect. 3 presents the proposed methodology framework, morphological dependent grounds, while the majority
Sect. 4 reports experimental results analysis, discussion ignore the problem altogether.
and finally Sect. 5 concludes the paper. The deductive approach adopted by [19] and [7] is
seriously affected in the presence of dense occluded RBCs.
The study in [19] is also dependent on area granulometry
2 Research background for RBC size (only constant when RBC is healthy) esti-
mation, and there is no specification for occlusions of
This section summarizes the categories of the tools and RBCs. The authors of [25, 26] trained SVM and PCA for
techniques presented in the literature for the task of classification, while for features extraction they also relied
quantification of malaria parasitemia and its grading. For on morphology dependent factors, i.e., area and radius. In
detailed study of the existing tools and techniques, inter- addition to the study of [25] was dependent on bimodal
ested readers are referred to the survey reported in [2]. histogram. Further, both mentioned studies have no clear
A noticeable number of studies addressed the automatic approach on how to address the occlusions of RBCs. The
malaria parasitemia quantification. Majority of them tried studies [6], [27] and [8] are dependent on the circularity of
to resolve problems such as image luminance, low contrast, RBCs (detection through circular hough transform). The
poor illumination and out of focus images in the prepro- circularity of RBCs is very sensitive and can be disturbed
cessing step [37–40]. However, most of these problems due to exertion of even slight pressure on the slide during
were resolved due to the advent of high-quality imaging preparation. For features extraction, the consideration of
tools. The well-known techniques employed by the fixed area, radius and edges is the cases suited to normal
majority of the studies as preprocessing steps are: His- RBCs but due to malaria and other diseases theses features
togram equalization (HE) [14–17], brightness preserving alter frequently [13]. However, these are adopted by
dynamic HE (BPDHE) [18], smallest univalue segment majority of the research studies such as [7, 10, 11].
assimilating nucleus (SUSAN) [19], smoothing the image Moreover, authors in [7] also counted the number of
through Median filter and edge preservation through infected RBCs based on the number of parasites which is
Laplacian [11, 20] and in the same way, but for edge not acceptable in medical cases, as authors in [13] stated
preservation, the authors of [21–23] used unsharp masking. that the infected RBC will be counted one, regardless of the
The underlying study considered image smoothing through number of parasites in it. The occluded RBCs problem is
median filter of kernel size [3 9 3], high kernel size will addressed in the work mentioned in [10], through the
remove the parasites particularly in their initial stages and method developed in [28], but in dense occluded RBCs the
for edges preservation of red blood cells and parasites an method will affect the accuracy. The segmentation of
unsharp masking is used. The selection of these two RBCs based on nucleic approach exposes the problem, as
methods has been made on the basis of their positive RBCs have no nuclei, and the studies considered the par-
results, experimented on 74 images of the standard dataset asites as nuclei. The studies based on nucleic approach will
obtained from [24]. be seriously affected when the RBCs become really
Further, according to the literature survey, we can nucleated, such as when the RBCs life span is near the end,
broadly divide the adapted methodologies previously for or the RBCs are highly matured. The nucleic approach is
automatic malaria diagnosis or parasitemia estimation into followed by several researches reported in [29, 30] and in
two deductive and inductive approaches [41, 42]. Deduc- [31] segmentation of RBCs. The segmentation based on
tive approach is a top-down strategy starting with the chromatin dots offers no surety that on the basis of maxi-
foreground and background separation, followed by red mum and minimum intensity levels that they will be the
blood cell segmentation. Finally, the parasites are studied. same in all images, and in addition, these studies are highly

123
Neural Comput & Applic

susceptible to noise. On the same grounds the studies of 2.1 Research challenges
[32, 33] addressed the segmentation of the parasites. In
addition, single RBCs may have noisy chromatin dots, Automatic tools and techniques introduced previously
single dots are not considered by experts as parasites, false provide better solutions for the mentioned problems, but
results will be reported and accuracy will be at risk [46]. mostly deal with dependent factors of RBC’s morphology

Fig. 1 Framework of the


proposed technique to
determine the degree of malaria
parasitemia in Giemsa-stained
thin blood smears

123
Neural Comput & Applic

[2]. For example, the circularity of RBCs is not universal which is less distributed in the image, will be the eligible
case and the majority of previous studies, reported in [6–8] color of the feature. In this regard, the most suitable prob-
considered RBCs as round or elliptical in shape and red in abilistic approach is Gaussian mixture model with expec-
color. Size, area and any other fixed geometrical factors of tation maximization to determine the mean, weight and co-
RBCs are also risk factors, true only in normal situations variance of the colors distributed in the image as presented
[9] and have been considered in [7, 10, 11] as well. A slight in the equation:
deviation from the proposed models of these studies will fWk ; lk ; CVk g; 8kcopts 2 Color ð2Þ
abruptly reduce the accuracy and efficiency and may even
generate no response in some cases. In addition, occluded where {Wk, lk, CVk} is the weight, mean and co-variance
RBCs are also a serious issue that have not been properly matrices of kth color component. The color components are
addressed in the past. The term occlusion is used because determined with the Gaussian mixture model by assigning
of clumping and overlapping RBCs [12]. Clump means to the pixel through the normal distribution probability as
glue, and RBCs glued to each other in the form of long mentioned in Eq. (3)
chains, an indication of iron deficiency (common in Wk Nðfx jlk ; CVk Þ
malaria) in the blood. Overlapped RBCs are formed due to Pðkjfx Þ ¼ P ð3Þ
k Wk Nðfx jlk ; CVk Þ
inappropriate slide preparation. The occluded RBCs affect
the accuracy in terms of malaria parasitemia [12]. Malaria where k is the color values in a group vector, Wk are the
P
parasitemia is the percentage ratio of infected RBCs to all weights given as ( Kk=1WK = 1) and fx(W1, …, WK;
RBCs present on the slide [13]. f1, …, fK).
iRs Next, the spatial variance is calculated from horizontal
% MP ¼  100 ð1Þ and vertical variances of the kth color components, which
aRs
are presented in Eqs. (4) and (5), respectively.
where iRs and aRs represent the number of infected RBCs
1 X
and the number of all RBCs in a single window, Vv ðkÞ ¼ Pðkjfx Þjyv  Mv ðkÞj2 ð4Þ
respectively. jY jk y

To cope with all these challenges, we proposed a P


where Mv ðkÞ ¼ jY1j y Pðkjfx Þyv
k
methodology that improves the accuracy and efficiency X
1
based on independent factors regarding RBCs’ morphology. Vh ðkÞ ¼ Pðkjfx Þxh  Mh ðkÞj2 ð5Þ
jXjk x

3 Proposed methodology

The proposed methodology is divided into three sections:


Features extraction, splitting occluded RBCs and malaria
parasitemia grading. Pictorial description of the proposed
methodology is shown in Fig. 1.

3.1 Features extraction

Thin blood smear images suffer from various issues


because feature selection criteria of the corresponding
patients vary from slide to slide. The aim of this study is to
discover an efficient procedure for the selection of most
informative feature descriptors that could be found easily
in all the slides of malaria parasitemia and then retaining
these features descriptors as a solid foundation in which to
determine the malaria parasitemia. Literature study and
discussion with the corresponding field experts revealed
that the selection of suitable feature descriptors based on
color information plays a vital role in parasite segmentation Fig. 2 Parasite segmentation with the proposed technique (slight
dilation is applied for clear visibility). The first two columns (left)
stained with Giemsa, overcoming the issues related to
consist of original input image and parasites segmented images with
segmentation accuracy. The criterion for the selection of the proposed technique, while the last column (right) contains ground
suitable feature descriptors based on color is that the color, truth data marked by medical experts

123
Neural Comput & Applic

1
P
where Mv ðkÞ ¼ jYjk y Pðkjfx Þyv , where yv and xh are y- The weighted feature color is also normalized to the
coordinate and x-coordinate of the pixel x, while |Y|k and range [0, 1].
P P Results with the proposed technique are presented for
|X|k are given as |Y|k = yP(k|fx) and |X|k = xP(k|fx),
respectively. visual inspection and compared with the ground truth images
The total variance of a color component k is given as: marked by medical experts as shown in Fig. 2. The verifi-
cation has been made by another panel of medical experts
V ðkÞ ¼ Vv ðkÞ þ Vh ðkÞ: ð6Þ from Saidu Medical College Swat, KPK, Pakistan. The
Further, we normalized V(k) to the range [0, 1] as, segmented features are slightly dilated for clear visibility.
As the parasite in its initial stages is in the form of
ðV ðkÞ  mink V ðkÞÞ
V ðkÞ ¼ : ð7Þ threads and can span an area of at least 50 pixels (empir-
ðmaxk V ðkÞ  minkV ðkÞÞ ically checked by experimenting on more than 45 images
Thus, the weighted sum of color spatial-distribution feature out of 74), the small areas are identified as noise and
Fs(x, f) is defined as: removed from the image. After segmentation of parasites
X both the original and the resulted image having parasites
Fs ðx; f Þ / Pðkjfx Þ  ð1  V ðkÞÞ: ð8Þ are converted to binary form for further processing.

Fig. 3 Presents the separation process, a Presents input original image, b presents the binary image of the original, c presents the single RBCs
and d presents the separated occluded RBCs

123
Neural Comput & Applic

3.2 Occluded red blood cells splitting infected). The accuracy of quantifying RBCs (infected and
non-infected) mainly suffered with occlusions (clumps and
The precise grading of malaria parasitemia depends on the overlaps of RBCs). The splitting of occlusions process
accurate quantification of RBCs (infected and non- needs to be designed in a way to save processing time on

Fig. 4 Overall process after separation of occluded RBCs from single drawing, e presents the mapping of drawn circles on the initial points
RBCs, a presents the image of occluded RBCs, b distance transform of the boundaries of the occluded RBCs and f presents the final
of image presented in a, c presents local maxima of the occluded mapped and cleaved RBCs in constituents number
RBCs, d presents the centroids of the occluded RBCs for circles

123
Neural Comput & Applic

highly accurate grounds. Following procedure, we per- jXj


X jXj
X
formed preprocessing steps to ensure the efficiency, i.e., ai xi jð8u : ai  0Þ& ai ¼ 1 ð9Þ
checking for the presence of occluded RBCs and the sep- i¼1 i¼1

aration of occluded RBCs from single RBCs in the image. where |X| = finite set of points, xi is point |X|, while ai is
weight assigned to xi, the sum of the weights must be equal
3.2.1 Checking for occluded RBCs to 1 mean normalized.
AreaRBC ¼ No: of Pels ð10Þ
We double checked for the presence of occluded RBCs,
i.e., median area check and median elongation check. First, where no. of pixels = pixels defining the convex hull
we find the convex hulls of all the RBCs present under the object of the RBCs.
current window through Eq. (9). We find the areas and LRBC
elongation of the convex hulls through Eqs. (10)–(12), ElongationRBC ¼ ð11Þ
BRBC
respectively. Using these two measures, we find a nor-
malize variance among all the RBCs. Through experi- where LRBC is the major axis and BRBC is the minor axis of
mentation, we found that if the variance is higher than 0.2 each convex hull (RBCs).
in case of area and higher than 0.5 in case of elongation ðX  lÞ2
then the occluded RBCs will exist and vice versa. r2 ¼ ð12Þ
N
Fig. 5 Occluded RBCs
splitting through the proposed
technique, a, c and e are original
images, while b, d and e are the
results obtained by drawing the
circles on the centroid positions
obtained through local maxima
from distance transform and
then mapping of the circles with
the occlusion to obtain the
actual cleaved number of RBCs

123
Neural Comput & Applic

3.2.3 Splitting the occluded RBCs


where X represents the area in one case and elongation in
the other and N is the number of terms in distribution.
After separation of single and occluded RBCs, the image of
occluded RBCs is further considered for splitting into
3.2.2 Separation of single and occluded RBCs
single cleaved RBCs. In splitting the occluded RBCs, we
first determine the distance transformed of the occluded
Once it is decided that occluded RBCs exist and then the
RBCs image and then we find the local maxima. After
next step is to separate them from single RBCs. In the same
finding the local maxima, we find the centered maxima and
way, in the separation, we again applied the double check
consider these maxima as a center points for the circles.
mentioned in Eqs. (9)–(12). We consider median among
Then, we draw the circles with mid-point circle drawing
many central tendency measures for the purpose that the
algorithm. Each circle after drawing is mapped with the
median is the best central tendency measure when the data
occluded and through slight erosion; we separate the single
values are irregular and have both small and large values.
RBC from the occlusion. In this way, the process is con-
We divide the area of every convex hull of RBC with the
tinued up to the number of central maxima and we collect
median area, and obtained results near or equal to 1 are
the resulting separated single RBCs in a separate image,
considered as single RBCs and are included in mask of
which is the output image. The whole process is depicted in
single RBC. On the other hand, the obtained results greater
Fig. 4.
than 1 are considered as multi-RBCs and are included in
The proposed technique for occluded RBCs splitting
multi-RBCs mask. Then, we pass the single RBCs mask
when applied on different images has shown good results.
into the pixel IDX_list of the input image and obtained the
The basic concept is taken from watershed transform, but
image for single RBCs. In the same way, we pass the multi-
as watershed suffers from over and under segmentation in
RBCs mask to obtain the image for occluded RBCs.
occlusions. More than four RBCs require too much pro-
Moreover, we performed the second check similarly, but
cessing time as compared to the proposed technique. More
instead of area, we used elongation here. The process of
experimentation results are depicted in Fig. 5.
separation is depicted in Fig. 3.

Fig. 6 Parasite imposition process with proposed technique on images having occluded RBCs. a, d Present original images, b, e have cleaved
occluded RBCs and imposition of parasites on them and c, f present the imposition of parasites on single RBCs

123
Neural Comput & Applic

3.3 Malaria parasitemia grading 3.3.2 Identifying infected RBCs

For malaria parasitemia grading, we have performed the As all the RBCs are separated and single, identifying the
following steps. infected RBCs is needed for counting. We used one specific
quality of infected RBCs, considering the outer boundaries
3.3.1 Imposition of segmented parasites of all the RBCs and encircling those with green that are
infected on the basis that if the parent, or outer boundary has
The parasites, which are segmented in the first step, are child boundary. RBCs having no child boundary are con-
imposed on the single RBCs after splitting the occluded sidered as non-infected RBCs. From medical literature an
RBCs into single RBCs if the occlusions existed otherwise RBC, in case of malaria, is considered infected based on the
this step will be followed directly after segmentation of presence of plasmodium in it. An infected RBC having many
parasites. The imposition of parasites is needed for the plasmodium parasites will count as one infected RBC. The
purpose of identifying the infected RBCs and counts their visual results for this phase are shown in Figs. 7, 8.
number to estimate the percentage malaria parasitemia.
The imposition of parasites process is just simply the 3.3.3 Segmentation of infected RBCs
addition of the two binary images, i.e., the one which has
single RBCs, while the other having segmented parasites as In segmentation of infected RBCs, we followed the same
they were in opposite signs to cancel the effects of noise concept as we did in the identification. We took an empty
and any other artifact. The visual results for inspection are binary image of the same size in which all RBCs (infected
presented in Fig. 6. and non-infected) are present. Then, we highlight those

Fig. 7 Parasite imposition on images having no occluded RBCs. a. c Input images, while b, d are the resultant after imposition of parasites

123
Neural Comput & Applic

Fig. 8 Identification of infected RBCs. a, c Present original images, while b, d present infected RBCs highlighted with the red boundaries

Fig. 9 Process of infected RBCs segmentation. a Is original binary image, b The empty image with areas highlighted as the infected RBCs area,
c present infected RBCs, resulted through proposed technique and finally d contains all non-infected RBCs

123
Neural Comput & Applic

Fig. 10 Segmentation process of the infected RBCs in slide images infected RBCs existed in the single RBCs, c, f are the non-infected
having occluded RBCs. a, f Represent the input images to this module RBCs in the cleaved RBCs. e, h Present the non-infected RBCs in the
while b presents the infected RBCs in the cleaved RBCs. d, g Present single RBCs

areas with (1’s) which we identified in the image having all RBCs. Having the total count of infected and non-infected
RBCs (infected and non-infected). Adding the image in RBCs, the percentage of malaria parasitemia ratio can be
which areas are highlighted to the image having both estimated by using the formula described in Eq. (1).
infected and non-infected RBCs resulted in an image
having infected RBCs. The whole process is visualized in 3.3.6 Malaria parasitemia grading
Fig. 9, while the results are presented in Fig. 10.
According to the book at [34] and to the study [35, 36], the
3.3.4 Counting infected and non-infected RBCs percentage of malaria parasitemia should be examined in
100–200 windows and can be graded to one of the fol-
Following segmentation, counting infected and non-infected lowing grades or levels listed in Table 2.
RBCs is a simple task. For automatic counting, we used Finally, malaria parasitemia is graded to the mentioned
MATLAB built-in function ‘bwlabel’. The RBCs segmenta- levels in Table 2. Further, for testing purpose, we assumed
tion and counting results are shown in Figs. 11,12 and Table 1. 40000 RBCs per window and estimated the results based
on this assumption with the result of each image because
3.3.5 Estimation of percentage malaria parasitemia each image is a single window.

Malaria parasitemia is the percentage ratio of infected


RBCs to all RBCs present on each window in a slide. 4 Results analysis and discussion
According to the recommendations of WHO [2, 3], the
percentage of malaria parasitemia ratio must be estimated We performed a quantitative analysis to validate the
on the basis of observing 80-100 windows, each with 100 effectiveness of the proposed framework.

123
Neural Comput & Applic

Fig. 11 Segmentation process of the infected RBCs in slide images having all single RBCs. a, d Represent the original input images, b, e present
the infected RBCs while c, f present the non-infected RBCs

4.1 Ground truth data preparation 4.3 Quantitative evaluation of the proposed
occluded RBCs splitting technique
The images obtained from DPDx [16] were printed as
forms and distributed among three pathologists. Each form We first check the relationship of counting red blood cells
has a single image of thin blood smear and its manually (automatically after occlusions splitting and manually
estimated statistics and marking of the parasites in the made by the experts) through Pearson’s correlation coef-
image. These forms are verified by another panel of three ficient. The relationship between the two variables is
medical experts. The data collection has been made in shown in Fig. 13. For the same purpose, we also performed
Department of Pathology, Saidu Medical College, Saidu the confusion matrix based-precision, recall and F-measure
Sharif Swat, KPK, Pakistan. with Eqs. (15), (16) and (16) through the confusion matrix
in Table 3.
4.2 Inter-rater agreement Tp
Precision ¼ ð14Þ
Tp þ Fp
The collected data are first checked for inter-rater relia- Tp
bility agreement through a variation of Cohen’s Kappa Recall ¼ ð15Þ
T p þ Fn
(Two Raters) called Fleiss’ Kappa through Eq. (13).
Precision  Recall
P0  P0e F-measure ¼ 2  ð16Þ
j¼ ð13Þ Precision þ Recall
1  P0e
P 0 P where Tp = correctly counted as red blood cells, Tn =
where P0 ¼ N1 Ni¼1 Pi and Pe ¼ Nj¼1 p2j , N = total num- correctly counted as non-red blood cells, Fp = in-correctly
ber of subjects and i, j = 1,2,3,…,N, k represents subjects counted as red blood cells and Fn = in-correctly counted
and categories, respectively. The Fleiss’ Kappa calculation as non-red blood cells
for the collected data is j ¼ 0:96, which shows strongly The achieved precision, recall and F-measure by
reliable data. counting the RBCs after splitting the occluded RBCs with

123
Neural Comput & Applic

Fig. 12 Counting process results. a, d, g Original input images, b, e, h are images labeled as infected RBCs, while images c, f, i are labeled as
non-infected RBCs

Table 1 Complete statistics after examination of single thin blood the proposed technique are 0.973766, 0.989544 and
smear image 0.985951, respectively.
Complete statistics
4.4 Statistical analysis of the overall framework
All red blood cells 32
Infected RBC 7 Moreover, in the same way, we performed the overall
Non-Infected RBC 26 results of percentage malaria parasitemia estimation
Percentage MP 100 RBC 21.212122 through Pearson’s correlation coefficient to find the
%MP 40 K RBC 8484.848485 relationship between manually and automatically esti-
Grade C mated percentage malaria parasitemia depicted in
Fig. 14. Confusion matrix based-sensitivity and

123
Neural Comput & Applic

Table 2 Percentage of malaria


Malaria parasitemia (%) Parasites/ll Remarks Grade
parasitemia grading. Source:
Compiled from [35, 36] 0.0001–0.0004 5–20 Sensitive A
0.002 100 Patient may have symptoms below this level B
0.2 10,000 Level above which immunes show symptoms C
2 100,000 Maximum Parasitemia D
2–5 [100,000 Hyper-Parasitemia/Severe Malaria E
10 =500,000 Exchange transfusion may be considered F

Fig. 13 Graph present correlation between manually and automati-


cally (by splitting occluded RBCs via distance transform and circles
drawing) counted RBCs Fig. 14 Correlation between automatic and manual malaria para-
sitemia estimation

Table 3 Confusion matrix


Table 4 Performance comparison of automatic malaria parasitemia
Confusion matrix Detected
Authors Year Dataset Sensitivity Specificity
Positive Negative
[7] 2011 DPDx 85.5 % NM
Actual
[10] 2007 DPDx R2 = 0.951
Positive Tp Fn
[11] 2011 DPDx 93.12 % 93.17 %
Negative Fp Tn
[32] 2012 DPDx 94.87 % 97.3 %
[33] 2009 DPDx 83 % 98 %
specificity are determined with Eqs. (17) and (18). We Proposed framework 2014 DPDx 98.01 % 97.10 %
noted the strength of the proposed techniques’ correct Proposed framework 2014 DPDx R2 = 0.981
acceptance of infected RBCs as infected through sensi-
tivity and correct rejection of non-infected as non-in-
fected RBCs through specificity.
4.5 Comparison of the proposed framework
Tp with other techniques
Sensitivity ¼ : ð17Þ
Tp þ Fn
Tn The proposed framework is compared with other tech-
Specificity ¼ : ð18Þ niques on the same grounds, i.e., on the same image dataset
Tn þ Fp
and number of images. Automatic malaria parasitemia
Using Eqs. (17) and (18) by following the mentioned estimation or grading is rich in the literature, but mostly the
rules the achieved sensitivity by the proposed framework is experimentations were made on in vitro slides of own
0.98013, while the specificity achieved is 0.9711. datasets and results are compared in Table 4.

123
Neural Comput & Applic

5 Conclusion 10. Sio SW et al (2007) MalariaCount: an image analysis-based


program for the accurate determination of parasitemia. J Micro-
biol Methods 68(1):11–18
We developed a framework for grading malaria para- 11. Savkare S, Narote S (2011) Automatic detection of malaria par-
sitemia in thin blood smear digital images on grounds, asites for estimating parasitemia. Int J Comput Sci Secur 5(3):310
which are more nearer to universality and improved the 12. Tek FB, Dempster AG, Kale I (2006)Malaria parasite detection in
peripheral blood images. BMVC
efficiency. The color of the parasites is the only unique
13. DPDx. Determination of malaria parasitemia. 2013 [cited 2014
property which is the same in all thin blood smear digital 7-4-2014]; Available from: http://www.cdc.gov/dpdx/resources/
images. However, the approach suffers from noise, but we pdf/benchAids/malaria/Parasitemia_and_LifeCycle.pdf
addressed it by canceling its effect in binary completely. 14. Purwar Y et al (2011) Automated and unsupervised detection of
malarial parasites in microscopic images. Malar J 10(1):364
The accuracy increase is also due to proper and indepen-
15. Sheeba F, et al (2011) Segmentation of peripheral blood smear
dent grounds addressing of the occluded RBCs splitting. images using tissue-like p systems. In: Bio-inspired computing:
The efficiency is increased due to accomplishing the sub- sixth IEEE international conference on theories and applications
steps of the framework in suitable ways and mostly in (BIC-TA)
16. Sheeba F, et al (2013) Detection of plasmodium falciparum in
binary to save the processing time. The overall framework
peripheral blood Smear images. In: Proceedings of seventh
is organized carefully because the study is dealing with the international conference on bio-inspired computing: theories and
most important entity, i.e., health. applications (BIC-TA 2012). Springer
17. Poomcokrak J, Neatpisarnvanit C (2008) Red blood cells
Acknowledgments This work was supported by the Research Center extraction and counting. In: The third international symposium on
of College of Computer and Information Sciences, King Saud biomedical engineering
University Riyadh 11372, Saudi Arabia. The authors are grateful for 18. Mandal S, et al (2010) Segmentation of blood smear images using
this support. normalized cuts for detection of malarial parasites. In: India
conference (INDICON), 2010 Annual IEEE
Authors’ contribution The main contribution in this research is to 19. Ahirwar N, Pattnaik S, Acharya B (2012) Advanced image
examine the microscopic images of stained thin blood smears using analysis based system for automatic detection and classification
a variety of computer vision techniques, grading malaria parasitemia of malarial parasite in blood images. Int J Inf Technol 5(1):59–64
on independent factors (RBC’s morphology). We proposed a novel 20. Wang H, Zhang H, Ray N (2011) Clump splitting via bottleneck
methodology based on inductive approach, color segmentation of detection. In: 18th IEEE international conference on in image
malaria parasites through adaptive algorithm of Gaussian mixture processing (ICIP)
model. The quantification accuracy of RBCs is improved by split- 21. Mohapatra S, Patra D (2010) Automated cell nucleus segmenta-
ting the occlusions of RBCs with distance transform and local tion and acute leukemia detection in blood microscopic images.
maxima. In: IEEE conference on systems in medicine and biology
(ICSMB)
Compliance with ethical standards 22. Mohapatra S, Patra D, Kumar K (2011) Blood microscopic image
segmentation using rough sets. In: IEEE conference on image
Conflict of interest The authors have no conflict of interests to information processing (ICIIP), 2011
report. 23. Mohapatra S, et al (2011) Fuzzy based blood image segmentation
for automated leukemia detection. In: IEEE international con-
ference on in devices and communications (ICDeCom)
24. DPDx, DPDx, Laboratory identification of parasites, Centers of
References diseases control and prevention, D.a.m.b.C.s.D.o.P.D.a.M.
(DPDM), Editor. 2002, Web site developed and maintained by
1. WHO (2013) Annual malaria report, W.H. Organization, Editor CDC’s Division of Parasitic Diseases and Malaria (DPDM): 1600
2. Tek FB, Dempster AG, Kale I (2009) Computer vision for Clifton Rd. Atlanta, GA 30333, USA
microscopy diagnosis of malaria. Malar J 8(1):153 25. Linder N et al (2014) A malaria diagnostic tool based on com-
3. WHO (1991) Basic malaria microscopy: part I. Learner’s guide: puter vision screening and visualization of plasmodium falci-
part II. Tutor’s guide, 1st edn. World Health Organization parum candidate areas in digitized blood smears. PLoS ONE
4. Kettelhut M et al (2003) External quality assessment schemes 9(8):e104855
raise standards: evidence from the UKNEQAS parasitology 26. Opoku-Ansah J et al (2014) Wavelength markers for Malaria
subschemes. J Clin Pathol 56(12):927–932 (Plasmodium Falciparum) infected and uninfected red blood cells
5. WHO (2004) Basic malaria microscopy part 1, 2nd edn. pp 1–88 for ring and trophozoite stages. Appl Phys Res 6(2):p47
6. Walliander M et al (2013) Automated segmentation of blood cells 27. Berge H, et al (2011) Improved red blood cell counting in thin
in Giemsa stained digitized thin blood films. Diagn Pathol blood smears. In: IEEE international symposium on biomedical
8(Suppl 1):S37 imaging: from Nano to Macro, 2011
7. Khan MI et al (2011) Content based image retrieval approaches 28. Kumar S et al (2006) A rule-based approach for robust clump
for detection of malarial parasite in blood images. Int J Biom splitting. Pattern Recogn 39(6):1088–1098
Bioinform 5(2):97 29. Kumarasamy SK, Ong S, Tan KS (2011) Robust contour recon-
8. Guan PP, Yan H (2011) Blood cell image segmentation based on struction of red blood cells and parasites in the automated iden-
the hough transform and fuzzy curve tracing. In: International tification of the stages of malarial infection. Mach Vis Appl
conference on machine learning and cybernetics (ICMLC) 22(3):461–469
9. Tek FB, Dempster AG, Kale I (2010) Parasite detection and 30. Khawaldeh BAI (2013) Developing a computer-based informa-
identification for automated thin blood film malaria diagnosis. tion system to improve the diagnosis of blood anemia. In:
Comput Vis Image Underst 114(1):21–32 Department of computer information systems faculty of

123
Neural Comput & Applic

information technology. Middle East University, Amman, Jordan, script recognition. Neural Comput Appl 25(6):1337–1347.
p 116 doi:10.1007/s00521-014-1618-9
31. Zou L-H, et al (2010) Malaria cell counting diagnosis within 40. Norouzi A, Rahim MSM, Altameem A, Saba T, Rada AE,
large field of view. In: international conference on digital image Rehman A, Uddin M (2014) Medical image segmentation
computing: techniques and applications (DICTA) methods, algorithms, and applications. IETE Tech Rev. doi:10.
32. Somasekar J (2011) Computer vision for malaria parasite clas- 1080/02564602.2014.906861
sification in erythrocytes. Int J Comput Sci Eng 3(6):2251–2256 41. Neamah K, Mohamad D, Saba T, Rehman A (2014) Discrimi-
33. Makkapati VV, Rao RM (2009) Segmentation of malaria para- native features mining for offline handwritten signature verifi-
sites in peripheral blood Smear images. In: IEEE international cation. 3D Res. doi:10.1007/s13319-013-0002-3
conference on acoustics, speech and signal processing 42. Rehman A, Saba T (2014) Features extraction for soccer video
34. Hänscheid T, Valadas E, Grobusch M (2000) Automated malaria semantic analysis: current achievements and remaining issues.
diagnosis using pigment detection. Parasitol Today 16(12):549–551 Artif Intell Rev 41(3):451–461. doi:10.1007/s10462-012-9319-1
35. Homel M, Gilles HM (1998) Malaria. In: Colliet L, Balows A, 43. Saba T, Rehman A (2012) Machine learning and script recogni-
Sussman M (eds) Microbiology and microbial, infections, 9th tion. Lambert Academic publisher, Saarbrueken, pp 56–68
edn. Topley & Wilson’s, Arnold 44. Joudaki S, Mohamad D, Saba T, Rehman A, Al-Rodhaan M, Al-
36. Iyar DRBK Malaria Diagnostics. 2013 [cited 2014 7-04-2014]; Dhelaan A (2014) Vision-based sign language classification: a
Available from: http://www.slideshare.net/iyerbk/malaria-diag directional review. IETE Tech Rev 31(5):383–391. doi:10.1080/
nostics. pp. 56–62. doi:10.1179/1743131X13Y.0000000063 02564602.2014.961576
37. Saba T, Rehman A (2012) Effects of artificially intelligent tools 45. Muhsin ZF, Rehman A, Altameem A, Saba T, Uddin M (2014)
on pattern recognition. Int J Mach Learn Cybernet 4:155–162. Improved quadtree image segmentation approach to region infor-
doi:10.1007/s13042-012-0082-z mation. Imaging Sci J 62(1):56–62. doi:10.1179/1743131X13Y.
38. Rehman A, Saba T (2014) Neural network for document image 0000000063
preprocessing. Artif Intell Rev 42(2):253–273. doi:10.1007/ 46. Saba T, Al-Zahrani S, Rehman A (2012) Expert system for offline
s10462-012-9337-z clinical guidelines and treatment. Life Sci J 9(4):2639–2658
39. Saba T, Rehman Amjad, Altameem Ayman, Uddin Mueen (2014)
Annotated comparisons of proposed preprocessing techniques for

123

You might also like