Lung Cancer Detection by Using Artificial Neural Network and Fuzzy Clustering Methods
Lung Cancer Detection by Using Artificial Neural Network and Fuzzy Clustering Methods
Lung Cancer Detection by Using Artificial Neural Network and Fuzzy Clustering Methods
296
Wqk = Wqk / ∑W
( r =1, Q )
rk , q = 1,..., Q the sensitivity of HNN, the cytoplasm regions were
represented by two clusters. These cytoplasm clusters
. (9) will be merged later if the difference in their mean values
4. Compute new centroids C(k), k = 1,….,K via is not large. Comparing the FCM segmentation result in
C (k ) = ∑W qk X ( q ) , k = 1,..., K (h) to the raw image (d), the nuclei regions are detected,
( q =1, Q )
. (10) but they present a little overlapping in the way that the
two different nuclei may be seen or considered as one
5. Update the weights {W qk } via nucleus, and this can affect the diagnosis results. The
cytoplasm regions are smoother than in the case of HNN,
Wqk =(1/ || xq −ck ||2)1/(p−1) / ∑(1/ || x −c || )
(r=1,K)
q r 2 1/(p−1)
,k =1,..K,q =1,...,Q reflecting that the FCM is less sensitive to the intensity
variation than HNN. The learning error waveforms of the
(11) above comparison and discussion are shown in Figure 2,
6. If there is change in the input, repeat from step where it can be seen that the segmentation error at
3, else terminate. convergence is smaller with HNN than with the FCM.
7. Assign each pixel to a cluster based on the However, the FCM converge fifty iterations earlier than
maximum weight. HNN. Figure 3 (a) shows a sample of sputum color image
We applied the FCM clustering algorithm with the stained with blue dyes, (b) and (c) show the segmentation
specification mentioned above to one thousand sputum results using HNN and the FCM with the RGB
color images and maintain the result for further components of the raw image (a), respectively. As is seen
processing in the following steps. Our algorithm in the segmentation results of both algorithms in (b) and
segments the images into nuclei, cytoplasm regions and (c) the nuclei have not been detected and the background
clear background, however, the FCM is not sensitive to presents a lot of intensity variation. A filter was needed to
intensity variation, therefore, the cytoplasm regions are minimize the effect of the intensity variation in the raw
detected as one cluster when we fixed the cluster number image as described in [10]. The result of this filter is
to three, four, five and six. Moreover, FCM failed in shown in (d). (e) And (f) are the segmentation results
detecting the nuclei; it detected only part of it. By obtained using HNN and FCM with RGB components of
experiment, the FCM algorithm takes less than 50 (d) with three clusters. (g) And (h) the segmentation
iterations to reach the desired results in 10 seconds on results with four clusters. Here, the nuclei have been
average. detected, however a color cluster is missing in the result
of FCM (h). The same applies for the previous case of the
4. ANALYSIS PAHSE red cells. HNN is more sensitive to intensity variation
between nuclei-nuclei or nuclei-cytoplasm regions. This
In this section, we present the result obtained with two is clear in Figure 4, which shows quantitatively, the
sample images; the first sample containing red cells learning error waveforms of HNN and FCM during the
surrounded by a lot of debris nuclei and a background segmentation process of the blue sample.
reflecting a large number of intensity variation in its pixel
values as shown in Figure 1 (a), and the second sample is 5. CONCLUSION
composed of blue stained cells shown in Figure 3 (a). In
Figure 1, (b) and (c) show the segmentation results using In this study, two segmentation processes have been used,
HNN and the FCM with RGB components of the raw the first one was Hopfield Neural Network (HNN), and
image (a), respectively. As is seen in the segmentation the second one was Fuzzy C-Mean (FCM) Clustering
results of both algorithms (b) and (c), the nuclei of the algorithm. It was found that the HNN segmentation
cells were not detected, in the case of HNN in (b), and results are more accurate and reliable than FCM
were not accurately represented in (c). For this reason we clustering in all cases. The HNN succeeded in extracting
developed a filter to extract our regions of interest, the nuclei and cytoplasm regions. However FCM failed in
described in [10], and the result is shown in (d). (e) And detecting the nuclei, instead it detected only part of it. In
(f) show the segmentation results by using HNN and addition to that, the FCM is not sensitive to intensity
FCM with the RGB components of (d). By fixing the variations as the segmentation error at convergence is
cluster numbers to three, respectively, we realized that in larger with FCM compared to that with HNN.
the case of HNN, the nuclei were detected but not The HNN will be used as a basis for a Computer Aided
precisely. In the case of FCM only part of the nuclei has Diagnosis (CAD) system for early detection of lung
been detected. We increased the cluster numbers to four cancer. In the future, we plan to consider a Bayesian
as an attempt to solve the nuclei detection problem. The decision theory for the detection of the lung cancer cells,
results are shown in (g) and (h) for both HNN and FCM, followed by developing a model based on the idea of
respectively. watershed algorithm which combined the idea of edge
Comparing the HNN segmentation result in (g) to the raw detection and region based approach to extract the
image (d), we can say that the nuclei regions were homogeneous tissues represented in the image. As soon
detected perfectly, and also their corresponding as a more extended dataset is available.
cytoplasm regions. However, due to the problem of
intensity variation in the raw image (d) and also due to
297
6. REFERENCES Figure 1. (a) Original raw image stained with red dyes, (b) and
(c) the segmentation results for the image in (a) by using HNN
[1] Dignam JJ, Huang L, Ries L, Reichman M, Mariotto A, and FCM, respectively. (d) The filtered image. (e) And (f) show
Feuer E. “Estimating cancer statistic and other-cause mortality the segmentation results for the filtered image in (d) by using
in clinical trial and population-based cancer registry cohorts”, HNN and FCM, and by fixing the cluster numbers to three,
Cancer 10, Aug 2009. respectively. (g) And (h) the results by fixing the cluster
[2] T. C. Kennedy, Y. Miller and S. Prindiville, “Screening numbers four, respectively.
for Lung Cancer Revisited and the Role of Sputum Cytology
8.00E+20
and Fluorescence Bronchoscopy in a High-Risk Group,” Chest 7.00E+20
Journal, vol. 10, pp. 72-79, 2005. 6.00E+20
Error Values
5.00E+20
[3] Z. Daniele, H. Andrew, J. Nickerson, “Nuclear Structure 4.00E+20
Fuzzy_k=4
NN_k=4
in Cancer Cells,” Nature Reviews Cancer, Medical School, vol. 3.00E+20
7.00E+16
(a) (b) (c) 6.00E+16
5.00E+16
4.00E+16
Error
Fuzzy_Mk3
3.00E+16 NN_k3
2.00E+16
1.00E+16
0.00E+00
1 13 25 37 49 61 73 85 97 109
(g) (h)
298