Ocular Disease Recognition Using Deep Learning
Ocular Disease Recognition Using Deep Learning
Abstract— Artificial intelligence holds a significant impact in a Cataracts can be aggravated by inherited genetic defects that
variety of drug-related medical studies, including ophthalmology. create health concerns. The term ocular is used to describe a
Deep literacy styles, in particular, have been successful in detect- tumor that is accompanied by an eye. It can be intraocular (af-
ing clinical signs and bracketing optical conditions. Studies reveal fecting the inside of the eye) or extraocular (affecting the out-
Ocular diseases to be the major contributing reason of childhood side of the eye). Cataracts, diabetic retinopathy, and redness
blindness all over the world. Rapid and automatic illness identifi- level are the most common diseases detected. [5]. Through
cation is vital and urgent in lowering the strain of ophthalmolo- studies including automatic recognition of diseases from iris
gists. Ophthalmologists use pattern recognition to identify disor- and fundus or retinal images, computer vision and deep learning
ders by looking at the eye and its surrounding tissues directly or
have assisted ocular pathology. [6]
indirectly. As a result, can benefit the area of medical greatly.
Each disease has several severity levels that can be identified by Elloumi et al. divide such research into two categories based
confirming the presence of different lesions. Morphological char- on the goal: pathology or severity identification and ocular le-
acteristics identify each lesion, and numerous lesions from differ- sion segmentation. Fundus based images including ocular dis-
ent diseases have similar characteristics. In ophthalmology, deep eases have been included in their analysis of DL-based tech-
literacy techniques have mostly been employed on eye fundus niques, they mention works including Fundus image databases
pictures and optic consonance tomography. In this paper , we that target specific disorders.
have used three models namely CNN, Inception V3 , VGG-19 for
cataract prediction. These properties have allowed deep learning to outperform
standard methods in a variety of computer vision and image
We have got accuracy of 0.9587 for VGG-19 which is perform- analysis tasks. Because of its success, it is now being used to
ing best as compared to other models. analyze medical images, including, of course, ophthalmology
images [8].
Keywords- Clinical Diagnosis, Deep Learning, Image Classifica-
tion, Neural Network, Ocular disease. The primary cause of blindness in the globe is fundus image
disorders. Eye problems, age-related eye problems (AMD),
I. INTRODUCTION glaucoma, and diabetes mellitus are among the most common
According to the World Health Organization (WHO), 2.2 eye illnesses (DR). According to projections, there would be
billion people worldwide are blind or have vision loss, with at 95.4 million people with cataract and 3.36 million of people
least 1 billion having impaired eyesight that might be correct- with myopia by 2030, up from 1.95 billion in 2010. [9] Fur-
ed.[1] Reports reveal that DED, a chronic eye illness can cause thermore, according to various studies, those who develop my-
irreversible vision impairments if left untreated and if ignored opia before the age of 20 are more prone to get cataracts later in
for longer time, and this is estimated to be detected in around life. Despite the fact that the specific cause is uncertain, some
one-third of diabetics.[2] studies suggest that the increased axial eyeball length may hin-
der nutrient transfer to the lenses' rear site. [10]
Fundus pictures can be used to acquire retinal images,
which aid doctors in evaluating lesions or changes in architec- We have focused on one of the ocular diseases that is Cataract.
ture. In such scenario, manual identification has been attempted Firstly, we extracted images from dataset and resized images. Then we
extracted information of whether eye is cataract affected or not. Then
but in longer way. Automation of these operations can save we trained Deep Learning models and compared accuracies of model
time and money.[3] The most serious tumors in the eye are iris to determine best performing model. We have discussed about this in
cataract tumors, commonly known as 'eye tumors.' Cataracts are Section IV Methodology.
caused by a change in the tissue that makes up the lens of the
eye as a result of age or damage. The lens' proteins and fibers
continue to break down, resulting in hazy or poor vision. [4].
2
horized licensed use limited to: Vignan's Foundation for Science Technology & Research (Deemed to be University). Downloaded on June 15,2024 at 04:14:28 UTC from IEEE Xplore. Restrictions ap
A. Convolutional Neural Network: - A. CNN -
A convolutional neural network has an architecture similar
to the human brain. It contains layers where each layer is con-
nected to the next layer through neurons. It uses the convolution
method which produces a function that helps in understanding
how a change in one is affecting change in another.
Finally, ConvNet's task is to compress the images into a
more manageable format while maintaining key components for
a good prediction. Convolutional neural networks are made up
of many layers of artificial neurons. Neurons are mathematical
functions which calculate the weight of each input and give an
activation value which helps in deciding whether a neuron will
fire or not. When you input an image into a ConvNet, each lay-
er develops several activation functions that are passed on to the
next layer.
B. VGG 19: -
A 3x3 receptive field and 19 convolution layers make up the
VGG-19 network. There are five of these levels, each with a
2x2 Max pooling layer. Three fully linked layers follow the
final Max pooling layer,aafter that, three fully connected layers
are added. It uses the softmax classifier as a final layer. All hid-
den layers have their Relu activated. The VGG19 model has the
disadvantage of being expensive to evaluate and requirees a lot
of memory and parameters.
VGG19 contains almost 138 million parameters. The major-
ity of these parameters (about 123 million) are in fully-
connected layers, which are substituted in our model by an
SVM classifier, greatly lowering the number of required pa- Fig. 1. CNN Architecture
rameters.
B. VGG 19 -
C. Inception V3: -
Google Net published Inception V3 in 2014, a CNN-based
Deep Learning model. This model contains 42 layers and has a
lower error rate than its predecessors. It uses an auxiliary classi-
fier to spread label information throughout the network and has
7x7 convolutions. It employs RMPS optimizers and includes
label smoothing, a king of regularizing component that is intro-
duced to the loss formula to keep the network from getting out-
liers in a class and this prevents over-fitting in the model that
gives best results.
V. PROPOSED MODEL ARCHITECTURE
We have proposed 3 deep learning models namely CNN,
INCEPTION V-3, VGG 19 for cataract prediction and follow-
ing is the model architecture summary for each model.
Proposed model architecture: -
3
horized licensed use limited to: Vignan's Foundation for Science Technology & Research (Deemed to be University). Downloaded on June 15,2024 at 04:14:28 UTC from IEEE Xplore. Restrictions ap
C. INCEPTION V3 - The confusion matrix for VGG-19 which was best perform-
ing model is as follows –
4
horized licensed use limited to: Vignan's Foundation for Science Technology & Research (Deemed to be University). Downloaded on June 15,2024 at 04:14:28 UTC from IEEE Xplore. Restrictions ap
This fig 8 represents the CNN Model’s accuracy curve This fig 10 represents the Inception V3 Accuracy curve
which is the best performing model. which is the best performing model.
Fig. 8. Accuracy Curve This fig 11 represents the VGG 19 Model’s loss curve
which is the best performing model.
This fig 9 represents the Inception V3 loss curve which is
the best performing model.
5
horized licensed use limited to: Vignan's Foundation for Science Technology & Research (Deemed to be University). Downloaded on June 15,2024 at 04:14:28 UTC from IEEE Xplore. Restrictions ap
VIII. LIMITATIONS AND FUTURE SCOPE
Deep Learning models require a large dataset to determine
model accuracy. Dataset collectionon requires equipment that
may not be available in local hospitals, and this would prevent
data collection on a broader scale. Another restriction of this
study is that any model must be thoroughly tested before being
deployed for medical use, as the subject is extremely sensitive
when dealing with people's lives.
As a future scope to this research, we aim to increase the
model accuracy by collecting more data and improve the model
architecture by ensembling techniques aiming to reduce the
percentage of false negative errors in the model.
IX. CONCLUSIONS
Deep Learning approaches are unique ways for detecting
and classifying distinct anomalies in eye pictures, and they have
a lot of promise for diagnosing ocular diseases efficiently. Us-
ing these deep learning algorithms, decision making becomes
powerful in field of ocular disease.
VGG-19 model stood out with the highest accuracy among
Fig. 12. Accuracy Curve different models. It is giving better accuracy as compared to
CNN and Inception -V3. The creation of new models to show
VGG16 neural network design is good. It is a straightfor- and extract characteristics that aid in the prognosis, diagnosis,
ward stack of convolutional and max-pooling layers, each one and follow-up of ocular disorders is required for novel medical
coming before a final layer that is fully connected. In other equipment utilization for detecting eye diseases. As a result,
words, it cannot extract features that are extremely complex. creating deep learning algorithms that incorporate multi-modal
Inception nets, on the other hand, have inception modules that input is still difficult.
are made up of 1X1 filters, also referred to as point wise convo-
lutions, followed by convolutional layers with various filter Despite the excellent findings, these methodologies face
sizes applied concurrently. Inception nets can now learn more significant open difficulties in terms of interpretability and
intricate characteristics medical professional feedback to the models. Furthermore, the
use of deep learning techniques in medical institutions has the
We can use VGG-19 and Inception V3 to solve complicated potential to enhance the number of individuals diagnosed, re-
issues. In our instance, VGG-19 performs somewhat better than sulting in an improvement in population quality of life. To fully
Inception V3. realize the promise of these tools, engineers and ophthalmolo-
We tested our model on testing dataset and we got the fol- gists must collaborate in a coordinated, multidisciplinary effort
lowing results. centered on the patient in order to reduce medical diagnosis
time and costs.
REFERENCES
[1] Liu, Chenxi, Dian Jiao, and Zhe Liu. "Artificial intelligence (AI)-aided
disease prediction." BIO Integration 1.3 (2020): 130-136.
[2] Li, N., Li, T., Hu, C., Wang, K., Kang, H. (2021). A Benchmark of
Ocular Disease Intelligent Recognition: One Shot for Multi-disease
Detection. In: Wolf, F., Gao, W. (eds) Benchmarking, Measuring, and
Optimizing. Bench 2020. Lecture Notes in Computer Science(), vol
12614. Springer, Cham. https://doi.org/10.1007/978-3-030-71058-3_11
[3] Karthiyayini, R., Shenbagavadivu, N. Retinal Image Analysis for Ocular
Disease Prediction Using Rule Mining Algorithms. Interdiscip Sci
Comput Life Sci 13, 451–462 (2021).
[4] Parampal S. Grewal, Faraz Oloumi, Uriel Rubin, Matthew T.S.
Fig. 13. Prediction Results Tennant,Deep learning in ophthalmology: a review,Canadian Journal of
Ophthalmology, Volume 53, Issue 4,2018,Pages 309-313, ISSN 0008-
4182,https://doi.org/10.1016/j.jcjo.2018.04.019.
VII. SCOPE OF RESEARCH
[5] M. T. Islam, S. A. Imran, A. Arefeen, M. Hasan and C. Shahnaz, "Source
Pattern recognition is used by ophthalmologists to diagnose and Camera Independent Ophthalmic Disease Recognition from Fundus
problems by looking at the eye and its surrounding tissues di- Image Using Neural Network," 2019 IEEE International Conference on
rectly or indirectly. This study focuses on the prediction of cata- Signal Processing, Information, Communication & Systems
(SPICSCON), 2019, pp. 59-63, doi:
ract illness, which is one of the ocular diseases. 10.1109/SPICSCON48833.2019.9065162.
[6] Salem, H., Negm, K.R., Shams, M.Y., Elzeki, O.M. (2022). Recognition
of Ocular Disease Based Optimized VGG-Net Models. In: Hassanien,
6
horized licensed use limited to: Vignan's Foundation for Science Technology & Research (Deemed to be University). Downloaded on June 15,2024 at 04:14:28 UTC from IEEE Xplore. Restrictions ap
A.E., Bhatnagar, R., Snášel, V., Yasin Shams, M. (eds) Medical Signal Processing, Information, Communication & Systems
Informatics and Bioimaging Using Artificial Intelligence . Studies in (SPICSCON), 2019, pp. 59-63, doi:
Computational Intelligence, vol 1005. Springer, Cham. 10.1109/SPICSCON48833.2019.9065162.
https://doi.org/10.1007/978-3-030-91103-4_6 [14] A. Saxena, A. Vyas, L. Parashar and U. Singh, "A Glaucoma Detection
[7] D. Wang and L. Wang, "On OCT Image Classification via Deep using Convolutional Neural Network," 2020 International Conference on
Learning," in IEEE Photonics Journal, vol. 11, no. 5, pp. 1-14, Electronics and Sustainable Communication Systems (ICESC), 2020, pp.
Oct. 2019, Art no. 3900714, doi: 10.1109/JPHOT.2019.2934484. 815-820, doi: 10.1109/ICESC48915.2020.9155930.
[8] PERDOMO CHARRY, Oscar Julián and GONZALEZ, Fabio Augusto. [15] K. Prasad, P. S. Sajith, M. Neema, L. Madhu and P. N. Priya, "Multiple
A Systematic Review of Deep Learning Methods Applied to Ocular eye disease detection using Deep Neural Network," TENCON 2019 -
Images. Cienc. Ing. Neogranad 2020. 2019 IEEE Region 10 Conference (TENCON), 2019, pp. 2148-2153,
[9] Ma, Xiaojun, et al. "Study on a prediction of P2P network loan default doi: 10.1109/TENCON.2019.8929666.
based on the machine learning LightGBM and XGboost algorithms [16] Yaroub Elloumi, Mohamed Akil, Henda Boudegga, "Ocular diseases
according to different high dimensional data cleaning." Electronic diagnosis in fundus images using a deep learning: approaches, tools and
Commerce Research and Applications 31 (2018): 24-39. performance evaluation," Proc. SPIE 10996, Real-Time Image
[10] Singh, Vishal, et al. "Prediction of Modernized Loan Approval System Processing and Deep Learning 2019, 109960T (14 May
Based on Machine Learning Approach." 2021 International Conference 2019); https://doi.org/10.1117/12.2519098.
on Intelligent Technologies (CONIT). IEEE, 2021. [17] Simi Sanya , M Seetha, 2021, Detection of Ocular Diseases using
[11] A. Soni and A. Rai, "A Novel Approach for the Early Recognition of Ensemble of Deep Learning Models, INTERNATIONAL JOURNAL OF
Diabetic Retinopathy using Machine Learning,"2021 International ENGINEERING RESEARCH & TECHNOLOGY (IJERT) Volume 10,
Conference on Computer Communication and Informatics (ICCCI), Issue 09 (September 2021)
2021, pp. 1-5, doi: 10.1109/ICCCI50826.2021.9402566. [18] Tayal, A., Gupta, J., Solanki, A. et al. DL-CNN-based approach with
[12] A. Pandey, P. Patre and J. Minj, "Detection of Glaucoma Disease using image processing techniques for diagnosis of retinal diseases.
Image Processing, Soft Computing and Deep Learning Approaches," Multimedia Systems (2021). https://doi.org/10.1007/s00530-021-00769-7
2020 Fourth International Conference on I-SMAC (IoT in Social, [19] Sarki, R., Ahmed, K., Wang, H., Michalska, S., & Zhang, Y. (2020).
Mobile, Analytics and Cloud) (I-SMAC), 2020, pp. 1-7, doi: 10.1109/I- Early detection of diabetic eye disease from fundus images with deep
SMAC49090.2020.9243596. learning. Lecture Notes in Computer Science, 234–241.
[13] M. T. Islam, S. A. Imran, A. Arefeen, M. Hasan and C. Shahnaz, "Source https://doi.org/10.1007/978-3-030-39469-1_20
and Camera Independent Ophthalmic Disease Recognition from Fundus
Image Using Neural Network," 2019 IEEE International Conference on
7
horized licensed use limited to: Vignan's Foundation for Science Technology & Research (Deemed to be University). Downloaded on June 15,2024 at 04:14:28 UTC from IEEE Xplore. Restrictions ap