Mayur Final Merged
Mayur Final Merged
Project Report on
BACHELOR OF ENGINEERING
in
CERTIFICATE
This is to certify that the project work entitled “GLAUCOMA DETECTION USING MACHINE
External Viva
Name of the Examiners Signature with Date
1)
2)
MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING
Accredited by NAAC with A+ Grade, An ISO 9001: 2015 Certified Institution
(A Unit of Rajalaxmi Education Trust)
Affiliated to Visvesvaraya Technological University, Belagavi
Badaga Mijar, Moodabidri – 574 225 , Karnataka
DECLARATION
I MAYUR S KARKERA (4MT20IS020) student of 8th semester BE in Information Science &
Engineering, Mangalore Institute of Technology and Engineering, Moodabidri, hereby declare
that the project work entitled “GLAUCOMA DETECTION USING MACHINE LEARNING
WITH OCT”, submitted to the Visvesvaraya Technological University, Belagavi during the
academic year 2023-24, is a record of an original work done by us under the guidance of Mr. Sharath
Kumar, Assistant Professor, Department of Information Science & Engineering , Mangalore
Institute of Technology and Engineering Moodabidri. This project work is submitted in partial
fulfillment of the requirements for the award of the degree of Bachelor of Engineering in Information
Science & Engineering. The results embodied in this report have not been submitted to any other
University or Institute for the award of any degree.
Place: Moodabidri
ABSTRACT
This project is dedicated to advancing the diagnosis of glaucoma and its primary sub-conditions:
Choroidal Neovascularization (CNV), Diabetic Macular Edema (DME), and Drusen, recognizing
their significant impact on vision loss worldwide. Glaucoma, often termed the "silent thief of sight,"
leads to progressive optic nerve damage and irreversible vision loss. While elevated intraocular
pressure (IOP) stands as a primary cause, not all cases stem from high IOP, emphasizing the
multifactorial nature of its development, involving genetics, age, race, and family history. This
necessitates a comprehensive approach for early detection and management to mitigate the disease's
silent progression. To address the urgent need for early intervention, this project employs cutting-
edge machine learning and image analysis techniques, complementing traditional diagnostic
methods like tonometry and ophthalmoscopy. By leveraging these advanced technologies, the
project aims to enhance early detection capabilities, allowing for tailored approaches to diagnose
and manage glaucoma and its sub conditions. Early identification is critical due to the insidious
nature of glaucoma, often advancing unnoticed until irreversible damage occurs. Prioritizing early
intervention not only slows disease progression but also safeguards the quality of life for affected
individuals, emphasizing the importance of personalized diagnostic and treatment strategies for
various complications and subtypes of glaucoma
i
ACKNOWLEDGEMENT
The satisfaction and the successful completion of this project would be incomplete without the
mention of the people who made it possible, whose constant guidance encouragement crowned my
efforts with success.
This project is made under the guidance of Mr. Sharath Kumar, Assistant Professor, Department
of Information Science & Engineering. I would like to express my sincere gratitude to my guide
for all the helping hand and guidance in this project.
I would like to thank my project coordinator Mr. Narendra UP, Associate Professor,
Department of Information Science & Engineering for his cordial support, valuable information
and guidance, which helped me in completing this project through the various stages.
I would like to thank our Principal Dr. Prashanth C M, for encouraging me and giving me an
opportunity to accomplish the project.
I also thank our management who helped me directly or indirectly in the completion of this project.
My special thanks to faculty members and others for their constant help and support.
Above all, I extend my sincere gratitude to our parents and friends for their constant
encouragement with moral support
MAYUR S KARKERA
ii
TABLE OF CONTENTS
CONTENTS PAGE
NO.
ABSTRACT i
ACKNOWLEDGEMENT ii
TABLE OF CONTENTS iii
LIST OF TABLES vi
LIST OF FIGURES vii
iii
3.2 Specific Requirements 9
3.2.1 Hardware Requirements 9
3.2.2 Software Requirements 10
3.2.3 Functional Requirements 12
3.2.4 Non-functional Requirements 13
4 GANTT CHART 15
5 SYSTEM DESIGN 16
5.1 Architectural Diagram 16
5.2 Use Case Diagram and Description 17
5.3 Sequence Diagram 17
5.4 Activity Diagram 18
5.5 Data flow Diagram 19
6 IMPLEMENTATION 21
6.1 System implementation 21
6.2 Workflow 22
7 TESTING 25
7.1 Introduction 25
7.2 Testing Approaches 25
7.3 Test Levels 26
7.4 Evaluation Metrics 26
7.5 Test Cases 27
8 RESULTS AND SNAPSHOTS 29
9 CONCLUSION AND FUTURE WORK 32
9.1 Conclusion 32
9.2 Future Work 33
iv
REFERENCES 34
PAPERS PRESENTED 35
v
LIST OF TABLES
vi
LIST OF FIGURES
vii
Chapter – 1
INTRODUCTION
GLAUCOMA DETECTION USING MACHINE LEARNING WITH OCT
CHAPTER 1
INTRODUCTION
In this chapter, we are going to give introduction about our project titled as Lightning
Prediction and Alert System using deep learning models .
1.1 Introduction
Glaucoma, often referred to as the "silent thief of sight," is a prevalent and potentially
devastating eye disease characterized by the progressive damage to the optic nerve, leading to
irreversible vision loss. It is estimated that over 80 million individuals worldwide are affected
by this condition, making it one of the leading causes of blindness. While glaucoma primarily
affects the elderly population, it can also manifest in individuals of all ages, making its early
detection and management of paramount importance. This project is dedicated to the
development and enhancement of diagnostic methods for glaucoma, with a specific focus on
the identification of its primary sub-conditions: Choroidal Neovascularization (CNV),
Diabetic Macular Edema (DME), and Drusen. To fully comprehend the significance of these
sub-conditions and the urgency for their early detection, we must delve into the causes and
symptoms of glaucoma itself. he primary cause of glaucoma is elevated intraocular pressure
(IOP), a condition where the fluid within the eye, known as aqueous humor, is not adequately
drained. This leads to increased pressure within the eye, which, over time, damages the optic
nerve. Elevated IOP is a major risk factor, but it is essential to recognize that not all individuals
with glaucoma have high IOP, and not all individuals with high IOP develop glaucoma
1.3 Objectives
1.4 Scope
The scope of this project involves the utilization of machine learning (ML) algorithms in
conjunction with Optical Coherence Tomography (OCT) technology to innovate glaucoma
detection methods. By employing ML and deep learning techniques, including convolutional
neural networks (CNNs) and Random Forest algorithms, the project aims to enhance the
accuracy and efficiency of glaucoma diagnosis. Through the analysis of OCT images and
extraction of relevant features indicative of glaucomatous damage, the AI system seeks to
provide clinicians with precise and timely identification of the disease. Additionally, the
project emphasizes the integration of ML models into user-friendly interfaces accessible via
web and mobile platforms, enabling seamless analysis of OCT scans for real-time detection
and monitoring of glaucoma progression. Collaboration with ophthalmologists and
researchers ensures the validation and refinement of the AI models, fostering responsible
and effective utilization of technology for improved patient care and management of
glaucoma.
Chapter 1 Starts by introducing the topic, which gives an insight and in-depth analysis of
the project work, which was closely followed by the problem statement which tells about the
problems faced by users, followed by the objectives of the topic, which gave rise to the
significance of the topic, followed by the scope and crowned by the organization of
theproject.
Chapter 2 Deals with the literature review, Existing system, Limitations of existing system
and proposed system.
Chapter 5 Which has Architectural diagram, Class diagram, Use case diagram, Sequence
diagram, Dataflow diagram.
CHAPTER 2
LITERATURE REVIEW
A literature survey or a literature review in a project report is that section which shows the
various analysis and research made in the field of our interest and the results already published,
taking into account the various parameters and the extent of the project.
In a paper by Jie Wang, In this paper researchers have made significant progress in developing
a deep learning algorithm for diagnosing and segmenting choroidal neovascularization (CNV)
in OCTA images. This novel algorithm has demonstrated exceptional accuracy in both tasks,
paving the way for its potential clinical application and improving the diagnosis and treatment
of CNV. By automating the process of CNV identification and segmentation, this deep learning
method can significantly reduce the workload of ophthalmologists and enhance the quality of
patient care. Additionally, the ability of this algorithm to accurately quantify CNV lesions
provides valuable insights for treatment planning and monitoring disease progression, further
optimizing patient outcomes. While further research is warranted to refine the algorithm and
evaluate its performance in real-world clinical settings, this deep learning approach holds
immense promise for revolutionizing CNV diagnosis and management.
In a paper by Filippo Arcado,This paper is proposed to develop deep learning (DL) models for
the automatic detection of optical coherence tomography (OCT) measures of diabetic macular
thickening (MT) from color fundus photographs (CFPs). Retrospective analysis on 17,997
CFPs and their associated OCT measurements from the phase 3 RIDE/RISE diabetic macular
edema (DME) studies. DL with transfer learning cascade was applied on CFPs to predict time-
domain OCT (TD-OCT)– equivalent measures of MT, including central subfield thickness
(CST) and central foveal thickness (CFT). MT was defined by using two OCT cutoff points:250
μm and 400 μm. A DL regression model was developed to directly quantify the actual CFT and
CST from CFPs. DL is capable of predicting key quantitative TD-OCT measurements related
to MT from CFPs. The DL models presented here could enhance the efficiency of DME
diagnosis in teleophthalmology programs, promoting better visual outcomes. Future research is
needed to validate DL algorithms for MT in the real-world.
While previous efforts have explored automated glaucoma detection using OCT images,
achieving accurate classification of glaucoma subtypes remains a challenge. This project builds
upon these advancements by incorporating a transfer learning approach. We'll utilize a pre-
trained Convolutional Neural Network (CNN) called VGG-19. This powerful model, trained
Dept. of ISE, MITE, Moodabidri Page 5
GLAUCOMA DETECTION USING MACHINE LEARNING WITH OCT
on a vast dataset of general images, possesses valuable image recognition skills. By leveraging
VGG-19's pre-trained knowledge as a foundation, we aim to develop a more efficient system.
During "fine-tuning," we'll adjust VGG-19's weights to focus on recognizing specific features
within OCT scans that differentiate drusen, CNV, DME, and normal retinas.It is also important
to acknowledge the limitations associated with transfer learning. The pre-trained model might
have inherent biases based on the original dataset (ImageNet), potentially requiring extensive
fine-tuning with a large and diverse collection of OCT images to achieve optimal performance
for glaucoma subclassification
Existing VGG-19-based systems for glaucoma subtyping suffer from biases inherited from
general image datasets (like ImageNet) and limited generalizability to medical image analysis.
Imagine training a detective on fashion photos to find criminals - they might miss key details.
Similarly, VGG-19 might miss crucial features in OCT scans. Additionally, its general object
recognition skills might not translate well to the subtle variations in OCT scans needed for
glaucoma subtyping.
CHAPTER 3
This document outlines the functionalities and considerations for a machine learning system
designed to detect glaucoma using Optical Coherence Tomography (OCT) images. OCT is a
non-invasive imaging technique that captures detailed cross-sectional images of the retina. This
system will utilize machine learning algorithms to analyze these OCT images and identify
potential signs of glaucoma, a leading cause of blindness.
● Improve diagnostic accuracy: Machine learning can assist in identifying subtle glaucomatous
features in OCT scans, potentially leading to earlier and more accurate diagnoses.
● Enhance efficiency: Automated analysis of OCT images can streamline the diagnostic
workflow, allowing doctors to dedicate more time to patient care and treatment planning.
● Support objective assessment: Machine learning models can provide a more consistent and
The core functionality of this system is the automated detection of glaucoma in OCT images
and data type. This may involve resizing, normalization, and other image manipulation
techniques.
● Feature Extraction: Machine learning algorithms will extract relevant features from the
preprocessed OCT images. These features could include measurements of retinal nerve fiber
layer thickness, cup-to-disk ratio, and other parameters indicative of glaucomatous changes.
● Model Training: The system will be trained on a large dataset of labeled OCT images.
Each image will be labeled as either positive (glaucoma present) or negative (glaucoma absent).
This training process allows the machine learning model to learn the complex relationships
● Image Classification: Once trained, the system can be used to analyze new, unseen OCT
images. The model will classify each image as either positive or negative for glaucoma based
● Retinal specialists: Retinal specialists can utilize the system for a more objective
● Glaucoma researchers: Researchers can leverage the system to develop and evaluate
• Data Availability and Quality: The performance of the machine learning system heavily
relies on the quality and quantity of training data. Access to a large dataset of labeled OCT
deep learning models, require significant computational power. Access to powerful computing
ensuring the interpretability of their results is important. Understanding how the model arrives
ophthalmologists.
• Ophthalmologists using the system will have a basic understanding of machine learning
• The system will have access to a secure and reliable storage system for medical images
This section includes the detailed description about the hardware requirements, software
requirements, functional requirements and non-functional requirements.
1) Python
2) Flask
Flask is a micro web framework written in Python, designed to make web development quick
and easy while providing the flexibility to scale up to complex applications. It follows a
minimalistic approach, allowing developers to get started quickly without imposing strict
conventions. At the core of Flask are routes, which map URL patterns to Python functions,
known as views or endpoints. These views handle HTTP requests and generate responses,
allowing developers to define what happens when a user visits a specific URL. Flask uses Jinja2
templating engine to render HTML templates, separating the presentation logic from the
application logic. Request and response objects provide access to request data and allow
customization of HTTP responses. Flask applications can be configured using configuration
variables to control various aspects such as debug mode and database settings. Error handling
mechanisms handle exceptions and errors during request processing, and Flask's rich ecosystem
of extensions adds additional functionality like database integration, authentication, and form
validation. With its simplicity, flexibility, and extensibility, Flask remains a popular choice for
web development in Python.
Visual Studio includes a code editor supporting IntelliSense as well as code refactoring. The
integrated debugger works both as a source-level debugger and a machine-level debugger.
Other built-in tools include a code profiler, designer for building GUI applications, web
designer, class designer, and database schema designer. It accepts plug-ins that expand the
functionality at almost every level—including adding support for source control systems (like
Subversion and Git) and adding new toolsets like editors and visual designers for domain-
specific languages or toolsets for other aspects of the software development lifecycle (like the
Azure DevOps client: Team Explorer). The most basic edition of Visual Studio, the Community
edition, is available free of charge. The slogan for Visual Studio Community edition is "Free,
fully-featured IDE for students, open-source and individual developers". As of March 2021, the
current production-ready Visual Studio version was 2019, with older versions such as 2012 and
2013 on Extended Support, and 2015 and 2017 on Mainstream Support On April 12, 2010,
Microsoft released Visual Studio 2010, codenamed Dev10, and .NET Framework 4.
4) HTML
Hypertext Markup Language (HTML) is the standard markup language for creating web pages
and web applications. With Cascading Style Sheets (CSS) and JavaScript, it forms a triad of
cornerstone technologies for the World Wide Web.
Web browsers receive HTML documents from a web server or from local storage and render
the documents into multimedia web pages. HTML describes the structure of a web page
semantically and originally included cues for the appearance of the document.HTML elements
are the building blocks of HTML pages. With HTML constructs, images and other objects such
as interactive forms may be embedded into the rendered page. HTML provides a means to
create structured documents by denoting structural semantics for text such as headings,
paragraphs, lists, links, quotes and other items. HTML elements are delineated by tags, written
using angle brackets. Tags such as <img /> and <input /> directly introduce content into the
page. Other tags such as <p> surround and provide information about document text and may
include other tags as sub-elements. Browsers do not display the HTML tags, but use them to
interpret the content of the page. HTML can embed programs written in a scripting language
such as JavaScript, which affects the behavior and content of web pages. Inclusion of CSS
defines the look and layout of content. The World Wide Web
Consortium (W3C), maintainer of both the HTML and the CSS standards, has encouraged the
use of CSS over explicit presentational HTML since 1997.
5) JavaScript
Windows XP/7analyses the performance impact of visual effects and uses this to determine
whether to enable them, so as to prevent the new functionality from consuming excessive
additional processing overhead. Users can further customize these settings. Some newer video
cards. However, if the video card in not capable of hardware alpha blending, performance can
be substantially degraded, and Microsoft recommends the feature should be turned off
manually. Windows XP/7 added the ability for windows to use “Visual Styles” to change the
appearance of the user interface.
• Data Acquisition Module: This module facilitates the import of OCT images from
various sources, ensuring compatibility with different imaging devices and file formats.
images to ensure consistency and improve analysis accuracy. Preprocessing techniques might
• Feature Extraction Module: This module extracts relevant features from the
preprocessed OCT images. These features could include retinal nerve fiber layer thickness
• Machine Learning Module: This core module houses the machine learning model
responsible for analyzing extracted features and classifying OCT images. The system can be
designed to support various machine learning algorithms depending on project goals and data
availability.
learning model, this module classifies OCT images as either positive (glaucoma present) or
• Result Reporting Module: This module generates reports summarizing the system's
analysis of OCT images. These reports can be tailored to provide ophthalmologists with
images for glaucoma detection. This translates to minimizing both false positives (healthy eyes
identified as glaucomatous) and false negatives (glaucomatous eyes missed by the system).
• Performance: The system should exhibit efficient processing times for analyzing OCT
volume and user base. This might involve implementing modular architecture and leveraging
• Security: The system must adhere to strict security protocols to safeguard sensitive
• Usability: The user interface should be intuitive and user-friendly for ophthalmologists
CHAPTER 4
GANTT CHART
A Gantt chart is a type of bar chart, developed by Henry Gantt that illustrates a project schedule.
Gantt charts illustrate the start and finish of the terminal elements and summary elements of the
project. Terminal elements and summary elements comprise the work breakdown structure of
the project.
The following is the Gantt chart of the project “LIGHTNING PREDICTION AND ALERT
SYSTEM”
CHAPTER 5
SYSTEM DESIGN
5.1 Architectural Diagram
A use case is a set of scenarios that describing an interaction between a source and a
destination. The two main components of a use case diagram are use cases and actors. It
displays the relationship among them. A use case diagram can identify the different types of
users of a system and the different use cases and will often be accompanied by other types of
diagrams as well. The use cases are represented by either circles or ellipses. Use case diagram
of this system is as shown below.
A Sequence diagram shows how a set of objects communicate with each other to perform a
Complex task. This type of diagram allows the other developer to verify that the interaction
is correct. A Sequence diagram shows parallel vertical lines, different processes or objects
that live simultaneously and as horizontal arrows, the messages exchanged between them,
in the
order in which they occur. This allows the specification of simple runtime scenarios in a
Graphical manner.
The Fig. 5.3 represents the entire workflow of the proposed system. It shows the sequence in
which the system actually works.
Activity diagram is an important diagram in UML to describe the dynamic aspects of the
system. Activity diagram is basically a flowchart to represent the flow from one activity to
another activity. The activity can be described as an operation of the system.
The components used in this are as follows: Rounded Rectangle: It indicates the process. Arrow: It
indicates a transition line.
Rhombus: It indicates the decision.
Solid Circle: It represents the initial state of workflow.
CHAPTER 6
IMPLEMENTATION
Introduction
A crucial phase in the system development life cycle is successful implementation of new
system design. Implementations simply mean converting new system design into operation.
The term implementation has different meanings, ranging from the conversion of a basic
application to a complete replacement of computer system. Implementation is used here to
mean the process of converting a new or revised system design into an operational one.
to extract more features about the input parameters, it is necessary to obtain more complex
feature correlation values from low-level convolutional layers through multi-level cascading.
6.2 Workflow
The following steps are carried out to implement the proposed system.
6.2.1 Pre-processing
Data Pre-processing is a technique that is used to convert the raw data into a clean data set. In
other words, whenever the data is gathered from different sources it is collected in raw format
which is not feasible for the analysis.
Dataset is pre-processed to resize the dataset images. We created our own dataset from a
lightning video and created frames and images are resized to 150*150 before processing them
further.
6.2.3 Training
The process of training an ML model involves providing an ML algorithm (that is, the learning
algorithm) with training data to learn from. The term ML model refers to the model artifact
that is created by the training process. The training data must contain the correct answer, which
is known as a target or target attribute. The learning algorithm finds patterns in the training
data that map the input data attributes to the target (the answer that you want to predict), and
it outputs an ML model that captures these patterns.
Pickling is a process in which model is stored in a file for future use. Before pickling model
must be trained well and optimized to maximum extent possible because after picking model,
• Exporting Dataset
• Custom Training
CHAPTER 7
TESTING
7.1 INTRODUCTION
Testing is a crucial stage in the development of a machine learning system for glaucoma
detection using OCT images. It ensures the system functions as intended, delivers accurate
results, and meets quality standards. There are various testing methodologies employed to
● Unit Testing: Individual components of the machine learning system, such as modules
responsible for image preprocessing or feature extraction, are tested in isolation to ensure they
function as designed.
● Integration Testing: Once individual components are validated, they are integrated into
a cohesive system. Integration testing verifies if these components interact seamlessly and
● System Testing: The complete system, including the user interface and interaction with
OCT image data, is rigorously evaluated to ensure it adheres to overall system requirements
Beyond functional testing, the system's performance is assessed using specific metrics:
• Accuracy: This metric measures the system's ability to correctly classify OCT images,
• Sensitivity: This metric indicates the system's ability to correctly identify glaucoma cases
(positive predictive value). A high sensitivity minimizes false negatives (missed glaucoma
cases).
• Specificity: This metric reflects the system's ability to correctly identify healthy eyes
(negative predictive value). A high specificity minimizes false positives (healthy eyes
identified as glaucoma).
• Precision: This metric represents the proportion of positive test results that are truly
positive (true positives divided by the sum of true positives and false positives).
• Recall: This metric represents the proportion of positive cases that are correctly identified
(true positives divided by the sum of true positives and false negatives).
These test cases focus on evaluating the core functionality of the system: uploading an OCT
● Action: User uploads a valid OCT image file (e.g., JPG, PNG)
● Expected Output:
○ System processes the image and displays the glaucoma detection prediction (e.g.,
● Action: User uploads a file with an unsupported format (e.g., DOCX, PDF)
● Expected Output:
○ System displays an error message indicating the file format is not supported.
● Expected Output:
● Action: User clicks the upload button without selecting an image file
● Expected Output:
○ System displays an error message prompting the user to select an image file.
CHAPTER 8
CHAPTER 9
CONCLUSION
FUTURE WORK
This project lays a strong foundation for further advancements in glaucoma detection using
machine learning. Here are some key areas for future exploration:
● Real-time Analysis: Transitioning the system from an offline analysis tool to a real-time
system would significantly enhance its clinical utility. This would involve optimizing
algorithms and system architecture for efficient processing of OCT images during patient
consultations.
● Integration with Diagnostic Devices: Integrating the system with ophthalmic imaging
devices could streamline the diagnostic workflow. This would allow for seamless transfer of
OCT images directly from the device for immediate analysis by the system, potentially reducing
analysis time and improving overall efficiency.
REFERENCES
1) https://tvst.arvojournals.org/solr/searchresults.aspx?author=Jie+Wang
2) https://iovs.arvojournals.org/solr/searchresults.aspx?author=Filippo
3) https://www.mdpi.com/2079-9292/9/10/1617
4) https://pubmed.ncbi.nlm.nih.gov/37565126/
5) https://doi.org/10.3390/s22062342
6) https://doi.org/10.1016/j.jksuci.2021.12.002
7) https://doi.org/10.3390/healthcare11020212
8) https://rdcu.be/dCUsd
DOI IJARCCE/10.17148
Abstract: Glaucom Detection is devoted to advancing the diagnosis of glaucoma and its primary sub-conditions:
Choroidal Neovascularization (CNV), Diabetic Macular Edema (DME), and Drusen, recognizing their significant impact
on global vision loss. Glaucoma, often termed the "silent thief of sight," leads to progressive optic nerve damage and
irreversible vision loss. While elevated intraocular pressure (IOP) is a primary cause, not all cases stem from high IOP,
underscoring the multifactorial nature of its development involving genetics, age, race, and family history. This
complexity necessitates a comprehensive approach for early detection and management to counteract the disease's silent
progression.To meet the pressing need for early intervention, this project integrates cutting-edge machine learning and
image analysis techniques alongside traditional diagnostic methods like tonometry and ophthalmoscopy. By harnessing
these advanced technologies, the project aims to enhance early detection capabilities, facilitating tailored approaches to
diagnose and manage glaucoma and its sub-conditions effectively. Early identification is paramount due to the insidious
nature of glaucoma, which often advances unnoticed until irreversible damage occurs. Prioritizing early intervention not
only decelerates disease progression but also safeguards the quality of life for affected individuals, underscoring the
significance of personalized diagnostic and treatment strategies for various complications and subtypes of glaucoma.
This holistic approach seeks to revolutionize glaucoma management by integrating state-of-the-art technology with
established diagnostic methods, ultimately improving outcomes for patients worldwide..
Keywords: Glaucoma diagnosis Choroidal Neovascularization (CNV) Diabetic Macular Edema (DME) Drusen Early
detection Machine learning Image analysis.
I. INTRODUCTION
Glaucoma, often referred to as the "silent thief of sight," is a prevalent and potentially devastating eye disease
characterized by the progressive damage to the optic nerve, leading to irreversible vision loss. It is estimated that over 80
million individuals worldwide are affected by this condition, making it one of the leading causes of blindness. While
glaucoma primarily affects the elderly population, it can also manifest in individuals of all ages, making its early detection
and management of paramount importance. This project is dedicated to the development and enhancement of diagnostic
methods for glaucoma, with a specific focus on the identification of its primary sub-conditions: Choroidal
Neovascularization (CNV), Diabetic Macular Edema (DME), and Drusen.
To fully comprehend the significance of these sub-conditions and the urgency for their early detection, we must delve
into the causes and symptoms of glaucoma itself.he primary cause of glaucoma is elevated intraocular pressure (IOP), a
condition where the fluid within the eye, known as aqueous humor, is not adequately drained. This leads to increased
pressure within the eye, which, over time, damages the optic nerve.
Elevated IOP is a major risk factor, but it is essential to recognize that not all individuals with glaucoma have high IOP,
and not all individuals with high IOP develop glaucoma. Additionally, other factors such as genetics, family history, age,
and race can play a role in glaucoma's development.While traditional diagnostic methods such as tonometry and
ophthalmoscopy are valuable, this project aims to employ state-of-the-art machine learning and image analysis techniques
to enhance the early detection of glaucoma and its sub conditions, CNV, DME, and Drusen. By advancing our
understanding and diagnostic capabilities, we endeavor to contribute to improved patient care, early intervention, and the
preservation of vision The project's emphasis on early detection is crucial because glaucoma and its sub-conditions often
progress silently, causing irreversible damage before noticeable symptoms manifest.
© IJARCCE This work is licensed under a Creative Commons Attribution 4.0 International License 729
IJARCCE ISSN (O) 2278-1021, ISSN (P) 2319-5940
In [1] Jie Wang, et.al., Researchers developed a deep learning algorithm for diagnosing and segmenting choroidal
neovascularization (CNV) in OCTA images. This algorithm showed high accuracy, potentially improving CNV diagnosis
and treatment. It can automate CNV analysis, reducing workload for ophthalmologists and improving patient care.
In [2] Filippo Arcadu, et.al., This study used deep learning to analyze retinal photographs (CFPs) to predict diabetic
macular thickening (MT) normally measured with retinal scans (OCT). The models achieved high accuracy and could
improve diagnosis efficiency in telemedicine. However, real-world validation is needed.
In [3] Quang T.M.Pham, et.al., This paper proposes a new deep learning model for segmenting drusen, deposits linked
to age-related macular degeneration (AMD). Unlike prior methods using single-scale analysis, this model incorporates
both global and local image information.This research shows promise for applying deep learning in AMD diagnosis.
In [4] Rakesh Patel, et.al., This study shows a deep learning model can effectively detect drusen, CNV, and DME in OCT
images. This offers a more standardized approach to diagnosing AMD and DR. More research is needed before
widespread use, but this technology has promise to improve patient care, especially in areas with limited access to
specialists.
In [5] Ibrahim Yasser, et.al., This is an article about automated diagnosis of Optical Coherence Tomography Angiography
(OCTA). It discusses using machine learning techniques to diagnose retinal diseases. The article proposes a new method
to classify OCTA images based on convolutional neural networks (CNNs). The method involves dividing the OCTA
image into smaller pieces and cropping a specific region around the fovea.
In [6] Pawan Kumar Upadhyay, et.al.,Researchers have developed a new and powerful neural network for analyzing
retinal diseases using optical coherence tomography (OCT) images. This network can effectively classify four distinct
retinal conditions: neovascularization, diabetic macular edema, drusen, and normal.
In [7] Amit Choudhary, et.al.,This study utilizes a customized 19-layer VGG-19 deep learning model to automatically
detect retinal diseases from OCT images. Trained on a large public dataset, the model classifies four conditions: choroidal
neovascularization, drusen, diabetic macular edema, and normal.
In [8] Zhenhua Wang , et.al., This is an article about automated diagnosis of Optical Coherence Tomography
Angiography (OCTA). It discusses using machine learning techniques to diagnose retinal diseases. The article proposes
a new method to classify OCTA images based on convolutional neural networks (CNNs). The method involves dividing
the OCTA image into smaller pieces and cropping a specific region.
In [9] Nawaaz A Nathoo, et.al., This is an article about measuring drusen load to predict age-related macular degeneration
(AMD). It discusses a study that used spectral-domain optical coherence tomography (SD OCT) to measure drusen load
in patients with non-neovascular AMD. The study found that drusen load was associated with development of RPE
atrophy and neovascular AMD.
© IJARCCE This work is licensed under a Creative Commons Attribution 4.0 International License 730
IJARCCE ISSN (O) 2278-1021, ISSN (P) 2319-5940
Proposed system
Building upon advancements in automated glaucoma detection with OCT images, this project seeks a more accurate
solution for classifying glaucoma subtypes. While existing systems utilize transfer learning with models like VGG-19,
these approaches can be limited by biases inherent in the pre-trained datasets used for general image recognition. To
address this, we propose a novel approach using YOLOv8, a state-of-the-art object detection model built on the CNN
architecture. Unlike VGG-19, YOLOv8 is specifically designed for object detection tasks, making it a more natural fit
for identifying crucial features within OCT scans, such as the optic nerve and retinal layers. Additionally, by training
YOLOv8 from scratch on a specifically curated dataset of OCT images labeled for glaucoma, CNV, DME, Drusen, and
normal retinas, we aim to minimize potential biases and achieve superior accuracy in classifying these glaucoma
subtypes. This targeted approach using YOLOv8 has the potential to significantly improve early detection and
subclassification compared to previous methods that rely on transfer learning.
© IJARCCE This work is licensed under a Creative Commons Attribution 4.0 International License 731
IJARCCE ISSN (O) 2278-1021, ISSN (P) 2319-5940
A formal explanation of a system is provided by an architectural explanation, which is arranged to support reasoning
regarding the system's structure, individual components' properties that can be observed from the outside, and the
interactions between them. It also offers a framework from which systems can be developed and products acquired that
will cooperate to implement the system as a whole.
IV. CONCLUSION
In conclusion, this project ventured beyond the limitations of transfer learning approaches to explore the potential of
YOLOv8, a state-of-the-art object detection model built upon the robust CNN architecture. This shift towards a model
specifically designed for object detection tasks addresses the inherent biases that can plague pre-trained models used in
transfer learning. By leveraging YOLOv8's object detection prowess, we aim to develop a system that can meticulously
identify and differentiate the subtle features within OCT scans that distinguish between glaucoma, CNV, DME, Drusen,
and normal retinas. This targeted approach, fueled by a meticulously curated training dataset of labeled OCT images, has
the potential to significantly improve early detection and accurate classification of glaucoma subtypes compared to
existing methods. The potential benefits of this project extend far beyond improved classification accuracy. Earlier
diagnosis of glaucoma and its subtypes, enabled by this system, can lead to timely intervention and potentially vision
preservation for patients. Additionally, YOLOv8's ability to provide a more consistent and objective assessment
compared to traditional methods holds promise for enhanced diagnostic accuracy. Furthermore, streamlined workflows
for ophthalmologists achieved through automated analysis can translate into more time dedicated to patient care and
treatment planning. Ultimately, this project aspires to empower patients by enabling earlier detection, which allows them
to take a proactive stance in managing their eye health. By harnessing the power of deep learning and artificial
intelligence, this project lays the groundwork for a transformative future in glaucoma detection. A future where the
process becomes more efficient, accurate, and ultimately leads to better patient outcomes for those facing the challenges
of glaucoma. This project represents a significant step towards a brighter future for countless individuals whose vision is
threatened by this disease.
REFERENCES
[1] https://tvst.arvojournals.org/solr/searchresults.aspx?author=Jie+Wang
[2] https://iovs.arvojournals.org/solr/searchresults.aspx?author=Filippo+ Arcadu
[3] https://www.mdpi.com/2079-9292/9/10/1617
[4] https://pubmed.ncbi.nlm.nih.gov/37565126/
[5] https://doi.org/10.3390/s22062342
[6]https://doi.org/10.1016/j.jksuci.2021.12.002
[7]https://doi.org/10.3390/healthcare11020212
[8]https://rdcu.be/dCUsd
[9]10.1016/j.ajo.2014.06.021
© IJARCCE This work is licensed under a Creative Commons Attribution 4.0 International License 732