0% found this document useful (0 votes)

125 views12 pages

Proposal

This document provides a table of contents for a research paper on diabetes diagnosis using machine learning. The table of contents outlines the introduction, background, problem statement, literature review discussing previous work and research gaps, the research aim and objectives, and proposed methodology which will involve data pre-processing, classification, and evaluation. Key points covered include the global burden of diabetes, complications from diabetes, using machine learning for predictive analysis and decision support in healthcare, and applying techniques like neural networks and deep learning to identify diabetes from complex data sources.

Uploaded by

abunishan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

125 views12 pages

Proposal

Uploaded by

abunishan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Contents ……………………………………………………………………………….… Page

1.0 Introduction ……………………………………………………………………………….3

2.0 Background of the study …………………………………………………………………..3

3.0 Problem statement ……………………………………………………………………...…4

4.0 Literature review ………………………………………………………..………..……….6

4.1 Research gap .………………………………………………………………….……...8

5.0 Research Aim and objectives ……………………………………………………………..9

6.0 Proposed methodology ..…………………………………………………………..………9

6.1 Implementation of proposed methodology ….……………………………………….10

6.1.1 Pre-processing ………..………. …………...….……..………………………..10

6.1.2 Classification ..……………………………...………….…..…………………..10

6.1.3 Evaluation ………………………..……………………………….……………11

References …...……………………………………………………..………………...….12

2 |Page
1.0 Introduction:

Diabetes, in addition known as diabetes mellitus, is a bunch of metabolic disorders

characterized by high blood sugar level over a delayed period of time. Diabetes mellitus has a
direct signal of high blood sugar, together with some symptoms including frequent urination,
increased thirst, increased hunger and weight loss. Patient of diabetes usually need constant
treatment, otherwise, it will possibly lead to many dangerous life-threatening complication
(Wei et al., 2018). Recently, progresses in biological and medical technologies have been
providing us huge volumes of biological and physiological data. Learning from this data
encourages the understanding of human wellbeing and disease.

Earlier studies on the issue of diabetes diagnosis came close to using machine
learning. However, due to either inadequate insulin synthesis by the pancreas or
inappropriate insulin usage by body cells, the specific cause of diabetes is still unknown.
When analyzing vast volumes of data for patterns or doing predictive analysis, machine
learning is especially useful. In order to enhance patient security and healthcare quality, it
also offers an extension of decision support tools for risk management and alerts. To reduce
healthcare expenditures and progress toward personalized treatment, the healthcare industry
must overcome challenges in electronic record administration, information integration,
computer-aided analysis, and early sickness or problem diagnosis. Machine learning offers a
variety of very precise tools, approaches, and systems to deal with these issues.

As a result, we conduct a thorough analysis of the most widely used methods of

artificial neural networks and deep learning-based algorithms to identify diabetes as well as
data preprocessing techniques using genome databases to appear amazing assurance in
extracting features and learning patterns from complex data.

2.0 Background of the study:

The global diabetes burden is projected to increase from 380 million people in 2013
to 590 million by 2035. Patients living with diabetes have a higher risk for acute and long-
term complications, such as hyperglycemia, nervous system damage, kidney disease, eye
damage, and cardiovascular events, than the general population. Furthermore, treatments for
diabetes complications are a major contributor to the healthcare costs attributable to diabetes,
particularly due to hospitalizations and emergency department visits (Ravaut et al., 2021).

3 |Page
Diabetes Mellitus (DM), sometimes called diabetes, is a concept for a variety of
disorders that include how the body converts food into energy. Once one consumes food, the
body converts it into sugar named glucose and transfers it to the bloodstream. The pancreas
produces insulin, which is a hormone that tends to transfer glucose from the blood to the cells
that utilize it for energy (Chaki et al., 2020).

In addition, Diabetes mellitus has been an increasing concern owing to its high
morbidity, and the average age of individual affected by of individual affected by this disease
has now decreased to mid-twenties. Given the high prevalence, it is necessary to address with
this problem effectively (Sharma et al., 2021).

Recently, Diabetes mellitus is a global public health issue. In 2019, the International
Diabetes Federation estimated the number of people living with diabetes worldwide at 463
million and the expected growth at 51% by the year 2045. Moreover, it is estimated that
there is one undiagnosed person for each diagnosed person with a diabetes diagnosis
(Fregoso-Aparicio et al., 2021)

Hence, Machine learning has demonstrated its powerful predictive capabilities and
parallel processing capabilities for handling large numbers of variables. Furthermore,
machine learning has derived variable screening mechanisms that can detect and interpret
complex relationships between variables (Qin et al., 2022). Machine learning algorithms
have been embedded into data mining pipelines, which can combine them with classical
statistical strategies, to extract knowledge from data. (Dagliati et al., 2018)

Data mining is one of the most essential components of medical science research,
which unavoidably generates enormous volumes of data due to the significant societal impact
of the specific condition. Applying machine learning and data mining techniques to the
investigation might be a crucial strategy for making use of the vast amounts of diabetes-
related data that are already accessible. When it comes to decision-making, administration,
and other associated healthcare organization aspects, a mix of machine learning and data
mining methodologies may be quite concerning.

3.0 Problem statement:

Diabetes is one of the most common diseases leading to disability and death
worldwide and its incidence is increasing, particularly in developing countries. Recently a
high number of people who live in Bangladesh experience the detrimental effects of diabetes
and the number of cases is expected to double by 2025 (Pronab et al., 2021).

4 |Page
Diabetes can cause numerous health complications includes cardiovascular disease,
stroke, chronic kidney infection, foot ulcers, damage to the nerves, harm to the eyes, and
cognitive impedance or death. The detection of diabetes is of great importance, concerning
its severe complications. There have been plenty of research studies about diabetes
identification, many of which are based on the Pima Indian diabetes data set. It’s a data set
studying women in Pima Indian population started from 1965, where the onset rate for
diabetes is comparatively high. Most of the research studies done before mainly focused on
one or two particular complex technique to test the data, while a comprehensive research
over many common techniques is missing (Wei et al., 2018).

Diabetes Mellitus (DM) is one of the most significant research applications is human-
threatening disease prognosis and treatment. DM is among the most widespread diseases
(World Health Organization, 2020) for the elderly in the country. In 2017, 451 million
individuals globally are diabetic as informed by the International Diabetes Federation.
Expectations are that this figure will rise to 693 million citizens over the next 26 years. The
primary cause of DM remains unclear, but researchers believe that both environmental and
genetic factors play an important role in DM (Chaki et al., 2020).

Diabetes increases the risks of initial kidney disease, loss of sight, nerve injure, blood
vessel damage and it contributes to heart disease. The cause of diabetes continues to be an
ambiguity, although both genetics and ecological factors such as obesity and be short of
exercise come out to take part in roles (Chitra et al., 2015).

Nowadays, the prevalence of diabetes is increasing worldwide. The International

Diabetes Federation (IDF) estimates that 536.6 million people are living with diabetes
(diagnosed or undiagnosed) in 2021, and this number is projected to increase by 46%,
reaching 783.2 million by 2045. As previous IDF estimates and other studies have shown,
approximately 50% of all individuals with diabetes are unaware of their condition. In 2021,
the global prevalence of undiagnosed diabetes remains high. Almost half of all people with
diabetes (44.7%; 239.7 million) were unaware that they have the condition (Katherine et al.,
2022).

5 |Page
4.0 Literature Review:

The following are discussions of earlier work on data mining and machine learning
techniques:

Various Data mining methods, including classification, clustering, associations, and

regression, were found in the health area by Tomar et al. (2013) research. The authors
provided a brief explanation of the benefits and drawbacks of various data mining
approaches. According to authors, it is important to identify redundant and incorrect features
before using classification techniques since these attributes behave as noise and outliers,
slowing down the processing work. The authors advised adopting a cross-validation
technique to improve classifier performance. The scientists also looked at how the clustering
approach might be useful when a data collection has missing or limited information. The
authors also recommended employing Data Mining techniques such association with
clustering or classification, fusion of various classifiers, or clustering with classification.

According to Sharma et al. (2021), several scientists and medical professionals have
now created artificial intelligence-based detection tools to more effectively address issues
that are ignored as a result of human mistakes. The use of machine vision systems to learn
data on facial images, gain better features for model training, and diagnosis via presentation
of iridocyclitis for detection of the disease through iris patterns are just a few examples of the
data mining techniques with algorithms that have been implemented by different
practitioners.

Research by Ahmad et al. (2015), Data mining has a huge potential to help healthcare
systems use data more effectively and efficiently. The authors found that one of the most
often utilized data mining techniques in the healthcare industry is classification. Data mining
is cited by the authors as being crucial in the identification of fraud and abuse, the provision
of better medical care at affordable costs, intelligent healthcare decision support systems, and
the early detection of diseases like diabetes, heart disease, lung cancer, thyroid, dengue,
Alzheimer's disease, and others. The authors also discussed the difficulties researchers have
encountered when using healthcare data for data mining, difficulties that might pose severe
obstacles to making the right conclusions. The authors recommended combining several data
mining approaches in order to improve survival rates for significant death-related conditions,
increase the accuracy of illness prognosis, etc.

6 |Page
Data mining techniques have been extensively employed in constructing decision
support systems for illnesses prediction using a collection of medical datasets, according to
research by Mehrbakhsh et al. (2017). The authors suggested integrating clustering, noise
reduction, and prediction approaches to create a new knowledge-based system for illness
prediction. The Classification and Regression Trees (CART) method was recommended by
the authors to create the fuzzy rules that would be employed in the knowledge-based system.

According to a study by Pronab et al. (2021), a sizable part of individuals worldwide

is now experiencing the negative impacts of diabetes, with a sizable amount of them not
being diagnosed at an early stage. Additionally, the authors noted that this might eventually
lead to major health issues like blindness and renal failure. The author recommended using a
variety of machine learning (ML) techniques to appropriately categorize the condition.

Chitra et al. (2015) performed a comprehensive review with the goal of assisting
researchers in creating ensemble learning approaches to aid in the early identification and
diagnosis of diabetes. Support Vector Machines (SVMs) have demonstrated strong
performance in a variety of application domains, according to the authors. The suggested
method by the authors is focused on identifying people who are at risk for developing pre-
diabetes or undetected diabetes and helping them decide whether to visit a doctor for
additional testing. According to the authors, the proposed system has the capability to study
and evaluate the diabetes diagnosis with high levels of accuracy, indicating the model's
capacity for diagnosis.

Zou et al. (2018) conducted research according to rising morbidity rates over the past
several years, there will be 642 million diabetic patients worldwide in 2040, or one in every
ten persons. Diabetes can cause chronic damage and malfunction of many tissues, including
the eyes, kidneys, heart, blood vessels, and nerves. Type 1 diabetes (T1D) and type 2
diabetes are the two kinds of diabetes identified by the authors (T2D). The average age of
type 1 diabetes patients is under 30 years old. To predict diabetes mellitus, the authors
created machine learning methods using decision trees, random forests, and neural networks.
Random forests, as proposed by the authors, are undoubtedly superior to classifiers in several
applications.

Research by Shafi et al. (2022) Chronic metabolic disease called diabetes is

characterized by high blood sugar levels brought on by inadequate insulin synthesis. Millions
of individuals throughout the world are impacted by one of the major health issues. Diabetes
that is not under control increases the risk of cancer, renal damage, heart attack, blindness,
and other diseases. The Authors created an approach for predicting the risk of developing

7 |Page
diabetes in North Kashmir using machine learning algorithms. According to estimates cited
by the authors, 285 million individuals worldwide had diabetes in 2010. According to
estimates, there will be 552 million people on the planet by 2030. (6.4 percent of adults)
Based on the disease's estimated development rate, by 2040 one in ten persons were expected
to have diabetes. In order to anticipate a patient's diabetes status at the earliest practical stage,
the authors proposed a variety of categorization models based on machine learning
techniques.

According to research by Katherine et al. (2021) and the IDF Chart book published in
2017, there are approximately 424.9 million diabetes patients worldwide between the ages of
20 and 79. Of them, 95% have Type 2 Diabetes Mellitus (T2DM). By 2045, it's expected that
there will be 628.6 million people on the planet. Numerous complicated diseases, including
nephropathy, cardiovascular infection, retinal disease, neuropathy, and many more, can be
brought on by diabetes. The study explains how machine learning may be applied to clinical
diagnostics to create frameworks that make use of patient-specific data to predict the
possibility of problems caused by diabetes.

A study by Rahman and colleagues et al. (2018) an ever-expanding area of artificial

intelligence called machine learning uses vast amounts of data to create algorithms that can
spot patterns and systems. This study compares several machine learning computations and
their outcomes in predicting the health issues associated with diabetes mellitus. Diabetes
mellitus may be a pancreas-restorative condition when there is a reduction in the body's
ability to produce or respond to the hormone insulin. In a controlled setting, methods such
Calculated Logistic Regression, SVM, Naive Bayes, Decision Trees, and Random Forest
have been used to predict the likelihood of Cardiovascular disease and Diabetes-actuated
Nephropathy.

4.1 Research Gap:

According to the review of the literature article, the proposed technique may be
potentially relevant to other illnesses classification challenges, including datasets of the same
sort as those used in this study. However, there is still plenty to be done in order to fully
exploit the promise and utility of systems for illness detection based on fuzzy rules, noise
reduction, and clustering.

In the future, the datasets for illness categorization and prediction using incremental
machine learning algorithms need to be given greater thought. Therefore, it is necessary to
evaluate this technique on more datasets, especially huge datasets, in order to determine its

8 |Page
suitability for large data processing. In expansion, the suggested approach may be broadened
to make it suitable for various types of medical datasets.

5.0 Research Aim and objectives:

The aim of this research work is to develop and use a novel system based on
clustering and classification that relies on a hybrid machine learning approach for the
diagnosis of diabetics using genomic databases. The main objectives (illness, strategy,
outcome, accuracy) of diverse research efforts as well as how they used the technique or
methods will be emphasized.

The research objectives are,

1. To cluster the data for the purpose of assessing their data pattern and classify the
content with according to behavior informatics.

2. By assessing diagnostic data with supervised and unsupervised machine learning

algorithms, it will be possible to improve the diagnostic performance of present
diagnostic techniques for illness prediction.

3. To validate the two-layer nested cross-validation classification technique.

4. To assess the effectiveness of the suggested technique by assessing the precision,

recall, measure, accuracy, and classification rate.

5. To evaluate how various classifiers and clustering techniques perform on datasets

related to diabetes research.

The research questions are,

1. What features make up the database used to generate the model, specifically?

2. Which machine learning method is best for developing a diabetes prediction

model?

3. What are the ideal validation measures to assess the effectiveness of the models?

6.0 Proposed methodology:

The major focus of this study is the knowledge-based methodology for diagnosing
diabetes using a genomic database. It does clustering, noise reduction, forecasting, and
classification using a rule-based decision tree approach in combination with a deep belief
network configuration.

9 |Page
Three steps are taken in the research implementation flow: In order to forecast
unknown or future values of intrigued, it is first necessary to use existing variables inside the
database. The second part of the statement focuses on developing designs for expressing the
data before introducing user explanation. Third, use classification and rule-based decision
tree to create fuzzy rules that may be utilized inside the knowledge-based framework.

6.1 Implementation of proposed methodology:

 Step 1: Pre-processing of structured or unstructured data.

 Step 2: Classification of the disease or disorder's early diagnosis.

 Step 3: Performance evaluation of the Classifier.

6.1.1 Pre-processing:

 Data Reduction: Combines noise and repetition removal with the float location
module to make models easier to use.

 Normalization: In order to remove the artifacts from the data, we used low pass
filtering, missing value removal (zero or negative), data review, outlier detection and
removal, and statistical calculations of maximum, minimum, mean, median, mode,
standard deviation, and range in order to have a normalized data set throughout the
study's conclusion.

6.1.2 Classification:
In order to account for the convolutional layers and totally related layers attributes of
a few class modifications, the proposed study incorporates a discretization procedure of the
Convolutional Neural Network (CNN) based classifier for class information.

 We design a Deep Belief Network (DBN) to take into account with slow learning and
over fitting wonder for training datasets in order to address issues with traditional
neural networks in deep layered networks.
 To prevent the error from being back-propagated via time and layers. A rule-based
decision tree strategy can ensure that a mistake is more likely to be made as you
memorize across several time steps.

10 | P a g
e
6.1.3 Evaluation:
The performance of the classifier is approved based on three viewpoints; sensitivity,
specificity, and accuracy.

 Sensitivity measures the predicted output with respect to the modification of the input.
In other words, sensitivity reveals the magnitude of the accurately identified true
positives.

𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠
𝑆𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 (%) = ∗ 100
𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 + 𝐹𝑎𝑙𝑠𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠

 Specificity ordinarily a partitioned with Sensitivity, which measures the proportion of

genuine negatives that are accurately identified.

𝑇𝑟𝑢𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠
S𝑝𝑒𝑐𝑖𝑓𝑖𝑐𝑖𝑡𝑦 (%) = ∗ 100
𝑇𝑟𝑢𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠 + 𝐹𝑎𝑙𝑠𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠

 Accuracy is the relationship among with predicted value and the actual value which
measures how close the anticipated esteem to the real esteem.

𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 + 𝑇𝑟𝑢𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠

𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 (%) = ∗ 100
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑎𝑚𝑝𝑙𝑒𝑠

Table-1 provides definitions for the abbreviations used for the execution parameters
of the True Positive (TP), True Negative (TN), False Positive (FP), and False
Negative (FN).

Table 1:

Abbreviation Explanation

The number of people who really diagnosed with disease or

True Positive (TP)
disorders: Target output = 1, Network output = 1

The number of people who really healthy but diagnosed with

False Positive (FP)
disease or disorders: Target output = 0, Network output = 1

The number of people who really healthy but diagnosed as

True Negative (TN)
healthy: Target output = 0, Network output = 0

The number of people who really have the disease but

False Negative (FN)
diagnosed as healthy: Target output = 1, Network output = 0

11 | P a g
e
References:
Ahmad, Parvez & Qamar, Saqib & Rizvi, Syed. (2015). Techniques of Data Mining In
Healthcare: A Review. International Journal of Computer Applications. 120. 38-50.
10.5120/21307-4126.
https://research.ijcaonline.org/volume120/number15/pxc3904126.pdf

Chaki, J., Thillai Ganesh, S., Cidham, S.K, Ananda Theertan, S., Machine Learning and
Artificial Intelligence based Diabetes Mellitus Detection and Self-Management: A
Systematic Review, Journal of King Saud University - Computer and Information
Sciences (2020),
https://www.sciencedirect.com/science/article/pii/S1319157820304134

Chitra Arjun, Mr. Anto S. Diagnosis of Diabetes Using Support Vector Machine and
Ensemble Learning Approach (2015). International Journal of Engineering and
Applied Sciences. 2394-3661, 2 (11).
https://www.ijeas.org/download_data/IJEAS0211027.pdf

Dagliati, A., Marini, S., Sacchi, L., Cogni, G., Teliti, M., Tibollo, V., De Cata, P., Chiovato,
L., & Bellazzi, R. (2018). Machine Learning Methods to Predict Diabetes
Complications. Journal of Diabetes Science and Technology, 12(2), 295-
302. https://doi.org/10.1177/1932296817706375

Fregoso-Aparicio, L., Noguez, J., Montesinos, L. et al. Machine learning and deep learning
predictive models for type 2 diabetes: a systematic review. Diabetol Metab Syndr 13,
148 (2021). https://doi.org/10.1186/s13098-021-00767-9

Katherine Ogurtsova, Leonor Guariguata, Noël C. Barengo, Paz Lopez-Doriga Ruiz, Julian
W. Sacre, Suvi Karuranga, Hong Sun, Edward J. Boyko, Dianna J. Magliano, IDF
diabetes Atlas: Global estimates of undiagnosed diabetes in adults for 2021, Diabetes
Research and Clinical Practice, Volume 183, 2022.
https://www.sciencedirect.com/science/article/pii/S0168822721004770

Mehrbakhsh Nilashi, Othman bin Ibrahim, Hossein Ahmadi, Leila Shahmoradi, An analytical
method for diseases prediction using machine learning techniques, Computers &
Chemical Engineering, Volume 106, 2017, Pages 212-223,
https://doi.org/10.1016/j.compchemeng.2017.06.011.

Pronab Ghosh, Sami Azam, Asif Karim, Mehedi Hassan, Kuber Roy, Mirjam Jonkman, A
Comparative Study of Different Machine Learning Tools in Detecting Diabetes,

12 | P a g
e
Procedia Computer Science, Volume 192, 2021, Pages 467-477,
https://doi.org/10.1016/j.procs.2021.08.048.

Qin Y, Wu J, Xiao W, Wang K, Huang A, Liu B, Yu J, Li C, Yu F, Ren Z. Machine Learning

Models for Data-Driven Prediction of Diabetes by Lifestyle Type. Int J Environ Res
Public Health. 2022 Nov 15;19(22):15027. doi: 10.3390/ijerph192215027.
https://www2.mdpi.com/1660-4601/19/22/15027

Rahman Tahsinur, Farzana Sheikh Mastura & Khanom Aniqa Zaida (2018). Prediction of
Diabetes Induced Complications Using Different Machine Learning Algorithms.
Thesis: Department of Computer Science and Engineering, BRAC University.
http://dspace.bracu.ac.bd/xmlui/bitstream/handle/10361/10945/15101128_CSE.pdf?se
quence=1&isAllowed=y

Ravaut, M., Sadeghi, H., Leung, K.K. et al. Predicting adverse outcomes due to diabetes
complications with machine learning using administrative health data. npj Digit.
Med. 4, 24 (2021). https://doi.org/10.1038/s41746-021-00394-8

Shafi, Salliah & Selvam, Venkatesan & Ansari, Gufran & Ansari, Mohd Dilshad & Rahman,
Md Habibur. (2022). Prevalence and Early Prediction of Diabetes Using Machine
Learning in North Kashmir: A Case Study of District Bandipora. Computational
Intelligence and Neuroscience. 2022. 1-12. 10.1155/2022/2789760.

Sharma, T., Shah, M. A comprehensive review of machine learning techniques on diabetes

detection. Vis. Comput. Ind. Biomed. Art 4, 30 (2021).
https://doi.org/10.1186/s42492-021-00097-7

Tomar, Divya. (2013). A survey on Data Mining approaches for Healthcare. International
Journal of Bio - Science and Bio - Technology. 5. 241-266.
http://dx.doi.org/10.14257/ijbsbt.2013.5.5.25

Wei, Sidong & Xuejiao, Zhao & Miao, Chunyan. (2018). A comprehensive exploration to the
machine learning techniques for diabetes identification. 291-295.
https://hdl.handle.net/10356/89478

Zou, Quan & Qu, Kaiyang & Luo, Yamei & Yin, Dehui & Ju, Ying & Tang, Hua. (2018).
Predicting Diabetes Mellitus With Machine Learning Techniques. Frontiers in
Genetics. 9. 10.3389/fgene.2018.00515.
https://www.frontiersin.org/articles/10.3389/fgene.2018.00515/full

13 | P a g
e

Diabetes
No ratings yet
Diabetes
37 pages
Bca 5th Sem Minor Report
No ratings yet
Bca 5th Sem Minor Report
46 pages
Journal Pone 0310218
No ratings yet
Journal Pone 0310218
29 pages
22258-Article Text-93692-1-10-20250212
No ratings yet
22258-Article Text-93692-1-10-20250212
21 pages
Diabetes Detection
No ratings yet
Diabetes Detection
19 pages
1 s2.0 S2772671124002419 Main (Asp)
No ratings yet
1 s2.0 S2772671124002419 Main (Asp)
18 pages
MLA Report
No ratings yet
MLA Report
19 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
14 pages
Integrating Machine Learning For Accurate Prediction of Early Diabetes - A Novel Approach
No ratings yet
Integrating Machine Learning For Accurate Prediction of Early Diabetes - A Novel Approach
24 pages
Improving Healthcare Prediction of Diabetic Patients Using KNN Imputed Features and Tri-Ensemble Model
No ratings yet
Improving Healthcare Prediction of Diabetic Patients Using KNN Imputed Features and Tri-Ensemble Model
11 pages
Kush Don FINAL Jatu
No ratings yet
Kush Don FINAL Jatu
11 pages
Food Del Report 1
No ratings yet
Food Del Report 1
13 pages
ML Diabetes Ieee
100% (1)
ML Diabetes Ieee
12 pages
Machine Learning and Applications CS522I1C
No ratings yet
Machine Learning and Applications CS522I1C
15 pages
1 s2.0 S1877050917315880 Main
No ratings yet
1 s2.0 S1877050917315880 Main
10 pages
1 s2.0 S2666307421000048 Main
No ratings yet
1 s2.0 S2666307421000048 Main
7 pages
Classification of Diabetes Mellitus Prediction Using Hybrid Machine Learning Techniques
No ratings yet
Classification of Diabetes Mellitus Prediction Using Hybrid Machine Learning Techniques
10 pages
Diagnosis of Diabetes Using Machine Learning
No ratings yet
Diagnosis of Diabetes Using Machine Learning
12 pages
Analyze The Use of Machine Learning Models in The Pima Diabetes Data Set For Early Stage Detection
No ratings yet
Analyze The Use of Machine Learning Models in The Pima Diabetes Data Set For Early Stage Detection
5 pages
DPS
No ratings yet
DPS
18 pages
Diabetes Prediction Using Machine Learning Techniques
No ratings yet
Diabetes Prediction Using Machine Learning Techniques
18 pages
Performance Analysis of Deep Neural Network and Machine Learning Algorithms For Diabetes Prediction
No ratings yet
Performance Analysis of Deep Neural Network and Machine Learning Algorithms For Diabetes Prediction
6 pages
Machine Learning Meets Healthcare: Predicting Diabetes Onset With EHR
No ratings yet
Machine Learning Meets Healthcare: Predicting Diabetes Onset With EHR
8 pages
3 Journal
No ratings yet
3 Journal
9 pages
A Survey On Diabetes Risk Prediction Using Machine.50
No ratings yet
A Survey On Diabetes Risk Prediction Using Machine.50
6 pages
Project
No ratings yet
Project
16 pages
Ijarcce 2020 9712
No ratings yet
Ijarcce 2020 9712
7 pages
1 s2.0 S2214785322007507 Main
No ratings yet
1 s2.0 S2214785322007507 Main
5 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
12 pages
2023 Article 5467
No ratings yet
2023 Article 5467
20 pages
Practice School Report - Prajwal
No ratings yet
Practice School Report - Prajwal
44 pages
Diabetes Decoded: Transitioning From Traditional Models To Hybrid Deep Learning Approaches
No ratings yet
Diabetes Decoded: Transitioning From Traditional Models To Hybrid Deep Learning Approaches
5 pages
Paper 4
No ratings yet
Paper 4
5 pages
Synopsis - Diabetes Prediction
No ratings yet
Synopsis - Diabetes Prediction
28 pages
241410
No ratings yet
241410
10 pages
Sensors 22 05304 v2
No ratings yet
Sensors 22 05304 v2
18 pages
Projectreport Diabetes Prediction
No ratings yet
Projectreport Diabetes Prediction
22 pages
10.3934 Publichealth.2023030
No ratings yet
10.3934 Publichealth.2023030
21 pages
Diabetes Prediction Using Machine Learning R3
No ratings yet
Diabetes Prediction Using Machine Learning R3
6 pages
Independent Project
No ratings yet
Independent Project
10 pages
Chapter I (1) - Merged
No ratings yet
Chapter I (1) - Merged
23 pages
189 Submission
No ratings yet
189 Submission
6 pages
245-Article Text-2088-1-10-20240129
No ratings yet
245-Article Text-2088-1-10-20240129
8 pages
Comparison of ML Techniques
No ratings yet
Comparison of ML Techniques
16 pages
STEP 2 CK New Free 120 (Q)
100% (1)
STEP 2 CK New Free 120 (Q)
59 pages
Analysis and Prediction of Diabetes Mell PDF
No ratings yet
Analysis and Prediction of Diabetes Mell PDF
10 pages
Prediction of Diabetes Using Machine Learning Analysis of 70000 Clinical Database Patient Record
No ratings yet
Prediction of Diabetes Using Machine Learning Analysis of 70000 Clinical Database Patient Record
5 pages
11-A Risk Assessment and Prediction Framework For Diabetes Mellitus Using Machine Learning Algorithms
No ratings yet
11-A Risk Assessment and Prediction Framework For Diabetes Mellitus Using Machine Learning Algorithms
12 pages
Dinesh Paper On Diabetes Mellitus (9%)
No ratings yet
Dinesh Paper On Diabetes Mellitus (9%)
8 pages
Jurnal Penelitian Teknik Informatia 4 (Internasional)
No ratings yet
Jurnal Penelitian Teknik Informatia 4 (Internasional)
11 pages
Type 2 Diabetes Mellitus Prediction Model Based On Data Mining
No ratings yet
Type 2 Diabetes Mellitus Prediction Model Based On Data Mining
8 pages
NP-NCD Operational Guidelines - 0
No ratings yet
NP-NCD Operational Guidelines - 0
176 pages
Sat - 17.Pdf - Machine Learning Models For Diagnosis of The Diabetic Patient and Predicting Insulin Dosage
No ratings yet
Sat - 17.Pdf - Machine Learning Models For Diagnosis of The Diabetic Patient and Predicting Insulin Dosage
11 pages
Hybrid Deep Learning CNN-LSTM Model For Diabetes Prediction
No ratings yet
Hybrid Deep Learning CNN-LSTM Model For Diabetes Prediction
4 pages
A Decision Support System For Diabetes Prediction Using Machine Learning and Deep Learning Techniques
No ratings yet
A Decision Support System For Diabetes Prediction Using Machine Learning and Deep Learning Techniques
4 pages
Predicting Diabetes Mellitus in Healthcare: A Comparative Analysis of Machine Learning Algorithms On Big Dataset
No ratings yet
Predicting Diabetes Mellitus in Healthcare: A Comparative Analysis of Machine Learning Algorithms On Big Dataset
12 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
6 pages
General Diet Plan For Diabetics PDF
100% (1)
General Diet Plan For Diabetics PDF
13 pages
Predicting Diabetes Using Deep Learning Techniques: A Study On The Pima Dataset
No ratings yet
Predicting Diabetes Using Deep Learning Techniques: A Study On The Pima Dataset
15 pages
Using Sentiment Analysis and Machine Learning Algorithms To Determine Citizens' Perceptions
No ratings yet
Using Sentiment Analysis and Machine Learning Algorithms To Determine Citizens' Perceptions
6 pages
Diabetes Mellitus Prediction and Diagnosis 2022
No ratings yet
Diabetes Mellitus Prediction and Diagnosis 2022
12 pages
54 Batch Project Documentation-1
No ratings yet
54 Batch Project Documentation-1
82 pages
Paper 105
No ratings yet
Paper 105
6 pages
Final NCM 116 PDF
No ratings yet
Final NCM 116 PDF
20 pages
Plan of Action: To Be Fit, Fine & Healthy Life-Long
No ratings yet
Plan of Action: To Be Fit, Fine & Healthy Life-Long
18 pages
Lemone/Burke/Bauldoff/Gubrud, Medical-Surgical Nursing 6Th Edition Test Bank
100% (1)
Lemone/Burke/Bauldoff/Gubrud, Medical-Surgical Nursing 6Th Edition Test Bank
48 pages
Cold Drinks
No ratings yet
Cold Drinks
20 pages
NCMB316 Lec Midterm
No ratings yet
NCMB316 Lec Midterm
28 pages
Gs Selection Medline Embase DP
No ratings yet
Gs Selection Medline Embase DP
3,876 pages
National Training On HTN and DM For HCWs at ART Clinics - Facilitator's Guide Final Draft
No ratings yet
National Training On HTN and DM For HCWs at ART Clinics - Facilitator's Guide Final Draft
98 pages
CASE STUDY - Type 2 Diabetes
No ratings yet
CASE STUDY - Type 2 Diabetes
5 pages
Progress in Cardiovascular Diseases
No ratings yet
Progress in Cardiovascular Diseases
8 pages
Diabetes
No ratings yet
Diabetes
352 pages
MCQ - Discussion On DM: DR Aditi Chaturvedi Prof. and Head Department of Pharmacology
No ratings yet
MCQ - Discussion On DM: DR Aditi Chaturvedi Prof. and Head Department of Pharmacology
57 pages
Diabetes Mellitus
No ratings yet
Diabetes Mellitus
5 pages
CCHM321 Lec
No ratings yet
CCHM321 Lec
12 pages
Herbal Anti Diabetic Tea Granules Powder
No ratings yet
Herbal Anti Diabetic Tea Granules Powder
37 pages
PDF The 10 Food Groups You Must Eat - Final For Publication
No ratings yet
PDF The 10 Food Groups You Must Eat - Final For Publication
31 pages
Wur Nutrition and Health Thesis
100% (3)
Wur Nutrition and Health Thesis
6 pages
Associations Between Dental Caries and Systemic Diseases: A Scoping Review
No ratings yet
Associations Between Dental Caries and Systemic Diseases: A Scoping Review
35 pages
Hypertension GEMS
No ratings yet
Hypertension GEMS
2 pages
Hipoglucemia en DBT2
No ratings yet
Hipoglucemia en DBT2
20 pages
Module 2 Content
No ratings yet
Module 2 Content
46 pages
Non-Alcoholic Fatty Liver Disease and Associated Lipid Profile in Type II Diabetes Et Al 2024
No ratings yet
Non-Alcoholic Fatty Liver Disease and Associated Lipid Profile in Type II Diabetes Et Al 2024
9 pages
Cureus 0015 00000040981
No ratings yet
Cureus 0015 00000040981
12 pages
Popular Diets: A Scientific Review: Obesity Research March 2001
No ratings yet
Popular Diets: A Scientific Review: Obesity Research March 2001
41 pages
Deadly Sweet
No ratings yet
Deadly Sweet
2 pages
ADN Care Plan Maternity PP Diabetes
No ratings yet
ADN Care Plan Maternity PP Diabetes
3 pages
Nutri Lec Q3 Finals Quizlet 2
No ratings yet
Nutri Lec Q3 Finals Quizlet 2
2 pages
Health Data Analytics And Informatics
From Everand
Health Data Analytics And Informatics
Mbuso Mabuza
No ratings yet

Proposal

Uploaded by

Proposal

Uploaded by

Table of Contents

Contents ……………………………………………………………………………….… Page

1.0 Introduction ……………………………………………………………………………….3

2.0 Background of the study …………………………………………………………………..3

3.0 Problem statement ……………………………………………………………………...…4

4.0 Literature review ………………………………………………………..………..……….6

4.1 Research gap .………………………………………………………………….……...8

5.0 Research Aim and objectives ……………………………………………………………..9

6.0 Proposed methodology ..…………………………………………………………..………9

6.1 Implementation of proposed methodology ….……………………………………….10

6.1.1 Pre-processing ………..………. …………...….……..………………………..10

6.1.2 Classification ..……………………………...………….…..…………………..10

6.1.3 Evaluation ………………………..……………………………….……………11

Diabetes, in addition known as diabetes mellitus, is a bunch of metabolic disorders

As a result, we conduct a thorough analysis of the most widely used methods of

2.0 Background of the study:

3.0 Problem statement:

Nowadays, the prevalence of diabetes is increasing worldwide. The International

Various Data mining methods, including classification, clustering, associations, and

According to a study by Pronab et al. (2021), a sizable part of individuals worldwide

Research by Shafi et al. (2022) Chronic metabolic disease called diabetes is

A study by Rahman and colleagues et al. (2018) an ever-expanding area of artificial

4.1 Research Gap:

5.0 Research Aim and objectives:

The research objectives are,

2. By assessing diagnostic data with supervised and unsupervised machine learning

3. To validate the two-layer nested cross-validation classification technique.

4. To assess the effectiveness of the suggested technique by assessing the precision,

5. To evaluate how various classifiers and clustering techniques perform on datasets

The research questions are,

2. Which machine learning method is best for developing a diabetes prediction

6.0 Proposed methodology:

6.1 Implementation of proposed methodology:

 Step 2: Classification of the disease or disorder's early diagnosis.

 Step 3: Performance evaluation of the Classifier.

 Specificity ordinarily a partitioned with Sensitivity, which measures the proportion of

𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 + 𝑇𝑟𝑢𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠

The number of people who really diagnosed with disease or

The number of people who really healthy but diagnosed with

The number of people who really healthy but diagnosed as

The number of people who really have the disease but

Qin Y, Wu J, Xiao W, Wang K, Huang A, Liu B, Yu J, Li C, Yu F, Ren Z. Machine Learning

Sharma, T., Shah, M. A comprehensive review of machine learning techniques on diabetes

You might also like