0% found this document useful (0 votes)

82 views5 pages

Lung Disease Prediction Using K-Means Clustering and Naïve Bayes Algorithm

This document proposes using k-means clustering and naive Bayes algorithms to develop a system for predicting lung diseases. The system would analyze lung disease data using Weka tools to classify data into clusters and predict disease status. This approach aims to create a faster and more accurate prediction system to help doctors detect lung diseases earlier for better treatment outcomes.

Uploaded by

Mohammad Farhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views5 pages

Lung Disease Prediction Using K-Means Clustering and Naïve Bayes Algorithm

Uploaded by

Mohammad Farhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Lung Disease Prediction Using K-Means Clustering and Naïve Bayes Algorithm

Introduction

In the real world, Lung cancer accounts for more deaths than any other cancer in both men
and women. Lung Cancer disease is the fifth leading cause of death in the world over the past
10 year (World Health Organization 2016). According to the WHO (World Health
Organization) report lung Disease is the leading cause of death across the world accounting
for 1.58 million, accounting for about 27 % of all cancer deaths. Death rate began declining
in 1991 in men and in 2003 in women.

Early detection of lung cancer is essential in reducing life losses. However earlier
treatment requires the ability to detect lung cancer in early stages. Early diagnosis requires an
accurate and reliable diagnosis procedure that allow physicians to distinguish benign lung
disease from malignant ones.

Health data is rapidly increasing in the world. Health data is very large and complex due to
this processing of data using traditional data processing techniques is very difficult. For
simplicity, machine learning techniques like KNN, SVM, D.T have been used. Some tool like
Python (pandas) and Weka are widely used in the data analytics field.

Objective

 To study different disease prediction algorithms and literature review.

 To design a system for lung disease prediction based on patient data.

 To design a system for higher accuracy in lung disease prediction than already
existing systems.

 To implement a system using multiple algorithms for increased time-efficiency.

Literature survey

Rucha Shinde , et.al (2015) ,nowadays people work on computers for hours and hours they
don’t have time to take care of themselves. Due to hectic schedules and consumption of junk
food, it affects the health of people and mainly heart. So to we are implementing an heart
disease prediction system using data mining technique Naïve Bayes and k-means clustering
algorithm. It is the combination of both the algorithms. This paper gives an overview for the
same. It helps in predicting the heart disease using various attributes and it predicts the
output as in the prediction form. For grouping of various attributes, it uses k-means algorithm
and for predicting it uses naïve Bayes algorithm.

V.Krishnaiah , et.al (2013) Proposed the potential use of classification based data mining
techniques such as rule based ,decision based, naïve Bayes to massive volume of healthcare
data. The healthcare industry collects huge amount of data which, unfortunately are not
mined to discover hidden information for data preprocessing and effective decision making
one dependency augmented naïve Bayes classifiers(ODANB) and naïve creedal classifiers 2
(NCC2) are used. This is extension of naïve Bayes to imprecise probabilities that aims at
delivering robust classification also when dealing with small or incomplete data sets.

S.Sudha, et.al (2013) data mining is defined as sifting through very large amounts of data
for useful information. Some of the most important and popular data mining techniques are
association rules, classification, clustering, prediction and sequential patterns. Data mining
techniques are used for variety of applications. In health care industry, data mining plays an
important role for predicting diseases. For detecting a disease number of tests should be
required from the patient. But using data mining technique the number of test should be
reduced. This reduced test plays an important role in time and performance.

T.Karthikeyan, et.al, (2014) presented a extraction algorithm used to improve the predicted
accuracy of the classification. This paper applies with Principal Component analysis as a
feature evaluator and ranker for searching method. Naive Bayes algorithm is used as a
classification algorithm. It analyzes the hepatitis patients from the UCI rvine machine
learning repository. The results of the classification model are accuracy and time. Finally, it
concludes that the proposed PCA-NB algorithm performance is better than other
classification techniques for hepatitis patients.

Pallavi Mirajkar, et.al (2011) Cancer identification and prediction are huge challenge to the
researchers. The use of various techniques of data mining techniques has revolutionized the
whole process of cancer Diagnosis and Prognosis. We are proposing integrated system which
is based on combination of various data mining techniques such as analytical hierarchy
process, rule based association, classification etc. that is helpful to predict the patient’s
disease status. Cancer disease risk can be discovered by analyzing and identifying various
factors and symptoms of the patient before recommending treatments. The vital aim of our
system is to help oncologist and medical practitioners in diagnosing the patient by analyzing
available data and relevant information.
Priyanka D, et.al (2014) Lung cancer is one of the major causes of death in both genders
when compared to all other cancers. Lung cancer has become the most hazardous types of
cancer in the world. Early detection of lung cancer is essential in reducing life losses. This
paper presents prediction on lung disease using K means algorithm. This project comprises of
three modules. First, admin module which is administrator’s login there the details of the
patient will be generated. Now the user will authenticate based on their credentials. The
second module is User module there the patient enters his username and password to predict
cancer. Third module is Cancer prediction module in which the result will be predicted at the
last stage with the help of K means algorithm. The K means will classify the input features
into two classes of cancer type (benign and malignant). This project is implemented in java as
the front end and mysql as the back end. This project aims to implement an effective
prediction on lung cancer with the help of K means algorithm user can know the cancer
status. From this project we infer that the K means is suitable for lung cancer prediction

Research Methodology

 To analyze data related to lung diseases for data mining through Weka.

 K-means clustering and naïve Bayes techniques will be use.

 Naive Bayes algorithm will be use as a classification algorithm.

 K-means clustering has the ability to handle massive data and cluster those data
efficiently and quickly.

 A simple and straightforward iterative method will be use to partition the data set into
k-number of clusters.
Tentative Outcomes

Lung disease prediction system will be developed by combining Naïve Bayes and K-
Means algorithm. Weka tools would be used to reduce the execution time of
algorithms. The prediction system may be faster, less computationally expensive, time
efficient and produce results that are more accurate. The proposed system will help
doctors to efficiently predict lung diseases in the initial stages for better treatment.

References

 [1] World Health Organization (2011) The top ten causes of death. World Health
Organization (2013) Deaths from coronary heart disease.

 [2] V.Krishnaiah, G.Narsimha, N.Subhash Chandra. 2013, “Diagnosis of Lung Cancer

Prediction System Using Data Mining Classification Techniques,” International Journal
of Computer Science and Information Technologies, Vol. 4 (1), 2013, 39 – 45

 [3] Rucha Shinde ,Sandhya Arjun,Priyanka Patil,”An intelligent heart disease prediction
system using k-means clustering and naïve bayes algorithm,” IJCSIT 2015 ,vol 6(1),2015

 [4] S.Sudha , S.Vijayarani , “Disease Prediction in Data Mining Technique” Vol. II, Issue
I, January 2013 (ISSN: 2278-7720).

 [5] T.Karthikeyan , P.Thangaraju, “PCA-NB Algorithm to Enhance the Predictive

Accuracy” 2014,IJET,vol.6(1)

 [6] Ankit Agrawal, Sanchit Misra, Ramanathan Narayanan, Lalith Polepeddi, Alok
Choudhary, “A Lung Cancer Outcome Calculator Using Ensemble Data Mining on
SEER Data,” BIOKDD 2011, August 2011, San Diego, CA, USA, 2011.

 [7] S. S. Mohamed and M. M. A. Salama, “Computer-aided diagnosis for prostate cancer

using support vector machine,” Proceedings SPIE Med. Imag., vol. 5744, pp. 898–906,
2005.

 [8] MS.Mehdi Khundmir Iliyas, “Heart disease prediction using naïve Bayes and k-
means techniques”, IJRPET, VOLUME 3, ISSUE 6, Jun.-2017, ISSN: 2454-7875

 [9] S. Vijayarani and S. Sudha ,” An Efficient Clustering Algorithm for Predicting

Diseases from Hemogram Blood Test Samples “ Vol 8(17), DOI:
10.17485/ijst/2015/v8i17/52123, August 2015

 [10] Priyanka D ,Ms S Shehar Bano , Prediction on lung disease using k-means
algorithm, IJERT vol 1 issue 11, 2014

 [11] Tanupriya Choudhury, Vivek Kumar ,“ Intelligent Classification & Clustering

Of Lung & Oral Cancer through Decision Tree & Genetic Algorithm ,”
IJARCSSE, Volume 5, Issue 12, December 2015 ISSN: 2277 128X .
 [12] P.Ramachandran , N.Girija and T.Bhuvaneswari ,“ Early Detection and
Prevention of Cancer using Data Mining Techniques ,”IJCA vol (97) no-13,2014.

 [13] Ada , Rajneet Kaur ,“ A Study of Detection of Lung Cancer Using Data
Mining Classification Techniques”,IJARCSSE, vol 3 issue 3,2013

Project Proposal Free Health Camp
78% (49)
Project Proposal Free Health Camp
2 pages
Detection of Heart Failure Using Different Machine Learning Algorithms
No ratings yet
Detection of Heart Failure Using Different Machine Learning Algorithms
5 pages
A Comparative Study of Classification Algorithms For Diseases Prediction in Medical Domain
No ratings yet
A Comparative Study of Classification Algorithms For Diseases Prediction in Medical Domain
5 pages
Thesis Updated
No ratings yet
Thesis Updated
151 pages
Early Prediction of Heart Disease Using Decision Tree Algorithm
No ratings yet
Early Prediction of Heart Disease Using Decision Tree Algorithm
16 pages
TSP CMC 41333
No ratings yet
TSP CMC 41333
14 pages
Heart Disease Prediction Using KNN Algorithm-2
No ratings yet
Heart Disease Prediction Using KNN Algorithm-2
19 pages
AI-based Smart Prediction of Clinical Disease Using Random Forest Classifier and Naive Bayes
No ratings yet
AI-based Smart Prediction of Clinical Disease Using Random Forest Classifier and Naive Bayes
22 pages
6245e19c618b73 12171037
No ratings yet
6245e19c618b73 12171037
9 pages
A Critical Study of Classification Algorithms For Lungcancer Disease Detection and Diagnosis
No ratings yet
A Critical Study of Classification Algorithms For Lungcancer Disease Detection and Diagnosis
8 pages
Disease Prediction System Using Naïve Bayes
No ratings yet
Disease Prediction System Using Naïve Bayes
7 pages
Pandi A Raj 2021
No ratings yet
Pandi A Raj 2021
8 pages
Management of Nursing Services and Education
No ratings yet
Management of Nursing Services and Education
14 pages
10 Heart Disease Prediction Kiranjit Kaur
No ratings yet
10 Heart Disease Prediction Kiranjit Kaur
15 pages
NB 1
No ratings yet
NB 1
7 pages
BP 2
No ratings yet
BP 2
6 pages
Comparison of Various Data Mining Methods For Early Diagnosis of Human Cardiology
No ratings yet
Comparison of Various Data Mining Methods For Early Diagnosis of Human Cardiology
9 pages
Farzana 2020
No ratings yet
Farzana 2020
5 pages
Heart Disease Prediction Using Machine Learning
No ratings yet
Heart Disease Prediction Using Machine Learning
7 pages
View of Cardiovascular Heart Disease Prediction Using Machine Learning Classifiers With Data Mining Techniques
No ratings yet
View of Cardiovascular Heart Disease Prediction Using Machine Learning Classifiers With Data Mining Techniques
9 pages
Sample Project Synopsis
No ratings yet
Sample Project Synopsis
5 pages
Irjet V6i31160
No ratings yet
Irjet V6i31160
7 pages
Final Project Report
No ratings yet
Final Project Report
33 pages
Paper 1
No ratings yet
Paper 1
4 pages
ML in Healthcare
No ratings yet
ML in Healthcare
5 pages
Lung Disease Prediction - Edited
No ratings yet
Lung Disease Prediction - Edited
35 pages
Jut 2
No ratings yet
Jut 2
12 pages
Survey of Heart Disease Prediction Based On Data Mining Algorithms Ijariie1844
No ratings yet
Survey of Heart Disease Prediction Based On Data Mining Algorithms Ijariie1844
5 pages
Prediction of Heart Disease by Clustering and Classification Techniques Prediction of Heart Disease by Clustering and Classification Techniques
No ratings yet
Prediction of Heart Disease by Clustering and Classification Techniques Prediction of Heart Disease by Clustering and Classification Techniques
8 pages
Heart Disease Prediction Using Data Mining Techniques: Journal of Analysis and Computation (JAC)
No ratings yet
Heart Disease Prediction Using Data Mining Techniques: Journal of Analysis and Computation (JAC)
8 pages
Prediction of Diseases Using Random Forest
No ratings yet
Prediction of Diseases Using Random Forest
8 pages
Heart Disease Prediction Using Machine Learning
No ratings yet
Heart Disease Prediction Using Machine Learning
11 pages
Three Dimensional Model For Diagnostic Prediction: A Data Mining Approach
No ratings yet
Three Dimensional Model For Diagnostic Prediction: A Data Mining Approach
5 pages
Review of Heart Disease Prediction System Using Data Mining and Hybrid Intelligent Techniques
No ratings yet
Review of Heart Disease Prediction System Using Data Mining and Hybrid Intelligent Techniques
5 pages
Applications of Machine Learning Techniques To Predict Diagnostic Breast Cancer
No ratings yet
Applications of Machine Learning Techniques To Predict Diagnostic Breast Cancer
11 pages
Heart Disease Prediction Using Data Mining
No ratings yet
Heart Disease Prediction Using Data Mining
3 pages
Lung Disease Prediction System Using Naive Bayes and K Means Clustering
No ratings yet
Lung Disease Prediction System Using Naive Bayes and K Means Clustering
36 pages
Prediction of Heart Disease Using A Hybrid Technique in Data Mining Classification
No ratings yet
Prediction of Heart Disease Using A Hybrid Technique in Data Mining Classification
3 pages
Lung Disease Prediction System Using Data Mining Techniques
No ratings yet
Lung Disease Prediction System Using Data Mining Techniques
6 pages
Heart Attack Prediction System: Sushmita Manikandan
No ratings yet
Heart Attack Prediction System: Sushmita Manikandan
4 pages
Heart Disease Prediction Using Machine Learning Te
No ratings yet
Heart Disease Prediction Using Machine Learning Te
7 pages
Data Mining Approach To Detect Heart Dieses: Authors
No ratings yet
Data Mining Approach To Detect Heart Dieses: Authors
11 pages
Diagnosis of Heart Disease Using Data Mining Algorithm
No ratings yet
Diagnosis of Heart Disease Using Data Mining Algorithm
3 pages
07 Dr. S. Anitha
No ratings yet
07 Dr. S. Anitha
9 pages
IJCRT2205103
No ratings yet
IJCRT2205103
10 pages
Heart Disease PredictionUsing
No ratings yet
Heart Disease PredictionUsing
6 pages
Prediction of Lung Cancer Using Machine Learning Classifier
No ratings yet
Prediction of Lung Cancer Using Machine Learning Classifier
11 pages
Prediction Heart Disease
No ratings yet
Prediction Heart Disease
11 pages
Heart Disease Prediction Using Naive Bayes and K-Means Techniques
No ratings yet
Heart Disease Prediction Using Naive Bayes and K-Means Techniques
5 pages
Article Eda
No ratings yet
Article Eda
7 pages
Heart Disease Prediction Using Machine Learning Techniques: Devansh Shah Samir Patel Santosh Kumar Bharti
No ratings yet
Heart Disease Prediction Using Machine Learning Techniques: Devansh Shah Samir Patel Santosh Kumar Bharti
6 pages
Decision Tree Algorithms For Prediction of Heart Disease: Srabanti Maji and Srishti Arora
No ratings yet
Decision Tree Algorithms For Prediction of Heart Disease: Srabanti Maji and Srishti Arora
8 pages
Disease Prediction Using Data Mining
No ratings yet
Disease Prediction Using Data Mining
5 pages
8 1486792440 - 10-02-2017 PDF
No ratings yet
8 1486792440 - 10-02-2017 PDF
5 pages
Ijarcce 2019 81210
No ratings yet
Ijarcce 2019 81210
3 pages
(IJCST-V7I4P8) : Nitasha
No ratings yet
(IJCST-V7I4P8) : Nitasha
4 pages
Decision Support in Heart Disease Prediction System Using Naive Bayes
No ratings yet
Decision Support in Heart Disease Prediction System Using Naive Bayes
7 pages
Multiple Disease Prediction Using Different Machine Learning Algorithms Comparatively
No ratings yet
Multiple Disease Prediction Using Different Machine Learning Algorithms Comparatively
5 pages
Iot Based Health Monitoring System
100% (1)
Iot Based Health Monitoring System
12 pages
Heart Disease Prediction Using Machine Learning Techniques: Raparthi Yaswanth, Y. Md. Riyazuddin
No ratings yet
Heart Disease Prediction Using Machine Learning Techniques: Raparthi Yaswanth, Y. Md. Riyazuddin
5 pages
8.adverse Effects of Drugs
No ratings yet
8.adverse Effects of Drugs
11 pages
Thesis Task 1
No ratings yet
Thesis Task 1
4 pages
Cultural Concepts in DSM 5
No ratings yet
Cultural Concepts in DSM 5
1 page
Types of Admission Form For Patient
100% (1)
Types of Admission Form For Patient
7 pages
Post Retirement Hospital List 8172018114112AM
No ratings yet
Post Retirement Hospital List 8172018114112AM
330 pages
Webinar Perawatan Luka Dan Tata Laksana Pemeriksaan Infeksi
No ratings yet
Webinar Perawatan Luka Dan Tata Laksana Pemeriksaan Infeksi
41 pages
Mazhaume Eugan CV
No ratings yet
Mazhaume Eugan CV
5 pages
Urban Reform Brief - FHT
No ratings yet
Urban Reform Brief - FHT
2 pages
Research Project Submission Form: Type of Project: Intramural ( )
No ratings yet
Research Project Submission Form: Type of Project: Intramural ( )
22 pages
Framework For Maternal and Child Nursing
No ratings yet
Framework For Maternal and Child Nursing
3 pages
Virginia Henderson's Theory of Nursing
No ratings yet
Virginia Henderson's Theory of Nursing
25 pages
The National Academies Press
No ratings yet
The National Academies Press
639 pages
Essentials of Community Medicine A Practical Approach 2nd Edition by Lalita Hiremath, Dhananjaya Hiremath ISBN B0B4HD73T6 9789350250440
No ratings yet
Essentials of Community Medicine A Practical Approach 2nd Edition by Lalita Hiremath, Dhananjaya Hiremath ISBN B0B4HD73T6 9789350250440
53 pages
IVF Treatment in India A Comprehensive Guide PDF
No ratings yet
IVF Treatment in India A Comprehensive Guide PDF
7 pages
Risk Factors of Ventilator-Associated Pneumonia in Critically III Patients
No ratings yet
Risk Factors of Ventilator-Associated Pneumonia in Critically III Patients
7 pages
Diocese of Bayombong Educational System
No ratings yet
Diocese of Bayombong Educational System
7 pages
Primary Health Care
No ratings yet
Primary Health Care
126 pages
ICICI Lombard Group Health Insurance (UIN: ICIHLGP02001V030102)
No ratings yet
ICICI Lombard Group Health Insurance (UIN: ICIHLGP02001V030102)
8 pages
Nurs FPX 4040 Assessment 1 Nursing Informatics in Health Care
No ratings yet
Nurs FPX 4040 Assessment 1 Nursing Informatics in Health Care
5 pages
ADHDASDregisterstudie
No ratings yet
ADHDASDregisterstudie
7 pages
Adobe Scan 27 Oct 2023
No ratings yet
Adobe Scan 27 Oct 2023
14 pages
Health Technology Assessment (HTA) - Development
No ratings yet
Health Technology Assessment (HTA) - Development
7 pages
NP 567 Week 6 Study Worksheet
No ratings yet
NP 567 Week 6 Study Worksheet
5 pages
HONEST VANIA ASARI (1710311020) Profesi Dokter 2017 Fakultas Kedokteran Universitas Andalas
No ratings yet
HONEST VANIA ASARI (1710311020) Profesi Dokter 2017 Fakultas Kedokteran Universitas Andalas
13 pages
Unified Do Not Attempt Cardiopulmonary Resuscitation (Dnacpr)
No ratings yet
Unified Do Not Attempt Cardiopulmonary Resuscitation (Dnacpr)
2 pages
All Hospital List
No ratings yet
All Hospital List
6 pages
Activity 1: Multiple Choice: Mapeh 10 - Physical Education - Week 5&6 - 3Rdq
No ratings yet
Activity 1: Multiple Choice: Mapeh 10 - Physical Education - Week 5&6 - 3Rdq
4 pages
De Ocampo, Pinky Rose Mojica: Lpu - St. Cabrini College of Allied Medicine, Inc ODC Form 5
No ratings yet
De Ocampo, Pinky Rose Mojica: Lpu - St. Cabrini College of Allied Medicine, Inc ODC Form 5
1 page

Lung Disease Prediction Using K-Means Clustering and Naïve Bayes Algorithm

Uploaded by

Lung Disease Prediction Using K-Means Clustering and Naïve Bayes Algorithm

Uploaded by

Lung Disease Prediction Using K-Means Clustering and Naïve Bayes Algorithm

 To study different disease prediction algorithms and literature review.

 To design a system for lung disease prediction based on patient data.

 To implement a system using multiple algorithms for increased time-efficiency.

 K-means clustering and naïve Bayes techniques will be use.

 Naive Bayes algorithm will be use as a classification algorithm.

 [2] V.Krishnaiah, G.Narsimha, N.Subhash Chandra. 2013, “Diagnosis of Lung Cancer

 [5] T.Karthikeyan , P.Thangaraju, “PCA-NB Algorithm to Enhance the Predictive

 [7] S. S. Mohamed and M. M. A. Salama, “Computer-aided diagnosis for prostate cancer

 [9] S. Vijayarani and S. Sudha ,” An Efficient Clustering Algorithm for Predicting

 [11] Tanupriya Choudhury, Vivek Kumar ,“ Intelligent Classification & Clustering

You might also like