Multiple Disease Prediction Using ML and Doctor Recommendation by Sentiment Analysis
Multiple Disease Prediction Using ML and Doctor Recommendation by Sentiment Analysis
Abstract - For the purpose of preventing and treating occurring diseases in the early phase as when they are not
illness, accurate and prompt examination of any health- checked or examined they can turn into a disease and
related issue is crucial. A dangerous illness might not be more dangerous diseases can even cause death. This
properly diagnosed using the conventional methods. The system will predict the most possible disease based on the
development of a machine learning (ML)-based medical
2023 6th International Conference on Contemporary Computing and Informatics (IC3I) | 979-8-3503-0448-0/23/$31.00 ©2023 IEEE | DOI: 10.1109/IC3I59117.2023.10397715
The field of machine learning uses historical data to Disease Risk Prediction was advised by Sayali
make predictions. The concept of a computer system Ambekar et al., [2] who carried out the task using a
known as "machine learning" refers to how a machine convolution neural network. Machine learning methods
learning model learns from data and experience. including the CNN-UDRP algorithm, Naive Bayes, and
KNN algorithm are employed in this research. The system
There are two stages to the machine learning employs structured data to be trained, and Naive Bayes is
algorithm: 1) Testing and 2) Training. used to attain an accuracy of 82%.
In order to improve processes and give patients better Using a fuzzy approach, Naganna Chetty et al. [3]
care, the healthcare industry also uses machine learning. built a system that provides better outcomes for disease
The disease prediction system forecasts illnesses based on prediction.He used KNN classifier, fuzzy c-means
the patient's symptoms and by using sentiment analysis clustering, and fuzzy KNN classifier approaches. The
recommended doctors for predicted ailment. accuracy of the diabetic disease and liver disorder
predictions in this research is 97.02% and 96.13
Sentiment analysis has developed into a potent tool respectively.
for tracking and comprehending online reviews as
people express their opinions and feelings about The focus of the Senthilkumar Mohan et al study [4] was
something more honestly than ever before. So, by hybrid machine learning approaches, which employ
applying sentiment analysis on the doctor reviews by algorithms such as Decision Tree, Support Vector
patients gives the best doctor for particular disease along Machine, Random Forest, Naive Bayes, Neural Network,
with other factors. and KNN to effectively anticipate cardiac illness. This
system's accuracy rating is 88.47%.
The systems that are now on the market are either
specifically designed to treat a specific condition, are in Data mining for the prediction of diabetic disease was
development, or are being researched to provide a solution examined by Deeraj Shetty et al. [5] using Naive Bayes
to the problem of generalized disease. The main motive of and KNN algorithms. The accuracy of this system's
the proposed system is the prediction of the commonly diabetes prediction is higher than Naive Bayes thanks to
1469
979-8-3503-0448-0/22/$31.00 ©2023 IEEE
Authorized licensed use limited to: Don Bosco Institute of Technology-Bengaluru. Downloaded on December 19,2024 at 06:51:52 UTC from IEEE Xplore. Restrictions apply.
2023 6th International Conference on Contemporary Computing and Informatics (IC3I)
Ankita Dewan et al[9] .'s recommendation Using a 4-step method, Hu and Liu [16] retrieved
of prediction system of disease uses a hybrid data mining opinions of features from customer reviews. To detect
classification technique. This system employs methods features, this algorithm uses association rule mining. It
like Naive Bayes, Decision Trees, and Neural Networks. then prunes irrelevant and redundant features, finds rare
This system has an accuracy rate of 87%. features, and lastly determines the semantic orientation of
each opinion sentence.
Anjan Nikhil Repaka et al. [10] used naive Bayesian to
construct and implement a prediction model for heart Several manually created criteria were implemented
disease. Any user may access the forecast results by by Agarwal et al. [17] in order to derive dependency tree
utilizing this method with any smartphone device. This patterns from phrases. Combining this data with the
method has an accuracy of 89.77%. semantic information from the Massachusetts Institute of
Technology Media Lab ConceptNet ontology, they trained
Gao et al. [11] examined patterns in physician evaluations a machine learning model to identify concept patterns in
over time to determine the factors influencing Web-based the text by using the concepts that were extracted. This
ratings. They found that, in general, evaluations were allowed them to classify documents into positive and
positive and that doctors who had been practicing for a negative categories.
long period, such as obstetricians or gynecologists, were Text from electronic medical records was used by Lix
more likely to obtain reviews than other medical et al. [18] to apply an SVM classifier to identify patients
specialties. They also found that clinicians without who had taken alcohol. A bag-of-words model was used
malpractice claims, board-certified physicians, highly to represent unigrams and bigrams in these data.
respected medical school grads, and recent graduates all
had better ratings. YeongWai Chung [19] Conduct research on huge
data and provide sentiment analysis problems. Twitter
Jiugang Li et al. [12] developed a hashtag produces 175 million tweets every day on average. 1
recommender system using the skip-gram model and Zettabyte of data has already been produced by the globe.
convolutional neural networks (CNN) to learn semantic
phrase vectors, taking into account the importance of III. CURRENT ISSUES
hashtags in sentiment analysis. These vectors employ
LSTM RNN to classify hashtags based on the features. Traditional diagnostic techniques involve physically
Results show that this model outperforms more widely assessing patients to measure things like body
used models like SVM and Standard RNN. This temperature, pulse rate, heartbeat, blood pressure, and
investigation is based on the fact that it was subjected to asking about their medical history. There are various
standard AI approaches like SVM and collaborative limitations of these techniques of diagnosis –
filtering; the semantic features are lost, which has a
significant impact on obtaining a reasonable expectation. ● It doesn't reveal the reason of the patient's illness.
● Prolonged procedure of diagnosis.
Jain, Kumar, and Mahanti [13] found that sentiment ● Don't give enough information on the illness.
1470
Authorized licensed use limited to: Don Bosco Institute of Technology-Bengaluru. Downloaded on December 19,2024 at 06:51:52 UTC from IEEE Xplore. Restrictions apply.
2023 6th International Conference on Contemporary Computing and Informatics (IC3I)
● Increase the risk of developing a disease in the sarcasm, irony, humour, which a person has minimal
future trouble recognising.
In reality, sentiment analysis is a difficult task even for
Correct doctor for specific disease is mandatory for humans, so sentiment analysis classifiers might not be as
curing of the disease. Due to lack of knowledge about the accurate as other classifiers.
doctor, people may be misdiagnosis or taking more than
enough time to recover. So, recommendation of proper
doctor is must. VI. FUNDING
1471
Authorized licensed use limited to: Don Bosco Institute of Technology-Bengaluru. Downloaded on December 19,2024 at 06:51:52 UTC from IEEE Xplore. Restrictions apply.
2023 6th International Conference on Contemporary Computing and Informatics (IC3I)
[16] Yadav, N., Banerjee, K., & Bali, V. (2020). A survey on fatigue
detection of workers using machine learning. International Journal
of E-Health and Medical Communications (IJEHMC), 11(3), 1-8
[17] Banerjee, K., & Bali, V. (2020). Design and Development of
Bioinformatics Feature Based DNA Sequence Data Compression
Algorithm. EAI Endorsed Trans. Pervasive Health Technol., 5(20),
e5.
[18] Banerjee, K., Kumar, M. S., & Tilak, L. N. (2021). Delineation of
potential groundwater zones using Analytical hierarchy process
(AHP) for Gautham Buddh Nagar District, Uttar Pradesh,
India. Materials Today: Proceedings, 44, 4976-4983.
[19] Banerjee K, Bali V, Nawaz N, Bali S, Mathur S, Mishra RK, Rani
S. A Machine-Learning Approach for Prediction of Water
Contamination Using Latitude, Longitude, and Elevation. Water.
2022; 14(5):728. https://doi.org/10.3390/w14050728
[20] Banerjee K., Santhosh Kumar M.B., Tilak L.N., Vashistha S.
(2021) Analysis of Groundwater Quality Using GIS-Based Water
Quality Index in Noida, Gautam Buddh Nagar, Uttar Pradesh (UP),
India. In: Choudhary A., Agrawal A.P., Logeswaran R., Unhelkar
B. (eds) Applications of Artificial Intelligence and Machine
Learning. Lecture Notes in Electrical Engineering, vol 778.
Springer, Singapore.
[21] Sharma, T., Banerjee, K., Mathur, S., & Bali, V. (2020). Stress
analysis using machine learning techniques. International Journal
of Advanced Science and Technology, 29(3), 14654-14665.
[22] K. Banerjee and R. A. Prasad, "Reference based inter
chromosomal similarity based DNA sequence compression
algorithm," 2017 International Conference on Computing,
Communication and Automation (ICCCA), 2017, pp. 234-238,
doi: 10.1109/CCAA.2017.8229806.
[23] Banerjee, K., & Prasad, R. A. (2014, October). A new technique in
reference based DNA sequence compression algorithm: Enabling
partial decompression. In AIP Conference Proceedings (Vol. 1618,
No. 1, pp. 799-802). American Institute of Physics.
[24] K. Banerjee et al., "A review on Artificial Intelligence based Sign
Language Recognition Techniques," 2022 5th International
Conference on Contemporary Computing and Informatics (IC3I),
Uttar Pradesh, India, 2022, pp. 2195-2201, doi:
10.1109/IC3I56241.2022.10073000.
[25] K. Banerjee et al., "Prediction of Criminal Behavior Based on
Genetic Data - A Review," 2022 5th International Conference on
Contemporary Computing and Informatics (IC3I), Uttar Pradesh,
India, 2022, pp. 2206-2210, doi:
10.1109/IC3I56241.2022.10073062.
[26] K. Banerjee et al., "Assessing Water Quality Index Near Industrial
Regions and Aiding in Effective Water Management and
Controlling Water Pollution Level," 2022 5th International
Conference on Contemporary Computing and Informatics (IC3I),
Uttar Pradesh, India, 2022, pp. 1987-1991, doi:
10.1109/IC3I56241.2022.10073296.
1472
Authorized licensed use limited to: Don Bosco Institute of Technology-Bengaluru. Downloaded on December 19,2024 at 06:51:52 UTC from IEEE Xplore. Restrictions apply.