2024-Jayaraman-Critical Review On Water Quality Analysis Using IoT
2024-Jayaraman-Critical Review On Water Quality Analysis Using IoT
A R T I C L E I N F O A B S T R A C T
Keywords: Water quality and its management are the most precise concerns confronting humanity globally. This article
Water quality indices evaluates the various sensors used for water quality monitoring and focuses on the water quality index
Internet of Things considering the multiple physical, chemical, and biological parameters. A Review of Internet of Things (IoT)
Sensor technology
research for water quality monitoring and analysis, sensors used for water quality can help remote monitoring of
GIS
and Integrated Technology
the water quality parameters using various IoT-based sensors that convey the assembled estimations utilizing
Low-Power Wide Area Network innovations. Overall, the IoT system was 95 % accurate in measuring pH,
Turbidity, TDS, and Temperature, while the traditional method was only 85 % accurate. Also, this study
reviewed the different A.I. techniques used to assess water quality, including conventional machine learning
techniques, Support Vector Machines, Deep Neural Networks, and K-nearest neighbors. Compared to traditional
methods, machine learning and deep learning can significantly increase the accuracy of measurements of
groundwater quality. However, various variables, such as the caliber of the training data, the water quality
metrics’ complexity, and the monitoring frequency, will affect the accuracy. The geographical information
system (GIS) is used for spatial data analysis and managing water resources. The quality of its data is also
reviewed in the paper. Based on these analyses, the study has forecasted the future sensors, Geospatial Tech
nology, and machine learning techniques for water quality monitoring and analysis.
Introduction plays a vital role in assessing and monitoring the health and safety of
water sources. It involves evaluating various physical, chemical, and
Water is a highly significant need of every single living being. With biological parameters to determine whether the water quality meets the
an increase in the human population, the requirement for preserving required standards for its intended use, such as drinking purposes, rec
water assets is acquiring greater significance. Several parameters are reational activities, and ecosystem preservation. Traditional water
used for identifying water quality based on the samples’ water level, pH quality monitoring systems have relied on conventional laboratory-
level, turbidity, and salt content. The machine learning techniques are based methods in which water samples are collected from various re
combined with a GIS based on data availability and quality. Machine sources and transported to the laboratory for analysis. Water samples are
learning algorithms must account for spatial autocorrelation and tem collected manually from different points of interest, such as rivers, lakes,
poral dynamics to predict water quality. Incorporating spatial analysis and wells, and detailed studies are performed using specialized equip
techniques within GIS, such as geostatistics, can help capture these ment and techniques in laboratory conditions. The obtained data is
variations and improve the accuracy of predictions. By addressing these analyzed and interpreted to assess the water quality. It is compared
problems, robust machine learning and GIS-based frameworks can be against the established standards or guidelines to meet the criteria for its
determined for water quality index calculation, aiding in efficient water designated purpose. The limitations of traditional laboratory-based
management and decision-making processes. Water quality analysis water quality monitoring systems are that they are time-consuming
* Corresponding author.
E-mail address: [email protected] (P. Jayaraman).
https://doi.org/10.1016/j.jjimei.2023.100210
and costly, lack real-time monitoring, and are inaccessible to remote R2: What strategies can be figured out by associations to develop
locations. robust water quality forecasts using machine learning models that align
with sustainability objectives?
Data-driven water quality assessment The answer to the above questions is that the present study Focuses
on the intersection of IoT, A.I., and GIS in water quality expectations.
Water is a fundamental resource that has always supported different This review surveyed 67 articles from driving diaries in the Scopus data
human activities, including farming, industry, and homegrown use. set (Sharma et al., 2021). This strategic methodology permits us to
However, the nature of water assets is progressively compromised by introduce thorough bits of knowledge and exploratory discoveries.
anthropogenic activities, such as contamination from modern releases, Sections 2 and 3 focus on literature reviews and comparison of various
farming overflow, and urbanization. Checking and evaluating water Review methodologies. Section 4 illustrates the findings of the system
quality is essential for ensuring the supportability of water assets and atic literature review on GIS, and Section 5 discusses the machine
safeguarding human well-being. Traditional methods of water quality learning algorithms and multiple techniques implemented in various
assessment deal with manual sampling and laboratory analysis. It is a applications in the research area. Section 6 is the conclusion.
time-consuming process. (Uddin et al., 2021) have developed the traditional methods of WQI
The advent of data-driven technologies, including sensors, the models. Due to its generalized structure, the evaluation cites the con
Internet of Things (IoT), A.I., and Geographic information systems (GIS), version of substantial data on water quality into a single value or index
has changed the field of water quality evaluation. Information-driven as one of the valuable techniques. They divided WQI models into four
approaches offer a few benefits over conventional techniques, consecutive steps: selection of parameters of water sub-indices genera
including Real-time monitoring, High-resolution data, spatial and tem tion, parameters weighting values, and overall WQI calculation.
poral analysis of water quality, and predictive modeling. The data- Therefore, the large volume of water quality data is compressed into a
driven water quality assessment is essential for addressing the single index with a comparison between various traditional WQI Index
following challenges: Increasing water scarcity, pollution from nonpoint related to selecting parameters, sub-index procedure, weighting, ag
sources, and the impact of climate change. Data-driven water quality gregation techniques, and rating scale. (Nayak et al., 2020) analyzed the
assessments to ensure the availability of clean and safe water for future water quality status of Indian rivers.
generations. The framework of water quality data analysis proposed by (Bui et al., 2020) Assessed WQI and concluded that traditional WQI
WHO and BIS. calculations consumed more time and identified few errors by calcu
This literature review explores the potential of data-driven ap lating sub-indices. Numerous statistical and visual evaluation indicators
proaches in accurately predicting water quality index (WQI) using ma were used to evaluate the models. The machine learning algorithm
chine learning and GIS techniques. The study will focus on the following divided the data into training and testing sets, used hybrid algorithms,
key aspects: and estimated WQI Values. (Wu et al., 2017) to obtain WQI analyses, the
research work was carried out in Lake Poyang, China, where 24 samples
1. Machine learning algorithms: Identify and evaluate the most effec of water quality values were classified into three groups. This work
tive machine learning algorithms for WQI prediction. considered 20 different water quality parameters for analyzing Total
2. GIS integration: Explore the role of GIS in spatial analysis and visu Nitrogen (TN) and Total Phosphorus (TP), whereas hazardous metals
alization of WQI data. and other criteria were rated low in the WQI Analysis. (Wang et al.,
3. Data considerations: Address the challenges and opportunities 2017) Worked with a broad classification of different fractional devia
associated with data collection, preprocessing, and management for tion methods referred to difference, ratio, and normalized difference
data-driven WQI prediction. index. The obtained WQI values vary from 56.61 to 2886.51. The above
4. Performance evaluation: Discuss the metrics and methodologies for index was estimated using the slope of the curve and the root means
evaluating the performance of data-driven WQI prediction models. square error values.
5. Case studies: Analyze real-world case studies to demonstrate the (Lamare & Singh, 2016) Did the work on Numerus’s pollution index
practical application of data-driven WQI prediction in various water (NPI), the simplest WQI evaluation method known among all the WQI
resource management scenarios. methods. Their study evaluated the specific parameters for their po
tential contribution to water bodies. (Dohare et al., 2014) Examined
Review of analysis of water quality parameters and indexes water quality for Indore City, and they calculated WQI using around 27
parameters: pH, color, Total Dissolved Solids (TDS), calcium, chromium,
Various physio-chemical and biological parameters for measuring electrical conductivity (E.C.), total alkalinity, hardness, zinc, manga
and monitoring the Water quality index (WQI) exist worldwide. The nese, and nickel. The evaluated results are as per IS 10500–2012 spec
assessment procedure of WQI is analyzed and summarized in this sec ifications. They used statistical methods to analyze the physio-chemical
tion. Analyses of various water quality measurement parameters have parameters and water pollutants characterization.
been made using traditional methods that help to calibrate the physical (GG.S, 2013) A standard formula for WQI was used by fixing the
changes. (Wu et al., 2021) Carried out experimental work on the critical numeric Q-value and weightage factor. Despite fixing the weight factors
parameters that affect water quality. Fifteen physio-chemical parame for several parameters, they adopted the dissolved oxygen level, fecal
ters were evaluated to assess the WQI and categorized into three groups. coliform, B.O.D., pH, nitrate, Phosphorus, temperature, turbidity, and
These helped establish the results with minimum error and good accu TDS Using these conditions, they arrived at an equation to calculate WQI
racy. This detection benefits low-cost and fast-track water quality with missing parameters. (Nayar, 2020) Analyzed Physio-Chemical pa
evaluation in adjacent basins. Based on the statistical values, the rameters: temperature, Transparency, hardness, pH, sulfate, chloride, D.
spatially evaluated distribution pattern and different models of WQI are O., B.O.D., C.O.D., alkalinity nitrates phosphates, and water quality.
classified. Given the expansive implications of water quality prediction They classified parameters as (0–25) - Excellent quality, (26–50) - Good
utilizing a complex methodology, our review aims to address funda quality, (51–75) - poor quality, (76–100) Inferior quality, >100 un
mental research questions, making significant contributions to the field. suitable for drinking water.
The recognized research questions directing our examination are as per Finally, (Uddin et al., 2021) employed the Analytic Hierarchy Pro
the following (Kar & Varsha, 2023): cess (A.H.P.) to categorize the World Quality Index as exceptional
R1: How do integrating Internet of Things (IoT), AI, and Geographic (90–100), good (90–75), fair (75–50), marginal (50–25), and low
Data system (GIS) functionalities add to the headway of water quality (25–5). This proves to be a very efficient method for minimizing un
expectations? certainty, weighting procedures, and improving accuracy. Table 1
2
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
Table 1
Comparison of WQI Models weighted parameters used by various researchers.
Parameters Horton ( National Sanitation W.J.WQI ( Environmental quality Scottish Research Bascarron Index Said ( Oregon (
Jha et al., Foundation ( Uddin et al., index (Uddin et al., Development Department (BWQI) (Sun Uddin Swamee &
2015) Dawood et al., 2022) 2021; Liu 2021) Index (Sutadian et al., et al., 2016) et al., Tyagi, 2000)
et al., 2019) 2016) 2021)
presents details of the different kinds of WQI models and weightage used five contributing variables: lithology, drainage, land cover/land use,
by various researchers. Several authors have analyzed some parameters. lineaments, seepage, and slope. The outcomes showed that around 1.2%
A comparative analysis by different authors shows that pH analysis of of the review region was assigned as having great potential for
the Horton index shows better results than any other index method. The groundwater to re-energize (Yeh et al., 2016). The curve between the
Bascarron index shows better results for calculating turbidity, and other wavelength and reflectance of the WQI value for different sample sites is
remaining parameters showed better outcomes for Bascarron discussed. Analytical hierarchy-based images are used to identify suit
index-based weightage results. A cluster analysis of water quality spatial able solid waste dumps based on the assigned weights of the various
distribution given by various research studies is presented in Table 2. parameters. The map was prepared to classify the trustworthy dump site
as highly qualified, less fortunate, and unsuitable for the site selection.
Review methodology So, the multi-criteria decision-making analysis was done to identify the
location, presented in Fig. 2.
There are several proposals for a writing survey. It involves a four- Researchers and water resource professionals effectively integrate an
step approach for doing the writing survey, as depicted in the ensuing GIS and visualize the various groundwater data layers by employing a
parts. This technique helps us recognize and concentrate on topics and methodological approach for groundwater analysis using GIS This
future exploration headings. Fig. 1 shows the flowchart of the review approach enables a better understanding of the spatial patterns, trends,
methodology adopted for Water quality prediction using Sensors, IoT, A. and relationships between groundwater parameters and environmental
I., and GIS technologies. factors. The integration of GIS with hydrological modeling and decision
support systems facilitates informed decision-making, sustainable
groundwater management, and protection of valuable resources. The
Review of water quality analysis using GIS
methodology describes the data acquisition system using wired and
wireless sensors. Hydrological modeling tools simulate the groundwater
The WQI can be predicted by plotting the spatial characteristics of
flow and evaluate the water availability. Various data sets are pre
water in lakes (Wang et al., 2017). Some of the parameters were esti
processed, and model selection, training, and evaluation of multiple
mated using remote sensing monitoring Technology. Depending on the
metrics are predicted. Some examples where the contexts based on GIS
problem domain, the boxplot analysis provides an accurate visualization
are discussed for providing ideas about spatial and temporal mapping
prediction and finalizes the outlier detection of threshold values. Ma
along with the implementation of A.I. models are listed in Table 3.
chine learning techniques were applied based on the captured image
data sets from the Gravity Recovery and Climate Experiments (GRACE).
Review of sensors-based water quality monitoring systems
These helped with accurate water quality predictions and images of
groundwater distribution.
Continuous monitoring of water quality parameters can be measured
(Nampak et al., 2014) A GIS approach was utilized to incorporate
Table 2
Water Quality Parameters Utilized by Different Researchers.
Author Reference
Water parameters (El Bilali & Taleb, (Jha et al., (Dohare et al., (Ewaid et al., (Ewaid et al., (Chen et al., (Shakhari & Banerjee,
2020) 2015) 2014) 2019) 2018) 2020) 2019)
3
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
Fig. 1. Flowchart on review methodology using various approaches for water quality prediction (Kar & Varsha, 2023).
Fig. 2. Methodological approach for groundwater analysis using GIS (Arabameri et al., 2021).
using different portable and low-cost approaches. (Tuna et al., 2013) Prediction & Simulation Package). The lifetime of node and trans
Focused their work on the practical usage of mini boat probes loaded mission power was analyzed, along with variations in the transmission
along with the wireless sensor network monitoring system (WW.S.N.. power of the nodes between 0 dB and − 25 dB. Fig. 3 shows the wireless
IEEE 802.15.4 is a wireless interface system meant for continuous water sensor network-based monitoring system. A field test in the water
quality monitoring, and a few parameters, including DD.O. pH, EE.C. reservoir was done in Kirklareli Dam. A web-based online interface with
temperature, turbidity, and nitrate, to determine water quality, and remote monitoring IoT devices controlled the system. Researchers were
simulation studies were focused on monitoring the water quality of focused on the parameters such as D.O. pH, EE.C. temperature,
drinking water reservoirs. (Wu et al., 2021) analyzed the maximum turbidity, and nitrate. They concluded the simulation results using a
water quality parameters meant for better accuracy. (EE.L.Watershed web-based interface and the system’s dependence on parameters such as
et al., 2017) Analyzed a few parameters to get the nearest accurate transmission frequency, power, size of the packet node parameters, etc.
values. It is important to note that the sensors monitoring the water quality
The system integrated with GG.P.S. provides an online data acqui should be cost-effective and continuous. The micro sensors fabricated
sition system with two long-range radio modems for simultaneous data along with the channel interface should measure water quality param
transmission. The MATLAB-based simulation environment was also used eters such as pH and conductivity. The final results were compared with
for analyzing the water quality parameters with the source codes of MM. the commercial sensors in static and dynamic conditions. Fig. 3 shows
A.T.S.N.L.(MM.A.T.L.A.B. Wireless Sensor Node Platform Lifetime the resistance of the pH sensor measured in static conditions. The same
4
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
5
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
for various applications for measuring water quality play an essential to analyse the water quality metrics to make predictions. The final re
role. Table 4 details various authors’ sensors for measuring water quality sults demonstrated a good indication of the model’s anticipated and
and suitable hardware, software, and networking systems locations to actual values using multidimensional input data sets and assessment for
ensure efficient and accurate output. the future emerging water quality trend using LSTM deep neural net
works to forecast drinking water quality.
Water quality analysis using machine learning techniques (Shakhari & Banerjee, 2019) The authors used a multi-class classi
fication system with continuous data sets to regulate water quality. They
Employing IoT (Internet of Things) and machine learning for water efficiently analyzed data using an algorithm known as the decision tree
quality monitoring offers advanced capabilities for collecting, classification model to forecast the water quality. The research is
analyzing, and interpreting data. (Wang et al., 2017) proposed a ma employed in water quality monitoring equipment to classify the quality
chine learning technique employing WQI and fractional derivative of bottled drinking water, pipeline drinking water, and pond water taken
methods to get remote sensing spectral indices. A support vector at center stage. They compared the quality of two methods of classifi
Regression model was implemented to analyse the model’s accuracy. cation and regression. (Sharma et al., 2021) A supervised algorithm and
(Liu et al., 2019) Work on smart water quality analysis and the predic machine learning approach accomplished the water quality prediction.
tion was done using Long short-term memory (LSTM) time series neural This approach needed to be updated with drawbacks, such as lab time
network IoT Environment. They used training and test monitoring data consumption, cost, and statistical analysis. Analysis of four different
parameters, namely turbidity, temperature, pH, and TDS, shows that
polynomial regression with a level of 2, angle assistance, and learning
Table 4 rate of 0.1 outperformed other relapse computations by its proficiency in
Review of Hardware, sensor, and I.O.T. techniques used by the researchers. most accurate WQI anticipation.
Sl. Author Sensors and Parameter measured Accuracy (Lamare & Singh, 2016) did the prediction of water quality for irri
No Reference Hardware tools level gation purposes using six models of machine learning techniques
used
exhibited A.I. models as proficient methods for the precise view of the
1 (Kar & Smart water pH, D.O. conductivity, The mean nature of water systems. Robotized sensor implementation and machine
Varsha, quality sensor temperature, value of
learning models help researchers regulate water quality and address
2023) oxidation–reduction 69.01
potential, turbidity, Accuracy irrigation water quality issues. An automated sensor technology coupled
dissolved ions level:93.08 with K-NN and S.V.R. machine learning provides accurate results.
% (Ewaid et al., 2018) performed groundwater quality prediction using
2 (Wang Optical sensors, pH, D.O. Turbidity, Accuracy machine learning techniques, and the initial step involved in the
et al., 2017) Microelectronics Escherichia Coli, and level:87 %
application of regression approach to predict groundwater images and
Mechanical Residual Chlorine.
systems, and Bio- the process adopted are shown in Fig. 5. Gaussian Mixture Model was
Sensors used to produce minor errors, and the machine learning models were
3 ( Kapta 3000 AC4 Chlorine, – applied in the GRACE Dataset to predict future datasets in the form of
Azpilicueta sensor Spectro: Temperature,
pixels.
et al., 2022) laser pressure,
conductivity,
Data mining algorithms with the combination of standalone and
Turbidity, and color,
4 (Najah The system Metal toxic pollutants Accuracy
Ahmed combines Hg2+, Cu2+ and level:93.3
et al., 2019) computer image Zn2+ %
processing
technology with
biological sensors.
5 (Nayar, DD.O.and pH D.O. pH values Accuracy
2020) sensors and level:96.3
zhorrible water %
checker, WW.S.N.
GSM Technology
6 (EE.L. SCADA, internet in Collection, storage, SCADA
Watershed Raspberry Pi analysis, and Accuracy-
et al., 2017) visualization 94 %
7 (Tuna et al., Temperature pH, turbidity, –
2013) sensor, pH, temperature
Turbidity,
Ultrasonic sensor,
Arduino Uno,
water flow sensor,
Filter unit
8 (Demetillo Data logging pH, T TDS
et al., 2019) system with a 32- Data logger
bit AA.R.M.core Accuracy-
microcontroller 97 %
board based on the
Atmel SAM3 × 8E
ARM Cortex-M3
9 (Liu et al., Arduino and Node pH, Turbidity, E.C.
2019) MCU, PP.H. Temperature, Accuracy of
Sensors, Turbidity Dissolved Oxygen, sensors-
sensor, Dallas and E 91.2 %
Temperature
sensor, DD.O.
sensor, salinity
Fig. 5. Flow chart of an implementation of machine learning models (Hussein
sensor
et al., 2020).
6
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
hybrid algorithms models were evaluated. Faecal coliform (FF.C. and bagging (B.A. and Random tree (RR.T. outperformed the other models,
total solids (TT.S. are the two parameters considered and revealed. and they used a boxplot to predict the errors such as R2, RMSE. MAE, and
(Mancuso et al., 2021) used post-processing techniques produced by NSE. (GG.S, 2013) used an algorithm that works with big data analysis
time series as a forecasting method, and this time series with hierar with a reduced cost to improve the prediction efficiency. They observed
chical extracts showed the end Neural Network with an accurate fore D.T. R.F. and D.C.F. give better predictions than other algorithms.
cast. The Neural Network disaggregation model was created, tested, and (Venkatachalam & Ray, 2022) The trained requirement is necessary to
trained by different data sets. (Dilmi & Ladjal, 2021) used the Long Short train the algorithm to be more accurate. (Ahmed et al., 2019) They
Term Memory deep learning approach to classify the water quality analyzed the data using a supervised algorithm and considered the error
network of recurrent neurons and achieved an accuracy of 99.72% using and deviations for predicting WQI and WQC He concluded the gradient
models for time series prediction, with popularity in deep learning boosting algorithm is more efficient than other regression algorithms in
techniques. They used a multisensory monitoring system for the con estimating MAE. MSE. RMSE and R2. and the polynomial regression
struction of the database. The performance assessment was done using shows better results compared to others. In the present study, the au
AI techniques, LSTM and RNNs, and multi-class SVM. Different types of thors mainly focus on the WQI and WQC. parameters to predict the
analysis were out for data reduction. The technology used principal future performance and efficiency of different algorithms. This research
component analysis (P.C.A., linear discriminant analysis (LDA), and primarily focuses on comprehending the challenges related to the uti
independent component analysis (II.C.A..) lization and consequences of the emerging generation of AI-based sys
(Chen et al., 2020) Did a comparative analysis of water quality tems in decision-making processes. This work’s objective involved
prediction using machine learning with extensive data analysis?. The identifying the potential research opportunities specifically for Infor
comparison was made between 10 learning models for assessing the mation Systems (II.S.. Water quality monitoring can be done in real-time
quality performance using big data. (Herath & Mittal, 2022) (Yang et al., using data analytics. This can aid in the early detection of potential is
2019) The data sets collected are related to the parameters pH, D.O. C.O. sues so that they can be fixed before doing significant harm (Pasika &
D. Mn, and NH3 and validated by the model’s Decision tree (D.T., Gandla, 2020). For instance, data analytics can be used to monitor the
Random Forest(RR.F. and Deep cascade Forest(D.C.F.. The best models concentrations of contaminants like microorganisms, heavy metals, and
were identified based on the increased efficiency in predicting surface chemicals in water. These details can be used to pinpoint regions where
water quality. (Aldhyani et al., 2020) Carried out work on water quality there is a chance of poor water quality and to take preventative
prediction using AI Algorithms that includes nonlinear autoregressive measures.
neural network (N.A.R.N.E.T. and long short-term memory (L.S.T.M. (Herath & Mittal, 2022) Analyzed high-impact impacts AI algorithms
deep learning algorithm. Seven datasets using the L.S.T.M. algorithm are used for various smart city domains, and it analysed the future work
shown in Fig. 6. Table 5 analyses the various machine learning algo based on collaborating Artificial intelligence with Big data along with
rithms used in water quality analysis by multiple researchers. virtual reality, 3D printing, etc. This work concluded with effective al
Using an artificial neural network and regression plot, the compar gorithms such as ANN, RNN/LSTM, and SVM. Data analytics can be used
ison of SVM, K.N.N. Nave Bayes models were made, and the output from to forecast patterns in water quality in the future. Water quality man
test data was found to improve the accuracy in the prediction of WW.Q.I agers can use this information to prepare for possible issues and
(Uddin et al., 2021) The authors analyzed the WQI and WQC using implement preventative measures. Data analytics, for instance, can be
machine learning techniques and proved that the GARNET result used to forecast the effects of climate change on the standard of the
demonstration provides a better forecast than LSTM Following that water. (Kalini, 2017) Using this knowledge, measures can be created to
(Dohare et al., 2014) is the prediction of WQI based on hybrid data safeguard water quality in the face of climate change and geographical
mining algorithm. Based on the above techniques, their approach may analysis. Spatial data, such as where water bodies are located and where
consume time and lead to errors and deviations in sub-indices—Stand pollutants are distributed, can be analyzed using data analytics. This can
alone (4 algorithms) and hybrid data mining (12 algorithms). Hybrid assist in locating regions most vulnerable to poor water quality. Data
analytics, for instance, can be used to visualize how pollutants are
distributed throughout a watershed. The places that require pollution
control measures the most can be determined using this information.
Data analytics can be used to create machine learning models that can
find trends in data on water quality. These models can be used to fore
cast future water quality trends and spot potential issues. (Sn et al.,
2020) Machine learning algorithms, for instance, can be used to find
trends in water quality data that are connected to certain pollutants.
Using this data, early warning systems can be created to notify managers
of water quality and potential issues. These are just a handful of the
numerous ways that data analytics can support water quality moni
toring. Water quality managers can better identify trends and patterns in
water quality by employing data analytics. This information can be
employed to control water quality more effectively.
(Yang et al., 2019) Based on the support vector machine frequency
data, it can be forecasted in short-term high-frequency data. (Duan et al.,
2019) reviews the various advancements in AI algorithms to improve
their computation of storage and power. (Kalini, 2017) (Sn et al., 2020)
I.J.I.M.’s paper used the AI research tool to analyze the various data
mining methods.
Employing IoT and machine learning for water quality monitoring
offers additional capabilities for collecting, analyzing, and interpreting
data using data analytics.
a) IoT for Data collection: IoT devices like sensors and probes are
Fig. 6. Flow diagram for estimating efficient algorithm (Uddin et al., 2021). deployed in water bodies. The collected data is transmitted
7
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
Table 5
Analysis of Machine Learning Algorithms implemented for assessing water quality.
Sl. Pilot scale experiments Water quality parameters Machine learning techniques Range Achieved Preferred Reference
No. Proposed algorithm
1 1. Reclaimed water pH, Cl, Conductivity, C.O.D.1. Direct SVM SVM: (DA 97.22 % T.P.R.values PCC-SVM based (Xu et al.,
2. Portable water Temperature, Turbidity, C 2. PCC-SVM-based method 91.67 % FF.P.R.being 0) PCC- method 2019)
3. Single parameter method SVM: (DA> 97 %, TT.P.R. 93 %
and FF.P.R.= 0)
2 Water quality monitoring Water Temperature, pH, 1. L L.S.T.M.2. AA.R.I.M.A.3. SS.V. L.S.T.M. (Limb et. al,
station of Guazhou Water Dissolved Oxygen, C.O.D. R.Period length m = 10 2011)
Source of the Yangtze Conductivity, Turbidity, LSTM-0.0017
River NH3-N ARIMA-0.0126
SVR-0.0046
3 packaged drinking water, pH T.S.Reference Indicator based, MAE-0.17 Reference (Shakhari &
Pipeline drinking water, T Decision Tree, C4.5, .Logistic RME-0.20 Indicator-based Banerjee,
and pond water Regression Decision Tree 2019)
4 Evaluated quality of the pH, BB.O.D. NH+ 3 , Iron, WQImethod, DI, RI and NN.D.I. R2–0.92, RMSE-58.4, RPD-2.81 Particle swarm (Wang et al.,
Ebinur Lake Watershed Copper, Zinc, D.O. TTDS correlation coefficients, SVM model optimisation 2017)
surface water Cl− , Na, etc. (20 (PSO)-support
parameters) vector regression
model
5 A case study Tilesdit dam pH, EC, T, TU Mg, B, H 1. L.S.T.M.RNNs PCA 95.17 %. LDA LSTM RNN (Dilmi &
in Bouira (Algeria) 2. Multi-class SVMs 97.84 %, ICA 95.42 % Ladjal, 2021)
6 Rawal watershed, situated Temperature, turbidity, TDSsupervised prediction MAE =1.96, F1score=0.56 Regression (Ahmed
in Pakistan pH, T algorithms Recall=0.564 Algorithms et al., 2019)
7 GRACE ΔTWS image Full image prediction to Extreme gradient boosting, MAESVR Mean absolute error- support vector (Hussein
reconstruction determine groundwater multivariate linear regression, 2.70, RMSE SVR regression et al., 2020)
quality prediction random forests, multilayer Root Mean Square Error-5.72
perceptron, and support vector
regression
8 Four monitoring stations Ammoniacal Nitrogen, Adaptive Neuro-Fuzzy Inference pH, No. of neuron-18, RMSE- Neuro-Fuzzy (Tiwari et al.,
of Johor River. suspended solid (SS.S., pH System, Radial Basis Function 0.15 Maxerror%− 3.22, R2=0.9 Inference System 2018)
Neural Networks (WDT-ANFIS)
9 Monthly data from 2 water B.O.D. C.O.D. D.O. pH, Random forest, Random tree, R2 = 0.941, RMSE = 2.71, MAE BA-RT (Dohare
quality monitoring turbidity, electrical Reduced error pruning tree, = 1.87, NSE = 0.941, PBIAS= et al., 2014)
stations within the Talar conductivity, and Total Around 12 hybrid data-mining 0.500
catchment solids algorithms
10 Kaggle-Indian water D.O. pH SVM, K-NN, Naive Bayes nonlinear N.A.R.N.E.T. LL.S.T.M. SVM model (Uddin et al.,
quality data sets Conductivity, BB.O.D. autoregressive neural network N Accuracy-97.01 % Sensitivity- (WW.Q.C. 2021)
Nitrate, Fecal coliform, 99.23 % Specificity-97.78% NARNET model
and total coliform Precision-94.93 % (WQI)
R.N.A.R.N.E.T.96:17 %
R.L.S.T.M. 4:21 %
wirelessly to a central database or cloud platform, enabling remote contaminations in most encompassing water bodies like waterways,
monitoring and access to information. lakes, and streams, which have explicit quality guidelines. e. The decay
b) Data Integration and Management: The data management system of water quality impacts oceanic biological systems by reducing the
organizes, stores, and preprocesses the collected data. supply of safe new water for domestic use. Developing nations
c) Data Analysis using Machine Learning: Various algorithms in iden frequently go through quick financial expansion, and each improvement
tifying the abnormal water quality or potential pollution sources. It project has the possibility of producing adverse consequences on the
also identifies patterns of water quality data and develops models for climate. Recently, the abundant usage of manure has generally moved
predicting future water quality conditions (Almetwally et al., 2020). into groundwater streams. Moreover, the standards and specifications
d) Prediction models: By training the models on historical data, the vary for different applications/uses. For instance, the water system is
system can identify real-time notifications. neither saline nor contains harmful materials, and it can be moved to
e) Spatial models and statistical analysis: Data analytics are used to plants or soil and accordingly annihilate the environment. In wireless
identify the most affected areas in water quality, which is highly risk. sensor applications, Sondes furnished with sunlight-based chargers for
Statistical analysis tends to identify the patterns in water quality energy harvesting is updated in recent works. Sondes are outfitted with
areas. IEEE 802.15.4-based remote connection points and mounted on floats.
They structure a Remote Sensor Organization to speak with one another
By combining IoT and machine learning, water quality monitoring and send their estimations at standard stretches to a focal PC. through
systems can provide real-time insights, predictive capabilities, and WSN in terms of GIS implementation, the movement of water and
proactive management approaches, which ultimately support the pro contaminants may be tracked, the places at risk of pollution and it can be
tection of water resources. found using geospatial technologies. Sensor technologies can be used to
continuously gather data and measure water quality indicators at spe
Scope for water quality analysis using geospatial technology, cific places. Data on water quality can be analyzed using machine
sensor technology, and machine learning learning methods to spot trends and patterns. Additionally, machine
learning can be used to forecast changes in water quality, and this can be
One of the main challenges related to collecting contaminated water helpful in developing mitigation strategies and locations that are most
is identifying the threats, analyzing water quality, and producing likely to be impacted by pollution. Processes affecting water quality can
improved water resources management. Water is the most critical need be modeled using geospatial technology, sensor technology, and ma
of any living organism and essential for supporting the existence of all chine learning methods. We should understand how contaminants affect
animals and individuals. There are definite restrictions on water quality and develop plans to enhance water quality standards.
8
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
9
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
Fig. 10. Various Artificial Intelligence Methods to Analyze Water Quality Management.
prediction of water resources and their quality. Various modeling uses learning, and GIS technologies are encouraging. These technologies can
GRACE, Visual MODFLOW., and ArcGIS 10.2.2, in which the water be utilized to create precise water quality prediction models, according
zones and multiple parameters are identified and predicted. The effec to several studies conducted in the past. WQI of the Tigris River in Iraq
tive and practical system for evaluating the nature of groundwater and was predicted using an artificial neural network. This model’s R2 and
its advancement in the utilization of physicochemical boundaries are RMSE values are 0.91 and 0.27, respectively, in the traditional methods
appropriately upheld by GIS methods, multivariate displaying, and of machine learning, and it is. 0.94 and 0.15 in the random forest model.
machine learning. Analytical Hierarchy Process is a central geological to forecast WQI most of the models achieved an R2 value of 0.93 and
technique for identifying ground-level elevation levels, and they per RMSE value of 0.17.
formed better boundary conditions and calibration levels. Using these
models, we could ensure support for the sustainable management of CRediT authorship contribution statement
water resources. Overall, numerical outcomes of predicting the water
quality using WQI sensor technology, IoT, machine learning, deep Poornima Jayaraman: . Kothalam Krishnan Nagarajan:
10
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
11
P. Jayaraman et al. International Journal of Information Management Data Insights 4 (2024) 100210
Jha, D. K., Devi, M. P., Vidyalakshmi, R., Brindha, B., Vinithkumar, N. V., & Journal of Information Management Data Insights, 1, Article 100050. https://doi.org/
Kirubagaran, R. (2015). Water quality assessment using water quality index and 10.1016/j.jjimei.2021.100050
geographical information system methods in the coastal waters of Andaman Sea, Sn, J., Muramthookil Thomas, G., Rohith Raj, R. V., Masetti, A., Tammana, A.,
India. Marine Pollution Bulletin, 100, 555–561. https://doi.org/10.1016/j. Motheram, M., & Gutlapalli, N. C. (2020). Assessment of water quality Index and
marpolbul.2015.08.032 study of the impact of pollution on the rivers of Kerala. Materials Today: Proceedings,
Kalini, Z., International Journal of Information Management A SEM-neural network 43, 3447–3451. https://doi.org/10.1016/j.matpr.2020.09.084
approach for predicting antecedents of m-commerce acceptance, 37 (2017) 14–24. Sun, W., Xia, C., Xu, M., Guo, J., & Sun, G. (2016). Application of modified water quality
https://doi.org/10.1016/j.ijinfomgt.2016.10.008. indices as indicators to assess the spatial and temporal trends of water quality in the
Kar, A. K., & Varsha, P. S. (2023). A review and research agenda. International Journal of Dongjiang River. Ecological Indicators, 66, 306–312. https://doi.org/10.1016/j.
Information Management Data Insights. , Article 100176. https://doi.org/10.1016/j. ecolind.2016.01.054
jjimei.2023.100176 Sutadian, A. D., Muttil, N., Yilmaz, A. G., & Perera, B. J. C. (2016). Development of river
Kothari, N., Shreemali, J., Chakrabarti, P., & Poddar, S. (2021). Design and water quality indices—A review. Environmental Monitoring and Assessment, 188,
implementation of IoT sensor based drinking water quality measurement system. 1–29. https://doi.org/10.1007/s10661-015-5050-0
Materials Today: Proceedings. https://doi.org/10.1016/j.matpr.2020.12.1142 Swamee, P. K., & Tyagi, A. (2000). Describing water quality with aggregate index.
Lamare, R.E., Singh, O.P., Application of Ccme water quality index in evaluating the Journal of Environmental Engineering, 126, 451–455. https://doi.org/10.1061/(ASCE)
water quality status in limestone mining area of, 10 (2016) 149–154. 0733-9372(2000)126:5(451)
Liu, P., Wang, J., Sangaiah, A., Xie, Y., & Yin, X. (2019). Analysis and prediction of water Thinsungnoen, M., An integrated system of geographic information and water quality :
quality using LL.S.T.M.Deep neural networks in IoT environment. Sustainability, 11, Lamtakong river, 10 (2019) 152–159. https://doi.org/10.12720/jait.10.4.152-159.
2058. https://doi.org/10.3390/su11072058 Tiwari, A. K., Singh, A. K., & Mahato, M. K. (2018). Assessment of groundwater quality of
Lumb, A., Sharma, T. C., & Bibeault, J.-F. (2011). A review of genesis and evolution of Pratapgarh district in India for suitability of drinking purpose using water quality
water quality index (WWQI and some future directions. Water Quality, Exposure and index (WWQI and GGIStechnique. Sustainable Water Resources Management, 4,
Health: Purpose and Goals, 3, 11–24. https://doi.org/10.1007/s12403-011-0040-0 601–616. https://doi.org/10.1007/s40899-017-0144-1
Mancuso, P., Piccialli, V., & Sudoso, A. M. (2021). A machine learning approach for Tuna, G., Arkoc, O., & Gulez, K. (2013). Continuous monitoring of water quality using
forecasting hierarchical time series. Expert Systems with Applications, 182, Article portable and low-cost approaches. International Journal of Distributed Sensor
115102. https://doi.org/10.1016/j.eswa.2021.115102 Networks, 2013. https://doi.org/10.1155/2013/249598
Najah Ahmed, A., Binti Othman, F., Abdulmohsin Afan, H., Khaleel Ibrahim, R., Ming Uddin, M. G., Nash, S., & Olbert, A. I. (2021). A review of water quality index models and
Fai, C., Shabbir Hossain, M., Ehteram, M., & Elshafie, A. (2019). Machine learning their use for assessing surface water quality. Ecological Indicators, 122, Article
methods for better water quality prediction. Journal of Hydrology, 578. https://doi. 107218. https://doi.org/10.1016/j.ecolind.2020.107218
org/10.1016/j.jhydrol.2019.124084 Venkatachalam, P., & Ray, S. (2022). How do context-aware artificial intelligence
Nampak, H., Pradhan, B., & Manap, M. A. (2014). Application of GGISbased data driven algorithms used in fitness recommender systems ? A literature review and research
evidential belief function model to predict groundwater potential zonation. Journal agenda. International Journal of Information Management Data Insights, 2, Article
of Hydrology, 513, 283–300. https://doi.org/10.1016/j.jhydrol.2014.02.053 100139. https://doi.org/10.1016/j.jjimei.2022.100139
Nayak, J. G., Patil, L. G., & Patki, V. K. (2020). Development of water quality index for Wang, X., Zhang, F., & Ding, J. (2017). Evaluation of water quality based on a machine
Godavari River (India) based on fuzzy inference system. Groundwater for Sustainable learning algorithm and water quality index for the Ebinur Lake Watershed, China.
Development, 10, Article 100350. https://doi.org/10.1016/j.gsd.2020.100350 Scientific Reports, 7, 1–18. https://doi.org/10.1038/s41598-017-12853-y
Nayar, R., Dr. (2020). Assessment of water quality index and monitoring of pollutants by Wu, Z., Zhang, D., Cai, Y., Wang, X., Zhang, L., & Chen, Y. (2017). Water quality
physico-chemical analysis in water bodies: A review. International Journal of assessment based on the water quality index method in Lake Poyang: The largest
Engineering Research, 9, 178–185. https://doi.org/10.17577/ijertv9is010046 freshwater lake in China. Scientific Reports, 7, 1–10. https://doi.org/10.1038/
Pasika, S., & Gandla, S. T. (2020). Smart water quality monitoring system with cost- s41598-017-18285-y
effective using IoT. Heliyon, 6, e04096. https://doi.org/10.1016/j.heliyon.2020. Wu, Z., Lai, X., & Li, K. (2021). Water quality assessment of rivers in Lake Chaohu Basin
e04096 (China) using water quality index. Ecological Indicators, 121, Article 107021. https://
Roseela, J. A., Godhavari, T., Narayanan, R. M., & Madhuri, P. L. (2020). Design and doi.org/10.1016/j.ecolind.2020.107021
deployment of IoT based underwater wireless communication system using Xu, X., Liu, Y., Liu, S., Li, J., Guo, G., & Smith, K. (2019). Real-time detection of potable-
electronic sensors and materials. Materials Today: Proceedings, 45, 6229–6233. reclaimed water pipe cross-connection events by conventional water quality sensors
https://doi.org/10.1016/j.matpr.2020.10.586 using machine learning methods. Journal of Environmental Management, 238,
Sanya, WW.M., Alawi, MM.A., Eugenio, I., Design and development of smart water 201–209. https://doi.org/10.1016/j.jenvman.2019.02.110
quality monitoring system using IoT design and development of smart water quality Yang, R., Yu, L., Zhao, Y., Yu, H., Xu, G., Wu, Y., & Liu, Z. (2019). International Journal
monitoring system using IoT, (2022). https://doi.org/10.31695/IJASRE.2022.8.3.1. of Information Management Big data analytics for fi nancial Market volatility
Shakhari, S., & Banerjee, I. (2019). A multi-class classification system for continuous forecast based on support vector machine ⋆. International Journal of Information
water quality monitoring. Heliyon, 5, e01822. https://doi.org/10.1016/j. Management. https://doi.org/10.1016/j.ijinfomgt.2019.05.027, 0–1.
heliyon.2019.e01822 Yeh, H. F., Cheng, Y. S., Lin, H. I., & Lee, C. H. (2016). Mapping groundwater recharge
Sharma, R., Kumar, A., & Chuah, C. (2021). International Journal of Information potential zone using a GGISapproach in Hualian River, Taiwan. Sustainable
Management Data Insights Turning the blackbox into a glassbox : An explainable Environment Research, 26, 33–43. https://doi.org/10.1016/j.serj.2015.09.005
machine learning approach for understanding hospitality customer. International
12