Q-Learning-Based Data-Aggregation-Aware Energy-Efficient Routing Protocol For Wireless Sensor Networks
Q-Learning-Based Data-Aggregation-Aware Energy-Efficient Routing Protocol For Wireless Sensor Networks
Q-Learning-Based Data-Aggregation-Aware
Energy-Efficient Routing Protocol for
Wireless Sensor Networks
WAN-KYU YUN AND SANG-JO YOO , (Member, IEEE)
Department of Information and Communication Engineering, Inha University, Incheon 402-751, South Korea
Corresponding author: Sang-Jo Yoo ([email protected])
This work was supported by the Inha University Research Grant.
ABSTRACT The energy consumption of the routing protocol can affect the lifetime of a wireless sensor
network (WSN) because tiny sensor nodes are usually difficult to recharge after they are deployed. Generally,
to save energy, data aggregation is used to minimize and/or eliminate data redundancy at each node and
reduce the amount of the overall data transmitted in a WSN. Furthermore, energy-efficient routing is widely
used to determine the optimal path from the source to the destination, while avoiding the energy-short
nodes, to save energy for relaying the sensed data. In most conventional approaches, data aggregation and
routing path selection are considered separately. In this study, we consider the degrees of the possible
data aggregation of neighbor nodes when a node needs to determine the routing path. We propose a
novel Q-learning-based data-aggregation-aware energy-efficient routing algorithm. The proposed algorithm
uses reinforcement learning to maximize the rewards, defined in terms of the efficiency of the sensor-
type-dependent data aggregation, communication energy and node residual energy, at each sensor node
to obtain an optimal path. We used sensor-type-dependent aggregation rewards. Finally, we performed
simulations to evaluate the performance of the proposed routing method and compared it with that of
the conventional energy-aware routing algorithms. Our results indicate that the proposed protocol can
successfully reduce the amount of data and extend the lifetime of the WSN.
INDEX TERMS Wireless sensor networks, routing, data aggregation, Q-learning, network lifetime.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
VOLUME 9, 2021 10737
W.-K. Yun, S.-J. Yoo: Q-DAEER Protocol for WSNs
power drain rate is also considered to avoid path disconnec- the optimum path that can maximize the rewards by consid-
tion and network partition. These measures can prolong the ering the sensor-type-dependent data aggregation level of the
network lifetime because energy is dissipated more equally neighbor node, the residual energy, communication cost with
among all nodes [4], [5]. distance and hop count to the sink. In this way, the sensor
Because the data being collected by multiple sensors in a nodes can determine the optimum next hop node using their
given area are based on common phenomena, there is likely updated Q-values based on the rewards.
to be some redundancy in the source data. Data aggregation This article is organized as follows: In Section II, we review
as a form of ‘‘in-network-processing’’ in WSNs is widely the existing energy-aware routing protocols for the WSN.
used to collect data in an energy-efficient manner by elimi- In Section III, we present our proposed system model for
nating redundancy and minimizing the number of transmis- WSN routing. In Section IV, we discuss Q-DAEER algo-
sions or data size. In many WSN applications, the actual rithm. We present the simulation results in Section V and
measured raw data at each sensor node may not need to be conclude this article in Section VI.
delivered in the exact same form to the sink. The raw data
can be abstracted or compressed in networks. Depending on II. RELATED WORK
the monitoring purposes of applications, various aggregation Routing is essential in WSNs to support reliable data transfer,
techniques can be used, such as abstracting as {mean, vari- achieve low latency and provide energy-efficient operation.
ance}, maximum value, minimum value, lossy compression, Wireless communications consume significant amount of
feature domain reduction and data prediction. The efficiency power for transmitting sensed data from sensor nodes to
of data aggregation increases when the correlation among the sink nodes. However, the power consumption has become
data collected by various sensors is high [6], [7]. a limiting factor because most sensor nodes are powered
Various machine learning technologies have been used to by batteries. Sensor nodes used in wireless networks have
effectively capture the dynamic features such as node topol- limited computational capability and cannot have full infor-
ogy changes, restricted energy conditions, event detection mation about networks so that it is very difficult for nodes
and communication costs of WSNs for their energy-efficient to calculate the optimum route to the destination quickly.
operation. Among them, reinforcement learning (RL) is par- Even when a node is able to obtain the optimum routing path,
ticularly suitable for problems that include a long- versus the path may not remain optimum over time owing to various
short-term reward trade-off. It provides a framework for a types of changes in the sensing environment, for example,
system to learn from its previous interactions with its envi- the node movement, instable wireless channel condition and
ronment and to select its actions efficiently in the future. dynamic energy status of sensor nodes. Conventional ad hoc
RL-based routing protocols can determine the optimal path routing protocols can be classified into proactive and reactive
as an adaptive method for complex network conditions and protocols [15]. In proactive routing, routes are computed even
quality of service requirements [8]–[10]. when they are not needed and stored in a routing table at
Most previous studies on energy-efficient routing path every node. Therefore, the routing table maintenance over-
selection typically consider communication energy with hop head is large and limits the scalability of this routing pro-
counts and the distance to the sink node to reduce the overall tocol. In reactive routing, routes are computed only when
network-wide energy consumption and/or residual energy they are needed, and sensor nodes store routes only for their
level at each sensor node to distribute the energy burden neighbors. However, this protocol may increase latency for
equally. However, distributing the possible routes to reduce sensed data delivery. To overcome these problems, many
the overhead of specific sensor nodes may conflict with the studies on finding the optimum routing path with low energy
objective of minimizing the network-wide energy consump- consumption are underway.
tion. Notably, the optimization goals do not consider the Mohemed et al. [16] addressed the hole problem in WSNs
possibility of data aggregation through the path. Furthermore, using two distributed, energy-efficient and connectivity-
data aggregation and routing path selection are considered aware routing protocols. They used two different proto-
separately in conventional approaches [11]–[14]. cols in local and global environments. This technique can
In this article, we propose an RL-based energy-aware rout- decrease the overhead of topology reformation and pro-
ing algorithm for obtaining a global optimum path to mini- long the network lifetime. Razaque et al. [17] presented the
mize the overall energy consumption and prolong the lifetime combined protocol of low-energy adaptive-clustering hierar-
of the WSN. We define the degrees of the possible data aggre- chy (LEACH) and power-efficient gathering in sensor infor-
gation of neighbor nodes when a node needs to determine mation systems (PEGASIS), named P-LEACH. This protocol
the routing path. Because data from various sensor types can improve the performance by considering the limitation
(e.g., vibration measurement sensor and temperature sen- of cluster-based routing in LEACH and static routing in
sor) may not show strong correlation, they cannot be aggre- PEGASIS. Khan et al. [18] addressed the problem of sensor
gated together. Therefore, we define sensor-type-dependent node movement in wireless body area sensor networks using
aggregation rewards. We propose a novel Q-learning-based a dynamic routing algorithm. Owing to diverse activities of
data-aggregation-aware energy-efficient routing (Q-DAEER) humans, the positions of sensor nodes on the human body
algorithm, in which each sensor node reinforces to determine change every second. Therefore, packet and energy losses
occur during transmission when nodes use the static rout- communication distance and energy information of nodes
ing algorithm. The authors solved this problem using the to use the energy-efficient clusters to minimize packet loss.
information of the residual energies of nodes, hop count to Guo et al. [27] proposed an energy-efficient routing protocol
sink distance and throughput when nodes select the next hop based on a reinforcement learning algorithm. The nodes
node to forward data. Baker et al. [19] applied the GreeDi were reinforced to calculate the optimal routing path using
routing protocol to the ad hoc on-demand distance vector a reward policy to maximize the energy efficiency and life-
(AODV) in vehicular ad hoc networks (VANET), named time of the network. Wang et al. [28] used the ant colony
GreeAODV, to achieve an energy-efficient routing protocol optimization (ACO) algorithm to address the mobile sink
in the next hop selection. They modeled city map-based wireless sensor network routing protocol. They proposed
VANET scenarios and demonstrated that the proposed algo- an improved ACO algorithm that considered not only the
rithm was better than the original AODV. Oubbati et al. [20] time and energy but also the distance between the selected
proposed an energy-efficient routing protocol, named energy cluster head (CH) and a mobile sink to calculate the optimum
connectivity-aware data delivery, in the flying ad hoc net- mobility trajectory.
work. They ensured the connectivity of the proposed routing El Alami and Najid [29] proposed the LEACH-based fuzzy
protocol by using the information on unmanned aerial vehi- cluster head selection algorithm. They determine the chance
cles (UAVs), such as their speed and location, to minimize the value using the membership function that consists of residual
packet loss caused by the movement of UAVs. energy, expected efficiency and the closeness to base station.
There are some studies on maximizing data aggrega- The nodes which have the higher chance value are selected
tion and network lifetime. Oubbati et al. [21] addressed as CHs in a round. Lee and Teng [30] improve the LEACH
the trade-off between efficient data aggregation and total algorithm using fuzzy logic in mobile sensor network. The
link cost minimization. They used a comprehensive weight, change of location of the nodes in network causes packet
named weighted data aggregation routing strategy, for solv- losses so they use the membership function that is made
ing the trade-off. By overlapping the paths of the nodes of residual energy, the movement speed and pause time of
in a cluster-based WSN, they maximized the efficiency nodes. By the membership function, the chance values of all
of data aggregation and prolonged the network lifetime. nodes to elect the CH nodes are calculated. El Alami and
Ardakani et al. [22] presented a data-aggregation-aware Najid [31] proposed an enhanced clustering hierarchy (ECH)
efficient-routing algorithm in which the mobile agent approach to achieve energy efficiency in WSNs by using
received data from sensor nodes and aggregated and trans- sleeping-waking mechanism for overlapping and neighboring
mitted the data to the sink. They solved the delay and packet nodes. Thus, the data redundancy is minimized and then net-
loss in routing protocols using the movement scheme of work lifetime is maximized. Sert and Yazıcı [32] proposed the
the mobile agents. Haseeb et al. [23] addressed the secu- modified clonal selection algorithm (CLONALG-M) applied
rity issues in applying the conventional routing algorithm to to determine the approximate form of the output membership
a large-area Internet of things. They proposed light-weight functions to improve the performance of rule-based fuzzy
structure-based data aggregation routing, which is a secure routing. Fuzzy approach is superior to well-defined method-
protocol that uses in-route data aggregation for routing data ologies, especially where boundaries between clusters are
in the conventional routing protocols. Yazici et al. [24] pre- unclear. They derived the optimal solution by using the initial
sented a fusion-based framework to reduce the amount of membership function and iterative experiment.
data to be transmitted over the wireless multimedia sensor Some studies have focused on data aggregation-based
network by intra-node processing. They designed a sensor energy efficient routing in WSNs. Sensing data routing in
node to detect objects using machine learning techniques network aggregation provides a better solution in terms of the
and proposed a method for increasing the accuracy while reduced number of messages, high aggregation rate and reli-
reducing the data amount. For sensor network routing, a new able transmission. Zhang et al. [33] proposed the data aggre-
cluster-based routing algorithm that consume less power was gation mechanism supported by dynamic routing. Nodes in
presented. Clustering is one of the important techniques network select the neighbor node as next hop, which has the
for topology control, effective data aggregation and energy- minimum value of function that is made of residual energy,
efficient routing in WSN. hop count and the size of remained buffer. Li et al. [13]
Many researchers have applied machine learning presented differentiated data aggregation routing (DDAR)
techniques to obtain the optimal routing path with low that makes different QoS (Quality of Service) routes to sink
overhead and cost. Chang et al. [25] applied the k-means node based on aggregation threshold and aggregation dead-
algorithm and a genetic algorithm for multi-objective opti- line. Most of conventional data aggregation-based routing
mization. The sensor nodes in the network were clustered algorithms are generally utilizing tree structure or hierar-
using the k-means algorithm. They constructed a fitness chical clustering architecture to aggregate the data and to
function of the genetic algorithm to maximize the network find out the optimum route to the sink. However, they have
lifetime. Thangaramya et al. [26] presented a neuro- not considered network-wise data aggregation possibilities
fuzzy-based energy-efficient clustering algorithm. In neuro- and corresponding energy consumption for different sensor
fuzzy, they used a membership function comprising the types, in which they depend on type-dependent neighbor
FIGURE 5. Schematic of sensing and data forwarding procedures (type t data only).
In Eq. (1), if there is no scheduled sensing time for type t at D. DATA AGGREGATION MODEL
time step n, then ODti (n) = 0. Owing to the high node density in sensor networks, similar
The required energy for data transmission is generally data are sensed by many nodes, which results in redundancy
proportional to the size of the aggregated data and the dis- in the sense data. Using data aggregation techniques, tem-
tance between the sender and receiver if the sensor nodes poral and spatial redundancies can be reduced while routing
can control the transmission power. The required reception packets from the source to the sink [37]–[39].
energy depends on the size and decoding of the data. The In this study, we consider three different types of data
required energy for data aggregation is proportional to the aggregation models. The first is a representative aggregation
queue state [36]. model, in which the sink node represents only a representative
The total transmission energy required by node i at the nth value. The typical mathematical functions are sum, average,
time step is maximum, minimum or median. In this model, regardless
X ADt (n)
(
di−nt∗ β
) of the cumulative queue state size, the aggregated data can
Ei (n) =
TX i
PtxElec + Pamp (3) have a unit packet size, as depicted in Fig. 5(a). The second
∀t B dmax model is the lossy compressive aggregation model. In this
where B is the nominal bit rate; PtxElec is the transmission model, the sensed data from multiple sensors can be rep-
power; Pamp is the amplifier power; dmax is the maximum resented by the limited size of the feature vector, in which
distance for communication at each node;di−nt∗ is the distance various types of dimension reduction techniques with infor-
between node i and the selected next neighbor node for type mation loss can be applied. As depicted in Fig. 5(b), when
t using the proposed routing algorithm, and β is the path loss the queue state is less than the feature vector size of the
exponent (β = 2 for free space). transformed domain, the data in the queue are transmit-
The total reception energy required by node i at the nth time ted without further aggregation. The third model is loss-
step is less aggregation, in which the sink node can reconstruct
X ADt (n) the raw data from the aggregated data without any loss.
EiRX (n) = i
PrxElec + ADti (n) EdecBit (4) In this study, we modeled this type of aggregation using a
∀t B log function, as depicted in Fig. 5(c). The three different
where PrxElec is the reception power, and EdecBit is the decod- data aggregation models are represented mathematically as
ing energy per bit. follows:
The total energy required for data aggregation by node i at
the nth time step is DAmodel1 Qti (n)
X
EiDA (n) = Qti (n)EaggBit
(
(5) t
Um1 if Qti (n) > 0
∀t
= (6)
where EaggBit is data aggregation energy per bit. 0 if Qti (n) = 0
where Um1 t , U t and U t are the unit packet sizes for the
m2 m3
first, second and third models, respectively; DPi (n) is the
number of aggregated data packets in the queue of node i.
The data aggregation model is designed based on the WSN
application objectives and sensor data types. It means that the
actual shapes of models can be different depending on the real
applications and used aggregation methods. Table 1 lists the FIGURE 6. Q-learning model for the proposed system.
system model parameters defined in this study.
IV. Q-LEARNING-BASED DATA-AGGREGATION-AWARE set of states S and a set of actions A. By performing an action
ENERGY-EFFICIENT ROUTING PROTOCOL a ∈ A, the agent transitions from one state to another. The
Reinforcement learning methods are essential to solve agent in state s interacts with the environment with action a
optimal control problems using on-line measurements by to learn the environment, while depending on the outcome,
interacting with an environment. The objective of RL is to to acquire reward r. The decision goal for selecting one of
maximize the reward of an agent by taking a series of actions the actions in the given state is to maximize the expected sum
in response to a dynamic environment. RL can be applied to of weighted rewards, which include the current immediate
the WSN routing problem because it can capture the dynam- reward and future discounted rewards [40].
ics of the network and environment conditions efficiently, In the proposed Q-learning system for WSN routing,
in which the action at each sensor node is the selection of the agent is considered as a network-wide data flow. In the
the next node for forwarding the sensing data to the sink conventional single-agent approach, a centralized network
node. Q-learning is a model-free value-based RL algorithm controller acts as an agent that can observe the global condi-
that is used to obtain the optimal action-selection policy using tions of the entire network and control the packet transmission
a Q value function. The Q value (quality value) represents at each sensor node. This central agent approach requires a
how useful a given action is in gaining some future reward. large overhead and makes it difficult to know the status of
Q-learning uses temporal differences (TD) to estimate the the entire network in real time. In the proposed system, there
expected Q value through episodes with no prior knowledge is no explicit central agent; instead, cooperative informa-
of the environment. Q-learning is defined using an agent, a tion exchange among neighbor nodes ensures that each node
where t is the data-type node s sent and n− is the most recent where Hs is the hop count of node s, 1 E is the K -dimensional
time step at which node s0 computed rDA t0 . vector with all 1s, Rs is the sink node reward and η is the
We have defined the type-independent energy status discount factor for the reward in range [0, 1].
reward. The energy status reward (RE ) is defined as follows: When node s receives reward R, it needs to update its
Q-table. To update its action value function Q (s, a),
E r0 (n) ds−s0 β
RE = sr − (16) (16) it requires Q s0 , a of the next state node. As explained
Es0 (0) dmax previously, in our proposed mechanism, when the next node
s0 receives an aggregated data packet, it replies with the
where Esr0 (n) and Esr0 (0) are the residual energies of the next ACK packet,in which the reward vector R of Eq. (17) and
node s0 at the nth and 0th time steps, respectively; ds−s0 is max Q s0 , a vector are included. Therefore, node s can
the estimated distance between nodes s and s0 ( estimated update its Q-table based on the ACK packet information. The
at node s0 using any distance estimation techniques);dmax is max Q s0 , a vector includes the maximum Q-value for each
the maximum transmission range of the sensor nodes; and data type at the next node s0 as follows:
β is the path loss exponent (in free space β = 2). When
max Qt1 s0 , a
the remaining energy of the next state node is relatively
∀a t2 0
large and the distance between the next and current state max Q s , a
max Q s0 , a = ∀a
nodes is short (which means that the energy requirement for
(18)
..
transmission is low), the action selection is efficient in terms .
of energy. Consequently, the energy state reward increases. max QtK s0 , a
This reward policy can reduce the energy consumption of the ∀a
entire network and increase the network lifetime by evenly The general Q-table update rule of Eq. (10) can be represented
distributing the energy consumption at each node. in vector form as follows:
To forward data to the sink, the reward should be smaller Q (s, a) = (1 − α)Q (s, a) + α R + γ + max Q s0 , a
than the maximum Q-value of the parent hop count node.
However, the fixed reward for all nodes in the network has (19)
a higher probability of backwarding the nodes that are away Fig. 7 illustrates a scenario for the proposed Q-DAEER learn-
from the sink. An additional discount factor for the reward of ing procedure.
the nodes is necessary to prevent backwarding. Reward R for 1) At node s, the waiting timer of type t1 expires at time
s (n)
action a in state s is finally computed as follows: step n, and then node s aggregates data in queue Qt1
( to ADt1s (n).
ηHs × (RDA +RE × 1 E ) if s0 is not a sink
2) Node s selects action a2 (node s0 ) that has the maximum
R= (17)
Rs × 1 E else Q-value for type t1 of state s Q-table.
those of SPR and SPRwDA, respectively for three data aggre- models, the average hop count of Q-DAEER is approximately
gation models. 25%∼35% higher than those of SPR and SPRwDA.
Fig. 11 shows the average hop count of data packets from A comparison of the decrease in data sizes in the network
the data source node to the sink node. The average hop is presented in Fig. 12. Fig. 12(a) shows the decrease in the
count at each time unit is depicted in Fig. 11(a). In SPR data size at each time unit. Because SPR does not perform
and SPRwDA, because each sensor node forwards data to data aggregation, the reduced data size is zero. In SPRwDA,
the neighbor that is closest to the sink node, the average hop the reduction in the data size is almost similar at each time
count is almost constant and lower than that of the proposed step for roughly the first half of the network lifetime; after-
Q-DAEER regardless data aggregation models. However, ward, it increases suddenly. Because SPRwDA utilizes the
near the end of the simulation, the average hop counts of SPR shortest path, the energy of some nodes close to the sink
and SPRwDA increase slightly because some nodes become node depletes, eventually causing these nodes to stop func-
dead owing to the depletion of their energies. In contrast, tioning. This causes data from sensor nodes to concentrate
the proposed Q-DAEER method demonstrates a higher initial in the remaining nodes, which can significantly reduce the
average hop count for reinforcement learning. In Q-learning, data size. Therefore, the decrease in the data size increases
before the Q-table is stabilized and used, the agent needs in the second half of the simulation. However, as shown
to explore more paths. The average hop count in the pro- in Fig 9, this accelerates the energy shortage among the
posed method decreases significantly after the initial learning overloaded nodes and shortens the network lifetime. In the
period. Each sensor node dynamically learns the optimal proposed Q-DAEER algorithm, the rewards that are given by
routing path in terms of not only the hop count but also the the neighbor nodes consider the energy level and degree of
energy consumption and data aggregation degree on the path. data aggregation so that nodes always dynamically determine
The Q-DAEER algorithm may choose longer paths to obtain the best path. The results indicate that the proposed algo-
higher expected rewards by achieving more data aggregation rithm can obtain a more optimal path to improve energy and
and energy saving. Therefore, for three data aggregation data aggregation efficiency compared with the conventional
TABLE 5. The results of grid and random topologies for 100 and 400 sensor node cases.
VI. CONCLUSION
In this article, we proposed a Q-learning-based data-
aggregation-aware energy-efficient routing (Q-DAEER)
algorithm. To calculate the best path to maximize the lifetime
and minimize energy consumption of the network, we defined
a reward policy that considered the energy level, distance,
hop count and the degree of data aggregation at each node.
For efficient data aggregation at each node with different
sensor types, we presented a data aggregation and system
model in which sensor-type-dependent queue management
and transmission schedule control were used. The reward
functions defined in this study captured the changes in
the energy node, neighbor relationship and type-dependent
data aggregation dynamics of each node. In the proposed
Q-DAEER algorithm, we incorporated a data-type-dependent
action selection and Q-table updating algorithm. To demon-
strate the applicability of the proposed algorithm to various
data aggregation scenarios, we defined three different data
aggregation models. We compared the performance of the
proposed algorithm with that of the conventional routing
protocol in terms of its energy consumption, network lifetime,
average hop count and degree of data aggregation. The
results indicate that the proposed algorithm can obtain a
more optimal path to improve energy and data aggregation
FIGURE 12. Comparison of data size reduction due to data aggregation efficiencies when compared with the conventional method.
(a) Per time unit (tu) decrease in data size due to aggregation (b) Average
decrease in data size.
We demonstrated that the proposed Q-DAEER protocol can
successfully reduce the overall data transmission load and
method. As depicted in Fig. 12(b), the proposed algorithm extend the lifetime of the wireless sensor network.
achieved approximately 20%∼10% higher data reduction
ratio compared with SPRwDA for three aggregation models. REFERENCES
We applied a random topology in addition to the grid [1] Y. Jin, K. S. Kwak, and S.-J. Yoo, ‘‘A novel energy supply strategy for
topology in the previous experiments in the sensor deploy- stable sensor data delivery in wireless sensor networks,’’ IEEE Syst. J.,
ment topology, and also verified the scalability of the pro- vol. 14, no. 3, pp. 3418–3429, Sep. 2020.
[2] S. Wang, J. Wan, D. Li, and C. Xhang, ‘‘Implementing smart factory of
posed algorithm by increasing the number of nodes to industrie 4.0: An outlook,’’ Int. J. Distrib. Sensor Netw., vol. 12, pp. 1–10,
100 and 400. Table 5 shows the experimental results with the Jan. 2016.
[3] T.-H. Kim, C. Ramos, and S. Mohammed, ‘‘Smart city and IoT,’’ Future [26] K. Thangaramya, K. Kulothungan, R. Logambigai, M. Selvi,
Gener. Comput. Syst., vol. 76, pp. 159–162, Nov. 2017. S. Ganapathy, and A. Kannan, ‘‘Energy aware cluster and neuro-fuzzy
[4] K. Cengiz and T. Dag, ‘‘Energy aware multi-hop routing protocol for based routing algorithm for wireless sensor networks in IoT,’’ Comput.
WSNs,’’ IEEE Access, vol. 6, pp. 2622–2633, 2018. Netw., vol. 151, pp. 211–223, Mar. 2019.
[5] J. Huang, Y. Hong, Z. Zhao, and Y. Yuan, ‘‘An energy-efficient multi-hop [27] W. Guo, C. Yan, and T. Lu, ‘‘Optimizing the lifetime of wireless sensor
routing protocol based on grid clustering for wireless sensor networks,’’ networks via reinforcement-learning-based routing,’’ Int. J. Distrib. Sensor
Cluster Comput., vol. 20, no. 4, pp. 3071–3083, Jun. 2017. Netw., vol. 15, no. 2, Feb. 2019, Art. no. 155014771983354.
[6] P. Guo, J. Cao, and X. Liu, ‘‘Lossless in-network processing in WSNs [28] J. Wang, J. Cao, R. S. Sherratt, and J. H. Park, ‘‘An improved ant colony
for domain-specific monitoring applications,’’ IEEE Trans. Ind. Informat., optimization-based approach with mobile sink for wireless sensor net-
vol. 13, no. 5, pp. 2130–2139, Oct. 2017. works,’’ J. Supercomput., vol. 74, no. 12, pp. 6633–6645, Dec. 2018.
[29] H. El Alami and A. Najid, ‘‘Energy-efficient fuzzy logic cluster head
[7] S. A. Putra, B. R. Trilaksono, A. Harsoyo, and A. I. Kistijantoro, ‘‘Mul-
selection in wireless sensor networks,’’ in Proc. Int. Conf. Inf. Technol.
tiagent system in-network processing in wireless sensor network,’’ Int. J.
Org. Develop. (IT OD), Rabat, Morocco, Mar. 2016, pp. 1–7.
Electr. Eng. Informat., vol. 10, no. 1, pp. 94–107, Mar. 2018.
[30] J.-S. Lee and C.-L. Teng, ‘‘An enhanced hierarchical clustering approach
[8] Z. A. Khan and A. Samad, ‘‘A study of machine learning in wireless
for mobile sensor networks using fuzzy inference systems,’’ IEEE Internet
sensor network,’’ Int. J. Comput. Netw. Appl., vol. 4, no. 4, pp. 105–112,
Things J., vol. 4, no. 4, pp. 1095–1103, Aug. 2017.
Aug. 2017.
[31] H. El Alami and A. Najid, ‘‘ECH: An enhanced clustering hierarchy
[9] D. P. Kumar, T. Amgoth, and C. S. R. Annavarapu, ‘‘Machine learning approach to maximize lifetime of wireless sensor networks,’’ IEEE Access,
algorithms for wireless sensor networks: A survey,’’ Inf. Fusion, vol. 49, vol. 7, pp. 107142–107153, 2019.
pp. 1–25, Sep. 2019. [32] S. A. Sert and A. Yazici, ‘‘Optimizing the performance of rule-based fuzzy
[10] Z. Mammeri, ‘‘Reinforcement learning based routing in networks: Review routing algorithms in wireless sensor networks,’’ in Proc. IEEE Int. Conf.
and classification of approaches,’’ IEEE Access, vol. 7, pp. 55916–55950, Fuzzy Syst. (FUZZ-IEEE), New Orleans, LA, USA, Jun. 2019, pp. 1–6.
2019. [33] J. Zhang, Q. Wu, F. Ren, T. He, and C. Lin, ‘‘Effective data aggregation
[11] J. Wang, Y. Gao, W. Liu, A. K. Sangaiah, and H.-J. Kim, ‘‘Energy efficient supported by dynamic routing in wireless sensor networks,’’ in Proc. IEEE
routing algorithm with mobile sink support for wireless sensor networks,’’ Int. Conf. Commun., Cape Town, South Africa, May 2010, pp. 1–6.
Sensors, vol. 19, pp. 1–19, Jan. 2019. [34] R. Rajagopalan and P. K. Varshney, ‘‘Data-aggregation techniques in sen-
[12] S. K. Das and S. Tripathi, ‘‘Intelligent energy-aware efficient routing for sor networks: A survey,’’ IEEE Commun. Surveys Tuts., vol. 8, no. 4,
MANET,’’ Wireless Netw., vol. 24, no. 4, pp. 1139–1159, May 2018. pp. 48–63, 4th Quart., 2006.
[13] X. Li, W. Liu, M. Xie, A. Liu, M. Zhao, N. N. Xiong, M. Zhao, and W. Dai, [35] A. Ganesh, ‘‘Publish/subscribe model in a wireless sensor network,’’
‘‘Differentiated data aggregation routing scheme for energy conserving and U.S. Patent 7 590 098, Sep. 15, 2009.
delay sensitive wireless sensor networks,’’ Sensors, vol. 18, no. 7, pp. 1–29, [36] H. Karl and A. Willig, Protocols and Architectures for Wireless Sensor
Jul. 2018. Networks. Hoboken, NJ, USA: Wiley, 2007.
[14] S. Sasirekha and S. Swamynathan, ‘‘Cluster-chain mobile agent rout- [37] M. Dagar and S. Mahajan, ‘‘Data aggregation in wireless sensor network:
ing algorithm for efficient data aggregation in wireless sensor network,’’ A survey,’’ Int. J. Inf. Comput. Technol., vol. 3, no. 3, pp. 167–174, 2013.
J. Commun. Netw., vol. 19, no. 4, pp. 392–401, Aug. 2017. [38] X. Xu, R. Ansari, A. Khokhar, and A. V. Vasilakos, ‘‘Hierarchical data
[15] Y. Bai, Y. Mai, and N. Wang, ‘‘Performance comparison and evaluation of aggregation using compressive sensing (HDACS) in WSNs,’’ ACM Trans.
the proactive and reactive routing protocols for MANETs,’’ in Proc. Wire- Sensor Netw., vol. 11, no. 3, pp. 1–25, May 2015.
less Telecommun. Symp. (WTS), Chicago, IL, USA, Apr. 2017, pp. 1–5. [39] F. Marcelloni and M. Vecchio, ‘‘An efficient lossless compression algo-
[16] R. E. Mohemed, A. I. Saleh, M. Abdelrazzak, and A. S. Samra, ‘‘Energy- rithm for tiny nodes of monitoring wireless sensor networks,’’ Comput. J.,
efficient routing protocols for solving energy hole problem in wireless vol. 52, no. 8, pp. 969–987, Nov. 2009.
sensor networks,’’ Comput. Netw., vol. 114, pp. 51–66, Feb. 2017. [40] Q. Yang, S.-J. Jang, and S.-J. Yoo, ‘‘Q-learning-based fuzzy logic for multi-
objective routing algorithm in flying ad hoc networks,’’ Wireless Pers.
[17] A. Razaque, M. Abdulgader, C. Joshi, F. Amsaad, and M. Chauhan, ‘‘P-
Commun., vol. 113, no. 1, pp. 115–138, Jul. 2020.
LEACH: Energy efficient routing protocol for wireless sensor networks,’’
in Proc. IEEE Long Island Syst., Appl. Technol. Conf. (LISAT), Apr. 2016, WAN-KYU YUN received the B.S. degree from
pp. 1–5. the Information and Communication Engineer-
[18] R. A. Khan, Q. Xin, and N. Roshan, ‘‘RK-energy efficient routing protocol ing Department, Inha University, South Korea,
for wireless body area sensor networks,’’ Wireless Pers. Commun., vol. 116, where he is currently pursuing the M.S. degree
pp. 1–13, Aug. 2020. with the Multimedia Network Laboratory. His
[19] T. Baker, M. J. García-Campos, D. G. Reina, S. Toral, H. Tawfik, research interests include wireless sensor net-
D. Al-Jumeily, and A. Hussain, ‘‘GreeAODV: An energy efficient routing works, the Internet of Things, machine learning,
protocol for vehicular ad hoc networks,’’ in Proc. Int. Conf. Intell. Comput., and deep learning.
Jul. 2018, pp. 670–681.
[20] O. S. Oubbati, M. Mozaffari, N. Chaib, P. Lorenz, M. Atiquzzaman, and
A. Jamalipour, ‘‘ECaD: Energy-efficient routing in flying ad hoc net- SANG-JO YOO (Member, IEEE) received the B.S.
works,’’ Int. J. Commun. Syst., vol. 32, no. 18, p. e4156, Dec. 2019. degree in electronic communication engineering
[21] O. A. Mahdi, A. W. A. Wahab, M. Y. I. Idris, A. A. Znaid, from Hanyang University, Seoul, South Korea,
Y. R. B. Al-Mayouf, and S. Khan, ‘‘WDARS: A weighted data aggregation
in 1988, and the M.S. and Ph.D. degrees in electri-
routing strategy with minimum link cost in event-driven WSNs,’’ J. Sen-
cal engineering from the Korea Advanced Institute
sors, vol. 2016, pp. 1–12, Sep. 2016.
of Science and Technology, in 1990 and 2000,
[22] S. P. Ardakani, J. Padget, and M. De Vos, ‘‘A mobile agent routing protocol
respectively.
for data aggregation in wireless sensor networks,’’ Int. J. Wireless Inf.
Netw., vol. 24, no. 1, pp. 27–41, Mar. 2017. From 1990 to 2001, he was a Member of Tech-
nical Staff with the Korea Telecom Research and
[23] K. Haseeb, N. Islam, T. Saba, A. Rehman, and Z. Mehmood, ‘‘LSDAR:
A light-weight structure based data aggregation routing protocol with Development Group, where he was involved in
secure Internet of Things integrated next-generation sensor networks,’’ communication protocol conformance testing and network design fields.
Sustain. Cities Soc., vol. 54, pp. 1–9, Mar. 2020. From 1994 to 1995 and 2007 to 2008, he was a Guest Researcher with the
[24] A. Yazici, M. Koyuncu, S. A. Sert, and T. Yilmaz, ‘‘A fusion-based National Institute Standards and Technology, USA. Since 2001, he has been
framework for wireless multimedia sensor networks in surveillance appli- with Inha University, where he is currently a Professor with the Information
cations,’’ IEEE Access, vol. 7, pp. 88418–88434, 2019. and Communication Engineering Department. His current research interests
[25] Y. Chang, X. Yuan, B. Li, D. Niyato, and N. Al-Dhahir, ‘‘Machine- include cognitive radio network protocols, adhoc wireless networks, MAC
learning-based parallel genetic algorithms for multi-objective optimization and routing protocol design, wireless networks, QoS, and wireless sensor
in ultra-reliable low-latency WSNs,’’ IEEE Access, vol. 7, pp. 4913–4926, networks.
2019.