Module 4 & 5
Module 4 & 5
Chapter 7
IoT Connectivity Technologies
Learning Outcomes
After reading this chapter, the reader will be able to:
• List common connectivity protocols in IoT
• Identify the salient features and application scope of each connectivity protocol
• Understand the terminologies and technologies associated with IoT connectivity
• Determine the requirements associated with each of these connectivity protocols
in real-world solutions
• Determine the most appropriate connectivity protocol for each segment of their
IoT implementation
7.1 Introduction
This chapter outlines the main features of fifteen identified commonly used
and upcoming IoT connectivity enablers. These connectivity technologies can be
integrated with existing sensing, actuation, and processing solutions for extending
connectivity to them. Some of these solutions necessarily require integration with a
minimal form of processing infrastructure, such as Wi-Fi. In contrast, others, such
as Zigbee, can work in a standalone mode altogether, without the need for external
processing and hardware support. These solutions are outlined in the subsequent
sections in this chapter.
i i
i i
i i
Application
Presentation
Host layers
Session
Network
Transport
LLC
Network
SSCS
Media layers
Data link
Data link
IEEE 802.15.4
Physical
Physical
Figure 7.1 The operational part of IEEE 802.15.4’s protocol stack in comparison to the OSI
stack
The direct sequence spread spectrum (DSSS) modulation technique is used in IEEE
802.15.4 for communication purposes, enabling a wider bandwidth of operation with
enhanced security by the modulating pseudo-random noise signal. This standard
exhibits high tolerance to noise and interference and offers better measures for
improving link reliability. Typically, the low-speed versions of the IEEE 802.15.4
standard use binary phase shift keying (BPSK), whereas the versions with high data
rate implement offset quadrature phase shift keying (O-QPSK) for encoding the
message to be communicated. Carrier sense multiple access with collision avoidance
(CSMA-CA) is the channel access method used for maintaining the sequence of
transmitted signals and preventing deadlocks due to multiple sources trying to access
the same channel. Temporal multiplexing enables access to the same channel by
multiple users or nodes at different times in a maximally interference-free manner.
i i
i i
i i
The IEEE 802.15.4 standard [2] utilizes infrequently occurring and very short
packet transmissions with a low duty cycle (typically, < 1%) to minimize the power
consumption. The minimum power level defined is –3 dBm or 0.5 mW for the radios
utilizing this standard. The transmission, for most cases, is line of sight (LOS),
with the standard transmission range varying between 10 m to 75 m. The best-case
transmission range achieved outdoors can be up to 1000 m.
This standard typically defines two networking topologies: 1) Star and 2) mesh.
There are seven variants identified with in IEEE 802.15.4—A, B. C, D, E, F, and G.
Variants A/B are the base versions, C is assigned for China, and D for Japan. Variants
E, F, and G are assigned respectively for industrial applications, active RFID (radio
frequency identification) uses, and smart utility systems.
The IEEE 802.15.4 standard supports two types of devices: 1) reduced function
device (RFD) and 2) full function devices (FFD). FFDs can talk to all types of devices
and support full protocol stacks. However, these devices are costly and energy-
consuming due to increased requirements for support of full stacks. In contrast, RFDs
can only talk to an FFD and have lower power consumption requirements due to
minimal CPU/RAM requirements. Figure 7.2 shows the device types and network
types supported by the IEEE 802.15.4 standard.
Beacon-Enabled
Network
Non-Beacon-
Enabled
IEEE 802.15.4
Coordinator
FFD Router
Device Device
RFD Device
Figure 7.2 The various device and network types supported in the IEEE 802.15.4 standard
i i
i i
i i
layer discovery. This network typically requires both source and destination IDs of the
communicating nodes. As the IEEE 802.15.4 is primarily a mesh protocol, all protocol
addressing must adhere to mesh configurations such that there is a decentralized
communication amongst nodes.
Figure 7.3 shows the frame types associated with the IEEE 802.15.4 standard.
Beacon frames are used for signaling and synchronization; data transmission is
done through the data frames; and message reception is confirmed using the
acknowledgment frames. MAC and command frames are used for association
requests/responses, dissociation requests, data requests, beacon requests, coordinator
realignment, and orphan notifications.
Beacon
Date
Mac
Ack
Figure 7.3 Various frame types supported in the IEEE 802.15.4 standard
Check yourself
7.3 Zigbee
The Zigbee radio communication is designed for enabling wireless personal area
networks (WPANs). It uses the IEEE 802.15.4 standard for defining its physical and
medium access control (layers 1 and 2 of the OSI stack). Zigbee finds common usage
in sensor and control networks [4]. It was designed for low-powered mesh networks
at low cost, which can be broadly implemented for controlling and monitoring
applications, typically in the range of 10–100 meters [3]. The PHY and MAC layers in
this communication are designed to handle multiple low data rate operating devices.
The frequencies of 2.4 GHz, 902–928 MHz or 868 MHz are commonly associated with
Zigbee WPAN operations. The Zigbee commonly uses 250 kbps data rate which is
optimal for both periodic and intermittent full-duplex data transmission between two
Zigbee entities.
i i
i i
A network can be significantly extended in the Zigbee mesh and tree topologies
by using multiple routers where the root of the topology is the coordinator. These
configurations allow any Zigbee device or node to communicate with any other
i i
adjacent node. In case of the failure of one or more nodes, the information is
automatically forwarded to other devices through other functional devices. In a
Zigbee cluster tree network, a coordinator is placed in the leaf node position of the
cluster, which is, in turn, connected to a parent coordinator who initiates the entire
network.
A typical Zigbee network structure can consist of three different device types,
namely the Zigbee coordinator, router, and end device, as shown in Figure 7.4. Every
Zigbee network has a minimum of one coordinator device type who acts as the root;
it also functions as the network bridge. The coordinator performs data handling and
storing operations. The Zigbee routers play the role of intermediate nodes that connect
two or more Zigbee devices, which may be of the same or different types. Finally,
the end devices have restricted functionality; communication is limited to the parent
nodes. This reduced functionality enables them to have a lower power consumption
requirement, allowing them to operate for an extended duration. There are provisions
to operate Zigbee in different modes to save power and prolong the deployed network
lifetime.
The PHY and MAC layers of the IEEE 802.15.4 standard are used to build the
protocol for Zigbee architecture; the protocol is then accentuated by network and
application layers designed especially for Zigbee. Figure 7.5 shows the Zigbee protocol
stack. The various layer of the Zigbee stack are as follows.
Application
Presentation
Host layers
Session Zigbee
Network
Transport
LLC
Network
SSCS
Media layers
Data link
Data link
IEEE 802.15.4
Physical
Physical
Figure 7.5 The Zigbee protocol stack in comparison to the OSI stack
• Physical Layer: This layer is tasked with transmitting and receiving signals, and
performing modulation and demodulation operations on them, respectively. The
i i
i i
i i
Zigbee physical layer consists of 3 bands made up of 27 channels: the 2.4 GHz
band has 16 channels at 250 kbps the 868.3 MHz has one channel at 20 kbps; and
the 902-928 MHz has ten channels at 40 kbps.
• MAC Layer: This layer ensures channel access and reliability of data
transmission. CSMA-CA is used for channel access and intra-channel
interference avoidance. This layer handles communication synchronization
using beacon frames.
• Network Layer: This layer handles operations such as setting up the network,
connecting and disconnecting the devices, configuring the devices, and routing.
• Application Support Sub-Layer: This layer handles the interfacing services,
control services, bridge between network and other layers, and enables the
necessary services to interface with the lower layers for Zigbee device object
(ZDO) and Zigbee application objects (ZAO). This layer is primarily tasked with
data management services and is responsible for service-based device matching.
• Application Framework: Two types of data services are provided by the
application framework: provision of a key-value pair and generation of generic
messages. A key-value pair is used for getting attributes within the application
objects, whereas a generic message is a developer-defined structure.
Zigbee handles two-way data transfer using two operational modes: 1) Non-
beacon mode and 2) beacon mode. As the coordinators and routers monitor the active
state of the received data continuously in the non-beacon mode, it is more power-
intensive. In this mode, there is no provision for the routers and coordinators to sleep.
In contrast, a beacon mode allows the coordinators and routers to launch into a very
low-power sleep state in the absence of data communication from end devices. The
Zigbee coordinator is designed to periodically wake up and transmit beacons to the
available routers in the network. These beacon networks are used when there is a need
for lower duty cycles and more extended battery power consumption.
Check yourself
7.4 Thread
Thread is built upon the IEEE 802.15.4 radio standard; it is used for extremely
low power consumption and low latency deployments [5]. Unlike Zigbee, Thread
can extend direct Internet connectivity to the devices it is connected with. Thread
removes the need for a mobile phone or a proprietary gateway to be in the range
of devices for accessing the Internet. It is specially designed for IoT with the need for
interoperability, security, power, and architecture addressed in a single radio platform.
i i
i i
i i
Figure 7.6 shows the comparison of the Thread stack against the standard ISO-OSI
stack. Thread is built on open standards to achieve a low-power wireless mesh
Application
Application
Presentation
Session Zigbee
UDP+DTLS
Security
Transport IP routing
Network Ipv6
Data link
IEEE 802.15.4
Physical
Figure 7.6 The functional protocol stack of Thread in comparison to the OSI stack
networking protocol with universal Internet Protocol (IP) support. The standard is
easy to set up and simple to use; it can reliably connect thousands of devices to the
Internet or a cloud with no single point of failure. It has the distinctive feature of self-
healing and reconfiguration in the event of the addition or removal of a device. Figure
7.7 shows the Thread network architecture.
Thread enables IoT interoperability by utilizing a certification application that
validates a device’s conformance to the specification as well as its interoperability
against multiple certified stacks. This feature ensures the resilience of connectivity,
even with diverse networks, in turn enabling its users to have consistent operational
experience.
Empowering low-power wireless devices with IP connectivity enables Thread
to seamlessly accommodate itself with larger IP-based networks and be a robust
option for most IoT applications such as smart homes/buildings, connected vehicles,
and others. This feature of Thread devices removes the need for Internet-enabled
proprietary gateways and cross-stack translators for connection between other
technologies. The additional benefits of this feature include better resilience to single
point of failures, highly economical deployments, less complex infrastructure, and
enhanced IoT end-to-end device security on the Internet.
i i
i i
i i
E E
E
E
R R Cloud
R L Cloud
Border
E router
R
E R E
Smartphone
E E E
Figure 7.7 Outline of the Thread network architecture (from end devices to the cloud)
Thread devices can use common infrastructure similar to Wi-Fi networks and can
connect directly to smartphones or computers if they are on the same IP network,
without any additional setup for Thread.
Check yourself
7.5 ISA100.11A
The ISA100.11A is a very low power communication standard and has been developed
and managed by ISA (International Society of Automation) [7]. Similar to the previous
protocols, it uses the IEEE 802.15.4 standard as a base for building its protocol.
The standard was mainly proposed for industrial plant automation systems. The
ISA100.11A is characterized by an IoT compliant protocol stack, which can also be
integrated with wired networks using Ethernet, support for open access protocols
and device-level interoperability; it boasts of a 128-bit AES (Advanced Encryption
Standard) encryption securing all communications. The security in ISA100.11A is
in two layers: Transport layer and data link layer. ISA100.11A provides extensive
support for IPv6 and UDP and uses TDMA (time-division multiple access)-based
resource sharing with CSMA-CA. Both IPv6 and UDP as well as star topologies
are supported by this standard. The utilization of IPv6 provides certain distinct
benefits to ISA100.11A, such as increased address sizes, enhanced IPSec-based security
measures, savings in network bandwidth by virtue of multicasting and auto address
configuration.
i i
i i
i i
An ISA100.11A wireless network utilizes the 2.4 GHz frequency band for
communication, similar to Wi-Fi and Bluetooth. To avoid interference over wireless
channels in the same band, it uses frequency hopping spread spectrum (FHSS) over a
total of 16 channels. A definitive feature of this protocol is channel blacklisting, which
blacklists the channels already in use by other protocols. This enables the protocol
to perform even better by further achieving immunity from interference. Figure 7.8
B System
R manager
R
Control
system
I/O Handheld Routing Backbone
Device device
R device
B
device
i i
i i
i i
Application
Presentation
Host layers
Tunneling
Session
UDP
Transport
ISA100.11A Ipv6
Network
Data link control
Media layers
Data link
Data link
IEEE 802.15.4
Physical
Physical
Figure 7.9 The ISA100.11A protocol stack in comparison to the OSI stack
• Physical Layer: It is built upon the IEEE 802.15.4-2006 standard. The physical
layer communicates on the 2.4 GHz frequency band using a DSSS (direct
sequence spread spectrum) modulation.
• Data Link Layer: It handles the creation, maintenance, and forwarding
packet functionalities in addition to typical MAC functionalities. Additionally,
it is responsible for operations dealing with the structure of the data packet,
formation of the frame, detecting the error, and bus arbitration. A data link
control (DLC) layer in ISA100.11A, which uses a graph-based routing, is
responsible for specific distinctive functions such as adaptive channel hopping,
detection and recovery of message loss, and clock synchronization.
• Network Layer: The ISA100.11A network layer is 6LoWPAN-compliant and
uses IPv6 addressing for an end-to-end routing. Protocol conversion from IPv6
to 6LoWPAN and 6LoWPAN to IPv6 is executed at this layer by a router.
• Transport Layer: The ISA100.11A transport layer supports UDP-based
connectionless services.
• Application Layer: The ISA100.11A stack only specifies system management
application in this layer.
Check yourself
i i
i i
i i
7.6 WirelessHART
WirelessHART can be considered as the wireless evolution of the highway addressable
remote transducer (HART) protocol [7]. It is a license-free protocol, which was
developed for networking smart field devices in industrial environments. The lack
of wires makes the adaptability of this protocol significantly advantageous over
its predecessor, HART, in industrial settings. By virtue of its highly encrypted
communication, wireless HART is very secure and has several advantages over
traditional communication protocols. Similar to Zigbee, wirelessHART uses the IEEE
802.15.4 standard for its protocols designing.
Figure 7.10 shows the WirelessHART network architecture. WirelessHART can
communicate with a central control system in any of the two ways: 1) Direct and
2) indirect. Direct communication is achieved when the devices transmit data directly
to the gateway in a clear LOS (typically 250 m). Indirect communication is achieved
between devices in a mesh and a gateway when messages jump from device to
device until it reaches the gateway. WirelessHART communication is 99.999% reliable
due to the maintenance of a tight schedule between message transmissions. All
wirelessHART devices are back-compatible and allow for the integration of legacy
devices as well as new ones.
AP Security
R manager Host
R application
Interface
AP Network
R manager
R
Process
control
Gateway automation
The HART encompasses the most number of field devices incorporated in any field
network. WirelessHART makes device placements more accessible and cheaper, such
as the top of a reaction tank, inside a pipe, or widely separated warehouses. The wired
and unwired versions differ mainly in the network, data link, and physical layer. The
wired HART lacks a network layer. HART ensures congestion control in the 2.4 Ghz
ISM band by eliminating channel 26 because of its restricted usage in certain areas. The
i i
i i
i i
Application Application
Presentation
Network Network
Figure 7.11 The WirelessHART protocol stack in comparison to the OSI stack
• Physical Layer: The IEEE 802.15.4 standard specification is used for designing
the physical layer of this protocol. Its operation is limited to the use of the
2.4 GHz frequency band. The channel reliability is significantly increased by
utilizing only 15 channels of the 2.4 GHz band.
• Data Link Layer: The data link layer avoids collisions by the use of TDMA.
The communication is also made deterministic by the use of superframes.
i i
i i
i i
Check yourself
7.7 RFID
RFID stands for radio frequency identification. This technology uses tags and readers
for communication. RFID tags have data encoded onto them digitally [8]. The RFID
readers can read the values encoded in these tags without physically touching them.
RFIDs are functionally similar to barcodes as the data read from tags are stored in
a database. However, RFID does not have to rely on line of sight operation, unlike
barcodes.
The automatic identification and data capture (AIDC) technology can be
considered as the precursor of RFID. Similar to AIDC techniques, RFID systems are
capable of automatically categorizing objects. Categorization tasks such as identifying
tags, reading data, and feeding the read data directly into computer systems through
radio waves outline the operation of RFID systems. Typically, RFID systems are made
up of three components: 1) RFID tag or smart label, 2) RFID reader, and 3) an antenna.
Figure 7.12 shows the various RFID components.
In RFID, the tags consist of an integrated circuit and an antenna, enclosed in a
protective casing to protect from wear and tear and environmental effects. These
i i
i i
i i
RFID Antenna
Interrogator
Tagged item
RFID tag
Database
tags can be either active or passive. Passive tags find common usage in a variety of
applications due to its low cost; however, it has to be powered using an RFID reader
before data transmission. Active tags have their own power sources and do not need
external activation by readers. Tags are used for transmitting the data to an RFID
interrogator or an RFID reader. The radio waves are then converted to a more usable
form of data by this reader. A host computer system accesses the collected data on
the reader by a communication technology such as Wi-Fi or Ethernet. The data on the
host system is finally updated onto a database. RFID applications span across domains
such as inventory management, asset tracking, personnel tracking, and supply chain
management.
Check yourself
How does RFID tackle various services, such as asset tracking and inventory
management?
7.8 NFC
Near field communication (NFC) was jointly developed by Philips and Sony as a
short-range wireless connectivity standard, enabling peer-to-peer (P2P) data exchange
network. Communication between NFC devices is achieved by the principle of
magnetic induction, whenever the devices are brought close to one another [9]. NFC
can also be used with other wireless technologies such as Wi-Fi after establishing
and configuring the P2P network. The communication between compatible devices
requires a pair of transmitting and receiving devices. The typical NFC operating
i i
i i
i i
frequency for data is 13.56 MHz, which supports data rates of 106, 212, or 424 kbps.
NFC devices can be grouped into two types: 1) passive NFC and 2) active NFC. Figure
7.13 shows the various NFC types, components, and its usage.
A small electric current is emitted by the NFC reader, which creates a magnetic field
that acts as a bridge in the physical space between two NFC devices. The generated
EM (electromagnetic) field is converted back into electrical impulses through another
coil on the client device. Data such as identifiers, messages, currency, status, and
others can be transmitted using NFCs. NFC communication and pairing are speedy
due to the use of inductive coupling and the absence of manual pairing.
Passive NFC
Public network
Local network
Maintenance
NFC Reader
Tags
Active NFC Transport
Passive NFC devices do not need a power source for communicating with the NFC
reader. Tags and other small transmitters can act as passive NFC devices. However,
passive devices cannot process information; they simply store information, which is
read by an NFC reader. In contrast, active NFC devices can communicate with active
as well as passive NFC devices. Active devices are capable of reading as well as
writing data to other NFC terminals or devices. Some of the most commonly used
NFC platforms are smartphones, public transport card readers, and commercial touch
payment terminals.
NFC currently supports three information exchange modes: 1) peer-to-peer,
2) read/write, and 3) card emulation. The peer-to-peer mode is commonly used in
NFC modes; it enables two NFC devices to exchange information. In the peer-to-peer
mode of information exchange, the transmitting device goes active while the receiving
device becomes passive. During the reverse transfer, both devices change roles. The
read/write mode of information exchange allows only one-way data transmission. An
active NFC device connects to a passive device to read information from it. Finally,
i i
i i
i i
the card emulation mode enables an NFC device (generally, smartphones) to act as a
contactless credit card and make payments using just a simple tap on an NFC reader.
Check yourself
7.9 DASH7
The DASH7 protocol is based on an active RFID standard [10]. It operates in the 433
MHz frequency band and is being rapidly accepted in agriculture, vehicles, mobiles,
and other consumer electronics-related applications. The messages in DASH7 are
modulated using FSK (frequency shift keying) modulation before transmission over
the 433 MHz frequency band. A very crucial aspect of DASH7 is its capability to use its
433.92 MHz operational band to enable communications with NFC devices. Recall, as
the NFCs operate in the 13.56 MHz band, they can communicate with DASH7 radios
by temporarily modifying/altering their antenna to access the higher-order harmonics
of the DASH7 band (433.92/13.56 = 32 or 25 ). Figure 7.14 shows the DASH7 network
architecture.
Internet
Dash7
Tags
Gateway
Smart
devices NFC
Compared to the IEEE 802.15.4 and its dependent technologies, the DASH7
protocol has a fully defined and complete OSI stack. This enables the DASH7 stack to
be made adaptable to the physical layers of technologies such as Sigfox or LoRa. The
i i
i i
i i
DASH7 stack includes support for cheap processing systems by virtue of its integrated
file system. Figure 7.15 shows the protocol stack of DASH 7 in comparison to the ISO-
OSI stack. DASH7 gateways can query devices in proximity to it without waiting for
pre-defined time-slots to listen to end-device beacons.
Application Application
Presentation Presentation
Session Session
Transport Transport
Figure 7.15 The DASH7 protocol stack in comparison to the OSI stack
Check yourself
i i
i i
i i
7.10 Z-Wave
Z-Wave is an economical and less complicated alternative to Zigbee. It was developed
by Zensys, mainly for home automation solutions [11]. It boasts of a power
consumption much lower than Wi-Fi, but with ranges greater than Bluetooth. This
feature makes Z-Wave significantly useful for home IoT use by enabling inter-device
communication between Z-wave integrated sensors, locks, home power distribution
systems, appliances, and heating systems. Figure 7.16 shows the network architecture
of the Z-Wave protocol.
ID-H:1, ID-H:1,
ID-H:1, ID-H:4 ID-H:3
ID-H:7
ID-H:1,
ID-H:8
ID-H:1,
Z- Wave device ID-H:2
ID-H:1,
ID-H:1 ID-H:1
Figure 7.17 shows the stack for this protocol. The Z-Wave operational frequency
is in the range of 800–900 MHz, which makes it mostly immune to the interference
effects of Wi-Fi and other radios utilizing the 2.4 GHz frequency band. Z-wave
utilizes gaussian frequency shift keying (GFSK) modulation, where the baseband
pulses are passed through a Gaussian filter before modulation. The filtering operation
smoothens the pulses consisting of streams of –1 and 1 (known as pulse shaping),
which limits the modulated spectrum’s width. A Manchester channel encoding is
applied for preparing the data for transmission over the channel.
i i
i i
i i
Z-Wave stack
Application
Security
Network
Transport
Physical
Check yourself
i i
i i
i i
7.11 Weightless
Weightless is yet another emerging open standard for enabling networked
communication in IoT; it is especially useful for low-power wide area networks [12]. It
was designed for useful for low-power, low-throughput, and moderate to high latency
applications supporting either or both public and private networks. The operating
frequency of Weightless is restricted to sub-GHz bands, which are also exempted from
the requirements of licensing such as 138 MHz, 433 MHz, 470 MHz, 780 MHz, 868
MHz, 915 MHz, and 923 MHz. Initially, three standards were released for Weightless:
Weightless P, Weightless N, and Weightless W. Weightless P is the only currently
accepted and used standard as it has features for bi-directional communication over
both licensed as well as unlicensed ISM bands. Weightless N was designed as an
LWPAN uplink-only technology, whereas Weightless W was designed to make use of
the TV whitespace frequencies for communication.
As Weightless P was the most commonly adopted and accepted standard among
the three Weightless standards, it came to be referred to merely as Weightless.
Weightless provides a true bi-directional and reliable means of communication,
where each message transaction is validated using an acknowledgment message.
As it was designed initially for dense deployments of low-complexity IoT end
devices, its payload size was limited to less than 48 bytes. Weightless networks
can be optimized to attain ultra-low-power consumption status compared to cellular
networks. However, this is at the cost of network latency and throughput with data
rates in the range of 0.625 kbps to 100 kbps. Weightless has been identified with three
architectural components: end devices, base stations, and base station network (Figure
7.18). The end devices (ED) form the leaf nodes in the Weightless network. These
devices are typically low complexity and low cost. The duty cycle of EDs is also low,
with a nominal transmiting power of 14 dBm (which can be increased up to 30 dBm).
Figure 7.18 Typical components of the Weightless standard and its protocols
i i
i i
i i
The base stations (BS) act as the central coordinating node in each cell. A star topology
is deployed to connect the EDs to the BS. The transmit powers of a typical BS lie in the
range of 27 dBm to 30 dBm. Finally, the base station network (BSN) is responsible
for connecting all the BS of a single network. This enables the BSN to manage the
allocation and scheduling of radio resources across the network. Additional tasks of
the BSN include addressing authentication, roaming, and scheduling responsibilities.
Check yourself
7.12 Sigfox
Sigfox is a low-power connectivity solution, which was developed for various
businesses such as building automation and security, smart metering, agriculture,
and others. It uses ultra-narrowband technology (192 kHz wide) for accessing and
communicating through the radio spectrum [13]. The typical data rates achieved in
Sigfox is in the range 100–600 bits per second. A binary phase shift keying (BPSK)
is used for encoding the message transmission by changing the phase of the carrier
waves, where each message is 100 Hz wide. Sigfox in Europe utilizes the 868 and 868.2
MHz spectrum, whereas it uses 902 and 928 MHz elsewhere. As the Sigfox receiver
has to access only a very tiny part of the spectrum for receiving messages, the effects of
noise are significantly reduced. It can even communicate in the presence of jamming
signals, making this standard quite resilient.
Figure 7.19 shows the network architecture of Sigfox. Sigfox has an exciting
message forwarding principle called random access, which ensures the high quality of
services in this standard. Each Sigfox device emits a message at an arbitrary frequency;
Sigfox protocol
TCP/IP
i i
i i
i i
Application
Presentation Application
Host layers
Session
Transport
Frame
Network
Media layers
Medium access
Data link control
Physical Physical
Figure 7.20 The Sigfox protocol stack in comparison to the OSI stack
i i
i i
i i
Check yourself
7.13 LoRa
LoRa or long range is a patented wireless technology for communication developed
by Cycleo of Grenoble, France for cellular-type communications aimed at providing
connectivity to M2M and IoT solutions [14]. It is a sub-GHz wireless technology that
operationally uses the 169 MHz, 433 MHz, 868 MHz, and 915 MHz frequency bands
for communication. LoRa uses bi-directional communication links symmetrically and
a spread spectrum with a 125 kHz wideband for operating. Applications such as
electric grid monitoring are typically suited for utilizing LoRa for communications.
Typical communication of LoRa devices ranges from 15 to 20 km, with support for
millions of devices. Figure 7.21 shows the LoRa network architecture.
3G/Ethernet
TCP/IP
It is a spread spectrum technology with a broader band (usually 125 kHz or more).
LoRa achieves high receiver sensitivity by utilizing frequency-modulated chirp coding
gain. LoRa devices provide excellent support for mobility, which makes them very
i i
i i
i i
useful for applications such as asset tracking and asset management. In comparison
with similar technologies such as NB-IoT, LoRa devices have significantly higher
battery lives, but these devices have low data rates (27 to 50 kbps) and longer latency
times. Figure 7.22 shows the LoRa protocol stack.
LoRa Stack
Application
MAC
PHY
RF
LoRa devices make use of a network referred to as LoRaWAN, which enables the
routing of messages between end nodes and the destination via a LoRaWAN gateway.
Unlike Sigfox, LoRaWAN has a broader spectrum resulting in interference, which
is solved using coding gains of the chirp signals. Additionally, unlike Sigfox, the
LoRaWAN end nodes and the base stations are quite inexpensive. The LoRaWAN
protocol is designed for WAN communications and is an architecture that makes use
of LoRa, whereas LoRa is used as an enabling technology for a wide area network.
Messages transmitted over LoRaWAN is received by all base stations in proximity
to the device, which induces message redundancy in the network. However, this
enhances the resilience of the network by ensuring more messages are successfully
delivered between entities in the network.
A LoRa network follows the star topology and is made up of four crucial
entities: end points/nodes, gateways, network server, and a remote computer
(Figure 7.21). The end nodes deal with all the sensing and control solutions. The
gateways forward messages from end nodes to a backhaul network. The LoRa
network can comprise both or either of wired and wireless technologies. The gateways
themselves are connected to the network server utilizing IP-based connections (either
private or public). The LoRa network server is responsible for scheduling message
acknowledgments, modifying data rates, and removing message redundancies.
Finally, the remote computers have control over the end nodes and act as data sinks
for data originating from these nodes.
The LoRa network security is achieved through various mechanisms such as
unique network key, which ensures security on the network level, unique application
key, which ensures an end-to-end security on the application level and device specific
key.
i i
i i
i i
Check yourself
7.14 NB-IoT
NB-IoT or narrowband IoT is an initiative by the Third Generation Partnership
Project (3GPP) to develop a cellular standard, which can coexist with cellular systems
(2G/3G/4G), be highly interoperable and that too using minimum power [15]. It is
reported that a major portion of the NB-IoT applications can support a battery life
of up to ten years. NB-IoT also boasts of significant improvements in reliability,
spectrum efficiencies, and system capacities. NB-IoT uses orthogonal frequency
division multiplexing (OFDM) modulation, which enhances the system capacity and
increases spectrum efficiency (Figure 7.23). However, device complexities are quite
high. NB-IoT also provides support for security features such as confidentiality,
authentication, and integrity. Figure 7.24 shows the protocol stacks of the various
components of NB-IoT.
GSM band
Guard band
LTE LTE
200 KHz
Figure 7.23 A location of NB-IoT band within the LTE spectrum
i i
i i
i i
UE eNB MME
NAC NAC
RRC RRC
PDCP PDCP
RLC RLC
MAC MAC
PHY PHY
Figure 7.24 The NB-IoT protocol stack with respect to its entities
that the NB-IoT can achieve more extensive coverage while coexisting with cellular
systems.
NB-IoT was developed for non-IP based applications requiring quite small
volumes of daily data transactions, typically in the range of a few tens to a hundred
bytes of data per device daily. Unlike technologies such as Sigfox and LoRa, the use of
OFDM (orthogonal frequency division multiplexing’s) faster modulation rates ensures
higher data handling capacities for NB-IoT.
Check yourself
7.15 Wi-Fi
Wi-Fi or WiFi is technically referred to by its standard, IEEE 802.11, and is a wireless
technology for wireless local area networking of nodes and devices built upon similar
standards (Figure 7.25). Wi-Fi utilizes the 2.4 GHz ultra high frequency (UHF) band
or the 5.8 GHz super high frequency (SHF) ISM radio bands for communication
[16]. For operation, these bands in Wi-Fi are subdivided into multiple channels.
The communication over each of these channels is achieved by multiple devices
simultaneously using time-sharing based TDMA multiplexing. It uses CSMA/CA for
channel access.
Various versions of IEEE 802.11 have been popularly adapted, such as a/b/g/n.
The IEEE 802.11a achieves a data rate of 54 Mbps and works on the 5 GHz band using
OFDM for communication. IEEE 802.11b achieves a data rate of 11 Mbps and operates
on the 2.4 GHz band. Similarly, IEEE 802.11g also works on the 2.4 GHz band but
achieves higher data rates of 54 Mbps using OFDM. Finally, the newest version, IEEE
802.11n, can transmit data at a rate of 140 Mbps on the 5 GHz band.
i i
i i
i i
Wi-Fi Stack
Presentation
Application
Data Link
Transport
Network
Physical
Session
IEE.802.11
Host A Host B
Wireless
router 1
Wireless
router 3 Access point 2
Check yourself
TDMA
i i
i i
i i
7.16 Bluetooth
Bluetooth is defined by the IEEE 802.15.1 standard and is a short-range wireless
communication technology operating at low power to enable communication among
two or more Bluetooth-enabled devices [17]. It was initially developed as a cable
replacement technology for data communication between two or more mobile devices
such as smartphones and laptops. This standard allows the transmission of data as
well as voice-over short distances. Bluetooth functions on the 2.4 GHz ISM band
and has a range of approximately 10 m. The transmission of data is done through
frequency hopping spread spectrum (FHSS), which also reduces the interference
caused by other devices functioning in the 2.4 GHz band. The data is divided into
packets before transmitting them by Bluetooth. The packets are transmitted over the
79 designated channels, each 1MHz wide in the 2.4 GHz band. Adaptive frequency
hopping (AFH) enables this standard to perform 800 hops per second over these
channels. Initial versions of this standard followed Gaussian frequency shift keying
(GFSK) modulation, which was known as the basic rate (BR) mode, and was capable
of data rates of up to 1 Mbps. However, with the development of newer variants,
modulation schemes such as π/4 DQPSK (differential quadrature phase shift keying)
and 8-DPSK (differential phase shift keying) were adopted, which enabled data rates
of 2 Mbps and 3 Mbps respectively.
S S
S Piconet M
M B
S S
S S Piconet S
Scatternet
i i
i i
i i
Application
Serv. Disc.
Telephony
RFComm
LLC
Audio Control
L2CAP
Baseband
PHY Radio
i i
i i
i i
(iii) L2CAP: It multiplexes logical connections between two devices. It is also tasked
with data segmentation, flow control, and data integrity checks.
(iv) Service Discovery Protocol: It is tasked with the discovery of services provided
by other Bluetooth devices.
(v) Radio Frequency Communications: It is a cable replacement protocol, which
generates a virtual stream of serial data. This protocol supports many telephony
related profiles as AT commands and Object Exchange Protocol (OBEX) over
Bluetooth.
(vi) Telephony Control Protocol – Binary (TCS BIN): It is a bit-oriented protocol to
control call signaling prior to initiation of voice or data communications between
devices.
Check yourself
Summary
This chapter covered the various IoT connectivity technologies and their functional
requirements and focuses on those technologies, which primarily rely on wireless
media for communications. The standards covered in this chapter are a mix of the
ones used for general consumer electronics, household devices, as well as speciality
applications such as industries. The connectivity technologies covered here range
from near-field ones to long-range ones. After going through this chapter, readers
will be able to select various connectivity technologies, which will be suitable for an
IoT application or architecture under consideration.
Exercises
(i) What is a piconet?
(ii) What is a scatternet? Explain the working of a scatternet with a brief
description of its various members.
(iii) Describe the protocol stack of Bluetooth.
(iv) What is BLE?
(v) Differentiate between class 1, 2, and 3 Bluetooth devices.
(vi) What are the various modes of operation of Bluetooth?
(vii) Describe the L2CAP layer in Bluetooth.
(viii) Describe the RFCOMM layer in Bluetooth.
i i
i i
i i
i i
i i
i i
i i
i i
i i
References
[1] Sikora, A. and V. F. Groza. 2005. “Coexistence of IEEE 802. 15.4 with Other Systems in the
2.4 GHz-ISM-Band.” In 2005 IEEE Instrumentation and Measurement Technology Conference
Proceedings 3: 1786–1791. IEEE.
[2] 802.15.4-2015 - IEEE Standard for Low-Rate Wireless Networks, IEEE. https://standards.
ieee.org/standard/802 15 4-2015.html.
[3] Zigbee Alliance. https://zigbee.org/zigbee-for-developers/zigbee-3-0/.
[4] Farahani, S. 2011. ZigBee Wireless Networks and Transceivers. Newnes.
[5] Thread. https://www.threadgroup.org/.
i i
i i
i i
i i
i i
i i
Chapter 8
IoT Communication Technologies
Learning Outcomes
After reading this chapter, the reader will be able to:
• List common communication protocols in IoT
• Identify the salient features and application scope of each communication protocol
• Understand the terminologies and technologies in IoT communication
• Determine the requirements associated with each of these communication
protocols in real-world solutions
• Determine the most appropriate communication protocol for their IoT
implementation
8.1 Introduction
Having covered the various connectivity technologies for IoT in the previous chapter,
this chapter specifically focuses on the various intangible technologies that enable
communication between the IoT devices, networks, and remote infrastructures. We
organize the various IoT communication protocols according to their usage into
six groups: 1) Infrastructure protocols, 2) discovery protocols, 3) data protocols,
4) identification protocols, 5) device management protocols, and 6) semantic
protocols. These protocols are designed to enable one or more of the functionalities
and features associated with various IoT networks and implementations such as
routing, data management, event handling, identification, remote management, and
interoperability. Figure 8.1 outlines the distribution of these IoT communication
protocol groups [3].
i i
i i
i i
Infrastructure
Discovery
Remote management
Event handling
Data
Routing
Identification
Device
management
Semantic
Before delving into the various IoT communication protocols, we outline some of
the essential terms associated with IoT networks that are indirectly responsible for the
development of these communication protocols.
i i
i i
i i
i i
i i
i i
significant packet losses, and other similar behavior, which makes it quite unreliable,
and often compromises network stability. However, LLNs have found extensive
use in application areas such as industrial automation and monitoring, building
automation, smart healthcare, smart homes, logistics, environment monitoring, and
energy management.
i i
i i
i i
Destination address
Ipv6 Address notation (128-bit)
Figure 8.2 Differences between IPv4 and IPv6 packets and the IPv6 address notation
size is mainly attributed to the increased number of bits needed for addressing
purposes.
(iii) End-to-End Connectivity: Unlike IPv4, the IPv6 paradigm allows for globally
unique addresses on a significantly massive scale. This scheme of addressing
enables packets from a source node using IPv6 to directly reach the destination
node without the need for network address translations en route (as is the case
with IPv4).
(iv) Auto-configuration: The configuration of addresses is automatically done in
IPv6. It supports both stateless and stateful auto-configuration methods and
can work even in the absence of DHCP (dynamic host configuration protocol)
servers. This mechanism is not possible in IPv4 without DHCP servers.
(v) Faster Packet Forwarding: As IPv6 headers have all the seldom-used optional
fields at the end of its packet, the routing decisions by a router are taken much
faster, by checking only the first few fields of the header.
(vi) Inbuilt Security: IPv6 supports inbuilt security mechanisms (IPSec) that IPv4
does not directly support. IPv4 security measures were attained using separate
mechanisms in conjunction with IPv4. The present-day version of IPv6 has
security as an optional feature.
(vii) Anycast Support: Multiple networking interfacesare assigned the same IPv6
addresses globally; these addresses are known as anycast addresses. This
mechanism enables routers to send packets to the nearest available destination
during routing.
i i
i i
i i
(viii) Mobility Support: IPv6 has one of the essential features that is crucial for IoT
and the modern-day connected applications: mobility support. The mobility
support of IPv6 allows for mobile nodes to retain their IP addresses and remain
connected, even while changing geographic areas of operation.
(ix) Enhanced Priority Support: The priority support system in IPv6 is entirely
simplified as compared to IPv4. The use of traffic classes and flow labels
determine the most efficient routing paths of packets for the routers.
(x) Extensibility of Headers: The options part of an IPv6 header can be extended by
adding more information to it; it is not limited in size. Some applications may
require quite a large options field, which may be comparable to the size of the
packet itself.
IPv6 Addressing
The IPv6 addressing scheme has a crucial component: the interface identifier (IID). IID
is made up of the last 64 bits (out of the 128 bits) in the IPv6 address. IPv6 incorporates
the MAC (media access control) address of the system for IID generation. As a device’s
MAC address is considered as its hardware footprint and is globally unique, the use
of MAC makes IID unique too. The IID is auto-configured by a host using IEEE’s
extended unique identifier (EUI-64) format. Figure 8.2 illustrates the IPv6 addressing
notation. IPv6 supports three types of unicasting: Global unicast address (GUA), link
local address (LL), and unique local address (ULA).
The GUA is synonymous with IPv4’s static addresses (public IP). It is globally
identifiable and uniquely addressable. The global routing prefix is designated by the
first (most significant) 48 bits. The first three bits of this routing prefix is always set
to 001; these three bits are also the most significant bits of this prefix. In contrast,
LLs are auto-configured IPv6 addresses, whose communication is limited to within a
network segment only (under a gateway or a router). The first 16 bits of LL addresses
are fixed and equals FE80 in hexadecimal. The subsequent 48 bits are set to 0. As
these addresses are not routable, the LLs’ scope is restricted to within the operational
purview of a router or a gateway. Finally, ULAs are locally global and unique. They
are meant for use within local networks only. Packets from ULAs are not routed to
the Internet. The first half of an ULA is divided into four parts and the last half is
considered as a whole. The four parts of the first part are the following: Prefix, local
bit, global ID, and subnet ID, whereas the last half contains the IID. ULA’s prefix is
always assigned as FD in hexadecimal (1111 110 in binary). If the least significant bit
in this prefix is assigned as 1, it signifies locally assigned addresses.
i i
i i
i i
IPv6 Communication
An IPv6 configured node starts by sending a router solicitation message to its
network segment; this message is essentially a multicast packet. It helps the node
in determining the presence of routers in its network segment or path. Upon receiving
the solicitation message, a router responds to the node by advertising its presence on
that link. Once discovered, the router is then set as that node’s default gateway. In case
the selected gateway is made unavailable due to any reason, a new default gateway is
selected using the previous steps.
If a router upon receiving a solicitation message determines that it may not be the
best option for serving as the node’s gateway, the router sends a redirect message
to the node informing it about the availability of a better router (which can act as a
gateway) within its next hop.
IPv6 Mobility
A mobile IPv6 node located within its home link uses its home address for routing all
communication to it. However, when the mobile IPv6 node goes beyond its home
link, it has to first connect to a foreign link for enabling communication. A new
IPv6 address is acquired from the foreign link, which is also known as the mobile
node’s care-of-address (CoA). The mobile node now binds its CoA to its home agent (a
router/gateway to which the node was registered in its home segment). This binding
between the CoA and the home agent is done by establishing a tunnel between them.
Whenever the node’s home agent receives a correspondence message, it is forwarded
to the mobile node’s CoA over the established tunnel. Upon receiving the message
from a correspondent node, the mobile node may choose not to reply through its
home agent; it can communicate directly to the correspondent node by setting its
home address in the packet’s source address field. This mechanism is known as route
optimization.
Check yourself
IPv6 header structure, IPv6 extension header types, Neighbor discovery using
IPv6, Mobility in IPv6
i i
i i
i i
8.2.2 LOADng
LOADng stands for Lightweight On-demand Ad hoc Distance vector Routing
Protocol – Next Generation. This protocol is inspired by the AODV (Ad hoc On-
Demand Distance Vector) routing protocol, which is primarily a distance vector
routing scheme [6]. Figure 8.3 illustrates the LOADng operation. Unlike AODV,
LOADng was developed as a reactive protocol by taking into consideration the
challenges of Mobile Ad hoc Networks (MANETs). The LOADng process starts
with the initiation of the action of route discovery by a LOADng router through
the generation of route requests (RREQs), as illustrated in Figure 8.3(a). The router
forwards packets to its nearest connected neighbors, each of which again forwards
the packets to their one-hop neighbors. This process is continued until the intended
destination is reached. Upon receiving the RREQ packet, the destination sends back
a route reply (RREP) packet toward the RREQ originating router (Figure 8.3(b)). In
continuation, route error (RERR) messages are generated and sent to the origin router
if a route is found to be down between the origin and the destination.
2
Source
3 Source
1 2
1 2 3
2
2 3
3
3 PREQ
1 PREQ PREQ_ACK
Destination Destination
(a) Flooding (b) Route discovery
Data
Destination
i i
i i
i i
• Generation of control and signaling traffic in the network only when data is to
be transferred or a route to the destination is down.
Operational Principle
A LOADng router transmits an RREQ over all of its LOADng interfaces whenever
a data packet from a local data source is received by it for transmission to a
destination whose routing entry (a tuple) is not present with it. Figure 8.3(a) shows
the flooding operation, where each LOADng’s forward interfaces are numbered
separately. Considering that it takes three hops to discover the destination from the
source LOADng node, the individual forward interfaces are numbered from 1 to 3.
The RREQ encodes the destination address received from the local source through the
packet. The routing set managing the routing entries at each LOADng router updates
or inserts an entry (with information of the originating address, and the immediate
neighbor LOADng router) upon receiving an RREQ. This also works to enable a
record of the reverse route between the source and destination (Figure 8.3(b)). The
received RREQ initiates the checking of the destination address so that if the packets
are intended for a local interface of a LOADng router, an RREP is sent back using
the reverse route. In case the destination address is not local, it is forwarded to other
LOADng interfaces in a hop-by-hop unicast manner through flooding.
When an RREP is received, it is recorded in the routing entry as the forward
path toward the origin of the RREP along with the LOADng router that forwarded
the message. The route metrics are additionally updated using RREQ and RREP
messages. The LOADng determines the desired metric to be used (Figure 8.3(c)).
Check yourself
8.2.3 RPL
RPL stands for routing protocol for low-power and lossy networks (LLN) and is
designed for IPv6 routing. It follows a distance vector based routing mechanism [7].
The protocol aims to achieve a destination-oriented directed acyclic graph (DODAG).
The network DODAG is formed based on an objective function and a set of network
metrics. The DODAG built by RPL is a logical routing topology which is built over a
physical network. The logical topology is built using specific criteria set by network
administrators. The most optimum path (best path) is calculated from the objective
function, a set of metrics, and constraints. The metrics in RPL may be expected
transmission values (ETX), path latencies, and others. Similarly, the constraints in
RPL include encryption of links, the presence of battery-operated nodes, and others.
In general, the metrics are either minimized or maximized, whereas the constraints
need to be minimized. The objective function dictates the rules for the formation of the
i i
i i
i i
DODAG. Interestingly, in RPL, a single node in the mesh network may have multiple
objective functions. The primary reason for this is attributed to the presence of
different network traffic path quality requirements that need separate addressal within
the same mesh network. Using RPL, a node within a network can simultaneously
join more than one RPL instance (graphs). This enables RPL to support QoS-aware
and constraint-based routing. An RPL node can also simultaneously take on multiple
network roles: leaf node, router, and others. Figure 8.4 shows the RPL mechanism
with different intra-mesh addressing arising due to different requirements of network
and objective functions. The RPL border router, which is also the RPL root (in the
illustrated figure), handles the intra-mesh addressing.
Internet
RPL nodes
End host
Figure 8.4 RPL information flow mechanism with different intra-mesh addressing and paths
RPL Instances
There are two instances associated with RPL: global and local. Global RPL instances
are characterized by coordinated behavior and the possibility of the presence of more
than one DODAG; they have a long lifetime. In contrast, local RPL instances are
characterized by single DODAGs. The local RPL DODAG’s root is associated directly
with the DODAG-ID. The RPL instance ID is collaboratively and unilaterally allocated;
it is divided between global and local RPL instances. Even the RPL control and data
i i
i i
i i
messages are tagged with their corresponding RPL instances using RPL instance IDs
to avoid any ambiguity in operations.
Check yourself
8.2.4 6LoWPAN
6LoWPAN allows low power and constrained devices/nodes to connect to the
Internet. 6LoWPAN stands for IPv6 over low power wireless personal area networks.
As the name suggests, it enables IPv6 support for WPANs, which are limited
concerning power, communication range, memory, and throughput [8]. 6LoWPAN
is designed to be operational and straightforward over low-cost systems, and extend
IPv6 networking capabilities to IEEE 802.15.4-based networks. Popular uses of this
protocol are associated with smart grids, M2M applications, and IoT. 6LoWPAN
allows constrained IEEE 802.15.4 devices to accommodate 128-bit long IPv6 addresses.
This is achieved through header compression, which allows the protocol to compress
and retro-fit IPv6 packets to the IEEE 802.15.4 packet format.
6LoWPAN networks can consist of both limited capability (concerning throughput,
processing, memory, range) devices—called reduced function devices (RFD)—and
devices with significantly better capabilities, called full function devices (FFD). The
RFDs are so constrained that for accessing IP-based networks, they have to forward
their data to FFDs in their personal area network (PAN). The FFDs yet again forward
the forwarded data from the RFD to a 6LoWPAN gateway in a multi-hop manner. The
gateway connects the packet to the IPv6 domain in the communication network. From
here on, the packet is forwarded to the destination IP-enabled node/device using
regular IPv6-based networking.
6LoWPAN Stack
The 6LoWPAN stack rests on top of the IEEE 802.15.4 PHY and MAC layers, which are
generally associated with low rate wireless personal area networks (LR-WPAN). The
choice of IEEE 802.15.4 for the base layer makes 6LoWPAN suitable for low power
LR-WPANs. The network layer in 6LoWPAN enabled devices (layer 3) serves as
an adaptation layer for extending IPv6 capabilities to IEEE 802.15.4 based devices.
Figure 8.5 shows the 6LoWPAN packet structure.
• PHY and MAC layers: The PHY layer consists of 27 wireless channels, each
having their separate frequency band and varying data rates. The MAC layer
defines the means and methods of accessing the defined channels and use them
for communication. The 6LoWPAN MAC layer is characterized by the following:
i i
i i
i i
Pan ID
IEEE 802.154
Destination ID (64-bits)
Source ID (64-bits)
(i) Beaconing tasks for device identification. These tasks include both beacon
generation and beacon synchronization.
(ii) Channel access control is provided by CSMA/CA.
(iii) PAN membership control functions. Membership functions include
association and dissociation tasks.
i i
i i
i i
only 13 octets available at the disposal of the upper layers and the data itself. The
6LoWPAN adaptation layer between the MAC and the network layers takes care
of these issues through the use of header compression, packet forwarding, and
packet fragmentation.
64-bits 16-bits
Extended Short
• Address Format: The 6LoWPAN address format is made up of two parts: 1) the
short (16-bit) address and 2) the extended (64-bit) address. The short address is
PAN specific and is used for identifying devices within a PAN only, which makes
its operational scope highly restricted and valid within a local network only. In
contrast, the globally unique extended address is valid globally and can be used
to identify devices, even outside the local network uniquely. Figure 8.6 illustrates
the 6LoWPAN address format.
Check yourself
i i
i i
i i
Dispatch header
0 1 Dispatch Type specific header
8.2.5 QUIC
Quick UDP internet connections (QUIC) was developed (and still undergoing
developments) to work as a low-latency and independent TCP connection [9]. The
aim behind the development of this protocol is to achieve a highly reduced latency
(almost zero round-trip-time) communication scheme with stream and multiplexing
support like the SPDY protocol developed by Google. Figure 8.8 illustrates the
differences between the positions of the various functionalities in QUIC and regular
HTTP protocols.
Http/2 QUIC
IP IP
i i
i i
i i
The connection latency in QUIC is reduced by reducing the number of round trips
incurred during connection establishment in TCP, such as those for handshaking,
data requests, and encryption exchanges. This is achieved by including session
negotiation information in the initial packet itself. The QUIC servers further enhance
this compression by publishing a static configuration record corresponding to the
connections. Clients synchronize connection information through cookies received
from QUIC servers.
TCP request
1 2 3 4 5 6
Data transfer
UDP request
1 2
3
4 5 6
Data transfer
Quic server Quic client
Figure 8.9 Differences between stream of packets over HTTP and QUIC protocols
QUIC uses advanced techniques such as packet pacing and proactive speculative
retransmission to avoid congestion. Proactive speculative retransmission sends copies
of most essential packets, which contain initial negotiation for encryption and error
correction. The additional speedup is achieved using compression of data such as
headers, which are generally redundant and repetitive. This feature enables QUIC
connections to make multiple secured requests within a single congestion window,
which would not have been possible using TCP. Figure 8.9 shows the difference in
regular streaming of packets over HTTP and the improved performance of HTTP-
over-QUIC during packet streaming. The use of UDP and multiple transmission
paths significantly speeds up the performance of streaming over QUIC as compared
to regular HTTP-based packet streaming.
Check yourself
i i
i i
i i
uIP
Socket API
UDP TCP
ICMP
Iov6 LoWPAN
The main highlighting features of uIP, which makes it stand out from other IP-
based protocols are as follows:
• The software interface of uIP does not require any operating system for working,
making it quite easy to integrate with small computers.
• When called in a timed loop within the embedded system, it also manages all the
network behavior and connection retries.
• The hardware driver for the uIP is responsible for packet builds, packet sending;
it may also be used for response reception for the packets sent.
• uIP uses a minimal packet buffer (packet buffer = 1) in contrast to normal IP
protocol stacks. This makes uIP suitable for low-power operations.
• The packet buffer is used in a half-duplex manner so that the same buffer can be
repurposed for use in transmission and reception.
• Unlike regular TCP/IP protocols, uIP does not store data in buffers in case there
is a need for retransmission. In the event of retransmission of packets, the
previous data has to be reproduced and is recalled from the application code
itself.
• Unlike conventional IP-based protocols, where a task is dedicated for each
connection to a distant networked device/node, uIP stores connections in an
array, and serves each connection sequentially through subroutine calls to the
application for sending data.
i i
i i
i i
Check yourself
Nano-TCP
8-bits 8-bits 8-bits
Flags (FIN | SYN | ACK) Length
Acknowledgment
Payload (n-Bytes)
Nano-UDP
8-bits 8-bits 8-bits
Flags (FIN | SYN | ACK) Length
Payload
i i
i i
i i
Check yourself
Cache
Content
Cache
Untrusted Data
User
Content server
Trusted content
server
i i
i i
i i
Check yourself
Points to ponder
i i
i i
i i
ent
trol
m
Con
Pay
A
dv
er s
ate
ti
d
se
Up
m
en
st
Notif /status
icatio
ns Alerts
Physical Status
alerts web
Status
Al
ert Internet
s
ts
r
le
A
Occupancy
Sta
tus
The physical web broadcasts a list of URLs within a short radius around it so that
anyone in range can see the available URLs and interact with them. This paradigm
is primarily built upon Bluetooth low energy (BLE), which is used to broadcast the
content as beacons. The primary requirement of any supporting beacons to function in
the physical web and broadcast URLs is their ability to support the Eddystone protocol
specification. BLE was primarily chosen for the physical web due to its ubiquity,
efficiency, and extended battery life of several years.
URLs are one of the core principles of the web and can be either flexible or
decentralized. These URLs allow any application to have a presence on the web
and enables the digital presence of an object or thing. As of now, physical web
deployments have been undertaken in public spaces, and any device with a physical
web scanner can detect these URLs. The use of URLs extends the benefits of the
web security model to the physical web. Features such as secured login, secure
communication over HTTPS (HTTP over secure socket layer), domain obfuscation,
and others can be easily integrated with the physical web.
Check yourself
i i
i i
i i
Check yourself
i i
i i
i i
Vendor specification
Forum specification
Device arcchitecture
GENA
GENA SSDP SSDP
Soap (Events)
(Control)
HTTPU
HTTMU (Discovery) HTTP
(Discovery)
HTTP
(Discovery)
UDP TCP
Internet protocol
reason for this is because, on a large scale, the cost of this solution would be infeasible
from an operational point of view.
Check yourself
Multicasting, Unicasting
i i
i i
i i
8.4.1 MQTT
Message queue telemetry transport or MQTT is a simple, lightweight publish–
subscribe protocol, designed mainly for messaging in constrained devices and
networks [16]. It provides a one-to-many distribution of messages and is payload-
content agnostic. MQTT works reliably and flawlessly over high latency and limited
bandwidth of unreliable networks without the need for significant device resources
and device power. Figure 8.15 shows the working of MQTT. The MQTT paradigm
consists of numerous clients connecting to a server; this server is referred to as a
broker. The clients can have the roles of information publishers (sending messages
to the broker) or information subscribers (retrieving messages from the broker). This
allows MQTT to be largely decoupled from the applications being used with MQTT.
Refrigerator
Thermometer
Control panel
Lights
Topics
Topics
Microwave
Tablet
Television Smart
Appliances home
Operational Principle
MQTT is built upon the principles of hierarchical topics and works on TCP for
communication over the network. Brokers receive new messages in the form of
topics from publishers. A publisher first sends a control message along with the
data message. Once updated in the broker, the broker distributes this topic’s content
to all the subscribers of that topic for which the new message has arrived. This
paradigm enables publishers and subscribers to be free from any considerations of
the address and ports of multiple destinations/subscribers or network considerations
of the subscribers, and vice versa. In the absence of any subscribers of a topic, a broker
normally discards messages received for that topic unless specified by the publisher
otherwise. This feature removes data redundancies and ensures that maximally
updated information is provided to the subscribers. It also reduces the requirements
of storage at the broker. The publishers can set up default messages for subscribers in
agreement with the broker if the publisher’s connection is abruptly broken with the
broker. This arrangement is referred to as the last will and testament feature of MQTT.
i i
i i
i i
i i
i i
i i
Check yourself
8.4.2 MQTT-SN
The primary MQTT protocols heavily inspire MQTT for sensor networks or MQTT-
SN; however, the MQTT-SN is robust enough to handle the requirements and
challenges of wireless communications networks in sensor networks [17]. Typical
features of MQTT-SN include low bandwidth usage, ability to operate under high
link failure conditions; it is suitable for low-power, low-cost constrained nodes and
networks. The major differences between the original MQTT and MQTT-SN include
the following:
• The CONNECT message types are broken into three messages in which two are
optional and are tasked with the communication of the testament message and
testament topic to the broker.
• The topic name in the PUBLISH messages are replaced by topic identifiers, which
are only 2 bytes long. This reduces the traffic generated from the protocol and
enables the protocol to operate over bandwidth-constrained networks.
• A separate mechanism is present for topic name registration with the broker in
MQTT-SN. After a topic identifier is generated for the topic name, the identifier
is informed to the publisher/subscribers. This mechanism also supports the
reverse pathway.
• In special cases in MQTT-SN, pre-defined topic identifiers are present that need
no registration mechanism. The mapping of topic names and identifiers are
known in advance to the broker as well as the publishers/subscribers.
• The presence of a special discovery process is used to link the
publisher/subscriber to the operational broker’s network address in the absence
of a preconfigured broker address.
• The subscriptions to a topic, Will topic, and Will message are persistent in MQTT-
SN. The publishers/subscribers can modify their Will messages during a session.
• Sleeping publishers/subscribers are supported by a keep-alive procedure, which
is offline, and which helps buffer the messages intended for them in the broker
until they wake up. This feature of MQTT-SN is not present in regular MQTT.
i i
i i
i i
Figure 8.16 shows the two gateway types in MQTT-SN: 1) the transparent gateway and
2) the aggregating gateway. The MQTT-SN converts/translates MQTT and MQTT-
SN traffic by acting as a bridge between these two network types. The transparent
gateway (Figure 8.16(a)) creates as many connections to the MQTT broker as there are
MQTT-SN nodes within its operational purview; whereas the aggregating gateway
(Figure 8.16(b)) creates a single connection to the MQTT broker, irrespective of the
number of MQTT-SN nodes under it.
SN-2 SN-2
SN-3 SN-3
SN-4 SN-4
Transparent Transparent
gateway gateway
SN-5 SN-5
8.4.3 CoAP
The constrained application protocol, or CoAP as it is more popularly known, is
designed for use as a web transfer protocol in constrained devices and networks,
which are typically low power and lossy [18]. The constrained devices typically have
minimal RAM and an 8-bit processor at most. CoAP can efficiently work on such
devices, even when these devices are connected to highly lossy networks with high
packet loss, high error rates, and bandwidth in the range of kilobits.
CoAP follows a request–response paradigm for communication over these lossy
networks. Additional highlights of this protocol include support for service discovery,
resource discovery, URIs (uniform resource identifier), Internet media handling
support, easy HTTP integration, and multicasting support, that too while maintaining
low overheads. Typically, CoAP implementations can act as both clients and servers
(not simultaneously). A CoAP client’s request signifies a request for action from
an identified resource on a server, which is similar to HTTP. The response sent by
the server in the form of a response code can contain resource representations as
well. However, CoAP interchanges are asynchronous and datagram-oriented over
UDP. Figure 8.17 shows the placement of CoAP in a protocol stack. Packet traffic
i i
i i
i i
collisions are handled by a logical message layer incorporating the exponential back-
off mechanism for providing reliability. The reliability feature of CoAP is optional. The
two seemingly distinct layers of messaging (which handle the UDP and asynchronous
messaging) and request-response (which handles the connection establishment) are
part of the CoAP header.
CoAP
Constrained PHY
Internet protocol
Application
UDP
Messages
Request
CoAP Features
The CoAP is characterized by the following main features:
(i) It has suitable web protocol for integrating IoT and M2M services in constrained
environments with the Internet.
(ii) CoAP enables UDP binding and provides reliability concerning unicast as well
as multicast requests.
(iii) Message exchanges between end points in the network or between nodes is
asynchronous.
(iv) The limited packet header incurs significantly lower overheads. This also results
in less complexity and processing requirements for parsing of packets.
(v) CoAP has provisions for URI and other content-type identifier support. CoAP
additionally provides DTLS (datagram transport layer security) binding.
(vi) It has a straightforward proxy mechanism and caching capabilities, which is
responsible for overcoming the effects of the lossy network without putting extra
constraints on the low-power devices. The caching is based on the concept of the
maximum age of packets.
(vii) The protocol provides a stateless mapping with HTTP. The server or receiving
node does not retain information about the source of the message; rather, it is
expected that the message packet carries that information with it. This enables
CoAP’s easy and uniform integration with HTTP.
i i
i i
i i
CoAP Messaging
CoAP defines four messaging types: 1) Confirmable (CON), 2) non-confirmable
(NON), 3) acknowledgment (ACK), and 4) reset. The method codes and the response
codes are included in the messages being carried. These codes determine whether the
message is a request message or a response message. Requests are typically carried
in confirmable and non-confirmable message types. However, responses are carried
in both of these message types as well as with the acknowledgment message. The
transmission of responses with acknowledgment messages is known as piggybacking
and is quite synonymous with CoAP.
Operational Principle
CoAP is built upon the exchange of messages between two or more UDP end
points. Options and payload follow the compact 4-byte binary header in CoAP. This
arrangement is typical of request and response messages of CoAP. A 2-byte message
ID is used with each message to detect duplicates.
Whenever a message is marked as a CON message, it signifies that the message
is reliable. In the event of delivery failure of a CON message, subsequent retries are
attempted with exponential back-off until the receiving end point receives an ACK
with the same message ID (Figure 8.18). In case the recipient does not have the
resources to process the CON message, a RESET message is sent to the originator of
the CON message instead of an ACK message.
Piggybacking
message
Separate
CON(Data)
Non-
NON CON
ACK+Data ACK
A B C
Figure 8.18 Various CoAP response–response models. (A): CON and NON messages, (B):
Piggyback messages, and (C): Separate messages
i i
i i
i i
NON or CON response from a server, based on the settings and semantics of the
application. If the receiver of the NON cannot process the message, a RESET message
is sent to the originator of the NON message.
If a server fails to respond immediately to a request received by it in a
CON message, an empty ACK response is sent to the requester to stop request
retransmissions. Whenever the response is ready, a new CON message is used to
respond to the previous request by the client. Here, the client then has to respond to
the server using an ACK message. This scheme is known as a separate response (Figure
8.18).
The multicast support of CoAP over UDP results in multicast CoAP requests. The
request and response semantics of CoAP is carried in the form of method and response
codes in the CoAP messages itself. The options field of CoAP carries information
about the requests and responses such as URI and MIME (multipurpose Internet mail
extensions). The concept of tokens is used to match requests with their corresponding
responses. The need for a token mechanism arose due to the asynchronous nature
of the CoAP messaging. Similar to HTTP, CoAP uses GET, PUT, POST, and DELETE
methods.
Check yourself
8.4.4 AMQP
AMQP or the advanced message queuing protocol is an open standard middleware
at the application layer developed for message-oriented operations [19]. It tries to
bring about the concept of interoperability between clients and the server by enabling
cross-vendor implementations. Figure 8.19 shows the various components of AMQP
and their relationships. An AMQP broker is tasked with maintaining message queues
between various subscribers and publishers. The protocol is armed with features of
message orientation, queuing, reliability, security, and routing. Both request–response
and publish–subscribe methods are supported. AMQP is considered as a wire-level
protocol. Here, the data format description is released on the network as a stream
of bytes. This description allows AMQP to connect to anyone who can interpret and
create messages in the same format. It also results in a level of interoperability where
anyone with compliant or supporting means can make use of this protocol without
any need for a specific programming language.
AMQP Features
AMQP is built for the underlying TCP and is designed to support a variety of
messaging applications efficiently. It provides a wide variety of features such as
flow-controlled communication, message-oriented communication, message delivery
i i
i i
i i
Subscriber
AMQP
broker
Publisher
Subscriber
Publisher
Subscriber
Publisher
Subscriber
Queues
guarantees (at most once, at least once, and exactly once), authentication support, and
an optional SSL or TLS based encryption support. The AMQP is specified across four
layers: 1) type system, 2) process to process asynchronous and symmetric message
transfer protocol, 3) extensible message format, and 4) set of extensible messaging
capabilities. In continuation, the primary unit of data in AMQP is referred to as a
frame. These frames are responsible for the initiation of connections, termination of
connections, and control of messages between two peers using AMQP. There are nine
frame types in AMQP:
Operational Principle
The workings of AMQP revolve around the link protocol. A new link is initiated
between peers that need to exchange messages by sending an ATTACH frame. A
DETACH frame terminates the link between peers. Once a link is established,
unidirectional messages are sent using the TRANSFER frame. Flow control is
maintained by using a credit-based flow-control scheme, which protects a process
from being overloaded by voluminous messages. Every message transfer state has
to be mutually settled by both the sender and the receiver of the message. This
i i
i i
i i
settlement scheme ensures reliability measures for messaging in AMQP. Any change
in state and settlement of transfer is notified using a DISPOSITION frame. This allows
for the implementation of various reliability guarantees. A session can accommodate
multiple links in both directions. Unlike the link, a session is bidirectional and
sequential. Upon initiation with a BEGIN frame, a session enables a conversation
between peers. The session is terminated using an END frame. Multiple logically
independent sessions can be multiplexed between peers over a connection. The OPEN
frame initiates a connection and the connection is terminated by using a CLOSE frame.
8.4.5 XMPP
The extensible messaging and presence protocol, or XMPP, which was initially
named as Jabber, is designed for message-oriented middlewares based on the
extensible markup language (XML) [20]. XMPP was developed for instant messaging,
maintenance of contacts, and information about network presence. Structured and
extensible data between two networked nodes/devices can be exchanged in near real-
time using this protocol. XMPP has found use in VOIP (voice-over Internet protocol)
presence signaling, video and file transfers, smart grid, social networks, publish–
subscribe systems, IoT applications, and others. The protocol, being open-source, has
enabled a spurt of developments in various freeware as well as commercial messaging
software. As XMPP follows a client–server architecture, peers in a network cannot talk
directly to one another through XMPP. All communication between peers has to be
routed through an XMPP server. The XMPP model is considered to be decentralized
as anyone can host an XMPP server to which various clients can subscribe. Figure 8.20
shows the basic communication between the various XMPP stakeholders.
Client-1A Client-1A
Client-1B Client-1B
Server Server
Client-1C Client-1C
XMPP
Client-1D Client-1D
Plug-ins Componet
i i
i i
i i
Operational Principle
A unique XMPP address, which is also referred to as a Jabber ID (JID), is assigned
to every user on the network. The JID, similar to an email address, has a username
and a domain name ([email protected]). The domain name is mostly the IP address
of the server hosting the XMPP service. XMPP allows its users to login from multiple
devices by means of specifying resources. The resource is used to identify a user’s
clients/devices (home, mobile, work, laptop, and others), which is generally included
in the JID by appending the JID with the resource name separated by a slash. A typical
JID looks like this: [email protected]/resource. Resources are prioritized using
numerical IDs. Any message arriving at the default JID (without resource name) is
forwarded to the resource with the highest priority (largest numerical ID value). Often
JIDs without usernames are used for specific control messages and system messages,
which are meant for the server. The use of JID in this mode—without an explicit IP
address—allows XMPP to be used as an overlay network on top of multiple underlay
networks.
XMPP Technologies
XMPP is an extensible, flexible, and diverse protocol; it has resulted in the
development of a significant number of technologies based on it. Some key XMPP
technologies include the following:
• Core: It deals with information about the core XMPP technologies for XML
streaming over a network. The core includes the base XML layer for streaming,
provides TLS-based encryption, imparts simple authentication and security
layer (SASL) based authentication, informs about the availability of a network,
provides UTF-8 support, and contact lists, which are presence enabled.
• Jingle: This provides session initiation protocol (SIP)-compatible multimedia
signaling for voice, video, file transfer, and other applications. Various media
transfer protocols such as TCP, UDP, RTP, or even in-band XMPP is supported.
The Jingle session initiation signal is sent over XMPP, and the media transfer
takes place in a peer-to-peer manner or over media relays.
• Multi-user chat: MUC is a flexible, multiparty communication exchange
extension for XMPP. Here multiple users can exchange information in a chat
room or channel. Support for strong chat room controls is also provided, which
enables the banning of users and updation of chat room moderators.
• Pub–sub: This provides publish–subscribe functionality to XMPP by proving
alerts and notifications for data syndication, vibrant presence, and more such
features. Pub–sub enables XMPP clients to create topics at a pub–sub service and
publish/subscribe to them.
• BOSH: It stands for bidirectional streams over synchronous HTTP. This is an
HTTP binding for XMPP (and other) traffic. BOSH incurs lower latencies and
lesser network bandwidth usage by doing away with HTTP polling. It is mainly
used for the XMPP traffic exchange between clients and servers.
i i
i i
i i
Check yourself
8.4.6 SOAP
SOAP or simple object access protocol is used for exchanging structured information
in web services by making use of XML information set formatting over the application
layer protocol (HTTP, SMTP) based transmission and negotiation of messages, as
shown in Figure 8.21 [21]. This allows SOAP to communicate with two or more
systems with different operating systems using XML, making it language and
platform independent. The use of SOAP facilitates the messaging layer of the web
services protocol stack.
Endpoint URI
Application
A SOAP application can send a request with the requisite search parameters to a
server with web services enabled. The target server responds in a SOAP response
format with the results of the search. The response from the server can be directly
integrated with applications at the requester’s end, as it is already in a structured and
parsable format. Figure 8.22 illustrates the basic working of SOAP.
SOAP is made up of three broad components: 1) Envelope (which defines
the structure of the message and its processing instructions), 2) encoding rules
(which handles various datatypes arising out of the numerous applications), and
3) convention (which is responsible for web procedure calls and their responses).
This messaging protocol extends the features of neutrality (can operate over any
application layer protocol), independence (independent of programming models),
and extensibility (features such as security and web service addressing can be
extended) to its services. The use of SOAP with HTTP-based request–response
exchanges does not require the modification of the communication and processing
infrastructures. It can easily pass through network/system firewalls and proxies
(similar to tunneling), as illustrated in Figure 8.22. However, the use of XML affects the
i i
i i
i i
Firewall Firewall
Internet
Request Response
parsing speed and hence, the performance of this protocol. Additionally, the verbose
nature of SOAP is not recommended for use everywhere. The specifications of the
SOAP architecture are defined across several layers, such as message format layer,
message exchange patterns (MEP) layer, transport protocol binding layer, message
processing model layer, and protocol extensibility layer.
Check yourself
8.4.7 REST
Representational state transfer or REST encompasses a set of constraints for the
creation of web services, mainly using a software architectural style [22]. The web
services adhering to REST styles are referred to as RESTful services; these services
enable interoperability between various Internet-connected devices. RESTful systems
are stateless: the web services on the server do not retain client states. The use
of stateless protocols and standards makes RESTful systems quite fast, reliable, and
scalable. The reuse of components can be easily managed without hindering the
regular operations of the system as a whole. Requesting systems can manipulate
textual web resource representations by making use of this stateless behavior of REST.
RESTful web services, in response to requests made to a resource’s URI, mainly
responds with either an HTML, XML, or JSON (JavaScript Object Notation) formatted
payload. As RESTful services use HTTP for transfer over the network, the following
four methods are commonly used: 1) GET (read-only access to a resource), 2) POST
(for creating a new resource), 3) DELETE (used for removing a resource), and 4) PUT
(used for updating an existing resource or creating a new one). Figure 8.23 represents
the REST style and its components.
i i
i i
i i
Resource URI
Application
i i
i i
i i
Check yourself
8.4.8 WebSocket
Websocket is an IETF (Internet Engineering Task Force)-standardized full-duplex
communication protocol. Websockets (WS), an OSI layer seven protocol, enables
reliable and full-duplex communication channels over a single TCP connection [23].
Figure 8.24 shows the position of a websocket layer in a stack. The WS relies on the
OSI layer 4 TCP protocol for communication. Despite being different from the HTTP
protocol, WS is compatible with HTTP and can work over HTTP ports 80 and 443,
enabling support for network mechanisms such as the HTTP proxy, which is usually
present during organizational Internet accesses through firewalls.
WS enables client–server interactions over the Web. Web servers and clients
such as browsers can transfer real-time data between them without incurring many
overheads. Upon establishment of a connection, servers can send content to clients
without the clients requesting them first. Messages are exchanged over the established
connection, which is kept open, in a standardized format. Support for WS is present in
almost all modern-day browsers; however, the server must also include WS support
for the communication to happen.
The full-duplex communication provided by WS is absent in protocols such as
HTTP. Additionally, the use of TCP (which supports byte stream transfers) is also
i i
i i
i i
Application layer
Websockets
Transport layer
Network layer
enhanced by enabling it to provide message stream transfers using WS. Before the
emergence of WS, comet channels were used for attaining full-duplex communication
over port 80. However, comet systems were very complicated and incurred significant
overheads, which made their utility limited for constrained application scenarios
mainly associated with IoT.
Websocket (WS) and websocket secure (WSS) have been specified as uniform
resource identifier (URI) schemes in the WS specification, which are meant for
unencrypted and encrypted connections, respectively. The WS handshake process and
the frames can be quickly inspected using browser development tools.
Operational Principle
A client initiates the WS connection process by sending a WS handshake request. In
response, a WS server responds with a WS handshake response. As the servers have
to incorporate both HTTP and WS connections on the same port, the handshaking
is initiated by an HTTP request/response mechanism. Upon establishment of a
connection between the client and server, the WS communication separates as a bi-
directional protocol that is non-conformant with the HTTP protocol. The WS client
sends an update header and a sec-websocket-Key header, which contains base64 encoded
random bytes. The server responds to the client’s request using a hash of the key
included in the Sec-WebSocket-Accept header. This allows the WS to overcome a caching
proxy’s efforts to resend previous WS communication. A fixed string, 258EAFA5-
E914-47DA-95CA-C5AB0DC85B11, is appended to the undecoded value from the Sec-
WebSocket-Key header by a hashing function using the SHA (secure hash algorithm)-1,
which is finally encoded using base64 encoding. Once the WS full-duplex connection
is established, minimally framed data (small header and a payload), which may be
data or text, can be exchanged. The WS transmissions or messages can be further split
i i
i i
i i
into multiple data frames whenever the full message length is not available during
message transfer. This feature is occasionally exploited to include/multiplex several
simultaneous streams, using simple extensions to the WS protocol. This multiplexing
avoids the monopoly of a single large payload over the WS port.
Check yourself
8.5.1 EPC
EPC or the electronic product code identification system was designed to act as a
universal identifier and provide unique identities accommodating all physical objects
in the world [24]. The open standard and free EPCglobal Tag Data Standard defines
the EPC structure. The official representation of EPC is an URI (uniform resource
identifier) and referred to as the pure identity URI. Figure 8.25 illustrates the standard
EPC representation. This representation is used for referring to physical objects in
communicating information and business systems and application software. The
standard also defines representations for EPC identifiers: tag encoding URI formats
and formats for binary EPC identifier storage. In systems such as passive RFIDs that
generally have low memory, the EPC binary identifier storage format plays a crucial
role. The standard also provides EPC encoding and decoding rules to use URI and
binary formats interchangeably seamlessly. Being a very flexible framework, external
support for various coding schemes such as those used with barcodes is also possible
with EPC. The EPC standard currently supports EPC identifiers, general identifiers,
and seven types of identification keys from the GS1 Application Identifier system. As
the EPC is not designed to be restricted for use only with RFID data carriers, the data
carrier technology-agnostic behavior of EPC is further enhanced by the pure identity
URI.
i i
i i
i i
8.5.2 uCode
Another identification number system, the uCode is designed to uniquely identify
real-world things and objects whose information is digitally associated with the uCode
system [25]. The uID center in Japan provides support for the uCode system. The
uCode system can be used with any application, business processes, and technology
(RFID, barcodes, matrix codes). uCode is application and technology independent and
uses 128 bit codes for uniquely tagging/naming physical objects. The uCode provides
3.4 × 1038 unique codes for individually tagging objects. These features make uCode a
crucial enabling technology for IoT. Figure 8.26 represents the working of uCode tags
and its various stakeholders.
The uCode tags are generally grouped into five categories: 1) print tags, 2) acoustic
tags, 3) active RF tags, 4) active infrared tags, and 5) passive RFID tags. In contrast to
other identification systems, the uCode system has the following distinct features:
(i) It does not display product types, albeit it identifies individual objects. Existing
codes identify products by individual vendors, making the possibility of
identifier tag reuse a possibility, which is avoided in the uCode system.
(ii) In addition to physical objects, the uCode can be associated with places,
concepts, and contents, enabling this system to identify such items universally.
(iii) Being application and business agnostic, the uCode system can be used across
industries and organizations. The system provides a unique identification
number, which does not carry any meaning or information about the tagged
object/item. This enables the same system to be used seamlessly across
organizations, industries, and product types.
(iv) uTRON, a ubiquitous security framework, which is incorporated with the
ubiquitous ID architecture of the uCode system, makes it entirely secure and
enables information privacy protection.
(v) The tag agnostic nature of the uCode system makes it possible for various
systems such as RFIDs, and barcodes to store uCode information. This makes
uCode highly ubiquitous and pervasive.
(vi) The uCode represents pure numbers and is devoid of any meaning or
information related to the tagged item/object. This makes the reassignment of
uCode tags quite robust and straightforward.
i i
i i
i i
TLD servers are located around the globe in Japan, Finland, and a few other countries.
Finally, the second level domain (SLD) is at the bottom of the hierarchy, below the
TLD. The TLD and SLD servers are not restricted and can be added to the existing
system.
Check yourself
8.5.3 URIs
One of the most common identifiers in use is the uniform resource identifier (URI).
[26] The URI is used to identify individual resources only by using character strings
distinctly. As with other protocols, the uniformity of this protocol is ensured by an
agreed-upon set of syntax rules. These rules also allow for extensibility through the
incorporation of separate hierarchical naming schemes such as “http://”. URIs enable
interaction with network-based resource representations through specific protocols,
especially over the WWW. Some terms commonly derived from URIs are URLs and
URNs. URLs or uniform resource locators are very commonly encountered during
resource search over the Web or a network. URLs are generally referred to as web
addresses and specify the location as well as the access mechanism for a remote
resource. For example, “http://www.abc.xz/home/index” denotes the location of
the resource at “/home/index”, which is hosted at the domain “www.abc.xz”, and
can be accessed using HTTP. A less encountered form of URIs is the uniform resource
name (URN), which identifies resources in particular namespaces only. URNs were
initially designed to complement URLs. However, unlike URLs, URNs only identify
resources and do not provide the location or method to access the identified resource.
Figure 8.27 shows the typical URI format.
http://www.examplelink.com/directory_path/target_file.html
i i
i i
Check yourself
Difference between URI, URL, and URN, Advantages of URI over URL and URN,
Limitations of URI
8.6.1 TR-069
Owing to the rising need for remote management of customer premises equipment
(CPE), the Broadband Forum defined the technical specifications for the application
layer protocol for CPE over IP networks; these specifications are referred to as
Technical Report 069 or TR-069 [27]. The TR-069 mainly focuses on the auto-
configuration of Internet-connected devices using auto configuration servers (ACS).
Within the premises of this report, the CPE WAN management protocol (CWMP)
outlines the various support functions for CPE, which encompasses software and
firmware management, status and performance report management, diagnostics, and
auto-configuration. CWMP, a primarily SOAP/HTTP-based bi-directional protocol,
which is also text-based, provides communication and management support between
CPE and servers within a single framework. Devices connecting over the Internet
such as routers, gateways, and end devices such as set-top boxes and VoIP devices
fall under its purview. Figure 8.28 shows the main components of TR-069 and their
relations between each other.
i i
(i) The commands between the CPE and the ACS during provisioning sessions are
either HTTP or HTTPS based, where the ACS is the server and the CPE are
clients.
(ii) The provisioning session is responsible for the communications and operations
between the CPE and ACS.
(iii) Session initiation is performed by the CPE through an “inform” message, to
which the ACS indicates its readiness using an “inform response”.
(iv) In the subsequent stage, the CPE transmits orders to the ACS, which is invoked
using a “transfer complete” message. An empty HTTP-request completes the
transmission from the CPE to the ACS.
(v) In response to the empty HTTP request, the HTTP response from the ACS to the
device contains a CWMP (CPE WAN management protocol) request. An empty
HTTP-response from the ACS indicates the completion of pending orders.
(vi) Information security during transmission (login, password, and others) is
handled using HTTPS and ACS certificate verification. Authentication of CPE
is done based on a shared secret key between the CPE and ACS.
(vii) A time limit of 30 seconds is imposed on the start of the provisioning session
after receiving device confirmation.
Points to ponder
The use of TR-069 for remote management of home networked devices and
terminals is endorsed by various forums such as Home Gateway Initiative (HGI),
Digital Video Broadcasting (DVB), and WiMAX Forum.
Check yourself
8.6.2 OMA-DM
The open mobile alliance (OMA) device management (DM) protocol is specified
by the OMA working group and the data synchronization (DS) working group for
remote device management of mobile devices, including mobile phones and tablets
[28]. The management functions include provisioning, device configuration, software
upgrades, fault management, and others. On the device end, any or all of these
features may be implemented. The OMA-DM specification is designed for constrained
devices with limited bandwidth, memory, storage, and processing. Data exchanges
i i
i i
i i
take place through SyncML, which is a subset of XML. OMA-DM supports both wired
as well as wireless data transport (USB, RS-232, GSM, CDMA, Bluetooth, and others)
over transport layer protocols such as WAP, HTTP, or OBEX.
Operations
Device specific interface
XML representation
DM tree
Parameters
Resources
Protocols
Transport layer
Physical layer
Client DM server
Check yourself
i i
i i
i i
8.7.1 JSON-LD
JavaScript object notation for linked data or JSON-LD is a lightweight protocol,
which is designed for JSON-based encoding of linked data by seamlessly converting
older JSON-based representations of data. The representations of the data are
highly human-understandable and highly suitable for RESTful environments and
unstructured data over the Web [29]. JSON-LD has an additional resource description
framework (RDF) over and above the typical JSON model and is built to be contextual.
This feature allows for the interoperability of JSON data over the Web. The contextual
linking of the object properties of a JSON document follows a fixed ontology in
JSON-LD through strategies such as tagging with a language by or forcibly assigning
values to pre-defined groups/bins. Context embeddings in JSON-LD documents
can be either direct or through the use of separate file references using HTTP link
headers. Linked data allows for the existence of a network of machine-readable and
standardized data over the Web, which can be parsed by starting at a singular piece
of data and subsequently traversing the embedded links within it; this may lead to
different locations across the Web.
i i
i i
i i
Check yourself
(i) Integration Patterns: Dictates how the Things in IoT connect to the Web.
It is mainly composed of three schemes: Direct connectivity, gateway based
connectivity, and cloud-based connectivity.
(ii) Web Things (WT) Requirements: Provides guidelines and recommendations
for handling various constraints and protocol implementations to enhance the
seamless interaction between the WoT entities. A typical web-server is referred
to as a Web Thing; it should also confirm to these recommendations.
(i) Web Thing Model: Data exchange over the WoT ensues once a Web Thing is
compliant. Additionally, in order to achieve context-awareness, this specification
outlines RESTful web protocol, which has a defined set of payload syntax, data
models, and resources. A fully compliant model is referred to as the Extended
Web of Things model.
Summary
This chapter provided an outline of various communication technologies that are
deemed as core technologies for developing IoT-based solutions. We initially explain
the requirements and classification of IoT devices and communication types. We
i i
i i
i i
divide the various communication protocols under six heads based on their usability
and functionalities: 1) Infrastructure, 2) discovery, 3) data, 4) identification, 5) device
management, and 6) semantic. After this chapter, readers will be able to distinguish
between various requirements and constraints associated with these protocols and
select the best one amongst them according to their application’s requirements.
Exercises
(i) What are the salient features of 6LoWPAN?
(ii) What is a WPAN?
(iii) Describe the addressing types in 6LoWPAN.
(iv) Describe the LOADng routing.
(v) Describe the RPL routing.
(vi) What are the different header types in 6LoWPAN?
(vii) What constitutes a low power lossy network (LLN)?
(viii) What is AMQP? Describe in detail.
(ix) What are the various message guarantees provided by AMQP? Explain each in
detail.
(x) List some of the salient features of AMQP.
(xi) What are the frame types in AMQP?
(xii) Differentiate between OPEN, BEGIN and ATTACH frame types in AMQP.
(xiii) Differentiate between DETACH, END, and CLOSE frame types in AMQP.
(xiv) Differentiate between TRANSFER and FLOW frame types in AMQP.
(xv) What are BINDINGS in the context of AMQP?
(xvi) What are the various types of AMQP exchanges? Describe each.
(xvii) What are the popular applications of AMQP?
(xviii) Explain the working of MQTT
(xix) How is MQTT different from HTTP?
(xx) What are the various MQTT methods?
(xxi) What is SMQTT? How is it different from MQTT?
(xxii) List the salient features of MQTT.
(xxiii) List the salient features of XMPP.
(xxiv) Describe the XMPP protocol.
(xxv) Differentiate between structured and unstructured data.
(xxvi) What is XML?
i i
i i
i i
i i
i i
i i
(lviii) What are the advantages of using REST over regular web-based services?
(lix) What are the various methods used in REST for transferring data over the
network?
(lx) What is statelessness in the context of REST?
(lxi) How are websockets different from simple HTTP?
(lxii) Describe the working of websockets?
(lxiii) What is the functional mechanism for EPC in IoT?
(lxiv) What is uCode and how is it different from EPC?
(lxv) What are the various categories associated with uCode tags?
(lxvi) Describe the uCode resolution system.
(lxvii) What are URIs? How is it used for identifying individual resources?
(lxviii) How is auto-configuration over Internet-connected devices achieved using the
auto configuration server?
(lxix) What are the various components of TR-069?
(lxx) What is OMA-DM?
(lxxi) How is OMA-DM functionally different from TR-069?
(lxxii) Differentiate between JSON-LD and XML.
(lxxiii) What is the Web Thing model? Illustrate its strengths and weaknesses.
References
[1] Bormann, C., M. Ersue and A. Keranen. 2014. “Terminology for Constrained Node
Networks.” https://tools.ietf.org/html/rfc7228.
[2] Annamalaisamy, Vijay. 2019. “Introduction to IoT Constrained Node Networks.”
https://www.hcltech.com/blogs/introduction-iot-constrained-node-networks.
[3] Postscapes. 2019. “IoT Standards and Protocols.” https://www.postscapes.com/internet-
of-things-protocols/.
[4] Vasseur, J. P., Cisco Systems, Internet Engineering Task Force (IETF). 2014. RFC-7102
ISSN: 2070-1721. https://tools.ietf.org/html/rfc7102.
[5] Deering, S., R. Hinden. 1998. “Internet Protocol, Version 6 (IPv6) Specification, IETF.”
https://tools.ietf.org/html/rfc2460.
[6] Sobral, J., J. Rodrigues, R. Rabelo, K. Saleem, and V. Furtado. 2019. “LOADng-IoT:
An Enhanced Routing Protocol for Internet of Things Applications over Low Power
Networks.” Sensors 19(1): 150.
[7] Winter, T. 2012. “RPL: IPv6 Routing Protocol for Low-Power and Lossy Networks, IETF.”
https://tools.ietf.org/html/rfc6550.
i i
i i
i i
[8] Kushalnagar, N., G. Montenegro, and C. Schumacher. 2005. “IPv6 over Low-Power
Wireless Personal Area Networks (6LoWPANs): Overview, Assumptions, Problem
Statement, and Goals, IETF.” https://datatracker.ietf.org/doc/rfc4919/.
[9] The Chromium Projects. 2015. “QUIC, a Multiplexed Stream Transport over UDP.”
Available online: https://www.chromium.org/quic.
[10] Dunkels, A. 2002. “uIP-A Free Small TCP/IP Stack.” The uIP 1.
[11] Shelby, Z., J. Riihijärvi, O. Raivio, and O. Mähönen. 2003. “NanoIP: The Zen of Embedded
Networking.” IEEE International Conference on Communications.
[12] Franck, F., S. A. S. Alcatel Lucent. 2016. “Content-centric Networking.” U. S. Patent
9,338,150.
[13] The Physical Web, Available online: https://google.github.io/physical-web/.
[14] Cheshire, S. and M. Krochmal. 2013. “Multicast dns.” RFC 6762, February.
[15] Jeronimo, M. and J. Weast. 2003. UPnP Design by Example (Vol. 158). Intel Press.
[16] Banks, A. and R. Gupta. 2014. “MQTT Version 3.1. 1.” OASIS Standard 29: 89.
[17] Stanford-Clark, A. and H. L. Truong. 2013. “MQTT for Sensor Networks (MQTT-SN)
Protocol Specification.” International Business Machines (IBM) Corporation version 1: 2.
[18] Shelby, Z., K. Hartke, and C. Bormann. 2014. “The Constrained Application Protocol
(CoAP).” IETF, RFC 7252.
[19] Vinoski, S. 2006. “Advanced Message Queuing Protocol.” IEEE Internet Computing
(6): 87–89.
[20] Saint-Andre, P., K. Smith, R. Tronçon, and R. Troncon. 2009. XMPP: The Definitive Guide.
"O’Reilly Media, Inc.”
[21] Box, D., D. Ehnebuske, G. Kakivaya, A. Layman, N. Mendelsohn, H. F. Nielsen, S. Thatte,
and D. Winer. 2000. “Simple Object Access Protocol (SOAP) 1.1.”
[22] Battle, R. and E. Benson. 2008. “Bridging the Semantic Web and Web 2.0 with
Representational State Transfer (REST).” Web Semantics: Science, Services and Agents on
the World Wide Web 6(1): 61–69.
[23] Fette, I. 2011. “The Websocket Protocol.”
[24] Song, B. and C. J. Mitchell. 2008. “RFID Authentication Protocol for Low-cost Tags.” In
Proceedings of the First ACM Conference on Wireless Network Security. ACM. 140–147.
[25] Ishikawa, C. 2012. “A URN Namespace for uCode.”
[26] Berners-Lee, T., R. Fielding, and L. Masinter. 1998. “Uniform Resource Identifiers (URI):
Generic Syntax.”
[27] Broadband Forum, TR-069: CPE WAN Management Protocol. 2018. https://www.
broadband-forum.org/download/TR-069 Amendment-6.pdf.
[28] Open Mobile Alliance, OMA Device Management Protocol. 2016. http://www.
openmobilealliance.org/release/DM/V1 3-20160524-A/OMA-TS-DM Protocol-V1 3-20
160524-A.pdf.
i i
i i
i i
[29] JSON-LD Working Group. 2018. “JSON for Linking Data.” https://www.w3.org/2018/
json-ld-wg/.
[30] Guinard Dominique. 2017. “The Web Thing Model”, WEB OF THINGS
INTEREST GROUP. https://www.w3.org/blog/wotig/2017/01/13/web-thing-model-
member-submission/.
i i
i i
i i
Chapter 9
IoT Interoperability
Learning Outcomes
After reading this chapter, the reader will be able to:
• Understand the importance of interoperability in IoT
• List various interoperability types
• Identify the salient features and application scope of each interoperability type
• Understand the challenges associated with interoperability in IoT
• Comprehend the importance of real-world use of interoperability frameworks in
IoT
9.1 Introduction
The introduction of billions of connected devices under the IoT environment,
which may extend to trillions soon, has contributed massively to the evolution of
interoperability. As more and more manufacturers and developers are venturing into
IoT, the need for uniform and standard solutions is felt now more than ever before
[1]. Figure 9.1 shows the various facets of interoperability in IoT. Interoperability
is considered as the interface between systems or products—hardware, software,
or middleware—designed in such a manner that the connecting devices can
communicate, exchange data, or services with one another seamlessly irrespective of
the make, model, manufacturer, and platform.
The urgency in the requirement for interoperability and interoperable solutions in
IoT arose mainly due to the following reasons:
i i
i i
i i
Smartphone
gateway
PC/workstation
gateway Aggregation
server
Smart camera
gateway Smarthome
gateway
i i
i i
i i
(i) Device: The existence of a vast plethora of devices and device types in an
IoT ecosystem necessitates device interoperability. Devices can be loosely
categorized as low-end, mid-end, and high-end devices based on their
processing power, energy, and communication requirements. Low-end devices
are supposed to be deployed in bulk, with little or no chance of getting
their energy supplies replenished, depending on the application scenario.
These devices rely on low-power communication schemes and radios, typically
accompanied by low-data rates. The interface of such devices with high-end
devices (e.g., smartphones, tablets) requires device-level interoperability [2].
(ii) Platform: The variations in the platform may be due to variations in operating
systems (Contiki, RIOT, TinyOS, OpenWSN), data structures, programming
languages (Python, Java, Android, C++), or/and application development
environment. For example, the Android platform is quite different from the iOS
one, and devices running these are not compatible with one another [3].
(iii) Semantic: Semantic conflicts arise during IoT operations, mainly due to the
presence of various data models (XML, CSV, JSON), information models (◦ C, ◦ F,
K, or different representations of the same physical quantity), and ontologies [4].
There is a need for semantic interoperability, especially in a WoT environment,
which can enable various agents, applications, and services to share data or
knowledge in a meaningful manner.
(iv) Syntactic: Syntactic interoperability is a necessity due to the presence of conflicts
between data formats, interfaces, and schemas. The variation in the syntactical
grammar between a sender and a receiver of information results in massive
stability issues, redundancies, and unnecessary data handling efforts [5]. For
i i
i i
i i
9.2 Standards
Toward enabling IoT interoperability, various technologies have been standardized
and are recognized globally for incorporating consistent interoperability efforts
worldwide across various industries, domains, and technologies. We list seven of the
popular ones in this chapter.
9.2.1 EnOcean
EnOcean is a wireless technology designed for building automation systems,
primarily based on the principle of energy harvesting [6]. Due to the robustness
and popularity of EnOcean, it is being used in domains such as industries,
transportation, logistics, and homes. As of 2012, EnOcean was adopted as a wireless
standard under ISO/IEC 14543-3-10, providing detailed coverage of the physical,
data link, and networking layers. EnOceanbased devices are batteryless. They
use ultra-low power consuming electronics along with micro energy converters to
enable wireless communication among themselves; the devices include networking
components such as wireless sensors, switches, controllers, and gateways. The energy
harvesting modules in EnOcean use micro-level variations and differences in electric,
electromagnetic, solar, or other forms of energy to transform the energy into usable
energy through highly efficient energy converters. The wireless signals from the
batteryless EnOcean sensors and switches, which are designed to be maintenance-free,
can operate up to 30 meters in buildings and homes and up to 300 meters in the open.
EnOcean wireless sensor modules wirelessly transmit their data to EnOcean system
modules, as shown in Figure 9.2.
EnOcean is typically characterized by low data rates (of about 125 kbit/s) for
wireless packets that are 14 bytes long. This reduces the energy consumption of the
EnOcean devices. Additional features such as the transmission of RF (radio frequency)
energy only during transmission of 1s in the binary encoded message further reduce
i i
i i
i i
Measurable
Energy source
quantity
Energy Energy Processor Sensor
converter management
RF transceiver
EnOcean
RF transceiver
Processor Sensor
the energy consumption of these devices. Frequencies of 902 MHz, 928.35 MHz, 868.3
MHz, and 315 MHz are employed for transmission of messages in this technology.
Check yourself
9.2.2 DLNA
The Digital Living Network Alliance (DLNA), previously known as the Digital Home
Working Group (DHWG), was proposed by a consortium of consumer electronics
companies in 2003 to incorporate interoperability guidelines for digital media sharing
among multimedia devices such as smartphones, smart TVs, tablets, multimedia
servers, and storage servers. Primarily designed for home networking, this standard
relies majorly on WLAN for communicating with other devices in its domain and can
easily incorporate cable, satellite, and telecom service providers to ensure data transfer
link protection at either end. The inclusion of a digital rights management layer
allows for multimedia data sharing among users while avoiding piracy of data. The
consumers in DLNA, which may consist of a variety of devices such as TVs, phones,
tablets, media players, PCs, and others, can view subscribable content without any
i i
i i
i i
additional add-ons or devices through VidiPath. Figure 9.3 shows the steps involved
in a typical DLNA-based multimedia streaming application. As of 2019, DLNA has
over a billion devices following its guidelines globally [7].
se Re
qu
ow Digital media es
Br controller t
DLNA
DLNA media
server DLNA media
player
Stream
Figure 9.3 A representation of the various roles in a DLNA-based media streaming application
Check yourself
DLNA Home Network and Infrastructure devices and components, DLNA mobile
infrastructure
i i
i i
i i
9.2.3 Konnex
Konnex or KNX is a royalty-free open Home Automation Network (HAN) based
wired standard for domestic building and home applications. It relies on wired
communication for achieving automation [8]. Wired configurations such as a star,
tree, or line topologies can be achieved by using a variety of physical communication
technologies involving twisted pair, power line, RF (KNX-RF), or IP-based (KNX-
net/IP) ones. KNX evolved from three previous standards: 1) BatiBUS, 2) European
Home Systems Protocol (EHS), and 3) European Installation Bus (EIB or Instabus). It
has a broad scope of applications in building automation, which involve tasks such
as controlling lighting, doors, windows, high-voltage AC (HVAC) systems, security
systems, audio/video systems, and energy management. Figure 9.4 represents a
typical Konnex-based building network. The KNX facilitates automation through
distributed applications and their interaction using standard data types, objects,
logical devices, and channels, which form an interworking model. The technology is
robust enough to be supported by a wide range of hardware platforms, starting from
a simple microcontroller to a sophisticated computer. The requirements of building
automation often dictate the hardware requirements.
Internet
User control
Home
(Browser)
gateway
Smart home
appliances
Building network
i i
i i
i i
system devices and components (couplers). Typically, the KNX uses a twisted pair
bus for communication, which is channeled through the building/home alongside the
electrical wiring. Using a 16-bit address bus, KNX can accommodate 57375 devices. A
KNXnet/IP installation allows the integration of KNX sub-networks via IP. A system
interface component is used for loading application software, system topology, and
operating software onto the devices, after which the devices can be accessed over LAN
or phone networks. This feature also allows for the centralized as well as distributed
control of systems remotely. KNX has three different configuration modes according
to device categories.
(i) Automatic mode (A mode): Typically used for auto-configurable devices, and is
generally installed by the end users.
(ii) Easy mode (E mode): Devices require initial training for installation, where the
configuration is done as per the user’s requirements; the device behavior is pre-
programmed using E mode.
(iii) System mode (S mode): Some devices generally require specialists to install; the
system mode is used for this. The devices do not have a default behavior but can
be used for deploying complex building automation systems.
Points to ponder
Check yourself
9.2.4 UPnP
The Universal Plug and Play (UPnP) was designed primarily for home networks
as a set of protocols for networking devices such as PCs, printers, mobile devices,
gateways, and wireless access points. UPnP can discover the presence of other
UPnP devices on the network, as well as establish networks amongst them for
communication and data sharing [9]. Whenever they are connected to a network,
UPnP devices can establish working configurations with other devices. As of
2016, UPnP is managed by the Open Connectivity Forum (OCF). The underlying
assumption of UPnP is the presence of an IP network over which it uses HTTP
to share events, data, actions, and service/device descriptions through a device-to-
device networking arrangement. Device search and advertisements are multicast
through HTTP over UDP (HTTPMU) over port 1900. The responses are returned in
i i
i i
i i
a unicast manner through HTTP over UDP (HTTPU). UPnP is based on established
protocols and architectures such as TCP/IP protocol suite, HTTP, XML, and SOAP.
UPnP is a distributed and open standard. Devices controlled by UPnP are handled by
UPnP control points (CPs). The networked UPnP devices are designed to dynamically
join networks, obtain IP addresses, advertise its presence and capabilities, and detect
the presence and capabilities of other neighboring and networked devices through a
process known as zero configuration networking.
Non-UPnP device
UPnP bridge
Control
point
UPnP devices are typically characterized by a control point and service(s). The
service(s) need to communicate with the control point for further
instructions/execution. Figure 9.5 shows a typical UPnP operation. A central control
point in a room can be used to control various UPnP services across a home. Non-
UPnP devices can be easily integrated with the UPnP services through a bridge.
UPnP supports a range of IP supporting media such as Ethernet, IR, Bluetooth,
Wi-Fi, FireWire, and others, without the need for individual device drivers. UPnP,
being an OS and language independent protocol, typically uses web browsers for the
user interface. Each UPnP device implements a DHCP (dynamic host configuration
protocol) client and searches for a DHCP server during its first initiation in the
network. These devices can also use a feature known as AutoIP to assign itself an IP
address, in case a DHCP server is not available. The UPnP device then discovers the
network through the simple service discovery protocol (SSDP), which advertises the
device through the CPs (coordination protocols) on the network. The CP then retrieves
the device’s information through a location URL sent by the device. The device
information is in the form of an XML schema using SOAP; it additionally contains
i i
i i
i i
a list of services: commands, actions, and actionable variables and parameters. To the
control URL in the description, CPs use control messages to send actions to a device’s
service. Finally, if a device has a URL for presentation, the CP retrieves the contents,
allowing a user to control or view the device and device status.
Check yourself
9.2.5 LonWorks
LonWorks or local operating network, as it was initially named, is a protocol
developed by the Echelon Corp [10]. It was primarily developed for addressing
the needs of networked control applications within buildings over physical
communication media such as twisted pair, fiber optic cables, powerlines, and RF. The
twisted pair uses differential Manchester encoding and has a data rate of 78 kbit/s,
whereas the powerline is much slower and can have either 5.4 kbit/s or 3.6 kbit/s
depending on the frequency of the power line. This protocol was standardized by
ANSI (American National Standards Institute) as early as 1999 when it was known
as LonTalk and was used for control networking. This protocol has been used
in a variety of deployment areas such as the pneumatic braking system of trains,
semiconductor equipment manufacturing, petrol station controls, and as a building
automation standard. LonWorks extends backward compatibility support to its legacy
installations through an IP-based tunneling standard (ISO/IEC 14908-4). Regular IP-
based services can be readily used with LonWorks platforms or installations for UI or
control level applications. Figure 9.6 illustrates a typical LonWorks network.
Initially, a LonTalk protocol node could only be installed using a custom-designed
IC with an 8-bit processor; this IC was referred to as the “neuron chip”. The neuron
chip is a system on a chip and is essentially the soul of the LonWorks-based devices.
There are two types of neuron chips based on the memory capabilities and packaging:
1) the 3120 and 2) the 3150. Presently, a significant number of LonWorks-based
devices use the neuron chip, which is also accessible by general processors by porting
to an IP-based or 32-bit chip. A neuron chip has three CPUs, one each for MAC
processing, network processing, and application processing. The MAC processor is
tasked with CRCs (cyclic redundancy checks), transmitting and receiving messages
over the physical media, and confirming message destinations. The network processor
deals with addressing, routing, acknowledgments, and other network layer tasks.
Finally, the application processor is used for deploying custom applications which
typically support 8-bit operations; it can also be used as a communication co-processor
for high-end processors. The decoupling of processors based on tasks enables the
robust and speedy performance of the neuron chips. Each neuron chip has three
memory types available with it: 1) ROM, 2) RAM, and 3) EEPROM. The LonTalk,
i i
i i
i i
User interface
Twisted pair
Building network
along with the OS and I/O libraries are typically programmed in the ROM during
manufacturing.
Check yourself
9.2.6 Insteon
Insteon was developed as a home automation technology by Smartlabs in 2005
and marketed under its subsidiary Insteon. Insteon enables interoperability and
automation among household devices such as lights, switches, thermostats, motion
and gas sensors, and others through RF or powerline communication [11]. Insteon-
connected devices act as peers and can independently perform network-based
functions such as message transmission and reception by using a dual mesh network
topology. These devices operating over the powerline have a frequency of 131.65 kHz;
the devices use binary phase shift keying (BPSK), with a minimum receive signal level
of 10 mV. In contrast, Insteon devices using RF operates over a frequency of 915 MHz;
these devices use frequency shift keying for communication and Manchester codes
for encoding data with a data rate of 4.56 kbit/s over ranges of approximately 120 m
without obstructions. Figure 9.7 shows a typical home-based Insteon network.
i i
i i
i i
Insteon networks can have 16 million+ unique IDs and can support 65,000+
devices. Each of these devices has a built-in engine, which has an 80 byte RAM
and a 3 kbyte ROM. Application-specific requirements of Insteon devices such as
lights and switches require 256 bytes of RAM and EEPROM, and 7 kbytes of flash
memory. Insteon devices have an average data rate of 180 bit/s, using which a
standard message of 10 bytes or extended message of 24 bytes is transmitted. Each
Insteon message can accommodate up to 14 bytes of user data and contain a two-
bit field meant for counting hops. Message originating nodes initialize this field
value to 3, which is decremented by the number of times a node repeats the message
during its transmission. Upon receiving a message, each device performs error
detection and correction. Retransmission of erroneous messages in this manner
enhances the reliability of Insteon technology. All devices transmit the same message
simultaneously using PSK to ensure synchronicity with the powerline frequency. This
ensures message integrity and strengthens the signal over the powerline.
Home automation CO
Monitoring sensors
Controller
Meter
RF
Bridge
Powerline
Gateway Internet
The dual mesh/ dual band network topology of Insteon is named so mainly
because, during operations over the RF band, interferences are mitigated by
transmitting data over the powerline, and vice versa. As this is a peer-to-peer network,
it can operate without the need for central controllers. Central controllers can be
integrated with this technology to extend control operations over smartphones and
tablets. As a security measure to avoid hijacking a neighbor’s Insteon devices, Insteon
requires users to have physical ownership of the devices they want to connect to their
network and the respective device IDs (which is unique and similar to a MAC ID). The
inbuilt firmware on the devices prevents Insteon devices from forming connections
and identifying themselves to other devices until a button is physically pressed on
them during their installation.
i i
i i
i i
Points to ponder
Legacy Insteon chipsets are interoperable with X10 powerline messaging, but with
reduced functionalities. Present-day initiatives have incorporated compatibility
for certain functionalities of Insteon with Amazon Echo, Microsoft Cortana, Apple
Watch, and the Google-owned NEST.
Check yourself
9.2.7 X-10
The X-10 protocol was developed by Pico Electronics (Scotland) in 1975 as a means of
achieving communication and automation among household devices over powerlines.
It was one of the first home automation technologies, and yet it remains one of the
most widely used even in the present day [12]. Data and controls are encoded as
brief RF bursts for signaling and control over the powerlines. Household electrical
wiring is used for sending data between X-10 devices by encoding it over a 120 kHz
carrier frequency, which is transmitted during zero crossings of 50–60 Hz consumer
AC signals as RF bursts, one bit per crossing. The data is made up of an address
and a command between the controller and the device. X-10 signals are restricted
within the power supply of a house/network using inductive filters, which act as
attenuators. Coupling capacitors or active repeaters for X-10 are used to facilitate
signal transmission over multiphase systems. An X-10 system can have 256 possible
addresses, which is made up of 4-bit house codes (numbered from alphabets A to
P), 4-bit unit codes, and finally, a 4-bit command. More than one house code can
be simultaneously called within a single house. X-10 devices may be either one-
way or two-way. One-way devices are typically very cheap and can only receive
commands, whereas two-way devices are more expensive and can send as well as
receive commands. These two-way devices are generally used as controllers. Figure
9.8 represents a typical X-10 setup and controller, which allows a user to connect to
and control a variety of appliances and devices at home.
i i
i i
i i
HVAC appliances
CT
Sensors X-10
Transceiver
CO
Lighting
Existing
powerline
X-10 User
Controller
A typical X-10 command may look like: “select code A5”, which is followed by
the command for that device such as “turn on/off”. This command signals an
X-10 device with address A5 to turn on/off.
Check yourself
i i
i i
i i
9.3 Frameworks
Similar to the standards, there has been a rise in universal interoperability
frameworks. These frameworks span across platforms, devices, technologies, and
application areas. We discuss five of the most popular interoperability frameworks
in this chapter.
9.3.1 universAAL
UniversAAL is an open-source software framework designed for enabling runtime
support in distributed service oriented environments comprising mainly of the
system of systems [13]. This framework extends semantic interoperability by sharing
compatible models/ontologies with service consumers such as mobile devices,
embedded systems, and others. Managers, along with middleware, collectively form
the universAAL platform. These managers are considered low-level applications and
provide functional APIs (application programming interfaces) to final applications
utilizing universAAL. Hardware such as sensors and actuators connect to the
universAAL platform through exporters, which are specific for different technologies
such as Zigbee, Konnex, and others.
The universAAL middleware is tasked with core coordination among the nodes
within a peer-to-peer connectivity layout, referred to as the uSpace. The sharing
of various universAAL communication semantics such as the shared ontological
model, context, service interactions, and user interactions is performed in this uSpace,
which creates a logical environment for enabling communications irrespective of
the underlying device, technology, or network. The services or set of services
run by a universAAL application is human/user-centric. A coordinator node is
responsible for creating each uSpace, and subsequently keeping track of its status,
and adding/deleting new nodes to it.
A container is responsible for supporting the middleware and the code and
building rules under different environments such as Java environments, Android
environments, and other embedded systems. As of now, universAAL supports only
Bundles in OSGi (for embedded systems) and APKs in Android. The peering part
handles various instances of middleware communication and interconnections. A
UPnP-like connector is tasked with the discovery of universAAL nodes and multi-
technology bridging.
The most crucial aspect of the middleware is the communication, which provides
the logic for semantic information flow between the peers. This flow is enabled
through purpose-specific buses to which various applications connect irrespective of
the device, container, or peering technology. Buses have been defined for purposes
such as context, service and user interactions, internal strategy handling, semantics,
peer matchmaking, and others. The ontology model, encryption, and message parsing
through message serialization are defined in a representation model. A uSpace
i i
i i
i i
Check yourself
9.3.2 AllJoyn
The AllJoyn is an open-source software initiative proposed by Qualcomm in 2011 that
allows devices within this framework to communicate with other devices near its
vicinity [14]. The flexible AllJoyn framework encourages proximal connectivity and
even has the option of including cloud connectivity to it. It was subsequently signed
over to the Linux Foundation under the aegis of the AllSeen Alliance, which was
formed primarily to promote IoT interoperability. Major global consumer electronics
corporations such as LG, Sony, Panasonic, Haier, Cisco, HTC, Microsoft, and many
others are part of the AllSeen Alliance. In 2016, AllJoyn merged with IoTvity and
joined the Open Connectivity Forum (OCF), which allowed various open-source
projects to include it within their framework. The AllJoyn and IoTvity technologies
are currently interoperable and backward compatible with one another.
The open-source AllJoyn software framework enables interoperability amongst
connected devices and applications, resulting in the creation of dynamic proximal
networks using a D-Bus message bus. The software framework and the core
components of the system seamlessly discover, communicate, and collaborate
irrespective of platform, product, brand, or connection types, although within the
limitations of the collaborating brands only (which is quite large). As of now,
communication is only through Wi-Fi, but it includes devices concerning smart homes,
smart TVs, smart audio, gateways, and even automotive devices.
The AllJoyn framework follows a client–server model. The clients are often
referred to as “consumer” and the server as the “producer”. For example, in a
smart home environment, a proximity sensor senses the presence of humans in the
house and switches on appliances based on the occupancy of the house. If the
house is empty, the appliances are turned off. Here, the proximity sensor is the
consumer, and the appliance (maybe, a light) is a producer. In this framework,
each producer is characterized by an introspection file, which is an XML schema
of the producer’s capabilities and functionalities. The requests for each producer
are based on its introspection file. The framework’s capabilities can be extended by
incorporating other protocols with it through bridging. Complex functionalities such
as simultaneous audio streams to multiple devices can also be executed using this
framework.
i i
i i
i i
Some of the core services provided by the AllJoyn framework include onboarding
services (attaching a new device to the framework’s Wi-Fi network), configuration
service (configuring device attributes such as languages, passwords, and names),
notification service (text/view-URL based audio and image notifications), control
panel (remote app-based control of all connected devices), and common device model
service (unified monitoring of IoT devices irrespective of vendors or manufacturers).
Check yourself
Device XML schema, AllJoyn source code, AllJoyn products and services
9.3.3 IoTivity
Similar to the AllJoyn, IoTvity is an open-source project which is sponsored by the OSF
(Open Science Framework) and hosted by the Linux Foundation [15]. This framework
was developed to unify billions of IoT devices, be it wired or wireless, across the
Internet, to achieve a robust and interoperable architecture for smart and thin devices.
IoTvity is interoperable and backward compatible with AllJoyn. This framework can
connect across profiles ranging from consumer, health, enterprise, industrial, and even
automotive.
The IoTvity framework uses CoAP at the application layer and is not bothered
with the physical layer requirements of devices. However, the network layer of
the connecting devices must communicate using IP. The connectivity technologies
of IoTvity connecting devices can consist of Wi-Fi, Ethernet, Bluetooth Low Energy,
Thread, Z-Wave, Zigbee, or other legacy standards.
The IoTvity architecture supports the following core functionalities: Discovery
(finding devices in one’s vicinity and offering services to them), data transmission
(standardized message transmission between devices), device management, and data
management.
Under the purview of the resource-bounded context in IoTvity’s OCF (Open
Connectivity Foundation) Native Cloud 2.0 framework, which aims to utilize and
enhance the benefits of IoT for companies fully, a resource hosting server has to be
accessible through the OCF’s native cloud. A resource is an object, which consists of
a type, associated data, resource relationships, and operational methods. A server can
only publish discoverable resources (which can be found by other connected clients),
once it is successfully connected, authenticated, and authorized. Clients can discover
resources, either based on the resource type or server identifiers.
Check yourself
IoTvity services and functionalities, IoTvity source code, IoTvity use cases
i i
i i
i i
Check yourself
9.3.5 HomeKit
The HomeKit software framework is designed by Apple to work with its iOS mobile
operating system for achieving a centralized device integrating and control framework
[17]. It enables device configuration, communication, and control of smart home
appliances. Home automation is achieved by incorporating room designs, items,
and their actions within the HomeKit service. Users can interact with the framework
using speech-based voice commands through Apple’s voice assistant, Siri, or through
external apps. Smart home devices such as thermostats, lights, locks, cameras, plugs,
and others, spread over a house can be controlled by a single HomeKit interface
through smartphones. HomeKit-enabled device manufacturers need to have an MFi
program, and all devices were initially required to have an encryption coprocessor.
Later, the processor-based encryption was changed to a software-based one.
i i
i i
i i
Non-HomeKit devices can have the benefits of HomeKit through the use of HomeKit
gateways and hubs.
HomeKit devices within a smart home securely connect to a hub either through
Wi-Fi or Bluetooth. However, as the range of Bluetooth is severely limited, the full
potential of the HomeKit may not be adequately exploited. This framework allows
for individual as well as grouped control of connected devices based on scenarios.
Features such as preconfigured devices settings can be collectively commanded using
voice commands to Siri.
Points to ponder
Check yourself
Summary
This chapter introduces the concept of interoperability in the context of IoT
architectures, frameworks, and application domains. We initially outline the
taxonomy of interoperability to give the readers a perspective of the challenges
and the present-day solutions or attempts to solve these challenges. We outline
the various standardization efforts to address interoperability issues in different
domains. Further, at the end of this chapter, we also provide a brief description of
the different interoperability-enabling frameworks that are under development by
various corporations across the globe.
Exercises
(i) Differentiate between semantic and syntactic interoperability.
(ii) What are the various types of interoperability encountered in IoT environments?
(iii) What is meant by the heterogeneity of IoT devices in the context of
interoperability?
(iv) How is device interoperability different from platform interoperability?
(v) Describe the following standards:
(a) EnOcean
(b) DLNA
i i
i i