Module - 3 - IoT Processing Topologies and Types
Module - 3 - IoT Processing Topologies and Types
MODULE-3
Data Format
• Huge quantities and varieties of data are generated regularly and
flow freely through internet.
• The massive volume of data generated is further enhanced by
the multiple devices utilized by most users and non-human data
generation also adds up to this volume.
• e-mails, text documents (Word docs, PDFs, and others),social
media posts, videos, audio files, and images
• Iot generated data can be broadly grouped into two types based on
how they can be accessed and stored:
• 1) Structured data and
• 2) unstructured data.
Structured data
• These are typically text data that have a pre-defined
structure.
• Structured data are associated with relational database
management systems (RDBMS).
• Even if the data is human or machine generated, these data
are easily searchable.
• Common usage of this type of data is associated with flight
or train reservation systems, banking systems, inventory
controls, and other similar systems.
• Structured Query Language (SQL) are used for accessing these data in
RDBMS.
• Structured data holds a minor share of the total data volume
Unstructured data:
• All the data on the Internet, which is not structured, is
categorized as unstructured.
• These data types have no pre-defined structure and can
vary according to applications and data-generating
sources.
• This data type does not have fixed formats associated with
it, which makes it very difficult for querying algorithms to
perform a look-up.
• Human-generated unstructured data include text, e-mails,
videos, images, phone recordings, chats, and others
• Machine-generated unstructured data include sensor data
from traffic, buildings, industries, satellite imagery,
surveillance videos, and others
Importance of Processing in IoT:
• As a large amount of data is being flowing through the internet we need
an intelligent and resourceful processing techniques. when to process and
what to process? Is the crucial part.
• The data to be processed can be divided into three types based on the
urgency of processing
1) Very time critical,2) time critical, and 3) normal.
• Data from sources such as flight control systems, healthcare, etc. which
need immediate decision support, are deemed as very critical.
• Data from sources that can tolerate normal processing latency/delay of
few seconds are deemed as time critical data.
• Data from vehicles, traffic, machine systems, smart home systems,
surveillance systems, and others, which can tolerate a delay of few second
• Last category of data can tolerate a processing latency of a few minutes to
a few hours and are typically associated with less data-sensitive domains
such as agriculture, environmental monitoring, and others.
• The processing requirements of data from very time-
critical sources are exceptionally high.
• Time critical data demands processing in place or
almost near to the source.
• Second category of data allows transmission of data to
remote processor/cloud processor.
• The last category of data sources (normal) typically
have no particular time requirements for processing
urgently and are pursued leisurely as such.
Processing Topologies:
• Depending on the processing requirements of the IoT application the
architecture of the IoT system decided, a properly designed IoT
system would result in massive savings in network bandwidth and
conserve significant amounts of overall energy in the architecture.
• No dependency on latency,
• it is significantly cheaper than on-site processing
topologies. This difference in cost is mainly due to the
low demands and requirements of processing at the
source itself.
• The sensor node is responsible for the collection and
framing of data that is eventually to be transmitted to
another location for processing
• Dedicated high-processing enabled devices, which can
be borrowed by multiple simpler sensor nodes to
accomplish their tasks.
Remote processing:
• It encompasses sensing of data by various sensor nodes; the
data is then forwarded to a remote server or a cloud-based
infrastructure for further processing and analytics.
• Enables the reuse and reallocation of the same processing
resource.
• This setup also ensures massive scalability of solutions.
• Use up a lot of network bandwidth and relies heavily on the
presence of network connectivity between the sensor nodes
and the remote processing infrastructure
Collaborative processing:
• This works by clubbing together the processing power of
nearby processing nodes and collaboratively process the
data in the vicinity of the data source itself.
• This is used in scenarios with limited or no network
connectivity. latencies due to transfer of data over network is
reduced.