UNIT IV - Iot - 1

The document discusses data analytics, focusing on structured and unstructured data, their advantages and disadvantages, and the differences between data in motion and data at rest. It also covers the role of machine learning in IoT, the benefits of NoSQL databases, and the Hadoop ecosystem, including components like HDFS and MapReduce. Additionally, it highlights the importance of edge streaming analytics and network analytics in managing IoT systems.

Uploaded by

drblessyexams

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views27 pages

UNIT IV - Iot - 1

Uploaded by

drblessyexams

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

UNIT IV

DATA ANALYTICS AND SUPPORTING

SERVICES
Structured data
• Structured data is data that has been
predefined and formatted to a set structure
before being placed in data storage
• Example: Relational database
• Advantages
• Easily used by machine learning algorithms
• Easily used by business users
• Increased access to more tools
Disadvantages
• Lack of flexibility
• predefined purpose limits use
• Limited storage options
Unstructured data
• Unstructured data is data stored in its native
format and not processed until it is used,
which is known as schema-on-read.
• Advantages
• Freedom of the native format
• Faster accumulation rates
• Data lake storage
Disadvantages
• Requires data science expertise
• Requires specialized tools
Structured data vs. unstructured data

• Structured Data
– Self-service access
– Only select data types
– Schema-on-write
– Commonly stored in data warehouses
– Predefined format
Unstructured Data
Requires data science expertise
Many varied types conglomerated
Schema-on-read
Commonly stored in data lakes
Native format
Data in motion vs Data in rest
• Data in motion
• The collection process for data in motion is similar to
that of data at rest; however, the difference lies in the
analytics. In this case, the analytics occur in real-time
as the event happens
• Data at rest
This refers to data that has been collected from various
sources and is then analyzed after the event occurs.
The point where the data is analyzed and the point
where action is taken on it occur at two separate times
Data in motion
• Data in transit, or data in motion, is data
actively moving from one location to another
such as across the internet or through a
private network.
• Data protection in transit is the protection of
this data while it’s travelling from network to
network or being transferred from a local
storage device to a cloud storage device
Data at rest
• Data at rest is data that is not actively moving
from device to device or network to network
such as data stored on a hard drive, laptop,
flash drive, or archived/stored in some other
way.
• Data protection at rest aims to secure inactive
data stored on any device or network.
Role of machine learning in IoT
• IoT and machine learning deliver insights
otherwise hidden in data for rapid, automated
responses and improved decision making.
• Machine learning for IoT can be used to
project future trends, detect anomalies, and
augment intelligence by ingesting image,
video and audio.
Need for ML
• Machine learning -demystify the hidden patterns in IoT
data by analyzing massive volumes of data
• Machine learning inference - supplement or replace
manual processes with automated systems using
statistically derived actions in critical processes.
• With machine learning for IoT, you can:
• Ingest and transform data into a consistent format
• Build a machine learning model
• Deploy this machine learning model on cloud, edge and
device
Benefits
• Simplify machine learning model training
• Flexibility to use your data science library of choice
• Rapid model deployment to operationalize
machine learning quickly
• Prebuilt connectors for operational & historical
datastores
• Integration with Cumulocity IoT Streaming
Analytics
• Notebook integration
No SQL Databases
• NoSQL is a class of databases that support semi-
structured and unstructured data, in
• addition to the structured data handled by data
warehouses and MPPs.
• Includes different types of databases
• Document stores
• Key-value stores
• Wide column stores
• Graph stores
Hadoop
• Most popular choice as a data repository and
processing engine.
• Two key elements are still present in current
Hadoop distributions and provide the
foundation for other projects
• Hadoop Distributed File System (HDFS): A system
for storing data across multiple nodes
• MapReduce: A distributed processing engine that
splits a large task into smaller ones that can be run
in parallel
Distributed hadoop cluster
Hadoop
• Both MapReduce and HDFS take advantage of
this distributed architecture to store and
process massive amounts of data and are thus
able to leverage resources from all nodes in
the cluster.
• For HDFS, this capability is handled by
specialized nodes in the cluster, including
NameNodes and DataNodes
Hadoop
• NameNodes: These are a critical piece in data
adds, moves, deletes, and reads on HDFS. They
coordinate where the data is stored, and
maintain a map of where each block of data is
stored and where it is replicated.
• DataNodes: These are the servers where the
data is stored at the direction of the NameNode.
It is common to have many DataNodes in a
Hadoop cluster to store the data.
Writing file to HDFS
Hadoop Ecosystem
• Many organizations have adopted Hadoop
clusters for storage and processing of data and
have looked for complimentary software
packages to add additional functionality to their
distributed Hadoop clusters.
• Since the initial release of Hadoop in 2011, many
projects have been developed to add incremental
• functionality to Hadoop and have collectively
become known as the Hadoop ecosystem.
Apache Kafka
• Apache Kafka is a distributed publisher-
subscriber messaging system that is built to be
scalable and fast.
• It is composed of topics, or message brokers,
where producers write data and consumers
read data from these topics.
• The goal of Kafka is to provide a simple way to
connect to data sources and allow consumers to
connect to that data in the way they would like.
Apache Kafka dataflow
Apache Spark
• Apache Spark is an in-memory distributed
data analytics platform designed to accelerate
processes in the Hadoop ecosystem.
• The “in-memory” characteristic of Spark is
what enables it to run jobs very quickly.
• At each stage of a MapReduce operation, the
data is read and written back to the disk,
which means latency is introduced through
each disk operation.
Edge Streaming Analytics
• Key values of edge streaming analytics include
the following
• Reducing data at the edge
• Analyzing and response at the edge
• Time sensitivity
Edge Streaming Analytics – Core Functions

• Streaming analytics at the edge can be broken

down into three simple stages:
• Raw input data
• Analytics Processing Unit
• Output Streams
Edge analytics processing unit
Edge Analytics
• In order to perform analysis in real-time, the
APU needs to perform the following functions:
• Filter
• Transform
• Time
• Correlate
• Match patterns
• Improve business intelligence
Network Analytics
• Extremely important in managing IoT systems is
network-based analytics
• Network analytics is concerned with
discovering patterns in the communication
flows from a network traffic perspective
• Network analytics has the power to analyze
details of communications patterns made by
protocols and correlate this across the network.

Microsoft Windows Questions
0% (1)
Microsoft Windows Questions
5 pages
Data Analytics and Hadoop
No ratings yet
Data Analytics and Hadoop
21 pages
IOT 4 Module
No ratings yet
IOT 4 Module
48 pages
Data Analytics Iot Unit5 Modified
No ratings yet
Data Analytics Iot Unit5 Modified
35 pages
IIOT Unit 3 NOTES
No ratings yet
IIOT Unit 3 NOTES
22 pages
IoT - Module 4 - 8th Sem
No ratings yet
IoT - Module 4 - 8th Sem
17 pages
Iot Module4 RMR
No ratings yet
Iot Module4 RMR
121 pages
CPE 445-Internet of Things - Chapter 7
No ratings yet
CPE 445-Internet of Things - Chapter 7
39 pages
UNIT IV - Iot
No ratings yet
UNIT IV - Iot
13 pages
IoT & Its Applications Unit-IV
No ratings yet
IoT & Its Applications Unit-IV
44 pages
IOT Module 4
No ratings yet
IOT Module 4
17 pages
IOT Mod4@AzDOCUMENTS - in
No ratings yet
IOT Mod4@AzDOCUMENTS - in
17 pages
Big Data Unit 1 AKTU Notes
No ratings yet
Big Data Unit 1 AKTU Notes
87 pages
Hadoop & BigData (UNIT - 2)
No ratings yet
Hadoop & BigData (UNIT - 2)
22 pages
Iot M4
No ratings yet
Iot M4
12 pages
Data Analytics For IoT Solutions (Module VI)
No ratings yet
Data Analytics For IoT Solutions (Module VI)
81 pages
IOT Mod-4
No ratings yet
IOT Mod-4
42 pages
IOT Unit-IV
No ratings yet
IOT Unit-IV
74 pages
Unified Big Data Lambda Architecture Wit
No ratings yet
Unified Big Data Lambda Architecture Wit
13 pages
BIT4440 BSE4040 CloudComputing 3.big Data Technologies
No ratings yet
BIT4440 BSE4040 CloudComputing 3.big Data Technologies
43 pages
Data Science and Big Data UNIT 3
No ratings yet
Data Science and Big Data UNIT 3
11 pages
BDA I Unit
No ratings yet
BDA I Unit
44 pages
BigData Terminology Hadoop MapReduce Yarn Spark File Formats
No ratings yet
BigData Terminology Hadoop MapReduce Yarn Spark File Formats
42 pages
Hadoop - MapReduce
No ratings yet
Hadoop - MapReduce
51 pages
Chapter Two Data Science: by Abdulaziz Oumer
No ratings yet
Chapter Two Data Science: by Abdulaziz Oumer
29 pages
Data Analytics in Iot: Cs578: Internet of Things
No ratings yet
Data Analytics in Iot: Cs578: Internet of Things
27 pages
BDA_Unit-1
No ratings yet
BDA_Unit-1
33 pages
ReductStore - White Paper - Review
No ratings yet
ReductStore - White Paper - Review
7 pages
IoT - New 6
No ratings yet
IoT - New 6
186 pages
Big Data Analytics - Project
50% (2)
Big Data Analytics - Project
27 pages
Future of Big Data
No ratings yet
Future of Big Data
3 pages
Ashish Presentation Stage1 Modify LR
No ratings yet
Ashish Presentation Stage1 Modify LR
24 pages
INTERNET OF THINGS Unit IV
No ratings yet
INTERNET OF THINGS Unit IV
9 pages
Big Data Distributed Platforms
No ratings yet
Big Data Distributed Platforms
18 pages
Machine Learning and Cloud Computing: Survey of Distributed and Saas Solutions
No ratings yet
Machine Learning and Cloud Computing: Survey of Distributed and Saas Solutions
13 pages
Unit 4 Iot
No ratings yet
Unit 4 Iot
92 pages
Biggdata
No ratings yet
Biggdata
24 pages
Big Data and BDA
No ratings yet
Big Data and BDA
44 pages
IoT Notes
No ratings yet
IoT Notes
21 pages
UNIT-4-IOT Notes
No ratings yet
UNIT-4-IOT Notes
74 pages
Unit III Lecture Notes
No ratings yet
Unit III Lecture Notes
6 pages
Course1 Summary
No ratings yet
Course1 Summary
4 pages
IOT Design 1. IOT Topology
No ratings yet
IOT Design 1. IOT Topology
5 pages
Expose Iot Data Mining Yagoub - Semida
No ratings yet
Expose Iot Data Mining Yagoub - Semida
19 pages
Bda Unit 1
No ratings yet
Bda Unit 1
32 pages
Data Science
No ratings yet
Data Science
87 pages
BDA Unit 2 1
No ratings yet
BDA Unit 2 1
42 pages
In9040 PHD Presentation Selimozcan 2
No ratings yet
In9040 PHD Presentation Selimozcan 2
36 pages
Chapter 14
No ratings yet
Chapter 14
12 pages
BigData Unit1
No ratings yet
BigData Unit1
74 pages
Data Handling & Analytics: Unit 5
No ratings yet
Data Handling & Analytics: Unit 5
18 pages
Introduction To Data Analytics For IoT
No ratings yet
Introduction To Data Analytics For IoT
4 pages
Module4-Data Analytics-Ppt-Dlb-Chapter5
No ratings yet
Module4-Data Analytics-Ppt-Dlb-Chapter5
50 pages
eiot5
No ratings yet
eiot5
14 pages
BIG DATA Notes
No ratings yet
BIG DATA Notes
11 pages
Unit1 - BDH
No ratings yet
Unit1 - BDH
77 pages
Group Assingment 1: - Which Emerging Technologies Will Have More Effect On Our Day-to-Day Life and How?
No ratings yet
Group Assingment 1: - Which Emerging Technologies Will Have More Effect On Our Day-to-Day Life and How?
4 pages
Big Data Deals With Large Data Sets
No ratings yet
Big Data Deals With Large Data Sets
4 pages
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Mastering Apache Iceberg: Managing Big Data in a Modern Data Lake
From Everand
Mastering Apache Iceberg: Managing Big Data in a Modern Data Lake
Robert Johnson
No ratings yet
Zsdzs
No ratings yet
Zsdzs
35 pages
Alcatel-Lucent Omnipcx Office Technical Bulletins & Release Notes Table of Content
No ratings yet
Alcatel-Lucent Omnipcx Office Technical Bulletins & Release Notes Table of Content
6 pages
Xray 8000 Adv
No ratings yet
Xray 8000 Adv
125 pages
Honeywell En0b0564 Ge51r0711
No ratings yet
Honeywell En0b0564 Ge51r0711
18 pages
Paper. How To Make A Key Generator Using - Aksel Heinlein, Andrew
No ratings yet
Paper. How To Make A Key Generator Using - Aksel Heinlein, Andrew
14 pages
Ntime Dapr
No ratings yet
Ntime Dapr
312 pages
Cutviewer Turn User Guide V3
No ratings yet
Cutviewer Turn User Guide V3
21 pages
Windows XP Mode Installation: February 2012 Author: Regupathy Ragavan Reviewed By: Mark Stout
No ratings yet
Windows XP Mode Installation: February 2012 Author: Regupathy Ragavan Reviewed By: Mark Stout
8 pages
BLE 5, Thread, Zigbee Modules, BT840/F/E/X/XE: Specifications
No ratings yet
BLE 5, Thread, Zigbee Modules, BT840/F/E/X/XE: Specifications
35 pages
Objectives Overview: Discovering Computers Fundamentals Fundamentals, 2012 Edition
No ratings yet
Objectives Overview: Discovering Computers Fundamentals Fundamentals, 2012 Edition
17 pages
CLEO ReadMeoewufhewiugf
No ratings yet
CLEO ReadMeoewufhewiugf
2 pages
Ch-01 (ICS I) - Basics of Information Technology
No ratings yet
Ch-01 (ICS I) - Basics of Information Technology
79 pages
Reda Rashad
No ratings yet
Reda Rashad
3 pages
Nextcloud Manual
No ratings yet
Nextcloud Manual
179 pages
DX Diag
No ratings yet
DX Diag
49 pages
PC Magazine - Linux Solutions Malestrom
No ratings yet
PC Magazine - Linux Solutions Malestrom
475 pages
Character Conversion
No ratings yet
Character Conversion
37 pages
Samsung CMOS Card
No ratings yet
Samsung CMOS Card
2 pages
ARM in Embedded Applications - David Rose@ARM
100% (3)
ARM in Embedded Applications - David Rose@ARM
24 pages
DELL D620 Schematics Document (UMA) - LA-2791
33% (3)
DELL D620 Schematics Document (UMA) - LA-2791
63 pages
Fundamentals of Operating Systems
No ratings yet
Fundamentals of Operating Systems
18 pages
ASIC RTL Design Engineer
No ratings yet
ASIC RTL Design Engineer
2 pages
OS Practical Slips
No ratings yet
OS Practical Slips
20 pages
8 Hacks To Make Firefox Ridiculously Fast
No ratings yet
8 Hacks To Make Firefox Ridiculously Fast
4 pages
Draft Manual For Quicknsr
No ratings yet
Draft Manual For Quicknsr
3 pages
RC 4000 Software: Multiprogramming System: Per Brinch Hansen (1969)
No ratings yet
RC 4000 Software: Multiprogramming System: Per Brinch Hansen (1969)
45 pages
Iterators: Well What Is Iteration???
No ratings yet
Iterators: Well What Is Iteration???
7 pages
Investigating Frequent System Hang in Samsung Smartphones
No ratings yet
Investigating Frequent System Hang in Samsung Smartphones
9 pages
Software
No ratings yet
Software
3 pages

UNIT IV - Iot - 1

Uploaded by

UNIT IV - Iot - 1

Uploaded by

UNIT IV

DATA ANALYTICS AND SUPPORTING

• Streaming analytics at the edge can be broken

You might also like