IoT With Cloud Computing - Unit 3
IoT With Cloud Computing - Unit 3
PRUDHVI KIRAN P
Assistant Professor, CSE - IOT Dept.
R. V. R. & J. C. College of Engineering
UNIT - 3 [ 13 Periods ]
loT Analytics for Cloud
Review of loT analytics for the cloud, IoT Analytics challenges, Microsoft Azure Analytics
overview, Designing data processing for analytics, Designing visual analytics for loT data,
Data Science for loT, Feature engineering with loT data.
IOT SENSOR 1
ML
IOT SENSOR 2
IOT SENSOR n
Feature engineering can involve the creation of new features from existing data, combining existing
features, or selecting the most relevant features.
Feature engineering is part of the data pre-processing stage in a machine learning development
lifecycle, as mentioned in the above figure. It typically consists of four different techniques;
1. Feature Creation
2. Feature Transformation
3. Features Extraction
4. Feature Selection
1. Feature Creation
• Feature creation is one of the most common techniques in feature engineering. It is also known as
feature construction or feature synthesis. It involves the creation of new features from existing
data that is performed by combining or transforming the existing features. For example, if you have
a dataset with the birth date of individuals, you can create a new feature age by subtracting it from
the current date.
• This technique is advantageous when the existing features do not provide enough information to
train and develop a robust machine-learning model or when the existing features are not in a
usable format for an ML model development. But it can also be time-consuming and require a
deep understanding of the data and the problem at hand.
2. Feature Transformation
• Transformations are a set of mathematical operations that can be applied to existing features to
create new ones. Some of the most common transformation techniques include data aggregation
(summary), scaling (up & down), normalization (tabulation), binning (clustering), etc.
3. Feature Extraction
• Feature extraction is a feature engineering technique that creates new variables by extracting
them from the raw data. I.e. transforming the original data into a new feature space. The main
objective of this process is to reduce the volume and dimensionality of the input data.
• A few of the most common techniques used for this method include - Principal Component Analysis
(PCA), Cluster Analysis, etc., which are proved efficient in dimensionality reduction of data.
4. Feature Selection
• The feature selection technique involves the selection of a subset of the most relevant features
from the data and dropping the remaining features. It is performed using criteria such as the
importance of the features, the correlation between the features and target variables, etc.
• It can greatly reduce the complexity of the ML model by removing redundant and irrelevant
features from the training process.
Feature engineering is a crucial step in the machine learning development lifecycle; it results in, Better
data representation, Improvement in model performance, Reduced complexity of the model.
PROGRAM ELEMENTS-STRUCTURE FEATURE ENGINEERING - EXAMPLE
For example, if we would like to predict the price of a car, the target variable would be the Market
Value. The predictor variables start as a long list of attributes that, through feature engineering, is
slimmed down and manipulated to produce a set of effective predictor variables.
The process of feature engineering would involve more explanatory questions like;
• Is number of seats a good predictor?
• Should horsepower and torque be separate predictor variables, or do they provide similar
information and only one of them is needed?
As the above example conveys, feature engineering is a process that is highly dependent on the
dataset and the target variables. As a result, there is no single correct method of conducting feature
engineering. Feature engineering is a process that is heavily dependent on the efficiency of the
automated tools used and experience and expertise of the data scientists conducting the analysis.
FEATURE ENGINEERING WITH IOT DATA
Feature engineering in the context of Internet of Things (IoT) data
involves extracting, creating, and transforming relevant features
from raw sensor data collected by IoT devices.
Feature engineering is a crucial step in the process of preparing data
for machine learning, and it's equally important when working with
IoT (Internet of Things) data.
IoT data typically involves a wide range of sensor readings,
timestamps, and potentially other contextual information. The goal
of feature engineering is to extract meaningful and informative
features from this raw data that can be used to build effective
machine learning models in IoT space.
Steps in Feature Engineering
A. Data Preparation
This data pre-processing step consists of collecting, manipulating, and
combining raw data from various sources into a standardized format.
B. Data Cleaning
This step requires handling missing values, removing erroneous and
inconsistent values, etc.
C. Exploratory Data Analysis (EDA)
• EDA involves analysing and investigating datasets through various statistical and visualization
methods.
• It can help data scientists to understand data better, its characteristics, how to manipulate it,
choose the right features for the models, etc.
D. Benchmark/Evaluation
This step involves evaluating the performance of the ML model using the engineered features and
comparing it to the performance of the ML model without feature engineering.
Feature engineering is a critical step in the data preprocessing process when working with IoT
(Internet of Things) data; because IoT or sensor data consists of a very large continuous stream of
data where the time interval between successive updates of the data is very small; usually minutes,
seconds or even milli-seconds.
UNIT - 3 [ 13 Periods ]
END OF UNIT 3