Feature and Feature Extractionlect2

Uploaded by

riya.munjal.ug21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views28 pages

Feature and Feature Extractionlect2

Uploaded by

riya.munjal.ug21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Feature and feature

extraction
What is a Feature?
• A feature is an individual measurable property within a recorded
dataset. In machine learning and statistics, features are often called
“variables” or “attributes.”
• Relevant features have a correlation or bearing on a model’s use case.
• Example:
• In a patient medical dataset, features could be age, gender, blood
pressure, cholesterol level, and other observed characteristics relevant
to the patient.
Feature extraction
• Feature extraction is a process in machine learning and
data analysis that involves identifying and extracting
relevant features from raw data. These features are
later used to create a more informative dataset, which
can be further utilized for various tasks such as:
• Classification
• Prediction
• Clustering
Advantages
• Feature extraction aims to reduce data complexity (often known as
“data dimensionality”) while retaining as much relevant information
as possible.

• This helps to improve the performance and efficiency of machine

learning algorithms and simplify the analysis process.

• Feature extraction may involve the creation of new features (“

feature engineering”) and data manipulation to separate and simplify
the use of meaningful features from irrelevant ones.
Why is Feature Extraction
Important?
• Feature extraction plays a vital role in many real-world
applications. Feature extraction is critical for processes
such as image and speech recognition, predictive
modeling, and Natural Language Processing (NLP).
• In these scenarios, the raw data may contain many
irrelevant or redundant features. This makes it difficult
for algorithms to accurately process the data.
• By performing feature extraction, the relevant features
are separated (“extracted”) from the irrelevant ones.
• With fewer features to process, the dataset becomes
simpler and the accuracy and efficiency of the analysis
improves.
Common Feature Types:
• Numerical: Values with numeric types (int, float, etc.). Examples:
age, salary, height.
• Categorical Features: Features that can take one of a limited
number of values. Examples: gender (male, female, X), color (red,
blue, green).
• Ordinal Features: Categorical features that have a clear
ordering. Examples: T-shirt size (S, M, L, XL).
• Binary Features: A special case of categorical features with only
two categories. Examples: is_smoker (yes, no), has_subscription
(true, false).
• Text Features: Features that contain textual data. Textual data
typically requires special preprocessing steps (like tokenization) to
transform it into a format suitable for machine learning models.
Feature Normalization
• Since data features can be measured on different scales,
it's often necessary to standardize or normalize them,
especially when using algorithms that are sensitive to
the magnitude and scale of variables (like gradient
descent-based algorithms, k-means clustering, or
support vector machines).

• Normalization standardizes the range of independent

variables or features of the data. This process can make
certain algorithms converge faster and lead to better
model performance, especially for algorithms sensitive to
the scale of input features.
Feature normalization helps in
the following ways:
• Scale Sensitivity: Features on larger scales can
disproportionately influence the outcome.
For example, if you have a dataset where one feature ranges from 1 to
1000 and another ranges from 0.1 to 1, the model might pay more
attention to the feature with the larger range.
• Better Performance: Normalization can lead to better
performance in many machine learning models by
ensuring that each feature contributes approximately
proportionate to the final decision. This is especially
meaningful for optimization algorithms, as they can
achieve convergence more quickly with normalized
features.
Common Feature Extraction
Techniques
• Autoencoders:

Autoencoders can identify key data features. The autoencoder

concept hinges on learning from the coding of the original data
sets to derive new, more potent features. It achieves this by
training a neural network to recreate its input, which forces it to
discover and exploit structures in the data. Through this process,
autoencoders reduce dimensionality and extract significant
features from the data, contributing to more effective machine-
learning models.
• Principal Component Analysis (PCA):
This feature extraction method reduces the dimensionality of
large data sets while preserving the maximum amount of
information. Principal Component Analysis emphasizes
variation and captures important patterns and relationships
between variables in the dataset.

• Bag of Words (BoW):

BoW is an effective technique in Natural Language Processing

(NLP) where the words (i.e. features) used in a text can be
extracted and classified by their usage frequency. A vector of
word counts represents each document. Machine learning
algorithms then use the word count as an input.
• Term Frequency-Inverse Document Frequency (TF-IDF):
An extension of BoW, TF-IDF is an NLP feature extraction
technique that uses a numerical statistic to reflect how
important a word is to a document in a collection or corpus.
Compared to BoW , it considers not only the frequency of a word
in a single document, but all other documents in the corpus. This
helps to adjust for the fact that some words appear more
frequently in general.
• Image Processing Techniques:
Image processing techniques involve raw data analysis to
identify and isolate significant characteristics or patterns in an
image. This could involve identifying edges and corners or
extracting features like color, texture, and shape. These features
can then be used for tasks such as image classification, object
detection, and image segmentation.
Learning
• Learning is a phenomenon through which a system
gets trained and becomes adaptable to give results in
an accurate manner.
• Learning is the most important phase as to how well the
system performs on the data provided to the system
depends on which algorithms are used on the data.
• The entire dataset is divided into two categories, one
which is used in training the model i.e. Training set, and
the other that is used in testing the model after training,
i.e. Testing set.
1. Supervised Learning
• In supervised learning, the model is trained on a labeled dataset,
which means that each training example is paired with an output
label. The goal is to learn a mapping from inputs to outputs so that
the model can predict the label for new, unseen data.
• Classification: The task of assigning input data to one of several
predefined categories. For example, handwriting recognition where
the input is an image of a handwritten character and the output is the
corresponding letter, red or blue.
• Regression: The task of predicting a continuous/real value. For
example, predicting the price of a house based on its features, weight.
• Supervised learning involves training a machine from
labeled data.
• Labeled data consists of examples with the correct
answer or classification.
• The machine learns the relationship between inputs
(fruit images) and outputs (fruit labels).
• The trained machine can then make predictions on
new, unlabeled data.
• For Regression
• Mean Squared Error (MSE): MSE measures the average
squared difference between the predicted values and the actual
values. Lower MSE values indicate better model performance.
• Root Mean Squared Error (RMSE): RMSE is the square root of
MSE, representing the standard deviation of the prediction
errors. Similar to MSE, lower RMSE values indicate better model
performance.
• Mean Absolute Error (MAE): MAE measures the average
absolute difference between the predicted values and the actual
values. It is less sensitive to outliers compared to MSE or RMSE.
• R-squared (Coefficient of Determination): R-squared
measures the proportion of the variance in the target variable
that is explained by the model. Higher R-squared values indicate
better model fit.
• For Classification
• Accuracy: Accuracy is the percentage of predictions that the model
makes correctly. It is calculated by dividing the number of correct
predictions by the total number of predictions.
• Precision: Precision is the percentage of positive predictions that the
model makes that are actually correct. It is calculated by dividing the
number of true positives by the total number of positive predictions.
• Recall: Recall is the percentage of all positive examples that the
model correctly identifies. It is calculated by dividing the number of
true positives by the total number of positive examples.
• F1 score: The F1 score is a weighted average of precision and
recall. It is calculated by taking the harmonic mean of precision and
recall.
• Confusion matrix: A confusion matrix is a table that shows the
number of predictions for each class, along with the actual class
labels. It can be used to visualize the performance of the model and
identify areas where the model is struggling.
Applications of Supervised
learning
• Spam filtering: Supervised learning algorithms can be trained to identify and
classify spam emails based on their content, helping users avoid unwanted
messages.
• Image classification: Supervised learning can automatically classify images
into different categories, such as animals, objects, or scenes, facilitating tasks
like image search, content moderation, and image-based product
recommendations.
• Medical diagnosis: Supervised learning can assist in medical diagnosis by
analyzing patient data, such as medical images, test results, and patient
history, to identify patterns that suggest specific diseases or conditions.
• Fraud detection: Supervised learning models can analyze financial
transactions and identify patterns that indicate fraudulent activity, helping
financial institutions prevent fraud and protect their customers.
• Natural language processing (NLP): Supervised learning plays a crucial role
in NLP tasks, including sentiment analysis, machine translation, and text
summarization, enabling machines to understand and process human language
effectively.
Advantages of Supervised
learning
• Supervised learning allows collecting data and produces
data output from previous experiences.
• Helps to optimize performance criteria with the help of
experience.
• Supervised machine learning helps to solve various
types of real-world computation problems.
• It performs classification and regression tasks.
• It allows estimating or mapping the result to a new
sample.
• We have complete control over choosing the number of
classes we want in the training data.
Disadvantages of Supervised
learning
• Classifying big data can be challenging.
• Training for supervised learning needs a lot of
computation time. So, it requires a lot of time.
• Supervised learning cannot handle all complex tasks in
Machine Learning.
• Computation time is vast for supervised learning.
• It requires a labelled data set.
• It requires a training process.
2. Unsupervised Learning
• In unsupervised learning, the model is trained on a dataset without labeled
responses. The goal is to infer the natural structure present within a set of
data points. The goal of unsupervised learning is to discover
patterns and relationships in the data without any explicit
guidance.
• Clustering: The task of grouping a set of objects in such a way that objects in
the same group (or cluster) are more similar to each other than to those in
other groups. Examples include K-means clustering and hierarchical
clustering., grouping customers by purchasing behavior.
• Dimensionality Reduction: The process of reducing the number of random
variables under consideration. Techniques include Principal Component
Analysis (PCA) and t-Distributed Stochastic Neighbor Embedding (t-SNE).
• Association: people that buy X also tend to buy Y.
Application of Unsupervised
learning
• Anomaly detection: Unsupervised learning can identify unusual
patterns or deviations from normal behavior in data, enabling the
detection of fraud, intrusion, or system failures.
• Scientific discovery: Unsupervised learning can uncover hidden
relationships and patterns in scientific data, leading to new hypotheses
and insights in various scientific fields.
• Recommendation systems: Unsupervised learning can identify patterns
and similarities in user behavior and preferences to recommend
products, movies, or music that align with their interests.
• Customer segmentation: Unsupervised learning can identify groups of
customers with similar characteristics, allowing businesses to target
marketing campaigns and improve customer service more effectively.
• Image analysis: Unsupervised learning can group images based on their
content, facilitating tasks such as image classification, object detection,
and image retrieval.
• Advantages of Unsupervised learning
• It does not require training data to be labeled.
• Dimensionality reduction can be easily accomplished
using unsupervised learning.
• Capable of finding previously unknown patterns in data.
• Unsupervised learning can help you gain insights from
unlabeled data that you might not have been able to get
otherwise.
• Unsupervised learning is good at finding patterns and
relationships in data without being told what to look for.
This can help you learn new things about your data.
3. Semi-Supervised Learning
• This approach involves using both labeled and unlabeled data for
training. Typically, a small amount of labeled data and a large amount
of unlabeled data are used. This is useful when obtaining a fully
labeled dataset is expensive or time-consuming.
4. Reinforcement Learning
• Reinforcement learning is a type of machine learning where an agent
learns to make decisions by taking actions in an environment to
maximize cumulative reward. It is often used in scenarios where the
agent interacts with the environment in a sequential manner, such as
game playing or robotic control.
5. Feature Extraction and
Selection
• Effective pattern recognition depends on identifying the right features
that capture the underlying structure of the data. Feature extraction
involves transforming raw data into a set of features that can be
effectively used in modeling. Feature selection involves choosing the
most relevant features for the task at hand.
6.Deep Learning
• Deep learning is a subset of machine learning that involves neural
networks with many layers (deep neural networks). These models can
automatically learn hierarchical representations of data, making them
powerful for tasks involving large and complex datasets.

Introduction To Data Mining 2005
60% (5)
Introduction To Data Mining 2005
400 pages
John Preskill - Quantum Information and Computation
No ratings yet
John Preskill - Quantum Information and Computation
321 pages
Nanobrain: The Making of An Artificial Brain Made of Time Crystal
0% (2)
Nanobrain: The Making of An Artificial Brain Made of Time Crystal
1 page
Survey of Deep Learning Paradigms For Speech Processing
No ratings yet
Survey of Deep Learning Paradigms For Speech Processing
37 pages
Rujivan 2021
No ratings yet
Rujivan 2021
29 pages
Chapter 10 PERT-CPM PDF
100% (1)
Chapter 10 PERT-CPM PDF
24 pages
Lecture 5 - Feature Extraction, Model Building & Evaluation
No ratings yet
Lecture 5 - Feature Extraction, Model Building & Evaluation
35 pages
4' - FDM - Examples
No ratings yet
4' - FDM - Examples
24 pages
Syllabus of Advanced Structural Analysis Course
0% (1)
Syllabus of Advanced Structural Analysis Course
2 pages
A Closed Loop Inverse Kinematics Solver Intended For Offline Calculation Optimized With GA
No ratings yet
A Closed Loop Inverse Kinematics Solver Intended For Offline Calculation Optimized With GA
13 pages
Unit II
No ratings yet
Unit II
119 pages
Data Science in Engineering,: Ramin Madarshahian Francois Hemez Editors
No ratings yet
Data Science in Engineering,: Ramin Madarshahian Francois Hemez Editors
158 pages
MA 214 Lecture 5
No ratings yet
MA 214 Lecture 5
123 pages
Lab Report 4, Computer Graphics BCA 5th Sem
No ratings yet
Lab Report 4, Computer Graphics BCA 5th Sem
4 pages
1 Introduction
No ratings yet
1 Introduction
81 pages
Machine Learning: Dr. Jagan. T Professor Department of ECE, GRIET
No ratings yet
Machine Learning: Dr. Jagan. T Professor Department of ECE, GRIET
69 pages
Unit 4
No ratings yet
Unit 4
42 pages
Feature Selection
No ratings yet
Feature Selection
53 pages
Machine: Learning
No ratings yet
Machine: Learning
24 pages
Semi Supervised Learning
No ratings yet
Semi Supervised Learning
86 pages
EDA Explanations
No ratings yet
EDA Explanations
22 pages
Final 2
No ratings yet
Final 2
85 pages
Unit 2 Feature Engineering
No ratings yet
Unit 2 Feature Engineering
64 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Unit-4 Part 3 Feature Engineering
No ratings yet
Unit-4 Part 3 Feature Engineering
29 pages
AI-Module 4 - Updated
No ratings yet
AI-Module 4 - Updated
53 pages
Unit 3
No ratings yet
Unit 3
50 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
ML 02 Dataset-Feature Selection PDF
No ratings yet
ML 02 Dataset-Feature Selection PDF
44 pages
Cs8792 Unit 2 Notes
No ratings yet
Cs8792 Unit 2 Notes
65 pages
UNIT04
No ratings yet
UNIT04
35 pages
Unit 6aics
No ratings yet
Unit 6aics
25 pages
PPA Data Preparation
No ratings yet
PPA Data Preparation
31 pages
Blockchain 33
No ratings yet
Blockchain 33
28 pages
CHP 4
No ratings yet
CHP 4
72 pages
Pattern Recognition and Computer Vision Unit-1
No ratings yet
Pattern Recognition and Computer Vision Unit-1
27 pages
ML Unit2 Classppt
No ratings yet
ML Unit2 Classppt
44 pages
Feature Engineering
No ratings yet
Feature Engineering
18 pages
Feature Engineering
No ratings yet
Feature Engineering
11 pages
Basics of Feature Engineering Marked
No ratings yet
Basics of Feature Engineering Marked
33 pages
Feature Selection
No ratings yet
Feature Selection
13 pages
ML - Unit-2 FULL - Feature Engineering Theory-13!09!24-1
No ratings yet
ML - Unit-2 FULL - Feature Engineering Theory-13!09!24-1
29 pages
Data Preprocessing
No ratings yet
Data Preprocessing
8 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
NN 7
No ratings yet
NN 7
26 pages
Machine Learning - Lec4 - 5
No ratings yet
Machine Learning - Lec4 - 5
41 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Data
No ratings yet
Data
36 pages
Pattern Recognition Unit 2
No ratings yet
Pattern Recognition Unit 2
24 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
Pattern L1 L6
No ratings yet
Pattern L1 L6
19 pages
Explore Feature Engineering
No ratings yet
Explore Feature Engineering
10 pages
Icst Robocomm2007 2204
No ratings yet
Icst Robocomm2007 2204
8 pages
ML Notes
No ratings yet
ML Notes
15 pages
Comparison of AHP and Monte Carlo AHP Under Different Levels of Uncertainty
No ratings yet
Comparison of AHP and Monte Carlo AHP Under Different Levels of Uncertainty
11 pages
20 Short Questions
No ratings yet
20 Short Questions
11 pages
Unit 3-2
No ratings yet
Unit 3-2
15 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
AITools Unit-4
No ratings yet
AITools Unit-4
25 pages
Feature Engineering For Machine Learning
No ratings yet
Feature Engineering For Machine Learning
41 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
23 pages
PRCV Unit-2
No ratings yet
PRCV Unit-2
24 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
ENEE 3530 Continuous and Discrete Signals and System Analysis
No ratings yet
ENEE 3530 Continuous and Discrete Signals and System Analysis
31 pages
MODELS (AutoRecovered)
No ratings yet
MODELS (AutoRecovered)
9 pages
Unit 2 Part 2
No ratings yet
Unit 2 Part 2
6 pages
Tower of Hanoi
No ratings yet
Tower of Hanoi
2 pages
Mark Alcazar 4.1 Midland Tool Shop Case Problem
No ratings yet
Mark Alcazar 4.1 Midland Tool Shop Case Problem
3 pages
Xplore Feature Engineering
No ratings yet
Xplore Feature Engineering
9 pages
Summary Chap 1 & 2
No ratings yet
Summary Chap 1 & 2
5 pages
Summery of Feature Eng
No ratings yet
Summery of Feature Eng
4 pages
Deep Learning Vocabulary
No ratings yet
Deep Learning Vocabulary
6 pages
Final 1
No ratings yet
Final 1
6 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
2 pages
Final ML
No ratings yet
Final ML
2 pages
RBI
No ratings yet
RBI
2 pages
The Implication of Statistical Analysis and Feature Engineering For Model Building Using Machine Learning Algorithms
No ratings yet
The Implication of Statistical Analysis and Feature Engineering For Model Building Using Machine Learning Algorithms
11 pages
Disease Prediction Research Report
No ratings yet
Disease Prediction Research Report
6 pages
Đề thi GT1 MI1016 CK 20221 aThu E
No ratings yet
Đề thi GT1 MI1016 CK 20221 aThu E
1 page
Frequency Response (Report)
No ratings yet
Frequency Response (Report)
19 pages
Assignment
No ratings yet
Assignment
4 pages
A Short Guide For Feature Engineering and Feature Selection
No ratings yet
A Short Guide For Feature Engineering and Feature Selection
32 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Mid 2 N
No ratings yet
Mid 2 N
2 pages
Soln mt2 w08 431
No ratings yet
Soln mt2 w08 431
10 pages
Cse319 Soft-Computing TH 1.10 Ac26
No ratings yet
Cse319 Soft-Computing TH 1.10 Ac26
2 pages
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet

Feature and Feature Extractionlect2

Uploaded by

Feature and Feature Extractionlect2

Uploaded by

Feature and feature

• This helps to improve the performance and efficiency of machine

• Feature extraction may involve the creation of new features (“

• Normalization standardizes the range of independent

Autoencoders can identify key data features. The autoencoder

• Bag of Words (BoW):

BoW is an effective technique in Natural Language Processing

You might also like