Feature Engineering For Machine Learning
Feature Engineering For Machine Learning
Machine Learning
• Feature engineering is the pre-processing step of machine learning,
which is used to transform raw data into features that can be used
for creating a predictive model using Machine learning
• In other words, it is the process of selecting, extracting, and
transforming the most relevant features from the available
data to build more accurate and efficient machine learning
models.
• Feature engineering involves a set of techniques that enable us
to create new features by combining or transforming the
existing ones.
Need for feature Engineering
• to improve the performance of machine learning models by
providing them with relevant and informative input data.
• Feature engineering can also help in addressing issues such as
overfitting, underfitting, and high dimensionality.
• feature engineering is a crucial step in preparing data for
analysis and decision-making in various fields, such as finance,
healthcare, marketing, and social sciences.
Processes Involved in Feature
Engineering
• Feature engineering in Machine learning consists of mainly 5
processes:
1) Feature Creation,
2) Feature Transformation,
3) Feature Extraction,
4) Feature Selection, and
5) Feature Scaling.
• The success of a machine learning model largely depends on
the quality of the features used in the model.
Feature Creation
• Feature Creation is the process of generating new features
based on domain knowledge or by observing patterns in the
data.
• The new features are created by mixing existing features using
addition, subtraction, and ration, and these new features have great
flexibility.
Types of Feature Creation:
• Domain-Specific, Creating new features based on domain
knowledge
• Data-Driven, Creating new features by observing patterns in
the data
• Synthetic, Generating new features by combining existing
features
Benefits of Feature Creation
• Improves Model Performance
• Increases Model Robustness
• Improves Model Interpretability
• Increases Model Flexibility
2. Feature Transformation
2.Ordinal encoding