0% found this document useful (0 votes)

35 views12 pages

Modern Pridictive Modelling (Regression)

The document outlines the fundamentals of regression analysis, focusing on linear and logistic regression, and their applications in predictive modeling. It details the modeling process, including problem definition, data preparation, model development, evaluation, and deployment, while also discussing evaluation metrics for both regression and classification models. Additionally, it provides practical examples of real estate price prediction and medical diagnosis using regression techniques.

Uploaded by

ayomide.adekoya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views12 pages

Modern Pridictive Modelling (Regression)

Uploaded by

ayomide.adekoya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

ICE 516: REGRESSION ANALYSIS

MODERN APPROACHES TO PREDICTIVE MODELING

Learning Objective
By the end of this lecture, you will:
 Understand the fundamentals of regression analysis.
 Differentiate between linear and logistic regression.
 Learn how to implement both regression types.
 Apply regression concepts to real-world problems.

What is Predictive Modelling?

 The process of using data and statistical algorithms to predict future outcomes.
 Combines statistics, machine learning, and data mining techniques.
 Forms the backbone of modern data-driven decision-making.

Types of Predictive Models

Supervised Learning Models
1. Regression Models
o Linear Regression
o Polynomial Regression
o Ridge/Lasso Regression
o Elastic Net
2. Classification Models
o Logistic Regression
o Decision Trees
o Random Forests
o Support Vector Machines
o Neural Networks
Unsupervised Learning Models
1. Clustering
o K-means
o Hierarchical Clustering
o DBSCAN
2. Dimensionality Reduction
o Principal Component Analysis (PCA)
o t-SNE
o UMAP

The Modelling Process

1. Problem Definition: This is the crucial first step where you establish what you're
trying to achieve
 Define objectives: This involves clearly articulating your model's goals. For
example, are you trying to predict customer churn, classify images, or forecast
sales? You must translate the business problem into a specific modelling task
(classification, regression, clustering, etc.).
 Identify key metrics: Here you determine how success will be measured. For a
classification problem, this might include accuracy, precision, recall, or F1-score.
For regression, you might use MAE, MSE, or RMSE. The choice depends on your
specific use case and business impact
 Establish success criteria: This means setting concrete thresholds for your
metrics that define when the model is suitable for deployment. For example,
"We need 95% accuracy" or "The model must have less than 5% false positives.
2. Data Preparation: This phase focuses on getting your data ready for modelling
 Data collection: Gathering relevant data from various sources (databases,
APIs, files, etc.). This includes understanding data availability, quality, and
accessibility. You might need to set up data pipelines or work with stakeholders
to get access to necessary information.
 Cleaning and preprocessing: This involves handling missing values,
removing duplicates, dealing with outliers, and correcting inconsistencies. You
might need to standardise formats, handle encoding of categorical variables,
and ensure data quality.
 Feature engineering: Creating new features from existing ones to better
capture the underlying patterns.
 Data splitting (train/validation/test): Dividing your data into:
i. Training set (typically 60-80% of data) for model learning
ii. Validation set (10-20%) for hyperparameter tuning.
iii. Test set (10-20%) for final evaluation
3. Model Development: This is where you build and refine your model
 Algorithm selection: Choosing appropriate algorithms based on
i. Problem type (classification, regression, etc.)
ii. Data size and characteristics
iii. Interpretability requirements
iv. Computational constraints
 Hyperparameter tuning: Finding the optimal configuration for your model
using techniques like Grid search, Random search, Bayesian optimisation etc.
 Cross-validation: Implementing techniques to ensure model robustness.
 Ensemble creation: Combining multiple models to improve performance.
4. Model Evaluation: This phase ensures your model performs as expected
 Performance metrics: Calculating relevant metrics based on your problem using
classification, Regression and clustering.
 Error analysis: Understanding where and why your model makes mistakes.
 Model interpretation: Making sense of how your model works.
 Validation strategies: Ensuring model generalisation.
5. Deployment: The final phase where your model goes into production
 Model Serving: Setting up the infrastructure to serve predictions.
 Monitoring: Tracking model performance in production
 Maintenance: Keeping the model healthy
 Updates: Continuous improvement process

Model Evaluation and Selection

Evaluation Metrics
1. Regression Metrics: These metrics help evaluate models that predict
continuous values
o Mean Squared Error (MSE): Calculates the average of squared
differences between predicted and actual values
1
 Formula: MSE = ( ) ∑ ( y true − y pred )
2
n
 Penalizes larger errors more heavily due to squaring
 Always positive, with 0 indicating perfect predictions
 Unit is squared
o Root Mean Squared Error (RMSE): Square root of MSE
 Formula: RMSE ¿ √ MSE
 Returns error in same unit as target variable
 More interpretable than MSE
 Commonly used in practice
 Like MSE, penalizes larger errors more heavily
o Mean Absolute Error (MAE): Average of absolute differences between
predicted and actual values

 Formula: MAE ¿ ( 1n )∑| y true − y pred|

 More robust to outliers than MSE/RMSE

 Easier to interpret as average error magnitude
 Treats all errors linearly
o R-squared(R)2: Proportion of variance in dependent variable explained
by model
 Formula: R² = 1 - (SSres/SStot)
 Ranges from 0 to 1 (1 being perfect fit)
 Can be negative if model performs worse than horizontal line
 Useful for comparing models on same dataset

2. Classification Metrics
o Accuracy
o Recall
o F1-Score
o ROC-AUC: Area under Receiver Operating Characteristic curve
 Plots true positive rate vs false positive rate at various thresholds.
 Range is 0 to 1 (0.5 is random, 1 is perfect)
 Threshold-independent metric
 Good for imbalanced classes
o Precision-Recall Curves: Plots precision vs recall at various thresholds
Model Selection Techniques
1. Cross-validation
o K-fold
o Stratified K-fold
o Time-series cross-validation
2. Hyperparameter Optimization
o Grid Search
o Random Search
o Bayesian Optimization
o Neural Architecture Search

Introduction to Regression Analysis

What is Regression Analysis?
 A statistical technique for modelling relationships between variables
 Used to predict outcomes based on one or more input variables
 Fundamental tool in statistics, data science, and machine learning
 Essential for prediction, forecasting, and understanding variable relationships

Primary Uses
1. Prediction and forecasting (machine learning applications)
2. Measuring variable relationships and influences
3. Data-driven decision making
Linear Regression
Linear regression predicts continuous values by modelling linear relationships
between variables. It's one of the most widely used statistical techniques in data
science.

Types of Linear Regression

1. Simple Linear Regression
 One independent variable
 One dependent variable
 Uses straight-line relationship
2. Multiple Linear Regression
 Multiple independent variables
 One dependent variable
 Creates multidimensional relationship plane

Mathematical Representation of Linear Regression

Basic Formula: Y i=f ( X i , β ) + ei

Where:
Y i=¿ Dependent variable
f =¿ Function
X i =¿Independent Variable
β=¿ Unknown Parameters
e i=¿ Error term

Steps Involved when performing Linear Regression

As the name suggested, the idea behind performing Linear Regression (simple linear
regression) is that we should come up with a linear equation that describes the
relationship between dependent and independent variables.
Step 1
Let’s assume that we have a dataset where x is the independent variable and Y is a
function of x (Y=f(x)). Thus, by using Linear Regression we can form the following
equation (equation for the best-fitted line):
Y= mx + c
y denotes the response variable
x denotes ith predictor variable
This is an equation of a straight line where m is the slope of the line and c is the
intercept.
Step 2
Now, to derive the best-fitted line, first, we assign random values to m and c and
calculate the corresponding value of the given training data points Y for a given x.
This Y value is the output value.
Step 3
Now, as we have our calculated output value (let’s represent it as ŷ), we can verify
whether our prediction is accurate or not. In the case of Linear Regression, we
calculate this error (residual) by using the MSE method (mean squared error) and we
name it as loss function:
1
L= ∑((y – ŷ)2)
n

Step 4

To achieve the best-fitted line, we have to minimise the value of the loss function. To
minimise the loss function, we use a technique called gradient descent.

Gradient Descent

A Cost Function is a mathematical formula used to calculate the error, difference

between predicted value and the actual value. If we look at the formula for the
loss function, it’s the ‘mean square error’ means the error is represented in second-
order terms. If we plot the loss function for the weight (in our equation weights are m
and c), it will be a parabolic curve. Now as our goal is to minimize the loss function,
we have to reach the bottom of the curve.
To achieve this we should take the first-order derivative of the loss function for the
weights (m and c). Then we will subtract the result of the derivative from the initial
weight multiplying with a learning rate (α). We will keep repeating this step until we
reach the minimum value (we call it global minima). We fix a threshold of a very small
value (example: 0.0001) as global minima. If we don’t set the threshold value then it
may take forever to reach the exact zero value.

Step 5

Once the loss function is minimized, we get the final equation for the best-fitted line
and we can predict the value of Y for any given X.

Requirements for Linear Regression

1. Continuous variables
2. Linear relationship between variables
3. Independent observations
4. No significant outliers
5. Homoscedasticity
6. Normal distribution of residuals

Logistic Regression
Overview
Logistic regression is used for classification problems, particularly binary outcomes. It
predicts categorical variables by calculating probabilities.
Types of Logistic Regression
1. Binary Logistic Regression
 Two possible outcomes (Yes/No, 0/1)
 Most common form

2. Multinomial Logistic Regression

 Multiple unordered outcomes
 Example: Transportation type prediction

3. Ordinal Logistic Regression

 Multiple ordered outcomes
 Example: Rating scales (1-5 stars)

Steps involved when performing Logistic Regression

In logistic regression model, we decide a probability threshold. If the probability of a

particular element is higher than the probability threshold then we classify that
element in one group or vice versa.

Step 1

To calculate the binary separation, first, we determine the best-fitted line by following
the Linear Regression steps.

Step 2

The regression line we get from Linear Regression is highly susceptible to outliers.
Thus it will not do a good job in classifying two classes.

Thus, the predicted value gets converted into probability by feeding it to the sigmoid
function.

The logistic regression hypothesis generalizes from the linear regression hypothesis
that it uses the logistic function is also known as sigmoid function (activation
function).

1
The equation of sigmoid: S ( x )= −x
1+ e

Thus, if we feed the output ŷ value to the sigmoid function it retunes a probability
value between 0 and 1.
Step 3

Finally, the output value of the sigmoid function gets converted into 0 or 1 (discreet
values) based on the threshold value. We usually set the threshold value as 0.5. In this
way, we get the binary classification.

Requirements
1. Binary/categorical dependent variable
2. Independent predictor variables
3. Low/no multicollinearity
4. Large sample size

Comparison: Linear vs. Logistic Regression

Similarities
 Both are supervised learning algorithms
 Use parametric regression approaches
 Require training data
 Based on linear relationships

Key Differences
Aspect Linear regression Logistic Regression
Output Continuous Values Categorical Values
Purpose Prediction Classification
Function Best fit line Sigmoid curve
Loss Calculation Mean square Error Maximum Likelihood
Application Quantitative response Binary/Categorical Response

Modern Applications
Linear Regression
 Price prediction
 Sales forecasting
 Resource allocation
 Performance analysis

Logistic Regression
 Spam detection
 Medical diagnosis
 Credit risk assessment
 Customer behaviour prediction

Practical Applications of Regression (Examples)

1. Real Estate Price Prediction
Problem Statement
Predicting house prices based on location, square footage, number of bedrooms, etc.
Implementation
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import StandardScaler

# Sample real estate data

real_estate_data = {
'sqft': [1500, 2000, 1800, 2200, 1600],
'bedrooms': [3, 4, 3, 4, 3],
'age': [10, 5, 15, 2, 8],
'location_score': [8, 9, 7, 9, 8],
'price': [300000, 450000, 320000, 500000, 330000]
}
df = pd.DataFrame(real_estate_data)

# Feature preparation
X = df[['sqft', 'bedrooms', 'age', 'location_score']]
y = df['price']

# Split and scale data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)
X_test_scaled = scaler.transform(X_test)

# Train model
model = LinearRegression()
model.fit(X_train_scaled, y_train)

# Example prediction
new_house = [[1900, 3, 5, 8]] # sqft, bedrooms, age, location_score
new_house_scaled = scaler.transform(new_house)
predicted_price = model.predict(new_house_scaled)
2. Medical Diagnosis (Logistic Regression)
Problem Statement
Predicting the likelihood of a disease based on patient symptoms and characteristics.
Implementation
from sklearn.linear_model import LogisticRegression
from sklearn.preprocessing import StandardScaler

# Sample patient data

patient_data = {
'age': [45, 52, 38, 60, 42],
'blood_pressure': [130, 145, 125, 150, 135],
'cholesterol': [200, 250, 180, 260, 220],
'has_disease': [0, 1, 0, 1, 0] # 0: No, 1: Yes
}

df = pd.DataFrame(patient_data)

# Prepare features
X = df[['age', 'blood_pressure', 'cholesterol']]
y = df['has_disease']

# Scale features
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

# Train model
model = LogisticRegression()
model.fit(X_scaled, y)

# Predict for new patient

new_patient = [[50, 140, 230]] # age, blood_pressure, cholesterol
new_patient_scaled = scaler.transform(new_patient)
risk_probability = model.predict_proba(new_patient_scaled)[:, 1]
Recommended Tools
- Python (sklearn, statsmodels)
- R (stats package)
- MATLAB
- Excel (basic analysis)

Further Reading
- Statistical Learning Theory
- Advanced Regression Techniques
- Machine Learning Applications
- Model Optimization Methods

Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
ML Unit3b
No ratings yet
ML Unit3b
175 pages
Unit 2
No ratings yet
Unit 2
136 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37
No ratings yet
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37
115 pages
Linear Regression
No ratings yet
Linear Regression
130 pages
ML Combined
No ratings yet
ML Combined
254 pages
Accuracy Assessment and Confusion Matrix
No ratings yet
Accuracy Assessment and Confusion Matrix
23 pages
Unit 2
No ratings yet
Unit 2
92 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
ML 2 ND Unit
No ratings yet
ML 2 ND Unit
50 pages
ML Section2
No ratings yet
ML Section2
36 pages
Predictive Maintenance
No ratings yet
Predictive Maintenance
66 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
2.1 Supervised Regression
No ratings yet
2.1 Supervised Regression
26 pages
MLT Unit 2 Linear Regression
No ratings yet
MLT Unit 2 Linear Regression
26 pages
Unit 2
No ratings yet
Unit 2
19 pages
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Unit 5
No ratings yet
Unit 5
18 pages
6 ML Updated
No ratings yet
6 ML Updated
23 pages
Unit 2
No ratings yet
Unit 2
67 pages
S&ML Unit 5 - Q & A
No ratings yet
S&ML Unit 5 - Q & A
15 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
UNIT 2-3 - Notes - Unit-2-3-Notes
No ratings yet
UNIT 2-3 - Notes - Unit-2-3-Notes
16 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
25 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
Unit 2
No ratings yet
Unit 2
18 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
ML 1
No ratings yet
ML 1
24 pages
Regression Notes
100% (1)
Regression Notes
20 pages
Da Sem Unit 3-1
No ratings yet
Da Sem Unit 3-1
13 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
Linear Regression
No ratings yet
Linear Regression
83 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Regression
No ratings yet
Regression
16 pages
Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
ML PR-2
No ratings yet
ML PR-2
11 pages
ML Unit Ii
No ratings yet
ML Unit Ii
30 pages
Data Science Module 5 Q & A
No ratings yet
Data Science Module 5 Q & A
8 pages
Basic Interview Question of Linear Regression
No ratings yet
Basic Interview Question of Linear Regression
9 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
No ratings yet
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
9 pages
Teit ML2
No ratings yet
Teit ML2
11 pages
Article Module 4
No ratings yet
Article Module 4
8 pages
Unit 3.1 Gradient Descent in Linear Regression
No ratings yet
Unit 3.1 Gradient Descent in Linear Regression
6 pages
Flat CH 2
No ratings yet
Flat CH 2
86 pages
CT 9000
100% (1)
CT 9000
256 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Contact List Latest Master - Sheet88
No ratings yet
Contact List Latest Master - Sheet88
8 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Linear Regression by IntuitiveAI v2.5
No ratings yet
Linear Regression by IntuitiveAI v2.5
5 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
CSS 2022 GSA MCQs General Science and Ability Paper Solved Quiz
No ratings yet
CSS 2022 GSA MCQs General Science and Ability Paper Solved Quiz
6 pages
CHM 111 19 20 ALADESUYI O.
100% (1)
CHM 111 19 20 ALADESUYI O.
54 pages
Civil 3d Road Design General Workflow
100% (1)
Civil 3d Road Design General Workflow
3 pages
Maret 12
No ratings yet
Maret 12
8 pages
Anatomy of A MapReduce Job
No ratings yet
Anatomy of A MapReduce Job
5 pages
Brochure SRT 4930 - en
No ratings yet
Brochure SRT 4930 - en
2 pages
CST 111 - Information Sources (LATEST)
No ratings yet
CST 111 - Information Sources (LATEST)
48 pages
Gases - Full Lecture
No ratings yet
Gases - Full Lecture
39 pages
Graitec Advance Concrete Manual
No ratings yet
Graitec Advance Concrete Manual
4 pages
STUDY NOTES TTL 100 Prelims - Unit 1
No ratings yet
STUDY NOTES TTL 100 Prelims - Unit 1
8 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
GST 111 Slides Increasing Reading Speed
No ratings yet
GST 111 Slides Increasing Reading Speed
25 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
CM100 SpecificationEng
No ratings yet
CM100 SpecificationEng
3 pages
Process Control-Lecture 09
No ratings yet
Process Control-Lecture 09
37 pages
Aveva™ - Engineering - Commands - 2024 09 26 13 33 05
No ratings yet
Aveva™ - Engineering - Commands - 2024 09 26 13 33 05
5 pages
Relational Database and SQL
No ratings yet
Relational Database and SQL
35 pages
Physics With Arduino
No ratings yet
Physics With Arduino
44 pages
Assignment3 Functions
No ratings yet
Assignment3 Functions
5 pages
CST111 Lecture Slides Module3 - Part2
No ratings yet
CST111 Lecture Slides Module3 - Part2
33 pages
PVS980 - 5MW Pre Commissioning Instruction
No ratings yet
PVS980 - 5MW Pre Commissioning Instruction
6 pages
Manual Testing Important
No ratings yet
Manual Testing Important
44 pages
GST 111 Orientation Lecture - SUCCESS IN COVENANT UNIVERSITY
No ratings yet
GST 111 Orientation Lecture - SUCCESS IN COVENANT UNIVERSITY
2 pages
100 HRS New - syllabus-ITT
No ratings yet
100 HRS New - syllabus-ITT
11 pages
E Passbook 2024 08 01 09 53 42 AM
No ratings yet
E Passbook 2024 08 01 09 53 42 AM
146 pages
Week 5 Loci
No ratings yet
Week 5 Loci
7 pages
GST 111 2018-2019 Course Compact
No ratings yet
GST 111 2018-2019 Course Compact
16 pages
Chm111 Chemical Equilibrium 20202021
No ratings yet
Chm111 Chemical Equilibrium 20202021
60 pages
05 Ccnasec-Firewall - p3
No ratings yet
05 Ccnasec-Firewall - p3
34 pages
Chapter 06 Orthographic Writing
No ratings yet
Chapter 06 Orthographic Writing
44 pages
Chapter 08 Orthographic Reading
No ratings yet
Chapter 08 Orthographic Reading
43 pages
Presented by Omokhaiye PreciousAA
No ratings yet
Presented by Omokhaiye PreciousAA
19 pages
Chapter 05 B Orthographic Projection
No ratings yet
Chapter 05 B Orthographic Projection
37 pages
GST 111 Slides Effective Listening Skills
No ratings yet
GST 111 Slides Effective Listening Skills
10 pages
GST 111 Slides Nature of The Lecture
No ratings yet
GST 111 Slides Nature of The Lecture
9 pages
Chemistry of Group VII - CHM122
No ratings yet
Chemistry of Group VII - CHM122
22 pages
01 GEC 117 16 - 17 Week 3
No ratings yet
01 GEC 117 16 - 17 Week 3
17 pages
01 Gec 117 Week 4
No ratings yet
01 Gec 117 Week 4
15 pages
TMC 511 Introduction
No ratings yet
TMC 511 Introduction
14 pages
TMC 511 The Gains of Consecration DR Oluwasegun Omidiora Nov 2024
No ratings yet
TMC 511 The Gains of Consecration DR Oluwasegun Omidiora Nov 2024
14 pages
Exam Mid Programming
No ratings yet
Exam Mid Programming
5 pages
Chapter 09 Perspective Projection
No ratings yet
Chapter 09 Perspective Projection
13 pages
Chapter 05 A Projection Method
No ratings yet
Chapter 05 A Projection Method
12 pages
Vernier Labquest 2 Manual Original
No ratings yet
Vernier Labquest 2 Manual Original
62 pages
ADB Bearing Sensor Tester
No ratings yet
ADB Bearing Sensor Tester
2 pages
Project Report On DVR (17001005025,2056,2046)
No ratings yet
Project Report On DVR (17001005025,2056,2046)
51 pages
Control Structure C
No ratings yet
Control Structure C
12 pages
Community-Infineon
No ratings yet
Community-Infineon
6 pages
TD Assignment 1
No ratings yet
TD Assignment 1
1 page
Unified Problem Solving - KAIZEN: Select Current Week Below WK 22 WK 23 WK 24 WK 25
No ratings yet
Unified Problem Solving - KAIZEN: Select Current Week Below WK 22 WK 23 WK 24 WK 25
21 pages
Aon - Cyber Solution: Ransomware Supplemental Questionnaire
No ratings yet
Aon - Cyber Solution: Ransomware Supplemental Questionnaire
9 pages
4.hemalatha Resume-1
No ratings yet
4.hemalatha Resume-1
2 pages

Modern Pridictive Modelling (Regression)

Uploaded by

Modern Pridictive Modelling (Regression)

Uploaded by

ICE 516: REGRESSION ANALYSIS

MODERN APPROACHES TO PREDICTIVE MODELING

What is Predictive Modelling?

Types of Predictive Models

The Modelling Process

Model Evaluation and Selection

 Formula: MAE ¿ ( 1n )∑| y true − y pred|

 More robust to outliers than MSE/RMSE

Introduction to Regression Analysis

Types of Linear Regression

Mathematical Representation of Linear Regression

Steps Involved when performing Linear Regression

A Cost Function is a mathematical formula used to calculate the error, difference

Requirements for Linear Regression

2. Multinomial Logistic Regression

3. Ordinal Logistic Regression

Steps involved when performing Logistic Regression

In logistic regression model, we decide a probability threshold. If the probability of a

Comparison: Linear vs. Logistic Regression

Practical Applications of Regression (Examples)

# Sample real estate data

# Split and scale data

# Sample patient data

# Predict for new patient

You might also like