B.Tech - 5thsem - KCS055 - Unit 2 - 1
B.Tech - 5thsem - KCS055 - Unit 2 - 1
1. It is a statistical tool used to understand and quantify the relation between two or more
variables.
4. 4. In addition to helping managers predict such things as future demand for their products ,
regression analysis helps fine tune the manufacturing and delivery process
Objectives:
a. More specifically , establish if there is a statistically significant relationship between the two .
Objectives:
a. Can we use what we know about the relationship to forecast unobserved values?
b. Examples : What will are sales be over the next quarter?What will the ROI of a new store
opening be contigent on store attributes
Variable’s Role:
Dependent Variable:
3. We denote it as Y.
Variable’s Role:
Independent Variable:
y=a+bx
Y=mx+b
Y=beta0+betax
We call it “linear” because the equation represents a straight line in a bi-dimensional plot.
Y=beta0+beta1x+epsilon
y= dependent variable
X= independent variable
3. Trend Forecasting
Core Concept: In linear regression data is modelled using a straight line but in logistic regression
probability of some obtained event is represented as a linear function of a combination of
predictor variables.
Used with: In linear regression we use continuous variable but in logistic we use categorical
variables.
Output/Prediction: in case of linear we get value of the variable but in logistic we get probability
of occurrence of event.
Accuracy and Goodness of fit: In case of linear regression ,it is measured by loss, R squared,
Adjusted R squared etc but in logistic regression it is measured by Accuracy, Precision , Recall, F1
score, ROC curve, confusion matrix.
Selection Criteria:
2. Data Quality
3. Computational Complexity
Where it is used:
1. R-squared value is a statistical measure of how close the data are to the fitted regression line.
Drawbacks:
2. Computation cost.
Drawbacks:
1. One of the problems with linear regression is that it tries to fit a constant line to your data
once the model was created
2. Such behavior might be okay when your data follows linear pattern and does not have much
noise .
3. However, when dataset is not linear ,linear regression tends to under fit the training data.