0% found this document useful (0 votes)
14 views3 pages

Statistics

The document provides steps to perform multiple linear regression in Excel using student CGPA as the dependent variable and study hours and attendance percentage as independent variables. It collects data, performs the regression analysis, interprets the results including coefficients and R-squared value, and provides an example equation to predict CGPA based on study hours and attendance percentage.

Uploaded by

Abu Saleh Sadhin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views3 pages

Statistics

The document provides steps to perform multiple linear regression in Excel using student CGPA as the dependent variable and study hours and attendance percentage as independent variables. It collects data, performs the regression analysis, interprets the results including coefficients and R-squared value, and provides an example equation to predict CGPA based on study hours and attendance percentage.

Uploaded by

Abu Saleh Sadhin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

The example of multiple linear regression in Microsoft Excel using a dataset of

student CGPA (Cumulative Grade Point Average) as the dependent variable and
two independent variables: study hours and attendance percentage. We'll
perform the regression analysis to predict CGPA based on these two factors.

Step 1:

Collect and Prepare my Data Assume that have a dataset with the following
variables:

Dependent variable: "CGPA"

Independent variables: "Study Hours" and "Attendance Percentage"

Step 2:

Input Data into Excel Open Microsoft Excel. Enter my data into a new
spreadsheet.

My data might look like this:

Name CGPA (Y) Study Hour (X1) Attendance Percentage (X2)


Morshed Alom 3.25 7 3
Mahfuz Alom 3.85 7 2
Sadhin 3.70 5 4
Rafsan 3.25 3 6
Thohidul 3.00 4 3

Step 3: Perform Multiple Linear Regression

Select a cell where I want to display the regression results. Let's say I want the
results to start in cell E2. Go to the "Data" tab and click on "Data Analysis" (If I
don't see this option, I may need to enable the "Analysis Tool Pack" add-in in
Excel). Choose "Regression" from the list of analysis tools and click "OK." In the
Regression dialog box:

For "Input Y Range," select the cells containing my dependent variable (e.g.,
B2:B6, which is "CGPA").

1|Page
For "Input X Range," select the cells containing my independent variables (e.g.,
C2:D6, which are "Study Hours" and "Attendance Percentage"). Check the
"Labels" box if your data includes headers. Under "Output Options," specify
where I want the regression results to be displayed (e.g., E2 for "Output Range").
Click "OK."

Step 4: Interpret the Results:

Excel will populate the selected output range with the regression results,
including coefficients, standard errors, R-squared, p-values, and more.

Here's an example of what you might see:

The "Coefficients" section provides the estimated coefficients (intercept, "Study


Hours," and "Attendance Percentage"). The "t-statistics" section shows the t-
statistic for each coefficient. The "P-value" section displays the p-value for each
coefficient. The "R Square" section shows the R-squared value.

Step 5: Interpretation:

The estimated equation for this example would be:

Y= α+ β1(X1) + β2(X2) + ε

[CGPA = 4.05 + 0.30 *{Study Hours} + 0.15 *{Attendance Percentage} ]


2|Page
In this example, the R-squared value is 0.95, indicating that 95% of the variance
in CGPA can be explained by the variables "Study Hours" and "Attendance
Percentage. I can interpret the coefficients: The intercept (4.05) represents the
estimated CGPA when both "Study Hours" and "Attendance Percentage" are
zero. The coefficient for "Study Hours" (0.30) suggests that, on average, a 0.30
increase in CGPA can be attributed to each additional hour of study while holding
attendance percentage constant. The coefficient for "Attendance Percentage"
(0.15) implies that, on average, a 0.15 increase in CGPA can be attributed to each
additional percentage point in attendance while holding study hours constant.

Step 6: Validate the Model

As with any regression analysis, it's important to validate the model's


assumptions and assess its goodness of fit. I can create scatter plots, residual
plots, and conduct hypothesis tests to check the model's validity.
8.00
CGPA (Y) Study Hour (X1) Attendance Percentage (X2)
7.00

6.00

5.00

4.00

3.00

2.00

1.00

0.00
0 1 2 3 4 5 6

This example provides a practical guide to performing multiple linear regression


in Microsoft Excel using a sample dataset of CGPA prediction. Please note that
real-world analyses may involve more complex datasets and additional
considerations.

3|Page

You might also like