0% found this document useful (0 votes)
6 views

BA file (1)

The document is a lab file for the Business Analytics course at Maharaja Agrasen Institute of Technology, detailing various exercises including Goal Seek, VLOOKUP, HLOOKUP, and Data Validation in Excel. It includes step-by-step instructions for using these functions to analyze data and solve problems. The exercises aim to enhance students' understanding of business analytics tools and their applications in real-world scenarios.

Uploaded by

radhika6902
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

BA file (1)

The document is a lab file for the Business Analytics course at Maharaja Agrasen Institute of Technology, detailing various exercises including Goal Seek, VLOOKUP, HLOOKUP, and Data Validation in Excel. It includes step-by-step instructions for using these functions to analyze data and solve problems. The exercises aim to enhance students' understanding of business analytics tools and their applications in real-world scenarios.

Uploaded by

radhika6902
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 131

MAHARAJA AGRASEN INSTITUTE OF TECHNOLOGY

(Approved by AICTE & Affiliated to GGSIP University, New Delhi)

Department of Management (MBA)


Business Analytics Lab File
MS-265: Business Analytics
Lab

Submitted By: Submitted To:

Student Name: Divneet Kaur Shah Faculty Name: Ms. Anjli Gupta
Enrollment No.:01114803922 Designation: Assistant Professor

Class : MBA 3A

Batch: 2022-2024

PSP area, Plot No.-1 Sector-22, Rohini, New Delhi – 110085


Ph. No : 8448186936, Website: www.mba.mait.ac.in
INDEX

S. No. Assignment Date Remarks


1. Exercise 1- Goal Seek
2. Exercise 2- VLOOKUP HLOOKUP
3. Exercise 3- Data Validation
4. Exercise 4- Correlation
5. Exercise 5- Simple Regression
6. Exercise 6- Multiple Regression
7. Exercise 7- Pivot Table and Chart
8. Exercise 8- IQR Outliers
9. Exercise 9- Google Dashboard
10. Exercise 10- Power BI
EXERCISE 1 GOAL SEEK
Goal seeking is the process of finding the correct input value when only the output
is known. The function of goal seeking can be built into different kinds of
computer software programs like Microsoft Excel.
Goal seeking is a general term used to describe the process involved in figuring out
your input value based on an already known output value. The process involves
using a specific operator in a formula, which can be calculated using computer
software. Goal seeking is one of the tools used in "what-if analysis" on computer
software programs. A what-if analysis is a process of changing values in
(Microsoft Excel) cells to see how these changes will affect formula outcomes on
the worksheet. When you are goal seeking, you are performing what-if analysis on
a given value, or the output. So, in essence, you would be creating a scenario by
asking "what if the output was X"—or basically, a cause-and-effect situation.

How Goal Seeking Work in Microsoft Excel


As mentioned above, goal seeking software will only work if you already know the
output value (or the result) but want to determine one input value. Here are some
key steps you can follow if you want to use the goal seek feature in the program:

• Open up a new spreadsheet


• Label your columns. This will make it easier for you to read everything in
the worksheet. So, using the example from above, the first column will be
“Loan Amount,” the second column would be “Term in Months,” the
third would be
“Interest Rate” and the final one would be “Payment.”
• Type in the values that you know.
• Enter the formula to calculate the goal — in this case, the payment. You
can ignore the interest rate for now, which the formula will assume is 0
percent.

Goal Seek Parameters

The parameters to be specified in the “Goal Seek” dialog box are stated as follows:
• Set cell–In this box, enter the cell reference of the formula to be resolved.
This is the target cell or the formula cell.
• To value–In this box, enter the desired output to be achieved. This is
the value of the “set cell” to be attained.
• By changing cell–In this box, enter the reference of the input cell
(variable) to be adjusted. This is the cell we want to change in order to
impact the “set cell.”

Note: At a given time, Goal Seek works with only one variable input. For
working with multiple input values, use the solver in Excel.

The Rules Governing the Parameters

• The “set cell” should always contain a formula.


• The formula of the “set cell” should depend, whether directly or indirectly,
on the “by changing cell.” This is necessary to study the impact on the “set
cell.”
• The “by changing cell” should not contain any special characters.

Question.1

Aryan Ltd. Is trading with generators. The price of each generator is Rs 18,000,
and the quantity sold is 100 pcs. We can see that the company is suffering a loss of
13.8 lacs. It is identified that the maximum price for which a generator can be sold
is Rs. 18000.

Ques. Using Goal seek function, you are required to identify the quantity of
generators that can be sold, which will return the break-even value (No Profit No
Loss). List all the steps with screenshots.

STEPS
STEP 1-Enter the value according to table and use formula given above.

Step 2 – click on data opting then select what if analysis Step

3 – Select Goal seek option .

Step 4 - Go to Data Tab > What If Analysis > Goal Seek.

The Goal Seek dialog box will appear.

Step 5 - Enter 0 in “To Value” box. Select the cell F3 in “By Changing cell” box.
Step 6 – Press ok . it will automatically replace quantity according to the profit.

INTERPRETATION:

It is found that 185.18 quantities are to be sold to achieve the break even point
i.e., No profit No loss situation.

Question.2

Suppose you have a data set as shown below:


The following formula is used to calculate the Monthly Payment:

=PMT(B2/12,B3,B1)

Ques. Using goal seek function calculate the following :

1) What should be the interest rate if monthly payment is set to 2000


2) What should be the number of monthly payments if monthly payment is to
be reduced to 1500?
3) What will be the loan amount at interest rate 6% and 60 months period,
but monthly payment is set to 1200?

Steps to perform Goal Seek Function.

1. Enter the values in Excel.


2. ENTER THIS formula in G5
3. Formula is =PMT(G3/12,G4,G2)

Ques. 1. What should be the interest rate if monthly payment is set to 2000?
STEPS:
4. Select the cell C7.
5. Go to Data Tab > What If Analysis > Goal
Seek. The Goal Seek dialog box will appear.

6. Enter 2000 in “To Value” box.


7. Select the cell C4 in “By Changing cell” box.
INTERPRETATION:
Interest Rate should be 7% if monthly payment is set to 2000.
Ques. 2. What should be the number of monthly payments if monthly payment
is to be reduced to 1500?
STEPS:
1. Select the cell C7.
2. Go to Data Tab > What If Analysis > Goal Seek.
3. The Goal Seek dialog box will appear. Enter 1500 in “To Value” box.
4. Select the cell C6 in “By Changing cell” box.

INTERPRETATION:

Number of monthly payments should be 81.29 if monthly payment is set to 1500.


Ques. 3. What will be the loan amount at interest rate 6% and 60 months
period, but monthly payment is set to 1200?
STEPS:
1. Select the cell C7.
2. Go to Data Tab > What If Analysis > Goal Seek.
3. The Goal Seek dialog box will appear.
4. Enter 1200 in “To Value” box.
5. Select the cell C4 in “By Changing cell” box.
6. Click OK.

INTERPRETATION:
The loan amount should be 62070.673 to achieve the monthly payment as 1200.
EXERCISE 2 VLOOKUP HLOOKUP

What is VLOOKUP in Excel?

The VLOOKUP Function[1] in Excel is a tool for looking up a piece of information


in a table or data set and extracting some corresponding data/information. In
simple terms, the VLOOKUP function says the following to Excel: “Look for this
piece of information (e.g., bananas), in this data set (a table), and tell me some
corresponding information about it (e.g., the price of bananas)”.

VLOOKUP Formula

=VLOOKUP(lookup_value, table_array, col_index_num, [range_lookup])

To translate this to simple English, the formula is saying, “Look for this piece of
information, in the following area, and give me some corresponding data from
another column”.

The VLOOKUP function uses the following arguments:

1. Lookup_value (required argument) – Lookup_value specifies the value


that we want to look up in the first column of a table.
2. Table_array (required argument) – The table array is the data array that is
to be searched. The VLOOKUP function searches in the left-most column
of this array.
3. Col_index_num (required argument) – This is an integer, specifying the
column number of the supplied table_array, that you want to return a
value from.
4. Range_lookup (optional argument) – This defines what this function
should return in the event that it does not find an exact match to the
lookup_value. The argument can be set to TRUE or FALSE, which means: o
TRUE –
Approximate match, that is, if an exact match is not found, use the
closest match below the lookup_value. o FALSE – Exact match, that is,
if an exact match not found, then it will return an error.

What is the HLOOKUP Function?

HLOOKUP stands for Horizontal Lookup and can be used to retrieve information
from a table by searching a row for the matching data and outputting from the
corresponding column. While VLOOKUP searches for the value in a column,
HLOOKUP searches for the value in a row.

Formula

=HLOOKUP(value to look up, table area, row number)

Important points to keep in mind about HLOOKUP

1. It is a case-insensitive lookup. It will consider, for example, “TIM”


and “tim” as the same.
2. The ‘Lookup_value’ should be the topmost row of the ‘table_array’ when
we are using HLOOKUP. If we need to look somewhere else, then we
must use another Excel formula.
3. HLOOKUP supports wildcard characters such as ‘*’ or ‘?’ in
the ‘lookup_value’ argument (only if ‘lookup_value’ is text).

Question.1

From the data provided to you in the excel sheet salarycomputation.xls

Create Salary slip of the employees where name of the employee is to be


entered and remaining all values should be picked up from the data using
VLookup & HLookup from the data provided in the data set sheet. After
coding
Name: ENTER THE NAME
Basic Payment
Dearness Allowance Rs. Value using Lookup Function
House Rent Allowance Rs. Value using Lookup Function
Conveyance Rs. Value using Lookup Function
Medical Expenses Rs. Value using Lookup Function
Special Rs. Value using Lookup Function
Bonus Rs. Value using Lookup Function
TA Rs. Value using Lookup Function

Total Gross Salary Add all above values


Less : Deductions
a.Contribution to PF Rs. Value using Lookup Function
b.Salary Advance Rs. Value using Lookup Function
c.Profession Tax Rs. Value using Lookup Function
d.TDS Rs. Value using Lookup Function
Total Deductions Add all deductions

NET SALARY AFTER Compute net salary i.e.


DEDUCTION Gross Salary - Deductions

STEPS:

1. Enter the required heads in the Excel sheet.

2. Go to Data > Data Validation.


3. On settings tab, in “Allow” box, select “List”.
4. In “Source” box, select the “Name of the Employees”.
5. Click OK.

The drop down list will be created for name of the employees.
6. For Basic Payment, enter the formula

=VLOOKUP(C3,Database,11,FALSE)

7. Press Enter.The value of Basic Payment will be displayed automatically.


For

8. Dearness Allowance, enter the formula

=VLOOKUP(C3,Database,12,FALSE)

9. Press EnterThe value of Dearness Allowance will be


displayed automatically.
For

10. House Rent Allowance, enter the formula

=VLOOKUP(C3,Database,13,FALSE)

11. Press Enter.The value of House Rent Allowance will be


displayed automatically.

12. Conveyance, enter the formula


For

=VLOOKUP(C3,Database,14,FALSE)

13. Press Enter.The value of Conveyance will be displayed automatically.

14. Medical Expenses, enter the formula


For

=VLOOKUP(C3,Database,16,FALSE)

15. Press Enter.


The value of Medical Expenses will be displayed automatically.

16. Special, enter the formula

=VLOOKUP(C3,Database,19,FALSE)
For

17. Press Enter.


The value of Special will be displayed automatically.

18. Bonus, enter the formula

=VLOOKUP(C3,Database,21,FALSE)
For

19. Press Enter.


The value of Bonus will be displayed automatically.

20. TA, enter the formula

=VLOOKUP(C3,Database,22,FALSE)
For

21. Press Enter.


The value of TA will be displayed automatically.
22.In cell E12, enter the formula

23. In cell E13, enter the formula

=SUM(E4:E11)

24. Contribution to PF, enter the formula


For

=VLOOKUP(C3,Database,24,FALSE)

25. Press Enter


The value of Contribution to PF will be displayed automaticallY
For

26. Salary Advance, enter the formula

=VLOOKUP(C3,Database,27,FALSE)

27. Press Enter.


The value of Salary Advance will be displayed automatically.
For

28. Profession Tax, enter the formula

=VLOOKUP(C3,Database,25,FALSE)

29. Press Enter.


The value of Profession Tax will be displayed automatically.
For

30. TDS, enter the formula

=VLOOKUP(C3,Database,26,FALSE)

31. Press Enter


The value of TDS will be displayed automatically.
For
32.In cell D19, enter the formula

33. In cell E20, enter the formula

34.In cell E21, enter the formula for NET SALARY AFTER DEDUCTION
35. Net Salary after Deductions will be displayed as:
EXERCISE 3 DATA VALIDATION

Data validation is a feature in Excel which is used to control what users can enter
into a cell. It allows you to dictate specific rules. It also allows users to display a
custom message if users try to enter invalid data.

What Is Data Validation in Excel?

Data validation in Excel is a technique that restricts user input in a worksheet. It is


often used to limit user entry.

Settings Tab

The settings tab is where you enter the validation criteria. There are eight options
available to validate for user input:

• Any Value - It removes any existing data validation.

• Whole Number - It allows only whole numbers. For example, you


can specify that the user must enter the number between 0 to 30.
• Decimal - The user must enter a number with decimal values.

• List - The user will have to create a drop-down list to choose from.

• Date - The user will have to enter the date format.

• Time - The user should enter a time.

• Text Length - It validates input based on the length of the data.

• Custom - It validates the user input using a custom formula.

Input Message Tab

You can set the input message to explain what data is allowed in a given cell. This
tab is optional.

• Check the 'show input message when the cell is selected'.

• Enter a title.

• Enter an Input message.

Error Alert Tab


You can show the error message if the user tries to enter the invalid data.

On the error message tab:

• Check the ‘Show error alert after invalid data is entered’ box.

• Enter a title.

• Enter an error message.

Now, when you try to enter the value beyond the range, you will get an error
message.

Question.1

Create a datasheet containing following headings for a company recording the


data of the employees as:
• Employee Code
• Employee Name
• Department
• Date of Joining
• Employee Phone Number
• Employee Date of Birth
• Gross Salary of the employee
• Account Number
• Name of the Bank
Following should be the data validation controls that is required to be build
for the sheet.

1. Employee code should be entered from a list of employee codes


(Starting EMP001 to EMP010)
2. Department should be entered from a list of departments
(Sales, Purchase, Finance, HR, Marketing, Operations)
3. Date of Joining must be a date
4. Employee phone number must be 10 digit number
5. Date of birth must be a date
6. Gross Salary must range between 20,000 to 2,00,000
7. Account number should be a 8 digit number
8. Name of the Bank must be picked from a list of banks (Union
Bank, PNB, SBI, Axis Bank, Canara Bank and others)
Run the sheet and enter at least 10 rows of data under the above-mentioned
heads.

Steps to perform Data Validation:

1. Enter the data in different columns.


2. Go to Data Tab.
3. Click on Data Validation.Data Validation dialog box appears.
A.

Employee code should be entered from a list of employee codes (Starting


EMP001 to EMP010)
Steps:
a. Create the list of items.

b. On the settings tab, in the “Allow” box select “List”.


c. In the source box, select the list of items.
d. Click OK
B.

Department should be entered from a list of departments (Sales,


Purchase, Finance, HR, Marketing, Operations) Steps:Create the list of

items.

1. On the settings tab, in the “Allow” box select “List”.

2. In the source box, select the list of items Click OK.


C.

Date of Joining must be a date


1. On the settings tab, in the “Allow” box, select “Date”.
2. In the “Data” box, select “Between”.
3. Enter start and end date. Click OK.

D. Employee phone number must be 10 digit number


1. On the settings tab, in the “Allow” box, select “Text Length”.
2. In the “Data” box, select “Equal to”.
3. In the “Length” box, enter “10”. Click OK.
E. Date of birth must be a date
1. On the settings tab, in the “Allow” box, select “Date”.
2. In the “Data” box, select “Greater than”.
3. Enter start date. Click OK.

F. Gross Salary must range between 20,000 to 2,00,000.


1. On the settings tab, in the “Allow” box, select “Whole Number”.
2. In the “Data” box, select “Between”.
3. Enter minimum and maximum values. Click OK.
G. Account number should be a 8 digit number.
1. On the settings tab, in the “Allow” box, select “Text Length”.
2. In the “Data” box, select “Equal to”.
3. In the “Length” box, enter 8. Click OK.

H. Name of the Bank must be picked from a list of banks (Union Bank,
PNB, SBI, Axis Bank, Canara Bank and others) 1. Create the list of items.

2. On the settings tab, in the “Allow” box, select “List”.


3. In the “Source” box, select the list of items.
4. Click OK.

Run the sheet and enter at least 10 rows of data under the above-mentioned
heads.
Values have been entered in 10 rows.
EXERCISE 4 CORRELATION

What is correlation?
Correlation is a statistical measure that expresses the extent to which two variables
are linearly related (meaning they change together at a constant rate). It’s a
common tool for describing simple relationships without making a statement about
cause and effect.
Correlations describe data moving together

Correlations are useful for describing simple relationships among data. For
example, imagine that you are looking at a dataset of campsites in a mountain park.
You want to know whether there is a relationship between the elevation of the
campsite (how high up the mountain it is), and the average high temperature in the
summer.

For each individual campsite, you have two measures: elevation and temperature.
When you compare these two variables across your sample with a correlation, you
can find a linear relationship: as elevation increases, the temperature drops. They
are negatively correlated.

What do correlation numbers mean?

We describe correlations with a unit-free measure called the correlation


coefficient which ranges from -1 to +1 and is denoted by r. Statistical significance
is indicated with a p-value. Therefore, correlations are typically written with two
key numbers: r = and p =.

• The closer r is to zero, the weaker the linear relationship.


• Positive r values indicate a positive correlation, where the values of
both variables tend to increase together.
• Negative r values indicate a negative correlation, where the values of one
variable tend to increase when the values of the other variable decrease.
• The p-value gives us evidence that we can meaningfully conclude that the
population correlation coefficient is likely different from zero, based on
what we observe from the sample.
• "Unit-free measure" means that correlations exist on their own scale: in our
example, the number given for r is not on the same scale as either elevation
or temperature. This is different from other summary statistics. For
instance, the mean of the elevation measurements is on the same scale as its
variable.

A correlation coefficient of +1 indicates a “perfect positive correlation”, which


means that as variable X increases, variable Y increases at the same rate. A
correlation value of -1, meanwhile, is a “perfect negative correlation”, which
means that as variable X increases, variable Y decreases at the same rate.
Correlation analysis may also return results anywhere between -1 and +1, which
indicates that variables change at similar but not identical rates.

Correlation values can help businesses evaluate the impact of specific actions on
other actions. For example, companies may find that as spending on social media
marketing increases, so does customer engagement, indicating that more spending
might make sense.

Or they may find that specific advertising campaigns result in a correlated decrease
of customer engagement, in turn suggesting the need for a reevaluation of current
efforts. The discovery that variables do not correlate can also be valuable; while
common sense might suggest that a new function or feature in your product would
correlate with increased engagement, it might have no measurable impact.
Correlation analysis allows companies to view this relationship (or lack thereof)
and make sound strategic decisions.

Using CORREL function

In Excel to find the correlation coefficient use the formula :


=CORREL(array1,array2)
array1 : array of variable x array2:
array of variable y

To insert array1 and array2 just select the cell range for both.

Question.1

Create a dataset as given below:


Find the correlation of Variable X with Y1, Y2 and Y3.
Steps to calculate correlation:

1. Enter the formula of correlation

=CORREL(B5:B9,C5:C9)

2. Press Enter.
3. Enter the formula of correlation

=CORREL(B5:B9,D5:D9)

4. Press Enter.
5. Enter the formula of correlation

=CORREL(B5:B9,E5:E9)

6. Press Enter.
EXERCISE 5 SIMPLE REGRESSION

Simple linear regression is used to estimate the relationship between two


quantitative variables. You can use simple linear regression when you want to
know:

1. How strong the relationship is between two variables (e.g., the relationship
between rainfall and soil erosion).
2. The value of the dependent variable at a certain value of the independent
variable (e.g., the amount of soil erosion at a certain level of rainfall).

Regression models describe the relationship between variables by fitting a line to


the observed data. Linear regression models use a straight line, while logistic and
nonlinear regression models use a curved line. Regression allows you to estimate
how a dependent variable changes as the independent variable(s) change.

What Is Linear Regression?

Linear regression is a type of data analysis that considers the linear relationship
between a dependent variable and one or more independent variables. It is
typically used to visually show the strength of the relationship or correlation
between various factors and the dispersion of results – all for the purpose of
explaining the behavior of the dependent variable. The goal of a linear regression
model is to estimate the magnitude of a relationship between variables and
whether or not it is statistically significant.

Say we wanted to test the strength of the relationship between the amount of ice
cream eaten and obesity. We would take the independent variable, the amount of
ice cream, and relate it to the dependent variable, obesity, to see if there was a
relationship. Given a regression is a graphical display of this relationship, the
lower the variability in the data, the stronger the relationship and the tighter the fit
to the regression line.

In finance, linear regression is used to determine relationships between asset prices


and economic data across a range of applications. For instance, it is used to
determine the factor weights in the Fama-French Model and is the basis for
determining the Beta of a stock in the capital asset pricing model (CAPM).
Here, we look at how to use data imported into Microsoft Excel to perform a linear
regression and how to interpret the results.
Assumptions of simple linear regression
Simple linear regression is a parametric test, meaning that it makes certain
assumptions about the data. These assumptions are:

1. Homogeneity of variance (homoscedasticity): the size of the error in our


prediction doesn’t change significantly across the values of the independent
variable.
2. Independence of observations: the observations in the dataset were
collected using statistically valid sampling methods, and there are no hidden
relationships among observations.
3. Normality: The data follows a normal distribution.

Linear regression makes one additional assumption:

4. The relationship between the independent and dependent variable is linear:


the line of best fit through the data points is a straight line (rather than a
curve or some sort of grouping factor).

If your data do not meet the assumptions of homoscedasticity or normality, you


may be able to use a nonparametric test instead, such as the Spearman rank test.

How to perform a simple linear regression

Simple linear regression formula


The formula for a simple linear regression is:

• y is the predicted value of the dependent variable (y) for any given value of
the independent variable (x).
• B0 is the intercept, the predicted value of y when the x is 0.
• B1 is the regression coefficient – how much we expect y to change as x
increases.
• x is the independent variable ( the variable we expect is influencing y).
• e is the error of the estimate, or how much variation there is in our
estimate of the regression coefficient.
Linear regression finds the line of best fit line through your data by searching for
the regression coefficient (B1) that minimizes the total error (e) of the model.

While you can perform a linear regression by hand, this is a tedious process, so
most people use statistical programs to help them quickly analyze the data.

Outputting a Regression in Excel


The first step in running regression analysis in Excel is to double-check that the
free Excel plugin Data Analysis ToolPak is installed. This plugin makes calculating
a range of statistics very easy. It is not required to chart a linear regression line, but
it makes creating statistics tables simpler. To verify if installed, select "Data" from
the toolbar. If "Data Analysis" is an option, the feature is installed and ready to use.
If not installed, you can request this option by clicking on the Office button and
selecting "Excel options".

Given the S&P 500 returns, say we want to know if we can estimate the strength
and relationship of Visa (V) stock returns. The Visa (V) stock returns data
populates column 1 as the dependent variable. S&P 500 returns data populates
column 2 as the independent variable.

1. Select "Data" from the toolbar. The "Data" menu displays.


2. Select "Data Analysis". The Data Analysis - Analysis Tools dialog
box displays.
3. From the menu, select "Regression" and click "OK".
4. In the Regression dialog box, click the "Input Y Range" box and select
the dependent variable data (Visa (V) stock returns).
5. Click the "Input X Range" box and select the independent variable data
(S&P 500 returns).
6. Click "OK" to run the results.

Question.1

Create a dataset as given below:


Assuming Mask sold to be a dependent variable and COVID cases to be an
independent variable. Compute Regression using data analytics.

Steps to perform Regression:

1. In the Data Tab, click on Data Analysis.


The Data Analysis dialog box will
appear.
2. Select Regression.
3. Click OK.
The Regression dialog box will appear.

4. In “Input Y Range” box, select the range of Masks Sold.


5. In “Input X Range” box, select the range of Covid Cases.
6. Check the “Labels” box,
7. In “Output Range” box, select any cell on the sheet.
8. Click OK.
The Regression summary will be shown as:
EXERCISE 6 MULTIPLE REGRESSION

What Is Multiple Linear Regression (MLR)?


Multiple linear regression (MLR), also known simply as multiple regression, is a
statistical technique that uses several explanatory variables to predict the outcome
of a response variable. The goal of multiple linear regression is to model the linear
relationship between the explanatory (independent) variables and response
(dependent) variables. In essence, multiple regression is the extension of ordinary
least-squares (OLS) regression because it involves more than one explanatory
variable.

Formula and Calculation of Multiple Linear Regression


yi=β0+β1xi1+β2xi2+...+βpxip+ϵ where, for i=n
observations:
yi = dependent variable xi = explanatory variables β0 =
y-intercept (constant term) βp = slope coefficients for
each explanatory variable ϵ = the model’s error term
(also known as the residuals)

What Multiple Linear Regression Can Tell You


Simple linear regression is a function that allows an analyst or statistician to make
predictions about one variable based on the information that is known about
another variable. Linear regression can only be used when one has two continuous
variables—an independent variable and a dependent variable. The independent
variable is the parameter that is used to calculate the dependent variable or
outcome. A multiple regression model extends to several explanatory variables.

The multiple regression model is based on the following assumptions:

• There is a linear relationship between the dependent variables and the


independent variables
• The independent variables are not too highly correlated with each other
• yi observations are selected independently and randomly from the population
• Residuals should be normally distributed with a mean of 0 and variance σ

The coefficient of determination (R-squared) is a statistical metric that is used to


measure how much of the variation in outcome can be explained by the variation in
the independent variables. R2 always increases as more predictors are added to the
MLR model, even though the predictors may not be related to the outcome
variable.

R2 by itself can't thus be used to identify which predictors should be included in a


model and which should be excluded. R2 can only be between 0 and 1, where 0
indicates that the outcome cannot be predicted by any of the independent variables
and 1 indicates that the outcome can be predicted without error from the
independent variables.

When interpreting the results of multiple regression, beta coefficients are valid
while holding all other variables constant ("all else equal"). The output from a
multiple regression can be displayed horizontally as an equation, or vertically in
table form.

Outcome 1: Find simple regression. Find intercept and coefficient of


independent variable (Model A only).

1. In the Data Tab, Click on Data Analysis.


2. Select Regression. Click OK
3. In “Input Y Range” box, select the range of Cost.
4. In “Input X Range” box, select the range of Model A.
The Regression Summary will be shown as:
Interpretation:

INTERCEPT = 41119.21776

COEFFICIENT = 3.816988178

Outcome 2: Find multiple regressions. Find intercept and coefficient of


independent variables (Model A, B and C).

1. In the Data Tab, Click on Data Analysis.


2. Select Regression. Click OK.
3. In “Input Y Range” box, selEct the range of Cost.
4. In “Input X Range” box, select the range of Model A, B and C.
The Regression Summary will be shown as:
We will check the p values of all the models.

Model A = 0.62825

Model B = 0.015696

Model C = 0.015588

The p value of Model A is greater than 0.15.

So, we will exclude Model A from the regression analysis and perform the
regression again.

1. In the Data Tab, Click on Data Analysis.


2. Select Regression. Click OK.
3. In “Input Y Range” box, select the range of Cost. 4. In “Input X Range”
box, select the range of Model B and C only.
The Regression Summary will be shown as:
INTERPRETATION:

The intercept and coefficients of Model B and Model C are as follows:

INTERCEPT = 35196.26124

COEFFICIENTS

MODEL B = 5.416562594

MODEL C = 5.689764974
Outcome 3: Predict monthly cost for 1200 Model A, 800 Model B, 100 C.

To calculate the monthly cost we will use the following formula

Y = Constant + B1*(X1) + B2*(X2) + … + BnXn


1. Enter the formula

=I43+(800*I44)+(100*I45)

2. Press EnterThe monthly cost will be displayed.

INTERPRETATION:

The monthly cost at 1200 Model A, 800 Model B and 100 Model C is found to be
40098.49.

Outcome 4: Predict monthly cost for 1250 Model A, 850 Model B, 150 C.

To calculate the monthly cost we will use the following formula

Y = Constant + B1*(X1) + B2*(X2) + … + BnXn


1. Enter the formula

=I43+(850*I44)+(150*I45)

2. Press Enter.

The monthly cost will be displayed.

INTERPRETATION:
The monthly cost at 1250 Model A, 850 Model B and 150 Model C is found to be
40653.8.

Outcome 5: Draw Scatter Model by using X and Y variables.

1. On the Insert Tab, Click on Scatter Chart.


2. Click on Select Data.

The Select Data Source dialog box appears.

3. Select the range of all three models.


4. Click OK.
5. Scatter Chart will be displayed as:
EXERCISE 7 PIVOT TABLE AND PIVOT CHART

PIVOT TABLE IN EXCEL

A pivot table summarizes the given data set bundled within a grid-like matrix that
helps explore or create reports based on useful information. In particular, it enables
users to extract the data in a customized format (such as reports or dashboards)
from the large, detailed data sets recorded within the Excel sheet.

Unlike the regular Excel reports, the Pivot Tables represent our essential data sets
in an interactive view, allowing us to view our data from a different perspective
with little tricks. We can easily sort & filter data, group data into desired
categories, create charts, break down the data month-wise or year-wise, and
perform complex calculations using various functions or formulas.

The Pivot Table helps us view our data effectively and saves crucial time by
summarizing the data into essential categories. It is a kind of reporting tool and
contains mainly the following four fields:

o Rows: This refers to data taken as a specifier. o Values:


This represents the count of the data. o Filters: This helps us
hide or highlight specific data. o Columns: This refers to
values under various conditions.
Why do we use Pivot Tables in Excel?

Following are some of the scenarios when using the Pivot Tables in Excel can be
an effective solution for us:

o Comparing sales totals of various listed products o


Highlighting the product sales as a percentage portion of the total
sales o Combining duplicate data sets o Adding or inserting the
default values to the empty cells o Counting or showing rows that
have some common data

PIVOT CHART IN EXCEL


A pivot chart is a useful tool, especially when the user is dealing with large
amounts of data. For instance, an XYZ company has 200 employees. The HR has
maintained each candidate's working hours in Excel. Now they want to find the
employee name who has taken the minimum leave in the entire year so you can
reward him for his sincerity and devotion towards the company. If you manually
browse through the entire list, it would be time-consuming or fetched results could
also be inaccurate. Don't worry, because Microsoft Excel has provided a built-in
feature named "pivot table" or a "pivot chart" to cater to such tasks. Pivot Charts
enable instant reorganization and understanding of your data visually, facilitating
the complete process.

Advantages of Pivot Charts


1. Pivot charts are a powerful way of interpreting data pictorially.
2. Pivot charts make the process of visualization of data effortless.
3. Pivot Charts are widely used for Data Analysis.
4. Pivot charts effectively facilitate various data conclusions and determine the
basis of statistical calculations.
5. Pivot Chars efficiently handle massive unsized raw data by correlating them
through Pivot filtering and Pivot slicing.

Limitations of Pivot Charts o Pivot Charts doesn't allow you to create reports
based on Multi-select / Checkbox field types. o Whenever you insert a new field
in an existing Pivot Table for which a Pivot Chart has already been created,
Excel will automatically add the new field in the last column. It is impossible to
change this order or make the added field (column) appear somewhere in the
middle of the remaining columns.

Exercise 7: Pivot Table and Pivot Chart

In your dataset, 213 records and 6 fields are given i.e., Order ID, Product,
Category, Amount, Date and Country. (File – T7 Pivot Chart and Table
213.slxs)

Steps to create Pivot Table:

1. Select the data.


2. Go to Insert Tab.
3. Click on Pivot Table button.

The Pivot table dialog box will appear.

4. Click on “Existing worksheet”.


5. In the “Location” box, select any cell.
6. Click OK.
The Pivot Table will appear on the sheet.

Outcome 1: Find sum of amount of each type of eatable being sold by the
enterprise.

Steps:

1. Drag “Product” field in the “Rows” area.


2. Drag “Amount” field in the “Values” area.

The resulting Pivot table will display the sum of amount of each type of eatable
being sold by the enterprise.
Outcome 2: Find sale of each type of eatable in Canada, Germany and United
States by using filters.

1. Drag “Country” field in the “Filters” area.


2. Drag “Product” field in the “Rows” area.
3. Drag “Amount” field in the “Values” area.
4. In the cell J3, select “Canada” from the drop down menu.

The resulting Pivot Table will display the sale of each eatable in Canada.
5. In the cell J3, select “Germany” from the drop down menu.

The resulting Pivot Table will display the sale of each eatable in Germany.

6. In the cell J3, select “United States” from the drop down menu.

The resulting Pivot Table will display the sale of each eatable in United States.
Outcome 3: Find out the highest sold fruit irrespective of its country.

Steps:

1. Drag “Category” field in the “Filters” area.


2. Drag “Product” field in the “Rows” area.
3. Drag “Count of Product” field in the “Values” area.

The resulting Pivot table will display the count of fruits sold irrespective of the
country.

Interpretation:

It can be concluded that highest sold fruit is “Banana” whose sold count is 71.
Outcome 4: Change summary calculation and find how many orders of
apple have been received from Canada?

Steps:

1. Drag “Country” and “Product” field in the “Filters” area.


2. Drag “Product” field in the “Rows” area.
3. Drag “Count of Product” field in the “Values” area.
4. In the cell J3, select Canada from drop down menu.
5. In the cell I5, select Apple from drop down menu.

The resulting Pivot Table will display the number of orders of apple that have been
received from Canada.

Interpretation:

It can be concluded that 6 orders of apple have been received from Canada.
Outcome 5: Prepare a Pivot Chart (Bar Chart) showing sum of amount
as per each eatable (both fruits and vegetables).

a. Drag “Product” field in the “Rows” area.


b. Drag “Sum of Amount” field in “Values” area.

The resulting Pivot table will display the sum of amount of each eatable.
c. In the Insert Tab, Click on Pivot Chart > Bar Chart.

d. Click OK.

The Pivot Chart showing the sum of amounts of each eatable is as follows:
T-8 IQR OUTLIER
OUTLIERS:

An outlier is an extremely high or extremely low data point relative to the nearest
data point and the rest of the neighbouring co-existing values in a data graph or
dataset you're working with.

Outliers are extreme values that stand out greatly from the overall pattern of values
in a dataset or graph.

Significance of Outliers:

• Outliers badly affect mean and standard deviation of the dataset. These may
statistically give erroneous results.

• Most machine learning algorithms do not work well in the presence of


outlier. So it is desirable to detect and remove outliers.

• Outliers are highly useful in anomaly detection like fraud detection where
the fraud transactions are very different from normal transactions.

How to calculate outliers:

1. Sorting Method

You can sort quantitative variables from low to high and scan for extremely low
or extremely high values. Flag any extreme values that you find.

This is a simple way to check whether you need to investigate certain data
points before using more sophisticated methods.
2. Using visualizations

You can use software to visualize your data with a box plot, or a box-
andwhisker plot, so you can see the data distribution at a glance. This type of
chart highlights minimum and maximum values (the range), the median, and
the interquartile range for your data. Many computer programs highlight an
outlier on a chart with an asterisk, and these will lie outside the bounds of
the graph.

3. Statistical outlier detection

Statistical outlier detection involves applying statistical tests or procedures to


identify extreme values. You can convert extreme data points into z scores
that tell you how many standard deviations away they are from the mean. If
a value has a high enough or low enough z score, it can be considered an
outlier. As a rule of thumb, values with a z score greater than 3 or less than –
3 are often determined to be outliers.

4. Using Inter Quartile Range

The interquartile range (IQR) tells you the range of the middle half of your
dataset. You can use the IQR to create “fences” around your data and then
define outliers as any values that fall outside those fences.
IQR is used to measure variability by dividing a data set into quartiles. The
data is sorted in ascending order and split into 4 equal parts. Q1, Q2, Q3
called first, second and third quartiles are the values which separate the 4
equal parts.

• Q1 represents the 25th percentile of the data.

• Q2 represents the 50th percentile of the data.

• Q3 represents the 75th percentile of the data.

This method is helpful if you have a few values on the extreme ends of your
dataset, but you aren’t sure whether any of them might count as outliers.

Interquartile range method

1. Sort your data from low to high

2. Identify the first quartile (Q1), the median, and the third quartile (Q3).

3. Calculate your IQR = Q3 – Q1

4. Calculate your upper fence = Q3 + (1.5 * IQR)

5. Calculate your lower fence = Q1 – (1.5 * IQR)

6. Use your fences to highlight any outliers, all values that fall outside your
fences.

Your outliers are any values greater than your upper fence or less than your
lower fence.

Exercise 8: Identifying Outliers through Inter-Quartile Range Method

Ques. Identify three data sets (File – T8) and analyze in order to find the
following outcomes:

Data Set 1
Outcome 1: Find Upper and Lower Fence

Steps to calculate Upper and lower fence

1. Select the data.

A2:A16

2. Under Data tab, click on sort option.


3. Click ok.
Data is sorted from low to high.

4. Calculate Q1 using the formula


=QUARTILE.EXC(A2:A16,1)

5. Q1 will be calculated as : 6.
6. Calculate Q3 using the formula

=QUARTILE.EXC(A2:A16,3)

7. Q3 will be calculated as : 39
8. Calculate IQR using the formula

=(E6-E4)
9. IQR will be calculated as : 33

10.Calculate Upper Fence using the

formula: Upper Fence = Q3 + (1.5*IQR)

=E6+(1.5*E8)

11.Upper Fence will be calculated as : 88.5


12.Calculate Lower Fence using the

formula: Lower Fence = Q1 – (1.5*IQR)

=E4-(1.5*E8)
13.Lower Fence will be calculated as : -43.5

INTERPRETATION:
UPPER FENCE = 88.5

LOWER FENCE = -43.5

Outcome 2: Find outliers from the given data set.

Steps to find Outliers from the data set

1. Find out whether the value is an outlier or not by using the formula:
=OR(A2>$E$10,A2<$E$12)

2. TRUE / FALSE value will be shown.


3. Drag the formula to all the columns.

4. The value 104 is found to be the Outlier.


Outcome 3: Remove Outlier and find both the fences in order to find outliers.

Steps –

1. Remove the outlier – 104 from the data set.


2. The new Upper and Lower fences are calculated as:

INTERPRETATION:
UPPER FENCE = 48

LOWER FENCE = -20

Data Set 2

Outcome 1: Find Upper and Lower Fence

Steps to calculate Upper and lower fence

1. Select the data.

A2:A18

2. Under Data tab, click on sort option.


3. Click ok.
Data is sorted from low to high.
4. Calculate Q1 using the formula

=QUARTILE.EXC(A2:A18,1)

5. Q1 will be calculated as : 6.
6. Calculate Q3 using the formula

=QUARTILE.EXC(A2:A18,3)

7. Q3 will be calculated as : 25
8. Calculate IQR using the formula

=(E6-E4)

9. IQR will be calculated as : 19


10.Calculate Upper Fence using the

formula: Upper Fence = Q3 + (1.5*IQR)

=E6+(1.5*E8)

11.Upper Fence will be calculated as : 53.5


12.Calculate Lower Fence using the

formula: Lower Fence = Q1 – (1.5*IQR)

=E4-(1.5*E8)

13.Lower Fence will be calculated as : -22.5


INTERPRETATION:

UPPER FENCE = 53.5

LOWER FENCE = -22.5


Outcome 2: Find outliers
from the given data set.

Steps to find Outliers from the data set

5. Find out whether the value is an outlier or not by using the formula:
=OR(A2>$E$10,A2<$E$12)
6. TRUE / FALSE value will be shown.

7. Drag the formula to all the columns.


8. The values 86 and 99 are found to be the Outliers.

Outcome 3: Remove Outlier and find both the fences in order to find outliers.

Steps –

1. Remove the outliers – 86 and 99 from the data set.


2. The new Upper and Lower fences are calculated as:
INTERPRETATION:

UPPER FENCE = 49

LOWER FENCE = -23

Data Set 3

Outcome 1: Find Upper and Lower Fence

Steps to calculate Upper and lower fence

1. Select the data.

A2:A21
2. Under Data tab, click on sort option.
3. Click ok.
Data is sorted from low to high.
4. Calculate Q1 using the formula

=QUARTILE.EXC(A2:A21,1)
5. Q1 will be calculated as : 29.5.

6. Calculate Q3 using the formula

=QUARTILE.EXC(A2:A21,3)
7. Q3 will be calculated as : 96.75

8. Calculate IQR using the formula

=(E6-E4)
9. IQR will be calculated as : 67.25

10.Calculate Upper Fence using the

formula: Upper Fence = Q3 +

(1.5*IQR)

=E6+(1.5*E8)
11.Upper Fence will be calculated as : 197.625

12.Calculate Lower Fence using the

formula: Lower Fence = Q1 –

(1.5*IQR)

=E4-(1.5*E8)
13.Lower Fence will be calculated as : -71.375

INTERPRETATION:

UPPER FENCE = 197.625


LOWER FENCE = -71.375
Outcome 2: Find outliers
from the given data set.

Steps to find Outliers from the data set

1. Find out whether the value is an outlier or not by using the formula:
=OR(A2>$E$10,A2<$E$12)

9. TRUE / FALSE value will be shown.

10.Drag the formula to all the columns.


11.The values 289 and 355 are found to be the Outliers.

Outcome 3: Remove Outlier and find both the fences in order to find outliers.

Steps –

1. Remove the outliers – 289 and 355 from the data set.
2. The new Upper and Lower fences are calculated as:
INTERPRETATION:

UPPER FENCE = 159.375

LOWER FENCE = -49.625


EXERCISE 9 GOOGLE DASHBOARD
A dashboard is an easy to read, one page summary of the analysis of the
information. It is an overview of your system at a glance. There are many
advantages that result in the utilization of this tool. Some of the most important
benefits are:

1. Customizable.– Dashboards could be customized in terms of users and


expectations. Each decision level dashboard can be customized to present
the most valuable and useful set of information. This allows each person
to see the level of detail that they need in order to get their job done and
meet their goals.

2. All-in-one.- In the past users would spend large amount of time


reviewing and analyzing different reports to end in a final conclusion.
This tool allows to see, at a glance, an overall situation report of the
desired information.

3. Drill into detail.- But, having all-in-one does not means the absence of
details. Dashboards are developed with the ability to get as deeper in
information as required by simply selecting the desired variable or object.

4. Intuitive data presentation.- There is no need for complicated and


exhaustive training. Dashboards are design to be intuitive to any user.
The graphic design allows an easy and smooth navigation throughout the
information.

5. Mobile device accessible.- Most dashboards software are programmed to


suit any mobile device. The idea is to reach anywhere, to everyone, in a
timely manner with the most accurate information.

The dashboard method of reviewing details and viewing the status of operations
provides a significant opportunity to make your business more efficient and
quicker to respond to issues and opportunities.

In computing, Google Dashboard lets users of the Internet view and manage
personal data collected about them by Google. With an account, Google Dashboard
allows users to have a summary view of their Google+, Google location history,
Google
web history, Google Play apps, YouTube and more. Once logged in, it summarizes
data for each product the user uses and provides direct links to the products. The
program allows setting preferences for personal account products.

The only information that is shared with Google Dashboard is information


generated while one is logged into an account. All data in Dashboard is considered
private unless settings are changed. Google allows the user control of all the
information that they provide and allows the data submitted to be purged from each
app.

Two step verification, web history, location history, and preferences are available
for all applications.

USAGE

To access, users sign into a Google account with username and password. Once
signed in, select Google account settings option in top right corner of the web page
and then click on the Dashboard link titled View data stored with the account.
After verifying account password, users can view Dashboard organized according
to the products of use. From Dashboard, users will also be able to view data
associated with the account.

Some applications link to new sections, while other settings stay within the page.
The user also has access, from the dashboard, to special settings. Two-step
verification is an example of this, which requires a verification code to be entered
that is sent to the user's phone when logging into a new machine and everytime
cookies are cleaned.

PURPOSE

Privacy and Convenience

The main purpose of Google Dashboard was that people would have a central
location to see what data has been collected about them. Google Dashboard also
provides users with a way to manage their account for each service they use. Links
are located next to each service, meaning that if a user wanted to manage their
Gmail account, a direct link to their account management page will be located in
their Google Dashboard. Using that link will enable users to manage their privacy
settings or sharing options on the video site. Google Dashboard enables users to
gain access to the company's most often used services, such as Google+, Google
search, Google Maps, YouTube and many more. These items are all supported by
Dashboard and as the user scroll through them, Dashboard displays all the account
settings for each service and any recent activity. For example, Dashboard will
show appointments on the Google Calendar, messages in the Gmail inbox, recently
shared or viewed documents from Google Docs, and status in Google Chat, among
other updates.
If a user is uncomfortable with the presence of their Google web history, they can
remove specific items or clear the entire history using their Google
Dashboard. Every service listed in a user's Dashboard also includes a link to that
service's privacy policies.

Exercise 9: Google Dashboard


In your dataset, detail of 100 employees is given (File – T9)

Outcome 1: Draw a pie chart of male and female recruits in the organization.

1. Select the range of Gender.

2. In the Insert Tab, Click on Charts.


Outcome 2: Draw a bar chart showing source of recruitment of all the
employees.

1. Select the range of Recruitment Source.

2. In the Insert Tab, Click on Charts.


3. In Chart type, select Bar Chart.
Outcome 3: Draw a map depicting location of all the employees.

1. Select the range of City.

2. In the Insert Tab, Click on Charts.


3. In Chart type, select Map.
Outcome 4: Outline the level of employee satisfaction by using a suitable chart.

1. Select the range of EmpSatisfaction.


2. In the Insert Tab, Click on Charts.
3. In Chart type, select Histogram.

Outcome 5: Describe span of management and highlight the name of manager


who has the highest number of employees under him.

1. Select the range of ManagerName.


2. Enter the formula

=INDEX(J2:J101,MODE.SNGL(MATCH(J2:J101,J2:J101,0)))

3. Press Enter.

Interpretation:

It can be concluded that Hadiya Hetal S. is the manager who has the highest
number of employees under hims.
EXERCISE 10 POWER BI
What is Microsoft Power BI?
Microsoft Power BI is a business intelligence (BI) platform that provides
nontechnical business users with tools for aggregating, analyzing, visualizing
and sharing data. Power BI's user interface is fairly intuitive for users familiar
with Excel, and its deep integration with other Microsoft products makes it a
versatile self-service tool that requires little upfront training.

Users can download an application for Windows 10, called Power BI Desktop, and
native mobile apps for Windows, Android and iOS devices. There is also Power
BI Report Server for companies that must maintain their data and reports on
premises. That version of Power BI requires a special version of the desktop app --
aptly called Power BI Desktop for Power BI Report Server.

Common uses of Power BI


Microsoft Power BI is used to find insights within an organization's data. Power BI
can help connect disparate data sets, transform and clean the data into a data model
and create charts or graphs to provide visuals of the data. All of this can be shared
with other Power BI users within the organization.

The data models created from Power BI can be used in several ways for
organizations, including the following:

• telling stories through charts and data visualizations;


• examining "what if" scenarios within the data; and
• creating reports that can answer questions in real time and help
with forecasting to make sure departments meet business metrics.

Power BI can also provide executive dashboards for administrators or managers,


giving management more insight into how departments are doing.
Key features of Power BI
Microsoft has added a number of data analytics features to Power BI since its
inception, and continues to do so. Some of the most important features are the
following:

• Artificial intelligence. Users can access image recognition and text


analytics in Power BI, create machine learning models using automated
ML capabilities and integrate with Azure Machine Learning.
• Hybrid deployment support. This feature provides built-in connectors
that allow Power BI tools to connect with a number of different data
sources from Microsoft, Sales force and other vendors.
• Quick Insights. This feature allows users to create subsets of data
and automatically apply analytics to that information.
• Common data model support. Power BI's support for the common data
model allows the use of a standardized and extensible collection of data
schemas (entities, attributes and relationships).
• Cortana integration. This feature, which is especially popular on
mobile devices, allows users to verbally query data using natural
language and access results using Cortana, Microsoft's digital assistant.
• Customization. This feature allows developers to change the
appearance of default visualization and reporting tools and import new
tools into the platform.
• APIs for integration. This feature provides developers with sample
code and application program interfaces (APIs) for embedding the Power
BI dashboard in other software products.

• Self-service data prep. Using Power Query, business analysts can


ingest, transform, integrate and enrich big data into the Power BI web
service. Ingested data can be shared across multiple Power BI models,
reports and dashboards.
• Modeling view. This allows users to divide complex data models by
subject area into separate diagrams, multiselect objects and set
common
properties, view and modify properties in the properties pane, and set
display folders for simpler consumption of complex data models.
Power BI components
Microsoft Power BI works by connecting data sources and providing a dashboard
of BI to the users. It can connect with just an Excel spreadsheet or bring together
cloud-based and on-premises data warehouses. Data pulled from cloud-based
sources, such as Salesforce CRM, is automatically refreshed.

With applications such as an Excel workbook or Power BI Desktop file connected


to online or on-premises data sources, Power BI users must manually refresh or
setup a refresh schedule to ensure the data in Power BI reports and dashboards use
the most current data available.

Power BI consists of a collection of apps and can be used either on desktop, as a


SaaS product or on a mobile device. Power BI Desktop is the on-premises version,
Power BI Service is the cloud-based offering and mobile Power BI runs on mobile
devices.

The different components of Power BI are meant to let users create and share
business insights in a way that fits with their role.

Included within Power BI are several components that help users create and share
data reports. Those are the following:

• Power Query: a data mashup and transformation tool


• Power Pivot: a memory tabular data modeling tool
• Power View: a data visualization tool
• Power Map: a 3D geospatial data visualization tool
• Power Q&A: a natural language question and answering engine
Additionally, there are dozens of data sources that connect into Power BI, ranging
from files (Excel, PDF, SharePoint, XML), databases (SQL Server Database,
Oracle Database, IBM databases, Amazon Redshift, Google Big Query), other
Power BI data sets, Azure data connections and many online services (Dynamics
365, Salesforce Reports, Google Analytics, Adobe Analytics, Facebook and
others).

Exercise 10: Power BI


As per dataset of 7991customers is given showing sales analysis
report (File – T10 Power BI)

Outcome 1: Draw a card showing total sales in rounded border.


Outcome 2: Draw a bar chart showing sum of total revenue as per each
channel of distribution.
Outcome 3: Draw a Donut chart depicting sum of unit cost by
warehouse code.
Outcome 4: Outline the count of order quantity by month by using funnel
tool.
Outcome 5: Draw a Table showing sum of unit price on monthly basis.

Note:
1. All five outcomes are to be shown on single screen.
2. The color contrast is to be added to make your visualization
more creative and clearer.

Outcome 1: Draw a card showing total sales in rounded border.

1. Under Visualizations tab, Go to Build Visual.


2. Click on Card option.
Select “Sum of Total Revenue” field.
Go to Format Visual > General.
Turn “On” Visual Border.

6. Increase the “Rounded Corners”.

The resulting card will show the total sales with rounded border.
Outcome 2: Draw a bar chart showing sum of total revenue as per each channel
of distribution.

1. Click on Stacked Bar Chart option.

2. Select “Total Revenue” and “Channel” fields.

The resulting bar chart will show the sum of total revenue as per each channel
of distribution.

Outcome 3: Draw a Donut chart depicting sum of unit cost by warehouse code.
1. Click on Donut Chart option.

2. Select “Sum of unit cost” and “Warehouse code” fields.


The resulting donut chart will depict the sum of unit cost by warehouse code.

Outcome 4: Outline the count of order quantity by month by using funnel tool.

1. Click on Funnel option.


2. Select “Order Quantity” and “Month” fields.

The resulting funnel chart will show the count of order quantity.

Outcome 5: Draw a Table showing sum of unit price on monthly basis.

1. Click on Table option.


2. Select “Sum of Unit price” and “Month” fields.

The resulting table will show the sum of unit price on monthly basis.
Colour contrasts are added.

The final visualization is as follows:

You might also like