CCW331 Ba Manual
CCW331 Ba Manual
SEMESTER-VI
Procedure:
Ms-Excel features:
MS Excel is used for processing the data that is in tabular form and then performing
mathematical functions on it to analyze it. This is what the Excel window looks like(version 2007):
Excel is a tool for coordinating and performing calculations on data. It can examine data, compute
statistics, create pivot tables, and express data as a chart or graph. MS Excel performs the following basic
functions:
a) Rows and Columns
In MS Excel, there are rows and columns. The intersection of rows and columns makes a cell. So each
of the cells is an individual unit of data. Each cell has a cell address which is the number of rows and
alphabet of the column it appears in. No two cells have the same address ever.
Font size, font color, font styles, alignment, background color, formatting options and styles,
insertion, deletion, and editing in the cells options are also available.
One can insert images and figures, header, and footer, charts, and spark lines and even attach graphs,
equations, and symbols.
b) Home and Insert
The Home & Insert menu of MS Excel is similar to MS Word. Users can change the formatting of
the content from home & include pie charts, tables, and other files related to data from the insert menu.
c) Formulas
The unique functions that MS Excel has are Formulas & Data. Users can perform the formula on
data to analyze it quickly. Users have to select the cells for that and one cell becomes one unit of data.
So if the user selects 10 cells and applies an average formula to them, the user will get an average
of the data output of those 10 cells.
To apply a formula to any data, the user needs to select it without any space. Then in the function
bar, the user needs to type ‘=’ and the abbreviation of the formula the user wishes to apply.
d) Data
From the Data menu, the user can perform functions without changing the original data. Users can
filter, add external data from the web & sort data without changing it. For example, the user can sort the
data in alphabetical order.
Right from basic functions like addition & subtraction, the user can perform complex statistical
functions like correlation & t-test. Moreover, users can convert them into Pie charts or graphs within
moments. This makes data analysis easy.
e) Page Layout
Users can apply themes, orientation, and check the page setup through the page layout option.
f) Review
Proofreading like spell check can be performed for an excel sheet in the review section and a user
can even add comments or remarks in this part.
g) View
Different views and layouts in which the user wants the spreadsheet to be displayed can be selected
here. Options to zoom in and out, full screen, and pane arrangement are available under this section.
MS Excel Terminologies
Cell- A cell is a rectangle block/box present in a worksheet. Any sort of data that a user wants to enter into
the worksheet must be arranged in a cell.
Cells can be color-coded, show text, numbers, and the outcomes of calculations, based on the operations.
An Active Cell is currently an open-cell for editing.
Workbook- The workbook relates to an Excel spreadsheet record. It includes all the data that have been
listed/sorted/calculated. A workbook that is available to be seen and edited by various users on a network
is recognized as a Shared Workbook.
Worksheet- Inside the workbook is where we get documents called worksheets. Also recognized as
spreadsheets, a user can have multiple worksheets in a single workbook.
Workspace- Very similar to worksheets in a workbook, a workspace enables users to open numerous
filessimultaneously.
Formula Bar- The Formula Bar displays the data of an active cell. In the case of formulas, the formula
barwill showcase all components of the formula.
Toolbar- The toolbar that holds quick shortcuts like save, undo and redo, etc.
Data Validation- This feature of MS Excel helps to limit incorrect data from being inserted into the
worksheet. Data validation aids consistency and precision in the data to be entered.
Ribbon- Over the workbook is a section of command tabs named the Ribbon. Several options are
present behind every tab of the ribbon.
There are several features that are available in Excel to make our task more manageable. Some
of the main features are:
1. AutoFormat: It allows the excel users to use predefined table formatting option.
2. AutoSum: AutoSum feature helps us to calculate the sum of a row or column automatically by
inserting an addition formula for a range of cells.
3. List AutoFill: It automatically develops cell formatting when a new component is added to the
end of a list.
4. AutoFill: This feature allows us to quickly fill cells with a repetitive or sequential record such
as chronological dates or numbers and repeated documents. AutoFill can also be used to copy
functions. We can also alter text and numbers with this feature.
5. AutoShapes: AutoShapes toolbar will allow us to draw some geometrical shapes, arrows,
flowchart items, stars, and more. With these shapes, we can draw our graphs.
6. Wizard: It guides us to work effectively while we work by displaying several helpful tips and
techniques based on what we are doing. Drag and Drop feature will help us to reposition the
record and text by simply dragging the data with the help of the mouse.
7. Charts: This feature will help you to present the data in graphical form by using Pie, Bar, Line
charts, and more.
8. PivotTable: It flips and sums data in seconds and allows us to execute data analysis and
generating documents like periodic financial statements, statistical documents, etc. We can
also analyze complex data relationships graphically.
9. Shortcut Menus: The shortcut menu helps users to make the work done.
RESULT:
GET THE INPUT FROM USER AND PERFORM NUMERICAL OPERATIONS
(MAX,MIN, AVG, SUM, SQRT, ROUND)
Ex.No:2(i)
Date:
Aim:
Procedure:
1. Open a new Excel document in Excel → New
Create a blank workbook or worksheet.
Enter the data to calculate numerical operations
2. MAX: The MAX function in Excel is used to find the highest value in a range of cells. Suppose
wehave a dataset of student test score in cells A1 to A10 and we want to find the highest score.
Select the cell where you want the maximum value to be displayed (let's say it's cell B1).
In cell B1, enter the following formula: =MAX(A1:A10), then Press Enter.
Excel will display the highest value from the specified range (A1 to A10) in cell B1.
3. MIN: The MIN function in Excel is used to find the lowest value in a range of cells. Let's use
thesame dataset of test scores in cells A1 to A10
Select the cell where you want the minimum value to be displayed (let's say it's cell B2).
In cell B2, enter the following formula: =MIN(A1:A10), then Press Enter.
Excel will display the lowest value from the specified range (A1 to A10) in cell B2.
4. AVG: The AVERAGE function in Excel is used to calculate the average (arithmetic mean)
of arange of cells. Let's use the same dataset of test scores in cells A1 to A10
Select the cell where you want the average value to be displayed (let's say it's cell B3).
In cell B3, enter the following formula: =AVERAGE(A1:A10), then Press Enter.
Excel will calculate the average of the values in the specified range (A1 to A10)
and displaythe result in cell B3.
5. SUM: The SUM function in Excel is used to add up the values in a range of cells. Let's
use thesame dataset of test scores in cells A1 to A10
Select the cell where you want the sum to be displayed (let's say it's cell B4).
In cell B4, enter the following formula: =SUM(A1:A10), then Press Enter.
Excel will add up all the values in the specified range (A1 to A10) and display
the total sumin cell B4.
6. SQRT: The SQRT function in Excel is used to calculate the square root of a given number. Let's
say we have a value in cell A1 (let's say, A1 contains 25), and we want to find the square root of
this value.
Select the cell where you want the square root to be displayed (let's say it's cell B5).
In cell B5, enter the following formula: =SQRT(A1), Press Enter.
Excel will calculate the sq root of the value in cell A1 and display the result in cell B5.
7. ROUND (Rounding): The ROUND function in Excel is used to round a number to a specified
number of decimal places. Let's say we have a value in cell A1 (let's say, A1 contains 6.578),
and we want to round this value to two decimal places.
Select the cell where you want the rounded value to be displayed (let's say it's cell B6).
In cell B6, enter the following formula: =ROUND(A1, 2), Press Enter.
Excel will round the value in cell A1 to two decimal places and display the
rounded result (6.58) in cell B6.
RESULT:
PERFORM DATA IMPORT/EXPORT OPERATIONS FOR DIFFERENT FILE FORMATS
Ex.No:2(ii)
Date:
Aim:
Procedure:
a) Import Data
Excel can import data from external data sources including other files, databases, or web pages.
If you have data to import from Access, the web, or another source, select one of those options in
the Get External Data group instead.
5. Select the file you want to import.
6. Click Import.
If, while importing external data, a security notice appears saying that it is connecting to an
external source that may not be safe, click OK.
When you have data that needs to be transferred to another system, export it from Excel in a
format that can be interpreted by other programs, such as a text or CSV file.
Procedure:
1. Enter the set of data into an EXCEL spreadsheet.
2. Mean: The formula to calculate mean is
=AVERAGE(select cells here)
Type this formula into the appropriate cell and hit Enter.
3. Median: The formula to calculate median is
=MEDIAN(select cells here)
Type this formula into the appropriate cell and hit Enter.
4. Mode: The formula to calculate mode is
=MODE(select cells here)
Type this formula into the appropriate cell and hit Enter.
5. Standard Deviation: The formula to calculate Standard Deviation is
=STDEV(select cells here)
Type this formula into the appropriate cell and hit Enter.
Median:
Mode:
Standard Deviation:
Vaiance:
Skewness:
Kurtosis:
RESULT:
PERFORM Z-TEST, T-TEST & ANOVA
Ex.No:4
Date:
Aim:
Procedure:
a) Z-TEST
1. Create a dataset
Mark 1 Mark 2
85 90
90 92
85 85
80 82
97 97
80 80
85 88
80 83
92 92
90 90
Variance 1 Variance 2
33.15555556 28.32222222
2. To obtain Z-Test,
Choose the Data Tab and the Analysis Group and Click on Data Analysis.
3. In the menu list highlight Z-Test.
4. Selecting OK yields the output. Your popup should look like the image below,
Variable
Variable 1 2
Mean 86.4 87.9
Known Variance 33.1555 28.3222
Observations 10 10
Hypothesized Mean
Difference 0
z -0.60496807
P(Z<=z) one-tail 0.272600106
z Critical one-tail 1.644853627
P(Z<=z) two-tail 0.545200213
z Critical two-tail 1.959963985
We can reject the null hypothesis if z < -z Critical two-tail or z stat > z Critical two-tail.
Here -0.60496807 > -1.9599 and -0.60496807 < 1.9599; hence we can’t reject the null hypothesis.
Thus, the means of both populations don’t differ significantly.
b) T-TEST
1. Create a dataset
2. To obtain T-Test,
Choose the Data Tab and the Analysis Group and Click on Data Analysis.
Variable 1 Variable 2
Mean 86.4 87.9
Variance 33.15555556 28.32222222
Observations 10 10
Pearson Correlation 0.951435832
Hypothesized Mean Difference 0
df 9
t Stat -2.66556995
P(T<=t) one-tail 0.012904567
t Critical one-tail 1.833112933
P(T<=t) two-tail 0.025809134
t Critical two-tail 2.262157163
This will show the mean of each data set, their variance, how many observations are
considered, correlation, and P-value.
We need to see the P-value i.e. 0.025809134, which is way below the expected P-value of 0.05.
1. Create a dataset.
2. To obtain ANOVA,
Choose the Data Tab and the Analysis Group and Click on Data Analysis.
4. Selecting OK yields the output.We will get the below result as shown below.
We can see that the F value is greater than the F crit value so we can reject the null hypothesis,
and wecan also say that at least one of the groups is significantly different.
RESULT:
PERFORM DATA PRE-PROCESSING OPERATIONS - HANDLING MISSING DATA
Ex.No:5(i)
Date:
Aim:
Procedure:
Handling missing data in Excel involves various methods, and the most suitable approach depends
on your dataset and analysis goals. Here are some common techniques:
i. Remove Rows or Columns: You can delete rows or columns containing missing data, but
this canresult in a loss of valuable information. Be cautious not to remove too much data,
making your dataset less informative.
ii. Impute Missing Values: Imputation means filling in missing values with estimated ones.
Commonimputation methods include:
Mean Imputation: Replacing missing values with the mean (average) value of the
column.
Median Imputation: Replacing missing values with the median (middle) value of
thecolumn.
Mode Imputation: Replacing missing values with the most frequent value in the
column.
iii. Flag Missing Data: Create a new column indicating whether a value is missing. This
approach retains the original dataset while allowing you to filter or analyze data based on
missing values.
Excel offers special functions like IFERROR or IFNA that can be used to handle errors or
specific types of missing values. These functions can be handy when working with formulas
and calculations.
v. Statistical Methods: Use advanced statistical techniques designed to handle missing data.
These methods can be more complex but may provide more accurate results.
The choice of the best method depends on your specific data and analysis requirements.
Consider a dataset with sales data, where some values are missing in the "Sales" column.
i) Remove Rows or Columns:
Resulting dataset:
ii) Impute Missing Values
Mean Imputation:
To impute missing sales values with the mean of the "Sales" column:
1. In a cell, enter the formula: =AVERAGE(C2:C7) and press Enter.
2. Copy the result.
Replace missing values with the calculated mean:
1. Select cell C2 (the first missing value).
2. Paste the mean value.
3. Repeat for other missing values.
Resulting dataset:
The procedure for handling missing values using the median and mode imputation methods in Excel is
verysimilar to the procedure for mean imputation.
Resulting dataset:
=ISERROR(VLOOKUP(C2,$C$2:$C$7,1,0))
Here,
C2= the data to find
$C$2:$C7= Lookup array
2. Drag the formula down to fill the entire column.
Note:
The VLOOKUP function uses C2 as the lookup value and searches within the fixed range
$C$2:$C7as the lookup array. It specifies 1 as the column index number and 0 as the range lookup
parameter.
The ISERROR function evaluates whether an error exists and returns TRUE in case of an error
or FALSE when no error is present.
Resulting dataset:
RESULT:
PERFORM DATA PRE-PROCESSING OPERATIONS - NORMALIZATION
Ex.No:5(ii)
Date:
Aim:
Procedure:
1. Prepare dataset in Excel
Procedure:
After generating the covariance matrix, you can use Excel's built-in matrix
functions likeMINVERSE and MMULT to calculate eigenvalues and
eigenvectors.
=MINVERSE(B2:D4-F2:H4)
=MMULT(B2:D4,F2:H4)
Press Ctrl + Shift + Enter to make it an array formula (you should see curly braces {}
appeararound the formula).
=MDETERM(B2:D4-F2:H4)*MINVERSE(B2:D4-F2:H4)
Press Ctrl + Shift + Enter to make it an array formula (you should see curly braces {} appear
around the formula). This formula calculates the determinant of the matrix.
Manually sort the eigenvalues and choose how many principal components
to retain basedon your criteria.
5. Create the Projection Matrix:
Multiply your original data by the projection matrix to obtain the transformed data
withprincipal components.
RESULT:
PERFORM BIVARIATE AND MULTIVARIATE ANALYSIS ON THE DATASET
Ex.No:7
Date:
Aim:
Procedure:
Bivariate analysis:
Bivariate analysis is a statistical method used to study the relationship between two variables. There
arethree common ways to perform bivariate analysis in Excel:
1. Scatterplots: A scatterplot is a graphical representation of the relationship between two variables.
2. Correlation coefficients: A correlation coefficient is a numerical measure of the
strength anddirection of the relationship between two variables.
3. Simple linear regression: Simple linear regression is a statistical technique that can be
used tomodel the relationship between two variables and to make predictions about the
future.
Scatterplots:
Initial Dataset
1. To create a scatterplot in Excel, simply select the two columns of data that you want to plot
andthen click on the "Insert Scatter Chart" button.
Correlation coefficients:
1. To calculate the correlation coefficient in Excel, use the CORREL() function.
=CORREL(A2:A7, B2:B7)
The correlation coefficient turns out to be 1, which indicates a strong positive correlation between
hoursstudied and exam score received.
Simple linear regression:
Steps:
This tells us that each additional hour studied is associated with an average increase of 0.7143 in
exam score.We can also use the regression equation to estimate the score that a student will
receive based ontheir total hours studied.
For example, a student who studies for 3 hours is estimated to receive a score of 81.6147:
Multivariate analysis:
Step 1: Organize Your Data
Before you can conduct multivariate regression analysis, you need to organize your data. Your data
should include a dependent variable and one or more independent variables. Each row of your data
should represent a unique observation or case.
Step 2: Open the Data Analysis Tool
Excel has a built-in data analysis tool that you can use to conduct multivariate regression analysis. To
accessthis tool, you first need to enable it. To do this, go to the “File” menu, select “Options,” and
then select “Add-ins.” From there, select “Excel Add-ins” and click “Go.” Check the “Analysis
ToolPak” box and click“OK.”
Step 3: Select the “Regression” Option
With the data analysis tool enabled, select the “Data” tab, and then select “Data Analysis” from the
“Analysis” group. Select “Regression” from the list of analysis tools and click “OK.”
Step 4: Choose Your Input Range
In the Regression dialog box, select the dependent variable and the independent variables. Make sure
to choose your input range carefully, as this will determine the variables that are used in the analysis.
Step 5: Choose Your Output Range
Next, choose where you want the output of the analysis to be displayed. This can be in a new
worksheet orin a specific range on an existing worksheet.
Step 6: Check the “Residuals” Box
In the Regression dialog box, make sure to check the “Residuals” box. This will allow you to see how
wellthe model fits the data.
Step 7: Interpret the Results
Once you have conducted the analysis, you can interpret the results. The output will include a
summary ofthe model, including the coefficient values for each independent variable, the standard
error, t-value, and p-value. You can use these values to determine the significance of each
independent variable.
RESULT:
APPLY AND EXPLORE VARIOUS PLOTTING FUNCTIONS ON THE DATA SET
Ex.No:8
Date:
Aim:
Procedure:
Bar charts are one of the easiest charts to interpret, enabling the person viewing the chart an easy
way to compare categorical data quickly. On a bar chart, the categorical data is on the y-axis, and
the values areon the x-axis.
A column chart, also known as a vertical bar chart, helps visualize data where categories are placed on
thex-axis and the values on the y-axis. Similar to bar charts, they help visualize data across categories.
You can now see a column chart that displays the number of units sold for each product category by
the month.
A line chart is the most useful way to capture how a numerical variable changes over time. This is
helpfulto identify trends in numeric values.
A pie chart is most commonly used to show the proportions of a whole. It’s like visualizing fractions
whenyou were in high school. With this pie chart, we want to compare the total sales between the three
categories.
A waterfall chart is a special chart that helps illustrate how positive and negative values can contribute
to atotal. They can be great for visualizing changes over time. In our example, we’ll compare the total
sales for each month regardless of the categories.
To create a waterfall chart in Excel:
We’ll only see positive values in our example because the Total column only contains positive
values, butthis chart can be great for comparing financial data and changes over time.
Aside from creating charts in Excel, there are many options to customize and format a chart. From
here, we’ll discuss some common formatting options that’ll help you enhance your visuals.
First, create a column chart based on our previous guidance that displays the number of units sold for
eachproduct category by the month.
Chart elements
Chart elements include titles, legends, data labels, gridlines, and axes. You can add, remove, or
modify these elements as needed.
To add a chart title, click "Add Chart Element" > "Chart Title" and choose one of the available
options (e.g., "Above Chart" or "Centered Overlay"). You can then click the title placeholder
and type your desired title
To remove a chart title, right-click on the title and select “Delete”rename the chart “Electronic
Store Sales 2022”
Legend
To modify the chart legend, click "Add Chart Element" > "Legend" and choose a position for
the legend (e.g., "Right" or "Top")
To remove the legend, click "Add Chart Element" > "Legend" > "None" move the legend to
the top of the chart.
Data labels
To add data labels, click "Add Chart Element" > "Data Labels" and choose one of the
availableoptions (e.g., "Center" or "Above"). This will display the data values directly on the
chart
To remove data labels, click “Add Chart Element” > “Data Labels” > “None”
To remove data labels from individual categories, right-click on the data label of the
chosencategory; this will highlight all relevant data labels. You can then click “Delete” add
data labels for all categories. Your chart should now look like this:
Chart Styles and colors
On top of customizing individual chart elements, you can change your chart's overall look and feel by
applying different styles and color schemes.
To customize a chart style, select the visual you want to update. Then take the following steps:
By implementing small visual changes, we can see a big difference in the aesthetic of the whole chart.
Here’s an example of how our chart has changed by choosing Style 6 and Monochromatic Palette 8.
Formatting Excel chart axes
To improve the readability of our chart, we can format our axis in several ways, including axis titles,
scale, or even visibility.
Axis titles
To add an axis title, click "Add Chart Element" > "Axis Titles" and choose "Primary
Horizontal" or"Primary Vertical." You can then click the title placeholder and type your desired
title
To remove an axis title, right-click on the axis title and click “Delete”add an x-axis title
“Month” and a y-axis title “Products Sold.”
Axis scale and number format
Right-click the axis you want to modify and choose "Format Axis"
In the "Format Axis" pane, you can change the minimum and maximum values, major and minor units, or
number format.
We intend to keep these options the same but feel free to play around and test what each does.
Axis visibility
Finally, if you would like to remove the axis labels from showing, you can take the following steps:
There are other formatting options available, including modifying data series colors, adjusting
chart andplot area backgrounds, and customizing gridlines.
To access these options:
Right-click the chart element you want to modify (e.g., data series, plot area, or gridlines)
Select "Format [element]"
Procedure:
Power BI Desktop is a powerful business intelligence tool developed by Microsoft that allows users
to create interactive reports and dashboards from various data sources. It's widely used in the industry
for dataanalysis and visualization.
1. Data Import: Power BI Desktop enables users to connect to a wide range of data sources
such as databases (SQL Server, MySQL), Excel files, CSV, web services, and more.
Eg: A dataset imported from an Excel spreadsheet, showing columns with data
like sales figures, dates, and product names.
3. Data Modeling: It allows to create relationships between tables, define hierarchies, and add
calculated columns using DAX (Data Analysis Expressions) formulas.
Eg: A data model diagram displaying tables and their relationships, such as a Sales table linked to a
Products table through a ProductID.
Eg: A bar chart displaying sales by product category, with categories on the x-axis and sales amounts on
the y-axis.
Eg: A report with customized visuals, such as changing the color palette, fonts, and addinga custom
company logo.
6. Interactive Reports: Power BI reports are interactive by nature. It is important to learn how to
createdrill-through actions, cross-filtering, and slicers to enable users to explore data
dynamically.
A report with drill-through actions allowing users to click on a data point to see detailed information
relatedto that point.
7. DAX Calculations: DAX, which stands for Data Analysis Expressions, is a powerful formula
and expression language used in Power BI and other Microsoft products like Excel and SQL
Server Analysis Services (SSAS). DAX is primarily used for creating custom calculations,
aggregations, and business logic in data models. In Power BI, DAX calculations are used to
perform calculations on data within your data model.
8. Publishing: Power BI Desktop allows users to publish reports to the Power BI service. They
should learn how to publish and share their reports with others, granting appropriate access and
permissions.
1. A published report hosted on the Power BI service, accessible via a web link, with
sharing options and access permissions.
9. Data Refresh: Power BI supports scheduled data refresh, ensuring that reports always display
up-to-dateinformation.
A report that automatically updates with new data at scheduled intervals, ensuring it always reflects
the latest information.
10. Power Query: Power BI Desktop incorporates Power Query, a powerful ETL (Extract,
Transform, Load)tool. Students can use Power Query to transform and clean data before it's
loaded into the model.
1. Transformed data with Power Query steps, showing the sequence of operations
performed onthe original dataset.
11. Mobile Optimization: Mobile optimization, in the context of Power BI, refers to the process
of designing and configuring Power BI reports and dashboards to ensure they are user-friendly
and visually appealing when accessed on mobile devices such as smartphones and tablets.
Mobile optimization is essential because it allows users to access and interact with Power BI
content on smaller screens, ensuring a seamless and responsive experience.Encourage
students to practice by creating their own Power BI reports and dashboards using sample
datasets or data relevant to their field of study. Hands-on experience is crucial for mastering
Power BI Desktop's features.
1. A mobile-friendly report layout with responsive visuals and touch-friendly navigation for
smartphones and tablets.
RESULT:
PREPARE AND LOAD DATA
Ex.No:10
Date:
Aim:
Procedure:
1. Power Query Editor, a part of Power BI Desktop, will open, allowing you to transform and clean
your data before loading it into your report.
Here's some of the data transformation technique you can do in Power Query:
a. Select the checkbox beside the table name (if it's not already selected). This will select the
entiretable.
b. In the "Transform" tab, find the "Use First Row as Headers" option. Click on it. This action
willreplace the current column names with the values from the first row, making them the column
headers.
Change type (Reg No)
b. In the "Transform" tab, find the "Data Type" option. Click on it.
c. A dropdown menu will appear, displaying various data types that you can choose for the
column. Select the appropriate data type for your data. The available data types include Whole
Number, DecimalNumber, Text, Date, and more.
d. After selecting the new data type, the values in the column will be cast to that data type,
ensuring proper formatting and calculations.
Removing Columns:
a. Open Power Query Editor by clicking "Edit Queries" on the Home tab.
b. Right-click on the column header you want to remove and select "Remove."
c. Click the "Close & Apply" button to apply the changes.
Removing Rows:
a. In the Home tab, you can use the "Remove Rows" option to filter or remove rows based on
specificconditions. You can choose to remove rows based on filter criteria or keep rows that
meet specificcriteria.
b. Click "Close & Apply" to save your changes.
Removing Duplicate Rows:
a. Use the "Remove Duplicates" feature located in the Home tab. This allows you to specify the
columns that should be considered when identifying duplicates. Rows with the same values in
thesespecified columns will be considered duplicates.
b. Click "OK" to remove duplicate rows.
c. Finally, click "Close & Apply" to apply the changes.
a. Use the "Remove Rows" option in the Home tab. Choose "Remove Bottom Rows" and
specify the number of rows you want to remove from the bottom of your dataset.
b. Click "OK."
c. Click "Close & Apply" to save the changes.
Replace values (AB into ABSENT)
a. In the Power Query Editor, select the column where you want to replace values.
b. Once the column is selected, go to the Transform tab.
c. Find and click on the "Replace Values" option. This will open a dialog box where you can
specifythe values you want to replace and the replacement values.
d. In the "Replace Values" dialog, you'll see two text boxes. In the left box, type the value
you want to replace, and in the right box, type the replacement value.
e. You can add multiple pairs of values to replace by clicking the "Add" button.
f. Click "OK" to apply the replacements.
Custom column-formula
Procedure:
1. Open the Power BI Desktop application and click on the “File” ribbon.
2. Click on the “Open” option and import the data for making Power BI Data Model.
6. Try to create a “Zone-wise” sales summary table by choosing “Region Names” from
“CityTable”and “Sales” Value from the “Sales” table.
7. Our summary table shows the same value for all the regions, as shown below, because we
have usedtwo different columns from two different tables.
8. Similarly, create one more table visualization to create a “City-wise” summary table. This
time use “City Names” from the “Manager Table” and “Sales” value from the “Sales” table.
Even though the “City Names” are in both tables, we still get the same numbers because Power BI cannot
recognize the relationship between these three tables.
9. Now, come back to the “Relationship” tab. As you can see, we have three tables with their
respectivenames.
First, look at the “Sales_Table” and “CityTable” in these two tables’ common column is “City Name.” So,
use this column to create a relationship between these two tables using Power BI data modeling.
10. Now, from “CityTable,” drag and drop the “City Names” column to the “City” column
placethe cursor on the connector line between these two lines. It will highlight two common
columns inrespective tables.
Here you go, it says the relationship between two columns in these two tables is “City.”
12. Now, we have one more table, i.e., “Manager_Table.” This table includes the city-wise
manager list.The city is there in the other two tables as well, but this time, we will create a
relationship between“CityTable” and “Manager_Table,” not with “Sales_Table.”
13. Now, return to the “Report” tab and see the result in the two initially created tables. This time
it recognizes the relationship between tables. Accordingly, it has displayed results against
each city and region.
14. You may click “Manage Relationship” under the “Home” tab to see and manage the
As you can see above, the first relationship list says, “CityTable (City Names)” is connected to
“Manager_Table (City Names),” and the second list says “Sales_Table (City)” is connected with
“City_Table (City Names).”
RESULT:
PERFORM DAX CALCULATIONS
Ex.No:12
Date:
Aim:
Procedure:
DAX functions are used to create custom calculations, aggregations, and expressions in Power BI
to workwith data in Power BI Desktop or Power BI Service.
1. Load Your Data into Power BI.
Procedure:
Step 1: Launch Power BI desktop
(Ex; here, we’ll be using the “Retail Analysis Sample” prepared by Microsoft and
Obvious)
Step 3: go to the “Datasets” section in your workspace and click the ‘Create report’ icon.
Step 4: Select the fields first then visualizations after [Select “Sales” and then “Total
Step 6: Modifying;
Enlarge your visual to the size you want (Simply click on the visual and drag its
corners)
The next part is enlarging the texts — legend, detail labels, and title — of your report.
Step 7: On the visualizations pane, go to the ‘Format’ section.
Step 9: Save your report by clicking ‘File’ from the tab list and select ‘Save’.
OUTPUT:
RESULT:
CREATE A DASHBOARD AND PERFORM DATA ANALYSIS
Ex.No:14
Date:
Aim:
Procedure:
1. Import Data.
5. Create relationships between tables if you have multiple data tables (in this case, you might
not need to create relationships).
7. In the "Visualizations" pane on the right, select the type of visualization you want (e.g., bar chart,
table, card).
8. Drag and drop fields from your dataset into the "Values," "Axis," or other relevant sections
of the visualization.
9. Customize your visualizations by adjusting colors, fonts, labels, and other properties. Add
titles and legends to make them more informative.
11. You can publish it to the Power BI service for sharing with others. (Optional).
For the sample sales dataset, you can start with the following key visualizations to create an
effective Power BIreport:
Create a bar chart with "Product" on the X-axis and "Revenue" on the Y-axis to visualize total
revenue for eachproduct.
Pie Chart for Revenue by Category:
Use a pie chart to represent the distribution of revenue by category. Each slice of the pie represents a
category's share of the total revenue.
Show how revenue changes over time by creating a line chart with "Date" on the X-axis and
"Revenue" on the Y- axis.
Include a card visualization that displays the total revenue for the entire dataset as a single, prominent
number.
Create a table visualization to display detailed data, including columns for Date, Product, Category,
and Revenue.
These visualizations provide a good balance of insights and allow you to convey the key information
from the dataset. Depending on the specific objectives of your report and the audience's needs, you
can also consider adding more visualizations, such as a treemap, donut chart, or scatter plot, to
provide additional perspectives on the data. Remember to keep the report clean, well-organized,
and easy to understand.
RESULT:
PRESENTATION OF A CASE STUDY
Ex.No:15
Date:
Aim:
Procedure:
1. Import Data
5. Create relationships between tables if you have multiple data tables (in this case, you might
not needto create relationships).
7. In the "Visualizations" pane on the right, select the type of visualization you want (e.g.,
bar chart,table, card).