0% found this document useful (0 votes)
16 views58 pages

BA Record

The document is a laboratory record notebook for the Business Analytics Laboratory (CCW331) at the University College of Engineering, Tindivanam, for the academic year 2024-2025. It includes various experiments related to MS Excel and Power BI, covering topics such as numerical operations, statistical operations, data import/export, and data pre-processing techniques. The document serves as a certification of the work done by students in the Business Analytics course.

Uploaded by

HEMACHANDAR . S
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views58 pages

BA Record

The document is a laboratory record notebook for the Business Analytics Laboratory (CCW331) at the University College of Engineering, Tindivanam, for the academic year 2024-2025. It includes various experiments related to MS Excel and Power BI, covering topics such as numerical operations, statistical operations, data import/export, and data pre-processing techniques. The document serves as a certification of the work done by students in the Business Analytics course.

Uploaded by

HEMACHANDAR . S
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 58

UNIVERSITY COLLEGE OF ENGINEERING

TINDIVANAM
(A Constituent College of Anna University, Chennai)

B.TECH SIXTH SEMESTER


RECORD FOR

BUSINESS ANALYTICS LABORATORY (CCW331)

DEPARTMENT OF INFORMATION TECHNOLOGY

LABORATORY RECORD NOTE BOOK


2024-2025

Page | 1
University College of Engineering, Tindivanam
UNIVERSITY COLLEGE OF ENGINEERING
TINDIVANAM
(A Constituent College of Anna University, Chennai)

DEPARTMENT OF INFORMATION TECHNOLOGY


LABORATORY RECORD NOTE BOOK
2024-2025

This is to certify that is a bonafide record of the work done by


Mr./Ms. Register Number
Of 3rd Year B.Tech, Department of Information Technology in the Business
Analytics Laboratory (CCW331) in the VI Semester.

University Examination held on ………………………………………….

Staff In-Charge Head of the Department

Internal Examiner External Examiner

Page | 3
University College of Engineering, Tindivanam
INDEX
S.No. Date Experiment Page No Signature

1 FEATURES OF MS-EXCEL

Get the input from user and perform


2(i) numerical operations (MAX, MIN,
AVG, SUM, SQRT, ROUND)

2(ii) Perform Data Import/ Export operations for


different File Formats.

Perform Statistical Operations - Mean,


3
Median, Mode, Standard Deviation,
Variance, Skewness, Kurtosis

4 Perform z – Test , T- Test & Anova

Perform Data Pre-Processing


5(i) (i)Handling Missing Data

(ii)Normalization
5(ii)

Perform dimensionality reduction operation


6
using PCA, KPCA & SVD

7 Bivariate and Multivariate

Apply and Explore various plotting


8
functions on Dataset

9 Explore the feature of Power BI Desktop

Prepare and Load Data Using Power BI


10

11 Develop the Data Model

12 Perform DAX Calculation using Power BI

13 Design a Report

14

15
EX.NO: 1

DATE: FEATURES OF MS-EXCEL

Aim:

To explore the features of Ms-Excel and understand its capabilities for data analysis and
visualization.

Features of MS Excel:

MS Excel is used for processing the data that is in tabular form and then performing
mathematical functions on it to analyze it.

Excel is a tool for coordinating and performing calculations on data. It can examine data,
compute statistics, create pivot tables, and express data as a chart or graph.
MS Excel performs the following basic functions:

In MS Excel, there are rows and columns. The intersection of rows and columns makes a cell.
So each of the cells is an individual unit of data. Each cell has a cell address which is the
number of rows and alphabet of the column it appears in. No two cells have the same address
ever.

Home and Insert:

The Home & Insert menu of MS Excel is similar to MS Word. Users can change the
formatting of the content from home & include pie charts, tables, and other files related to
data from the insert menu.

Fontsize, font color, font styles, alignment, background color, formatting options and styles,
insertion, deletion, and editing in the cells options are also available.

One can insert images and figures, header, and footer, charts, and sparklines and even attach
graphs, equations, and symbols.

Formulas:

The unique functions that MS Excel has are Formulas & Data. Users can perform the
formula on data to analyze it quickly. Users have to select the cells for that and one cell
becomes one unit of data.

So, if the user selects 10 cells and applies an average formula to them, the user will get an
average of the data output of those 10 cells.

To apply a formula to any data, the user needs to select it without any space. Then in the
function bar, the user needs to type ‘=’ and the abbreviation of the formula the user wishes to
apply.

Data :

From the Data menu, the user can perform functions without changing the original data.
Users can filter, add external data from the web & sort data without changing it. For example,
the user can sort the data in alphabetical order.
Page Layout :

Users can apply themes, orientation, and check the page setup through the page layout option.

Review:

Proofreading like spell check can be performed for an excel sheet in the review section and a
user can even add comments or remarks in this part.

View:

Different views and layouts in which the user wants the spreadsheet to be displayed can be
selected here. Options to zoom in and out, full screen, and pane arrangement are available
under this section.

There are several features that are available in Excel to make our task more manageable.
Some of the main features are:

1. AutoFormat: It allows the excel users to use predefined table formatting option.

2. AutoSum: AutoSum feature helps us to calculate the sum of a row or column


automatically by inserting an addition formula for a range of cells.
3. List AutoFill: It automatically develops cell formatting when a new component is added
to the end of a list.

4. AutoFill: This feature allows us to quickly fill cells with a repetitive or sequential record
such as chronological dates or numbers and repeated documents. AutoFill can also be used to
copy functions. We can also alter text and numbers with this feature.

5. AutoShapes: AutoShapes toolbar will allow us to draw some geometrical shapes, arrows,
flowchart items, stars, and more. With these shapes, we can draw our graphs.

6. Wizard: It guides us to work effectively while we work by displaying several helpful tips
and techniques based on what we are doing. Drag and Drop feature will help us to reposition
the record and text by simply dragging the data with the help of the mouse.

7. Charts: This feature will help you to present the data in graphical form by using Pie, Bar,
Line charts, and more.

8. PivotTable: It flips and sums data in seconds and allows us to execute data analysis and
generating documents like periodic financial statements, statistical documents, etc. We can
also analyze complex data relationships graphically.

9. Shortcut Menus: The shortcut menu helps users to make the work done through shortcut
commands that need a lengthy process.

Result:

Thus the features of Ms-Excel is studied successfully.


EX.NO :2(1)Get the input from user and perform numerical operations
(MAX, MIN, AVG, SUM, SQRT, ROUND)

AIM:
To perform numerical operations (MAX, MIN, AVG, SUM, SQRT, ROUND).

PROCEDURE:
1.Get the input values from the user or dataset.

2.MAXIMUM:

The MAX function returns the highest value of income in a set of data.
Firstly, in a cell D1 type the formula given below-
=MAX (D2:D16)
3.MINIMUM:

The MIN function returns the lowest value of income in a set of data.
Firstly, in a cell D1 type the formula given below-
=MIN (D2:D16)
4.AVERAGE:
The AVG function returns the average value of income in a set of data.
Firstly, in a cell D1 type the formula given below-
=AVG (D2:D16)

5.SUM:
The SUM function returns the total value of income in a set of data.
Firstly, in a cell D1 type the formula given below-
=SUM (D2:D16)
6.SQRT:
The SQRT function returns the square root value of income in a set of data.
Firstly, in a cell D1 type the formula given below-
=SQRT (D3:D4).

7.ROUND:
The ROUND function rounds a value to specified number in income.
Firstly, in a cell D1 type the formula given below-
=ROUND (D2:D16).
RESULT:
Thus ,the numerical operations in excel can be demonstrated and executed
successfully.
Ex.no : 2(ii)
Date :
Perform Data Import/ Export operations for different File Formats.

Aim :
To Perform the data import/export operations for different file formats.
Algorithm:
Export Data:
Step 1: First, Click the tab.
Step 2: Next at the left, Click Export.
Step 3: Click the change file type.
Step 4: Select a file type in TEXT, CSV, FORMATTED TYPE, SAVE AS
ANOTHER FILE TYPE.
Step 5: Click Save as.
Step 6: Specify where you want to save the file.
Step 7: Next, Click the Save button, And click Yes.
Import Data:
Step 1: First, Click the Data tab on the Ribbon.
Step 2: Next, Click the Get Data button.
Step 3: Select the From File.
Step 4: Select from the Text/CSV.
Step 5: Select the file you want to import. Click the Import button.
Step 6: Verify the Preview looks correct.
Step 7: Click load. Display the Dataset.

EXPORT DATA:
1.Click the file tab.
2. Click Save as.

3. Click SAVE.
2. IMPORT DATA:
i) Click data tab. next go to GET DATA button. Select FROM FILE . then -> Click
FROM TEXT/CSV.

ii). Click IMPORT.


iii) Click LOAD.

iv) Get the DATA SET:

RESULT :
Thus the analysis for import/ export operation in different file format was
executed and demonstrated successfully.
Ex No:3 PERFORM STATISTICAL OPERATIONS - MEAN,MEDIAN,
Date: MODE ,STANDARD DEVIATION,VARIANCE,
SKEWNESS,KURTOSIS.

Aim:
To perform sratistical operations for mean,median,mode standard deviation , variance
Skewness,kurtosis.
Algorithm:

Mean:

Step 1: Enter your data. Data Entered.

Step 2: Select a Cell.


Step 3: Use the AVERAGE function.
Step 4: Select Your Data Range.
Step 5: Close the Parenthesis and Press Enter.
Step 6: Preview the Result.
Median:
Step 1: Enter your data. Data Entered.
Step 2: Select a Cell.
Step 3: Use the MEDIAN Function.
Step 4: Select Your Data Range.
Step 5: Close the Parenthesis and Press Enter.
Step 6: Preview the Result.
Mode:

Step 1: Enter your data. Data Entered.


Step 2: Select a Cell. ...
Step 3: Use the MODE Function. ...
Step 4: Select Your Data Range. ...
Step 5: Close the Parenthesis and Press Enter. ...
Step 6: Preview the Result.
Standard deviation:

Step 1: Enter your data. Data Entered.


Step 2: Select a Cell.
Step 3: Use the Stdev Function.
Step 4: Select Your Data Range.
Step 5: Close the Parenthesis and Press Enter.
Step 6: Preview the Result.

Variance:

Step 1: Enter your data. Data Entered.


Step 2: Select a Cell.
Step 3: Use the variance Function.
Step 4: Select Your Data Range.
Step 5: Close the Parenthesis and Press Enter.
Step 6: Preview the Result.

Kurtosis:
Step 1: Enter the data values into cells.
Step 2: In a new cell type =KURT(
Step 3: Highlight the cells where the data are at. Or type the range of cells containing the
data.
Step 4: Make sure to close the parentheses by typing )
Step 5: Then press the enter key.

Skewness:
Step 1: Open the Excel file containing the data set that you want to skew.
Step 2: Select the data set and click on the Insert tab.
Step 3: In the Charts group, click on the Line chart icon.
Step 4:Choose the “skew” option from the drop-down menu.
Step 5: Then press the enter key.

Output:

Result:
Thus median,,mode ,mean, standard deviation ,variance ,skewness and kurtosis has
been executed successfully.
Ex No:4 PERFORM Z-TEST,T-TEST& ANOVA
Date:

Aim:
To perform operations on z-test,t-test & anova
Algorithm:

Z test:

Step 1: First, we need to calculate the variables for these two values using the VAR.P
function.
Step 2: Go to the “Data” tab and click “Data Analysis.”
Step 3: For the “Variable 1 Range,” select Student tamil scores. For the “Variable 2
Range,” select Students english scores.
Step 4: For “Variable 1 Variance(known),” select Student variance tamil score, and for
“Variable 2 Variance(known),” select Student variance english score.
Step 5: Select the “Output Range” as a cell and press “OK.”

T test:

Step 1: Open the data tab.


Step 2:Define your t-test variances.
Step 3:Set your first variable range.
Step 4: Set your second variable range.
Step 5: Finalize your test.
Anova:

Step 1: Input your data into columns or rows in Excel. For example, if three groups
of students for music treatment are being tested, spread the data into three columns.
Step 2: Click the “Data” tab and then click “Data Analysis.” If you don’t see Data
Analysis, load the ‘Data Analysis Toolpak’ add-in.
Step 3: Click “ANOVA Single Factor” and then click “OK.”
Step 4: Type an input range into the Input Range box. For example, if the data is in
cells A1 to C10, type “A1:C10” into the box. Check the “Labels in the first row” if we
have column headers, and select the Rows radio button if the data is in rows.
Step 5: Select an output range. For example, click the “New Worksheet” radio button.
Step 6: Choose an alpha level. For most hypothesis tests, 0.05 is standard.
Step 7: Click “OK.” The results from ANOVA will appear in the worksheet.

Z test:
T test:
Anova:

Result:
Thus the implementation of z-test,t-test and anova was executed successfully.
EX.NO:5(i) PERFORM DATA PRE-PROCESSING OPERATIONS

DATE: (i)HANDLING MISSING DATA

AIM:

To perform the data pre-processing operations by Handling the missing data.

PROCEDURE:

Handling missing data

 Get the Input Dataset.


 Identify the columns with missing data.
 Decide how to handle the missing data.
 Techniques for handling missing data.
 Imputation: Replace missing values with estimated values (mean, median,
mode, etc.).
 Deletion: Remove rows or columns with missing data.
 Model-Based Methods: Use statistical models to predict missing values.
 Multiple Imputation: Generate multiple imputed datasets and combine results.
 Some options include :
o Ignore the missing Data.
o Treat missing values as Zero.
INPUT TABLE:

 In the above table ,it shows the missing values.


 Ignoring the missing values.

 To treat the missing value as Zero.


 Interpolate the missing data.

RESULT:

Thus the operations for handling the missing data using data pre-processing
operations was performed successfully.
EX .No: 5(ii) PERFORM DATA PRE-PROCESSING OPERATIONS
DATE: (ii)NORMALIZATION

AIM:
To perform the data pre-processing operations by normalize the given data.
PROCEDURE:
Step 1: Identify the columns that need to be normalized. Normalization is typically used fornumerical
columns, such as height, weight, and age.
Step 2: Choose a normalization method. Some common methods include:
Min-max normalization: This method scales the data so that the smallest value is 0 and the largest value is 1.

Z-score normalization: This method scales the data so that the mean is 0 and the standard deviation is 1.

Decimal scaling: This method scales the data so that the largest decimal place is 2
Step 3:Implement the chosen method. For example, to perform min-max normalization, you can use the
following formula
=(A2-MIN(A:A))/(MAX(A:A)-MIN(A:A))

Height (in cm)

152

155

168

175

153

162

173

166

158

156
RESULT:
Thus the implementation of normalization in Exel was executed succcesfully.
Exp No: 6
Date:

Perform dimensionality reduction operation using PCA, KPCA & SVD

Aim:

To compare the effectiveness of Principal Component Analysis (PCA), Kernel PCA (KPCA),
and Singular Value Decomposition (SVD) in reducing the dimensionality of a dataset while
preserving essential information.

Apparatus Required:

1. Python or R environment with required libraries (NumPy, SciPy, Scikit-learn).


2. Sample dataset (e.g., Iris dataset) for experimentation.

Procedure:

Step 1:
Data Preparation:
 Load the sample dataset (e.g., Iris dataset) into your Python environment.
 Preprocess the data by scaling it if necessary.

Step 2:
PCA (Principal Component Analysis):
 Apply PCA to reduce the dimensionality of the dataset.
 Choose the number of principal components based on the explained variance ratio (e.g.,
95% variance explained).
 Fit the PCA model to the data and transform the dataset to the reduced dimensional space.
 Visualize the data in the reduced dimensional space using scatter plots or other suitable
visualization techniques.

Step 3:
KPCA (Kernel PCA):
 Apply KPCA using different kernel functions (e.g., linear, polynomial, Gaussian) to explore
the effect of kernels on dimensionality reduction.
 Choose appropriate hyperparameters for each kernel (e.g., gamma for Gaussian kernel,
degree for polynomial kernel).
 Fit the KPCA model to the data and transform the dataset to the reduced dimensional space.
 Visualize the data in the reduced dimensional space for each kernel function.

Step 4: SVD (Singular Value Decomposition):


 Apply SVD to decompose the dataset into its singular values and vectors.
 Choose the number of singular values to retain based on the desired level of dimensionality
reduction.
 Reconstruct the dataset using the retained singular values and vectors.
 Visualize the reconstructed data to assess the effectiveness of dimensionality reduction
using SVD.

Step 5:
Performance Evaluation:
 Evaluate the performance of each dimensionality reduction technique based on metrics such
as explained variance, reconstruction error, and computational efficiency.
 Compare the visualizations of the reduced dimensional spaces from PCA, KPCA, and SVD.
 Discuss each technique's strengths and limitations in terms of preserving important features
and reducing dimensionality.
Results:
Thus the dimensionally reduction operation was performed successfully.
EX.NO:07 BIVARIATE AND MULTIVARIATE
DATE :

AIM:
To perform bivariate and multivariate analysis on dataset.

ALGORITHM:
BIVARIATE
STEP 1: Load the dataset into the excel.
STEP 2:Access the insert tab and select the scatter chart.
STEP3: Add the custom title by double clicking the Title Box , i.e study hours vs score.
STEP4: Add titles to x-asis and y-axis by selecting the scatter chart. Click on the PLUS(+)
button on right corner, then click axis title and rename it both x-axis and y axis as “study
hours and score”.
STEP 5: Select x-axis chart area and right click on it . A drop down appears , select the
“Format axis” format axis dialogue box appears . under axis option change the default value
as MIN value in dataset .
STEP 6: Follow step5 for y-axis and change the MAX value of y-asix in dataset .This
makes visualize you clearly.
STEP 7: Select the trend line in chart elements by clicking plus (+) button on right corner of
scatter chart .
STEP8:click the trend line previewed in chart and go to Format Trendline , select the option
“display equation on chart”.
STEP 9: Use CORREL Function to show correlation value between two variables.
FORMULA: =CORREL (A2:A21,B2:B21).
STEP10: If value is correlation value is positive it signifies study of hours increase and
score of student also increase.
MULTIVARIATE
STEP1: Load the dataset ,then Go to data tab .
STEP2: Select the data analysis dialogue box . choose the FILE option and click on button
labeled OPTIONS .
STEP3: Click on options labeled “Add-ins”. You should see “Analysis ToolPack click on
it.and then click the “GO” button at bottom and another dialogue box will appear as Data
analysis.
STEP4: Now click the data analysis tab , select the regression then enter input X-Range
,inputY-Range .
STEP5: In the section Radio button labeled “OUTPUT RANGE” , Click on it and enter
range for your data .
STEP6: Display the result .

OUTPUT:

RESULT :
Thus the analysis for performing bivariate and multivariate was executed and
demonstrated successfully.
Ex No:8 APPLY AND EXPLORE VARIOUS PLOTTING
Date: FUNCTIONS ON DATASET

Aim:
To apply and explore various plotting functions on the dataset.

Algorithm:

Step 1: Select data for the chart.

Step 2: Select Insert > Recommended Charts.

Step 3: Select a chart on the Recommended Charts tab, to preview the chart. ...

Step 4: Select a chart.

Step 5: Select OK.


Output:
Result:
Thus various plotting functions on the dataset is applied and explored successfully.
EX NO: 9

DATE: EXPLORE THE FEATURE OF POWER BI DESKTOP

AIM:

To explore the features of Power BI Deskstop and understanding its capabilities for data
visualization,analysis and reporting.

PROCEDURE:
Step1:Install POWER BI desktop.

Step 2:Select download from Microsoft Power BI Desktop


Step 3: Explore the Features.
 Get data (Data Source)
In the Home tab, from the Get data option, we can get data from various data sources
such as Excel workbook, Text/CSV, SQL Server, Power BI datasets, Power BI dataflows,
Web, OData feed, Analysis Services, Parquet, SharePoint folder,Azure Blod Storage,Azure
Databricks,MariaDB,etc.

 Range of Attractive Visualizations

1. We can add visuals to the report page by clicking the New visual option.
2. you can create reports and dashboards using as simple or as complex
visualizations as you want to represent your data set with.
3. There is also a library available for custom visualizations.
4. In Power BI we can create visuals such as; Ribbon chart,Waterfall chart,Scatter
chart,pie chart,map,filled map,clustered bar chart,etc.,
 Text Box

1. With the Text box option, we can insert a text box in the report page and set the
font type and font size for the text visual.
2. In the Insert tab, from the Text box option, we can insert a text box in the report
page and set the font type and font size for the text visual.
3. Using the Shapes option, we can insert shapes like block arrows pointing to
different directions and basic shapes such as circles, ovals, lines, pentagons,
hexagons, etcetera.,
 Filters and bookmarks options

Using Filters and bookmarks options, the user can apply filters and bookmarks to the
Power BI Report.
RESULT:
Thus the features of POWER BI desktop is executed successfully.
EXNO:10

DATE:

PREPARE & LOAD DATA USING POWER BI


AIM:
To prepare & load data set using Power Bi.

PROCEDURE:
1. Prepare the dataset.
2. Launch Power Bi Desktop on your computer.
3. Click on the “Get Dat” button located on the Home tab in the Power Bi desktop
ribbon.

4). Choose the type of data source you want to connect to, power BI supports a wide range of
data sources including excel. Click on the data source type and then select the specific data
fill or database.
5). To select the files in salas_pipeline.csv.
6). After select the data set and to load the dataset.

7). After select the data source you may need to provide credentials or other authentication
details to connect to the data source
8). To click the Build visual to the power bi tools.

9). To connect the two dataset in power bi.

Result:
Thus prepare & load data set using Power Bi has been demonstrated and executed
successfully.
EX.NO: 11

DATE:

DEVELOP THE DATA MODEL

AIM

Develop the data model using power BI.

PROCEDURE:

1. Launch Power BI Desktop on your computer.

2. Click on "Get Data" to import your data into Power BI. You can connect to various
data sources like Excel.
3. Load the dataset.

4. Select the data,


5. To click the built visual.

6. Select the data in sales_data_sample.


7. Develop the pie chart.

8. Develop the line chart.


9.Develop the 100% stacked bar chart.

10. Develop the scatter plot.


RESULT:

Thus , the data model has bee developed and demonstrated successfully.
Exp no:12

Date : PERFORM DAX CALCULATIONS USING BI TOOLS

Aim:
To perform DAX calculations using power BI tool.

Algorithm:
Step 1: Prepare the dataset
Step 2: Open power BI desktop
Step 3: Select the dataset and load the dataset
Step 4: Choose the new measure option
Step 5: Give name to this measure and write the query for count
Step 6: Now the measure gives the total marks
Step 7: Cross check the total marks in excel as well

Load the dataset directly to the power BI file:


Right click on the table and choose the “New measure” option:
Now this measure gives the total marks:

This is the total marks for the subject :


We can check the total marks in excel as well:

Result:
Thus the DAX calculations was executed successfully using power BI tools.
EX:NO:13
DATE: DESIGN REPORT

AIM:
To design report using power bi tool.
ALGORITHM:
Step1:Download the health care dataset contains two input as a inpatient and outpatient data.
Step2:Both dataset contain same number of column and column header.
Step3:Click get data, select the folder combine dataset and load it.
Step4:Data transformation and modeling.
Step5:Identify key entities and make relationship in your data “patient as multiple
appointment ”,”each appointment belongs to department”.
Step6:Use power query to merge and join tables,handling machine data ,create calculated
columns for relevant metrices i.e.wait times.
Step7:Build variety of visualization for this primary metrics
*average wait times
*patient count on the waitlist by department
*percentage of patients waiting for more than a specified time
Step8:Design the report layout with easy navigation,use report tabs
E.g.overview,trends,department,breakdown.
Step8.1:Add slicers and filters for user interaction for department filter,data range slicer to
view a specified period,dynamic filtering based on wait time.
Step8.2:Add annotation to explain key findings or trends.
Step9:Test the report to ensure data accuracy and visualization correctness .
Step10:Stop

OUTPUT:
`

RESULT:
Thus the report for healthcare patient in power BI was executed and demonstrated
successfully.

You might also like