Dwbi Lab
Dwbi Lab
1
Program-2: How to perform the Data Analysis and how to visualize the data using various charts in
Excel?
AI
22N31A67122N31A671 22N31A670
3 4 Shashank 1 Sai 22N31A670
Prashanth 2 Teja
22N31A671 22N31A670
2 Nuthan 3 Raj
22N31A671 22N31A670
1 Chethan 4 Rishi
22N31A671 22N31A670
0 Saketh 5 Yesh
22N31A670 22N31A670
9 Asad 22N31A670 22N31A6706 Charan
8 Alok 7 Harsha
2
b) Clustered Column – Line on Total vs Gained
Total vs Gained
155
150
145
140
135
130
125
1 2 3 4 5 6 7 8 9 10 11 12 13 14
All Subjects
31
30
29
28
27
26
25
24
AVG
29.5
29
28.5
28
27.5
27
26.5
26
25.5
1 2 3 4 5 6 7 8 9 10 11 12 13 14
3
e) Pareto on all attributes from Name-Avg
Chart Title
AI
40 100
30 0
60
20
40
10 20
0 0
S ai
esh
j
hi
a
a
an
k
nk
d
an
n
eth
Ra
Alo
rsh
Tej
Asa
ant
ara
Ris
sh a
eth
th
S ak
Ha
Ch
sh
Nu
Ch
Sha
Pra
Program-3: How to analyse the excel data using various formulas for a given dataset?
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
Table-1
b) CountIf: Counts the number of cells within a range that meets the given condition.
Formula: =countif(range, criteria)
Example: =countif(d3:d16,30)
4
c) CountA: Counts the number of cells in a range that are not empty.
Formula: =counta(range)
Example: =counta(a1:k16)
d) Index: Returns a value of the cell at the intersection of a particular row and and
column, in a given a range.
Formula: =index(range,row_num,column_num)
Example: =index(D3:H3,1,4)
e) Vlookup: Looks for a value in the leftmost column of a table, and then returns a value
in the same row from a column you specify.
Formula: =vlookup(look_up value,range,col_index_num)
Example: =vlookup("Raj",C3:I16,7,FALSE)
Program-4: How to perform various conditional formats in Excel for a given dataset.
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
Table-1
5
Select the dataset you want to apply the formatting and then in the dashboard click
conditional formatting, there you can use various formatting as given the below figures.
6
1) Greater than
Selected the 5 subjects marks and highlighting the cells with above 27.
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
2) Less than
Selected the Marks Gained column and highlighting the with less than 140.
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
TASK:
Do the rest of the Conditional Formatting.
7
Unique values:
Formula: =unique(range)
Example: =unique(i3:i16)
Unique
140
142
138
143
141
134
144
139
146
135
Numbers to Currency:
Formula: =text(cell/range,”$#,##0.00”)
Example:
Number Currency
25 $25.00
35 $35.00
100 $100.00
66 $66.00
120 $120.00
Numbers to Romans:
Formula: =roman(cell/range)
Example:
Number Roman
25 XXV
35 XXXV
100 C
66 LXVI
120 CXX
Program-5: Perform Pivot Table Operations in Excel for a given dataset?
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
Table-1
Pivot Table allows users to extract useful insights from data by transforming it into a
structured format, enabling quick summarization and comparison of data.
Click insert in the dashboard and click the pivot table option.
Sum of Sum of Sum of Sum of Sum of
Names AI ATCD R&A DWBI CN
Alok 27 26 27 28 26
Asad 30 29 28 29 28
Charan 26 28 27 29 28
Chethan 29 28 28 27 29
Harsha 30 28 28 29 26
Nuthan 30 29 29 28 30
Prashanth 28 27 27 26 27
Raj 27 26 28 28 29
Rishi 29 28 29 30 27
Sai 28 27 26 29 30
Saketh 29 28 29 27 26
Shashank 27 29 29 27 28
Teja 28 29 28 30 27
Yesh 30 29 28 29 27
Grand Total 398 391 391 396 388
9
Program-6 : Perform ANOVA Operations in Excel for a given dataset?
10
Anova: Single
Factor
SUMMARY
Groups Count Sum Average Variance
7 2 13 6.5 4.5
3 2 13 6.5 4.5
7 2 10 5 2
ANOVA
Source of
Variation SS df MS F P-value
Between Groups 3 2 1.5 0.409090909 0.696461276
Within Groups 11 3 3.666666667
Anova: Single
Factor
Anova: Two-
Factor Without
Replication
ANOVA
Source of
Variation SS df MS F P-value
Rows 0.888888889 2 0.444444444 0.084210526 0.920824406
Columns 2.888888889 2 1.444444444 0.273684211 0.773748285
Error 21.11111111 4 5.277777778
Total 24.88888889 8
11
Program-7: Execute the below programs using Python (Jupyter Notebook/Anaconda)?
i) How to create a dataset using python
ii) How to read and display a dataset (in Excel or CSV) using python
12
iii) Consider a given dataset and perform the following operation:
a) How to display the top 5 records:
13
e) How to display the Blank or Null values
14
g) How to display the records based on condition.
15
h) How to drop or delete an attribute or column in a dataset
Program -8: How to perform the Descriptive Statistics Analysis and display the summary
information for a given dataset
16
Program - 9: Perform the basic plotting’s (Stack, Bar, Scatter, pie, boxplot, Bubble Plot etc)
with and without legend function for a given dataset.
a)Barplot:
With legend
17
Without legend
1
b) Stack plot :
With legend
19
Without legend
20
c)Scatter plot:
With legend
Without legend
21
d) Pie Chart:
With legend
22
Without legend
23
f) Box plot:
With legend
24
Without legend
25
g) Bubble Plot:
With legend
26
Without legend
27
Program-11: Python Program to print Wordcloud for a given text.
2
Program-12&13: Microsoft Power BI visualizations.
Loading and transforming the dataset.
29
Let us see some visuals based on a dataset we have loaded into Power BI.
Here, in the data section we will select(we can drag and drop) the attributes.
In the visualizations section we get to select the various way to visualize the
data(Tree map,bar chart,line chart, pie chart, doughnut,etc) and in this same
section we can format the visuals as well(legend, labels, colours,data format,etc).
30
1) Sum of sales by state:
31
4) Sum of sales by shipping mode using a matrix:
We can even transform the data in between by adding new attributes. Let us see it.
Click the transform data in the home section.
In the add column section we can insert custom columns with a formula appropriate as
shown in the below figure.
Here, we want a add a column with Delay days(difference between ordered date and
shipped date) so, we can apply the difference inside of the table.transformcolumntypes
method.
This results in a new column of Delay days in the original dataset as shown in the below
figure.
32
We save this data and load back into the Data section.
Let us apply the visualizations of the newly added column of Delay days.
In the above clustered column chart we count the order id by the number of delay days .
33
34
35