0% found this document useful (0 votes)
10 views35 pages

Dwbi Lab

Uploaded by

ramu34080
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views35 pages

Dwbi Lab

Uploaded by

ramu34080
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 35

Program-1: Using Excel, Perform the following operations compute a) Sum b) Maximum c) Minimum

d) Average e) Median f) Standard Deviation g) round.

Below is a table of MID-I Examinations Marks.

MID EXAMINATIONS-I RESULTS


S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
Table-1

a) Sum: It adds all the numbers in a range of cells


Formula: =sum(d3:h3)
b) Maximum: Returns the largest value in a set of values.
Formula: =max(d3:h3)
c) Minimum: Returns the smallest value in a set of values.
Formula: =min(d3:h3)
d) Average: Returns the average value.
Formula: =average(d3:h3)
e) Median: Returns the middle value in a set of values.
Formula: =median(d3:h3)
f) Standard deviation: Estimates standard deviation based on a sample.
Formula: =stdev.s(d3:h3)
g) Round: Rounds a number to a specified number of digits
Formula: =round(k4,0)

Sum Maximum Minimum Average Median Standard Deviation Round


140 30 26 28 28 1.58113883 28

1
Program-2: How to perform the Data Analysis and how to visualize the data using various charts in
Excel?

MID EXAMINATIONS-I RESULTS


S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
Table-1

Let us apply some charts on table-1.

a) Pie Chart on AI marks

AI
22N31A67122N31A671 22N31A670
3 4 Shashank 1 Sai 22N31A670
Prashanth 2 Teja
22N31A671 22N31A670
2 Nuthan 3 Raj

22N31A671 22N31A670
1 Chethan 4 Rishi

22N31A671 22N31A670
0 Saketh 5 Yesh

22N31A670 22N31A670
9 Asad 22N31A670 22N31A6706 Charan
8 Alok 7 Harsha

2
b) Clustered Column – Line on Total vs Gained

Total vs Gained
155
150
145
140
135
130
125
1 2 3 4 5 6 7 8 9 10 11 12 13 14

MARKS GAINED TOTAL MARKS

c) Clustered Column on all subjects

All Subjects
31
30
29
28
27
26
25
24

AI ATCD R&A DWBI CN

d) Line Chart with markers on Average

AVG
29.5

29

28.5

28

27.5

27

26.5

26

25.5
1 2 3 4 5 6 7 8 9 10 11 12 13 14

3
e) Pareto on all attributes from Name-Avg

Chart Title
AI

40 100
30 0
60
20
40
10 20
0 0

S ai
esh

j
hi
a

a
an

k
nk
d
an

n
eth

Ra
Alo
rsh

Tej
Asa

ant

ara
Ris

sh a
eth
th

S ak
Ha

Ch
sh
Nu

Ch

Sha
Pra
Program-3: How to analyse the excel data using various formulas for a given dataset?
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
Table-1

a) Count: Counts the number of cells in a range that contains numbers.


Formula: =count(range)
Example: =count(a3:a16)

b) CountIf: Counts the number of cells within a range that meets the given condition.
Formula: =countif(range, criteria)
Example: =countif(d3:d16,30)

4
c) CountA: Counts the number of cells in a range that are not empty.
Formula: =counta(range)
Example: =counta(a1:k16)

d) Index: Returns a value of the cell at the intersection of a particular row and and
column, in a given a range.
Formula: =index(range,row_num,column_num)
Example: =index(D3:H3,1,4)

e) Vlookup: Looks for a value in the leftmost column of a table, and then returns a value
in the same row from a column you specify.
Formula: =vlookup(look_up value,range,col_index_num)
Example: =vlookup("Raj",C3:I16,7,FALSE)

f) Length: Returns the number of characters in a text string.


Formula: =len(text/ cell containing text)
Example: =len(c3)

Count CountIf CountA Index Vlookup Length


14 4 166 29 138 3

Program-4: How to perform various conditional formats in Excel for a given dataset.
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
Table-1

5
Select the dataset you want to apply the formatting and then in the dashboard click
conditional formatting, there you can use various formatting as given the below figures.

6
1) Greater than
Selected the 5 subjects marks and highlighting the cells with above 27.
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28

2) Less than
Selected the Marks Gained column and highlighting the with less than 140.
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28

TASK:
Do the rest of the Conditional Formatting.

7
Unique values:
Formula: =unique(range)
Example: =unique(i3:i16)
Unique
140
142
138
143
141
134
144
139
146
135

Numbers to Currency:
Formula: =text(cell/range,”$#,##0.00”)
Example:
Number Currency
25 $25.00
35 $35.00
100 $100.00
66 $66.00
120 $120.00

Numbers to Romans:
Formula: =roman(cell/range)
Example:

Number Roman
25 XXV
35 XXXV
100 C
66 LXVI
120 CXX
Program-5: Perform Pivot Table Operations in Excel for a given dataset?
MID EXAMINATIONS-I RESULTS
S.NO ROLL NO NAME AI ATCD R&A DWBI CN MARKS GAINED TOTAL MARKS AVG
1 22N31A6701 Sai 28 27 26 29 30 140 150 28
2 22N31A6702 Teja 28 29 28 30 27 142 150 28.4
3 22N31A6703 Raj 27 26 28 28 29 138 150 27.6
4 22N31A6704 Rishi 29 28 29 30 27 143 150 28.6
5 22N31A6705 Yesh 30 29 28 29 27 143 150 28.6
6 22N31A6706 Charan 26 28 27 29 28 138 150 27.6
7 22N31A6707 Harsha 30 28 28 29 26 141 150 28.2
8 22N31A6708 Alok 27 26 27 28 26 134 150 26.8
9 22N31A6709 Asad 30 29 28 29 28 144 150 28.8
10 22N31A6710 Saketh 29 28 29 27 26 139 150 27.8
11 22N31A6711 Chethan 29 28 28 27 29 141 150 28.2
12 22N31A6712 Nuthan 30 29 29 28 30 146 150 29.2
13 22N31A6713 Prashanth 28 27 27 26 27 135 150 27
14 22N31A6714 Shashank 27 29 29 27 28 140 150 28
Table-1

Pivot Table allows users to extract useful insights from data by transforming it into a
structured format, enabling quick summarization and comparison of data.
Click insert in the dashboard and click the pivot table option.
Sum of Sum of Sum of Sum of Sum of
Names AI ATCD R&A DWBI CN
Alok 27 26 27 28 26
Asad 30 29 28 29 28
Charan 26 28 27 29 28
Chethan 29 28 28 27 29
Harsha 30 28 28 29 26
Nuthan 30 29 29 28 30
Prashanth 28 27 27 26 27
Raj 27 26 28 28 29
Rishi 29 28 29 30 27
Sai 28 27 26 29 30
Saketh 29 28 29 27 26
Shashank 27 29 29 27 28
Teja 28 29 28 30 27
Yesh 30 29 28 29 27
Grand Total 398 391 391 396 388

9
Program-6 : Perform ANOVA Operations in Excel for a given dataset?

Group 1 Group 2 Group 3


7 3 7
5 8 4
8 5 6

10
Anova: Single
Factor

SUMMARY
Groups Count Sum Average Variance
7 2 13 6.5 4.5
3 2 13 6.5 4.5
7 2 10 5 2

ANOVA
Source of
Variation SS df MS F P-value
Between Groups 3 2 1.5 0.409090909 0.696461276
Within Groups 11 3 3.666666667

Anova: Single
Factor

Anova: Two-
Factor Without
Replication

SUMMARY Count Sum Average Variance


Row 1 3 17 5.666666667 5.333333333
Row 2 3 17 5.666666667 4.333333333
Row 3 3 19 6.333333333 2.333333333

Column 1 3 20 6.666666667 2.333333333


Column 2 3 16 5.333333333 6.333333333
Column 3 3 17 5.666666667 2.333333333

ANOVA
Source of
Variation SS df MS F P-value
Rows 0.888888889 2 0.444444444 0.084210526 0.920824406
Columns 2.888888889 2 1.444444444 0.273684211 0.773748285
Error 21.11111111 4 5.277777778

Total 24.88888889 8

11
Program-7: Execute the below programs using Python (Jupyter Notebook/Anaconda)?
i) How to create a dataset using python

ii) How to read and display a dataset (in Excel or CSV) using python

12
iii) Consider a given dataset and perform the following operation:
a) How to display the top 5 records:

b) How to display the last 5 records

c) How to display the specific records

d) How to display only the attributes of a dataset

13
e) How to display the Blank or Null values

f) How to rename the column name of a dataset

14
g) How to display the records based on condition.

15
h) How to drop or delete an attribute or column in a dataset

Program -8: How to perform the Descriptive Statistics Analysis and display the summary
information for a given dataset

16
Program - 9: Perform the basic plotting’s (Stack, Bar, Scatter, pie, boxplot, Bubble Plot etc)
with and without legend function for a given dataset.
a)Barplot:
With legend

17
Without legend

1
b) Stack plot :
With legend

19
Without legend

20
c)Scatter plot:
With legend

Without legend

21
d) Pie Chart:
With legend

22
Without legend

23
f) Box plot:
With legend

24
Without legend

25
g) Bubble Plot:
With legend

26
Without legend

Program-10: Python Program to print Waffle Charts with various categories.

27
Program-11: Python Program to print Wordcloud for a given text.

2
Program-12&13: Microsoft Power BI visualizations.
Loading and transforming the dataset.

29
Let us see some visuals based on a dataset we have loaded into Power BI.

Here, in the data section we will select(we can drag and drop) the attributes.

In the visualizations section we get to select the various way to visualize the
data(Tree map,bar chart,line chart, pie chart, doughnut,etc) and in this same
section we can format the visuals as well(legend, labels, colours,data format,etc).

30
1) Sum of sales by state:

2) Sum of sales by ear and Quarter:

3) Sum of sales by category:

31
4) Sum of sales by shipping mode using a matrix:

We can even transform the data in between by adding new attributes. Let us see it.
Click the transform data in the home section.
In the add column section we can insert custom columns with a formula appropriate as
shown in the below figure.

Here, we want a add a column with Delay days(difference between ordered date and
shipped date) so, we can apply the difference inside of the table.transformcolumntypes
method.
This results in a new column of Delay days in the original dataset as shown in the below
figure.

32
We save this data and load back into the Data section.
Let us apply the visualizations of the newly added column of Delay days.

Count of Order ID by delay days:

In the above clustered column chart we count the order id by the number of delay days .

The next page contains some more visualizations.

33
34
35

You might also like