0% found this document useful (0 votes)
33 views2 pages

Be Computer Engineering Ai, DS, ML Semester 6 2023 December Data Analytics and Visualization Rev 2019 C Scheme

Uploaded by

Sakshi Gholap
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views2 pages

Be Computer Engineering Ai, DS, ML Semester 6 2023 December Data Analytics and Visualization Rev 2019 C Scheme

Uploaded by

Sakshi Gholap
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

5

1
F
E2

A8

46
71
Paper / Subject Code: 37471 / Data Analytics and Visualization

6
26

5E

19
2C

F1
94

A8

46
71
6E
1T01876 - T.E. Computer Science and Engineering (Artificial Intelligence and Machine Learning) (Choice

96
68

5E
2C

F1
42

61
A4
Based) (R-19-20 'C' Scheme)SEMESTER - VI / 37471 - Data Analytics and Visualization

8
71
89

E4
18

26
QP CODE: 10039481 DATE: 11/12/2023

C
6

F1

5
4
87

94

8
A

1
E

A
7
8B

68
8

2C

F1
1

2
4
7

4
4C
(3 Hours) (Total Marks: 80)

A8
A

71
8

6E
B

8
8
96

2C
46

F1
1

42
8

7
4C
61

71
B8

6E
N.B.: 1. Question No. 1 is compulsory.

68
E4

18
6

2C
42
8
19

4
87
C
2. Answer any three out of the remaining questions.

85

6E
46

8
1A

8
6
3. Assume suitable data if necessary.

6
E

42
C8
9

4
7
5

1
1F

8A
8

89

6E
8

6
4. Figures to the right indicate full marks.

64

8B
A

4
C7

46
E

71

42
19
1

C
5
1F
E2

8A
B8

89
8

46

4
1A

6
C7
Q1. Attempt the following (any 4): (20)
26

6
5E

71
C8
19

4
F
94

8A
B8
8

6
a. Why is data analytics lifecycle essential?

64
1
E

4
C7
68

71
C8
9
1
2
b. The regression lines of a sample are and .

1
A4

1F
4

8A
B8
A8

64
9

E
Find (i) sample means ̅ and ̅.

4
C7
8
18

26

5E

71
C8
19
6

1
A4

1F
7

94

B8
8

46
(ii) coefficient of correlation between and

64
B8

A
C7
8
18

26

5E

C8
19
46

F1
C8

c. Differentiate between linear regression and logistic regression.


7

E2

A8

6
A

64
71
B8

9
64

4
8
d. What is Pandas? State and explain key features of Pandas.
8

26

5E

19
2C
6

F1
1
C8
19

4
7

e. Explain term frequency (TF), document frequency (DF), and inverse document 94

A8

46
A

1
8

E
6

64

C7
B

68
E4

18

26

5E
F1
8

frequency (IDF).
19

4
87

94

2
C
85

A8
A

71
6E
6

8B

68
1A

E4

18
6

2C

F1
42
9

A4
7
4C
85

1
1F

Q2. Attempt the following:

71
8

89

6E
6

8B
1A

E4

18
6
C7

2C
a. Explain the data analytics lifecycle. (10) 6

42
9

4
87
4C
85

1
1F
E2

89

6E
6

b. Find two lines of regression from the following data: (10)


B
1A

18
96
C7
26

46
E

42
C8

87
85

61
1F

Age of husband
94

E2

8A

89
4

25 22 28 26 35 20 22 40 20 18
8B
A

E4

96
C7
68

26

46
( )
71
1

4C
5

1
A4

1F
94

E2

8A
B8
8

Age of wife ( 18 15 20 17 22 14 16 21 15 14
A

E4

96
C7
68
18

26

71
8
1

C
5

61
A4

1F
87

Estimate (i) the age of husband when the age of wife is 19 and (ii) the age of wife when
94

E2

B8
8

4
A

96
7
8B

68
18

26

5E

the age of the husband is 30.


C8
C

F1

61
A4
87

94

2
4C

64
1
E

1A

4
C7
B

68
8

6
96

5E

19
71

2
C8

1F

Attempt the following:


94

E2

A8

46
8A
8
64

C7
8B

68

26

a. Explain Box-Jenkins intervention analysis. (10)


5E
F1
71
9

94

2
4C
61

A8
8A

71
8

6E

b. What is text mining? Enlist and explain the seven practice areas of text analytics. (10)
8B

68
E4

2C

F1
71

42
19

4
4C
5

71
8

E
A8

46

8B

68
8

26
6

Q4. Attempt the following:


2C
5E

1
19

A4
7

94
4C

a. Explain different types of data visualizations in R programming language. (10)


6E
A8

46

8B

68
18
6
5E

42
19
1

b. Fit a regression equation to estimate , , and to the following data of a transport


4
87
4C
1F

8A

89
A8

46

company on the weights of 6 shipments, the distances they were moved and the damage
6
7

46
5E

71
C8
19
2C

1
1F

8A

of the goods that was incurred. (10)


8
A8

46

64
6E

8B
C7

5E

71
9
1

Weight (1000 4.0 3.0 1.6 1.2 3.4 4.8


4C
1
1F
E2

B8
A8

46

kg)
96
C7
6

5E

C8
1
42

61
1F
E2

Distance (100 1.5 2.2 1.0 2.0 0.8 1.6


A8

64
89

4
C7
6

5E

19
46

1
42

km)
1F
E2

A8

46
8A

89

Damage (Rs.) 160 112 69 90 123 186


C7
6

5E
46

1
42

1F
E2

A8
A

89

C7
18

6
46

F1
42

Estimate the damage when a shipment of 3700 kg is moved to a distance of 260 km.
87

E2
A

71
89
8B

39481 Page 1 of 2
18

2C
46

42
87
4C

89

6E
8B

18

46

42
87
4C

89
8B

4C8B8718A4689426E2C71F1A85E46196
18

46
4C 8A 6E A8 96
46 2C 5 4C
8B 89 7 E4 8B
87 42 1F 61 87
18 6 1 9 18
4C A 46 E2 A8
5E
64
C8 A4
8B 89 C7 46 B 68
87 42 1F 87 94
18 6 1 19
6 18 26
A 46 E2 A8
5E 4C A4 E2
8B 89 C7 46 8B 68 C7
87 42 1F
1 19 8 71 94 1F
18
A 6 6 8 26 1A

39481
E2 A8 4C A4 E2
46 C7 5E 8B 85
89 46 68 C7 E4
42 1F
1 19 8 71 94 1F 6 19
18
A 6E2 A8 6 4C 8A 26
E2
1A 64
46
89 C7 5E 8B 4 68 C7 85
E4 C8
42 1F 46 8 94 1F 6 B8
6 1 19 71 19 7

regression.
6 26

SD
E2 A8 4C 8A 1A 64 18
4 E2 85
Mean
46 C7 5E 8B 68
A4 C8

c. Regression plot
89 46 C7 68 E4
42 1F
1 1 7 8 1F94 1 94 6 B8
7
6E 9 64 1 8 26 1 A 9 6 18 26
A E

Coefficient
A8

Correlation
4

a. Time series analysis


2C 5E C8 4 2 C 85 C A4 E2
Q5. Attempt the following:

71 4 B 68 7 E 4 8 B 6 8 C7
61 87 94 6 8 9
of
F1 1F 1 7 4 1F
1

b. Exploratory data analysis


A8 96 18 26 A 9 6 1 8 2 6 1A
4C A4 E2 8 5 4C A E 2

Q6. Write short notes on (any 2):


5E 8B 68 C7 E4 8B 46 85
46 8 9 1 8 C7 E4 0.52
36.8

19 61 87 94 1F 61
508.4

71 42 F1 1

d. Generalized linear model (GLM)


64 8A 6E A 8
96 18
A
26
E A 96
C8 4 2 5 4C 4 2 8 5 4C
B8 68 C7 E4 8B 6 8 C7 E 8B
94 1F 6 1 8 9 1 46
Yield in Kg.

71 26 1A 96 7 18 4 26 F 1A 19 87
8A E 8 4 A E 8 6 18
46 2C 5E C8 4 2 5 E
4C
8
A4
89 71 4 B 68 C7 4 6 B 68
61 8

Page 2 of 2
87 94 1F 1

*******
42 F1 1 7 94
6E A8 96 18 26 A 96 1 8 26
2C 5E 4C A4 E2 8 5E 4 C A 4 E2
71 4 61
8B 68 C7 46 8 B8 689
F1 87 94 1F 7
96 18 26 1 19 1 42

4C8B8718A4689426E2C71F1A85E46196
A8 4 A E 64 8 A 6E
4.6

C A8
4 5 4
26.7

5E 8 6 2C E C8 2C
46 B8 89 71 46 B 87
68
94 71
19 71 42 F1
A
19 F
64 8A 6E 8 6 4
18
A
26
E
C8 2 C
when the rainfall is 29 cm and the rainfall when the yield is 600 kg.

46 C7 5E 8B 46 2C
B8 89 1 4 6 8 8 9 71
Rainfall in cm

71 42 F1 1 9 7 1 42 F1
8A 6E A8 64 8 A 6 E
Paper / Subject Code: 37471 / Data Analytics and Visualization

A8
46 2C 5E C8 4 68 2 C 5
89
b. What is stepwise regression? State and explain different types of stepwise

71 46 B8 9 71
42 F1 19 71 42 F1
6E A8 64 8A 6E A8
2C 5E C8 4 6 2C 5E
71 4 6 B8 8 9 7 1 46
a. From the following results, obtain two regression equations and estimate the yield

F1 19 71 42 F1 1
A8 64 8A 6E A 8
(10)

5E C8 2C
(10)

(20)

46 5E
46 B8 89 71 46
19 71 42 F1 19
64 8A 6E A 8 6
C8 4 6 2C 5 E4
B8 89 71 61
71 42 F1 96
8A 6E A8

You might also like