0% found this document useful (0 votes)
87 views2 pages

Dsbda May2022

The document discusses topics related to data science and big data analytics including sources of big data, data analytics architecture, data preprocessing steps, machine learning algorithms like linear regression and logistic regression, clustering, time series analysis, TF-IDF, and data visualization tools.

Uploaded by

cryptoshubz1
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
87 views2 pages

Dsbda May2022

The document discusses topics related to data science and big data analytics including sources of big data, data analytics architecture, data preprocessing steps, machine learning algorithms like linear regression and logistic regression, clustering, time series analysis, TF-IDF, and data visualization tools.

Uploaded by

cryptoshubz1
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Total No. of Questions : 8] SEAT No.

8
23
P812 [5870] - 1133
[Total No. of Pages : 2

ic-
T.E. (Computer Engineering)

tat
7s
DATA SCIENCE AND BIG DATA ANALYTICS

6:5
(2019 Pattern) (Semester - II) (310251)

02 91
8:3
Time : 2½ Hours] [Max. Marks : 70

0
20
Instructions to the candidates:
9/0 13
1) Answer Q.1 or Q.2, Q.3 or Q.4, Q.5 or Q.6, Q.7 or Q.8.
0
1) Neat diagrams must be drawn whenver necessary.
6/2
.23 GP

2) Figures to the right side indicate full marks.


3) Use of logarithmic tables slide rule, mollier charts, electronic pocket calculator
E

and steam tables is allowed.


82

8
C

23
4) Assume suistable data, if necessary.

ic-
16

Q1) a) What is driving data deluge? Explain with one example. [9]

tat
8.2

7s
b) What is data science? Differentiate between Business Intelligence and
.24

6:5
Data Science. [9]
91
49

8:3
30

OR
20
01
02

Q2) a) What are the sources of Big Data. Explain model building phase with
6/2

example. [9]
GP
9/0

b) Explain big data analytics architecture with diagram. What is data


CE
82

8
discovery phase. Explain with example. [9]

23
.23

ic-
16

tat
8.2

7s

Q3) a) Explain various data pre-processing steps. Discuss essential python


.24

6:5

libraries for preprocessing. [8]


91
49

8:3

b) What are association rules? Explain Apriori Algorithm in brief. [9]


30
20
01
02

OR
6/2
GP

Q4) a) Explain the following


9/0
CE
82

i) Linear Regression
.23

ii) Logistic Regression [8]


16
8.2

b) Explain scikit-learn library for matplotlib with example. [9]


.24

[5870] - 1133 1 P.T.O.


49
Q5) a) Write short note on

8
23
i) Time series Analysis

ic-
tat
ii) TF - IDF. [9]

7s
6:5
02 91
b) What is clustering? With suitable example explain the steps involved in

8:3
k - means algorithm. [9]

0
20
9/0 13
OR
0
6/2
.23 GP

Q6) a) Write short note on


E
82

8
i) Confusion matrix
C

23
ic-
ii) AVC - ROC curve [9]
16

tat
8.2

7s
b) Discuss Holdout method and Random Sub Sampling methods. [9]
.24

6:5
91
49

8:3
30

Q7) a) With a suitable example explain Histogram and explain its usages. [8]
20
01
02

b) Describe the Data visualization tool “Tableau”. Explain its applications


6/2

in brief. [9]
GP
9/0

OR
CE
82

8
23
Q8) a) With a suitable example explain and draw a Box plot and explain its
.23

usages. [8] ic-


16

tat
8.2

7s

b) Describe the challenges of data visualization. Draw box plot and explain
.24

6:5

its usages. [9]


91
49

8:3
30
20


01
02
6/2
GP
9/0
CE
82
.23
16
8.2
.24

[5870] - 1133 2
49

You might also like