Data Warehousing and Mining (Notes)
Data Warehousing and Mining (Notes)
com
http://www.rtmnuonline.com
B.E. (Computer
B.E. (Computer Science
Science &
& Engineering)
Engineering) Seventh
Seventh Semester
Semester (C.B.S.)
(C.B.S.)
Data Warehousing
Data Warehousing &
& Mining
Mining
P. Pages
P. Pages :: 2
2 NJR/KS/18/4626
NJR/KS/18/4626
*0093*
Time :: Three
Time Three Hours
Hours INA ll Max. Marks
Max. Marks :: 80
80
_____________________________________________________________________
Notes: 1.
Notes : 1. All questions
All questions carry
carry marks
marks as
as indicated.
indicated.
2.
2. Solve Question
Solve Question 11 OR
OR Questions
Questions No. 2.
No. 2.
3.
3. Solve Question
Solve Question 33 OR
OR Questions
Questions No. 4.
No. 4.
4.
4. Solve Question
Solve Question 55 OR
OR Questions
Questions No.
No. 6.6.
5.
5. Solve Question
Solve Question 77 OR
OR Questions
Questions No.
No. 8.8.
6.
6. Solve Question 9 OR Questions No. 10.
Solve Question 9 OR Questions No. 10.
7.
7. Solve Question
Solve Question 1111 OR
OR Questions
Questions No.
No. 12.12.
8.
8. Due credit will be given to neatness and adequate
Due credit will be given to neatness and adequate dimensions.
dimensions.
9. |
9. Assume suitable
Assume suitable data
data whenever
whenever necessary.
necessary.
10.
10. Illustrate your
Illustrate your answers
answers whenever
whenever necessary
necessary with with the
the help
help of
of neat
neat sketches.
sketches.
1.
1. a)
a) Define Data
Define Data mining.
mining. What
What are
are the
the steps
steps involved
involved in
in KDD
KDD process?
process? 88
b) | Write
b) Write aa short
short note
note on:-
on:- 66
i)
i) Classification of
Classification of Data
Data mining.
mining.
11) Data
ii) Data mining
mining Task
Task Primitive
Primitive
OR
OR
2.
2. a) — Describe
a) Describe the
the typical
typical architecture
architecture of
of Data
Data mining
mining system?
system? 66
b) | Why
b) Why preprocessing
preprocessing is
is necessary
necessary in
in Data
Data mining.
mining. Explain
Explain various
various preprocessing technique
preprocessing technique 88
in brief.
in brief.
3.
3. a) | Define
a) Define Data
Data warehousing.
warehousing. State
State its
its characteristics
characteristics features.
features. 33
b)
b) Differentiate between
Differentiate between OLTP
OLTP and
and OLAP.
OLAP. 66
c)
c) List different
List different DW
DW schemas.
schemas. Explain
Explain in
in brief
brief STAR
STAR SCHEMA.
SCHEMA. 44
OR
OR
4.
4. a)
a) Discuss the
Discuss the different
different types
types of
of OLAP
OLAP servers.
servers. 88
b)
b) Write aa short
Write short note
note on
on any
any one.
one. 55
i)
i) Cube materialization.
Cube materialization. ii)
ii) Attribute oriented
Attribute oriented Induction
Induction
5.
5. a) | What
a) What are
are the
the two
two necessary
necessary steps
steps of
of association
association rule
rule mining.
mining. 33
b) — Explain
b) Explain closed
closed and
and maximal
maximal frequent
frequent item
item set.
set. 33
c)
c) Write the
Write the step
step involved
involved in
in finding
finding frequent
frequent item
item set
set using
using apriori
apriori algorithm.
algorithm. 77
OR
OR
6.
6. a) _ Discuss
a) Discuss various
various kinds
kinds of
of Association
Association rules.
rules. 88
b) — Explain
b) Explain the
the concept
concept of
of constraint
constraint based
based association
association mining.
mining. 55
NJR/KS/18/4626
NJR/KS/18/4626 11 P.T.O
7. a) Differentiate between
Differentiate between classification
classification and
and prediction.
prediction. 4
b) Give the
Give the steps
steps involved
involved in
in decision
decision tree
tree algorithm.
algorithm. States
States its
its advantages
advantages and
and 6
disadvantages.
disadvantages.
c) Write in
Write in brief
brief about
about attribute
attribute selection
selection measure.
measure. 3
OR
OR
8. a) Briefly explain
Briefly explain regression
regression methods
methods used
used in
in prediction.
prediction. 4
b) Describe Naive
Describe Naive Bayesians
Bayesians classification.
classification. 6
c) Write aa short
Write short note
note on:-
on:- 3
i)
i) Bagging
Bagging ii)
ii) Boosting
Boosting
9. a) What is
What is clustering?
clustering? How
How it
it differs
differs from
from classification?
classification? Also
Also give
give its
its application
application area.
area. 4
b) Briefly explain
Briefly explain with
with example,
example, how
how dissimilarity
dissimilarity between
between object
object can
can be
be computed
computed inin the
the 9
following data
following data types:-
types:-
i)
i) Interval-scaled variable
Interval-scaled variable ii) Binary
ii) Binary variable
variable iii) | Categorical
iii) Categorical variable.
variable.
OR
OR
10.
10. Write aa short
Write short note
note on:-
on:-
a) K-means partitioned
K-means partitioned method
method 4
b) Agglomerative and
Agglomerative and decisive
decisive hierarchical
hierarchical clustering.
clustering. 2
c) Outlier detection
Outlier detection 3
d) DBCAN clustering
DBCAN clustering 4
11.
11. a) Explain in
Explain in brief
brief the
the complex
complex data
data type.
type. 3
1)
i) Data stream
Data stream 11)
ii) Time series
Time series data
data 111)
iii) Sequence data
Sequence data
b) Describe Trend
Describe Trend analysis
analysis w.
w. r.
r. t.
t. time
time series
series data.
data. 6
c) Explain the
Explain the concept
concept of
of sequence
sequence pattern
pattern in
in detail.
detail. 5
OR
OR
12.
12. Write short
Write short notes
notes on
on :: any
any three.
three.
i)
i) Methodology of
Methodology of Data
Data streaming.
streaming. 5
WN
ii) Graph 5
ili)
iii) Social Networking
Social Networking 4
iv)
iv) Task and
Task and challenges
challenges in
in link
link mining.
mining. 4
ah
KREREEK
*******
NJR/KS/18/4626
NJR/KS/18/4626 22
rtmnuonline.com
rtmnuonline.com
B.E. (Computer
B.E. (Computer Science
Science &
& Engineering)
Engineering) Seventh
Seventh Semester
Semester (C.B.S.)
(C.B.S.)
Data Warehousing
Data Warehousing & & Data
Data Mining
Mining
P. Pages
P. Pages: : 22 NRJ/KW/17/4626
NRJ/KW/17/4626
Time :: Three
Time Three Hours
Hours *0934*
MTA ll Max. Marks
Max. Marks :: 80
80
_____________________________________________________________________
Notes: 1.
Notes : 1. All questions carry
All questions carry marks
marks as
as indicated.
indicated.
2.
2. Solve Question
Solve Question 11 OROR Questions
Questions No. 2.
No. 2.
3.
3. Solve Question
Solve Question 33 OROR Questions
Questions No. 4.
No. 4.
4.
4. Solve Question
Solve Question 55 OROR Questions
Questions No. 6.
No. 6.
5.
5. Solve Question
Solve Question 77 OROR Questions
Questions No. 8.
No. 8.
6.
6. Solve Question
Solve Question 9 9 OR
OR Questions
Questions No. 10.
No. 10.
7.
7. Solve Question
Solve Question 11 11 OR
OR Questions
Questions No. 12.
No. 12.
8.
8. Assume suitable
Assume suitable data
data whenever
whenever necessary.
necessary.
9.
9. Illustrate your
Illustrate your answers
answers whenever
whenever necessary
necessary with
with the
the help
help of
of neat
neat sketches.
sketches.
1.
1. a)
a) Explain KDD
Explain KDD in
in detail
detail with
with neat
neat diagram.
diagram. 88
b)
b) Explain Data
Explain Data integration
integration &
& Transformation
Transformation in
in Data
Data mining.
mining. 66
OR
OR
2.
2. a)
a) Discuss major
Discuss major issues
issues in
in Data
Data mining.
mining. 77
b) — Explain
b) Explain applications
applications of
of data
data mining
mining in
in detail.
detail. 77
3.
3. a)
a) Define data
Define data warehouse.
warehouse. Explain
Explain an
an architecture
architecture of
of Data
Data warehouse
warehouse with
with suitable
suitable diagram.
diagram. 77
b)
b) Briefly explain
Briefly explain the
the OLAP
OLAP guidelines
guidelines suggested
suggested by
by Dr.
Dr. Codd.
Codd. 66
OR
OR
4.
4. a)
a) Differentiate between
Differentiate between OLTP
OLTP and
and OLAP.
OLAP. 77
b) — Explain
b) Explain in
in detail
detail life
life cycle
cycle of
of Data
Data warehouse.
warehouse. 66
5.
5. a) | What
a) What do
do you
you mean
mean by
by mining
mining frequent
frequent patterns,
patterns, Association
Association &
& correlation
correlation with
with an
an 66
example.
example.
b) — Explain
b) Explain constraint-
constraint- Based
Based mining
mining with suitable example.
with suitable example. 77
OR
OR
6.
6. a)
a) What is
What is correlation
correlation Analysis?
Analysis? 44
b)
b) Define the
Define the following
following terms.
terms. 99
1)
i) Association mining.
Association mining.
11)
ii) Frequent item
Frequent item sets.
sets.
111) Closed
iii) Closed item
item sets.
sets.
NRJ/KW/17/4626
NRJ/KW/17/4626 11 P.T.O
www.rtmnuonline.com
rtmnuonline.com
rtmnuonline.com
7. :))
a) Explain support
Explain support vector
vector machine
machine with
with suitable
suitable diagram.
diagram. 6
b)
b) Discuss different
Discuss different issues
issues related
related to
to classification
classification &
& prediction.
prediction. 7
OR
OR
8.
8. a)
a) Explain classification
Explain classification by
by Decision
Decision Tree
Tree Induction
Induction with
with an
an example.
example. 7
b)
b) Write short
Write short note on Back
note on Back propagation.
propagation. 6
9.
9. a)
a) What is
What is clustering?
clustering? Why
Why it
it is
is required.
required. 3
b)
b) Differentiate between
Differentiate between K-means
K-means and
and K-medoids.
K-medoids. 6
oy)
c) What is
What is outlier?
outlier? Why
Why outlier
outlier mining
mining is
is important.
important. 5
OR So
om
OR
e.c
of
10.
10. a)
a) Explain any
Explain any two
two clustering
clustering methods
methods with
with their
their types
types in
in detail.
detail. SS 14
14
lin
oO S
on
11.
11. a)
a) Explain the
Explain the techniques
techniques for
for mining
mining time-
time- series
series data.
data. > 7
nu
rtm
b)
b) Define the
Define the following
following terms.
terms. A’ ¥ 6
< w.
ww
i)
i) Data stream.
Data stream.
ii)
ii) Time series
Time series Data.
Data.
111) Sequence
iii) Sequence Data.
Data.
S
om
cS OR
OR
e.c
12. a) Write 13
s
on
nu
i)
i) Graph mining.
Graph mining. YS
s&s
rtm
ii)
ii) Link mining.
Link mining. <\’
w.
aN
ww
iii) Social
iii) Social Network Analysis.
Network Analysis.
iv) Multi
iv) Multi relational
relational Data
Data Mining.
Mining.
KRREEKRKEKRKREEEEE
************
NRJ/IKW/17/4626
NRJ/KW/17/4626 22
www.rtmnuonline.com
B.E. Seventh
B.E. Seventh Semester
Semester (Computer
(Computer Science
Science &
& Engineering)
Engineering) (C.B.S.)
(C.B.S.)
Data Warehousing
Data Warehousing & & Mining
Mining
P. Pages
P. Pages: : 2
2 NKT/KS/17/7487
NKT/KS/17/7487
Time :: Three
Time Three Hours
Hours *0113*
MAA ll Max. Marks
Max. Marks :: 80
80
_____________________________________________________________________
Notes: 1
Notes : 1. All questions
All questions carry
carry marks
marks as
as indicated.
indicated.
2.
2. Solve Question
Solve Question 11 OR
OR Questions
Questions No. 2.
No. 2.
3
3. Solve Question
Solve Question 33 OR
OR Questions
Questions No. 4.
No. 4.
4.
4. Solve Question
Solve Question 55 OR
OR Questions
Questions No. 6.
No. 6.
5.
5. Solve Question
Solve Question 77 OR
OR Questions
Questions No. 8.
No. 8.
6
6. Solve Question
Solve Question 99 OR
OR Questions
Questions No. 10.
No. 10.
7
7. Solve Question
Solve Question 1111 OR
OR Questions
Questions No.
No. 12.12.
8.
8. Illustrate your
Illustrate your answers
answers whenever
whenever necessary
necessary with the help
with the help of
of neat
neat sketches.
sketches.
1.
1. a)
a) Explain various
Explain various data
data mining
mining functionalities
functionalities along
along with
with examples.
examples. 88
b) | What
b) What is
is the
the need
need of
of Data
Data Preprocessing?
Preprocessing? Explain
Explain steps
steps involved
involved in
in data
data prepressing.
prepressing. 66
OR
OR
2.
2. a) — Discuss
a) Discuss Major
Major issues
issues in
in Data
Data Mining.
Mining. 77
b)
b) Give classification
Give classification of
of data
data mining
mining system
system and
and also
also explain
explain concept
concept hierarchy
hierarchy generation.
generation. 77
3.
3. a) — Explain
a) Explain the
the need
need of
of Multidimensional
Multidimensional Data
Data Model.
Model. 33
b)
b) List and
List and explain
explain various
various OLAP
OLAP operations.
operations. 88
c)
c) Differentiate between
Differentiate between OLAP
OLAP and
and OLTP.
OLTP. 33
OR
OR
4.
4. a)
a) Explain three
Explain three tier
tier architecture
architecture of
of data
data warehouse
warehouse with
with neat
neat sketch.
sketch. 88
b) — Discuss
b) Discuss the
the architecture
architecture of
of ROLAP
ROLAP and
and MOLAP
MOLAP in
in detail
detail with
with the
the help
help of
of suitable
suitable diagram.
diagram. 66
5.
5. a) — Explain
a) Explain constraint-based
constraint-based Association
Association mining
mining with
with example.
example. 77
b)
b) Associate rule
Associate rule mining
mining often
often generate
generate large
large number
number of
of rules.
rules. Discuss
Discuss effective
effective methods
methods that
that 66
can be
can be used to reduce
used to reduce the
the number
number of
of rules
rules generated
generated while
while still
still preserving most of
preserving most of the
the
interesting rules.
interesting rules.
OR
OR
NKT/KS/17/7487
NKT/KS/17/7487 11 P.T.O
6. a) Define following
Define following terms:
terms: 6
1)
i) Frequent Item
Frequent Item Sets.
Sets.
ii)
ii) Closed Item
Closed Item sets.
sets.
ili) Association
iii) Association rules.
rules.
b) Explain in
Explain in brief Market-Basket analysis
brief Market-Basket analysis using
using example.
example. 7
7. a) Why is
Why is Bayesian
Bayesian classification
classification called
called naive?
naive? Briefly
Briefly outline
outline the
the major
major ideas
ideas of
of naive
naive 7
Bayesian classification.
Bayesian classification.
b) How the
How the accuracy
accuracy of
of aa classifier
classifier or
or aa predictor is evaluated?
predictor is evaluated? Explain.
Explain. 6
OR
OR
8. a) What are
What are the
the different
different issues
issues regarding
regarding classification
classification and
and prediction.
prediction. 7
b) Explain classification
Explain classification by
by Decision
Decision Tree
Tree Induction
Induction with
with example.
example. 6
9. a) Give classification
Give classification ofof clustering
clustering algo's
algo's and
and explain
explain partition
partition based
based clustering
clustering algorithm
algorithm 8
namely k-means
namely k-means stating
stating its
its merits,
merits, demerits
demerits and
and application
application area.
area.
b) Write aa short
Write short note
note on
on SVM
SVM (Support
(Support Vector
Vector M/C).
M/C). 5
OR
OR
10.
10. a) Explain various
Explain various requirements
requirements of
of clustering.
clustering. 4
b) Compare agglomerative
Compare agglomerative and
and divisive
divisive hierarchical
hierarchical clustering
clustering methods.
methods. 6
c) What is
What is outlier?
outlier? Why
Why outlier
outlier analysis
analysis is
is important?
important? 3
11.
11. a) Describe the
Describe the process
process of
of mining
mining Time-series
Time-series data
data with
with suitable
suitable example.
example. 7
b) Define following
Define following terms:
terms: 6
1)
i) Data streams.
Data streams.
11)
ii) Time series
Time series data.
data.
111) Sequence
iii) Sequence Data.
Data.
OR
OR
12.
12. Write short
Write short notes
notes on
on ::
1)
i) Graph Mining.
Graph Mining. 4
11)
ii) Network analysis and
Network analysis and Multi
Multi relational
relational Data
Data Mining.
Mining. 5
111) Mining
iii) Mining sequence
sequence pattern in Biological
pattern in Biological Data.
Data. 4
iv) Mining
iv) Mining Data
Data streams.
streams. 4
KRREEKEKEKERERKEER
************
NKT/KS/17/7487
NKT/KS/17/7487 2
B.E. (Computer
B.E. (Computer Science
Science &
& Engineering)
Engineering) Seventh
Seventh Semester
Semester (C.B.S.)
(C.B.S.)
Data Warehousing
Data Warehousing & & Mining
Mining
hehe Hou
P. Pages
P. Pages: : 2
2
Time : Three Hours *0885* wu Max. Marks : 80
_____________________________________________________________________
TKN/KS/16/7574
TKN/KS/16/7574
Nn Matsa
Notes: 1
Notes : 1. All questions
All questions carry
carry marks
marks as
as indicated.
indicated.
2.
2. Solve Question
Solve Question 11 OROR Questions
Questions No. 2.
No. 2.
3
3. Solve Question
Solve Question 33 OROR Questions
Questions No. 4.
No. 4.
4.
4. Solve Question
Solve Question 55 OROR Questions
Questions No.
No. 6.6.
5.
5. Solve Question
Solve Question 77 OROR Questions
Questions No.
No. 8.8.
6.
6. Solve Question
Solve Question 9 9 OR
OR Questions
Questions No. 10.
No. 10.
7
7. Solve Question
Solve Question 11 11 OR
OR Questions
Questions No.
No. 12.12.
8.
8. Due credit will be given to neatness and adequate
Due credit will be given to neatness and adequate dimensions.
dimensions.
9.
9. Assume suitable
Assume suitable data
data whenever
whenever necessary.
necessary.
10.
10. Illustrate your
Illustrate your answers
answers wherever
wherever necessary
necessary with with the
the help
help of
of neat
neat sketches.
sketches.
11.
11. Use of
Use of non-programmable
non-programmable calculator
calculator isis permitted.
permitted.
1.
1. a)
a) What are
What are the
the different
different data
data Mining
Mining Functionalities?
Functionalities? 77
b)
b) Discuss the
Discuss the Major
Major issues
issues in
in Data
Data mining.
mining. 77
OR
OR
2.
2. a)
a) Give the
Give the classification
classification of
of data
data mining
mining system.
system. Explain
Explain in
in detail.
detail. 77
b)
b) Explain the
Explain the different
different techniques
techniques for
for data
data reduction.
reduction. 77
3.
3. a)
a) What is
What is data
data warehouse?
warehouse? Explain
Explain architecture
architecture of
of data
data warehouse.
warehouse. 77
b) | Enumerate
b) Enumerate three
three classes
classes of
of Schemas
Schemas that
that are
are popularly
popularly used for modeling
used for modeling data
data Warehouse.
Warehouse. 77
Write features of each Schema.
Write features of each Schema.
OR
OR
4.
4. a)
a) Write the difference between OLAP & OLTP.
Write the difference between OLAP & OLTP. 44
b) | What
b) What is
is OLAP?
OLAP? What
What are
are the
the different
different OLAP
OLAP operations
operations that
that can
can be performed on
be performed on 77
multidimensional data
multidimensional data model.
model.
c)
c) Write short
Write short note
note on
on ROLAP
ROLAP model.
model. 33
5.
5. a)
a) Consider following
Consider following transactional
transactional dataset.
dataset. find
find frequent
frequent item
item sets
sets and
and association
association rules
rules 99
using apriori algorithm.
using apriori algorithm. With
With support
support = = 30%
30% && confidence
confidence == 70%
70%
TID
TID List of
List of Items
Items IDS
IDS
T100
T100 Il, I2
I1, 12 I5
15
T200
T200 12, I4
I2, 14
T300
T300 12, I3.
I2, 13.
T400
T400 11, I2,
I1, 12, I4.
14.
T500
T500 Il, I3,
I1, B,
T600
T600 12, I3
I2, 13
T700
T700 1, I3
I1, 13
T800
T800 Il, I2,
I1, 12, I3,
13, I5
15
T900
T900 Il, I2,
I1, 12, I3
13
TKN/KS/16/7574
TKN/KS/16/7574 11
www.rtmnuonline.com
www.rtmnuonline.com
b)
b) Write in
Write in brief
brief about
about constraint
constraint Based
Based Association
Association Mining.
Mining. 4
OR
OR
6.
6. a)
a) Explain in
Explain in brief
brief market
market Basket
Basket Analysis.
Analysis. 5
b)
b) Define the
Define the following
following terms.
terms. 6
i)
i) Frequent Item
Frequent Item sets.
sets.
11)
ii) Closed item
Closed item sets.
sets.
iii) Association
iii) Association rules.
rules.
c)
c) What is
What is correlation
correlation Analysis?
Analysis? 2
7.
7. a)
a) What are
What are the
the different
different issues
issues regarding
regarding classification
classification and
and prediction.
prediction. 6
b) — Write
b) Write short
short note
note on.
on. 7
i)
i) Support Vector
Support Vector Machine
Machine (SVM)
(SVM)
ii)
ii) Classification by
Classification by Back
Back propagation.
propagation.
OR
OR
8.
8. a) — Explain
a) Explain classification
classification by
by Decision
Decision Tree
Tree Induction
Induction with
with example.
example. 6
b) | What
b) What are
are the
the different
different measures
measures for
for Accuracy
Accuracy and
and error
error in
in classification
classification or
or prediction.
prediction. 5
c) | What
c) What do
do you
you mean
mean by Lazy Learners?
by Lazy Learners? 2
9.
9. a)
a) How the
How the clustering
clustering methods
methods are
are categorize?
categorize? 6
b)
b) Illustrate and
Illustrate and explain
explain partitioning
partitioning method
method for
for clustering.
clustering. 7
OR
OR
10.
10. a)
a) What do
What do you
you mean
mean by hierarchical clustering
by hierarchical clustering approach?
approach? Explain
Explain agglomerative
agglomerative and
and 7
divisive hierarchical
divisive hierarchical clustering.
clustering.
b) | What
b) What is
is outlier?
outlier? Why
Why outlier
outlier analysis
analysis is
is important?
important? 3
c)
c) Write short
Write short note
note on
on ''Constraint-
"Constraint- Based
Based Cluster
Cluster Analysis.''
Analysis." 3
11.
11. a) — Explain
a) Explain the
the technique
technique for
for mining
mining time
time series
series Data?
Data? 7
b)
b) Define following
Define following terms
terms 6
1)
i) Data streams.
Data streams.
11)
ii) Time series
Time series Data
Data
111) Sequence
iii) Sequence Data
Data
OR
OR
12.
12. a)
a) Write short
Write short on
on any
any three.
three. 13
1)
i) Graph mining
Graph mining
11)
ii) Mining sequence
Mining sequence pattern in Biological
pattern in Biological Data.
Data.
111) Social
iii) Social Network Analysis.
Network Analysis.
iv) Multirelational
iv) Multirelational Data
Data mining.
mining.
v)
v) Link Mining.
Link Mining.
KREKKKKEEK
********
TKN/KS/16/7574
TKN/KS/16/7574 22
www.rtmnuonline.com
www.rtmnuonline.com
www.rtmnuonline.com
B.E.(Computer Science
B.E.(Computer Science &
& Engineering)
Engineering) Semester
Semester Seventh
Seventh (C.B.S.)
(C.B.S.)
Data Warehousing
Data Warehousing & & Mining
Mining
0 000 ve Mara. 6a
P. Pages
P. Pages: : 2
2 KNT/KW/16/7487
KNT/KW/16/7487
Time :: Three
Time Three Hours
Hours *0837* Max. Marks : 80
_____________________________________________________________________
Notes: 1.
Notes : 1. All questions
All questions carry
carry marks
marks as
as indicated.
indicated.
2.
2. Solve Question 1 OR Questions No.
Solve Question 1 OR Questions 2.
No. 2.
3.
3. Solve Question
Solve Question 33 OR
OR Questions
Questions No. 4.
No. 4.
4.
4. Solve Question
Solve Question 55 OR
OR Questions
Questions No. 6.
No. 6.
5.
5. Solve Question
Solve Question 77 OR
OR Questions
Questions No. 8.
No. 8.
6.
6. Solve Question
Solve Question 99 OR
OR Questions
Questions No. 10.
No. 10.
7.
7. Solve Question
Solve Question 1111 OR
OR Questions
Questions No.
No. 12.12.
1. a) State different
different criterion
criterion on
on which
which data
data mining
mining systems
systems can
can be categorized;and write aa note
note 88
om
1. a) State be categorized and write
on each
on each of
of them.
them.
e.c
lin
b) Explain the
the major
major issues
issues in
in Data
Data Mining.
Mining. 66
on
b) Explain
nu
m
OR
OR
.rt
w
w
2.
2. a)
a) Explain major
Explain major Tasks
Tasks in
in Data
Data preprocessing.
preprocessing. 88
w
b)
b) Write short
Write short note
note on
on Discretization
Discretization &
& concept
concept Hierarchy
Hierarchy Generation.
Generation. 66
3.
3. a)
a) Explain three
Explain three tier
tier Data
Data warehousing architecture with
warehousing architecture with diagram.
diagram. 77
b) Explain all
all OLAP
OLAP operations
operations in
in the
thesmultidimensional Data model.
model. 66
om
OR
OR
lin
on
4. a) What is
is Data
Data cube
cube computation?
computation? What
What are
are the
the efficient
efficient methods
methods for
for Data
Data cube
cube 77
nu
4. a) What
m
computation.
computation.
.rt
w
w
b)
b) Differentiate between.
Differentiate between. 66
w
i)
i) Datamart &
Datamart & Data
Data warehouse.
warehouse.
ii)
ii) OLTP &
OLTP & OLAP.
OLAP.
5.
5. a) | What
a) What is
is the
the process
process of
of generating
generating association
association rules
rules from
from frequent
frequent item
item sets?
sets? Explain
Explain with
with 77
example?
example?
b) — Explain
b) Explain various
various kinds
kinds of
of association
association rule mining.
rule mining. 66
OR
OR
6.
6. a)
a) Explain Apriori
Explain Apriori algorithm
algorithm for
for frequent
frequent Item
Item sets.
sets. 77
b) — Explain
b) constraint —– Based
Explain constraint Based association
association mining
mining in
in short.
short. 66
KNT/KW/16/7487
KNT/KW/16/7487 11 P.T.O
P.T.O
www.rtmnuonline.com
www.rtmnuonline.com
www.rtmnuonline.com
www.rtmnuonline.com
7. a) | What
a) What is
is Back
Back propagation? Explain classification
propagation? Explain classification by Back propagation
by Back propagation with
with example.
example. 8
b) — Explain
b) Explain support
support vector machine in
vector machine in short.
short. 6
OR
OR
8. a)
a) Why is
Why is Bayesian
Bayesian classification
classification called
called naïve?
naive? Briefly
Briefly outline
outline the
the major
major ideas
ideas of
of naive
naive 7
Bayesian classification.
Bayesian classification.
b) | What
b) What are
are the
the different
different issues
issues regarding
regarding classification
classification &
& prediction.
prediction. 7
9. a)
a) What is
What is clustering?
clustering? Briefly
Briefly describe
describe the
the approach
approach of
of clustering
clustering in
in partitioning
partitioning method.
method. 7
b) | What
b) What dodo you
you mean
mean byby Hierarchical
Hierarchical clustering
clustering approach?
approach? Explain
Explain agglomerative
agglomerative and
and divise
divise 6
Hierarchical clustering.
Hierarchical clustering.
OR
OR
om
e.c
10.
10. a)
a) Illustrate and
Illustrate and explain
explain Grid
Grid Based
Based clustering.
clustering. 7
lin
on
b) — Give
b) Give any
any one
one application
application to
to explain
explain clustering
clustering as
as major
major data
data mining
mining function.
function. 6
nu
m
11.
11. a) — Explain
a) Explain constraint
constraint -Based
-Based sequential
sequential pattern mining for
pattern mining for transactional
transactional databases.
databases.
.rt
6
w
w
b) — Explain
b) Explain sequence
sequence pattern mining for
pattern mining for Biological
Biological data
data‘inin short.
short. 7
w
OR
OR
12.
12. a)
a) Write short
Write short note
note on.
on.
om
1)
i) Graph mining.
Graph mining. 4
e.c
lin
ii)
ii) Social Network
Social Analysis.
Network Analysis. 4
on
nu
ii1) Mining
iii) Mining Time-series
Time-series’ Data.
Data. 5
m
.rt
**************
w
w
KNT/KW/16/7487
KNT/KW/16/7487 22
www.rtmnuonline.com
www.rtmnuonline.com
http://www.rtmnuonline.com
http://www.rtmnuonline.com
11. (a)
11. (a) Explain
Explain the
the method
method for
for mining
mining sequence
sequence patterns
patterns in
in NTK/KW/15/7574
NTK/KW/15/7574
Biological Databases.
Biological Databases. 66
(b) How
(b) How are
are we
we able
able to
to achieve
achieve Social
Social Network Analysis
Network Analysis Faculty of
Faculty of Engineering
Engineering & & Technology
Technology
and Multirelational
and Multirelational Data
Data Mining
Mining ?? 77 Seventh Semester
Seventh Semester B.E.
B.E. (C.S.E.)
(C.S.E.) (C.B.S.)
(C.B.S.) Examination
Examination
OR DATA WAREHOUSING
DATA WAREHOUSING & & MINING
MINING
OR
12.
12. (a)
(a) Write
Write short
short notes
notes on
on :: (any TWO) : :
(any TWO) Time—Three Hours]
Time—Three Hours] [Maximum Marks—80
[Maximum Marks—80
(i) Graph
(i) Graph Mining
Mining INSTRUCTIONS TO
INSTRUCTIONS TO CANDIDATES
CANDIDATES
(i)
(ii) Data Stream
Data Stream Mining
Mining (1) All
(1) All questions
questions carry
carry marks
marks as
as indicated.
indicated.
(ii) Task
(iii) Task and
and challenges
challenges in
in link
link mining.
mining. 88 (2) Solve
(2) Solve Question
Question No.
No. 11 OR
OR Question
Question No. 2
No. 2.
(b) What
(b) What dodo you
you mean
mean by
by Time
Time Series
Series and
and Sequence
Sequence (3) Solve
(3) Solve Question
Question No.
No. 3
3 OR
OR Question
Question No. 4
No. 4.
Data
Data ?? Explain
Explain with
with example.
example. 55 (4) Solve
(4) Solve Question
Question No.
No. 55 OR
OR Question
Question No. 6.
No. 6.
(5) Solve
(5) Solve Question
Question No.
No. 77 OR
OR Question
Question No. 8
No. 8.
(6) Solve
(6) Solve Question
Question No.
No. 99 OR
OR Question
Question No. 10.
No. 10.
(7) Solve
(7) Solve Question
Question No. 11 OR
No. 11 OR Question
Question No. 12.
No. 12.
(8) Due
(8) Due credit
credit will
will be
be given
given to
to neatness
neatness and
and adequate
adequate
dimensions.
dimensions.
(9) Assume
(9) Assume suitable
suitable data
data wherever
wherever necessary.
necessary.
(10) Use
(10) Use of
of non
non programmable
programmable calculator
calculator is
is permitted.
permitted.
1.
1. (a) Give
(a) Give classification
classification of
of Data
Data Mining
Mining Systems.
Systems. 55
(b) Describe
(b) Describe why
why concept
concept hierarchies
hierarchies are
are useful
useful in
in Data
Data
Mining.
Mining. 44
(c) Give
(c) Give the
the application
application of
of Data
Data Mining.
Mining. 55
OR
OR
MVM—47658
MVM—47658 44 3050
3050 MVM—47658
MVM—47658 1l Contd.
Contd.
http://www.rtmnuonline.com
http://www.rtmnuonline.com
www.rtmnuonline.com
www.rtmnuonline.com http://www.rtmnuonline.com
http://www.rtmnuonline.com
2.
2. (a) What
(a) What isis the
the need
need of
of Data
Data Preprocessing
Preprocessing ?? Also
Also explain
explain 7.
7. (a) Write
(a) Write short
short notes
notes on
on ::
different steps
different steps involved
involved inin Data
Data Preprocessing.
Preprocessing. 88
(i)
(i) SVM (Support
SVM (Support Vector
Vector Machine)
Machine)
(b) Discuss
(b) Discuss the
the major
major issues
issues in
in data
data mining.
mining. 66 (ii)
(ii) Bayesian classification.
Bayesian classification. 88
3.
3. Explain three-tier
(a) Explain
(a) three-tier architecture
architecture of
of data
data warehouse
warehouse in
in (b) How
(b) How do do you
you evaluate
evaluate the
the accuracy
accuracy of
of a a classifier
classifier in
in
detail.
detail. 77 classification ?? Explain.
classification Explain. 66
OR
OR
(b) Explain
(b) Explain various
various OLAP
OLAP operations in
operations in the
the
multidimensional Data
multidimensional Data Model.
Model. 66 8.
8. (a) Explain
(a) Explain different
different issues
issues regarding
regarding classification
classification and
and
prediction.
prediction. 77
OR
OR
(b) Differentiate
(b) Differentiate classification
classification by
by Back
Back propagation
propagation and
and
4.
4. (a) Write
(a) Write the
the difference
difference between
between OLTP
OLTP and
and OLAP.
OLAP. 6
6 classification by
classification by Decision
Decision Tree
Tree Induction.
Induction. 77
(b) Discuss
(b) Discuss possible
possible design
design approaches
approaches used
used in
in the
the design
design 9.
9. (a) Discuss
(a) Discuss typical
typical requirements
requirements of
of clustering
clustering in
in data
data mining.
mining.
process of
process of aa data
data warehouse.
warehouse. Also
Also write
write the
the general
general 55
steps in
steps in data
data warehouse
warehouse design
design process.
process. 77
(b) What
(b) What isis Clustering
Clustering ?? Briefly
Briefly describe
describe the
the approach
approach
of clustering
of clustering in
in partitioning
partitioning method.
method. 33
5.
5. (a) What
(a) What dodo you
you mean
mean byby mining
mining frequent
frequent patterns,
patterns,
associations and
associations and correlations
correlations ?? Elaborate
Elaborate by
by giving
giving (c) What
(c) What is
is Outlier
Outlier ?? Why
Why outlier
outlier mining
mining is
is important
important ??
example.
example. 66 55
(b) Write
(b) Write short
short note
note on
on Constraint-Based
Constraint-Based Association
Association OR
OR
Mining.
Mining. 77
10. (a)
10. (a) Write
Write andand explain
explain DBSCANLA
DBSCANLA Density
Density Based
Based
OR
OR Clustering method
Clustering method based
based on
on connected
connected regions
regions with
with
sufficiently high
sufficiently high density.
density. 77
6.
6. What are
What are the
the different
different methods
methods available
available for
for Efficient
Efficient and
and
scalable frequent
scalable frequent item
item set
set mining
mining ?? Explain
Explain any
any one
one method
method (b) Explain
(b) Explain Grid-based
Grid-based clustering
clustering approach
approach byby considering
considering
along with
along with example
example in in detail.
detail. 13
13 STING (Statistical
STING (Statistical Information
Information Grid).
Grid). 66
MVM—47658
MVM—47658 22 Contd.
Contd. MVM—47658
MVM—47658 33 Contd.
Contd.
http://www.rtmnuonline.com
http://www.rtmnuonline.com
www.rtmnuonline.com