0% found this document useful (0 votes)
65 views

I. Choose The Correct Alternative:: II. Fill in The Blanks

This document contains 20 multiple choice and fill in the blank questions related to data mining concepts. The questions cover topics like prediction, clustering algorithms, time series data, text mining, spatial data and overfitting. Some key algorithms and concepts asked about include decision trees, k-means clustering, DBSCAN, hierarchical clustering, spatiotemporal data, web content mining and frequent patterns.

Uploaded by

Srimanth Reddy
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
65 views

I. Choose The Correct Alternative:: II. Fill in The Blanks

This document contains 20 multiple choice and fill in the blank questions related to data mining concepts. The questions cover topics like prediction, clustering algorithms, time series data, text mining, spatial data and overfitting. Some key algorithms and concepts asked about include decision trees, k-means clustering, DBSCAN, hierarchical clustering, spatiotemporal data, web content mining and frequent patterns.

Uploaded by

Srimanth Reddy
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 1

I.

Choose the correct alternative:


1. Prediction can be viewed as forecasting a_________value.  [ ]
a)non-continuous. b)constant. c)continuous. d)variable
2. A _____________ is a flowchart-like tree structure [ ]
a)Navie Bayesian b)Rule Based c)Decision tree d)Binary tree.
3. In ________ algorithm each cluster is represented by the center of gravity of the cluster.  [ ]
a)k-medoid.  b)k-means.  c)STIRR. d)ROCK.
4. Pick out a hierarchical clustering algorithm. [ ]
a)DBSCAN  b)BIRCH.  c)PAM.  d)CURE
5. Clustering large applications can be shortened as ---- [ ]
a)DBSCAN b)OPTICS c)STING d)CLARA
6. DBSCAN is a -----clustering algorithm [ ]
a)partitioning methods b)hierarchical methods
c)density based methods d)grid based methods
7. ______________ sequences include DNA and protein sequences. [ ]
a)time-series data b)Symbolic sequence data
c)Biological sequences d)sequence data
8.________________ data are data that relate to both space and time [ ]
a)Time-series b)Spatiotemporal c)Multimedia d)Text and web
9. ________mining analyzes web content such as text, multimedia data, and structured data [ ]
a)Web content b)Spatial c)Multimedia d)Data stream
10. _____mining is an interdisciplinary field that draws on information retrieval, data mining,
machine learning, statistics, and computational linguistics. [ ]
a)Data stream b)Text c)Spatial d)Multimedia
II. Fill in the Blanks
11. Many time-series similarity queries require _______________matching

12. A pattern is considered frequent if its count satisfies a _______________support

13. “High quality” in text mining usually refers to a combination of _______________

14. Spatial data, in many cases, refer to _______________data stored in geospatial data repositories

15. Data matrix is also called as _______________structure

16. _______________is a statistical information grid approach

17. A hierarchical method can be classified as being either _______________

18. Rock stands for _______________

19. A categorical variable is generalized form of _______________with more than two states

20. Over fitting means _______________

You might also like