Data_Mining_QA
Data_Mining_QA
3. The __________ split is defined as one that does the best job of separating the records into
groups
Answer: Best
5. Which of the following is generally used in finding hidden structure and patterns in a
given unlabeled data?
6. Which of the following refers to obtaining information from unstructured textual data?
11. The total categories of functions that are involved in Data Mining are:
Answer: 5
12. Which one of the clustering techniques needs the merging approach?
Answer: Hierarchical
16. What are the advantages of Classification and Regression Trees (CART)?
Answer: Both
20. Suppose your target variable is the price of a house using Decision Tree. What type of
tree do you need to predict the target variable?
Answer: Dividing a node into two or more sub-nodes based on if-else conditions
Answer: The end of the decision tree where it cannot be split into further sub-nodes.
24. In a decision tree algorithm, entropy helps to determine a feature or attribute that gives
maximum information about a class which is called _____.
25. In Decision Trees, for predicting a class label, the algorithm starts from which node of
the tree?
Answer: Root
26. CART uses the ____________ for determining the best split.
27. Which of the following is an essential process in which intelligent methods are applied
to extract data patterns?
28. Which one of the following statements is TRUE for a Decision Tree?
29. How do you choose the right node while constructing a decision tree?
30. A __________ is a classification scheme which generates a tree and a set of rules
representing the model of different classes from a given data set
32. K-means clustering requires prior knowledge about the number of clusters required as
its input
Answer: True
35. __________ is a classification scheme which generates a tree and a set of rules.
36. A ______________ database stores a large amount of space-related data, such as maps,
preprocessed remote sensing or medical imaging data etc.
37. The method of arranging data into homogeneous classes according to the common
features present in the data is known as
Answer: Clustering
Answer: 5
40. ___________ is defined as a process used to extract usable data from a larger set of any raw
data.
44. Web content mining describes the discovery of useful information from the _______
contents.
Answer: Web
45. _______________ describes the discovery of useful information from web contents.
46. Which one of the following can be defined as the data object which does not comply with
the general behavior (or the model of available data)?
47. In the example predicting the number of newborns, the final number of total newborns
can be considered as the _________
Answer: Outcome
48. The following given statement can be considered as the example of_________: Suppose one
wants to predict the number of newborns according to the size of storks' population by
performing supervised learning
Answer: Regression
49. Which of the following is an essential process in which intelligent methods are applied
to extract data patterns?
50. Multiple numbers of data sources get combined in which step of the Knowledge
Discovery?