(MCQ) Data

Download as pdf or txt
Download as pdf or txt
You are on page 1of 8

7/8/22, 4:53 PM [MCQ] - Data warehouse and Data mining - LMT

Everything You Need For Your CDAC C-CAT Preparation 🔥💻 ->  Click Here  (https://lastmomenttuitions.com/course/ccat- 
study-material-section-asection-b/)

Get Latest Exam Updates, Free Study materials and Tips


Your Name
(https://lastmome
Your Branch nttuitions.com/)
Year Of Engineering

[MCQ]-Data warehouse and Data mining

Introduction to Data Warehouse and Dimensional modelling (#1617715335101-71b4ecbf-5a78)

ETL Process and OLAP (#1617715335113-f539e2ff-ccc6)

Introduction to Data Mining, Data Exploration and Preprocessing (#1617876405234-112cc4db-aac0)

Classification, Prediction and Clustering (#1617723650839-b8fb9b7a-dd89)

 Module 4

1. How many terms are required for building a bayes model?


a) 1
b) 2
c) 3
d) 4

Answer: c

2. What is needed to make probabilistic systems feasible in the world?


a) Reliability
b) Crucial robustness
c) Feasibility
d) None of the mentioned

Answer: b

3. Where does the bayes rule can be used?


a) Solving queries
b) Increasing complexity
c) Decreasing complexity
d) Answering probabilistic query

Answer: d

4. What does the bayesian network provides?


a) Complete description of the domain
b) Partial description of the domain
c) Complete description of the problem
d) None of the mentioned

Answer: a

5. How the entries in the full joint probability distribution can be calculated?
a) Using variables
b) Using information
c) Both Using variables & information
d) None of the mentioned

Answer: b

6. How the bayesian network can be used to answer any query?


a) Full distribution
b) Joint distribution
c) Partial distribution
d) All of the mentioned

Answer: b

https://lastmomenttuitions.com/mcqs/computer-engineering/mcq-data-warehouse-and-data-mining/#1617723650839-b8fb9b7a-dd89 1/8
7/8/22, 4:53 PM [MCQ] - Data warehouse and Data mining - LMT
7. How the compactness of the bayesian network can be described?
a) Locally structured
b) Fully structured
c) Partial structure
d) All of the mentioned

Answer: a

8. To which does the local structure is associated?


a) Hybrid
b) Dependant
c) Linear
d) None of the mentioned

Answer: c

9. Which condition is used to influence a variable directly by all the others?


a) Partially connected
b) Fully connected
c) Local connected
d) None of the mentioned

Answer: b

10. What is the consequence between a node and its predecessors while creating bayesian network?
a) Functionally dependent
b) Dependant
c) Conditionally independent
d) Both Conditionally dependant & Dependant

Answer: c

Learn Datawarehouse and Data mining from Scratch

Understand the Concept of Datawarehouse and Data mining in Detail [Videos + Notes]
Click Here!
(https://lastmomenttuitions.com/course/data-warehousing-and-mining/?ref=42057)

11. A _________ is a decision support tool that uses a tree-like graph or model of decisions and their possible consequences,
including chance event outcomes, resource costs, and utility.
a) Decision tree
b) Graphs
c) Trees
d) Neural Networks

Answer: a

12. Decision Tree is a display of an algorithm.


a) True
b) False

Answer: a

13. What is Decision Tree?


a) Flow-Chart
b) Structure in which internal node represents test on an attribute, each branch represents outcome of test and each leaf node
represents class label
c) Flow-Chart & Structure in which internal node represents test on an attribute, each branch represents outcome of test and
each leaf node represents class label
d) None of the mentioned

Answer: c

14. Decision Trees can be used for Classification Tasks.


a) True
b) False

Answer: a

15. Choose from the following that are Decision Tree nodes?
a) Decision Nodes
b) End Nodes
c) Chance Nodes
d) All of the mentioned

Answer: d

16. Decision Nodes are represented by ____________


a) Disks
b) Squares

https://lastmomenttuitions.com/mcqs/computer-engineering/mcq-data-warehouse-and-data-mining/#1617723650839-b8fb9b7a-dd89 2/8
7/8/22, 4:53 PM [MCQ] - Data warehouse and Data mining - LMT
c) Circles
d) Triangles

Answer: b

17. Chance Nodes are represented by __________


a) Disks
b) Squares
c) Circles
d) Triangles

Answer: c

18. End Nodes are represented by __________


a) Disks
b) Squares
c) Circles
d) Triangles

Answer: d

19. Which of the following are the advantage/s of Decision Trees?


a) Possible Scenarios can be added
b) Use a white box model, If given result is provided by a model
c) Worst, best and expected values can be determined for different scenarios
d) All of the mentioned

Answer: d

20. Which of the following is the valid component of the predictor?


a) data
b) question
c) algorithm
d) all of the mentioned

Answer: d

Learn Datawarehouse and Data mining from Scratch

Understand the Concept of Datawarehouse and Data mining in Detail [Videos + Notes]
Click Here!
(https://lastmomenttuitions.com/course/data-warehousing-and-mining/?ref=42057)

21. Point out the wrong statement.


a) In Sample Error is also called generalization error
b) Out of Sample Error is the error rate you get on the new dataset
c) In Sample Error is also called resubstitution error
d) All of the mentioned

Answer: a

22. Which of the following is correct order of working?


a) questions->input data ->algorithms
b) questions->evaluation ->algorithms
c) evaluation->input data ->algorithms
d) all of the mentioned

Answer: a

23. Which of the following shows correct relative order of importance?


a) question->features->data->algorithms
b) question->data->features->algorithms
c) algorithms->data->features->question
d) none of the mentioned

Answer: b

24. Point out the correct statement.


a) In Sample Error is the error rate you get on the same dataset used to model a predictor
b) Data have two parts-signal and noise
c) The goal of predictor is to find signal
d) None of the mentioned

Answer: d

25. Which of the following is characteristic of best machine learning method?


a) Fast
b) Accuracy
c) Scalable
d) All of the mentioned

https://lastmomenttuitions.com/mcqs/computer-engineering/mcq-data-warehouse-and-data-mining/#1617723650839-b8fb9b7a-dd89 3/8
7/8/22, 4:53 PM [MCQ] - Data warehouse and Data mining - LMT
Answer: d

26. True positive means correctly rejected.


a) True
b) False

Answer: b

27. Which of the following trade-off occurs during prediction?


a) Speed vs Accuracy
b) Simplicity vs Accuracy
c) Scalability vs Accuracy
d) None of the mentioned

Answer: d

28. Which of the following expression is true?


a) In sample error < out sample error
b) In sample error > out sample error
c) In sample error = out sample error
d) All of the mentioned

Answer: a

29. Backtesting is a key component of effective trading-system development.


a) True
b) False

Answer: a

30. Which of the following is correct use of cross validation?


a) Selecting variables to include in a model
b) Comparing predictors
c) Selecting parameters in prediction function
d) All of the mentioned

Answer: d

Learn Datawarehouse and Data mining from Scratch

Understand the Concept of Datawarehouse and Data mining in Detail [Videos + Notes]
Click Here!
(https://lastmomenttuitions.com/course/data-warehousing-and-mining/?ref=42057)

31. Point out the wrong combination.


a) True negative=correctly rejected
b) False negative=correctly rejected
c) False positive=correctly identified
d) All of the mentioned

Answer: c

32. Which of the following is a common error measure?


a) Sensitivity
b) Median absolute deviation
c) Specificity
d) All of the mentioned

Answer: d

33. Which of the following is not a machine learning algorithm?


a) SVG
b) SVM
c) Random forest
d) None of the mentioned

Answer: a

34. Point out the wrong statement.


a) ROC curve stands for receiver operating characteristic
b) Foretime series, data must be in chunks
c) Random sampling must be done with replacement
d) None of the mentioned

Answer: d

35. Which of the following is a categorical outcome?


a) RMSE
b) RSquared

https://lastmomenttuitions.com/mcqs/computer-engineering/mcq-data-warehouse-and-data-mining/#1617723650839-b8fb9b7a-dd89 4/8
7/8/22, 4:53 PM [MCQ] - Data warehouse and Data mining - LMT
c) Accuracy
d) All of the mentioned

Answer: c

36. For k cross-validation, larger k value implies more bias.


a) True
b) False

Answer: b

37. Which of the following method is used for trainControl resampling?


a) repeatedcv
b) svm
c) bag32
d) none of the mentioned

Answer: a

38. Which of the following can be used to create the most common graph types?
a) qplot
b) quickplot
c) plot
d) all of the mentioned

Answer: a

39. For k cross-validation, smaller k value implies less variance.


a) True
b) False

Answer: a

40. Predicting with trees evaluate _____________ within each group of data.
a) equality
b) homogeneity
c) heterogeneity
d) all of the mentioned

Answer: b

Learn Datawarehouse and Data mining from Scratch

Understand the Concept of Datawarehouse and Data mining in Detail [Videos + Notes]
Click Here!
(https://lastmomenttuitions.com/course/data-warehousing-and-mining/?ref=42057)

41. Point out the wrong statement.


a) Training and testing data must be processed in different way
b) Test transformation would mostly be imperfect
c) The first goal is statistical and second is data compression in PCA
d) All of the mentioned

Answer: a

42. Which of the following method options is provided by train function for bagging?
a) bagEarth
b) treebag
c) bagFDA
d) all of the mentioned

Answer: d

43. Which of the following is correct with respect to random forest?


a) Random forest are difficult to interpret but often very accurate
b) Random forest are easy to interpret but often very accurate
c) Random forest are difficult to interpret but very less accurate
d) None of the mentioned

Answer: a

44. Point out the correct statement.


a) Prediction with regression is easy to implement
b) Prediction with regression is easy to interpret
c) Prediction with regression performs well when linear model is correct
d) All of the mentioned

Answer: d

45. Which of the following library is used for boosting generalized additive models?
a) gamBoost

https://lastmomenttuitions.com/mcqs/computer-engineering/mcq-data-warehouse-and-data-mining/#1617723650839-b8fb9b7a-dd89 5/8
7/8/22, 4:53 PM [MCQ] - Data warehouse and Data mining - LMT
b) gbm
c) ada
d) all of the mentioned

Answer: a

46. The principal components are equal to left singular values if you first scale the variables.
a) True
b) False

Answer: b

47. Which of the following is statistical boosting based on additive logistic regression?
a) gamBoost
b) gbm
c) ada
d) mboost

Answer: a

48. Which of the following is one of the largest boost subclass in boosting?
a) variance boosting
b) gradient boosting
c) mean boosting
d) all of the mentioned

Answer: b

49. PCA is most useful for non linear type models.


a) True
b) False

Answer: b

50. Which of the following clustering type has characteristic shown in the below figure?

a) Partitional
b) Hierarchical
c) Naive bayes
d) None of the mentioned

Answer: b

Learn Datawarehouse and Data mining from Scratch

Understand the Concept of Datawarehouse and Data mining in Detail [Videos + Notes]
Click Here!
(https://lastmomenttuitions.com/course/data-warehousing-and-mining/?ref=42057)

51. Point out the correct statement.


a) The choice of an appropriate metric will influence the shape of the clusters
b) Hierarchical clustering is also called HCA
c) In general, the merges and splits are determined in a greedy manner
d) All of the mentioned

Answer: d

52. Which of the following is finally produced by Hierarchical Clustering?


a) final estimate of cluster centroids
b) tree showing how close things are to each other
c) assignment of each point to clusters
d) all of the mentioned

Answer: b

53. Which of the following is required by K-means clustering?


a) defined distance metric
b) number of clusters
c) initial guess as to cluster centroids
d) all of the mentioned

Answer: d

54. Point out the wrong statement.


a) k-means clustering is a method of vector quantization
b) k-means clustering aims to partition n observations into k clusters

https://lastmomenttuitions.com/mcqs/computer-engineering/mcq-data-warehouse-and-data-mining/#1617723650839-b8fb9b7a-dd89 6/8
7/8/22, 4:53 PM [MCQ] - Data warehouse and Data mining - LMT
c) k-nearest neighbor is same as k-means
d) none of the mentioned

Answer: c

55. Which of the following combination is incorrect?


a) Continuous – euclidean distance
b) Continuous – correlation similarity
c) Binary – manhattan distance
d) None of the mentioned

Answer: d

56. Hierarchical clustering should be primarily used for exploration.


a) True
b) False

Answer: a

57. Which of the following function is used for k-means clustering?


a) k-means
b) k-mean
c) heatmap
d) none of the mentioned

Answer: a

58. Which of the following clustering requires merging approach?


a) Partitional
b) Hierarchical
c) Naive Bayes
d) None of the mentioned

Answer: b

59. K-means is not deterministic and it also consists of number of iterations.


a) True
b) False

Answer: a

60. Hierarchical clustering should be mainly used for exploration.


a) True
b) False

Answer: a

Learn Datawarehouse and Data mining from Scratch

Understand the Concept of Datawarehouse and Data mining in Detail [Videos + Notes]
Click Here!
(https://lastmomenttuitions.com/course/data-warehousing-and-mining/?ref=42057)

61. K-means clustering consists of a number of iterations and not deterministic.


a) True
b) False

Answer: a

62. Which is needed by K-means clustering?


a) defined distance metric
b) number of clusters
c) initial guess as to cluster centroids
d) all of these

Answer: d

63. Which function is used for k-means clustering?


a) k-means
b) k-mean
c) heatmap
d) none of the mentioned

Answer: a

64. Which is conclusively produced by Hierarchical Clustering?


a) final estimation of cluster centroids
b) tree showing how nearby things are to each other
c) assignment of each point to clusters
d) all of these

https://lastmomenttuitions.com/mcqs/computer-engineering/mcq-data-warehouse-and-data-mining/#1617723650839-b8fb9b7a-dd89 7/8
7/8/22, 4:53 PM [MCQ] - Data warehouse and Data mining - LMT
Answer: b

65. Which clustering technique requires a merging approach?


a) Partitional
b) Hierarchical
c) Naive Bayes
d) None of the mentioned

Answer: b

Mining Frequent Patterns and Association Rules (#1617876182270-7f30a588-2eda)

Prepare For Your Placements: https://lastmomenttuitions.com/courses/placement-preparation/


(https://lastmomenttuitions.com/courses/placement-preparation/)
(https://lastmomenttuitions.com/course/python-zero-to-hero-covering-web-development-and-machine-learning-capstone-
project-from-scratch-included-mentorship/youtube-2/)

/ Youtube Channel: https://www.youtube.com/channel/UCGFNZxMqKLsqWERX_N2f08Q


(https://www.youtube.com/channel/UCGFNZxMqKLsqWERX_N2f08Q)

Follow For Latest Updates, Study Tips & More Content!

(https://lastmomenttuitions.com/course/python-zero-to-hero-covering-web-development-and-machine-learning-capstone-
project-from-scratch-included-mentorship/insta-1/)/lastmomenttuition (https://www.instagram.com/lastmomenttuition/)

(https://lastmomenttuitions.com/course/python-zero-to-hero-covering-web-development-and-machine-learning-capstone-
project-from-scratch-included-mentorship/link/)/ Last Moment Tuitions (https://in.linkedin.com/company/last-moment-
tuitions#:~:text=Last%20Moment%20Tuitions%20(LMT)%20is,others%20is%20its%20teaching%20methodology.)

(https://lastmomenttuitions.com/course/python-zero-to-hero-covering-web-development-and-machine-learning-capstone-
project-from-scratch-included-mentorship/twittrwer/)/ lastmomentdost (https://twitter.com/lastmomentdost)

https://lastmomenttuitions.com/mcqs/computer-engineering/mcq-data-warehouse-and-data-mining/#1617723650839-b8fb9b7a-dd89 8/8

You might also like