Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
MCQs Bank
Unit 1:
1. What is Machine learning?
a) The autonomous acquisition of knowledge through the use of computer
programs
b) The autonomous acquisition of knowledge through the use of manual
programs
c) The selective acquisition of knowledge through the use of computer
programs
d) The selective acquisition of knowledge through the use of manual programs
2. Which of the factors affect the performance of learner system does not include?
a) Representation scheme used
b) Training scenario
c) Type of feedback
d) Good data structures
3. Which of the following sentence is FALSE regarding regression?
a) It relates inputs to outputs.
b) It is used for prediction.
c) It discovers causal relationships.
d) It may be used for interpretation.
4. What are the different types of models in Machine Learning?
a) Predictive
b) Descriptive
c) Both a and b
d) None of these
5. Which learning technique is used in regression?
a) Supervised Learning
b) Unsupervised Learning
c) Reinforcement Learning
6. Several sets of data related to each other used to make decisions in machine
learning algorithms. E.g. comments to make you happy and comments to make you
sad
a) Dataset
b) Supervised learning
c) Unsupervised learning
d) Classifiers
7. Type of machine learning algorithm used to infer information from data without
input from humans.
a) Dataset
b) Supervised learning
c) Unsupervised learning
d) Classifiers
8. Where does the observer reward the agent for correct responses?
a) Decision tree learning
b) Reinforcement learning
c) Predictive models
d) sentiment analysis
9. After training the ML model, we see how accurately it predicts the
answer/responds. For example – does it cry when I say something mean to it. What
is that called?
a) Recognition
b) Training
c) Predictive models
d) Testing
10. Suppose you are working on weather prediction, and use a learning algorithm
to predict tomorrow's temperature (in degrees Centigrade/Fahrenheit). Would you
treat this as a classification or a regression problem?
a) Regression
b) Classification
11. Different learning methods does not include?
a) Memorization
b) Analogy
c) Deduction
d) Introduction
12. In language understanding, the levels of knowledge that does not include?
a) Phonological
b) Syntactic
c) Empirical
d) Logical
13. A model of language consists of the categories which does not include?
a) Language units
b) Role structure of units
c) System constraints
d) Structural units
14. Which of the following is a widely used and effective machine learning
algorithm based on the idea of bagging?
a) Decision Tree
b) Regression
c) Classification
d) Random Forest
15. To find the minimum or the maximum of a function, we set the gradient to zero
because:
a) The value of the gradient at extrema of a function is always zero
b) Depends on the type of problem
c) Both A and B
d) None of the above
16. Which of the following is a good test dataset characteristic?
a) Large enough to yield meaningful results
b) Is representative of the dataset as a whole
c) Both A and B
d) None of the above
17. How do you handle missing or corrupted data in a dataset?
a) Drop missing rows or columns
b) Replace missing values with mean/median/mode
c) Assign a unique category to missing values
d) All of the above
18. What is the purpose of performing cross-validation?
a) To assess the predictive performance of the models
b) To judge how the trained model performs outside the sample on test data
c) Both A and B
19. The problem of finding hidden structures in unlabeled data is:
a) Supervised learning
b) Unsupervised Learning
c) Reinforcement Learning
20. Some tele-communication company wants to segment their customers into
distinct groups in order to send appropriate subscription offers, this is an example
of
a) Supervised Learning
b) Data Extraction
c) Serration
d) Unsupervised learning
21. When is learning not used?
a) Human expertise does not exist
b) Humans are unable to explain their expertise
c) Solution changes in time
d) Solution needs to be adapted to particular cases
e) None of these
22. When is learning used?
a) Human expertise does not exist
b) Humans are unable to explain their expertise
c) Solution changes in time
d) Solution needs to be adapted to particular cases
e) All of these
23. Which of the following is NOT supervised learning?
a) PCA
b) Decision Tree
c) Linear Regression
d) Naive Bayesian
24. In which of the following learning the teacher returns reward and punishment
to learner?
a) Active learning
b) Reinforcement learning
c) Supervised learning
d) Unsupervised learning
25. Which of the following is not an application of learning?
a) Data mining
b) WWW
c) Speech recognition
d) None of the mentioned
26. Which of the following is the component of learning system?
a) Goal
b) Model
c) Learning rules
d) All of the mentioned
27. Which statement is true about prediction problems?
a) The output attribute must be categorical.
b) The output attribute must be numeric.
c) The resultant model is designed to determine future outcomes.
d) The resultant model is designed to classify current behavior.
28. Which statement is true about neural network and linear regression models?
a) Both models require input attributes to be numeric.
b) Both models require numeric attributes to range between 0 and 1.
c) The output of both models is a categorical attribute value.
d) Both techniques build models whose output is determined by a linear sum
of weighted input attribute values.
e) More than one of a,b,c or d is true.
29. Unlike traditional production rules, association rules
a) allow the same variable to be an input attribute in one rule and an
output attribute in another rule.
b) allow more than one input attribute in a single rule.
c) require input attributes to take on numeric values.
d) require each rule to have exactly one categorical output attribute.
30. Which of the following is a common use of unsupervised clustering?
a) detect outliers
b) determine a best set of input attributes for supervised learning
c) evaluate the likely performance of a supervised learner model
d) determine if meaningful relationships can be found in a dataset
e) All of a,b,c, and d are common uses of unsupervised clustering.
Unit 2:
1. Class learning is finding descriptions shared by all _________.
a) Positive Examples
b) Negative Examples
2. A statement made about a population for testing purpose is called?
a) Statistic
b) Hypothesis
c) Level of Significance
d) Test-Statistic
3. What describes the most specific hypotheses the best?
a) The tightest rectangle that includes all the positive examples and none
of the negative examples
b) The largest rectangle that includes all the positive examples and none of the
negative examples
c) The largest rectangle that includes all the positive examples and all the
negative examples
4. An axis-aligned rectangle can shatter four points in two dimensions. What is its
VC Dimension?
a) 2
b) 3
c) 4
d) 5
5. In Probably Approximately Correct (PAC) learning, given a class, C, we want to
find the number of examples, N, such that with probability at least 1 - δ, the
hypothesis h has error at most ε, for arbitrary δ ≤ 1/2 and ε > 0. Which one is the
correct value of N?
a) at least (4/ ε) log(4/δ)
b) at most (4/ ε) log(4/δ)
c) (1- ε /4)N
d) None
6. Which one is not a correct example of noise in data?
a) Imprecision in collecting training data
b) Errors in labeling data points( positive to negative and vice-versa)
c) Hidden or neglected attributes which are important but not included
d) Correctly recorded class labels
7. How many hypotheses are needed for a K-Class classification problem?
a) 2
b) k
c) k-1
d) 4
8. When the output class is a numeric value, we learn a numeric function. What is
this process called when there is noise or no noise respectively?
a) Interpolation, Extrapolation
b) Extrapolation, Regression
c) Interpolation, Regression
d) Regression, Interpolation
9. What is inductive bias?
a) Set of learning rules
b) Set of assumptions to make learning possible
c) Set of class labels to make learning successful
10. The key purpose of splitting the dataset into training and test sets is:
a) To speed up the training process
b) To reduce the number of features
c) To validate the results of testing unknown inputs
11. What is the purpose of performing cross-validation?
a) To assess the predictive performance of the models
b) To judge how the trained model performs outside the sample on test data
c) Both A and B
12. Suppose you have trained a logistic regression classifier and it outputs a new
example x with a prediction ho(x) = 0.2. This means
a) Our estimate for P(y=1 | x)
b) Our estimate for P(y=0 | x)
c) Our estimate for P(y=1 | x)
d) Our estimate for P(y=0 | x)
13. The VC dimension of hypothesis space H1 is larger than the VC dimension of
hypothesis space H2. Which of the following can be inferred from this?
a) The number of examples required for learning a hypothesis in H1 is
larger than the number of examples required for H2
b) The number of examples required for learning a hypothesis in H1 is smaller
than the number of examples required for H2
c) No relation to number of samples required for PAC learning.
14. For a particular learning task, if the requirement of error parameter changes
from 0.1 to 0.01. How many more samples will be required for PAC learning?
a) Same
b) 2 times
c) 10 times
d) 1000 times
15. Suppose the VC dimension of a hypothesis space is 4. Which of the following
is true?
a) No sets of 4 points can be shattered by the hypothesis space.
b) Atleast one set of 4 points can be shattered by the hypothesis space.
c) All sets of 4 points can be shattered by the hypothesis space.
d) Set of 5 points can be shattered by the hypothesis space
16. Computational complexity of classes of learning problems depends on which
of the following?
a) The size or complexity of the hypothesis space considered by learner
b) the accuracy to which the target concept must be approximated
c) the probability that the learner will output a successful hypothesis
d) all of the above
17. VC dimension for conjunctions of n Boolean literals is:
a) Atleast n
b) Atmost n
c) Can’t say
d) None
18. If the null hypothesis is false then which of the following is accepted?
a) Null Hypothesis
b) Positive Hypothesis
c) Negative Hypothesis
d) Alternative Hypothesis
19. Which of the following methods do we use to find the best fit line for data in
Linear Regression?
a) Least Square Error
b) Maximum Likelihood
c) Logarithmic Loss
d) Both A and B
20. Linear Regression is a supervised machine learning algorithm.
a) True
b) False
21. What is generalization?
a) It is how well a model trained on the training set predicts the right
output for new instances is called.
b) It is about choosing between possible hypotheses H.
c) Both a and b
d) None of these
22. ___________ refers to a model that can neither model the training data nor
generalize to new data.
a) good fitting
b) overfitting
c) underfitting
d) all of the above
23. Suppose your model is overfitting. Which of the following is NOT a valid way
to try and reduce the overfitting?
a) Increase the amount of training data.
b) Improve the optimization algorithm being used for error minimization.
c) Decrease the model complexity.
d) Reduce the noise in the training data.
24. Why learning an ill-posed problem?
a) Because of too much data
b) Because of too less data
c) Because of less number of hypotheses
d) None of these
25. Which of the following is not a noise type?
a) Imprecision in collecting training data
b) Errors in labeling data points ( positive to negative and vice-versa)
c) Hidden or neglected attributes which are important but not included
d) None of these