K.L.N. College of Engineering, Pottapalayam – 630 612.
(An Autonomous Institution, Affiliated to Anna University)
DEPARTMENT OF ELECTRONICS AND COMMUNICAION
ENGINEERING
MCQ Practice - Question Bank
Course Code & Course Name: CS8082 Machine learning Techniques
Academic Year : 2020-2021, Year & Semester : IV Year & VII semester
Unit – I
Part – A ( Remember / Understand Level ) – 1 Mark Questions
Q1 : A machine learning problem involves four attributes plus a class. The attributes
have 3, 2, 2, and 2 possible values each. The class has 3 possible values. How many
maximum possible different examples are there?
a. 12
b. 24
c. 48
d. 72
Answer : d)72
Q2 : Inductive learning is
a. Learning new things from our past experience
b. Learning from a teacher/expert
c. Deriving new facts from past facts
d. On the basis of past experience, formulating a generalized concept
Answer : d) On the basis of past experience, formulating a generalized concept
Q3 : What is Machine learning?
a. The autonomous acquisition of knowledge through the use of computer programs
b. The autonomous acquisition of knowledge through the use of manual programs
c. The selective acquisition of knowledge through the use of computer programs
d. The selective acquisition of knowledge through the use of manual programs
Answer : a) The autonomous acquisition of knowledge through the use of computer programs
Q4 : Which of the factors affect the performance of learner system does not include?
a. Representation scheme used
b. Training scenario
c. Type of feedback
d. Good data structures
Answer : d) Good data structures
Q5 : FIND-S Algorithm starts from the most specific hypothesis and generalize it by
considering only _________examples
a. Negative
b. Positive
c. Negative or Positive
d. None of the above
Answer : b) Positive
Q6 : FIND-S algorithm ignores _______ examples.
a. Negative
b. Positive
c. Both
d. None of the above
Answer : a) Negative
Q7 : The Candidate-Elimination Algorithm represents the _____.
a. Solution Space
b. Version Space
c. Elimination Space
d. All of the above
Answer : b) Version Space
Q8: Inductive learning is based on the knowledge that if something happens a lot it is
likely to be generally.
a. True
b. False
Answer : a) True
Q9: Inductive learning takes examples and generalizes rather than starting with
__________ knowledge
a. Inductive
b. Existing
c. Deductive
d. None of these
Answer : b) Existing
Q10: A drawback of the FIND-S is that, it assumes the consistency within the training
set.
a. True
b. False
Answer : a) True
Q11: Concept learning inferred a --------------- valued functions from training examples of
its input and output.
a. Decimal
b. Boolean
c. Hexadecimal
d. All of the above
Answer : b) Boolean
Q12: A _________ is a decision support tool that uses a tree-like graph or model of
decisions and their possible consequences, including chance event outcomes, resource
costs, and utility.
a. Decision tree
b. Graphs
c. Trees
d. Neural Networks
Answer : a) Decision tree
Q13: Which of the following are the advantage/s of Decision Trees?
a. Possible Scenarios can be added
b. Use a white box model, If given result is provided by a model
c. Worst, best and expected values can be determined for different scenarios
d. All of the mentioned
Answer : d) All of the mentioned
Q14: Which of the following is NOT supervised learning?
a. PCA
b. Decision Tree
c. Linear Regression
d. Naive Bayesian
Answer : a) PCA
Q15: Which of the following cross validation versions may not be suitable for very large
datasets with hundreds of thousands of samples?
a. k-fold cross-validation
b. Leave-one-out cross-validation
c. Hold out method
d. All of the above
Answer : b) Leave-one-out cross-validation
Part – B (Apply Level) – 2 Mark Questions
Q1 : Target Concept: The days on which Mr.X goes to cinema
Task: Learn to predict the value of watching Movie on any day
Binary Features:
Has money
Has free time
It’s a holiday
Training Examples: x1:<1,1,0> +ve
X2:<1,1,1> +ve
X3:<1,0,0> -ve
h1:<1,1,?>
h2:<1,?,?>
Choose whether the following statement is true or false
h1 is more general than h2
a. YES
b. NO
Answer : b) NO
Q2 : Target Concept: The days on which Mr.X goes to cinema
Task: Learn to predict the value of watching Movie on any day
Binary Features:
Has money
Has free time
It’s a holiday
Training Examples: x1:<1,1,0> +ve
X2:<1,1,1> +ve
X3:<1,0,0> -ve
Find the possible number of syntactically distinct hypothesis for the above
example
a. 16
b. 32
c. 64
d. 128
Answer : c) 64
Q3 : Target Concept: The days on which Mr.X goes to cinema
Task: Learn to predict the value of watching Movie on any day
Binary Features:
Has money
Has free time
It’s a holiday
Training Examples: x1:<1,1,0> +ve
X2:<1,1,1> +ve
X3:<1,0,0> -ve
In Find S algorithm for the given example, what is the hypothesis after the first
iteration
a. h1:<True, True, False>
b. h1:<False, False, False>
c. h1: <Θ,Θ,Θ>
d. h1:<True, False, False
Answer : a) h1:<True, True, False>
Q4 : Target Concept: The days on which Mr.X goes to cinema
Task: Learn to predict the value of watching Movie on any day
Binary Features:
1. Has money
2. Has free time
3. It’s a holiday
Training Examples: x1:<1,1,0> +ve
X2:<1,1,1> +ve
X3:<1,0,0> -ve
In Find S algorithm for the given example, What is the hypothesis before the first
iteration
a. h1:<True, True, False>
b. h1:<False, False, False>
c. h1: <Θ,Θ,Θ>
d. h1:<True, False, False
Answer : c) h1: <Θ,Θ,Θ>
Q5 :
Let h = < Sunny, warm, ?, strong,?,?>
Check whether h(x) is consistent with c(x)
a. YES
b. NO
Answer : a) YES
Q6 :
In decision tree representation, <Outlook=Sunny, Temp=Hot, Humidity=High,
Wind=Strong No, Write the expression for playing tennis
a. (Outlook = Sunny Humidity = Normal)
(Outlook = Overcast)
(Outlook = Rain Wind = Weak)
b. (Outlook = Sunny Humidity = Normal)
(Outlook = Overcast)
(Outlook = Rain Wind = Weak)
c. (Outlook = Sunny Humidity = Normal)
(Outlook = Overcast)
(Outlook = Rain Wind = Weak)
d. (Outlook = Sunny Humidity = Normal)
(Outlook = Overcast)
(Outlook = Rain Wind = Weak)
Answer : b) (Outlook = Sunny Humidity = Normal)
(Outlook = Overcast)
(Outlook = Rain Wind = Weak)
Q7 : This Version Space, containing all 6 hypotheses can be compactly represented with its
most specific (S) and most general (G) sets. How to generate all h in VS, given G and S?
x1 = <Sunny, Warm, Normal, Strong, Warm, Same>, +
x2 = <Sunny, Warm, High, Strong, Warm, Same>, +
x3 = <Rainy, Cold, High, Strong, Warm, Change>, -
x4 = <Sunny, Warm, High, Strong, Cool, Change>, +
a. <Sunny,?, Strong, ?, Change> <Sunny, warm, Normal, ?, ?, ?> <?, warm, ?, Strong, ?, ?
>
b. <Sunny,?, Strong, ?, Same> <Sunny, warm, ?, ?, cool, ?> <?, warm, ?, Strong, ?, ?>
c. <Sunny,?, Strong, ?, ?> <Sunny, warm, ?, ?, ?, ?> <?, warm, ?, Strong, ?, ?>
d. <Rainy,?, Strong, ?, ?> <Sunny, warm, ?, ?, ?, ?> <?, warm, High, Strong, ?, ?>
Answer : c) <Sunny,?, Strong, ?, ?> <Sunny, warm, ?, ?, ?, ?> <?, warm, ?, Strong, ?, ?>
Unit – II
Part – A ( Remember / Understand Level ) – 1 Mark Questions
Q1:
Q2: What is the use of Multi-Layer Feed-Forward Neural Network?
a. to realize structure of MLP
b. to solve pattern classification problem
c. to solve pattern mapping problem
d. to realize an approximation to an MLP
Answer: d) to realize an approximation to an MLP
Q3 : What are the issues on which biological networks proves to be superior than AI
networks?
a. robustness & fault tolerance
b. flexibility
c. collective computation
d. all of the mentioned
Answer: d) all of the mentioned
Q4 : The fundamental unit of network is____________
a. brain
b. nucleus
c. neuron
d. axon
Answer: d) axon
Q5 : What are dendrites?
a. fibers of nerves
b. nuclear projections
c. other name for nucleus
d. none of the mentioned
Answer: a) fibers of nerves
Q6: What is shape of dendrites like
a. oval
b. round
c. tree
d. rectangular
Answer: c) tree
Q7: Signal transmission at synapse is a?
a. physical process
b. chemical process
c. physical & chemical both
d. none of the mentioned
Answer: b) chemical process
Q8: How does the transmission/pulse acknowledged?
a. by lowering electric potential of neuron body
b. by raising electric potential of neuron body
c. both by lowering & raising electric potential
d. none of the mentioned
Answer: c) both by lowering & raising electric potential
Q9: Where does the chemical reactions take place in neuron?
a. dendrites
b. axon
c. synapses
d. nucleus
Answer: c) synapses
Q10: Function of dendrites is?
a. receptors
b. transmitter
c. both receptor & transmitter
d. none of the mentioned
Answer: a) receptors
Q11: What is purpose of Axon?
a. receptors
b. transmitter
c. transmission
d. none of the mentioned
Answer: c) transmission
Q12: The process of forming general concept definitions from examples of concepts to be
learned.
a. Deduction
b. abduction
c. induction
d. conjunction
Answer: c) induction
Q13: Neuro software is:
a. A software used to analyse neurons
b. It is powerful and easy neural network
c. Designed to aid experts in real-world
d. It is software used by Neuro surgeon
Answer: b) It is powerful and easy neural network
Q14: Why is the XOR problem exceptionally interesting to neural network researchers?
a. Because it can be expressed in a way that allows you to use a neural network
b. Because it is complex binary operation that cannot be solved using neural networks
c. Because it can be solved by a single layer perceptron
d. Because it is the simplest linearly inseparable problem that exists.
Answer: d) Because it is the simplest linearly inseparable problem that exists.
Q15: What is back propagation?
a. It is another name given to the curvy function in the perceptron
b. It is the transmission of error back through the network to adjust the inputs
c. It is the transmission of error back through the network to allow weights to be adjusted so that
the network can learn.
d. None of the mentioned
Answer: c) It is the transmission of error back through the network to allow weights to be adjusted so
that the network can learn.
Part – B (Apply Level) – 2 Mark Questions
Q1:
Q2:
Q3:
Answer: 0.9
Q4: Having multiple perceptron can actually solve the XOR problem satisfactorily: this
is because each perceptron can partition off a linear part of the space itself, and they can
then combine their results.
a. True –this works always, and these multiple perceptron learn to classify even complex
problems.
b. False –perceptron are mathematically incapable of solving linearly inseparable
functions
c. True –perceptron can do this but are unable to learn to do it –they have to be explicitly
hand-coded
d. False –just having a single perceptron is enough
Answer: c) True –perceptron can do this but are unable to learn to do it –they have to be
explicitly hand-coded
Q5 : Which of the following is true?
Single layer associative neural networks do not have the ability to:
(i) perform pattern recognition
(ii) find the parity of a picture
(iii)determine whether two or more shapes in a picture are connected or not
a. (ii) and (iii) are true
b. (ii) is true
c. All of the mentioned
d. None of the mentioned
Answer: a) (ii) and (iii) are true
Q6 : . Which of the following is true?
(i) On average, neural networks have higher computational rates than conventional computers.
(ii) Neural networks learn by example.
(iii) Neural networks mimic the way the human brain works.
a. All of the mentioned are true
b. ii) and iii) are true
c. (i), (ii) and (iii) are true
d. None of the mentioned
Answer: a) All of the mentioned are true
Q7: Which of the following is true for neural networks?
(i) The training time depends on the size of the network.
(ii) Neural networks can be simulated on a conventional computer.
(iii) Artificial neurons are identical in operation to biological ones.
a. All of the mentioned
b. (ii) is true
c. (i) and (ii) are true
d. None of the mentioned
Answer: c) (i) and (ii) are true
Q8: Which of the following is true?
Single layer associative neural networks do not have the ability to:
(i) perform pattern recognition
(ii) find the parity of a picture
(iii)determine whether two or more shapes in a picture are connected or not
a. (ii) and (iii) are true
b. (ii) is true
c. All of the mentioned
d. None of the mentioned
Answer: a) (ii) and (iii) are true
Unit – III
Part – A ( Remember / Understand Level ) – 1 Mark Questions
Q1:
Q2 & Q3 :
Q4 & Q5:
Q6:
Q7 & Q8:
Q9: Predicting the amount of rainfall for a particular day.
What is the term for the probability of a hypothesis before a new evidence E is observed
in the context of Bayesian reasoning?
a. A POSTERIORI
b. A PRIORI
c. UNCONDITIONAL
Answer: A PRIORI
Q10: Which of the following is not a classification task?
a. Predicting whether it will rain or not on a particular day
b. Grouping students into the groups- primary, high school, college
c. Given all the actors in a movie, predicting its genre
d. Predicting the amount of rainfall for a particular day.
Answer: d) Predicting the amount of rainfall for a particular day.
Q11 : Which of the following is assumed when classifying using the Naive Bayes method?
a. The features have strong correlation with each other
b. The features are all normalized before classification
c. The features are independent/conditionally independent of each other
d. The data is low dimensional in nature
Answer: c) The features are independent/conditionally independent of each other
Q12: Compared to the variance of the Maximum Likelihood Estimate (MLE), the
variance of the Maximum A Posteriori (MAP) estimate is ________
a) higher
b) same
c) lower
d) it could be any of the above
Answer: c) lower
Q13: High entropy means that the partitions in classification
a. Pure
b. not pure
c. useful
d. useless
Answer: b) Not pure
Q14: A and B are two events. If P(A, B) decreases while P(A) increases, which of the
following is true?
a) P(A|B) decrease
b) P(B|A) decreases
c) P(B) decreases
d) All of above
Answer: b) P(B|A) decrease
Q15: MLE estimates are often undesirable because
a. they are biased
b. they have high variance
c. they are not consistent estimators
d. None of the above
Answer: b) they have high variance
Part – B (Apply Level) – 2 Mark Questions
Q1: What is the name of the genetic algorithm operator that exchange values of two strings up to a
certain position?
Answer: CROSSOVER
Q2 : Let S be the state of a Patient having stiff neck, m be the state of the patient having
meningitis. A priori probability P(m) = 1/50000, P(s) = 0.01. P(s/m) = 0.7. What is the value of
P(m/s)?
a. 0.14
b. 0.014
c. 0.0014
d. 0.00014
Answer: c) 0.0014 (Explanation : P(m/s) = P(s/m)*P(m)/P(s))
Q3 : Two chromosomes
A 11101110
B 00110100
A two-point crossover operation between position 2 and 7 should be performed to give
two new chromosomes. Which constellation is correct?
a. 11101110, 00101100
b. 11110110 , 00101100
c. 11100110, 00110100
Answer: b) 11110110 , 00101100
Q4 : Which of the following is the joint probability of H, U, P, and W described by the
given Bayesian Network? [note: as the product of the conditional probabilities]
a. P(H, U, P, W) = P(H) * P(W) * P(P) * P(U)
b. P(H, U, P, W) = P(H) * P(W) * P(P | W) * P(W | H, P)
c. P(H, U, P, W) = P(H) * P(W) * P(P | W) * P(U | H, P)
d. None of the above
Q5: Which of the following methods can achieve zero training error on any linearly
separable dataset?
a. Decision tree
b. 15-nearest neighbors
c. Perceptron
d. Logistic regression
Answer: (a) Decision tree (b) Perceptron
Q6 : You are increasing the size of the layers (more hidden units per layer) in your
neural network. What kind of impact it will have on bias and variance?
a. increases, increases
b. increases, decreases
c. decreases, increases
d. decreases, decreases
Answer: d) decreases, decreases
Q7: In higher dimensions, covariance is represented by a matrix
The Gaussian is always symmetric about the mean
The mean of the Gaussian is the same as its mode
Which of the following statements is true?
a. Independent events must be mutually exclusive
b. The sum of probabilities of mutually exclusive events must be 1.
c. The sum of probabilities of mutually exclusive and collectively exhaustive events
must be 1
d. None of the above
Answer: c) The sum of probabilities of mutually exclusive and collectively exhaustive events
must be 1.
Q8: Suppose you are given an EM algorithm that finds maximum likelihood estimates for
a model with latent variables. You are asked to modify the algorithm so that it finds
MAP estimates instead. Which step or steps do you need to modify?
a. Expectation
b. Maximization
c. No modification necessary
d. Both
Answer: b) Maximization
Unit – IV
Part – A ( Remember / Understand Level ) – 1 Mark Questions
Q1: Which of the following sentence is FALSE regarding regression?
a. It relates inputs to outputs
b. It is used for prediction.
c. It may be used for interpretation
d. It discovers causal relationships.
Answer: d) It discovers causal relationships.
Q2 : K-fold cross-validation is
a. linear in K
b. quadratic in K
c. cubic in K
d. exponential in K
Answer: a) linear in K
Q3 : The training phase of a textual data mining process involves
a. removing common words from a dictionary
b. creating an attribute dictionary.
c. determining whether a document is about the topic under investigation.
d. modifying an initially created attribute dictionary.
Answer: b) creating an attribute dictionary
Q4 : A nearest neighbour approach is best used
a. with large-sized datasets
b. when irrelevant attributes have been removed from the data
c. when a generalized model of the data is desirable
d. when an explanation of what has been found is of primary importance
Answer: b) when irrelevant attributes have been removed from the data
Q5: Which statement is true about neural network and linear regression models?
a. Both models require input attributes to be numeric
b. Both models require numeric attributes to range between 0 and 1.
c. The output of both models is a categorical attribute value
d. Both techniques build models whose output is determined by a linear sum of weighted
input attribute values.
Answer: a) Both models require input attributes to be numeric
Q6 : Bayesian Belief network is characterized by
a. Cyclic graph, Conditional independence and causal
b. Acyclic graph, Conditional independence and causal
c. Acyclic graph, Conditional dependence and causal
d. Cyclic graph, Conditional dependence and noncausal
Answer: b) Acyclic graph, Conditional independence and causal
Q7: If a hypothesis says it should be positive, but in fact it is negative, we call it as
a. A consistent hypothesis
b. A false negative hypothesis
c. A false positive hypothesis
d. A True negative hypothesis
Answer: c) A false positive hypothesis
Q8: How the Bayesian network can be used to answer any query?
a. Full distribution
b. Joint distribution
c. Partial distribution
d. None of the above
Answer: b) Joint distribution
Q9: From the given diagram, write the independent attributes of campfire
a. Storm
b. Lightning
c. Thunder
d. Bus tour group
Answer: b) c)
Q10: Suppose you are given an EM algorithm that finds maximum likelihood estimates
for a model with latent variables. You are asked to modify the algorithm so that it finds
MAP estimates instead. Which step or steps do you need to modify?
a. Expectation
b. Maximization
c. Both
d. No modification
Answer: b) Maximization
Q11: Advantages of Distance weighted KNN algorithm __________
a. It is robust to noisy training data
b. Irrelevant attributes can be eliminated
c. highly effective inductive inference method
d. All of the above
Answer: d) All of the above
Q12: Instance based learning is also called____________
a. concept based learning
b. content based learning
c. memory based learning
d. hypothesis based leaning
Answer: c) memory-based learning
Q13: Give one of the instance-based learning algorithms____________________
a. KNN algorithm
b. Optimization algorithm
c. Gradient decent algorithm
d. decision tree algorithm
Answer: a) KNN algorithm
Q14: Write the weight measuring formula in distance weighted nearest neighbour
algorithm___________
Answer: wi=1/square(d(xq,xi))
Q15: write the minimize the square error equation in locally weighted linear
regression___________
Answer: E=1/2 Summation(square(f(x)-E(f(x))
Part – B (Apply Level) – 2 Mark Questions
Q1: A dataset of 1000 instances contains one attribute specifying the color of an object.
Suppose that 800 of the instances contain the value red for the color attribute. The
remaining 200 instances hold green as the value of the color attribute. What is the
domain predictability score for color = green?
a. 0.80
b. 0.20
c. 0.60
d. 0.40
Answer: b) 0.20
Q2 : From the given diagram, write the conditional probability for the campfire attribute
PDF value 0.4
Answer: P(Campfire = True/Storm = True, BusTourGroup = True) = 0.4
Q3 : Which of the following is true for the Gaussian distribution?
a. The mean of the distribution is always zero
b. In higher dimensions, covariance is represented by a matrix
c. The Gaussian is always symmetric about the mean
d. The mean of the Gaussian is the same as its mode
Answer: b) c) d)
Q4: Instances generated by a mixture of two Normal distributions with identical variance
'a'. The instances are shown by the points along the x axis. If the means ('m1 and m2')of
the Normal distributions are unknown, the EM algorithm can be used to search for their
maximum likelihood estimates. For the given diagram write the expectation step in the
EM algorithm
Answer: Calculate the expected value E[zij] of each hidden variable zi,, assuming the current
hypothesis h = (m1, m2) holds.
Q5: For a particular learning task, if the requirement of error parameter ϵ changes from
0.1 to 0.01. How many more samples will be required for PAC learning?
Same
2 times
10 times
1000 times
Answer: c) 10 times
Q6: Write the performance measures of the rules generated during the sequential
covering algorithm
Answer: Accuracy and coverage
Q7 : Match the following :
(i) OLAP (a) Regression
(ii) OLTP (b) Data Warehouse
(iii) Decision Tree (c) RDBMS
(iv) Neural Network (d) Classification
code:
(i) (ii) (iii) (iv)
a. (b) (c) (a) (d)
b. (b) (c) (d) (a)
c. (c) (b) (a) (d)
d. (c) (b) (d) (a)
Answer: (b). (b) (c) (d) (a)
Unit – V
Part – A ( Remember / Understand Level ) – 1 Mark Questions
Q1: Data mining is best described as the process of
a. identifying patterns in data.
b. deducing relationships in data.
c. representing data.
d. simulating trends in data.
Answer: a) identifying patterns in data
Q2 : Translate the following statement into FOL.
“For every a, if a is a philosopher, then a is a scholar”
a. ∀ a philosopher(a) scholar(a)
b. ∃ a philosopher(a) scholar(a)
c. All of the mentioned
d. None of the mentioned
Answer: a) ∀ a philosopher(a) scholar(a)
Q3 : The statement comprising the limitations of FOL is/are ____________
a. Expressiveness
b. Formalizing Natural Languages
c. Many-sorted Logic
d. All of the mentioned
Answer: d) All of the mentioned
Q4 : First Order Logic is also known as ___________
a. First Order Predicate Calculus
b. Quantification Theory
c. Lower Order Calculus
d. All of the mentioned
Answer: d) All of the mentioned
Q5: The rule of Universal Instantiation (UI for short) says that we can infer any sentence
obtained by substituting a ground term (a term without variables) for the variable.
a. True
b. False
Answer: a) True
Q6 : The corresponding Existential Instantiation rule: for the existential quantifier is
slightly more complicated. For any sentence a, variable v, and constant symbol k that
does not appear elsewhere in the knowledge base.
True
False
Answer: a) True
Q7: Translate the following statement into FOL.
“For every a, if a is a PhD student, then a has a master degree”
a. ∀ a PhD(a) -> Master(a)
b. ∃ a PhD(a) -> Master(a)
c. A is true, B is true
d. A is false, B is false
Answer: a) ∀ a PhD(a) -> Master(a)
Q8: What is/are the requirement for the Learn-One-Rule method?
a. Input, accepts a set of +ve and -ve training examples.
b. Output, delivers a single rule that covers many +ve examples and few -ve.
c. Output rule has a high accuracy but not necessarily a high coverage.
d. A & B
e. A, B & C
Answer: e) A, B & C
Q9: Suppose the reinforcement learning player was greedy, that is, it always played the
move that brought it to the position that it rated the best. Might it learn to play better, or
worse, than a non greedy player?
a. Worse
b. Better
Answer: b) Better
Q10: The model will be trained with data in one single batch is known as
a. Batch learning
b. Offline learning
c. Both A and B
d. None of the above
Answer: c) Both A and B
Q11: How do you handle missing or corrupted data in a dataset?
a. Drop missing rows or columns
b. Replace missing values with mean/median/mode
c. Assign a unique category to missing values
d. All of the above
Answer: d) All of the above
Q12: Data can be updated in _____environment.
A. data warehouse.
B. data mining.
C. operational.
D. informational.
Answer: c) operational.
Q13: _________maps the core warehouse metadata to business concepts, familiar and
useful to end-users.
a. Application level metadata.
b. User level metadata.
c. End user level metadata.
d. Core level metadata
Answer: a) Application level metadata.
Q14: ____________predicts future trends & behaviours, allowing business managers to
make proactive, knowledge-driven decisions.
a. Data warehouse.
b. Data mining.
c. DataMart’s.
d. Metadata.
Answer: b) Data mining
Q15: _________ is a subject-oriented, integrated, time-variant, non-volatile collection of
data in support of management decisions.
a. Data Mining.
b. Data Warehousing
c. Web Mining.
d. Text Mining.
Answer: b) Data Warehousing
Part – B (Apply Level) – 2 Mark Questions
Q1: A doctor knows that Cold causes fever 50% of the time. Prior probability of any
patient having cold is 1/50,000. Prior probability of any patient having fever is 1/20. If a
patient has fever, what is the probability he/she has cold?
a. P(C/F)= 0.0003
b. P(C/F)=0.0004
c. P(C/F)= 0.0002
d. P(C/F)=0.0045
Answer: c) P(C/F)= 0.0002
Q2 : Suppose you have trained a logistic regression classifier and it outputs a new
example x with a prediction ho(x) = 0.2. This means
a. Our estimate for P(y=1 | x)
b. Our estimate for P(y=0 | x)
c. Our estimate for P(y=1 | x)
d. Our estimate for P(y=0 | x)
Answer: b) Our estimate for P(y=0 | x)
Q3 : What is the rule for this pattern?
1st term:32
2nd term:36
3rd term:40
a. 4x+28
b. 4x+32
c. x+4
d. 4x
Answer: b) 4x+32
Q4 : Q-learning with linear function approximation (features) will always converge to the
optimal policy.
The number of parameters in a Bayesian network is exponential in the total number of
arcs in the graph.
a. True, True
b. True, False
c. False, True
d. False, False
Answer: d) False, False
Q5: What is the correct representation of GA?
a. GA(Fitness, Fitness_threshold, p)
b. GA(Fitness, Fitness_threshold, p, r )
c. GA(Fitness, Fitness_threshold, p, r, m)
d. GA(Fitness, Fitness_threshold)
Answer: c) GA(Fitness, Fitness_threshold, p, r, m)
Q6 : Correct ( h ) is the percent of all training examples correctly classified by hypothesis
h. then Fitness function is equal to
a. Fitness ( h) = (correct ( h)) 2
b. Fitness ( h) = (correct ( h)) 3
c. Fitness ( h) = (correct ( h))
d. Fitness ( h) = (correct ( h)) 4
Answer: a) Fitness ( h) = (correct ( h)) 2
Q7: When would the genetic algorithm terminate?
a. Maximum number of generations has been produced
b. Satisfactory fitness level has been reached for the population
c. Both A & B
d. None of these
Answer: c) Both A & B