MLMock Testsolution
MLMock Testsolution
Answer B
Ans B
Ans C
Ans A
Ans C
6. Choose the options that are correct regarding machine learning (ML) and
artificial intelligence (AI),
(A) ML is an alternate way of programming intelligent machines.
(B) ML and AI have very different goals.
(C) ML is a set of techniques that turns a dataset into a software.
(D) AI is a software that can emulate the human mind.
Ans A, D
a. Decision Tree
b. Regression
c. Classification
d. Random Forest
Ans D
c. Both A and B
Ans C
Ans D
10. In order to determine the p-value of a hypothesis test, which of the following
is not needed?
a. whether the test is one-tail or two-tail
b. the value of the test statistic
c. the form of the null and alternate hypotheses
d. the level of significance
e. all of the above are needed to determine the p-value
Ans D
11. The rejection probability of Null Hypothesis when it is true is called as?
a) Level of Confidence
b) Level of Significance
c) Level of Margin
d) Level of Rejection
Ans B
13.A feature F1 can take certain value: A, B, C, D, E, & F and represents grade
of students from a college.
Ans B
Ans B
15. To find the minimum or the maximum of a function, we set the gradient to zero
because:
c.Both A and B
Ans A
a. Factor analysis
Ans C
Ans C
Ans C
Ans A
Ans C,D
Ans C
Ans D
Ans D
25. Which of the following is true about training and testing error in such case?
Suppose you want to apply AdaBoost algorithm on Data D which has T observations.
You set half the data for training and half for testing initially. Now you want to
increase the number of data points for training T1, T2 … Tn where T1 < T2…. Tn-1 <
Tn.
A) The difference between training error and test error increases as number of
observations increases
B) The difference between training error and test error decreases as number of
observations increases
C) The difference between training error and test error will not change
D) None of These
Ans B
Ans C
27. Consider the hyperparameter “number of trees” and arrange the options in terms
of time taken by each hyperparameter for building the Gradient Boosting model?
remaining hyperparameters are same
1. Number of trees = 100
2. Number of trees = 500
3. Number of trees = 1000
A) 1~2~3
B) 1<2<3
C) 1>2>3
D) None of these
Ans B
28. In gradient boosting it is important use learning rate to get optimum output.
Which of the following is true abut choosing the learning rate?
A) Learning rate should be as high as possible
B) Learning Rate should be as low as possible
C) Learning Rate should be low but it should not be very low
D) Learning rate should be high but it should not be very high
Solution: C
29. Given the following data pairs (x, y), find the regression equation.
(1, 1.24), (2, 5.23), (3, 7.24), (4, 7.60), (5, 9.97), (6, 14.31), (7, 13.99), (8,
14.88),
(9, 18.04), (10, 20.70)
a. y = 0.490 x - 0.053
b. y = 2.04 x
c. y = 1.98 x + 0.436
d. y = 0.49 x
Ans C
Ans D
Ans B
Ans A
33. When asked questions concerning personal hygiene, people commonly lie. This is
an example of:
a. sampling bias
b. confounding
c. non-response bias
d. response bias
Ans D
34. Selection of a football team for FIFA World Cup is called as?
a) random sampling
b) systematic sampling
c) purposive sampling
d) cluster sampling
Ans C
Ans B
36. How the bayesian network can be used to answer any query?
a) Full distribution
b) Joint distribution
c) Partial distribution
d) All of the mentioned
Ans B
Ans A
38. What is the consequence between a node and its predecessors while creating
bayesian network?
a) Functionally dependent
b) Dependant
c) Conditionally independent
d) Both Conditionally dependant & Dependant
Ans C
39. Bag I contains 4 white and 6 black balls while another Bag II contains 4 white
and 3 black balls. One ball is drawn at random from one of the bags and it is found
to be black. Find the probability that it was drawn from Bag I.
A. ½ B. 3/5 C. 3/7 D. 7/12
Ans D
Solution:Let E1 be the event of choosing the bag I, E2 the event of choosing the
bag II and A be the event of drawing a black ball.
By using Bayes’ theorem, the probability of drawing a black ball from bag I out of
two bags,
P(E1|A) = P(E1)P(A|E1)P(E1)P(A│E1)+P(E2)P(A|E2)
40. Previous probabilities in Bayes Theorem that are changed with help of new
available information are classified as _________________
a) independent probabilities
b) posterior probabilities
c) interior probabilities
d) dependent probabilities
Ans B
41. At a certain university, 4% of men are over 6 feet tall and 1% of women are
over 6 feet tall. The total student population is divided in the ratio 3:2 in
favour of women. If a student is selected at random from among all those over six
feet tall, what is the probability that the student is a woman?
a) 2⁄5
b) 3⁄5
c) 3⁄11
d) 1⁄100
Ans C
42. Three companies A, B and C supply 25%, 35% and 40% of the notebooks to a
school. Past experience shows that 5%, 4% and 2% of the notebooks produced by these
companies are defective. If a notebook was found to be defective, what is the
probability that the notebook was supplied by A?
a) 44⁄69
b) 25⁄69
c) 13⁄24
d) 11⁄24
Answer: b
Explanation: Let A, B and C be the events that notebooks are provided by A, B and C
respectively.
Let D be the event that notebooks are defective
Then,
P(A) = 0.25, P(B) = 0.35, P(C) = 0.4
P(D|A) = 0.05, P(D|B) = 0.04, P(D|C) = 0.02
P(A│D) = (P(D│A)*P(A))/(P(D│A) * P(A) + P(D│B) * P(B) + P(D│C) * P(C) )
= (0.05*0.25)/((0.05*0.25)+(0.04*0.35)+(0.02*0.4)) = 2000/(80*69)
= 25⁄69.
Ans D
Ans B
45. The relationship between number of beers consumed (x) and blood alcohol content
(y) was studied
in 16 male college students by using least squares regression. The following
regression equation
was obtained from this study:
y^2= -0.0127 + 0.0180x
The above equation implies that:
a. each beer consumed increases blood alcohol by 1.27%
b. on average it takes 1.8 beers to increase blood alcohol content by 1%
c. each beer consumed increases blood alcohol by an average of amount of 1.8%
d. each beer consumed increases blood alcohol by exactly 0.018
Ans C
Ans C
Ans C
A. 2 Only
B. 1 and 2
C. 1 and 3
D. 2 and 3
E. 1, 2 and 3
Ans C
Regression
Classification
Clustering
Reinforcement Learning
Options:
A. 1 and 3
B. 1, 2 and 3
C. 1, 2 and 4
D. 1, 2, 3 and 4
Ans C
50. How can Clustering (Unsupervised Learning) be used to improve the accuracy of
Linear Regression model (Supervised Learning):
A. 1 only
B. 1 and 2
C. 1 and 4
D. 3 only
E. 2 and 4
51. Which of the following is the most appropriate strategy for data cleaning
before performing clustering analysis, given less than desirable number of data
points:
A. 1 only
B. 2 only
C. 1 and 2
Ans A
52. What is the minimum no. of variables/ features required to perform clustering?
A. 0
B. 1
C. 2
D. 3
Ans B
53.If two variables V1 and V2, are used for clustering. Which of the following are
true
for K means clustering with k =3?
A. 1 only
B. 2 only
C. 1 and 2
Ans A
54. Which of the following can act as possible termination conditions in K-Means?
1. For a fixed number of iterations.
2. Assignment of observations to clusters does not change between iterations.
Except for cases with a bad local minimum.
3. Centroids do not change between successive iterations.
4. Terminate when RSS falls below a threshold.
Options:
A. 1, 3 and 4
B. 1, 2 and 3
C. 1, 2 and 4
D. All of the above
Ans D
Ans A
56. Feature scaling is an important step before applying K-Mean algorithm. What
is reason behind this?
A. In distance calculation it will give the same weights for all features
B. You always get the same clusters. If you use or don’t use feature scaling
C. In Manhattan distance it is an important step but in Euclidian it is not
D. None of these
Solution (A)
A.Evolutionary Computing
Ans D
Ans B
Ans C
60. A represents the dominant allele and a represents the recessive allele of a
pair. If, in 1000 offspring, 500 are aa
and 500 are of some other genotype, which of the following are most probably the
genotypes of the parents?
a. Aa and Aa
b. Aa and aa
c. AA and Aa
d. AA and aa
e. aa and aa
Ans A
61. Sickle cell anaemia is a genetic disorder. Which of the following doesn’t holds
true for it?
a) It can be analysed by PCR
b) It destroys a restriction site
c) The mutation is in alpha globulin gene
d) The conventional approach took weeks for the whole analyses to be carried out
Ans C
62. Which of the following is the most likely explanation for a high rate of
crossing-over between two genes?
a. The two genes are far apart on the same chromosome.
b. The two genes are both located near the centromere.
c. The two genes are sex-linked.
d. The two genes code for the same protein.
e. The two genes are on different chromosomes.
Ans A
Ans B
Ans D
Ans C
Ans 3
68.All of the following are suitable problems for genetic algorithms EXCEPT
1. dynamic process control
2.pattern recognition with complex patterns
3.simulation of biological models
4.simple optimization with few variables
Ans 4
Ans C
Ans B