100% found this document useful (1 vote)
2K views12 pages

Machine Learning Multiple Choice Questions - Free Practice Test

Machine Learning Multiple Choice Questions _ Free Practice Test

Uploaded by

arafaliwijaya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
2K views12 pages

Machine Learning Multiple Choice Questions - Free Practice Test

Machine Learning Multiple Choice Questions _ Free Practice Test

Uploaded by

arafaliwijaya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

Big Data and Analytics

Machine Learning Multiple Choice Questions - Free Practice Test

6042 Tests taken

Machine learning is a revolutionary technology that’s changing how businesses and industries
function across the globe in a good way. This Machine Learning quiz, is a free practice test that
is focused to help people wanting to start their career in the Machine learning industry. This
Machine Learning Bootcamp helps you assess how prepared are you for the certification exam.
Here, you get Machine Learning MCQs that test your knowledge on the technology. These
Machine Learning Questions are prepared by subject matter experts and are in line with the
questions you can come across in certification exam. Take this test today!

Improve Your Skills with this Course

Machine Learning

6729 Ratings

Exclusive 30% OFF

EXPLORE COURSE

Explanations

20 9 45
Questions Correct Answers % Correct Answers

1. Which of the following is a widely used and effective machine learning algorithm based
on the idea of bagging?

Decision Tree

Regression

Classification

Random Forest

Explanations:

The Radom Forest algorithm builds an ensemble of Decision Trees, mostly trained with the bagging
method.

2. To find the minimum or the maximum of a function, we set the gradient to zero because:

The value of the gradient at extrema of a function is always zero

Depends on the type of problem

Both A and B

None of the above

Explanations:

The gradient of a multivariable function at a maximum point will be the zero vector of the function,
which is the single greatest value that the function can achieve.

3. The most widely used metrics and tools to assess a classification model are:

Confusion matrix

Cost-sensitive accuracy

Area under the ROC curve


All of the above

Explanations:

The model performance assessment for classification algorithms encorporates all of the above
techniques.

4. Which of the following is a good test dataset characterstic?

Large enough to yield meaningful results

Is representative of the dataset as a whole

Both A and B

None of the above

Explanations:

A good test dataset has a good amount of sample population and equal ratios of class representation.

5. Which of the following is a disadvantage of decision trees?

Factor analysis

Decision trees are robust to outliers

Decision trees are prone to be overfit

None of the above

Explanations:

Allowing a decision tree to split to a granular degree makes decision trees prone to learning every point
extremely well to the point of perfect classification that is overfitting.

6. How do you handle missing or corrupted data in a dataset?

Drop missing rows or columns


Drop missing rows or columns

Replace missing values with mean/median/mode

Assign a unique category to missing values

All of the above

Explanations:

All of the above techniques are different ways of imputing the missing values.

7. What is the purpose of performing cross-validation?

To assess the predictive performance of the models

To judge how the trained model performs outside the sample on test data

Both A and B

Explanations:

Cross-validation is a model validation technique for assessing how the results of a statistical analysis
will generalize to an independent data set.

8. Why is second order differencing in time series needed?

To remove stationarity

To find the maxima or minima at the local point

Both A and B

None of the above

Explanations:

If the second-order difference is positive, the time series will curve upward and if it is negative, the time
series will curve downward at that time.

9. When performing regression or classification, which of the following is the correct way
to preprocess the data?
to preprocess the data?

Normalize the data -> PCA -> training

PCA -> normalize PCA output -> training

Normalize the data -> PCA -> normalize PCA output -> training

None of the above

Explanations:

You need to always normalize the data first. If not, PCA or other techniques that are used to reduce
dimensions will give different results.

10. Which of the folllowing is an example of feature extraction?

Constructing bag of words vector from an email

Applying PCA projects to a large high-dimensional data

Removing stopwords in a sentence

All of the above

Explanations:

All of the above techniques transform raw data into features which can be used as inputs to machine
learning algorithms.

11. What is pca.components_ in Sklearn?

Set of all eigen vectors for the projection space

Matrix of principal components

Result of the multiplication matrix

None of the above options

Explanations:
pca.components_ is the set of all eigen vectors for the projection space.

12. Which of the following is true about Naive Bayes ?

Assumes that all the features in a dataset are equally important 

Assumes that all the features in a dataset are independent

Both A and B

None of the above options

Explanations:

Naive Bayes assumes that all the features in a data set are equally important and independent.

13. Which of the following statements about regularization is not correct?

Using too large a value of lambda can cause your hypothesis to underfit the data.

Using too large a value of lambda can cause your hypothesis to overfit the data.

Using a very large value of lambda cannot hurt the performance of your hypothesis.

None of the above

Explanations:

A large value results in a large regularization penalty and therefore, a strong preference for simpler
models, which can underfit the data.

14. How can you prevent a clustering algorithm from getting stuck in bad local optima?

Set the same seed value for each run

Use multiple radom initializations

Both A and B

None of the above


Explanations:

K-Means clustering algorithm has the drawback of converging at local minima which can be prevented
by using multiple radom initializations.

15. Which of the following techniques can be used for normalization in text mining?

Stemming

Lemmatization

Stop Word Removal

Both A and B

Explanations:

Lemmatization and stemming are the techniques of keyword normalization.

16. In which of the following cases will K-means clustering fail to give good results?
1)
Data points with outliers
2) Data points with different densities
3) Data points with
nonconvex shapes

1 and 2

2 and 3

1, 2, and 3

1 and 3

Explanations:

K-means clustering algorithm fails to give good results when the data contains outliers, the density
spread of data points across the data space is different, and the data points follow nonconvex shapes.

17. Which of the following is a reasonable way to select the number of principal
components "k"?
Choose k to be the smallest value so that at least 99% of the varinace is retained.

Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer).

Choose k to be the largest value so that 99% of the variance is retained.

Use the elbow method.

Explanations:

This will maintain the structure of the data and also reduce its dimension.

18. You run gradient descent for 15 iterations with a=0.3 and compute J(theta) after each
iteration. You find that the value of J(Theta) decreases quickly and then levels off. Based
on this, which of the following conclusions seems most plausible?

Rather than using the current value of a, use a larger value of a (say a=1.0)

Rather than using the current value of a, use a smaller value of a (say a=0.1)

a=0.3 is an effective choice of learning rate

None of the above

Explanations:

You need the gradient descent to quickly converge to the minimum. So the current setting of a seems to
be good.

19. What is a sentence parser typically used for?

It is used to parse sentences to check if they are utf-8 compliant.

It is used to parse sentences to derive their most likely syntax tree structures.

It is used to parse sentences to assign POS tags to all tokens.

It is used to check if sentences can be parsed into meaningful tokens.

Explanations:
Sentence parsers analyze a sentence and automatically build a syntax tree.

20. Suppose you have trained a logistic regression classifier and it outputs a new example
x with a prediction ho(x) = 0.2. This means

Our estimate for P(y=1 | x

Our estimate for P(y=0 | x

Our estimate for P(y=1 | x

Our estimate for P(y=0 | x

Improve Your Score with this Course

Machine Learning

6729 Ratings

Exclusive 30% OFF

EXPLORE COURSE

course page fpt faqs

What is a Machine Learning practice test?

The Machine Learning free practice test is a simulation of the actual Machine Learning certification exam.
The test consists of 20 multiple choice questions that are likely to be faced in the actual exam. The test is
helpful in understanding whether you have the skills that are required to become a Machine Learning
engineer.
Who can take up this Machine Learning quiz?

This Machine Learning online test can be taken by anyone who is preparing to pass the Machine Learning
certification exam.

What will I learn from this mock exam?

The Machine Learning practice exam is designed to test your knowledge of machine learning concepts
and techniques. There are 25 multiple choice questions in the test which are helpful in analyzing your
strong and weak areas in topics like supervised and unsupervised learning, mathematical and heuristic
aspects, hands-on modeling, and more.

What is included in this machine learning exam simulator?

There are 20 machine learning practice test questions in the test that need to be answered in 25 minutes.
You can even pause the test and resume it later.

Will the Machine Learning quiz questions help in clearing the actual certification exam?

Yes, this Machine Learning mock test gives a complete overview of what you will face in the actual
certification exam. While we do not guarantee excellent results in the exam, you will definitely be able to
pass the exam with much ease.

Are these the same questions I'll see on the real exam?

The questions asked in this test are much like the questions expected in the actual certification exam.

Will the Practice Tests be updated frequently?

Yes, the Machine Learning MCQs are periodically updated and all the latest information related to

machine learning is incorporated.

Can I retake the Machine Learning certification dumps?

Yes, the Machine Learning objective questions can be taken repeatedly if you find it suitable. However,
you should ensure that your preparation is adequate so as to get a better test experience. 
What are the requirements to take the Machine Learning dumps?

This Machine Learning exercise comes with no prior conditions.

I didn’t do well on this practice test. What should I do now?

If you are dissatisfied with your performance, you can retake the Machine Learning exam dumps multiple
times. You can also get a better grasp of all the machine learning concepts by taking our Machine
Learning Certification Course and then attempt the practice test.

Recommended Resources

Introduction to Machine Learning: A Beginner's Guide

How to Develop a Machine Learning Career

Machine Learning Career Guide: A complete playbook to becoming a


Machine Learning Engineer

How to Build a Career in AI and Machine Learning

© 2009 -2021- Simplilearn Solutions


Disclaimer
PMP, PMI, PMBOK, CAPM, PgMP, PfMP, ACP, PBA, RMP, SP, and OPM3 are registered marks of the Project Management Institute, Inc.

You might also like