Reading 3 Machine Learning
Reading 3 Machine Learning
The technique in which a machine learns to model a set of output data from a given set of
inputs is best described as:
A) unsupervised learning.
B) deep learning.
C) supervised learning.
Over the past several years, Kowalski has become aware that investment firms are
increasingly using technology to improve their investment decision making. Kowalski has
become particularly interested in machine learning techniques and how they might be
applied to investment management applications.
Kowalski has read a number of articles about machine learning in various journals for
financial analysts. However, she has only a minimal knowledge of how she might source
appropriate model inputs, interpret model outputs, and translate those outputs into
investment actions.
Kowalski and Nowak meet to discuss plans for incorporating machine learning into their
investment model. Kowalski asks Nowak to research machine learning and report back on
the types of investment problems that machine learning can address, how the algorithms
work, and what the various terminology means.
After spending a few hours researching the topic, Nowak makes a number of statements to
Kowalski on the topics of:
Kowalski is left to work out which of Nowak's statements are fully accurate and which are
not.
Question #2 - 5 of 23 Q ti ID 1472228
Question #2 - 5 of 23 Question ID: 1472228
Nowak first tries to explain classification and regression tree (CART) to Kowalski. CART is
least likely to be applied to predict a:
Which of the following statements Nowak makes about hierarchical clustering is most
accurate?
Bottom-up hierarchical clustering begins with each observation being its own
A)
cluster.
B) In divisive hierarchical clustering, the algorithm seeks out the two closest clusters.
Hierarchical clustering is a supervised iterative algorithm that is used to build a
C)
hierarchy of clusters.
Which of the following statements Nowak makes about neural networks is most accurate?
Neural networks:
have four types of layers: an input layer, agglomerative layers, regularization layers,
A)
and an output layer.
have an input layer node that consists of a summation operator and an activation
B)
function.
C) are effective in tasks with non-linearities and complex interactions among variables.
The unsupervised machine learning algorithm that reduces highly correlated features into
fewer uncorrelated composite variables by transforming the feature covariance matrix best
describes:
Joyce Tan manages a medium-sized investment fund at Marina Bay Advisors that specializes
in international large cap equities. Over the four years that she has been portfolio manager,
Tan has been invested in approximately 40 stocks at a time.
Tan has used a number of methodologies to select investment opportunities from the
universe of investable stocks. In some cases, Tan uses quantitative measures such as
accounting ratios to find the most promising investment candidates. In other cases, her
team of analysts suggest investments based on qualitative factors and various investment
hypotheses.
Tan begins to wonder if her team could leverage financial technology to make better
decisions. Specifically, she has read about various machine learning techniques to extract
useful information from large financial datasets, in order to uncover new sources of alpha.
Tan is interested in using a supervised learning algorithm to analyze stocks. This task is least
likely to be a classification problem if the target variable is:
A) ordinal.
B) continuous.
C) categorical.
After Tan implements a particular new supervised machine learning algorithm, she begins to
suspect that the holdout samples she is using are reducing the training set size too much. As
a result, she begins to make use of K-fold cross-validation. In the K-fold cross-validation
technique, after Tan shuffles the data randomly it is most likely that:
At first Tan bases her stock picks on the results of a single machine-learning model, but then
begins to wonder if she should instead be using the predictions of a group of models.
Compared to a single machine-learning model, an ensemble machine learning algorithm is
most likely to produce predictions that are:
A) more precise but less dependable.
B) more accurate and more stable.
C) less reliable but more steady.
Tan is interested in applying neural networks, deep learning nets, and reinforcement
learning to her investment process. Regarding these techniques, which of the following
statements is most accurate?
Neural networks with one or more hidden layers would be considered deep learning
A)
nets (DLNs).
Reinforcement learning algorithms achieve maximum performance when they stay
B)
as far away from their constraints as possible.
Neural networks work well in the presence of non-linearities and complex
C)
interactions among variables.
Which supervised learning model is most appropriate (1) when the Y-variable is continuous
and (2) when the Y-variable is categorical
A) Regression Classification
What is the appropriate remedy in the presence of excessive number of features in a data
set?
A) Dimension reduction.
B) Unsupervised learning.
C) Big data analysis.
The degree to which a machine learning model retains its explanatory power when
predicting out-of-sample is most commonly described as:
A) hegemony.
B) generalization.
C) predominance.
Considering the various supervised machine learning algorithms, a linear classifier that
seeks the optimal hyperplane and is typically used for classification, best describes:
A) unsupervised learning.
B) clustering.
C) supervised learning.
An algorithm that involves an agent that performs actions that will maximize its rewards
over time, taking into consideration the constraints of its environment, best describes: