Machine Learning - Trading
Machine Learning - Trading
Now that we have a basic understanding of Training & Testing Data sets, let us understand the
different types of machine learning tasks. Machine learning tasks can roughly fall into two main
categories
The 1st one where the expected outcome is clearly defined is called “Supervised Learning”
supervised learning, each training data consists of a pair of input objects and the desired output value
or target. The main task is to produce a function that will map input values to the output value in a
way such that when you have new input data you should be able to make reasonable prediction about
the target value. Such type of learning is called supervised learning because training data sets are
predefined and can be thought of as a teacher supervising the learning process.
Supervised learning problems can be divided into classification and regression problems. If a
supervised learning algorithm analyzes the training data and produces an output in classes or in a
discrete form such as 0/1, then it is called a classification problem. If the output is continuous, then it
is called a regression problem. For example, Suppose you want to classify pictures into labels such as
animals and birds. In order to make the classification, your algorithm is provided with labeled
pictures. After training data on these labeled pictures, your algorithm should be able to categorize
unlabeled images with some level of accuracy. This would be considered a classification problem.
Similarly, if you are given a data set containing the parameters which affect the price of a house and
you are expected to make a prediction for the price of the house then it is a regression problem. The
second one where the expected outcome is not defined is called unsupervised learning. Unsupervised
learning refers to a broad array of machine learning algorithms which are used to draw inferences
from datasets consisting of input data without labeled responses. This implies that the algorithm is
not presented with the right output for a sample input, but instead is forced to learn the correct way to
produce an output in an unsupervised manner. Unsupervised learning mostly forms the significant
part of learning for the human brain and hence is an important segment of machine learning. There
are two approaches to unsupervised learning, the first involves using some sort of reward system to
indicate success. This type of system falls under the decision problem framework because the goal is
to make decisions that maximizes rewards. The second type of unsupervised learning is called
clustering. Here, the goal is to find similarities in the training data. The assumption in such a system
is that the clusters discovered will match reasonably well with an intuitive classification. For
example, the clustering of individuals based on demographics might result in clustering the high and
low income groups in two cluster. It might seem that all machine learning machine learning
algorithms can be categorized into supervised and unsupervised learning, but that is not the case.
Reinforcement learning involves techniques that try to retro-feed the model to improve performance.
In order to accomplish this, the model needs to be able to interpret signals, decide on an action and
then compare the outcome against a predefined reward system. Reinforcement learning tries to
understand what needs to be done in order to maximize the rewards. This is not a supervised type of
learning, because it does not strictly depend on supervised or labelled data. It relies on the ability to
interpret the response of the actions being taken and measure them against the definition of the
reward. And it is not unsupervised learning either, because it receives and modifies the model
according to the feedback received against the predefined reward system. In the upcoming units, your
concept will be tested through several multiple choice questions and after which we will discuss an
application of machine learning in trading using reinforcement learning model.