Module 1
Module 1
Matrix Decomposition.
Version Spaces,
PAC Learning,
VC Dimension.
PROBABILITY
Probability is the branch of mathematics concerning
numerical descriptions of how likely an event is to occur,
or how likely it is that a proposition is true.
The probability of an event is a number between 0 and 1,
where, roughly speaking, 0 indicates impossibility of the
event and 1 indicates certainty.
PROBABILITY OF AN EVENT
Bernoulli Distribution
Uniform Distribution
Normal Distribution
BERNOULLI DISTRIBUTION
o The probabilities of success and failure need not be equally likely, like the
result of a fight between me and Mary Kom. She is pretty much certain to win.
So in this case probability of my success is 0.15 while my failure is 0.85
The expected value of a random variable X from a
Bernoulli distribution is found as follows:
E(X) = 1*p + 0*(1-p) = p
You can see that the shape of the Uniform distribution curve is rectangular,
the reason why Uniform distribution is called rectangular distribution.
For a Uniform Distribution, a and b are the parameters.
EXAMPLE
The number of bouquets sold daily at a flower shop is
uniformly distributed with a maximum of 40 and a
minimum of 10.
Let’s try calculating the probability that the daily sales
will fall between 15 and 30.
The probability that daily sales will fall between 15 and
30 is (30-15)*(1/(40-10)) = 0.5
The mean and variance of X following a uniform
distribution is:
Exactly half of the values are to the left of the center and the other
The line comes from plugging in different values into the unknown x to find out what
the equation or model does to the value of y.
We can line up a system of equations with the same form with two or more
unknowns; for example:
y = 0.1 × x1 + 0.4 × x2 y = 0.3 × x1 + 0.9 × x2 y = 0.2 × x1 + 0.3 × x2 · · · (1.2)
The column of y values can be taken as a column vector of outputs from the equation.
The two columns of integer values are the data columns, say a1 and a2, and can be
taken as a matrix A.
The two unknown values x1 and x2 can be taken as the coefficients of the equation
and together form a vector of unknowns b to be solved.
We can write this compactly using linear algebra notation as:
y=A·b
NUMERICAL LINEAR ALGEBRA
The application of linear algebra in computers is often called
numerical linear algebra.
“numerical” linear algebra is really applied linear algebra.
It is more than just the implementation of linear algebra
operations in code libraries; it also includes the careful
handling of the problems of applied mathematics, such as
working with the limited floating point precision of digital
computers.
Computers are good at performing linear algebra
calculations, and much of the dependence on Graphical
Processing Units (GPUs) by modern machine learning
methods such as deep learning is because of their ability to
compute linear algebra operations fast.
LINEAR ALGEBRA AND STATISTICS
Linear algebra is a valuable tool in other branches of
mathematics, especially statistics.
Use of vector and matrix notation, especially with
multivariate statistics.
Solutions to least squares and weighted least squares,
such as for linear regression.
Estimates of mean and variance of data matrices.
Each row is encoded as a binary vector, a vector with zero or one values and
this is an example of a sparse representation, a whole sub-field of linear
algebra
LINEAR REGRESSION
Linear regression is an old method from statistics for
describing the relationships between variables.
It is often used in machine learning for predicting numerical
values in simpler regression problems.
There are many ways to describe and solve the linear
regression problem, i.e. finding a set of coefficients that
when multiplied by each of the input variables and added
together results in the best prediction of the output variable.
Even the common way of summarizing the linear regression
equation uses linear algebra notation:
y =A· b