0% found this document useful (0 votes)

31 views3 pages

Machine Learning and Pattern Recognition Week 2 Univariate Gaussian

The document discusses the Gaussian or normal distribution, which is widely used in probabilistic machine learning. It describes how to generate random values from the standard normal distribution with zero mean and unit variance. It also explains how to shift and scale the normal distribution to have any mean and variance by adding or multiplying the standard normal by constants. Code examples are provided to generate and plot random normal values.

Uploaded by

zeliawillscumberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views3 pages

Machine Learning and Pattern Recognition Week 2 Univariate Gaussian

Uploaded by

zeliawillscumberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Univariate Gaussians

The Gaussian distribution, also called the normal distribution, is widely used in probabilis-
tic machine learning. This week we’ll see Gaussians in the context of doing some basic
statistics of experimental results. Later in the course we’ll use Gaussians as building blocks
of probabilistic models, and to represent beliefs about unknown quantities in inference
algorithms.
We know that many of you will have seen all of this material before (some of you several
times). However, not everyone has, and large parts of the MLPR course depend on thoroughly
understanding Gaussians. This note is more detailed (slow) than many of the machine
learning text books, and provides some exercises. A later note on multivariate Gaussians
will also be important.

1 The standard normal distribution, the zero-mean unit-variance

Gaussian
I think of a probability distribution as an object, like a black box, that outputs numbers.
A number-producing box for the standard normal distribution has a label, N (0, 1), and a
button. If we press the button, the box displays a number. You can simulate this process
at a python prompt by typing np.random.randn() (after importing numpy as usual). If you
‘press the button’ multiple times, or run randn() multiple times, you’ll get different numbers.
To attach a label to an outcome from the generator we write in mathematics x ∼ N (0, 1), or
in code x = np.random.randn().
Each call to randn() gives a different number, and many calls reveal the distribution encoded
in the generator. A histogram of a million draws from the distribution, reveals a classic ‘bell-
curve’ shape. You should be able to sketch this bell-shaped curve, with points of inflection
at ±1. Try plotting a histogram of a million standard normal samples on a computer now. If
you know how to do it, opening a terminal and creating the plot can be done in about 10
seconds. If you don’t know how to do it, learning to sample test data and to create plots is
an important skill to develop. Example code is at the end of this document.
The observed or empirical mean of the samples will be about 0, and the empirical variance
will be about 1. Check the mean and variance of the samples you’ve just simulated! In
the limit of infinitely many samples, these values become exact, hence R the numbers in
“N (0, 1)”, the description of the distribution. Formally: µ = E [ x ] = x p( x ) dx = 0 and
var[ x ] = E[( x − µ)2 ] = x2 p( x ) dx = 1. If you don’t know these definitions, please work
R
through the review sheet on expectations in the background material section.
The probability density function (PDF) for the standard normal distribution describes the
bell-shaped curve:
1 1 2
p( x ) = N ( x; 0, 1) = √ e− 2 x . (1)
2π
The probability of landing in a narrow range of width δ between x − δ/2 and x + δ/2 is about
p( x )δ. This statement becomes accurate in the limit δ → 0. So with N = 106 samples, we
expect about p( x ) Nδ samples to land in a narrow histogram bin of width δ at position x.
Using this approximation, you should be able to plot a theoretical prediction through the
histogram you produced earlier.

2 Shifting and scaling: general univariate Gaussians

[The separate notes on expectations contain more material on shifting and scaling random variables.]
We can define a process to generate a quantity y that draws a value from a standard normal,
x ∼ N (0, 1), and then adds a constant y = x + µ. The mean of this distribution, is µ.

MLPR:w2b Iain Murray and Arno Onken, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2020/ 1

If we drew many values from N (0, 1), and added some constant µ to each one, the histogram
of the values keeps the same shape, and simply shifts to be centred at µ. The PDF will shift
in the same way. This distribution has the same shape, just with a different location, and
so is still called a Gaussian, just with mean µ instead of mean 0. To get the PDF of the new
variable, we replace every occurrence of x with (y − µ):
1 1 2
p(y) = N (y; µ, 1) = √ e− 2 (y−µ) . (2)
2π
The choice of variable names, such as x or y, is arbitrary in this note. You will of course also
see x used for Gaussians with non-zero mean.
Similarly, we can define a random variable z that multiplies or ‘scales’ a standard normal
outcome by a constant σ, before adding a constant µ:

z = σx + µ. (3)

If we scale and shift many draws from a standard normal, the histogram of values will be
stretched horizontally, and then shifted. Scaling by σ multiplies the variance by σ2 (see the
notes on expectations), and leaves the mean at zero. Adding µ doesn’t change the width of
the distribution, or its variance, but adds µ to the mean.
The distribution of z maintains the same bell-curve shape, with the points of inflection now
at µ ± σ (note, not ±σ2 ). We still say the variable is Gaussian distributed, but with different
parameters: z ∼ N (µ, σ2 ). By convention, the second parameter of the normal distribution is
usually its variance σ2 , not its width or standard deviation σ. However, if you are reading a
paper, or using a new library routine, it is worth checking the parameterization being used,
just in case. Sometimes people choose to define a Gaussian by its precision, 1/σ2 , instead of
the variance.
For a general univariate Gaussian variable z ∼ N (µ, σ2 ), we can identify a standard normal,
by undoing the shift and scale above:
z−µ
x= . (4)
σ
We now work out the probability density for z, by transforming the density for this x.
[See the further reading if you can’t follow this section, or want more detail.]
Substituting the above expression into the PDF for the standard normal, suggests the shape
of the shifted and scaled distribution that we imagined above is described by the density
− 1 ( z − µ )2
p(z) = N (z; µ, σ2 ) ∝ e 2σ2 . (5)

However, we have to be careful transforming real-valued random variables. Here, stretching

out the bell-curve to be σ times wider would make the area underneath it σ times bigger.
However, PDFs must be normalized: the area under the curve must be one. To conserve
probability mass, if we stretch a region of outcomes, we must also decrease the PDF there
by the same factor. Hence, the PDF of a general (univariate) Gaussian is that for a standard
normal, scaled down by a factor of σ:
1 − 1 ( z − µ )2
p(z) = N (z; µ, σ2 ) = √ e 2σ2 , (6)
σ 2π
which can also be written by replacing the σ in the denominator with σ2 inside the square-
root.

3 Check your understanding

3.1 Notation
[The website version of this note has a question here.]

MLPR:w2b Iain Murray and Arno Onken, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2020/ 2

3.2 Maths background
[The website version of this note has a question here.]

3.3 Code demonstration

[The website version of this note has a question here.]

4 Further reading
https://en.wikipedia.org/wiki/Normal_distribution
If you would like to work through the change of variables more rigorously, or in more detail,
we are using a specific form of the following result:
If an outcome x has probability density p X ( x ), and an invertible, differentiable function g is
used to create a new quantity z = g( x ), then the probability density of the new quantity is
p Z (z) = p X ( g−1 (z))| dx
dz |. In our case the derivative is simple: 1/σ.
This method for transforming densities for a change of variables is in Bishop Section 1.2.1,
with more detail and the multivariate case in Murphy Section 2.6.

5 Python code
To generate a million outcomes from a standard normal and plot a histogram:
import numpy as np
from matplotlib import pyplot as plt

N = int(1e6) # 1e6 is a float, numpy wants int arguments

xx = np.random.randn(N)
hist_stuff = plt.hist(xx, bins=100)
plt.show()
We increase the default number of bins from 10, which is not really enough to see the shape
properly. We also want narrow bins to make an approximation suggested in the notes work
well.
To check the mean and variance:
print('empirical_mean = %g' % np.mean(xx)) # or xx.mean()
print('empirical_var = %g' % np.var(xx)) # or xx.var()
You should understand how to plot a theoretical prediction of this histogram shape. Fill in
the parts marked TODO below. If you don’t know the answers, read the note again: it does
contain both the PDF and how many samples we (approximately) expect to land in a narrow
bin.
bin_centres = 0.5*(hist_stuff[1][1:] + hist_stuff[1][:-1])
# Fill in an expression to evaluate the PDF at the bin_centres.
# To square every element of an array, use **2
pdf = TODO
bin_width = bin_centres[1] - bin_centres[0]
predicted_bin_heights = TODO # pdf needs scaling correctly
# Finally, plot the theoretical prediction over the histogram:
plt.plot(bin_centres, predicted_bin_heights, '-r')
plt.show()

MLPR:w2b Iain Murray and Arno Onken, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2020/ 3

Listening 3 - Nationalities and Languages
No ratings yet
Listening 3 - Nationalities and Languages
1 page
L10
No ratings yet
L10
37 pages
Lect_07 C Gaussain Distribution
No ratings yet
Lect_07 C Gaussain Distribution
72 pages
Lecture+16_inClass
No ratings yet
Lecture+16_inClass
25 pages
Normal Distribution
No ratings yet
Normal Distribution
1 page
4.normal Distribution Haomin2021
No ratings yet
4.normal Distribution Haomin2021
94 pages
Fundamentals of Statistics (18.6501x)
No ratings yet
Fundamentals of Statistics (18.6501x)
20 pages
Continuous Random Variable and Z Test (MADA) (1).Pptx
No ratings yet
Continuous Random Variable and Z Test (MADA) (1).Pptx
100 pages
Unit 4b - Normal Distribution
No ratings yet
Unit 4b - Normal Distribution
59 pages
-5
No ratings yet
-5
27 pages
Lec2 IntroToProbabilityAndStatistics
No ratings yet
Lec2 IntroToProbabilityAndStatistics
37 pages
Inferiential Statistics
No ratings yet
Inferiential Statistics
30 pages
5. Raghunath Chatterjee_Normal Distribution_Lecture
No ratings yet
5. Raghunath Chatterjee_Normal Distribution_Lecture
39 pages
Econ 1006 Summary Notes 5
No ratings yet
Econ 1006 Summary Notes 5
16 pages
current_log
No ratings yet
current_log
28 pages
5.1._Notes[1]
No ratings yet
5.1._Notes[1]
16 pages
1 The Gaussian or Normal Probability Density Function
No ratings yet
1 The Gaussian or Normal Probability Density Function
43 pages
Lecture Slides - Inferential Statistics
No ratings yet
Lecture Slides - Inferential Statistics
42 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Q3 Module 3 - by Pair
No ratings yet
Q3 Module 3 - by Pair
12 pages
Week 9+10+11
No ratings yet
Week 9+10+11
82 pages
MAS3301 Bayesian Statistics: M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2008-9
No ratings yet
MAS3301 Bayesian Statistics: M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2008-9
18 pages
PRP PBL-1
No ratings yet
PRP PBL-1
12 pages
Statistics and Probability (1)
No ratings yet
Statistics and Probability (1)
23 pages
Normal Distribn Theory
0% (1)
Normal Distribn Theory
16 pages
Normal Distribution (Bell Curve) - Definition, Examples, & Graph
No ratings yet
Normal Distribution (Bell Curve) - Definition, Examples, & Graph
10 pages
Continuous Probability
No ratings yet
Continuous Probability
38 pages
2 Statistical Definitions: 2.1 Probability Density Function
No ratings yet
2 Statistical Definitions: 2.1 Probability Density Function
9 pages
06-14-PP CHAPTER 06 Continuous Probability
No ratings yet
06-14-PP CHAPTER 06 Continuous Probability
81 pages
Lecture01 Uppsala EQG 12
No ratings yet
Lecture01 Uppsala EQG 12
39 pages
M131-Lecture Notes No. 4
No ratings yet
M131-Lecture Notes No. 4
58 pages
39 1 Norm Dist
No ratings yet
39 1 Norm Dist
24 pages
110 Normal Distribution
No ratings yet
110 Normal Distribution
5 pages
Normal Distribution Review
No ratings yet
Normal Distribution Review
22 pages
EDA01 Normal Distribution
No ratings yet
EDA01 Normal Distribution
14 pages
auditory memory span
No ratings yet
auditory memory span
13 pages
Probability Density Function:: Time Again. More Closely The Histogram Will Approximate The PDF
No ratings yet
Probability Density Function:: Time Again. More Closely The Histogram Will Approximate The PDF
46 pages
Normal distribution - WikipediaTheory of estimation
No ratings yet
Normal distribution - WikipediaTheory of estimation
61 pages
Gaussian or Normal PDF
No ratings yet
Gaussian or Normal PDF
5 pages
Normal Notes
No ratings yet
Normal Notes
3 pages
11 Normal Distribution
No ratings yet
11 Normal Distribution
48 pages
Award_in_Education_and_Training_sample
No ratings yet
Award_in_Education_and_Training_sample
9 pages
w2c_central_limit
No ratings yet
w2c_central_limit
1 page
Probability and Distributions
No ratings yet
Probability and Distributions
6 pages
Ch.3 Normal Distribution
No ratings yet
Ch.3 Normal Distribution
1 page
A Normal Distribution
No ratings yet
A Normal Distribution
1 page
5b_20241208_0001
No ratings yet
5b_20241208_0001
1 page
w2e_multivariate_gaussian
No ratings yet
w2e_multivariate_gaussian
6 pages
Sabbath School 101
No ratings yet
Sabbath School 101
10 pages
Part 5
No ratings yet
Part 5
31 pages
MATH11183 Week 1-Part 2
No ratings yet
MATH11183 Week 1-Part 2
18 pages
MDA3S
No ratings yet
MDA3S
22 pages
Part 4
No ratings yet
Part 4
24 pages
Week 2 Naive Bayes
No ratings yet
Week 2 Naive Bayes
15 pages
Week 8 Pca
No ratings yet
Week 8 Pca
26 pages
SFF24 - Program - Guide-SUNDANCE 2024
No ratings yet
SFF24 - Program - Guide-SUNDANCE 2024
53 pages
Slides 03 A
No ratings yet
Slides 03 A
21 pages
PMRslides 02
No ratings yet
PMRslides 02
13 pages
Part 3
No ratings yet
Part 3
29 pages
Ch.3 Normal Distribution
No ratings yet
Ch.3 Normal Distribution
1 page
Bayesian Week4 LectureNotes
No ratings yet
Bayesian Week4 LectureNotes
15 pages
Heat Advection
No ratings yet
Heat Advection
12 pages
Biological Data Science Lecture4
No ratings yet
Biological Data Science Lecture4
21 pages
PMRslides 03 B
No ratings yet
PMRslides 03 B
45 pages
Biological Data Science Lecture6
No ratings yet
Biological Data Science Lecture6
29 pages
Ch.3 Normal Distribution
No ratings yet
Ch.3 Normal Distribution
1 page
TS Part2
No ratings yet
TS Part2
62 pages
BDS 2018-19
No ratings yet
BDS 2018-19
6 pages
Bayesian Workshop1 Solution
No ratings yet
Bayesian Workshop1 Solution
3 pages
Machine Learning and Pattern Recognition Variational KL
No ratings yet
Machine Learning and Pattern Recognition Variational KL
5 pages
Machine Learning and Pattern Recognition - Laplace - Approximation
No ratings yet
Machine Learning and Pattern Recognition - Laplace - Approximation
4 pages
Machine Learning and Pattern Recognition Minimal Stochastic Variational Inference Demo
No ratings yet
Machine Learning and Pattern Recognition Minimal Stochastic Variational Inference Demo
3 pages
Normal Probability Distribution
No ratings yet
Normal Probability Distribution
6 pages
Normal Distribution - Wikipedia, The Free Encyclopedia
No ratings yet
Normal Distribution - Wikipedia, The Free Encyclopedia
22 pages
ProbabilityStatistics_Probability3 (1)
No ratings yet
ProbabilityStatistics_Probability3 (1)
9 pages
w9b Netflix Prize
No ratings yet
w9b Netflix Prize
3 pages
Script U9
No ratings yet
Script U9
3 pages
Distribución Gaussiana
No ratings yet
Distribución Gaussiana
26 pages
Bio Statslectures
No ratings yet
Bio Statslectures
60 pages
MLPR w0f - Machine Learning and Pattern Recognition
No ratings yet
MLPR w0f - Machine Learning and Pattern Recognition
3 pages
(eBook PDF) The Great Conversation: A Historical Introduction to Philosophy 8th Edition download
No ratings yet
(eBook PDF) The Great Conversation: A Historical Introduction to Philosophy 8th Edition download
48 pages
Module 6 Common Continuous Probability Distribution
No ratings yet
Module 6 Common Continuous Probability Distribution
45 pages
2019 AMAM Exam Paper
No ratings yet
2019 AMAM Exam Paper
3 pages
Lecture
No ratings yet
Lecture
6 pages
Statistics Review
No ratings yet
Statistics Review
16 pages
Place Value 1
No ratings yet
Place Value 1
2 pages
Makalah B. Inggris 01
No ratings yet
Makalah B. Inggris 01
13 pages
Lab Report Gassiuan Distribution
100% (1)
Lab Report Gassiuan Distribution
13 pages
ADM work
No ratings yet
ADM work
63 pages
W6a Gaussian Process Kernels
No ratings yet
W6a Gaussian Process Kernels
6 pages
A (Very) Brief Review of Statistical Inference: 1 Some Preliminaries
No ratings yet
A (Very) Brief Review of Statistical Inference: 1 Some Preliminaries
9 pages
BDS 2016-17
No ratings yet
BDS 2016-17
4 pages
Plan Lectie 11
No ratings yet
Plan Lectie 11
5 pages
2017 AMAM Exam Paper
No ratings yet
2017 AMAM Exam Paper
6 pages
What Is Probability
No ratings yet
What Is Probability
8 pages
Simple Formal Letter
No ratings yet
Simple Formal Letter
9 pages
A Three-Day Workshop On Forensic Linguistics: Dept. of Linguistics, University of Kerala & Police Training College
No ratings yet
A Three-Day Workshop On Forensic Linguistics: Dept. of Linguistics, University of Kerala & Police Training College
5 pages
Irregular Verbs Part I I Complete The Chart With The Past Simple and Past Participle Forms
No ratings yet
Irregular Verbs Part I I Complete The Chart With The Past Simple and Past Participle Forms
2 pages
Bmu101107 PDF
No ratings yet
Bmu101107 PDF
76 pages
Normal Distribution Normal Probability Distribution: Mean Continuous Random Variable (X)
No ratings yet
Normal Distribution Normal Probability Distribution: Mean Continuous Random Variable (X)
5 pages
Ten Lectures in Linguistics
No ratings yet
Ten Lectures in Linguistics
160 pages
R Lab - Probability Distributions
No ratings yet
R Lab - Probability Distributions
10 pages
Chapter6 Stats
No ratings yet
Chapter6 Stats
4 pages
Cloning Dectection
No ratings yet
Cloning Dectection
5 pages
Lesson 2-08 Properties of Normal Distributions
100% (3)
Lesson 2-08 Properties of Normal Distributions
18 pages
DLL - Mathematics 3 - Q1 - W8
No ratings yet
DLL - Mathematics 3 - Q1 - W8
6 pages
Normal Distribution
No ratings yet
Normal Distribution
48 pages
For FinalExam
No ratings yet
For FinalExam
65 pages
WOLC & SFTP Bulk Payment File New Layout + Intermediary Bank
No ratings yet
WOLC & SFTP Bulk Payment File New Layout + Intermediary Bank
16 pages
Fill in The Correct Form - Adjective or Adverb
No ratings yet
Fill in The Correct Form - Adjective or Adverb
2 pages
Computer
No ratings yet
Computer
5 pages
Super Test Batch
No ratings yet
Super Test Batch
28 pages
A.C. Circiuits Notes
No ratings yet
A.C. Circiuits Notes
38 pages
Copa
100% (1)
Copa
13 pages
Chapter 4 The Normal Distribution
100% (1)
Chapter 4 The Normal Distribution
12 pages
Service Bulletin: 0 9 8 9 0 0 3 Rev. 794-1 Pg. 1,2,3,5,9 Rev. 05/95-2 Pg. 3 SEPTEMBER 21, 1989 Mazatrol
No ratings yet
Service Bulletin: 0 9 8 9 0 0 3 Rev. 794-1 Pg. 1,2,3,5,9 Rev. 05/95-2 Pg. 3 SEPTEMBER 21, 1989 Mazatrol
11 pages
Understanding Learners + Types of Learners +
No ratings yet
Understanding Learners + Types of Learners +
5 pages
Sap BPC
No ratings yet
Sap BPC
8 pages
2210 0478 IGCSE OL Computer Science TG 090316
100% (1)
2210 0478 IGCSE OL Computer Science TG 090316
42 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Machine Learning and Pattern Recognition Week 2 Univariate Gaussian

Uploaded by

Machine Learning and Pattern Recognition Week 2 Univariate Gaussian

Uploaded by

Univariate Gaussians

1 The standard normal distribution, the zero-mean unit-variance

2 Shifting and scaling: general univariate Gaussians

MLPR:w2b Iain Murray and Arno Onken, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2020/ 1

However, we have to be careful transforming real-valued random variables. Here, stretching

3 Check your understanding

MLPR:w2b Iain Murray and Arno Onken, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2020/ 2

3.3 Code demonstration

N = int(1e6) # 1e6 is a float, numpy wants int arguments

MLPR:w2b Iain Murray and Arno Onken, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2020/ 3

You might also like