0% found this document useful (0 votes)

20 views4 pages

Class Notes v1

Uploaded by

Parvat Chavhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views4 pages

Class Notes v1

Uploaded by

Parvat Chavhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Probability, Distributions and Descriptive Statistics

Experiment
One-off or repeated process, for which possible outcomes are known, but actual outcome of each process is
known with some probability. The set of all possible outcomes is known as the sample space. The set of all
probabilities is known as the probability space; each element in the probability space lies between 0 and 1, and
the sum of all elements is one.

Random Variable
A function that maps a real number from the sample space to the probability space.

Probability Density Function

The set of ordered pairs (x, f(x)) is a probability density function (pdf) if:
1. 𝑓(𝑥) ≥ 0, ∀ 𝑥 ∈ ℝ
"
2. ∑! 𝑓(𝑥) = 1 if x is discrete, or, ∫#" 𝑓(𝑥) = 1 if x is continuous
$
3. 𝑃(𝑋 = 𝑥) = 𝑓(𝑥) if x is discrete, or, 𝑃(𝑎 < 𝑋 < 𝑏) = ∫% 𝑓(𝑥). 𝑑𝑥 if x is continuous

The pdf of a variable x, is commonly referred to as the distribution of x.

Cumulative Density Function

The cumulative density function (cdf), F(x), for a random variable X, with a probability density function f(x), is
!
defined as 𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = ∑&'! 𝑓(𝑡) if x is discrete, or, 𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = ∫#" 𝑓(𝑥). 𝑑𝑥 if x is continuous.

Expected Value of a Random Variable

Expected value is a generalised form of the weighted arithmetic average. Essentially, it tells one what the centre of
"
a distribution is. It is defined as: 𝐸(𝑋) = 𝜇 = ∑! 𝑥. 𝑃(𝑥) if x is discrete; and, 𝐸(𝑋) = 𝜇 = ∫#" 𝑥. 𝑃(𝑥). 𝑑𝑥 if x is
continuous.

Expected Value of a Linear Combination of Random Variables

𝐸(𝑎𝑋 + 𝑏𝑌 + ⋯ ) = 𝑎. 𝐸(𝑋) + 𝑏. 𝐸(𝑌) + ⋯

Variance
A measure of spread of variable x around its mean: 𝑉𝑎𝑟(𝑋) = 𝐸(𝑋 ( ) − (𝐸(𝑋))(

Standard Deviation
𝜎(𝑋) = C𝑉𝑎𝑟(𝑋)

Range, Minimum, Maximum

Range defines the maximum and minimum points of an ordered set of all observations of the variable x.

Median, Percentiles, Deciles, Quartiles

For a variable x, the mid-point of its ordered set is called the median.

Essentially, if all the observations for the variable x are ordered, then the percentile level n, gives the percentage of
all observations that exist below that level. In other words, 90th percentile is a value of x, below which 90% of all
the observations of x lie. That implies, the median is the 50th percentile. Minimum value of a variable x is the 0th
percentile, and maximum value is the 100th percentile.
Quartiles, quintiles, deciles, etc. break the ordered set of observations into n equal parts: quartiles into 4 equal
parts corresponding to 25th, 50th, 75th percentiles; quintiles into five, deciles into ten.

Inter-quartile Range (IQR)

Difference between third (75th percentile) and first (25th percentile) quartiles. 𝐼𝑄𝑅 = 𝑄3 − 𝑄1

Linear Transforms of Data

Allows one to change the centre and spread of data and bring multiple variables onto a common point of reference
to allow for comparative analysis.

If transform is of the type 𝑌 = 𝑎 + 𝑏. 𝑋, then a represents a change of level and b represents a change of scale.
Some common effects of linear transforms on descriptive statistics:

average(a + bX) = a + b.average(X)

var(a + bX) = b2var(X)
median(a + bX) = a + b.median(X)
stdev(a + bX) = |b|.stdev(X)
IQR(a + bX) = |b|.IQR(x)

Z-score
A special (and important) type of linear transform of data. It is also called a standard score. Z-score of any
observation of a variable x, gives the number of standard deviations above or below the mean value that the
observation is located at.

𝑧 = (𝑥 − 𝜇)/𝜎

This transform maps the variable x to a distribution with mean 0 and variance 1, without altering the general
shape/characteristics of the original distribution. Essentially it recreates a pseudo-normal distribution for the
variable x.

Skewness and Kurtosis

Skewness measures the asymmetry of a distribution about the mean, and kurtosis measures the fatness of the
distribution.

Skewness = E(Z3). If skewness < 0, then left tail is longer; > 0, then right tail is longer.
Kurtosis = E(Z4) – 3. If kurtosis < 0, then narrow distribution; if > 0, then fat distribution.

Important Descriptive Statistics

1. Mean
2. Minimum
3. Maximum
4. Median
5. Inter-quartile range
6. Variance, or, standard deviation
7. Skewness
Normal Distribution

Special case: mean and median occur at same point; balanced distribution around mean.

Many people believe that datasets with very large number of observations will follow a normal distribution, or, be
close to a normal distribution – in my opinion, not necessary, and safer not to make this assumption.

Chebyshev’s Rule

Important takeaway: 95% of all observations lie within second standard deviation
Outliers
Classic rule: all observations for which Z > 2.

Preferred rule: using the boxplot, all observations that lie below Q1 – 1.5.IQR or above Q3 + 1.5.IQR

Solution Manual Adms 2320 PDF
No ratings yet
Solution Manual Adms 2320 PDF
869 pages
BCS Statistics 112 Sem 1 2019 Answers
No ratings yet
BCS Statistics 112 Sem 1 2019 Answers
16 pages
Statistics 2 Marks and Notes 2019
No ratings yet
Statistics 2 Marks and Notes 2019
37 pages
Statistics and Probability Notes Part 1
No ratings yet
Statistics and Probability Notes Part 1
23 pages
B.Arch S1S2 Syllabus
No ratings yet
B.Arch S1S2 Syllabus
67 pages
Topic - 7 (Uncertainty)
No ratings yet
Topic - 7 (Uncertainty)
25 pages
ZC-417 Quantitative Methods Exam Notes
No ratings yet
ZC-417 Quantitative Methods Exam Notes
144 pages
Gate Scholorship Work - October: Sampling Fundamentals
No ratings yet
Gate Scholorship Work - October: Sampling Fundamentals
13 pages
MCQ For PTSP
100% (1)
MCQ For PTSP
11 pages
Stats
No ratings yet
Stats
109 pages
Probability unit-III
No ratings yet
Probability unit-III
106 pages
Module Wise Important Formulae
No ratings yet
Module Wise Important Formulae
45 pages
L-03 PBH 611 Exploratory Data Analysis
No ratings yet
L-03 PBH 611 Exploratory Data Analysis
78 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Statistics
No ratings yet
Statistics
12 pages
Chapter 2
No ratings yet
Chapter 2
46 pages
SCSA1606 - Predictive and Advanced Analytics - Unit II
No ratings yet
SCSA1606 - Predictive and Advanced Analytics - Unit II
50 pages
FDSA Unit 2
No ratings yet
FDSA Unit 2
44 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
51 pages
Lec 1
No ratings yet
Lec 1
44 pages
Stat Chapter 5-9
No ratings yet
Stat Chapter 5-9
32 pages
Data Science 01 - Basics
No ratings yet
Data Science 01 - Basics
52 pages
EECM3724 Unit 1 Ch3 Slides 2022
No ratings yet
EECM3724 Unit 1 Ch3 Slides 2022
48 pages
City Uni of New York
No ratings yet
City Uni of New York
33 pages
1 - 3 Descriptive Measures
No ratings yet
1 - 3 Descriptive Measures
33 pages
Predictive Analytics Notes1
No ratings yet
Predictive Analytics Notes1
37 pages
Six Sigma Book PDF Form
No ratings yet
Six Sigma Book PDF Form
111 pages
Quant Descriptive Statistics
No ratings yet
Quant Descriptive Statistics
37 pages
Module 3 Descriptive Statistics Numerical Measures
No ratings yet
Module 3 Descriptive Statistics Numerical Measures
28 pages
Statistics 1
No ratings yet
Statistics 1
10 pages
AP ECON 2500 Session 2
No ratings yet
AP ECON 2500 Session 2
22 pages
4 - Stat - Measures of Variation 2024
No ratings yet
4 - Stat - Measures of Variation 2024
27 pages
Prob and Stats Notes
No ratings yet
Prob and Stats Notes
12 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
M-1 CH-3 Descriptive Statistcs
No ratings yet
M-1 CH-3 Descriptive Statistcs
27 pages
IE101 Reviewer
No ratings yet
IE101 Reviewer
22 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Module 1 Overview - of - Statistics
No ratings yet
Module 1 Overview - of - Statistics
11 pages
Presentation 3
No ratings yet
Presentation 3
26 pages
Faculty Introduction: Tkachwala@nmims - Edu
No ratings yet
Faculty Introduction: Tkachwala@nmims - Edu
27 pages
Numerical Summary Statistics
No ratings yet
Numerical Summary Statistics
19 pages
Unit 3 - Descriptive Statistics
No ratings yet
Unit 3 - Descriptive Statistics
44 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
ch03 Ver3
No ratings yet
ch03 Ver3
25 pages
Applied Science: Course No: AS ESDM 361 Title: Environmental Science and Disaster Management Credit: 3 (2+1) Semester: I
No ratings yet
Applied Science: Course No: AS ESDM 361 Title: Environmental Science and Disaster Management Credit: 3 (2+1) Semester: I
277 pages
Stats Reviewer
No ratings yet
Stats Reviewer
16 pages
Lecture 3
No ratings yet
Lecture 3
14 pages
9.1. Prob - Stats
No ratings yet
9.1. Prob - Stats
19 pages
Statistical and Probability Tools For Cost Engineering
No ratings yet
Statistical and Probability Tools For Cost Engineering
16 pages
Univariate Statistics
No ratings yet
Univariate Statistics
7 pages
Measures of Central Tendency
100% (15)
Measures of Central Tendency
15 pages
Descriptive Stat
No ratings yet
Descriptive Stat
13 pages
Short Notes
No ratings yet
Short Notes
2 pages
Measures of Central Tendency: Mean
No ratings yet
Measures of Central Tendency: Mean
7 pages
Qualitative Quantitative: Random Variable
No ratings yet
Qualitative Quantitative: Random Variable
4 pages
Unit-3 DS Students
No ratings yet
Unit-3 DS Students
35 pages
The Expected Value and Variance of A Discrete Random Variable
No ratings yet
The Expected Value and Variance of A Discrete Random Variable
14 pages
Erwin John Landicho
No ratings yet
Erwin John Landicho
8 pages
Introductory of Statistics - Chapter 3
No ratings yet
Introductory of Statistics - Chapter 3
7 pages
DDDDDD 2
No ratings yet
DDDDDD 2
5 pages
tg2 E2 Probability B
No ratings yet
tg2 E2 Probability B
2 pages
Univariate Statistics
No ratings yet
Univariate Statistics
4 pages
Prob and Stats Notes PDF
No ratings yet
Prob and Stats Notes PDF
12 pages
GoldSim Appendices
No ratings yet
GoldSim Appendices
129 pages
M Tech-Syllabus
No ratings yet
M Tech-Syllabus
7 pages
Chapter 3 (Technical English For Statistics)
No ratings yet
Chapter 3 (Technical English For Statistics)
8 pages
Module 3: Random Variables Lecture - 4: Descriptors of Random Variables (Contd.) Measure of Skewness
No ratings yet
Module 3: Random Variables Lecture - 4: Descriptors of Random Variables (Contd.) Measure of Skewness
8 pages
Bongabon Senior High School
No ratings yet
Bongabon Senior High School
8 pages
Binomial Poisson
No ratings yet
Binomial Poisson
4 pages
Continuous Dist Week 5
No ratings yet
Continuous Dist Week 5
12 pages
Dmth404 Statistics
No ratings yet
Dmth404 Statistics
450 pages
MA Economics CBCS 2023 24 With Objectives
No ratings yet
MA Economics CBCS 2023 24 With Objectives
34 pages
PMP - Report MTP
No ratings yet
PMP - Report MTP
37 pages
Pharmacy Statistics Prelims - Reviewer
No ratings yet
Pharmacy Statistics Prelims - Reviewer
47 pages
Improvement of Value Stream Mapping
No ratings yet
Improvement of Value Stream Mapping
15 pages
BM Syllabus 11 & 12
No ratings yet
BM Syllabus 11 & 12
18 pages
CTV Actuarial Sciences
No ratings yet
CTV Actuarial Sciences
8 pages
STA124 Complete Note (Edward Cares)
No ratings yet
STA124 Complete Note (Edward Cares)
41 pages
Spe 212231 Ms
No ratings yet
Spe 212231 Ms
20 pages
CS1.1 Discrete RV
No ratings yet
CS1.1 Discrete RV
4 pages
A-Level Statistics 1 - Normal Distribution - Notes
No ratings yet
A-Level Statistics 1 - Normal Distribution - Notes
5 pages
Full Download Theory of Sampling and Sampling Practice, Third Edition Francis R Pitard PDF
100% (1)
Full Download Theory of Sampling and Sampling Practice, Third Edition Francis R Pitard PDF
63 pages
Complete Download Environmental Statistics With S Plus 1st Edition Steven P. Millard PDF All Chapters
100% (2)
Complete Download Environmental Statistics With S Plus 1st Edition Steven P. Millard PDF All Chapters
55 pages
Syllabus 1MA501 1
No ratings yet
Syllabus 1MA501 1
2 pages
A Treatise on the Calculus of Finite Differences
From Everand
A Treatise on the Calculus of Finite Differences
George Boole
4/5 (1)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Elgenfunction Expansions Associated with Second Order Differential Equations
From Everand
Elgenfunction Expansions Associated with Second Order Differential Equations
E. C. Titchmarsh
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Integration, Measure and Probability
From Everand
Integration, Measure and Probability
H. R. Pitt
No ratings yet
Math for Computer Applications
From Everand
Math for Computer Applications
The Editors of REA
No ratings yet

Class Notes v1

Uploaded by

Class Notes v1

Uploaded by

Probability, Distributions and Descriptive Statistics

Probability Density Function

The pdf of a variable x, is commonly referred to as the distribution of x.

Cumulative Density Function

Expected Value of a Random Variable

Expected Value of a Linear Combination of Random Variables

Range, Minimum, Maximum

Median, Percentiles, Deciles, Quartiles

Inter-quartile Range (IQR)

Linear Transforms of Data

average(a + bX) = a + b.average(X)

Skewness and Kurtosis

Important Descriptive Statistics

You might also like