PSM 2k23
PSM 2k23
2k23 REVISION
PART 2
where:
Oi = Observed frequencies
Ei = Expected frequencies if the null hypothesis were true
d.f. = (r-1)x (c-1)
Probability distribution
• Can be discrete or continuous probability distributions based on the nature of the random
variable
2012 Q1b, 2011 Q2a, 2013 Q1a
• The probability of ‘r’ successes out of ‘n’ trials of a binomial experiment is given by the formula
• Pr (r out of n) = nCr pr qn-r
• = n! pr qn-r
• (n-r)! r!
• n is the total number of trials or attempts
• r is the number of successes
• P is the probability of success
• q is the probability of failure
•2013 Q1b, 2014 Q2c
•Parameters of Binomial distribution are
i. the number of trials (n)
ii. the probability of success (p)
•
•2013 Q1c
i. mean is np
ii. variance np(1- p)
• Standard deviation= npq
•2013 Q1d
•The binomial distribution is appropriate here because only two outcomes are possible: died or
survived
• p = 36% or 0.36 hence q = 1- 0.36= 0.64
• n = 6 cases
• r for the first question is 0, it’s 3 for the second and takes more than one value for the third
question (4 or more = 4, 5 and 6)
•Pr (r out of n) = nCr pr qn-r
•= n! pr qn-r
r! (n-r)!
a) Pr (0 out of 6 cases) = 6! 0.360 x 0.64(6-0)
0! X 6!
=
b) Pr (3 out of 6 cases) = 6! 0.363 x 0.64(6-3)
3! X 3!
•Nominal, Ordinal,Interval,Ratio
VARIABLES
•Types of variables
• Quantitative variables (numeric data)
– Have numerical dimension
• Qualitative variables (attributive or categorical variables)
• No numerical dimension
2012 Q2
QUALITATIVE VARIABLES
•The normal distribution is used when the population distribution of data is assumed
normal. It is characterized by the mean and the standard deviation of the data.
•A sample of the population is used to estimate the mean and standard deviation.
•The t statistic is an estimate of the standard error of the mean of the population or
how well known is the mean based on the sample size.
•Which to use depends entirely on the question you are trying to answer. If the
question concerns the entire population as it is distributed, then the normal distribution
should be used. If the question concerns the mean of the population then the t statistic
may be used. For the use of either, a larger sample size gives a better result.
•
2014 Q2a
• Standard deviation
• Variance
• Range
• Mean absolute deviation
• Interquartile range
• Coefficient of variation
2013 Q2a
•Nominal Vs Ordinal
– Merely classifies or categorizes vs Has Rank or order property
– No notion of numerical magnitude vs Has nominal properties.
– Focus on attributes or qualities of the phenomenon vs Differences between any two levels cannot be quantified meaningfully. e.g. level of pain: Mild,
Moderate, Severe.
– Examples: Blood Group( A, B, AB,O), Sex(male, female),Socio-economic status, Marital Status, Site of Cancer,Type of Disease VS Stage of cancer disease,
Patient’s pain condition, Level of Education, Doctors’ Professional Status
2017 Q1a
2017 Q1a
SKEWNESS
•Skewed Distributions.
–Positive if mean is greater than median.
–Negative if mean is less than median.