Ug Statistic Notes
Ug Statistic Notes
STATISTICS FORMULAE
Measures of central tendencies (Mean, median, mode)
a. Mean = ∑ n (Summation of observations divided by no. of observations)
N
b. Median = middle value = value of middle observation
______________________________________________________________
= σ/ x × 100
It is the relative measure of variation. Also, used for comparing variability between
2 or more samples with different magnitudes & between 2 or more variables in the
same sample.
Also k/a Relative variate / Standard normal variate: It is the deviation of individual
observation from mean x in a normal curve & is measured in terms of standard deviation.
2
It indicates how much observations will be higher or lower from specific values on either
side of the mean in the form of S.D.
12. Chi square test (χ2) χ2 = ∑ (O – E)2 O🡪 Observed value E 🡪 Expected value
E
E = Row total × Column total calculate E for each cell
N
Chi square test is used to test the significance of difference between 2 or more
than 2 groups (proportions). When data are expressed in frequencies or counts
such as no. of responses (eg patients) in 2 or more categories.
Pie diagram: - Frequency of groups is represented in the form of circle & degree of angle
of groups denotes frequency of groups
In above eg, 30% = 30 × 360 = 108o; 30%= 82.8o; 15% = 54o; 7% = 25.2o; 4%=14.4o; 2%= 7.2o
3
100
o
Total is 360
Histogram:
It is a graphical representation of frequency distribution of quantitative data. Different
groups of the variable characters are indicated on the horizontal line & frequency i.e. no.
of observations are indicated vertically.
Total area covered by histogram gives total frequency. Hence, it is also called Area
diagram.
Frequency polygon is useful for comparison of two or more groups in terms of frequency
distribution.
Line chart:
It is a frequency polygon presenting variation by lines. It shows trend of events occurring
over a period of time. Eg IMR, CBR, CDR……. It brings about components of factors
which are imp in epidemiological studies as it gives idea about time, place & persons.
5
Scatter diagram:-
It is used to assess the relationship between two continous variables (like ht, wt, etc….)
In plotting a curve, one of the variable on X -axis & other on Y –axis is plotted
perpendicular drawn from 2 readings. A line is drawn to show the nature of correlation.
Screening: TP: true positive FP: false positive FN: False negative TN: True negative
Test Present Absent Total
• Exercise: In a pilot study of 1600 subjects to identify breast cancer with the help of
breast carcinoma promoting factor, 880 individuals had a negative test result of
which did not have breast cancer. Calculate negative predictive value of the test.
VITAL STATISTICS
• Census population in a city was 6,00000. The following events occurred during the year
1991.
• Total LBs - 15000; total deaths – 6000; total maternal deaths – 60; infant deaths – 800 ;
neonatal deaths – 720; still births in 1991 – 140; early neonatal deaths – 480
• Calculate CBR, CDR, IMR, MMR, NMR, PNMR, Early NMR, Late NMR, PMR
• IMR = infant deaths /total LBs × 1000 = 53.3 per 1000 LBs
• PNMR = infant deaths – neonatal deaths / TLBs × 1000 = 5.3 per 1000 LBs
• Early NMR = early neonatal deaths /TLBs × 1000 = 480/15000 = 32 per 1000 LBs
• Late NMR = neonatal deaths - early neonatal deaths /TLBs × 1000 = 720-480/15000 = 16
per 1000 LBs
• PMR = SBs + early neonatal deaths / total deaths (live & SB) × 1000 = 40.95 per 1000 LBs
• Total cases
10
• Total cases
• OR = ad/bc
Secondary attack rate (SAR) = No. of exposed persons developing the disease within the
Q. In a population of 100 females the mean Hb concentration was 10 & the SD was 1
Ans: 0.4
Q. Calculate the SE for a population size of 25 persons suffering from history of fever
of 8 days & with SD 2
Ans: 0.4
Ans: 0.01
11
Sample size = 4 pq
L2