0% found this document useful (0 votes)
11 views40 pages

Lecture 2

Uploaded by

Yuvarani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views40 pages

Lecture 2

Uploaded by

Yuvarani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 40

PLG 500 Penaakulan Statistik dalam Pendidikan

Statistical Reasoning in Educational Research

Dr Nooraida Yakob
Dr Nor Asniza Ishak
Dr Muzirah Musa
• Summarising and displaying data, describe
the distribution
– Data distribution. Constructing charts and graphs
– bar chart, histogram, box plot, stem and leaf
plot
– Describing the distribution, calculating mean,
mode and median, standard deviation and
variance, skewness of a distribution.
Data distribution.
Constructing charts and graphs – bar chart, histogram,
box plot, stem and leaf plot
Mathematics score
63,59,53,61,58,63, 77,65,59,63

59,59,88,58,59,64,60,64,75,66

62,61,60,70,64,63,59,62,65,76

• This set of data makes sense when it is simplified


• We can report the number of students with specific scores in
a table
• This number is called frequency
Carta Bar

• Diagram showed frequency


• The height of each bar indicates the frequency of
each category
• The bars are the same width and separated
Histogram
• Showed the frequency of each interval class for
continuous variable
• The width of each bar represents the size of class
interval, the height represents the frequency of class
interval
• The bars have the same width and touch to one
nother
Stem & Leaf Plot

A Stem and Leaf Plot is a special table where each data


value is split into a "stem" (the first digit or digits) and a
"leaf" (usually the last digit).
Boxplot
• A Box Plot is the visual representation of the statistical five
number summary of a given data set.
• A Five Number Summary includes:
– Minimum
– First Quartile
– Median (Second Quartile)
– Third Quartile
– Maximum
Box Plot
• A Five Number Summary includes:
– Minimum = 53
– First Quartile = 59
– Median (Second Quartile) = 62.5
– Third Quartile = 65
– Maximum = 88
Example 3-24: Solution

Step 5.
Figure 3.14

Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved
Boxplot

To determine the outliers

 Outlier is a value exceed 1.5*IQR form the nearest quartile (1.5 x 6


= 9, Q3 = 65) ; 65 + 9 = 74

 Extreme outliers is a value exceed 3*IQR from the nearest quartile


- was labelled as * (3 x 6 =18, Q3 = 65); 65 + 18 = 84

Extreme outliers – ID
13
Outliers – ID 7,19,30

IQR – interquartile
range
Describing the distribution, calculating mean,
mode and median, standard deviation and
variance, skewness of a distribution.
Central Tendency Measurement
 Numerical value to represent the center point of
data set
 Indicate where most values in a distribution fall
 Describe the whole data with one value
 Three types :
 Mode
 Median
 Mean
Central Tendency Measurement

 Mode = score with the highest frequency


 Median = score at the middle when the data is arranged
from smallest to largest
 Mean = total score  number of score

3, 7, 9, 4, 5, 4, 6, 9, 9
Mode = 9 (perhatikan mod ≠ 3)
Median = 3, 4, 4, 5, 6, 7, 9, 9, 9
Mean = (3+7+9+4+5+4+6+9+9)/9 = 6.22
Central Tendency Measurement

Advantages Weaknesses
Min Used the most as it is Dipengaruhi oleh nilai ekstrim
stable
Take into account all 22, 25, 21, 26, 67
data Min = 32.20
Suitable for continuous
data
Median Tidak dipengaruhi oleh Hanya mengambil nilai di tengah-
nilai ekstrim tengah
Mod The simplest Some of the data did not have
measurement mode value
Suitable for discrete data 11, 14, 19, 16, 25, 36
Measures of Dispersion
 A measurement that showed how well the values in
a data set differ from one another or from the
central of the data set
 Variance
 Standard of deviation
=
.

Varians & Sisihan Piawai


 Varians merupakan statistik yang mengukur sejauh manakah skor-skor
di dalam data berbeza dengan min.
 Nilai varians yang besar menunjukkan data berada jauh daripada min.
Ini bermaksud data tersebut lebih terserak. Sebaliknya, nilai varians
yang kecil menunjukkan data berada hampir dengan min, yang
bermaksud data tersebut lebih terkumpul.
 Varians bagi satu-satu data, 2 diberikan dalam rumus-rumus berikut:
σ 𝑥−𝑥ҧ 2 σ 𝑥2
2 = atau 2 = − 𝑥ҧ 2 ,
𝑁 𝑁

dengan 𝑥ҧ = min skor bagi data tersebut


 Sisihan piawai,  = varians .
Variance

4, 6, 9, 3, 5, 12, 10
Min, 𝑥ҧ =  X 4  6  9  3  5  12  10
 7
N 7
x (x – 𝑥)ҧ (x – 𝑥)ҧ 2 x x2
x 4 16
4 4 – 7 = –3 (–3)2 = 9 6 36
6 6 – 7 = –1 (–1)2 = 1 9 81
9 9–7=2 (2)2 = 4 3 9
3 3 – 7 = –4 (–4)2 = 16 5 25
5 5 – 7 = –2 (–2)2 = 4 12 144
12 12 – 7 = 5 (5)2 = 25 10 100
10 10 – 7 = 3 (3)2 = 9 x2 = 411
(x – 𝑥)ҧ 2 = 68
σ 𝑥2 411
2 =
σ 𝑥−𝑥ҧ 2
=
68
= 9.71 2 = − 𝑥ҧ 2 = − 72 = 9.71
𝑁 7 𝑁 7
Variance & Standard Deviation

Advantages:
 More accurate as the calculation involved all values in
the data set
 Measure the dispersion of each value from the mean
value in the data set.
 The calculation did not involved the square of deviation.
 The unit for standard deviation and the unit for data –
same
Skewness Dispersion

Give the shape of distribution

Positive skewness –
Negative skewness – tail on the right side
tail on the left side
Terima

Kasih

You might also like