0% found this document useful (0 votes)

9 views29 pages

MIT24 915F15 Lec14

Uploaded by

mail2vinaykk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views29 pages

MIT24 915F15 Lec14

Uploaded by

mail2vinaykk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

24.

963
Linguistic Phonetics
Basic statistics

2.5%

-3S -2S -S X +S +2S +3S

68%
95%
99.7%

Image by MIT OCW.

Adapted from Kachigan, S. K. Multivariate
Statistical Analysis. 2nd ed. New York, NY: Radius, 1991.

1
Assignments:
• Send me a paragraph on your final project (due 11/24)
• Write up the affricates experiment (due 12/1)

2
Writing up an experiment

The report on an experiment usually consists

of four basic parts:
1. Introduction
2. Procedure
3. Results
4. Discussion

3
Writing up an experiment
1. Introduction
• Outline of the purpose of the experiment
• state hypotheses tested etc
• provide background information (possibly including
descriptions of relevant previous results, theoretical
issues etc).
2. Procedure - what was done and how.
• instructions for replication, e.g.
– Experimental materials
– Subjects
– Recording procedure
– Measurement procedures (especially measurement
criteria).

4
Writing up an experiment
3. Results
• Presentation of results, including descriptive statistics
(means etc) and statistical tests of hypotheses.

4. Discussion
• Discuss the interpretation and significance of the results

5
Some Statistics
Two uses of statistics in experiments:
• Summarize properties of the results (descriptive statistics).
• Test the significance of results (hypothesis testing).

6
Descriptive statistics
Measures of central tendency:
• Mean: Σxi
M= N
– M is used for sample mean, μ for population mean.

• Median: The value that separates the lower half of

a set of observations from the higher half.
– Arrange the values from low to high. The median is in
the middle of this list.

7
Descriptive statistics
A measure of dispersion:
• Variance: mean of the squared deviations from the mean

Σ(xi-μ)2
σ2 = N

• Standard deviation: σ (square root of the variance).

8
9
10
11
Hypothesis Testing

Which properties differentiate /tS/ and /dZ/?

mean mean mean

VOT intensity F1 onset
(ms) (dB) (Hz)
/tS/ 112 67 531
/dZ/ 58 64 423

12
Hypothesis Testing

• Is peak intensity of frication in /tS/ higher than in /dZ/? I.e.

is 67 dB significantly higher than 64 dB?

• Could the apparent differences have arisen by chance,

although the true (population) means of frication
intensities in the two affricates are the same?

• I.e. given that intensity of /tS/ and /dZ/ frications varies, we

might happen to sample most of our /tS/ tokens from the
high end of the distribution, and most of our /dZ/ tokens
from the low end.
• Statistical tests allow us to assess the probability that this is
the case.

13
• Could the apparent differences have arisen by chance, although the true
(population) means of /tS/ and /dZ/ are the same?

dZ/tS
dZ tS

?
• Statistical tests allow
us to assess the
probability that this is
the case.

dZ tS

14
Hypothesis Testing: t-test
• The t-test allows us to test hypotheses concerning means
and differences between means.
– The mean frication intensity of /tS/ differs from the
mean frication intensity of /dZ/ (Mean difference ≠ 0)

• We actually evaluate two exhaustive and mutually

exclusive hypotheses, a null hypothesis that the mean
has a particular value, and the alternative hypothesis that
the mean does not have that value.
– The mean frication intensity of /tS/ = the mean
frication intensity of /dZ/ (Mean difference = 0).
• Statistical tests allow us to assess the probability of
obtaining the observed data if the null hypothesis were
true.
15
Hypothesis Testing: t-test
• Basic concept: If we know what the distribution
of sample means would be if the null hypothesis
were true, then we can calculate the probability
of obtaining the observed mean, given the null
hypothesis.
• We arrive at the parameters of the distribution of
sample means through assumptions and
estimation.

16
Hypothesis Testing
Distribution of sample means

σ = 10 ms

…it is unlikely that we

would get a sample mean
if this were the
population mean… of this value

110 120 130 140 150 160 170 180 190

17
Hypothesis Testing
• Basic assumption: The samples are drawn from normal populations.

2.5%

-3S -2S -S X +S +2S +3S

68%
95%
99.7%

Image by MIT OCW.

Adapted from Kachigan, S. K. Multivariate
Statistical Analysis. 2nd ed. New York, NY: Radius, 1991.
18
Distribution of Sample Means
• Basic assumption: The distribution of sample
means is normal.
• This is guaranteed to be true if the population
from which the sample is taken is normally
distributed.
• But the distribution of sample means is
approximately normal even if the population is
non-normal, as long as the samples are large
enough.

• http://onlinestatbook.com/stat_sim/sampling_dist/index.html

19
Hypothesis Testing
• Properties of distribution of means of samples of size N
drawn from a normal population:
– The sample means are normally distributed.
– Mean is the same as the population mean.
– The variance is less than the population variance:
σ2
σM2 = N

20
(a) Parent population

x
300 350 400 450 500 550 600

(b) Sampling distribution of

the mean based on samples
of size n = 4

x
300 350 400 450 500 550 600

(b) Sampling distribution of

the mean based on samples
of size n = 16

x
300 350 400 450 500 550 600

(b) Sampling distribution of

the mean based on samples
of size n = 64
x
300 350 400 450 500 550 600

Image by MIT OCW.

Adapted from Kachigan, S. K. Multivariate
Statistical Analysis. 2nd ed. New York, NY: Radius, 1991.

21
Hypothesis Testing
• The mean of the distribution is determined by hypothesis.
– E.g. mean = 1500 Hz or mean difference = 0.
• Population variance is estimated from the sample
variance. Unbiased estimate of the population variance:
Σ(xi-Μ)2
S2 = N-1

– N-1 is the number of degrees of freedom of the

sample.
• So estimated variance of distribution of sample means,
SM2 = S2/N
=
M-μ
• t score: t S
M

22
Hypothesis Testing
• t scores follow a t-distribution - similar to a normal distribution, but
with slightly fatter tails (more extreme values) because S may
underestimate σ.
• t-distribution is actually a family of distributions, one for each
number of degrees of freedom.
• Calculate t-score then consult relevant t distribution to determine the
probability of obtaining that t-score or greater (more extreme).

23
t test for independent means
• When we compare means, we are actually sampling a population of
differences (e.g. differences in intensity of frication of /tS/ and /dZ/).
• If the null hypothesis is correct, then the mean difference is 0.
• Variance of the distribution of mean differences is estimated based
on the variances and sizes of the two samples.
– Or, if the observations are paired, based on the variance of the
differences (‘paired t-test’)
• Using a paired t-test (ignoring repeated measures):
– SM = 3.34/√252 = 0.21
– M = M1-M2 = 66.7-64.3 = 2.4
(difference between sample means)
– μ = 0 (null hypothesis is that the difference between pop.
means is 0).
M − μ 2.4 − 0
t(22) = = = 11.4
SM 0.21
p( t ( 252 ) ≥ 11.4) = 2.56 ×10 −6

24
Hypothesis testing
• Statistical tests like the t test give us the probability of obtaining the
observed results if the null hypothesis were correct - the p value.
E.g. p < 0.01, p = 0.334.
• We reject the null hypothesis if the experimental results would be
very unlikely to have arisen if the null hypothesis were true.
• How should we set the threshold for rejecting the null hypothesis?
– Choosing a lower threshold increases the chance of incorrectly
accepting the null hypothesis.
– Choosing a higher threshold increases the chance of incorrectly
rejecting the null hypothesis.
– A common compromise is to reject the null hypothesis if p < 0.05,
but there is nothing magical about this number.

25
Hypothesis testing
• In most experiments we need more complex statistical analyses than
the t test (e.g. ANOVA), but the logic is the same: Given certain
assumptions, the test allows us to determine the probability that our
results could have arisen by chance in the absence of the
hypothesized effect (i.e. if the null hypothesis were true).

26
Fitting models
• Statistical analyses generally involve fitting a model to the
experimental data.
• The model in a t-test is fairly trivial, e.g.

affricate VOT = μ + voice (voice is ‘voiced’ or ‘voiceless’)

27
Fitting models
• Statistical analyses generally involve fitting a model to the
experimental data.
• The model in a t-test is fairly trivial, e.g.
affricate VOT = μ + voice (voice is ‘voiced’ or ‘voiceless’)
VOTij = μ + voicei + errorij

• Analysis of Variance (ANOVA) involves more complex models, e.g.

VOTijk = voicei + contextj + errorijk
(context takes different values for /stop_, /vowel_)
VOTijk = voicei + contextj + voice*contextij + errorijk

• Model fitting involves finding values for the model parameters that
yield the best fit between model and data (e.g. minimize the squared
errors, maximize the probability of the observed data).
• Hypothesis testing generally involves testing whether some term or
coefficient in the model is significantly different from zero.

28
MIT OpenCourseWare
https://ocw.mit.edu

24.915 / 24.963 Linguistic Phonetics

Fall 2015

For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms.

Regression Exercise IAP 2013
No ratings yet
Regression Exercise IAP 2013
20 pages
Questions and Answers On Unit Roots, Cointegration, Vars and Vecms
No ratings yet
Questions and Answers On Unit Roots, Cointegration, Vars and Vecms
6 pages
Descriptive and Inferential Statistics
No ratings yet
Descriptive and Inferential Statistics
30 pages
A Brief (Very Brief) Overview of Biostatistics: Jody Kreiman, PHD Bureau of Glottal Affairs
No ratings yet
A Brief (Very Brief) Overview of Biostatistics: Jody Kreiman, PHD Bureau of Glottal Affairs
56 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
18 pages
AFM - Module 6
No ratings yet
AFM - Module 6
72 pages
An Introduction To T-Tests: Statistical Test Means Hypothesis Testing
100% (1)
An Introduction To T-Tests: Statistical Test Means Hypothesis Testing
8 pages
Mba Semester 1 Mb0040 - Statistics For Management-4 Credits (Book ID: B1129) Assignment Set - 2 (60 Marks)
No ratings yet
Mba Semester 1 Mb0040 - Statistics For Management-4 Credits (Book ID: B1129) Assignment Set - 2 (60 Marks)
5 pages
Test Statistics Fact Sheet
No ratings yet
Test Statistics Fact Sheet
4 pages
Chapter8 - Hyp - Test - 2 - Samples - Student
No ratings yet
Chapter8 - Hyp - Test - 2 - Samples - Student
45 pages
Lesson 2. Simple Comparative Experiments
No ratings yet
Lesson 2. Simple Comparative Experiments
8 pages
EDU 411 Topic 5 Data Analysis
No ratings yet
EDU 411 Topic 5 Data Analysis
9 pages
Stat Test Statistics Samples
No ratings yet
Stat Test Statistics Samples
13 pages
Statistics
No ratings yet
Statistics
27 pages
Comparison of Means: Hypothesis Testing
No ratings yet
Comparison of Means: Hypothesis Testing
52 pages
Descriptive Statistics and Inferential Statistics: Part 1
No ratings yet
Descriptive Statistics and Inferential Statistics: Part 1
65 pages
Raghunath Chatterjee - Statistical Tests - Lecture
No ratings yet
Raghunath Chatterjee - Statistical Tests - Lecture
47 pages
T-Tests & Chi2
No ratings yet
T-Tests & Chi2
35 pages
1 Biostatistics
No ratings yet
1 Biostatistics
16 pages
R Unit-4
No ratings yet
R Unit-4
13 pages
Statistics SS2020
No ratings yet
Statistics SS2020
12 pages
1 Hypothesis Testing Rev
No ratings yet
1 Hypothesis Testing Rev
122 pages
Untitled Document
No ratings yet
Untitled Document
9 pages
Basic Concepts of One Way Analysis of Variance (ANOVA)
No ratings yet
Basic Concepts of One Way Analysis of Variance (ANOVA)
38 pages
Basic Concepts of One Way Analysis of Variance (ANOVA)
No ratings yet
Basic Concepts of One Way Analysis of Variance (ANOVA)
38 pages
MD115 Wk05
No ratings yet
MD115 Wk05
86 pages
Types of Statistical Hypothesis: Statistics
No ratings yet
Types of Statistical Hypothesis: Statistics
18 pages
Students T Test
No ratings yet
Students T Test
15 pages
Data Visualization Notes Ou
No ratings yet
Data Visualization Notes Ou
125 pages
Descriptive Statistics: Sample
No ratings yet
Descriptive Statistics: Sample
5 pages
DV Unit 1&2 Notes
No ratings yet
DV Unit 1&2 Notes
50 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
BRM - 9e - PPT - CH - 22 Student
No ratings yet
BRM - 9e - PPT - CH - 22 Student
19 pages
Sampling
No ratings yet
Sampling
34 pages
Wolkite University: Department of Horticulture
100% (1)
Wolkite University: Department of Horticulture
167 pages
Chapter 7
No ratings yet
Chapter 7
17 pages
False - Choosing Random Individuals Who Pass by Yields A Random Sample False - Probability Predicts What Kind of Population
No ratings yet
False - Choosing Random Individuals Who Pass by Yields A Random Sample False - Probability Predicts What Kind of Population
5 pages
Lecture 2 - MAT361 (21 JAN 2025)
No ratings yet
Lecture 2 - MAT361 (21 JAN 2025)
40 pages
Hypothesis Testing : Z-Test, T-Test, F-Test
No ratings yet
Hypothesis Testing : Z-Test, T-Test, F-Test
42 pages
Business Analytics & Machine Learning: Regression Analysis
No ratings yet
Business Analytics & Machine Learning: Regression Analysis
58 pages
Chap 4 Research Method and Technical Writing
No ratings yet
Chap 4 Research Method and Technical Writing
33 pages
Lecture 3 2023
No ratings yet
Lecture 3 2023
80 pages
Chapter 5 Data Analysis Ab
No ratings yet
Chapter 5 Data Analysis Ab
56 pages
8975 Tenaz Sands Worksheet 140395 1662372103
No ratings yet
8975 Tenaz Sands Worksheet 140395 1662372103
8 pages
Difference Between Descriptive and Inferential Statistics
No ratings yet
Difference Between Descriptive and Inferential Statistics
8 pages
qm2 Notes
No ratings yet
qm2 Notes
9 pages
Statistical Tests
No ratings yet
Statistical Tests
11 pages
Final Exam
No ratings yet
Final Exam
5 pages
Introduction To Statistics - 2023-2024
No ratings yet
Introduction To Statistics - 2023-2024
38 pages
Regression Analysis
No ratings yet
Regression Analysis
68 pages
Inferential Statistics
No ratings yet
Inferential Statistics
42 pages
Liv-Stats 2
No ratings yet
Liv-Stats 2
15 pages
Coloring Pages Spring and Summer 2
No ratings yet
Coloring Pages Spring and Summer 2
4 pages
Adstat Final Exam Reviewer2
No ratings yet
Adstat Final Exam Reviewer2
29 pages
07 Inf Pop Mean
No ratings yet
07 Inf Pop Mean
65 pages
Statistics For A2 Biology
100% (1)
Statistics For A2 Biology
9 pages
Expe Finals
No ratings yet
Expe Finals
8 pages
Research Methodology Lecture 7
No ratings yet
Research Methodology Lecture 7
103 pages
Teaching Guide Statistics and Probability
No ratings yet
Teaching Guide Statistics and Probability
5 pages
STATS
No ratings yet
STATS
7 pages
Bayesian Inference For Partially Identified Models Exploring The Limits of Limited Data 1st Edition Complete EPUB Ebook
100% (19)
Bayesian Inference For Partially Identified Models Exploring The Limits of Limited Data 1st Edition Complete EPUB Ebook
14 pages
Chapter 6 - Correlation and Regression
No ratings yet
Chapter 6 - Correlation and Regression
9 pages
Repeated-Measures Analysis of Variance (ANOVA)
No ratings yet
Repeated-Measures Analysis of Variance (ANOVA)
10 pages
Data Analytics Unit 2
No ratings yet
Data Analytics Unit 2
13 pages
f11 Examtopics
No ratings yet
f11 Examtopics
2 pages
Case Study - 8
No ratings yet
Case Study - 8
21 pages
Simple Linear Regression and Correlation: Chapter Outline
No ratings yet
Simple Linear Regression and Correlation: Chapter Outline
77 pages
Individual Assignment 2: Harvested Area Production of Dry Cocoa (Hectare) (Tonne)
No ratings yet
Individual Assignment 2: Harvested Area Production of Dry Cocoa (Hectare) (Tonne)
4 pages
Lilliefors Van Soest's Test of Normality
No ratings yet
Lilliefors Van Soest's Test of Normality
10 pages
Question - A: Parameters Smoker Non-Smoker Total Men Women Total
No ratings yet
Question - A: Parameters Smoker Non-Smoker Total Men Women Total
7 pages
Package Hmisc' - Harrell (2022)
No ratings yet
Package Hmisc' - Harrell (2022)
455 pages
Managerial Economics
100% (1)
Managerial Economics
33 pages
Statistics Module 3
No ratings yet
Statistics Module 3
33 pages
The Effect of Multicollinearity in Nonlinear Regression Models
No ratings yet
The Effect of Multicollinearity in Nonlinear Regression Models
4 pages
(Ebook) Introduction To SPSS Statistics in Psychology: For Version 19 and Earlier by Dennis Howitt, Duncan Cramer. ISBN 9780273734260, 0273734261 Download
100% (1)
(Ebook) Introduction To SPSS Statistics in Psychology: For Version 19 and Earlier by Dennis Howitt, Duncan Cramer. ISBN 9780273734260, 0273734261 Download
56 pages
Minitab Practical Manual 02
No ratings yet
Minitab Practical Manual 02
6 pages
Geweke MeasurementLinearDependence 1982
No ratings yet
Geweke MeasurementLinearDependence 1982
11 pages
Chi - Square - Test of Association Notes
No ratings yet
Chi - Square - Test of Association Notes
1 page
Regression Analysis: Study Hours GPA 5 2.8 8 3.1 6 3.4 7 3.5 1 2.2 4 3.67 3 3 8 2.5 5 3.33 2 3
No ratings yet
Regression Analysis: Study Hours GPA 5 2.8 8 3.1 6 3.4 7 3.5 1 2.2 4 3.67 3 3 8 2.5 5 3.33 2 3
9 pages
Afif Akbar Syawala - 120210200029 - Tugas Ekonometrik
No ratings yet
Afif Akbar Syawala - 120210200029 - Tugas Ekonometrik
5 pages
Module 4 Post Task
100% (1)
Module 4 Post Task
9 pages
Assumption of Homos Ceda Sticty
No ratings yet
Assumption of Homos Ceda Sticty
35 pages
Standardized Multiple Regression Analysis
No ratings yet
Standardized Multiple Regression Analysis
18 pages
How To Guide: Simple Tips For Using Our Most Popular Tools
No ratings yet
How To Guide: Simple Tips For Using Our Most Popular Tools
15 pages
Multiple Linear Regression and Checking For Collinearity Using SAS
0% (1)
Multiple Linear Regression and Checking For Collinearity Using SAS
18 pages
Imputation
No ratings yet
Imputation
2 pages

MIT24 915F15 Lec14

Uploaded by

MIT24 915F15 Lec14

Uploaded by

24.

-3S -2S -S X +S +2S +3S

Image by MIT OCW.

The report on an experiment usually consists

• Median: The value that separates the lower half of

• Standard deviation: σ (square root of the variance).

Which properties differentiate /tS/ and /dZ/?

mean mean mean

• Is peak intensity of frication in /tS/ higher than in /dZ/? I.e.

• Could the apparent differences have arisen by chance,

• I.e. given that intensity of /tS/ and /dZ/ frications varies, we

• We actually evaluate two exhaustive and mutually

…it is unlikely that we

110 120 130 140 150 160 170 180 190

-3S -2S -S X +S +2S +3S

Image by MIT OCW.

(b) Sampling distribution of

(b) Sampling distribution of

(b) Sampling distribution of

Image by MIT OCW.

– N-1 is the number of degrees of freedom of the

affricate VOT = μ + voice (voice is ‘voiced’ or ‘voiceless’)

• Analysis of Variance (ANOVA) involves more complex models, e.g.

24.915 / 24.963 Linguistic Phonetics

You might also like