30.9 - Unit 4 - Experimental
30.9 - Unit 4 - Experimental
Research Design
By Vuong Yen
Source: N01- Griffe, D. T, An introduction to second
language research methods: Design and Data. TESL- EJ
Publication, 2012 (p70-93)
Objectives:
• After finishing the lesson, students can grasp:
1. Experimental design defined
2. Key components of experimental design
3. The basic structure and function of experimental research
4. Advantages and disadvantages of ERD
Experimental design
1. Some technical terms used in Experimental research Design
1. Control group (Nhóm kiểm soát): A group of participants used for comparison purposes, similar
to the experimental group, but to whom no treatment (tác động) is administered. What
happens in the control group, it is argued, is what would happen if no experiment or treatment
were done. Results from the control group are known as counter-factual evidence.
2. Experimental group (nhóm thí nghiệm) : Also known as the treatment group, the group of
participants with whom the innovation or treatment is applied. The experimental group is
where change of some sort will be compared with the performance in the control group.
3. Dependent variable (Biến phụ thuộc): A variable that is usually the test used to measure results.
In terms of cause and effect, this is a measure of the effect.
4. Independent variable (Biến độc lập): The variable under investigation. For example, if a TREE
were using EXD to investigate listening comprehension, then listening comprehension would be
the independent variable. If a TREE were investigating how students acquire vocabulary, then
vocabulary acquisition would be the independent variable.
5. Hypothesis A hypothesis is a possible explanation. The null-hypothesis states
that there is no relationship between variables of interest. The research
hypothesis is the preferred explanation put forward by the TREE to explain
the results of the experiment. An alternative or rival hypothesis is an
explanation other than the research hypothesis that also might explain the
results.
6. Inferential statistics (thống kê suy luận) A class of statistics that does not
directly describe scores, but allows the TREE to infer or suppose something
that cannot be directly observed, usually the probability
that two scores are related (correlation) or not related (a significance test).
7. Intact class A class that is whole or intact before the TREE comes into
contact with it. A class is intact because either it forms itself (students select
the class and then enrol for some reason such as convenient time) or the class
is formed by others (administrative control). Intact classes are the norm
because most teachers cannot randomly assigned students.
8. Mean scores (Giá trị trung bình): The average score for a group of taking a test. A
mean score of 82.7 indicates that the average score for the whole class was about 83
points.
10. p-value (probability value)The small letter p stands for probability. A TREE
administers a test to two groups of students and wonders if the difference between
the resulting scores is likely the result of chance
variation. The answer comes in the form of a percent called the p-value. If the p-
value is small, traditionally .05 or smaller, the TREE can make the claim there is a
small probability, 5%, that they would get these scores if there were no relationship;
therefore, they can infer that there is a relationship. If the point of the experiment
was to show score improvement, the TREE hopes the mean difference is large and
the p-value is small.
11. Parametric statistics (Thống kê tham số): A family of statistical procedures
that requires normal distribution and a continuous scale. Some familiar
parametric statistics are the t-test, ANOVA, and Pearson correlation.
12. Pretest (tiền kiểm): A test that is administered before the experiment,
intervention, treatment, or teaching takes place. The purpose of the pretest is to
establish baseline data; in other words, it establishes what the situation is before
the intervention or treatment is administered.
13. Posttest (hậu kiểm) A test that is the same or very similar to the pretest,
and is administered after the experiment or teaching takes place.
14. Random assignment (Phân công ngẫu nhiên ): A type of placement that
refers to the process by which any participant has an equal chance to be
assigned to any group. One way of random assignment is to write names on
pieces of paper, put them into a container, and have somebody draw them out
one by one without seeing the names. Another way is to use a computer
program that can randomize a list of names or numbers.
15. Reliability (độ tin cậy) The idea of consistency of results or getting the same
or similar results each time data is collected. It was originated by Spearman as a
comparison of a hypothesized true score and observed score. It is often given in
terms of a percent marked with a decimal point, such as .82.
16. Scale Refers to unit of measurement. Every measuring device must have a
scale in which the measurement is reported. Tests are usually reported in one of
three scales: nominal (e.g., gender, nationality), ordinal (e.g., frequency counts), or
continuous (e.g., test scores).
17. Significance test (Kiểm định độ tin cậy) A statistical procedure that estimates
the difference between two (sometimes more depending on the type of test) sets
of scores. The results of a significance test is a p-value, which is an indicator of
the likelihood of obtaining this difference given the null hypothesis. A common
significance test for testing the difference between two and only two scores is
called a t-test. Another significance test is called analysis of variance - Phân tích
phương sai (ANOVA) and is used for testing the difference between two or more
groups.
18. Stratified sampling (Chọn mẫu phân tổ) Although identified with
SRD, stratified sampling can be used in EXD to equalize groups. For
example, a TREE considers ethnic groups important variable. The
TREE has a control group with 50% one ethnic group and 50%
another group. In order to make the treatment group similar to the
control group, the TREE wishes to use only those students in the two
ethnic groups in approximately a 50-50 proportion.
19. Threats The name given to a rival hypothesis used to explain the
difference between treatment groups.
For example, if a researcher were investigating the effects of an
innovative writing curriculum, a threat would be that, unknown to the
researcher, many students went to the writing center, and those visits
accounted for the score increase, not the innovative curriculum.
Attendance at the writing center, usually called a threat of history,
threatens the research hypothesis that it was the innovative writing
curriculum that caused the increase in scores.
20. Time-Series (dãy số thời gian) A variation of EXD where
instead of comparison of two groups, the TREE uses only one
group, takes several measurements to establish baseline data (to
establish which is usually the case), introduces an innovation or
treatment, and continues with additional measurements
(to establish the effect of the treatment). The main advantage is
only one group is necessary and can function as its own control
group.
21. Unit of Assignment The persons or things being studied.
Things being studied can be classes, schools, or even cities and
towns. An EXD can study one person, a group of persons, an
entire class, or a school.
22. Validity (Độ chuẩn xác) Test validity is traditionally defined
as providing evidence that shows or indicates that what the
TREE believes he or she is testing is actually being tested. In
practice, validity means offering an explanation of how one
checked or verified instrument results.
23. Variable (biến) A trait or quality that can be measured by
a test or other kind of data collection instrument. A variable
can be anything a TREE is interested in.
2. What is Experimental Research?
• Experimental research is a scientific approach to research, where one or
more independent variables are manipulated and applied to one or more
dependent variables to measure their effect on the latter. The effect of
the independent variables on the dependent variables is usually observed
and recorded over some time, to aid researchers in drawing a reasonable
conclusion regarding the relationship between these 2 variable types.
Eg: Một giảng viên muốn tìm hiểu về sự thay đổi về nhận thức của sinh viên đối với môn
học Pháp luật đại cương thông qua một bài trắc nghiệm được áp dụng trước và ngay sau
khi kết thúc học phân. Tuy kết quả lần trắc nghiệm sau cao hơn lần trắc nghiệm đầu,
giảng viên khó có thể đưa ra kết luận về sự thay đổi do môn học mang lợi bởi trong suốt
thời gian học, các phương tiện báo- đài có những chương trình về giáo dục pháp luật cho
thanh niên.
c. Static-group Comparison
(Thiết kế so sánh nhóm tĩnh)
• In a static-group comparison study, 2 or more groups are placed under
observation, where only one of the groups is subjected to some
treatment while the other groups are held static. All the groups are
post-tested, and the observed differences between the groups are
assumed to be a result of the treatment.
3.2. Quasi-experimental Research Design (Thiết kế bán
thử nghiệm)
• The word “quasi” means partial, half, or pseudo. Therefore, the quasi-
experimental research bearing a resemblance to the true experimental
research, but not the same. In quasi-experiments, the participants are not
randomly assigned, and as such, they are used in settings where
randomization is difficult or impossible.
• This is very common in educational research, where administrators are
unwilling to allow the random selection of students for experimental
samples.
• Some examples of quasi-experimental research design include: the time
series, no equivalent control group design, and the counterbalanced
design.
3.3. True Experimental Research Design
(thiết kế thử nghiệm đích thực)
• The true experimental research design relies on statistical analysis
to approve or disapprove a hypothesis. It is the most accurate type
of experimental design and may be carried out with or without a
pretest on at least 2 randomly assigned dependent subjects.
• The true experimental research design must contain a control
group, a variable that can be manipulated by the researcher, and
the distribution must be random. The classification of true
experimental design include:
a. The posttest-only Control Group Design
(Thiết kế chỉ kiểm tra sau)
• In this design, subjects are randomly selected and assigned to the
2 groups (control and experimental), and only the experimental
group is treated. After close observation, both groups are post-
tested, and a conclusion is drawn from the difference between
these groups.
Eg: Sau khi học hết HP NCKH trong giảng dạy tiếng Anh (X), sinh viên
năm 3 ngành sư phạm Anh, HPU2 (A) tham gia một cuộc kiểm tra (O)
để được đánh giá khả năng tiếp thu môn học.
b. The pretest-posttest Control Group Design
(T/kế kiểm tra trước sau)
• For this control group design, subjects are randomly assigned to the 2 groups,
both are presented, but only the experimental group is treated. After close
observation, both groups are post-tested to measure the degree of change in
each group.
c. Solomon four-group Design
(Thiết hiết kế 4 nhóm)
• This is the combination of the pretest-only and the pretest-
posttest control groups. In this case, the randomly selected
subjects are placed into 4 groups.
=> The first two of these groups are tested using the
posttest-only method, while the other two are
tested using the pretest-posttest method.
4. Examples of Experimental Research