Explicit and Implicit Lexical Knowledge Acquisition of Collocations Under Different Input Condictions (2013)
Explicit and Implicit Lexical Knowledge Acquisition of Collocations Under Different Input Condictions (2013)
Norbert Schmitt
University of Nottingham
To date, there has been little empirical research exploring the relationship between
implicit and explicit lexical knowledge (of collocations). As a first step in addressing
this gap, two laboratory experiments were conducted that evaluate different conditions
(enriched, enhanced, and decontextualized) under which both adult native speakers
(Experiment 1) and advanced nonnative speakers of English (Experiment 2) acquire
collocations. Three different tests of collocational knowledge were used to assess gains
after treatment: two traditional explicit tests (form recall and form recognition) and an
innovative implicit test (priming). Results from mixed-effects modeling showed that all
conditions led to significant long-term gains in explicit form recall and recognition both
for natives and nonnatives, while no condition facilitated implicit collocational priming
effects either for natives or nonnatives.
Keywords vocabulary; collocations; explicit knowledge; implicit knowledge; priming;
mixed-effects modeling
Introduction
The relationship between explicit and implicit knowledge has long been the
focus of research in psychology (for an overview, see Squire, 2004), and a
similar interest has also emerged in applied linguistics and second language
acquisition (SLA) circles (N. Ellis, 2008). To date, most SLA research into
the explicit/implicit linguistic knowledge distinction has focused on second
Correspondence concerning this article should be addressed to Suhad Sonbul, P.O. Box 10424,
Makkah, Saudi Arabia. Internet: [email protected]
DOI: 10.1111/j.1467-9922.2012.00730.x
14679922, 2013, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/j.1467-9922.2012.00730.x by Pontificia Universidad Catolica De Chile, Wiley Online Library on [20/08/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Sonbul and Schmitt Explicit and Implicit Lexical Knowledge
language (L2) grammar acquisition (de Graaff, 1997; R. Ellis, 2005), although
the nature of the relationship between explicit and implicit grammatical knowl-
edge remains controversial. In contrast, there has been very little consider-
ation of the distinction’s implications for lexical knowledge. This might be
due to the traditional dictionary metaphor which views the lexicon as noth-
ing more than a list of forms and meanings that can be associated fairly
easily through rote learning. However, the view that vocabulary knowledge
is a much more complex construct is now gaining acceptance, with several
lexical scholars claiming that it entails a number of different dimensions in-
volving both explicit and implicit mental representations (Aitchison, 2003; N.
Ellis, 1994; Nation, 2001; Schmitt, 2010). Still, the exact nature and interre-
lationship of explicit and implicit vocabulary knowledge has only begun to be
investigated.
This study addresses this emerging lexical area by reporting on a two-
experiment study (one with natives and one with nonnatives) exploring how
well various direct and indirect input conditions promote both explicit and
implicit knowledge of one aspect of vocabulary knowledge: collocations.
is contrasted with strategic priming in that the former operates rapidly without
participants’ conscious intention (implicit knowledge) while the latter requires
intention and reflects slower processes (explicit knowledge) (Dagenbach, Horst,
& Carr, 1990). After discussing different variables determining the nature of
priming processes (stimulus onset asynchrony [SOA], the nature of task, the
type of semantic relation, relatedness proportion [RP] measured as the number
of related word-prime trials divided by all word-target trials, and nonword ratio
[NR] measured as the ratio of nonword trials to all unrelated trials), McNamara
(2005) concludes:
An investigator interested in the automatic component of semantic
priming would be well advised to use a short SOA (e.g., 200 ms or less), a
low RP (e.g., <0.20?), and an NR of 0.50. Under these conditions, the
task and the type of semantic relation do not seem to matter much. (p. 72)
Unfortunately, none of the three experiments mentioned above met these re-
quirements. Thus, although they did show collocational priming, how this prim-
ing relates to implicit knowledge (vs. strategic explicit knowledge) is not clear.
Method
Participants
The 35 native participants (15 males and 20 females) were undergradu-
ates studying at a British university who received a payment (£8) for their
participation. They had no previous medical training and ranged in age from 18
to 27 (M = 19.54, SD = 1.74). Data for a further 8 participants was excluded
from the experiment because of (1) low (less than 60%) scores in the reading
comprehension test (see Procedure Section below) (N = 1), (2) having too
many (more than 15%) incorrect or extreme responses in the lexical decision
task (N = 2), or (3) not showing up in the delayed testing session (N = 5).
Stimuli
We intended to set up as natural a pedagogical setting as possible, and so wished
to use real collocations as stimuli. However, this required finding collocations
which educated native speakers would not already know. We decided to use
medical collocations, as these are unlikely to be known by people without
medical training. Choosing medical collocations for the present study involved
two stages. First, five medical resources1 were consulted in an attempt to
identify candidates that fit the following criteria:
Materials
We then chose a reading passage from The Language of Medicine in English
(Tiersky & Tiersky, 1992) which covered a wide range of medical condi-
tions, thus facilitating the inclusion of the intact collocations. According to
Rott (2009), a single exposure can result in an initial memory trace (see also
Durrant & Schmitt, 2010) which should later be strengthened through subse-
quent exposures. Thus, the passage was adopted and shortened to include three
occurrences of 10 intact collocations, varying according to our two contextu-
alized experimental conditions (i.e., enriched and enhanced) as follows. Five
collocations were embedded in the passage (the enriched condition) while the
other five were embedded and made salient with red, bold font (the enhanced
condition). In the third learning condition, five collocations were taught in
isolation in a PowerPoint presentation (the decontextualized condition), where
each collocation was presented individually on a different slide (no meaning
was provided). Each collocation was presented in red font, flashing on the
screen three times for a total of 10 seconds.
The 15 collocations and the 3 learning conditions were placed into a counter-
balanced design consisting of three experimental blocks, as shown in Figure 1.
Because it was not possible to administer a pretest in order not to attract partic-
ipants’ attention to the critical stimuli before the experiment, a control group
was included as a baseline for comparison. The 35 participants were randomly
assigned to the experimental groups (Block 1, N = 10; Block 2, N = 8; Block
3, N = 8) or to the control group (N = 9).
Measures
Because the focus of the present experiment is on the distinction between
explicit and implicit collocational knowledge, three measures were developed:
two explicit and one implicit. The two explicit measures were based on Schmitt,
Dornyei, Adolphs, and Durow (2004). The first (form recall) employed a cloze
format to measure participants’ ability to recall the form of the collocation.
Participants were presented with a summary of the reading passage with target
collocations in parentheses. A brief definition of each collocation was included
in the margin as a clue. To avoid any potential effects of this test on the
subsequent form recognition one, the first letter of each missing word was not
included. Alternatively, the number of letters in each space was represented by
dashes to restrict participants’ responses. The following example is the form
recall test item for vanishing lung:
Lung infections might lead to different conditions such as the Decrease in the lung’s
( _ _ _ _ _ _ _ _ _ lung) syndrome. size as shown in an X-ray
The second explicit test measured form recognition. It used the same pas-
sage as in the form recall test, but with multiple-choice items given in the
margin. Each item included five options: the correct first word (the key), three
plausible distracters (one is the control and two other synonyms with the same
part of speech), and one “I don’t know” option to reduce guessing. Below is
the form recognition test item for vanishing lung:
a. fading
b. declining
Lung infections might lead to different conditions such as the ( ________ c. decaying
lung) syndrome. d. vanishing
e. I DON’T KNOW
While the explicit measures are fairly traditional, the measurement of implicit
collocation knowledge required a more innovative approach. We opted to use the
priming paradigm to capture implicit knowledge. In our priming task (designed
R
on E-Prime , Psychology Software Tools, Inc., www.pstnet.com), participants
were presented with the first word of the collocation as the prime and the second
as the target. They had to decide whether the second string of letters is a real
English word or not by pressing the appropriate key (YES or NO) on a response
box. The priming task was designed in accordance with McNamara’s (2005)
guidelines to ensure automatic, nonstrategic priming.
Critical stimuli for this task included the 15 intact (exposed) collocations
and their 15 control pairs. The intact and control primes were combined into
two counterbalanced lists such that seven or eight intact pairs and seven or eight
control pairs were included in each list. Targets which were matched with their
intact primes in one list were matched with their control primes in the other.
No prime or target word was used more than once in either list. In addition,
53 filler items were included, in which the second item of the pair was either
a word (K = 22) or a nonword (K = 31).2 All nonwords were selected from
the ARC nonword database (Rastle, Harrington, & Coltheart, 2002) and were
pronounceable and orthographically legal. In order to reduce the effect of the
implicit test on the explicit form recognition test, 30 fillers included distracters
from the multiple-choice test as primes. Thus, in each stimuli list, participants
were presented with 68 trials (7 or 8 intact pairs, 7 or 8 control pairs, and 53
fillers). Half of the participants (N = 18) were tested with the first stimuli set,
while the other half (N = 17) were tested with the second set. Twenty practice
trials consisting of nonstudy items were constructed to allow participants to get
used to the lexical decision task.
In each trial, a fixation point was presented (2,000 milliseconds) followed
by the first word of each pair (the prime), which remained on the screen for 150
milliseconds. This was immediately replaced by the second string (the target),
which remained on the screen until the participant had made a lexical decision.
The order of the trials was randomized across subjects to avoid potential order
effects. E-Prime recorded accuracy and response latency for each participant
in all trials.
Procedure
Upon arrival at the lab for the single-person session, the participant signed the
consent form in which he or she was briefed on the purpose of the study. Then,
the study phase began. While participants in the experimental blocks took part
in both study and test phases, those in the control group only took part in the
test phase. The participant was instructed to read the whole passage once within
10 minutes and to try to focus on the message as there would be some compre-
hension questions at the end. Upon finishing, the first author took the reading
passage and participants were instructed to watch a PowerPoint presentation
that included five collocations initiated with the following instruction:
Now, you will be presented with five different phrases that usually occur
in medical contexts. Please study them carefully and try to remember the
combination.
that participants were reading for meaning; the mean score gained by the 26
participants in the experimental group was acceptable (7.26 out of 8.00, 90.87%,
SD = 0.96). Second, the test functioned as an interval between the study phase
and the test phase.
To avoid earlier measures affecting later ones, the tests were carefully
sequenced. The first was the computerized priming task, which took 6 minutes.
The participants then performed a distracter task (a digit-span memory test)
that was intended to minimize any effect of the implicit test on the subsequent
explicit tests. Next the participants completed the paper-and-pencil form recall
test within 10 minutes, followed by the form recognition test (5 minutes).
Finally, participants filled out a short background questionnaire.
At the end of the session, participants were invited to a follow-up experiment
to be conducted two weeks later. This entailed a delayed testing session where
the participants took the explicit and implicit tests without being exposed to
any treatment. To minimize any possible transfer from the immediate to the
delayed testing sessions, the choices were reordered in the delayed multiple-
choice test.
Third, it can handle interval-scale (e.g., reaction time) measures through linear
mixed-effects (LME; see Baayen, 2008) models as well as categorical (e.g.,
correct versus incorrect) measures using mixed logit models (the LME alter-
native for categorical data; see Jaeger, 2008). Lastly, it can cope with missing
values and imbalanced designs. Because the present study involved both a cor-
rect/incorrect categorical measure along with an interval-scale reaction time
measure and because we aimed at testing various item-related and participant-
related factors, we opted to use mixed-effects modeling as the analysis method.
The method starts with the simplest (null) model, which includes only the
dependent measure and the random variables. Fixed effects are then added incre-
mentally and χ 2 (likelihood ratio) tests are used to check whether the inclusion
of additional predictors contributes significantly to the model. Once the interim
best-fit model is reached, variables are excluded one by one to check whether
any predictor is redundant (i.e., a variable whose presence/absence in the model
does not lead to any significant difference) and can thus be excluded. Once these
are excluded, we are left with the final best-fit model, which best describes the
overall variance in the research design. As mixed-effects modeling is likely
to be new for many readers, we provide additional detailed description of the
procedure we followed in Appendix S3 of the Supporting Information online.
We developed two mixed-effects models, one for explicit knowledge and
one for implicit knowledge. For explicit knowledge, the analysis (mixed logit
modeling) started with a null model including our binomial dependent variable
(CORRECT versus INCORRECT responses) and participants and items as ran-
dom effects. We then added predictor variables incrementally in the following
order: main effects (TEST TYPE: form recall, form recognition; CONDITION:
enriched, enhanced, decontextualized, control; and SESSION: immediate, de-
layed3 ) and then item-related effects (length of Word 1, log frequency of Word
1, length of Word 2, log frequency of Word 2, frequency of the collocation).
Finally, variables were excluded one by one to check for redundancy to arrive
at the final best-fit model. The procedure for implicit knowledge (using LME
modeling) was the same except that the dependent variable was log response
time4 (reaction times were log-transformed to reduce skewness in the distribu-
tion) instead of CORRECT/INCORRECT responses, and the first main effect
was ITEM TYPE (intact, control) instead of TEST TYPE.
Results
Explicit Form Recall and Form Recognition Tests
An assumption of the experiment is that the medical collocations were unknown
to the participants. This assumption is supported by the control group results
Table 1 Responses and percentile scores in explicit tests (form recall and recognition)
under all conditions (natives)
Explicit Knowledge Tested
(Table 1), which showed very low form recall collocation knowledge (1.5%).
However, it should be noted that the recall percentage score for the delayed test
(9.6%) was higher, which suggests that the implicit priming test might have had
a direct effect on explicit test results (see Limitations section). As opposed to
the lack of form recall knowledge by the control group, this group answered one
third of the items in the recognition test correctly (around 32% immediate and
34% delayed). This implies that participants were merely guessing on the four-
option multiple-choice test. Thus, if participants under the treatment conditions
score higher than the 9.6% level (recall) and 34.1% level (recognition), gains
should be attributed to the treatment.
We modeled CORRECT answers likelihood using a mixed logit model.
Main effects were added to the null model incrementally. The addition of TEST
TYPE significantly improved the null model (χ 2 (1) = 429.80, p < .001, N =
2,100). Adding CONDITION further improved the model (χ 2 (3) = 26.68, p <
.001, N = 2,100). Then, a TEST TYPE × CONDITION interaction was added
to the model but this did not lead to any significant improvement (χ 2 (3) = 0.15,
p = 0.99, N = 2,100). Moreover, adding SESSION as a main effect did not
improve the model (χ 2 (1) = 0.08, p = .78, N = 2,100), neither did adding it as
an interacting variable with CONDITION (χ 2 (4) = 5.15, p = .27, N = 2,100) or
TEST TYPE (χ 2 (2) = 1.14, p = .56, N = 2100). SESSION was thus removed
from the model. Item-related variables were then added incrementally, but none
Table 2 Summary of the best fit mixed logit model for variables predicting natives’
correct explicit responses
improved the model. Finally, significant predictors were removed backward but
none was found to be redundant. The final best-fit model is presented in Table 2.
The best-fit mixed logit model indicates that more correct responses were
produced in the recognition test (634/863) than in the recall test (229/863).
Also, more correct responses (237, 252, 270/863) were produced under the
three experimental conditions (enriched, enhanced, decontextualized, respec-
tively) than those produced by the control group (104/863). In order to assess
differences across experimental conditions, a multiple comparison analysis (see
Bretz, Hothorn, & Westfall, 2010) was conducted on the CONDITION variable.
Results showed that more correct responses were produced under the decon-
textualized condition than the enriched condition (Wald Z = 2.55, p = .046)
while no significant difference was found between the enhanced condition and
the other two conditions (enriched: Wald Z = -1.17, p = .63, decontextualized:
Wald Z = 1.38, p = .49). Third, lack of interaction between TEST TYPE and
CONDITION implies that (1) the advantage of the recognition gains over re-
call gains was present under all conditions and (2) significant differences across
conditions were present both for recall and recognition gains. Finally, the fact
that SESSION was not a significant predictor implies that explicit gains did not
drop significantly after the 2-week gap and that the effects reported above are
true for immediate and delayed results.
Overall, immediate and delayed results (for both recall and recognition
tests) can be summarized as follows:
Figure 2 Percentages and comparisons of explicit (form recall and recognition) scores
under all conditions in both sessions (natives).
These results are also graphically illustrated in Figure 2. The three experimental
conditions led to higher explicit gains than the control group (around 16–19
percentage points for recall and 30–37 percentage points for recognition both
on the delayed test5 ). Also, the decontextualized condition led to more correct
answers than the enriched condition on the delayed test (two percentage point
difference in recall and six in recognition).
Table 3 Mean reaction time (RT) in milliseconds for intact and control items under all
conditions (natives)
lead to any significant improvement in the null model (χ 2 (3) = 4.00, p = .26,
N = 1,001), and so was also removed. SESSION was added as a predictor and
was found to significantly improve the model (χ 2 (1) = 10.11, p = .001, N =
1,001) with shorter (faster) reaction time in the delayed session. After testing
all main effects, item-related variables were added incrementally to assess their
effects on log reaction time. Only Word 1 length (χ 2 (1) = 5.27, p = .02, N =
1,001), Word 2 length (χ 2 (1) = 18.13, p < .001, N = 1,001), and Word 2 log
frequency (χ 2 (1) = 15.04, p < .001, N = 1,001) led to significant improvement
in the model. Thus, the shorter the first word and the shorter/more frequent the
second word, the shorter the log reaction time is. We then tested interactions
between significant predictors (SESSION, W1 length, W2 length, and W2 log
Frequency). Only the interaction between W2 length and W2 log Frequency led
to a significant improvement in the model (χ 2 (1) = 5.71, p = .02, N = 1,001).
In this model, W2 log frequency ceased to be a significant predictor. Thus, the
effect of the second word frequency on reaction times (RT) does not seem to be
direct but is rather modulated by length (i.e., more frequent second words lead
to faster RT only if they are short). Finally, removing the significant predictors
backward did not show any redundant effects, and so all were retained. Table 4
presents the final best-fit model.
The nonsignificant interaction between CONDITION and ITEM TYPE in-
dicates that natives’ RT for intact versus control primes was not predicted by
the type of input (enriched, enhanced, decontextualized, or no input [control])
either in the immediate or the delayed sessions. This result provides a clear
answer to the third research question; namely, natives did not develop implicit
Table 4 Summary of the best fit LME model for variables predicting natives’ log
reaction time
Predictor Estimate MCMC SE t pMCMC Pr(>|t|)
Figure 3 Mean reaction times for intact and control items under all conditions in both
sessions (natives).
Discussion
Experiment 1 compared three treatment conditions for their ability to promote
collocation learning. On the explicit side of knowledge (form recall and form
recognition), all treatment conditions led to learning. Moreover, the decontex-
tualized condition led to higher gains than the enriched condition for both types
of knowledge and in both testing sessions. However, the enhanced condition
seems to stand in the middle between the other two treatment conditions, in that
it did not lead to any significant difference. Furthermore, just as with single
words, form recognition knowledge is stronger than form recall knowledge.
In terms of implicit knowledge, one may have speculated that explicit
learning conditions would lead to explicit knowledge, while implicit knowl-
edge would accrue best from an indirect learning condition without an explicit
component. However, we found that natives did not develop either short-term
or long-term implicit memory traces for intact collocations under any treat-
ment condition. To conclude, it seems that promoting explicit knowledge in
native speakers is possible with a wide range of treatment conditions. Implicit
knowledge, on the other hand, seems to be difficult to develop from such a short
treatment period.
university, who had all met the university entry requirement (minimum IELTS
score of 6.00 or TOEFL score of 550). They ranged in age from 21 to 41 (M =
28.65, SD = 4.91) and came from a variety of first language (L1) backgrounds
(Arabic = 6, Bengali = 1, Bulgarian = 1, Chinese = 9, Dutch = 2, Farsi = 1,
French = 2, Hindi = 1, Italian = 1, Kurdish = 1, Luxemburgish = 1, Malay =
2, Polish = 2, Portuguese = 2, Romanian = 2, Russian = 1, Spanish = 3,
Swedish = 1, Swiss = 1, Thai = 1, Turkish = 1, and Yoruba = 1). The
participants had spent a mean of 25.59 months in the United Kingdom (SD =
26.91, Min = 6, Max = 132), but were first exposed to English at an average age
of 10.78 years (SD = 4.41, Min = 3, Max = 22). Their self-rated proficiency
scores (on a scale from 1 = very poor to 5 = excellent) were: reading M =
4.14, SD = 0.74; writing M = 3.97, SD = 0.89; speaking M = 3.86, SD = 0.64,
and listening M = 3.95, SD = 0.69. Their overall self-rated proficiency score
(averaged across skills) was 3.98 (SD = 0.62). The participants were offered
a £10 payment for their participation. Data for a further 5 participants were
excluded from the experiment for having too many (more than 15%) incorrect
or extreme responses in the lexical decision task (N = 3) or for not showing up
for the delayed test (N = 2).
Stimuli
The procedures from the previous experiment were employed in choosing med-
ical intact collocations and their controls, except for the following additional
criterion:
This step was intended to ensure that the nonnatives knew the individual words
comprising each collocation. We ended up with 15 viable items (see Appendix
S2 in the online Supporting Information). These were divided into three bal-
anced sets of five collocations, which were then counterbalanced across the
three treatment conditions, as shown in Figure 4.
Materials
The passage from Experiment 1 was adapted to include three occurrences of
each collocation and simplified somewhat to be of a suitable level of difficulty
for the nonnative participants.7 Other than this, the treatment conditions were
the same. The 43 participants were randomly assigned to the experimental
groups (Block 1, N = 11; Block 2, N = 11; Block 3, N = 12) or to the
control group (N = 9). Participants in these blocks did not differ in terms of
the time they spent in the United Kingdom, age they were exposed to English,
or self-rated proficiency (all ps > .20).
Results
Explicit Form Recall and Form Recognition Tests
The experiment assumed that medical collocations were unknown to the non-
native participants, and the results of the control group on the immediate tests
seem to confirm this (Table 5), with only a 2.2% score on the form recall test,
and a 29.6% on the form recognition test, which is close to chance. However, it
should be noted that, similar to results of Experiment 1, the percentage scores
for the delayed tests (10.4% recall, 45.9% recognition) are far higher.
Table 5 Responses and percentile scores in explicit tests (form recall and recognition)
under all conditions (nonnatives)
Explicit Knowledge Tested
Table 6 Summary of the best fit mixed logit model for variables predicting nonnatives’
correct explicit responses
Predictor Estimate SE Wald-Z p
(Intercept) −7.63 1.05 −7.25 <.001
TEST TYPE: recognition 3.87 0.72 5.36 <.001
CONDITION: decontextualized 1.60 0.35 4.54 <.001
enhanced 2.12 0.36 5.96 <.001
enriched 1.42 0.35 4.04 <.001
SESSION: immediate 1.42 0.72 1.98 0.047
Overall Proficiency 1.32 0.25 5.37 <.001
CONDITION: (decontextualized) X SESSION: (immediate) 1.48 0.31 4.70 <.001
(enhanced) 1.42 0.32 4.48 <.001
(enriched) 1.17 0.31 3.76 <.001
TEST TYPE: (recognition) X Overall Proficiency −0.44 0.18 −2.45 0.01
SESSION: (immediate) X Overall Proficiency −0.61 0.17 −3.55 <.001
Note. The model has random intercepts for participants and items. N = 2,580, log-
likelihood = −1,249.
Figure 5 Interaction between proficiency and TEST TYPE (left panel) and between
proficiency and SESSION (right panel) in the logit model for the nonnative explicit data
(Note. The vertical axis depicts the probability of CORRECT answers transformed from
log odds).
(3) a significant difference between the immediate and the delayed sessions
was only present for the control group (with fewer correct responses in the
immediate session). Finally, CONDITION did not interact with TEST TYPE
suggesting similar results for both recall and recognition tests.
To conclude, the three treatment conditions led to delayed recall gains of
27–37 percentage points and to delayed recognition gains of 21–32 percentage
points over the control group. Also, the enhanced condition led to more correct
answers than the enriched condition in the delayed test (10 and 11 percentage
point difference in recall and recognition, respectively). These results can be
summarized as follows (and are illustrated in Figure 6):
Table 8 Mean reaction rime (RT) in milliseconds for intact and control items under all
conditions (nonnatives)
Figure 6 Percentages and comparisons of explicit (form recall and recognition) scores
under all conditions in both sessions (nonnatives).
to the null model. Similar to Experiment 1, ITEM TYPE did not improve the
null model (χ 2 (1) = 0.00, p = 1.00, N = 1,246), neither did CONDITION
(χ 2 (3) = 6.77, p = .08, N = 1,246). However, SESSION did improve the model
(χ 2 (1) = 17.63, p < .001, N = 1,246) with shorter (faster) reaction time in the
delayed session. Only two-item related variables further improved the model:
Word 2 length (χ 2 (1) = 27.92, p < .001, N = 1,246) and Word 2 log frequency
(χ 2 (1) = 13.42, p < .001, N = 1,246). The shorter and more frequent the
second word, the shorter the log reaction time is. Finally, overall proficiency
was added to the model and was found to significantly improve it (χ 2 (1) = 7.05,
p = .008, N = 1,246). The more proficient the participant, the shorter his/her
log reaction time is. We then tested interactions between significant predictors
(SESSION, W2 length, W2 Frequency, and overall proficiency), but none led to
any significant improvement in the model. Finally, when removing significant
predictors backward, none was found to be redundant. Table 9 presents the final
best-fit model.
Similar to Experiment 1, the best fit LME model seems to suggest that
nonnatives’ RT performance for intact as opposed to control items was
not predicted by condition (i.e., no interaction between ITEM TYPE and
CONDITION). It can be concluded, thus, that implicit knowledge was not
Table 9 Summary of the best fit LME model for variables predicting nonnatives’ log
reaction time
Predictor Estimate MCMCmean SE t pMCMC Pr(>|t|)
Figure 7 Mean reaction times for intact and control items under all conditions in both
sessions (nonnatives).
developed under any condition. The results can be summarized as follows (see
Figure 7):
Discussion
As opposed to native speakers, explicit learning plays a much more prominent
role in L2 vocabulary acquisition (e.g., Nation, 2001; Schmitt, 2008), although
little is known about the learning of collocations. Experiment 2 explored three
learning conditions and found that they all led to significant gains in collocation
knowledge at both the form recall and form recognition levels of mastery. This
was true for the immediate test, but more importantly, it was also the case of the
delayed test. Therefore, all three of the treatments were effective in facilitating
durable learning. Moreover, the enhanced condition was more effective than
the enriched condition in enhancing both types of explicit knowledge (though
not more effective than the decontextualized condition). Finally, recognition
gains were always higher than recall gains and this difference increased as the
proficiency of the L2 learners increased.
What about implicit knowledge? The immediate and delayed priming tests
showed that there was no improvement of this knowledge. It seems that the types
and amounts of treatment exposure in this experiment were not sufficient to
establish long-term implicit knowledge, at least not that the priming paradigm
could capture. Overall, although the treatments all facilitated the learning of
explicit collocational knowledge, they did not facilitate measurable learning of
implicit knowledge.
General Discussion
The Relationship Between Explicit and Implicit Lexical Knowledge
Previous research has shown that direct conditions are generally more effec-
tive than indirect conditions in promoting lexical learning, but it must be said
that this research has a number of important limitations: (1) the lexical items
researched have almost always been individual words, (2) the aspect of word
knowledge addressed has almost always been the form-meaning link, and (3)
the research measurements have almost always been explicit tests. It is there-
fore perhaps not surprising that direct approaches have usually been shown
to have an advantage, because the form-meaning link of individual words is
relatively amenable to explicit introspection, and the explicit measurements
have privileged explicit knowledge. Thus, it is unclear whether nonexplicit in-
direct learning conditions can be as, or more, effective in promoting implicit
knowledge, simply because implicit knowledge has seldom been tested before.
the learning of implicit knowledge more than that of explicit knowledge. That
is, it may be that recycling is more important for the learning of implicit knowl-
edge than it is for explicit knowledge. R. Ellis (2007) observes that studies that
did show growth in implicit grammatical knowledge involved prolonged and
substantial instruction.
that while natives tend to focus mainly on the communicative meaning, non-
natives inevitably focus relatively more on the language form, and this may be
a factor in enhancing their learning of collocational word combinations from
input (we did not test the learning of the collocation’s meaning) at least on the
explicit side. This conclusion is sustained by the finding that, in comparison
with the enriched condition, natives benefited more from decontextualized ex-
posure where their attention was explicitly focused on the collocation’s form,
while advanced nonnatives benefited more from having their attention focused
on the collocations through contextualized, salient exposure.
just beginning to consider how to facilitate fluent and automatic (i.e., implicit)
knowledge. Future research should attempt to develop and validate other tests
of implicit lexical knowledge (eye tracking, ERP, among other behavioral, free
constructed, tasks) to inform the explicit/implicit debate. Another interesting
line for future research concerns how beginning and intermediate English-as-
a-foreign-language learners (as opposed to the advanced ESL learners inves-
tigated here) develop implicit, in addition to explicit, collocational knowledge
in language classrooms. Our results, along with Webb and Kagimoto (2009),
have shown proficiency as a major predictor of performance on explicit tests
of collocational knowledge. It is interesting to see what would happen when
implicit collocational knowledge is assessed.
The nature of explicit and implicit lexical knowledge and the interrelation-
ship between them is an emerging lexical issue that is likely to attract increased
attention in the future. This study is a start in exploring this topic, but can
only be considered a first step. It will surely raise more questions than it has
answered, but hopefully it has at least begun the discussion.
Revised version accepted 20 December 2011
Notes
1 (i) Blakiston’s Gould medical dictionary (Gould & Gennaro, 1979), (ii) Collins
dictionary [of] medicine (Youngson, 2004), (iii) Mosby’s medical, nursing, and
allied health dictionary (Anderson, Keith, & Novak, 2002), (iv) Stedman’s medical
terms and phrases (Stedman, 2004), and (v) Stedman’s medical dictionary for the
health professions and nursing (Stedman, 2005).
2 We tried our best to achieve McNamara’s (2005) recommended ratios for automatic
priming. Relatedness proportion (RP) was close to the 0.20 value (the number of
related trials was 7 or 8 out of all 37 word-word trials (7 or 8/37 ≈ 0.20). As for
nonword ratio (NR), there were 31 nonword trials out of all the 60 or 61 unrelated
trials resulting in the acceptable NR of ≈ 0.50 (31/60 or 61).
3 When categorical variables are entered as predictors in a logit mixed model or a
LME model, the levels are ordered alphabetically and the first is treated as the
reference level (e.g., control is the reference level for the CONDITION variable
here). Thus, when any level is found to be significant, this would mean that it is
significant in comparison to the reference level.
4 For the implicit, priming test, responses shorter than 150 ms or longer than 1300 ms
(constituting only 2.15% of all data points) were excluded from further analysis. The
long cut-off time in the present priming task is far higher than that used in previous
research. However, it should be noted that words used in the present lexical decision
task are rather low-frequency (see Appendix S1 in the Supporting Information) and
would be processed slower than those included in previous research.
5 Because immediate tests can only inform about short-term effects, we prefer to
interpret immediate gains as short-term enhancement effects. We reserve the notion
of “learning” for gains on delayed tests, as only they can demonstrate acquisition
that is durable (see Schmitt, 2010, for more on this).
6 For both experiments, only correct responses were included in the final reaction time
analysis.
7 The mean reading-comprehension score gained by the 34 participants in the
experimental group was acceptable. All participants scored higher than 60% in this
test (Mean = 6.88 out of 8.00, 86.03%, SD = 0.84).
8 Only 1.09% of the data points coded as CORRECT in the explicit recall test
represent partial, rather than complete, knowledge in Experiment 2.
9 This analysis was run in a model including the CONDITION × SESSION
interaction only (excluding all other significant predictors) in order to catch all
possible contrasts for this interaction.
References
Aitchison, J. (2003). Words in the mind: An introduction to the mental lexicon (3rd
ed.). Oxford, UK: Blackwell.
Anderson, D. M., Keith, J., & Novak, P. D. (2002). Mosby’s medical, nursing, and
allied health dictionary (6th ed.). London: Mosby.
Andringa, S., de Glopper, K., & Hacquebord, H. (2011). Effect of explicit and implicit
instruction on free written response task performance. Language Learning, 61,
868–903.
Baayen, R. H. (2008). Analyzing linguistic data: A practical introduction to statistics
using R. Cambridge, UK: Cambridge University Press.
Bialystok, E. (1981). Some evidence for the integrity and interaction of two knowledge
sources. In R. Anderson (Ed.), New dimensions in research on the acquisition and
use of a second language (pp. 62–74). Rowley, MA: Newbury House.
Bishop, H. (2004). Noticing formulaic sequences—A problem of measuring the
subjective. LSO Working Papers in Linguistics, 4, 15–19.
Bretz, F., Hothorn, T., & Westfall, P. (2010). Multiple comparisons using R. Boca
Raton, FL: Chapman & Hall/CRC.
Dagenbach, D., Horst, S., & Carr, T. H. (1990). Adding new information to semantic
memory: How much learning is enough to produce automatic priming? Journal of
Experimental Psychology: Learning, Memory, and Cognition, 16, 581–591.
Davies, M. (2007). VIEW: Variation in English words and phrases. Retrieved July 22,
2011, from http://corpus.byu.edu/bnc/
de Graaff, R. (1997). The eXperanto experiments: Effects of explicit instruction on
second language acquisition. Studies in Second Language Acquisition, 19, 249–276.
Ellis, R. (2009). Implicit and explicit learning, knowledge, and instruction. In R. Ellis
et al. (Eds.), Explicit and implicit knowledge in second language learning, testing
and teaching (pp. 3–25). Bristol, UK: Multilingual Matters.
Gagné, C. L., & Spalding, T. L. (2009). Constituent integration during the processing
of compound words: Does it involve the use of relational structures? Journal of
Memory and Language, 60, 20–35.
Gould, G. M., & Gennaro, A. R. (1979). Blakiston’s Gould medical dictionary: A
modern comprehensive dictionary of the terms used in all branches of medicine and
allied sciences, with illustrations and tables (4th ed.). New York/London:
McGraw-Hill.
Hoey, M. (1991). Patterns of lexis in text. Oxford, UK: Oxford University Press.
Hoey, M. (2005). Lexical priming: A new theory of words and language. London:
Routledge.
Hulstijn, J. H. (2007). Psycholinguistic perspectives on language and its acquisition. In
J. Cummins & C. Davison (Eds.), The international handbook of English language
teaching (pp. 783–796). Norwell, MA: Springer.
Hulstijn, J. H., & de Graaff, R. (1994). Under what conditions does explicit knowledge
of a second language facilitate the acquisition of implicit knowledge? AILA Review,
11, 97–112.
Jaeger, T. F. (2008). Categorical data analysis: Away from ANOVAs (transformation or
not) and towards logit mixed models. Journal of Memory and Language, 59,
434–446.
Krashen, S. D. (1981). Second language acquisition and second language learning.
London: Pergamon Press.
Laufer, B., & Girsai, N. (2008). Form-focused instruction in second language
vocabulary learning: A case for contrastive analysis and translation. Applied
Linguistics, 29, 694–716.
Leech, G., Rayson, P., & Wilson, A. (2001). Word frequencies in written and spoken
English: Based on the British National Corpus. London: Longman.
Long, M. H. (1983). Does second language instruction make a difference? A review of
the research. TESOL Quarterly, 17, 359–382.
Mackey, A., & Goo, J. (2007). Interaction research in SLA: A meta-analysis and
research synthesis. In A. Mackey (Ed.), Conversational interaction in second
language acquisition: A series of empirical studies (pp. 407–452). Oxford, UK:
Oxford University Press.
McKoon, G., & Ratcliff, R. (1992). Spreading activation versus compound cue
accounts of priming: Mediated priming revisited. Journal of Experimental
Psychology: Learning, Memory, and Cognition, 18, 1155–1172.
McLaughlin, B., Rossman, T., & McLeod, B. (1983). Second language learning: An
information-processing perspective. Language Learning, 33, 135–158.
McLaughlin, J., Osterhout, L., & Kim, A. (2004). Neural correlates of second
language word learning: Minimal instruction produces rapid change. Nature
Neuroscience, 7, 703–704.
157 Language Learning 63:1, March 2013, pp. 121–159
14679922, 2013, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/j.1467-9922.2012.00730.x by Pontificia Universidad Catolica De Chile, Wiley Online Library on [20/08/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Sonbul and Schmitt Explicit and Implicit Lexical Knowledge
Supporting Information
Additional Supporting Information may be found in the online version of this
article at the publisher’s website:
Appendix S1. Intact Medical Collocations and their Controls (Experiment 1).
Appendix S2. Intact Medical Collocations and their Controls (Experiment 2).
Appendix S3. Mixed-Effects Modeling Procedures in the Study.