(The World of Linguistics 6) Geoffrey Haig, Geoffrey Khan - The Languages and Linguistics of Western Asia - An Areal Perspective-De Gruyter (2018)

The Languages and Linguistics of Western Asia
WOL 6
The World of Linguistics
Editor
Hans Henrich Hock
Volume 6
De Gruyter Mouton
The Languages
and Linguistics
of Western Asia
An Areal Perspective
Edited by
Geoffrey Haig
Geoffrey Khan
De Gruyter Mouton
ISBN 978-3-11-042608-3
e-ISBN (PDF) 978-3-11-042168-2
e-ISBN (EPUB) 978-3-11-042174-3
Library of Congress Cataloging-in-Publication Data

Names: Haig, Geoffrey, editor. | Khan, Geoffrey, editor.
Title: The languages and linguistics of Western Asia : an areal perspective /
edited by Geoffrey Haig, Geoffrey Khan.
Description: Berlin ; Boston : De Gruyter Mouton, 2018. | Series: The world
of linguistics ; volume 6
Identifiers: LCCN 2018012560 | ISBN 9783110426083 (hardcover)
Subjects: LCSH: Middle East--Languages.
Classification: LCC P381.M53 .L36 2018 | DDC 409.5--dc23 LC record available at
https://lccn.loc.gov/2018012560
Bibliographic information published by the Deutsche Nationalbibliothek

The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed
bibliographic data are available in the Internet at http://dnb.d-nb.de.
© 2019 Walter de Gruyter GmbH & Co. KG, Berlin/Boston

Cover image: G. Haig
Typesetting: Dörlemann Satz GmbH & Co. KG, Lemförde
Printing and binding: CPI books GmbH, Leck
www.degruyter.com
Acknowledgements
We have been very fortunate in having the assistance of various people in the prepa-
ration of this volume. In the early stages Nicholas Peterson (Bamberg) assisted
in copy-editing, later Soner Ö. Barthoma (Cambridge) copy-edited several chap-
ters, while the bulk of the final copy-editing and preparation of the indices was
undertaken by Laurentia Schreiber (Bamberg). Nils Norman Schiborr (Bamberg)
produced the maps. We gratefully acknowledge their valuable contribution to the
project.
Many thanks also to the staff at Mouton de Gruyter for their guidance and
efficiency during the long process of producing this volume: to Birgit Sievert for
organisation and support in the early phases of the project, to Barbara Karlson for
accompanying the editing process through to completion, and to Frauke Schafft
for supervising the final production of the volume.
We would also like to thank all the scholars who participated in the project for
their excellent contributions, and their enduring patience.
Finally, we would like to express our gratitude to the many anonymous speak-
ers of the languages described in this volume, who provided much of the data on
which the contributions are based. Without their interest and enthusiasm for their
own languages, we could not have produced this volume.
Geoffrey Haig and Geoffrey Khan (editors) November 2018
https://doi.org/10.1515/9783110421682-201
Table of contents
1. Introduction · Geoffrey Haig and Geoffrey Khan . . . . . . . . . . . 1
2. Eastern Anatolia and northwestern Iran . . . . . . . . . . . . . . . . . . . 31

2.1. Eastern Anatolia and northwestern Iran: overview · Geoffrey Khan . 31
2.2. The Armenian dialects · Hrach Martirosyan . . . . . . . . . . . . . . . . . . . 46
2.3. Northern Kurdish (Kurmanjî) · Geoffrey Haig . . . . . . . . . . . . . . . . . 106
2.4. The Arabic dialects of eastern Anatolia · Stephan Procházka . . . . . 159
2.5. The Neo-Aramaic dialects of eastern Anatolia and northwestern
Iran · Geoffrey Khan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
3. Northern Iraq . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237

3.1. Northern Iraq: overview · Geoffrey Khan and Geoffrey Haig . . . . . 237
3.2. The Arabic dialects of northern Iraq · Stephan Procházka . . . . . . . . 243
3.3. The Iranian languages of northern Iraq · Geoffrey Haig . . . . . . . . . 267
3.4. The Neo-Aramaic dialects of northern Iraq · Geoffrey Khan . . . . . . 305
3.5. Iraq-Turkic · Christiane Bulut . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354
4. Western Iran . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 385

4.1. Western Iran: overview · Geoffrey Khan . . . . . . . . . . . . . . . . . . . . . 385
4.2. The Turkic varieties of Iran · Christiane Bulut . . . . . . . . . . . . . . . . . 398
4.3. Bakhtiari · Erik Anonby and Mortaza Taheri-Ardali . . . . . . . . . . . . 445
4.4. The Neo-Aramaic dialects of western Iran · Geoffrey Khan . . . . . . 481
4.5. Hawrāmī of western Iran · Parvin Mahmoudveysi and
Denise Bailey . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 533
4.6. Persian · Ludwig Paul . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 569
4.7. Kumzari · Christina van der Wal Anonby . . . . . . . . . . . . . . . . . . . . . 625
5. The Caspian region and south Azerbaijan: Caspian and

Tatic · Donald L. Stilo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 659
viii Table of contents
6. Caucasian rim and southern Black Sea coast . . . . . . . . . . . . . . . 825
6.1. Caucasian rim and southern Black Sea coast: overview ·

Geoffrey Haig and Geoffrey Khan . . . . . . . . . . . . . . . . . . . . . . . . . . 825
6.2. Laz · René Lacroix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 830
6.3. Ossetic · David Erschler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 861
6.4. Romeyka · Laurentia Schreiber . . . . . . . . . . . . . . . . . . . . . . . . . . . . 892
7. Appendix: Basic lexical items in selected languages of

Western Asia · Geoffrey Haig, Geoffrey Khan and
Laurentia Schreiber . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 935
7.1. Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 935
7.2. Iranian languages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 936
7.3. Semitic languages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 941
7.4. Other language families . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 945
7.5. Notes on sources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 947
Index of language names . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 953
Index of place names . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 961
Index of subjects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 969

1. Introduction1
Geoffrey Haig and Geoffrey Khan
1. Aims and scope of the volume
This volume contains a series of descriptions of around 20 languages (or language

groups), spoken across a region that includes most of eastern Turkey, western Iran
and northern Iraq (see Fig. 1 for the locations), together with overview articles of
sub-regions, and an appendix with selected lexical items from these languages.
The region that we refer to here loosely as “Western Asia” is not clearly demar-
cated, either politically or topographically, and requires a few words of explana-
tion. Essentially it is linked to an ancient cultural core, namely the northern part of
Mesopotamia, the upper catchment regions of the Tigris and Euphrates rivers. The
book’s coverage goes beyond Mesopotamia in the narrower sense to include the
surrounding mountainous regions of the Zagros in the southeast, and the elevated
regions northwards and eastwards across the Anatolian plateau into the outliers
of the Caucasus. These areas were all involved, at least economically and stra-
tegically, in the succession of empires that arose in and around ancient Mesopo-
tamia, beginning with the Sumerians in the third millennium bce and continuing
down to the Ottoman Empire into the dawn of the twentieth century. The sense of
a common Kulturraum, while difficult to delineate precisely, is reflected by the
growing recognition of linguistic parallels shared among the region’s languages,
some of which we take up below.
We have further divided the region into five sub-areas, each of which is treated
in a section of the volume, with its own overview chapter. The individual language
chapters are quite varied in composition, some focussing on a single language
spoken in a geographically narrow location (e. g. Kumzari, chapter 4.7) while
others cover geographically dispersed and internally diverse groups (e. g. Arme-
nian, chapter 2.2, or Iran-Turkic, chapter 4.2). For this reason, it has not been pos-
sible to impose a single format on the presentations, but we have ensured that lin-
guistic examples are glossed according to standard practice, that all contributions
include a map, and where possible, a glossed text sample of natural language. In
addition, an Appendix contains comparative lists of lexical items from a selection
of the languages, plus some additional languages that were not treated in the main
body of the book. The approximate locations of the languages and language groups
covered in the volume is shown in Fig. 1, whereby the symbols are only intended
1
We are very grateful to Christina van der Wal Anonby, Erik Anonby, René Lacroix and
Ludwig Paul for comments and corrections on an earlier version of this introduction.
We of course bear the responsibility for the remaining shortcomings.
2
Figure 1: Approximate locations of main languages and language groups covered in this volume
Introduction 3
to indicate the approximate centre of the respective language variety. Language

names in brackets indicate important languages of the region, but which do not
have a dedicated chapter in this volume.
The focus of the volume is on the minority languages of the region, for the
most part under-described and endangered, while the respective state languages
are generally outside the purview of the volume. This perspective is motivated
by the recognition that taken together, the minority languages yield a much richer
picture of the region’s long-standing multilingual tapestry than the handful of cur-
rently dominant official languages (Arabic, Persian, Turkish). The latter have all
been described exhaustively elsewhere, and have only achieved their current dom-
inance very recently, hence give at best a distorted picture of the historical pro-
cesses that engendered the region’s linguistic composition (we have nevertheless
included an overview chapter of Persian, chapter 4.6, because of its unique status
as a hegemonial language across much of the region over at least two millennia).
The volume’s scope is also restricted by practical considerations. For example,
the Caucasus is largely excluded because it is treated within a companion volume
in the same series. Zazaki, an Iranian language of Central Anatolia, is not included
because there is already an extensive overview chapter on Zazaki in Windfuhr
(2009). In other cases, gaps have arisen simply due to the unavailability of appro-
priate material or suitably qualified authors at the right time, or through consid-
erations of space, or through various organisational setbacks that we faced over
the six years of compilation. A particularly unfortunate ommission is Domari, the
language of the Doms of the Middle East (see Herin 2012 on Domari of Aleppo,
and Matras 2012 for Domari of Jerusalem). Similarly, Southern Kurdish has not
been treated in any depth (see Fattah 2000, and Belleli 2016). We can only trust
that anyone familiar with the task of compiling a volume of this scope will appre-
ciate that comprehensive coverage is an ideal that is seldom achieved. And while
we are very much aware of the gaps in the current compilation, we are confident
that the present volume provides the first reasonably representative survey of the
linguistic diversity of the region.
We turn now to the major thematic focus of the volume, the areal perspec-
tive. By this we mean that the volumeʼs structure reflects broad areal divisions,
rather than genetic groupings of the languages concerned. In keeping with the
general approach, we have eschewed in-depth historical treatment of the language
families; all the major families and branches have been dealt with extensively in
recent handbooks (see for example Windfuhr 2009 on Iranian, Johanson and Csató
2006 on Turkic, and Weninger et al. 2011 on Semitic). With regard to the ancient
language contacts of the region, we refer to Butts (2015) on Semitic, and Hassel-
bach-Andee (in prep.).2 The individual chapters of the present volume primarily
2
We are grateful to Ilya Yakubovich for drawing our attention to this latter volume in
preparation.
4 Geoffrey Haig and Geoffrey Khan
focus on a synchronic description of the respective language or language group,

but with explicit reference to contact issues. In the overview chapters to each
sub-region, we provide a short synopsis of the most striking contact phenomena.
Interest in areal contact across the region has steadily increased in the last years
(see for example the joint Frankfurt/Cambridge initiative on language contact and
language change in Western Asia,3 and the recent study on relative clauses from an
areal perspective by Gandon 2016). Because the present volume combines exper-
tise from diverse philological traditions, it will hopefully become a valuable one-
stop resource for future research in this rapidly growing field.
2. Western Asia from an areal typological perspective
Within language typology, the relevance of large-scale areal units in shaping the
global distribution of typological features is increasingly emphasized. The region
under consideration here has not, however, generally been recognized to date.
Dryer’s (1992) proposal has been influential, and recognizes the following six
macro-areas (see Hammarström and Donohue 2014 for critical discussion):
• Africa
• Australia-New Guinea
• Eurasia
• North America
• Southeast Asia and Oceania
• South America
On this coarse-grained view, our “Western Asia” would be assigned to “Eurasia”,

which also includes e. g. Mongolian, or Mandarin. However, it is evident that the
Semitic languages represented in Western Asia (e. g. Neo-Aramaic) are the north-
easterly outliers of the Afro-Asiatic language family, and are thus historically
rooted in Africa (see in this respect Güldemann’s notion of “Afrabia”, Güldemann,
2018.). More recently, Bickel et al. (2017) propose a finer-grained division, which
recognizes a macro-area “Greater Mesopotamia”. This more closely corresponds
to our Western Asia, though in Bickel et al. (2017), it also includes the entire
Arabian peninsula. But ignoring for a moment the issue of the precise boundaries,4
there are in fact reasons to consider Western Asia a special kind of linguistic area,
which has hitherto not been afforded much attention.
3
See e. g. http://titus.fkidg1.uni-frankfurt.de/click/fourthAct.html.
4
It is worth pointing out that many of the regions traditionally considered to be linguistic
areas (or Sprachbünde) lack clearly identifiable borders (e. g. “the Balkans”), and are
defined in different ways by different scholars.
Introduction 5
The linguistic diversity of the region is, within the broader Eurasian context,
relatively high: four distinct language families5 are represented (Turkic, Semitic,
Indo-European, Kartvelian), and within Indo-European, four branches (Arme-
nian, Iranian, Indic, and Hellenic). But with the exception of Armenian, none of
these groupings are indigenous to the region: all the languages represent outliers
of larger groups, and have close relatives outside the immediate region. The high
level of linguistic diversity has thus arisen secondarily, as it were, through the
regionʼs location at the intersection of several large genetic groupings. This is in
stark contrast to, for example, the linguistic diversity of the Caucasus (Nicholsʼ
1992 canonical example of a “residual zone”), stemming from three indigenous
language families, all of which lack (proven) relatives outside the region itself. In
a number of publications Stilo (e. g. 2005, 2012) looks at the region centred on the
catchment of the Araxes river, considering it to be an “intersection zone”. Among
the most striking phenomena is the areal distribution of adpositional typologies,
which shift from Turkic-type postpositional, across Iranian mixed typologies, to
Semitic prepositional. Haig (2001, 2017) considers eastern Anatolia as a “transi-
tion zone”, with gradual areal shifts across a number of morphosyntactic features,
but also its own set of defining features. Among them is the common Anatolian
OVX word order pattern (where “X” refers to various kinds of non-direct object
arguments, in particular goals, addressees, and recipients, see §3.4.3 below). Like
the mixed adpositional typologies of Iranian, the OVX word order can be con-
sidered a compromise strategy between OV and VO word orders. These kinds of
typologically unusual patterns illustrate that a transition zone is not merely the
sum of the participant language families, but may develop its own profile with
typologically rare structures – rare simply because transition zones themselves
emerge only under very specific, hence globally very rarely occurring, condi-
tions. If large-scale macro-areas are relevant to understanding the distribution of
structural features, then we suggest that the transitional zones that lie between
them also merit close scrutiny, precisely because they involve the comparatively
rare areal contiguity of different language types, yielding configurations that are
unlikely to arise elsewhere.
3. Areal patterns in phonology and morphosyntax across Western Asia
This section discusses a selection of features from phonology and morphosyntax

that we consider are evidence for the role of language contact in shaping the struc-
tures of the languages of the region. In some cases, the phenomena are region-
ally quite restricted, while others span a large cross-section. Though the language
5
We are ignoring some of the more recent arrivals to the region, such as communities of
Chechen speakers in Anatolia.
sample we draw on is by no means comprehensive, it is nevertheless possible to

draw a few more general conclusions regarding the range and constraints of con-
tact-induced change in the languages of the region.
3.1. Language contact: general considerations
Two or more languages spoken in the same region may exhibit structural similari-
ties through (i) mere chance, (ii) inheritance from a common ancestor, (iii) contact
influence, or (iv) a combination of (i)–(iii). We are primarily interested in (iii), but
assigning similarities to contact influence requires caution, and should only be
undertaken after due consideration of other possible causes. First, we need to bear
in mind the histories of related languages spoken outside the region (to the extent
that they are known) in order to formulate a benchmark of comparison against
which the developments in the languages under consideration can be assessed.
As an example of how the facts from related languages can be brought to
bear in assessing contact influence, let us consider the example of North-Eastern
Neo-Aramaic (NENA) and Central Neo-Aramaic (CNA) in the region (Khan, this
volume, chapters 2.5, 3.4, and 4.4). These languages have developed ergative (or
split-intransitive) alignments with their perfective verb forms, matching a broadly
similar alignment profile in the neighbouring varieties of Kurdish. Alignment
systems have often been considered relatively immune to areal influence (Nichols
1992: 181), so in principle, the similarities between Neo-Aramaic and Kurdish
could have occurred independently, and we need solid arguments to make a case
for contact influence. In this case, evidence from the related languages, but outside
the immediate contact region, is extremely relevant. With regard to Semitic, the
Neo-Aramaic development of ergativity is unique within the Semitic language
family as a whole. It is absent in Western Neo-Aramaic in Syria, a close relative of
NENA and CNA outside of the area. This fact greatly strengthens the argument in
favour of assuming contact influence: if it were a solely language-internal devel-
opment, we might have expected traces of it within related languages outside the
geographical contact zone, but this does not seem to be the case. Furthermore,
contact between Kurdish and the Neo-Aramaic speech communities of the region
has been intense and continuous for many centuries, and there is widespread evi-
dence of borrowing in other parts of the grammar (Khan 2007), lending further
credence to the assumption of structual convergence. Considering the histories of
the respective language families is also important in order to establish the direc-
tion of influence. We know that some kind of tense-sensitive ergativity is widely
attested in Iranian languages spoken beyond the contact zone with Neo-Aramaic
(e. g. Pashto, or eastern varieties of Balochi, or Tatic, Haig 2008). Thus the sim-
plest assumption is that ergativity developed in Kurdish through inheritance from
its Iranian ancestors, while its occurrence in Neo-Aramaic is (at least in part)
Introduction 7
due to influence from an Iranian contact language, e. g. precursors of Kurdish, or

Gorani.
A second factor to consider in assessing contact influence is typological proba-
bility. Consider for example the fact that most of the languages of the region have
pre-verbal negation in simple clauses, with the exception of Turkic, and Kumzari
(van der Wal Anonby, this volume, chapter 4.7). In Turkic, negation is via a verbal
suffix, and Turkish is generally suffixal, so post-verbal negation is structurally
pre-figured in the language. We might then be inclined to ignore Turkic, and con-
sider the otherwise prevalent pre-verbal negation to be an areal trait. But in fact,
pre-verbal negation is the most widespread kind of negation across the world’s
languages (Dryer 1988), so the case for areal influence in Western Asia is weak,
and would need to be reinforced through, e. g., evidence of non-random similari-
ties in the forms involved, if it were to be maintained. The presence of post-verbal
negation in Kumzari, on the other hand, obviously requires additional assump-
tions: it does not conform to the typologically more frequent pattern, and is dis-
tinct from Kumzari’s Iranian relatives, which have consistent pre-verbal negation.
Thus contact influence from neighbouring South Arabian languages appears to be
a very plausible causal factor in the emergence of post-verbal placement of nega-
tion markers in Kumzari, as noted in van der Wal Anonby (chapter 4.7, §7).
Several suggestions have been made for the relative ease of “borrowability”
among different categories of morphosyntax (see Matras 2007 on hierarchies of
borrowability, and see below for a proposal in this direction regarding syntax).
While it is often stated that, e. g. derivation is more easily borrowed than inflec-
tion, or clitics are more easily borrowed than affixes, it is useful to identify the
more general principles at work (Haig 2014a). In general, ease of borrowability is
characteristic for items that (i) are perceptually salient (e. g. syllabic, as opposed
to segmental, concatenative as opposed to non-concatenative); (ii) exhibit predict-
able and transparent form-meaning correspondences, with tangible semantics; (iii)
have a greater degree of positional freedom (e. g. are phonologically independent
items, with some versatility in positioning, as opposed to affixes locked into mor-
phological templates).
From these it follows fairly naturally that words are more likely to be borrowed
than clitics, and clitics more likely than affixes. The least borrowable items are
irregular, non-concatenative exponents of an abstract grammatical category, such
as the suppletive forms for the comparative grade of the adjectives good and bad,
or irregular past-tense formations involving Ablaut (give, gave etc.) in English.
Particularly striking evidence for the difficulty of borrowing non-concatenative
(i) and unpredictable (iii) morphology can be seen in the fact that Semitic root-
and-pattern morphology (e. g. broken plurals, or verb stem alternations in Arabic)
is very rarely transferred to native lexical items in contact languages (but see van
der Wal Anonby, this volume, chapter 4.7 on the incorporation of Arabic mor-
phology into Kumzari and Paul, this volume, chapter 4.6, §4.4, for examples of
broken Arabic plurals borrowed from Arabic). At the other end of the spectrum,
a derivational suffix such as Turkic -či (with various vowel-harmonic variants),
meaning ‘occupation associated with the base’ (e. g. saat-či ‘watchmaker’) turns
out to be readily borrowable, and occurs in most of the languages in the region
(e. g. in the Muş dialect of Northern Kurdish mesî-čî ‘fisherman’, from Kurdish
masî ‘fish’, example from Songül Gündoğdu p.c.). The suffix -čî is perceptually
salient, affixal, and regular in form and meaning.
Finally, a useful distinction is drawn by Matras (2007) between matter borrow-
ing, and pattern borrowing. The former involves the borrowing (or copying) of
items together with the phonological substance of the donor language. A clear case
of matter borrowing is found in Turkic varieties spoken in northern Iraq, which
have borrowed a comparative suffix -tär from neighbouring Iranian languages
(Bulut, chapter 4.2, §2.3.1.4). Pattern borrowing involves the borrowing of, e. g.
the structural organization of paradigms, or the relative ordering of morphemes in
a word, or specific principles of form-meaning correspondence, but without bor-
rowing of actual phonological material. An example of pattern borrowing is m-in-
itial reduplication, where a word is reduplicated, but the initial segment replaced
by [m]: Turkish para mara ‘money and stuff’, Laz dadzi madzi ‘thorns and stuff’
(Lacroix, chapter 6.2, §3.4). This is widespread across the region (see e. g. Khan
2016, vol. 2: 99 for NENA, Haig 2001 for other languages of Anatolia). Of course
matter and pattern borrowing are not mutually exclusively. Often, pattern borrow-
ing is accompanied by phonological similarity of the items concerned, as in the
m-segment in the reduplication example just mentioned.
3.2. Phonology
A number of languages of the region have increased their consonantal inventory by
acquiring consonants from other languages by areal diffusion. A particularly clear
case of this is the spread of glottalized consonants, realized as unaspirated stops
or ejectives, across the northern sector of the region, including Armenian dialects,
Northern Kurdish dialects of eastern Anatolia and northwestern Iran, the Neo-Ar-
amaic dialects belonging to the NENA group spoken in this Northern Kurdish
area, and some languages of the Caucasian rim, such as Laz and Ossetic. In Laz
glottalization is a feature inherited from Kartvelian. In Armenian it has considera-
ble historical depth, as shown by the fact that it occurs in Classical Armenian, and
is possibly an inheritance from earlier Indo-European (Martirosyan, chapter 2.2,
§4.1), but it may well have been reinforced by the fact that it is an areal feature.
In Kurdish, Neo-Aramaic and Ossetic it is not an inherited feature and has entered
these languages by areal diffusion. This is clearly demonstrable in the case of
Kurdish and Neo-Aramaic, since dialects of these languages spoken further south
in northern Iraq and western Iran do not have glottalized consonants. The area of
glottalized consonants extends further north into the Caucasus region, where it is
Introduction 9
found in Nakh-Daghestanian languages (e. g. Archi, Lak), Northwestern Cauca-

sian languages (e. g. Kabardian) and South Caucasian languages such as Georgian,
which are not covered by this volume (see Grawunder 2017 on the areal distribu-
tion of phonological features in the Caucasus). The nature of glottalization varies
across the area with regard to its degree of strength. In the languages of the Cauca-
sus it is generally realized as glottal ejection. In languages on the Caucasus rim and
in Armenian, Kurdish and Neo-Aramaic, which are described in this volume, the
glottalization is weaker and is typically realized as the lack of aspiration without
clear ejection. It is noteworthy, however, that dialects of Armenian and Neo-Ar-
amaic that are spoken by migrant communities in Georgia pronounce glottalized
consonants as ejectives (Chirikba 2008: 44–45; Khan 2016, vol. 1: 93–95), which
is a further demonstration of the areal nature of the feature. This also shows that
intensity of contact in the apparent epicentre of the areal feature in the Caucasus
can bring about greater degrees of areal convergence.
The fact that languages and dialects outside of the Caucasus typically realize
the glottalized consonants as unaspirated stops rather than ejectives gives some
insight into how these consonants were diffused. An insightful case study in this
respect is the development of the unaspirated stop phonemes in the Neo-Aramaic
dialects. In Neo-Aramaic dialects that have glottalized consonants fully estab-
lished in their sound inventories, such as the Christian NENA dialect of Urmi,
these consonants occur predominantly in native Aramaic words. In fact it is dif-
ficult to identify any loanword in the Christian Urmi dialect that has a glottalized
consonant in the source language. The glottalized phonemes have rather developed
by internal processes. One such process is the reanalysis of the deaspirated allo-
phones of stops in fricative—stop clusters as unaspirated phonemes, e. g. /xp/ [xp̂ ]
> /xp̂ /.6 Another is the reanalysis of the sequence of an aspirated stop and laryngeal
stop as an unaspirated phoneme, e. g. /t/ + /ʔ/ > /ṱ/. A further process arises from
the fact that in Christian Urmi Neo-Aramaic and neighbouring dialects, the orig-
inal pharyngealization of pharyngealized consonant segments has developed into
a suprasegmental feature that takes a whole word as its domain. In such dialects
an original pharyngealized stop ṭ [ˁṱ] becomes reanalyzed as an unaspirated stop
segment /ṱ/ within the suprasegmental domain of pharyngealization (for details
see Khan 2016, vol. 1: 92–131). The new glottalized phonemes formed by these
internal processes have then spread to other contexts and serve various functional
purposes, such as diminutive sound symbolism, e. g. ṱəlpa ‘eyelash’ (< *təlpa)
(Khan 2016, vol. 1: 180). The basic process, then, is that of phonemicization of
already existing sound patterns within Neo-Aramaic that match or approximate
to those of a contact language. This would have taken place by the perception
of an equivalence between these matching or similar sounds and the glottalized
phonemes of the contact language. Such innovative unaspirated stop phonemes
6
In the transcription unaspirated stops are distinguished by diacritics thus: p̂ , ṱ, k̭.
could then undergo further phonetic change by converging with the phonetic
features of ejectives in languages with such ejective glottalized consonants by a
“perceptual magnet effect”, as Blevins (2017) puts it. As mentioned, dialects of
Northern Kurdish (Kurmanjî) spoken in close proximity to Armenian also possess
unaspirated voiceless stops (in some dialects with ejective characteristics) in their
native lexicon. These elements are not directly inherited from Iranian, but likewise
cannot be easily explained in terms of a spread from loanwords; the number of
clearly identifiable Armenian loans in the everyday lexicon of Kurmanjî is very
small. We see, then, that the presence of perceptually salient segments in a contact
language may act as attractors in guiding the direction of internal changes.
Conversely, languages that have inherited glottalized consonants may undergo
weakening or reduction of this feature, if they are geographically isolated from lan-
guages with it. This is the case in Laz (Kartvelian), where the inherited glottalized
consonants are weakened in comparison to related Georgian (Lacroix, chapter 6.2,
§6.1), presumably due to the isolation of Laz from its Kartvelian relatives, and the
influence of neighbouring Turkish. Nevertheless, Lacroix still posits glottalized
consonants for Laz. Full loss of glottalization is apparently a rare phenomenon
in the Western Asian context. Glottalized consonants appear to be thus (a) fairly
resistant to loss, and (b) prone to spread in contact situations.
Pharyngeal and pharyngealized consonants, which are an inherited feature
of the sound inventories of Semitic languages, have spread by diffusion to some
non-Semitic languages of the region. The process of areal diffusion is not so clear-
cut as that of glottalized consonants. Several non-Semitic languages exhibit phar-
yngeals in loanwords from Arabic. Some Iranian languages, such as Kurdish and
the languages of Kordestan and Kermanshah provinces, have developed innovative
pharyngeals in native lexical items, in particular the unvoiced pharyngeal, which,
for example, is commonly found in these languages in the numeral ‘seven’ (North-
ern Kurdish, Hawrami ħaft, Central Kurdish ħawt). The Semitic inherited pharyn-
geals, unvoiced [ħ] and voiced [ʕ], are found in the Arabic dialects of the region
and in the Central Neo-Aramaic dialects of southeastern Turkey. In the NENA
dialect group situated to the east of the Tigris, however, the inherited pharyngeals
have been mostly lost, especially in many of the dialects in the eastern sector of the
NENA area in the Hakkari mountains of Turkey and northwestern Iran. The pres-
ervation of pharyngeals in Central Neo-Aramaic and their partial preservation in
some NENA dialects is likely to be conditioned by extensive contact with Arabic.
This would, therefore, be a case of language contact preserving sound patterns.
When pharyngeals occur in the aforementioned subgroup of NENA dialects these
are predominantly inherited pharyngeals in the environment of pharyngealized
consonants, e. g. Qaraqosh ṭaʿən [ˤtˤɑːʕən] ‘he bears’, pharyngeals arising from
the debuccalization of pharyngealized consonants, e. g. Jewish Sanandaj ʾaḥra
< *ʾˤaˤtˤrˤaˤ < *ʾaṯrā ‘town’, or innovative pharyngeals resulting from strength-
ening of laryngeals for functional purposes. The latter category includes cases
Introduction 11
of the strengthening of laryngeals to preserve morphological distinctions, e. g.

Qaraqosh 3ms possessive suffix -əḥ < *-əh to prevent the loss of final -h, which
would have rendered the suffix homophonous with the noun plural suffix -ə, e. g.
tor-əḥ (< *tor-əh ox-poss .3 ms ) ‘his ox’ vs. tor-ə (ox-pl ) ‘oxen’. The emergence
of pharyngeals in some Iranian languages possibly also has the functional moti-
vation of strengthening distinctions in paradigms, e. g. Northern Kurdish, Dohuk:
ħaft (< *haft) ‘seven’ to distinguish it clearly from the similar sounding numeral
hašt ‘eight’.7
Glottalized consonants and pharyngeals are cross-linguistically relatively
rare consonants (Maddieson 2013a, 2013b) and their diffusion has resulted in the
enrichment of the consonantal inventories of the languages of the region. A factor
that may have facilitated their “magnet” effect is their salience (Blevins 2017).
This would have conditioned not only their tendency to spread but also their resist-
ance to loss. It is relevant in this respect that the unvoiced pharyngeal [ḥ] dif-
fused more readily than the voiced pharyngeal [ʕ], no doubt because the unvoiced
member of the pair is the more salient due to its higher pitch.
Contact has resulted in loss of consonants of a number of languages of the area.
A clear case of this is the loss of inherited interdental consonants of some Arabic
and Neo-Aramaic dialects. This has brought the phoneme inventories of these dia-
lects closer to those of the non-Semitic contact languages, all of which lack inter-
dentals. The most common process involves merger of the interdentals with other
consonants that have a direct match in the inventories of the contact languages.
These are typically stops or sibilants. In many cases this merger is symmetrical,
e. g. all the interdentals merge with stops (Arabic θ, ð, ð̣ > t, d, ḍ, NENA θ, ð > t,
d) or sibilants (Arabic θ, ð, ð̣ > s, z, ẓ, NENA θ, ð > s, z). Sometimes, however,
there is asymmetry. This is found in a number of NENA dialects. In such cases the
outcome results in the reflex of the unvoiced interdental being weaker than that
of the voiced interdental. In some dialects the unvoiced interdental is preserved
while the voiced one shifts to a stop, e. g. Christian Ankawa xaθa ‘sister’, ʾida (<
*ʾiða) ‘hand’, the unvoiced becomes a sibilant, while the voiced becomes stop,
e. g. Jewish Nerwa xasa (< *xaθa) ‘sister’, ʾida (< *ʾiða) ‘hand’, or the unvoiced
undergoes debuccalization while the voiced becomes a stop, e. g. Jilu +xa ‘sister’
(< *xaθa),8 ʾida (< *ʾiða) ‘hand’. Some unusual outcomes include the shift of the
Arabic interdentals θ, ð, ð̣ to labio-dental fricatives f, v, ṿ in some Arabic dialects
(Procházka, chapter 2.4, §1.1) and the shift in some Jewish Neo-Aramaic dialects
of both the interdentals θ and ð to the lateral l (Khan, chapter 2.5, §5.1, chapter 3.4,
§5.1, chapter 4.4, §5.1.1). The Neo-Aramaic dialects that have this latter outcome
were spoken in areas where in neighbouring Iranian and Turkic languages a /d/
7
Data from Matras, Yaron et al. 2016. The Dialects of Kurdish. Web resource, University
of Manchester. http://kurdish.humanities.manchester.ac.uk/ (accessed 17 March 2018).
8
The superscribed symbol + denotes suprasegmental pharyngealization.
following a vowel or sonorant undergoes lenition, know as “Zagros d”, result-

ing in it being realized as an approximant or as sonorant (Haig, chapter 3.3,
§3.1.1; Mahmoudveysi and Bailey, chapter 4.5, §3.1; Anonby and Taheri-Ardali,
chapter 4.3, §2.1). There is evidence that in the Neo-Aramaic dialects concerned
the two interdentals first shifted to the voiced stop *d before finally becoming a
lateral sonorant /l/. The sonorant /l/ can then be regarded as lenition of the *d.
Such lenition, therefore, is likely to be due to the “perceptual magnet effect” of the
weakened Zagros d, whereby Neo-Aramaic speakers match this perceptually with
the sonorant /l/ in their existing sound inventory. We may conclude that interden-
tals are prone to loss and do not spread by contact. This can be correlated with their
lack of salience (Maddieson 2013b).
Shifts in the realization of consonants without necessary loss to consonan-
tal inventories also result from areal diffusion. A case of this is the palataliza-
tion of the dorsal stops /k/ and /g/, which has occurred in many languages of the
region. This is found, for example, in several Turkic varieties, in particular in
the Turkic spoken in the north of the region in the Azerbaijanian exclave Nakh-
ichevan, in some regions of Georgia, around Tabriz and Urmi in Iran, and in north-
ern Iraq (Bulut, chapter 4.2, §2.1.2.4). Similar palatalization of these consonants
is found in some NENA dialects in Iraq and northwestern Iran (Khan, chapter 3.4,
§5.1, Khan, chapter 3.5, §5.1), in many Iranian languages, including those of
the Caspian region (Stilo, chapter 5, §3.1), various Kurdish dialects of Anatolia,
Iraq and northwestern Iran (chapter 2.1, §1.5, Haig, chapter 3.3, §3.1.1), Ossetic
(Erschler, chapter 6.3, §2.1) and Bakhtiari (Anonby and Taheri-Ardali, chapter 4.3,
§2.1). It is also found in Romeyka (Schreiber, chapter 4.4, §2.1.2). The degrees
of palatalization vary, ranging from stops with palatal offglides to a shift to the
affricates [ʧ] and [ʤ]. In some languages the palatalization is conditioned by adja-
cent high vowels (e. g. in Ossetic and some languages in the Caspian region),
whereas in others it is not conditioned by the environment. Palatalization appears
to have been diffused as an areal feature. This is shown, for example, by the fact
that NENA dialects in Iraq and Iran only exhibit this process in areas where it is
found in the contact languages Turkic and Kurdish (chapter 3.1, §2). The varying
degrees of palatalization can be correlated in some cases with different degrees of
intensity of contact. This is shown, for example, by the case of the NENA dialect
spoken by Christians in the plain of Urmi in northwestern Iran. In varieties of this
dialect spoken in the southern areas of the plain palatalization results in affrication
whereas in the varieties spoken in the north of the plain the dorsal stops are not
fully affricated. This can be correlated with the fact that in the south the NENA
speakers frequently code-switched between NENA and Azeri Turkic, whereas this
was rare in the north. Azeri Turkic exhibits affrication of dorsals. Moreover, the
Mukri Kurdish dialect spoken in the south of the Urmi plain has affrication of
dorsals, but this is not found in the Kurdish spoken in the north of the plain. All
this meant that the speakers of NENA in the south had greater contact with affrica-
Introduction 13
tion of dorsals in the contact languages than speakers in the north (for details see
Khan 2016: 111).
In the vowel systems of the languages of the region a clear case of areal dif-
fusion is the fronting of the high back vowel [uː] to [yː] (transcribed ü below).
Front rounded vowels are generally considered marked in comparison to back
rounded vowels. The only languages of the region to have historically inherited
front rounded vowels are the Turkic languages. Interestingly, among the lan-
guages treated in this volume which have undergone heavy contact influence from
Turkic, none are analysed as having front rounded vowels in their vowel system.
These languages include Romeyka (Schreiber, chapter 6.4), Anatolian dialects of
Arabic (Procházka, chapter 2.4, see also Jastrow 2011), Laz (Lacroix, chapter 6.2),
Northern Kurdish of Anatolia (Haig, chapter 2.3). This is particularly noteworthy
for the Arabic dialects of Siirt, Kozluk and Sason, discussed in Jastrow (2011).
Although the vowel systems (and some parts of morphosyntax) have been exten-
sively restructured under Turkish influence, they apparently do not include front
rounded vowels.
Front rounded vowels are found in the Kurdish dialects in southeastern Anato-
lia (Şemzînan) and northwestern Iran, and in the Bahdini Kurdish dialect of north-
ern Iraq. This results in a pull-chain effect whereby [oː] is raised to [uː] to fill the
gap left by the fronting of original [uː] (Haig, chapter 3.3, §4.1.2). The same pro-
cesses are found in NENA dialects that were in contact with these specific Kurdish
dialects, e. g. +Mawana: xabüša < *ḥabbūšā ‘apple’, ruqe < *rōqē ‘spittle’. The
emergence of this areal feature appears to have come about through processes
of fronting rather than under the influence of Turkic. This can result in partial
fronting outcomes. In the Christian Urmi dialect of NENA, for example, long ū is
not fully fronted but shifts to a diphthong with a palatal offglide: xabuyša ‘apple’
(Khan, chapter 2.5, §5.5.2). In this latter case the perceptual magnet of the Kurdish
front rounded vowel has resulted in only a partial convergence. In the Christian
NENA dialect of Salamas the offglide is pushed up rather than forward, and is
realized as a velar fricative x, e. g. xibuxša < *ḥabbūša ‘apple’ (Khan, chapter 2.5,
§5.5.2). This may be a strategy to create a sound pattern that is more saliently
distinct from u than uy, suggesting that it was a later internal development in this
Neo-Aramaic after a diphthong uy had arisen by contact with the Kurdish fronted
rounded vowel. Fronting may be conditioned by assimilation processes at mor-
pheme boundaries, as in Gorani dialects of western Iran: [ø] occurs in some verb
forms, e. g. [møinɨm] ([<mæ-win-ɨm < *mæ-bin-ɨm], ‘I see’, Mahmoudveysi et al.
2012: 11). An exceptional case of the emergence of front rounded vowels by diffu-
sion from Turkish is Asia Minor Greek of Cappadoccia (not treated in this volume,
see Dawkins 1916), which is frequently cited as a language exhibiting exception-
ally high levels of influence from Turkish. Turkic languages that have been heavily
influenced by Iranian languages, on the other hand, have often lost at least some of
their front rounded vowels. The most extensive loss is found in Turkic varieties in
Iran and the southernmost varieties of Iraq Turkic, Mandali and Khanaqin. Bulut
(chapter 3.5, §2.1.2.3, chapter 4.2, §2.1.2.1) gives examples of the shift of the front
rounded vowels /ü/ and /ö/ to their unrounded counterparts /i/ and /e/ (or [ie]), such
as /sit/ < süt ‘milk’, and [Ieːz] < öz ‘self, own’, or [dek-] < dök- ‘to pour’.
The backing of the vowel a is a feature that has diffused across several lan-
guages in the Anatolian region. In Central Neo-Aramaic spoken in the area of Ṭūr
ʿAbdīn all historical long a vowels have the quality of o, e. g. Ṭuroyo: ḥmoro ‘ass’
(< *ḥmārā). This is a feature also of modern Western Aramaic dialects spoken in
Syria (Arnold 2011) and can be traced to a considerable time depth in Aramaic
in the Levant and other ancient Semitic languages of this region, such as Phoe-
nician and Hebrew. It is likely to have diffused to Central Neo-Aramaic dialects
of Turkey from the Levant at an early period. NENA dialects that were spoken in
Turkey adjacent to Central Neo-Aramaic exhibit some degree of convergence with
this vowel typology. In some NENA dialects of the Bohtan area long a in stressed
syllables shifts to o, e. g. Ruma xmóra ‘ass’ (< *ḥmārā). This may be explained as
the perceptual coupling of long ā in NENA with long ō in Central Neo-Aramaic in
salient stressed syllables but not in non-salient syllables. In other NENA dialects
of the Bohtan cluster, such as Hertevin, a long a is realized as a back [ɑː], i. e.
with a lesser degree of convergence with ō. In Armenian dialects an a shifts to ɔ
in stressed syllables in the area of Cilicia and even to the high vowel u in Svedia
(Martirosyan, chapter 2.2, §6.1). In Kurdish dialects of the area of Mardin a low
central unrounded [aː] of Standard Kurdish is retracted to [ɑː] and, in dialects
further west, has the quality of [ɔː] (Haig, chapter 2.3, §3.1.2).
Although the diffusion of the backing and rounding of ā is not a general feature
of the sound systems of NENA dialects, it can be identified in the paradigm of
the possessive suffixes in all dialects, where it has been exploited as a strategy of
distinguishing otherwise homophonous suffixes. The 2fs suffix is -ax. The histor-
ical form of the 2ms suffix is *-āx and by the normal process of historical pho-
nology the reflex of this in the modern NENA dialects should have been -ax, i. e.
a homophone of the 2fs suffix. In order to resolve this homophony the long *ā of
the historical 2ms form *-āx shifts to /o/, which results in the maintenance of the
distinction between the 2ms -ox and 2fs -ax in the paradigm. This shift of *ā > /o/
in the 2ms suffix -ox, which, as remarked, is not a general feature of all NENA
dialects, can be identified as a convergence with the phonology of neighbouring
languages that is motivated by the needs of the morphological system.9 In such
cases the areal feature is activated beyond the area of general diffusion to enrich
the possibilities of making distinctions between linguistic forms. Another example
of this phenomenon can possibly be identified in the limited diffusion of pharyn-
geals in Kurdish, where its sporadic use in native words in some cases appears
9
Cf. the work of Malkiel (1968, 1976) on the morphological motivations for “irregular”
sound changes in Romance.
Introduction 15
to have a similar function of making clear distinctions in a paradigm, e. g. ḥaft

‘seven’, hašt ‘eight’ (see above).
The provisional conclusion we draw from the Western Asian findings is that
in contact situations marked phonological segments differ in their ease of borrow-
ability, and in the degree to which they are retained in languages that have them
originally. It seems that glottalized consonants can quite readily cross language
boundaries and be incorporated into the native lexicon of the recipient language.
Furthermore, these segments seem to be retained in the donor languages. Inter-
dental consonants, on the other hand, are not borrowed, and are frequently lost in
languages that originally had them. Vowel shifts such as the fronting of rounded
vowels and the backing and rounding of ā diffuse in different degrees. Salience of
sounds is a factor that is conducive to diffusion. Finally, language-internal factors,
such as the need to make paradigmatic distinctions, may also bring about a conver-
gence in specific morphological or lexical items.
3.3. Morphosyntax
3.3.1. The marking of definiteness

A case of areal diffusion of morphology is the spread of the Iranian definite
marking nominal suffix -aka, native to Central and Southern Kurdish and Gorani,
to unrelated languages of the region. This exhibits various degrees of integration
in the recipient languages. In the Turkic dialects of the region (Bulut, chapter 3.5,
§2.4.4.1, chapter 4.2, §2.4.2.2) it is added, as in Kurdish and Hawrami, directly
after the stem, before case and plural suffixes, e. g. šär-äkä-sı-nı (poem-def -
poss .3 sg - acc ) ‘this certain/aforementioned poem of his’ (Sonqorî). Compare
for example Hawrami, where it is followed by number and gender inflection
(Mahmoudveysi and Bailey, chapter 4.5, §4.1.1), e. g. warg-aka (wolf-def . m ) ‘the
wolf’, ađā-kē (mother-def .f ) ‘the mother’, palawar-akē (bird-def .pl ) ‘the birds.’
In NENA dialects, however, it has the fixed form –ăke, which corresponds to the
oblique inflection of the Kurdish particle (-akay) rather than the nominative form
(–aka), e. g. Jewish Sanandaj bela ‘house’, belăke ‘the house’; Christian Sanandaj
besa ‘house’, besăke ‘the house’ (Khan, chapter 3.4, §6.1, chapter 4.4, §6.3.1).
Moreover in NENA it is attached to the right periphery of the noun after plural
suffixes, e. g. Jewish Sulemaniyya barux-awal-ăke (friend-pl - def ) ‘the friends’,
contrast Kurdish dost-ak-ān (friend-def - pl ) ‘the friends’ (Khan 2007: 201–202).
The stress patterns in the NENA dialects reflect the fact that this definite particle
has the prosodic status of an affix rather than a clitic.
Another language that has developed a novel marker of definiteness is the
Iranian language Kumzari (van der Wal Anonby, chapter 4.7), spoken in the
extreme south of the region. The marker is a suffix with the form -ō (qiṣr ‘palace’,
qiṣr-ō ‘the palace’, van der Wal Anonby, chapter 4.7, §4). The origin of the marker
itself is obscure, but it seems likely that the development of systematic marking
of definiteness, otherwise rare in Iranian, was influenced by contact with Arabic,
which generally employs definiteness marking. The languages to the north of the
region lack definiteness markers (Ossetic, Iranian languages of the Caspian, Laz,
NENA dialects in the area of Northern Kurdish). Romeyka, Armenian and Central
Neo-Aramaic, however, have inherited morphological markers of definiteness. A
‘definiteness isogloss’ across the region as a whole thus cannot be drawn, but a
general tendency is that the languages to the southeast of the region are more likely
to mark definiteness than those northward of Lake Van.
3.3.2. Commonalities in pronominal and case morphology

In various Anatolian Arabic dialects, plural pronouns and pronominal inflections
of common gender contain a n element. In such forms one would expect m, which
is a feature of the masculine plural pronouns in other Arabic dialects and Old
Arabic, e. g. Mardin hənne (3 cpl independent pronoun), -hən (3 cpl pronominal
suffix), ʾəntən (2 cpl independent pronoun), -kən (2 cpl pronominal suffix). It
is likely that these were influenced by the form of the corresponding pronouns
in the Neo-Aramaic dialects of the area, which contain n, e. g. Ṭuroyo (Mardin
area) hənne (3 cpl independent pronoun), NENA Hertevin (Bohtan area) ʾaḥniton
(2 cpl independent pronoun), -eḥon (2 cpl pronominal suffix) (Khan, chapter 2.5,
§6.1).
Some NENA dialects spoken in the region of Northern Kurdish have developed
a series of demonstrative pronouns from inherited morphology that resembles very
closely the shape of corresponding demonstratives in Kurdish, e. g. NENA Barwar
(Khan chapter 3.4, §6.1) ʾawwa ‘this’, ʾaw ‘that’ (anaphoric), ʾawaha ‘that over
there’, compared with Kurdish of Amedia (MacKenzie 1961: 82, 174): awa ‘this’,
aw ‘that (anaphoric), awēhē ‘that over there’. In some NENA dialects in contact
with Turkic the masculine singular far deixis demonstrative pronoun has devel-
oped the form ʾo ‘that’ by a process of contraction from *ʾaw and, moreover, it has
supplanted the feminine singular form, with the result that ʾo is now of common
gender. This resembles the form and distribution of its demonstrative counterpart o
in Turkic, which, as in general in Turkic, is genderless (Bulut, chapter 4.2, §2.3.2).
The configuration of demonstrative systems and their syntax converge in
various areas of the region. Several languages in the region of Anatolia and north-
ern Iraq, for example, have a three-way configuration of the demonstrative system
including (i) near deixis, (ii) anaphoric and (iii) far deixis. This is found in North-
ern Kurdish and neighbouring NENA dialects (examples from Kurdish of Amedia
and NENA of Barwar are given above). This three-way distinction of demonstra-
tives is found in Eastern Armenian (Dum-Tragut 2009: 129–130). In other areas of
the region a two-way demonstrative system predominates consisting of near deixis
and far deixis forms.
Introduction 17
A case of the convergence of the syntactic patterns of demonstratives is as

follows. In the Northern Kurdish dialects demonstrative constructions typically
consist of a demonstrative pronoun before a noun and a postposed deictic clitic,
e. g. ew-defter=e ‘that book’, ev-defter=e ‘this book’ (Haig, chapter 2.3, §2.2.3).
This pattern has been replicated by Central Neo-Aramaic, in which demonstra-
tive constructions consist of a preposed definite article (originally an anaphoric
pronoun) and a postposed deictic suffix, e. g. ʾu-malk-ano ‘this king’ (Khan,
chapter 2.5, §6.1).
The patterns of genitive constructions converge across various languages
(Khan, chapter 2.1, §2.2.1). In genitive constructions in noun phrases in the NENA
dialects, for example, the genitive particle is typically suffixed to the head noun,
whereas historically it was attached to the front of the dependent noun, e. g.
(1) Christian Urmi
bet-ət malca
house-gen king
‘the house of the king’
This is likely to be a convergence with the pattern of the Northern Kurdish attrib-
ute marking ezafe, which is suffixed to the head noun, e. g.
(2) a. xani-yê mirov-î
house-ez . m man-obl . m
‘the man’s house’
b. xani-yê wî mirov-î
house-ez . m that.obl . m man-obl . m
‘that man’s house’
In NENA there is even a replication of the oblique case morphology of Kurdish. As
seen in (2) dependent nouns, pronouns, and demonstratives in Northern Kurdish
have oblique case morphology. In Neo-Aramaic the oblique morphology of the
demonstratives has been replicated by the development of an innovative oblique
pronoun through the bonding of the genitive particle with the stem of the demon-
strative resulting in an unitary morphological form, e. g. ms. do < *d- (gen ) + o
( dem ). This is used together with a genitive particle suffixed to the head noun, thus
replicating the Kurdish pattern head noun-ezafe + oblique pronoun:10
(3) C. Urmi
bet-ət do malca
house-gen that.obl king
‘the house of that king’
10
For a detailed treatment of this process see Gutman (2018).
3.3.3. Clitic pronouns indexing verbal objects

One of the most salient features of Iranian languages of western Iran is the high
frequency of clitic pronouns, used in several different functions including adnom-
inal possession, prepositional complement, and various kinds of objects (see e. g.
Haig, chapter 3.2 for Central Kurdish, Mahmoudveysi and Bailey, chapter 4.5,
for Hawrami, Paul, chapter 4.6, §4.9.1 for Persian). When indexing objects, they
commonly cliticize to the finite verb. A number of non-Iranian languages have
converged with this pattern of object marking. Some incipient cases of this can
be identified in Turkic varieties in Iran, in which the use of possessive suffixes
on nouns has been extended to the expression of benefactive and dative pronom-
inal arguments on predicative constructions, e. g. lazım-mız (necessary-poss .1 pl
(is)) ‘we need’, ertebâṭ var-ı (relation existent- poss .3 sg ), literally ‘his contact
(is) existent’, that is: ‘he has contact’ under the influence of Iranian languages
(Bulut, chapter 4.2, §2.4.2.3). In the Neo-Aramaic dialects of western Iran there
has been an extension of the use of possessive suffixes to express pronominal
direct objects in various parts of verbal paradigms (Khan, chapter 4.4, §7.1, §7.2).
In some Armenian dialects in contact with West Iranian languages, possessive suf-
fixes are likewise used on verbs, e. g. Urmi dialect: mɛ ci pṙnɛnk‘-t (one horse let’s.
catch-2 sg ) ‘let’s catch a horse for you’ (Martiryosyan, chapter 2.2, §6.6). Simi-
larly, in Iraq Turkic and Iran Turkic, the actual Iranian pronominal clitics (matter
borrowing) are used to index benefactives, and direct objects on the Turkish verb,
as in yäyipti=šan (eat.pf .3 sg =3 pl ) ‘he ate them’ (Bulut, chapter 3.5, §2.4.4.2).
3.4. Constituent order
3.4.1. Position of copular elements

For clauses in the present indicative with non-verbal predicates, all languages of
the region have an overt, clause-final, obligatory copula element (often clitic in
nature), either as one variant, or as the sole option. The general pattern for such
clauses is schematically provided in (4a) and (4b):
(4) a. house large=is ‘the house is large’
b. this my.son=is ‘this is my son’
For the Iranian languages, an obligatory clause-final copular is an inherited feature
which characterizes most (perhaps all) of contemporary West Iranian, including
Iranian languages outside of the region under consideration. Example (5) from
Gilaki (Caspian region, see Stilo, chapter 5) is illustrative of contemporary Iranian
languages, and is basically matched by all the Iranian languages of the region (see
chapters 2.3, 3.3, 4.3, 4.5, 4.6, 4.7 and 5 for further illustration of Iranian copular
constructions).
Introduction 19
(5) Gilaki, Lāhijāni (Stilo, chapter 5, ex. 01)

i ti pəsér=ə
this your son=cop .3 sg
‘Is this your son?’
Examples from other language groups are as follows:
(6) Armenian (Hamšen dialect, Martirosyan, chapter 2.2, ex. 8)
im ɔnun-s Ašot ä
I.gen name-my PN(masc ) cop .3 sg
‘My name is Ašot.’
(7) Arabic of northern Iraq, dialects east of the Tigris (Procházka,
chapter 3.2, ex. 1)
báyt-a-hu
house-3 sg . f - cop .3 sg . m
‘It is her house.’
(8) Turkic of Iran (Khalaj of Bayâdestân, Bulut, chapter 4.2, ex. 10)
her biri älli metir=dir
every one.poss .3 sg 50 meter= cop .3 sg
‘[…] each one of them is fifty meters (long)’
(9) NENA of Iran, Jewish dialect of Sanandaj (Khan, chapter 4.4, ex. 13)
tat-óx tajə̀r=ye?| mam-ì tajə́r=ye| lá
father-your merchant-cop .3 ms uncle-my merchant-cop .3 ms neg
tat-ì.|
father-my
‘Is your father a merchant?’ ‘My uncle is a merchant, not my father.’
(10) Neo-Mandaic of Khorramshahr, Iran (Khan, chapter 4.4, ex. 18)
man=ye?
who=cop .3 s
‘Who is he?’
(11) Laz (Lacroix, chapter 6.2, ex. 45)
uʃkuri tʃk’om-eri t’u
apple eat-part be.impft .i 3sg
‘The apples were eaten.’ (state)
Note that Laz has an inflected copular verb. Positionally, however, it matches the
clitic copular elements of the other languages considered here.
(12) Romeyka (Schreiber, chapter 6.4, ex. 8)

Ato to kitap temo en.
3sg . n def . art . n book 1sg . poss be.3sg
‘This book is mine.’
The clause-final copula thus appears to be a good candidate for a syntactic trait
that has diffused across the entire region. As mentioned, for the Iranian languages,
a clause-final copula going back to the Old Iranian defective verb ‘be’ is wide-
spread across the family, and its presence in all Iranian languages of the region
is thus a matter of historical inheritance. For the other languages, the obligatory
clause-final copula has emerged through quite varied means. Across Turkic gener-
ally, a zero-copula is widespread for the third person present indicative, but an overt
copula is available as a pragmatically marked option (e. g. in Standard Turkish dIr,
going back to a verb dur- ‘stand’). In the varieties of Turkic investigated here, this
marked option has become regularized, so that most present indicative third person
copular clauses have an overt copula. Thus for Turkic, the (near obligatory) copula
has arisen through relaxing the pragmatic conditions on the old overt copula, so
that its frequency of occurrence increases. As Bulut (chapter 4.2, §2.3.3.1) notes,
the frequent “usage of the copula of the 3rd person may point to the influence of
Iranian languages, where an overt marking is obligatory.” For Semitic languages,
the presence of an overt, clause-final copula has required quite extensive restructur-
ing. The morpheme generally identified as the copula in these languages is mostly
of pronominal origin, e. g. Anatolian varieties of Arabic (Procházka, chapter 2.4,
§2.4.1) and Neo-Aramaic (Khan, chapters 3.4, and 4.4 for NENA, and 2.4 for
Central Neo-Aramaic and NENA). Among other things, Khan discusses different
degrees of grammaticalization of the innovated copula in NENA. In dialects such as
Christian Barwar of northern Iraq, the default position of the copula is clause final,
but it may occur on a clause-internal constituent that is in “narrow focus” (Khan,
chapter 3.4, §8.1). In other dialects, however, such as Jewish Sanandaj of Iran, the
copula element is positionally fixed in clause-final position, and is thus impervious
to pragmatic considerations (Khan, chapter 4.4, §8.1.1). The fixed clause-final posi-
tion matches that of the main contact languages (different varieties of Kurdish), and
it is noteworthy that the NENA dialects with fixed clause-final copulas are generally
those that have undergone the heaviest contact influence. For Romeyka (Schreiber,
chapter 6.4), an overt obligatory copula is part of the Indo-European heritage, but its
clause-final position can be presumably attributed to contact influence from Turkish.
In sum, the obligatory clause-final copula in the present indicative is a common
template across the region. The different languages have converged on this pattern
via varying processes, ranging from major restructuring (re-analysis of pronouns)
in the case of Semitic, or relaxation of pragmatic conditions on existing structures
(Turkic). Although the processes themselves differ, they nevertheless converge
on a common outcome. In this case, the target model has been provided by the
Introduction 21
Iranian languages of the region. We may surmise that copular constructions are
particularly prone to contact influence due to their omnipresence in everyday com-
munication (e. g. in utterances such as ‘what’s this?’, or ‘that’s mine’, ‘he’s at
home’ etc.), and are presumably among the earliest utterance types to be acquired
in multi-lingual settings.
3.4.2. The relative ordering of direct object and verb

The relative order of direct object and verb remains one of the most intensely
researched features in language typology, and is of correspondingly high interest
in areal linguistics. The languages of the region exhibit historically both options:
Semitic is historically VO, while Turkic and Iranian are OV across all histori-
cally attested stages. For the remaining languages, the issue is more controversial:
Greek and Armenian might be considered VO for the earliest records, while old
Kartvelian has been characterized as “free SOV/SVOˮ (Testelec 1998: 236).
In general, the historically attested (or reconstructed) order remains stable in
the modern languages of the region. Thus change is the exception, rather than the
rule. Contemporary Armenian (Martirosyan, chapter 2.2) and Romeyka (Schrei-
ber, chapter 6.4) are difficult to classify, evidently permitting both orders, and thus
seem to continue the somewhat contentious nature of word order in their historical
ancestors; clarifying the issues here would raise complications beyond the scope of
this introduction. The Kartvelian language Laz (Lacroix, chapter 6.2) has shifted
from the apparently free word order of early Kartvelian noted above to a consist-
ent OV order, presumably under Turkic influence. But the most radical examples
of change documented in this volume are found in NENA dialects, in particular
the trans-Zab Jewish dialects of northern Iraq and northwestern and western Iran.
Here the inherited Semitic VO order has completely switched to OV. An example
from the Jewish Sanandaj dialect of NENA (Khan, chapter 4.4, ex. 44) is the
following:
(13) hămər-Ø ke ʾay-brona həl-day brata gb-e.
say.irr - d .3 ms comp that-boy obj - dem . obl girl love.prs - d .3 ms
‘[in order that] he say that the boy loves the girl’
In Neo-Mandaic, both VO and OV are possible, with the choice apparently
dependent on the definiteness and specificity of the object. The contrast is shown
in (14), with an indefinite specific object pre-verbally, and (15) with a definite
object post-verbally (both from Neo-Mandaic of Khorramshahr):
(14) jisr-i tum əḇad-yon
bridge-indf again do.pst -3pl
‘They built another bridge.’ (Khan, chapter 4.4, ex. 28, citing Häberl
2009)
(15) čāre aḇd-etton … qam tā jisər

remedy do.ipfv -2 pl to that bridge
‘You will fix that bridge.’ (Khan, chapter 4.4, ex. 31, citing Häberl 2009)
A contact-induced change from VO to OV is thus clearly documented in NENA,
and partially for Neo-Mandaic. A change in the opposite direction, however, is not
found in the material covered in this volume. Even Kumzari, an (arguably) Iranian
language heavily influenced by neighbouring varieties of Arabic, has largely
retained the Iranian OV word order. However, OV order only obtains with lexical
objects, while pronominal objects are generally post-verbal, evidently replicating
the syntax of Southern Arabian languages. Thus we find that pronominal syntax is
apparently more susceptible to contact-induced change than the syntax of lexical
noun phrases. Typical examples illustrating pre-verbal lexical objects are given in
(16) and (17):11
(16) Kumzari (van der Wal Anonby, chapter 4.7, ex. 14)
dar-ō twākš-um na.
door-the open.impf -1 s neg
‘I will not open the door.’
ṣirx dō-um ba yē.
gold give.impf -1 s to 3s
I will give her gold.’
In sum, the relative ordering of direct object and verb is a comparatively stable
syntactic feature. The only changes attested in the region involve a shift towards
OV, rather than in the other direction. This is a particularly interesting finding in
view of the widespread opinion that changes from OV to VO (as in Germanic) are
in some sense the more natural and frequent kind of word-order change than the
11
The position of pronominal objects in Kumzari is noteworthy because it violates Green-
berg’s Universal number 25, according to which if pronominal objects follow the verb,
then nominal objects likewise do. A possible account might run as follows: the pronom-
inal objects of Kumzari basically appear in the position of the clitic object pronouns
of many West Iranian languages, including Persian, namely immediately following the
verb, or the light verb complement if the verb is a complex predicate, cf. examples
(4) and (5) in van der Wal Anonby (chapter 4.7). Furthermore, the object pronouns of
Kumzari show obvious phonological similarities to the clitic pronouns of other West
Iranian languages. What appears to have happened is that the erstwhile object clitic
pronouns have developed into free pronouns, but have retained their position to the
right of their former hosts. Such a development (clitic pronoun > free pronoun) is obvi-
ously unusual, but given that Greenberg’s Universal 25 is a fairly robust generalization,
violations presumably require exceptional circumstances such as very heavy contact
influence.
Introduction 23
opposite direction (see Gell-Mann and Ruhlen 2011 for a recent statement to this
effect).
3.4.3. OVG (direct object – verb – goal) word order in the region
While word-order typology has focussed on the position of direct objects relative
to the verb, the position of other verbal arguments relative to the verb has received
much less attention (see Hawkins 2008 for discussion). The implicit assumption
is that generally, all objects will occur on the same side of the verb, thus in VO
languages we can expect indirect objects etc. to be post-verbal, and in OV lan-
guages we expect them to be pre-verbal. Languages that violate this expectation
are quite rare, and interestingly, generally appear to be OV languages (Hawkins
2008). A relatively well-known example is Mande (generally classifed as Niger-
Congo, West Africa), where adpositional arguments follow, while direct objects
precede the verb (Nikitina 2011).
In the Western Asian context, a similar phenomenon occurs in the majority of
OV languages: constituents expressing goals of verbs of movement, or of caused
movement (‘put’, ‘place’ etc.) overwhelmingly follow the verb, yielding a char-
acteristic OVG (G=Goal) order across much of the region (Haig 2014b, 2017, to
appear). The generalization that can be drawn is the following:
(18) Phrases expressing goals of verbs of motion and caused motion are
post-verbal, irrespective of the position of the direct object in the
language concerned.
For the VO languages of the region, e. g. the Arabic dialects west of the Tigris
(Procházka, chapter 2.4), this yields an unremarkable and harmonic word order, in
which all manner of verbal complements follow the verb. The following example
from Mosul Arabic illustrates the post-verbal position of a goal (‘to.house’), and
of an indirect object (‘some apples’):
(19) qabǝl-mā y-ġōḥ ʕa-l-bēt ə
štaġa təffāḥ
before-prtcl 3 sg . m -go.ipfv to-def -house buy.pfv .3 sg . m apples.coll
‘Before he went home he bought some apples.’ (Procházka,
chapter 2.4, ex. 29)
In OV languages, however, (18) leads to a disharmonic order, with direct objects
preceding, but goals following the verb. This is illustrated in (20), from the Jewish
Sanandaj dialect of NENA. The direct object ‘one cow’ of ‘buy’ precedes the verb,
while the goal of ‘bring’ follows the verb:
(20) […] xa-dána tórta šăqə̀l-Ø-wa-la.|

[…] one-single cow buy.prs .- d .3 ms - pst - l .3 fs
k-m-e-wa-l-ó ga-béla nòš-ef.|
ind -bring-d .3 ms - pst - l .3 fs - pvp in-house self-poss .3 ms
‘(each family went and) bought a cow. They would bring it to their
home.’ (Khan, chapter 4.4, taken from ex. 58)
The OVG word order illustrated in (20) is found in all the varieties of Neo-Aramaic
that have shifted to OV, and in all the Iranian languages of the region. Example
(21) shows post-verbal position of a recipient, coupled with preverbal order of the
direct object, in Kumzari:
mī’ī dōʾa ba diryīʾīn-an.
fish give.3s . impf to fisher-pl
‘He gives fish to the fishers.’
Turkic varieties of the region likewise show a strong tendency to place goals after
the verb, thus disturbing the otherwise consistently verb-final syntax (see Bulut,
chapter 3.5, §2.4.5.1 for Turkic in northern Iraq, and chapter 4.2, §7.2.4 for Turkic
in Iran). The data from Armenian do not permit a firm conclusion on the order of
goals. Laz and Ossetic, on the other hand, both seem to be outside of the region
where (18) holds. Example (22) from Laz shows that the goal of the verb of caused
motion is preverbal, while (23) shows preverbal goals of motion verbs in Ossetic:
(22) Laz (Lacroix, chapter 6.2, ex. 32)
Xalili-k uʃkui dʒebi-s dol-i-bɣ-am-s
Halil-erg apple pocket-dat pv - mid -tip-th - i 3 sg
‘Halil tips the apples in his pocket.’
(23) Ossetic (Erschler, chapter 6.3, ex. 27b)
dɐrʁɐvš-mɐ sɐw-ɐg išči gošt-mɐ akumulʲator
Darghavs-all go-nmz someone Gosht-all accumulator
a-laš-zɐn
prv -carry-fut .3 sg
‘Someone who is going to Darghavs will take the accumulator to Gosht.’
The areal epicentre for OVG word order appears to be northern Iraq and neigh-
bouring regions of western Iran. In Behdinī Kurdish of northern Iraq, post-verbal
positioning of goals and recipients is a grammatical rule (Haig, to appear), while
in other languages, for example colloquial Persian, or Iran Turkic, it is a statis-
tically preferred option (see Frommer 1981 and Haig 2017 on spoken Persian).
Languages also differ in the type of argument that is treated as ‘goal’. Thus in
Behdinī dialects of Northern Kurdish goals, recipients, addressees of ‘tell’ (gotin),
and final states of change-of-state predicates (‘become, turn into’ etc.) are placed
Introduction 25
after the verb (Haig, to appear), but dialects of Northern Kurdish further north put
addressees of ‘tell’ in front of the verb (cf. Haig, chapter 2.3, Fig. 5). Stilo (2010)
notes for a dialect of Vafsi (Tati, see also Stilo, this volume, chapter 5) that recip-
ients of ‘give’ are overwhelmingly post-verbal, while addressees of ‘say, tell’ are
overwhelmingly pre-verbal.
Although we lack detailed syntactic analysis for many of the languages in this
volume, it is nevertheless possible to formulate some provisional conclusions with
regard to post-verbal constituents. Firstly, they are predominant in the region of
northern Iraq, western Iran, and southeastern Anatolia. The extent and frequency
of post-verbal constituents fades out as one progresses northward and eastward.
In Ossetic and Laz, the phenomenon is only marginal. In spoken Persian they are
very common, but not grammatically obligatory (Haig 2017), and further eastward
(e. g. the easternmost dialects of Balochi, or in East Iranian languages, for example
the Wakhi texts in Obrtelová 2017), they are scarce. Second, there appears to be a
hierarchy of post-posability, approximately as follows:
(24) H ierarchy of post - posability in OV languages of W estern A sia
( those to the left are most frequently post - posed )
goals of (caused) motion > recipients of ‘give’ > addressees of ‘tell’ >
other
More generally, the wide distribution of OVG order (in Iranian, some of Neo-Ar-
amaic, and Turkic) suggests that goal arguments are comparatively susceptible to
“synchronization” with a contact language, leading to a common ordering, while
the linear placement of direct objects is more resistant to change (see Haig 2014b).
3.5. Other issues in contact-induced syntactic change

Another area where areal distribution is evidently relevant is adpositions. In general,
the Semitic languages have all retained their inherited prepositions, regardless of
heavy contact influence. The Turkic languages have likewise largely retained their
postpositions. Iranian languages of the region, however, exhibit all possible kinds
of adpositions, which approximately correspond to a north-south cline. While
Southern and Central Kurdish, Kumzari, Hawrami, and Bakhtiari (and Persian)
are overwhelmingly prepositional, the dialects of Northern Kurdish exhibit a mix
of pre-, post- and circumpositions. The Caspian and most Tatic languages (Stilo,
Chapter 5, §5.7), on the other hand, have a “predominance of postpositions”, as
does Ossetic (see Stilo 2005, and Haig, chapter 2.3). Laz and Armenian are like-
wise predominantly postpositional. Adpositional type is generally slow to change,
so the evident variability across Iranian is presumably the result of long-standing
convergence with languages of different types.
Relative clauses and subordination generally are also domains known to reflect
contact influence (see Gandon 2016). Most of the languages of the region have
post-verbal relative clauses, and this is even found in Iran Turkic, where generally
patterns of subordination have moved closer to the Iranian type (see Khan, chap-
ters 2.1 and 4.1 for an overview of subordination). Ossetic has the most complex
patterns of subordination, and has also pre-verbal participial relative-like construc-
tions (see the nominalized verb ‘go’ in the Ossetic relative clause in (23) above),
perhaps reflecting a shift from the original post-verbal Iranian type towards the
pre-verbal pattern in the Caucasus. Laz is generally head-final in this respect,
again reflecting the overall trend for the languages to the north of the region to
adhere more closely to a consistent head-final type.
3.6. Provisional remarks on ease of syntactic borrowing in the Western Asian

context
The data we have considered above, though not a comprehensive survey, do allow
provisional conclusions regarding the ease with which the linear ordering of clause
constituents can adapt to, or align with, that of a contact language (we have not
included patterns of clause linkage and subordination in this hierarchy, but it seems
evident that they would rank highly, see Haig 2001, Matras 2002 for discussion).
These are summed up in the following hierarchy:
(25) P rovisional hierarchy of ease of contact - induced change in
clause - internal syntax
copular constructions H ighest
order of goal and verb
order of direct object and verb
adpositional order L owest
Support for the hierarchy in (25) comes from the blanket distribution of the shared
copular construction, the almost complete spread of post-verbal goals (except-
ing Laz and Ossetic), the small number of unambiguous cases of object/verb
ordering changes (e. g. the Trans-Zab Jewish dialects of NENA). But even the
languages most heavily affected by contact, such as the aforementioned varie-
ties of NENA, have not abandoned their inherited adpositional type (though of
course some of the contact languages are also predominantly prepositional).
It may in fact turn out that the relative position of object-verb, and adposi-
tional order in (25) will need to be reconsidered; this remains a task for future
research.
Introduction 27
References
Arnold, Werner. 2011. Western Neo-Aramaic. In Stefan Weninger, Geoffrey Khan, Michael
Streck & Janet Watson (eds.), The Semitic languages: An international handbook, 685–
696. Berlin & Boston: De Gruyter Mouton.
Belleli, Sara. 2016. A study on language and folklore in the city of Harsin (Kermānshāh
Province, West Iran). Sketch grammar with texts and lexicon. Naples: University of
Naples dissertation.
Bickel, Balthasar, Johanna Nichols, Taras Zakharko, Alena Witzlack-Makarevich, Kristine
Hildebrandt, Michael Rießler, Lennart Bierkandt, Fernando Zúñiga & John B. Lowe.
2017. The AUTOTYP typological databases. Version 0.1.0 https://github.com/autotyp/
autotyp-data/tree/0.1.0 (accessed 15 December 2017).
Blevins, Juliette. 2017. Areal sound patterns: From perceptual magnets to stone soup. In
Raymond Hickey (ed.), The Cambridge handbook of areal linguistics, 88–121. Cam-
bridge: Cambridge University Press.
Butts, Aaron (ed.). 2015. Semitic languages in contact. Leiden & Boston: Brill.
Chirikba, Viacheslav A. 2008. The problem of the Caucasian Sprachbund. In Pieter Muys-
ken (ed.), From linguistic areas to areal linguistics, 25–94. Amsterdam & Philadelphia:
Benjamins.
Dawkins, Richard. 1916. Modern Greek in Asia Minor. A study of the dialects of Sílli, Cap-
padocia and Phárasa, with grammar, texts, translation and glossary. Cambridge: Cam-
bridge University Press.
Dryer, Matthew S. 1988. Universal of negative position. In Michael Hammond, Edith
Moravcsik & Jessica Wirth (eds.), Studies in syntactic typology, 93–124. Amsterdam &
Philadelphia: Benjamins.
Dryer, Matthew S. 1992. The Greenbergian word order correlations. Language 68(1). 81–
138.
Dum-Tragut, Jasmine. 2009. Armenian. (London Oriental and African Language Library,
14). Amsterdam & Philadelphia: Benjamins.
Fattah, Ismaïl Kamandâr. 2000. Les dialectes kurdes méridionaux: étude linguistique et
dialectologique. (Acta Iranica, 37). Leuven: Peeters.
Frommer, Paul. 1981. Post-verbal phenomena in colloquial Persian syntax. Los Angeles:
University of Southern California dissertation.
Gandon, Ophélie. 2016. La relativisation dans une perspective areale: lʼaire Caucase – Ana-
tolie de lʼest – Iran de lʼouest. Paris: Universite Sorbonne Nouvelle – Paris 3 dissertation.
Gell-Mann, Murray & Merritt Ruhlen. 2011. The origin and evolution of word order. Pro-
ceedings of the National Academy of Sciences of the United States of America 108(42).
17290–17295.
Grawunder, Sven. 2017. The Caucasus. In Raymond Hickey (ed.), The Cambridge hand-
book in areal linguistics, 356–395. Cambridge: Cambridge University Press.
Güldemann, Tom. 2018. Areal linguistics beyond contact, and linguistic areas in Afrabia. In
Tom Güldemann (ed.), The languages and linguistics of Africa. (The World of Linguis-
tics, 11), 448–545. Berlin & Boston: De Gruyter Mouton.
Gutman, Ariel. 2018. Attributive Constructions in North-Eastern Neo-Aramaic. Studies in
Diversity Linguistics 15. Berlin: Language Science Press.
Haig, Geoffrey. 2001. Linguistic diffusion in present-day east Anatolia: From top to bot-
tom. In Robert Dixon & Alexandra Aikhenvald (eds.), Areal diffusion and genetic in-
heritance: Problems in comparative linguistics, 195–224. Oxford: Oxford University

Press.
Haig, Geoffrey. 2008. Alignment change in Iranian languages: A Construction Grammar
approach. Berlin & Boston: De Gruyter Mouton.
Haig, Geoffrey. 2014a. East Anatolia as a linguistic area? Conceptual and empirical issues.
In Lale Behzadi, Patrick Franke, Geoffrey Haig, Christoph Herzog, Birgitt Hoffmann,
Lorenz Korn & Susanne Talabardon (eds.), Bamberger Orientstudien, Band 1. Bam-
berg: Bamberg University Press.
Haig, Geoffrey. 2014b. Verb-goal (VG) word order in Kurdish and Neo-Aramaic: Typolog-
ical and areal considerations. In Geoffrey Khan & Lidia Napiorkowska (eds.), Neo-Ar-
amaic and its linguistic context, 407–425. Piscataway, NJ: Gorgias Press.
Haig, Geoffrey. 2017. Western Asia: East Anatolia as a transition zone. In Raymond Hickey
(ed.), The Cambridge handbook of areal linguistics, 396–423. Cambridge: Cambridge
University Press.
Haig, Geoffrey. To appear. Post-predicate constituents in Kurdish. In Yaron Matras, Ergin
Öpengin & Geoffrey Haig (eds.), Structural and typological variation in the dialects of
Kurdish. London: Palgrave Macmillan.
Hammarström, Harald & Mark Donohue. 2014. Some principles on the use of macro-areas
in typological comparison. Language Dynamics and Change 4(1). 167–187.
Hasselbach-Andee, Rebecca (ed.). In preparation. The Wiley Blackwell Companion to An-
cient Near Eastern Languages. Malden, MA: Wiley-Blackwell.
Hawkins, John. 2008. An asymmetry between VO and OV languages: The ordering of ob-
liques. In Greville Corbett & Michael Noonan (eds.), Case and Grammatical Relations:
Studies in honor of Bernard Comrie, 167–190. Amsterdam & Philadelphia: Benjamins.
Herin, Bruno. 2012. The Domari language of Aleppo, Syria. Linguistic discovery 10(2).
1–52. DOI: 10.1349/PS1.1537-0852.A.412, http://journals.dartmouth.edu/cgi-bin/Web
Objects/Journals.woa/1/xmlpage/1/article/412 (accessed 17 March 2018).
Jastrow, Otto. 2011. Turkish and Kurdish influences on the Arabic dialects of Anatolia. Türk
Dilleri Araştırmaları 21(1). 83–94.
Johanson, Lars & Éva Csató (eds.). 2006. The Turkic languages. London & New York:
Routledge Curzon.
Khan, Geoffrey. 2007. Grammatical borrowing in North-Eastern Neo-Aramaic. In Yaron
Matras & Jeanette Sakel (eds.), Grammatical borrowing in cross-linguistic perspective,
197–214. Berlin & Boston: De Gruyter Mouton.
Khan, Geoffrey. 2016. The Neo-Aramaic dialect of the Assyrian Christians of Urmi. 4 vols.
Leiden & Boston: Brill.
MacKenzie, David Neil. 1961. Kurdish dialect studies I. Oxford: Oxford University Press.
Maddieson, Ian. 2013a. Glottalized consonants. In Matthew S. Dryer & Martin Haspelmath
(eds.), The world atlas of language structures online. Leipzig: Max Planck Institute for
Evolutionary Anthropology. http://wals.info/chapter/7 (accessed 17 March 2018).
Maddieson, Ian. 2013b. Presence of uncommon consonants. In Matthew S. Dryer & Mar-
tin Haspelmath (eds.), The world atlas of language structures online. Leipzig: Max
Planck Institute for Evolutionary Anthropology. http://wals.info/chapter/19 (accessed
17 March 2018).
Malkiel, Yakov. 1968. The inflectional paradigm as an occasional determinant of sound
change. In Winfred P. Lehmann & Yakov Malkiel (eds.), Directions for Historical Lin-
guist, 20–64. Austin: University of Texas Press.
Introduction 29
Malkiel, Yakov. 1976. Multi-conditioned sound change and the impact of morphology on
phonology. Language 52. 757–778.
Matras, Yaron. 2002. Kurmanji complementation: Semantic-typological aspects in an areal
perspective. Sprachtypologie und Universalienforschung / Language typology and lan-
guage universals 55(1). 49–63.
Matras, Yaron. 2007. The borrowability of structural categories. In Yaron Matras & Jeanette
Sakel (eds.), Grammatical borrowing in cross-linguistic perspective, 31–73. Berlin &
Boston: De Gruyter Mouton.
Matras, Yaron. 2012. A grammar of Domari. Berlin & Boston: De Gruyter Mouton.
Nichols, Johanna. 1992. Linguistic diversity in space and time. Chicago: University of Chi-
cago Press.
Nikitina, Tatiana. 2011. Categorial reanalysis and the origin of S-O-V-X order in Mande.
Journal of African Languages and Linguistics 32. 251–273.
Obrtelová, Jaroslava. 2017. Narrative structures of Wakhi oral stories. Uppsala: Studia
Iranica Upsaliensia.
Stilo, Donald L. 2005. Iranian as buffer zone between the universal typologies of Turkic
and Semitic. In Éva Ágnes Csató, Bo Isaksson & Carina Jahani (eds.), Linguistic con-
vergence and areal diffusion: Case studies from Iranian, Semitic and Turkic, 35–63.
London & New York: Routledge Curzon.
Stilo, Donald L. 2010. Ditransitive constructions in Vafsi: A corpus-based study. In An-
drej Malchukov, Martin Haspelmath & Bernard Comrie (eds.), Studies in ditransitive
constructions: A comparative handbook, 243–276. Berlin & New York: De Gruyter
Mouton.
Stilo, Donald L. 2012. Intersection zones, overlapping isoglosses, and ‘Fade- out/Fade-in’
phenomena in Central Iran. In Behrad Aghaei & M. R. Ghanoonparvar (eds.), Iranian
languages and culture: Essays in honor of Gernot Ludwig Windfuhr, 3–33. Costa Mesa:
Mazda.
Testelec, Yakov. 1998. Word order in Kartvelian languages. In Anna Siewierska (ed.), Con-
stituent order in the languages of Europe, 235–256. Berlin & Boston: De Gruyter Mou-
ton.
Weninger, Stefan, Geoffrey Khan, Michael Streck & Janet Watson (eds.). 2011. The Semitic
languages: An international handbook. Berlin & Boston: De Gruyter Mouton.
Windfuhr, Gernot (ed.). 2009. The Iranian languages. London & New York: Routledge
Curzon.
2. Eastern Anatolia and northwestern Iran
2.1. Eastern Anatolia and northwestern Iran:
overview
Geoffrey Khan
The languages covered by the chapters in this section include the Indo-European
languages Armenian and Kurdish and the Semitic languages Arabic and Neo-Ar-
amaic. The varieties of Turkic spoken in the region are treated in Section 4 of this
volume, devoted to western Iran (Bulut, this volume, chapter 4.2). The current
chapter describes the language situation that existed at the beginning of the 20th
century. Some components of the language situation have undergone major changes
since this period. This applies in particular to Armenian and Neo-Aramaic, which
are now spoken only by small isolated communities in the Republic of Turkey. At
the beginning of the 20th century Armenian was spoken throughout the region and
Neo-Aramaic was spoken in numerous communities in southeastern Turkey from
the region of Ṭūr ʿAbdīn (Mardin) eastwards. The language situation of the region
had undergone various evolutions also at earlier periods. Armenian and Neo-Ara-
maic have deeper historical roots, which predate the arrival of Arabic, Kurdish and
Turkic languages. All the languages of the region in the modern period exhibit a
considerable diversity of dialects. The numerous dialects of Armenian are broadly
divided into western and eastern Armenian, with the boundary between the two
coinciding roughly with the border between eastern Turkey and Armenia and Iran.
The Kurdish dialects belong to the northern variety of the language known as
Kurmanji and this can be divided into various subgroups according to geograph-
ical area. The Neo-Aramaic dialects fall into the Central Neo-Aramaic subgroup
spoken west of the Tigris river and the North-Eastern Neo-Aramaic (NENA) sub-
group spoken east of the Tigris. The Arabic dialects are divided into the eastern,
sedentary-type dialects (often called qǝltu dialects, from the form qǝltu ‘I spoke’,
which is a distinctive feature of this group), and the western Bedouin type of dia-
lects.
Numerous contact-induced convergences can be identified in the languages
described here. A selection of these are noted in this introductory chapter. The
languages are attested in a wide range of dialects, classifiable into a variety of
geographical subgroups, and so shared areal features generally relate to specific
areas of the wider region.
32 Geoffrey Khan
1. Phonology
1.1. Loss of Semitic interdental fricatives

Some dialects of Arabic and Neo-Aramaic preserve interdental fricative conso-
nants, which are a characteristic feature of Semitic. In many dialects of these
languages in the region, however, the interdentals have been eliminated to bring
their phoneme inventories more in line with the non-Semitic contact languages,
which lack interdentals. In several of the qǝltu Arabic dialects, for example, the
interdental fricatives (ḏ, ṯ, ḏ)̣ have shifted to stops (d, t, ḍ), sibilants (z, s, ẓ), or
labio-dental fricatives (v, f, ṿ), respectively (Procházka, chapter 2.4, §1.1). In the
Central Neo-Aramaic dialect of Mlaḥso the interdentals ṯ and ḏ shift to sibilants (s,
z), whereas in the North-Eastern Neo-Aramaic (NENA) dialects of the region the
most frequent process is for the interdentals to shift to stops (t, d). In some NENA
dialects the unvoiced fricative ṯ is reduced to zero. In the Jewish dialects of north-
western Iran the interdentals shift to the lateral l.
1.2. Unaspirated stops

Many Armenian dialects of the region have in their phonemic inventory unaspi-
rated unvoiced stops contrasting with aspirated unvoiced stops, which are found
also in Classical Armenian and are possibly an inheritance from earlier Indo-Eu-
ropean (Martirosyan, chapter 2.2, §4.1). Unaspirated stops are also found in the
Kurdish and Neo-Aramaic dialects of the region. Since unaspirated stops are not
a feature of Kurdish and Neo-Aramaic dialects further south outside of the Arme-
nian dialect area, it is highly likely that that they entered Kurdish and Neo-Ara-
maic through contact with Armenian.
1.3. Pharyngeals and pharyngealized consonants

Pharyngeal fricatives (ḥ and ʿ) and pharyngealized consonants are a feature of
Semitic and occur in the Arabic and Neo-Aramaic dialects of the region.
The Central Neo-Aramaic subgroup, spoken in the region of Mardin
(Ṭūr ʿAbdīn), preserves the pharyngeal fricatives ḥ and ʿ most conservatively,
probably due to extensive contact with Arabic, whereas in most NENA dialects
they have been lost. A small cluster of NENA dialects in the region Bohtan on the
western periphery of NENA adjacent to the Central Neo-Aramaic dialect area have
the pharyngeal ḥ in their sound inventory (Khan, chapter 2.5, §5.3). Pharyngeal
fricatives occur in some Kurdish dialects in the area of Mardin. It is possible that
they have entered Kurdish through Arabic or Central Neo-Aramaic.
Pharyngealized consonants occur throughout Arabic and Neo-Aramaic dia-
lects in the region (Khan, chapter 2.5, §5.4). They are found also in Kurdish in
Eastern Anatolia and northwestern Iran: overview 33
the Mardin area, where as just indicated, also pharyngeal consonants have devel-
oped, it seems in association with the pharyngealization of the consonants (Haig,
chapter 2.3, §2.1.2). In the NENA dialects in the eastern half of the region, the
original pharyngealization of the consonantal segments has developed into a
suprasegmental feature that takes a whole word as its domain. In the Christian
dialects where this suprasegmental feature develops, an original pharyngealized
stop ṭ [ˁt] becomes reanalyzed as an unaspirated stop ṱ, which is faciliated by the
existence of unaspirated unvoiced stops in their phoneme inventories.
1.4. h>x
The shift of the laryngeal h > x is a characteristic of the Armenian dialects of the
Van and Urmi areas. In the variety of the Christian Urmi NENA dialect spoken
in Armenia a historical h is shifts to the velar fricative /x/ under the influence of
a similar shift in the Armenian dialects of the region, e. g. xada ‘thus’ (< hada).
1.5. Palatalization
Palatalization of dorsals is found throughout the region. In NENA the process is
particularly common in the southeast of the region and is most advanced in the
Christian Urmi dialect, in which the stops *k and *g shift without conditioning to
affricates č [ʧh] and j [ʤ]. Parallels to such advanced palatalization can be found
in the Azeri Turkic and Kurdish dialects of the Urmi region. In the NENA Chris-
tian Urmi dialect there is complete palatalization of *k in some words resulting in
y, e. g. *dukta > *duyta ‘place.’ A parallel to this can be found in the Armenian
dialects of the Urmi area, in which unaspirated k and g become y and aspirated
k becomes hy before syllable-initial consonants (Martirosyan, chapter 2.2, §6.6).
Kurdish dialects in the Şemzînan area exhibit a lesser degree of palatalization of the
stops g and k, which are realized with a palatal offglide preceding front or central
unrounded vowels, e. g. Standard Kurdish [kɨraːs], Şemzînan [kʸɨraːs] ‘robe’.
1.6. Rhotics
In the Kurdish dialects of the region two types of rhotic are found, viz. trills [r] or
flaps [ɾ]. These can contrast when not in word-initial position, e. g. [pɨr] ‘much’ vs.
[pɨɾ] ‘bridge’ (Haig, chapter 2.3, §2.2). In NENA dialects of the region rhotics are
likewise either trills or flaps, which can contrast. The trills, which are pronounced
with greater muscular tension, are typically also pharyngealized. In some of the
NENA dialects of the Tyare area, in addition to the non-pharyngealized flap and
the pharyngealized trill a non-pharyngealized retroflex ɻ has developed, resulting
in a three-way rhotic opposition, e. g. Ashitha: paɻa ‘layer of dust’, para ‘orna-
mental coin’, paṛa ‘lamb’.
34 Geoffrey Khan
1.7. a>o
The backing of the vowel a is a feature that is shared by many of the languages
in the western area of the region. In Armenian dialects an a shifts to ɔ in stressed
syllables in the area of Cilicia and even to the high vowel u in Svedia (Marti-
rosyan, chapter 2.2, §6.1). In Kurdish dialects of the area of Mardin a low central
unrounded [aː] of Standard Kurdish is retracted to [ɑː] and, in dialects further
west, has the quality of [ɔː] (Haig, chapter 2.3, §3.1.2). In some NENA dialects of
the Bohtan area long a in non-final open syllables shifts to o, e. g. Ruma xmóra
‘donkey’ (< *ḥmārā). In other NENA dialects of the Bohtan cluster, such as Herte-
vin, a long a is realized as a back [ɑː]. In Central Neo-Aramaic spoken further west
in the area of Ṭūr ʿAbdīn all historical long a vowels have the quality of o, e. g.
Ṭuroyo: ḥmoro ‘ass’ (< *ḥmārā).
1.8. Vowel fronting and raising

In the Kurdish dialects in southeastern Anatolia (Şemzînan) a back high vowel
[uː] fronts to [yː] (transcribed here as ü), e. g. Standard Kurdish stur, Şemzînan
stür ‘thick’. By a pull-chain effect [oː] in such Kurdish dialects is raised to [uː] to
fill the gap left by the fronting of original [uː], Standard Kurdish bo, Şemzînan bu
‘for’. Similar fronting of [uː] is found in the Kurmanji dialects of the Urmi region
([uː] > [yː] > [iː]), in the Kurmanji spoken in northern Iraq (Badini dialect, Haig,
this volume, chapter 3.3, §4.1.2) and in Mukri dialects in northwestern Iran ([uː]
> [yː]). These same processes are found in the NENA dialects of the southeast-
ern area of the region, e. g. +Mawana: xabüša < *ḥabbūšā ‘apple’, ruqe < *rōqē
‘spittle’. In the Christian Urmi dialect long ū is not fully fronted but shifts to a
diphthong with a palatal offglide: xabuyša ‘apple’ (Khan, chapter 2.5, §5.5.2).
1.9. w>x
In many Armenian dialects of eastern areas of the region the w of the diphthong
aw shifts to x by a process of fortition before a dental stop or affricate, e. g. eawt‘n
‘seven’ > Mełri ɔ́xtə (Martirosyan, chapter 2.2, §5.1). This process in Armenian
may have been a factor in the fortition of the offglide of long u to x in some NENA
dialects of the area, e. g. Christian Salamas xibuxša < *ḥabbūša ‘apple’ (Khan,
chapter 2.5, §5.5.2).
2. Morphology
2.1. Pronouns
A salient feature of the Anatolian Arabic qǝltu dialects is occurrence of n in several
plural pronouns and pronominal inflections of common gender. In such forms one
would expect m, which is a feature of the masc. plural pronouns in other Arabic
dialects and Old Arabic, e. g. Mardin hənne (3 cpl independent pronoun), -hən
(3 cpl pronominal suffix), ʾəntən (2 cpl independent pronoun), -kən (2 cpl pronom-
inal suffix). It is likely that these were influenced by the form of the corresponding
pronouns in the Neo-Aramaic dialects of the area, which contain n, e. g. Ṭuroyo
(Mardin area) hənne (3 cpl independent pronoun), NENA Hertevin (Bohtan area)
ʾaḥniton (2 cpl independent pronoun), -eḥon (2 cpl pronominal suffix) (Khan,
chapter 2.5, §6.1).
In the Kurdish of the region demonstrative constructions typically consist of a
demonstrative pronoun before a noun and a postposed deictic clitic, e. g. ew-deft-
er=e ‘this book’, ev-defter=e ‘that book’. (Haig, chapter 2.3, §2.2.3). The clitic
may also be attached directly to a demonstrative pronoun when it is used inde-
pendently, e. g. ew-e ‘this one’. This pattern may have influenced the demonstra-
tive constructions in Central Neo-Aramaic, which consists of a preposed definite
article (originally an anaphoric pronoun) and a postposed demonstrative suffix,
e. g. ʾu-malk-ano ‘this king’ (Khan, chapter 2.5, §6.1.).
In most of the Kurdish dialects of the region there are near deixis and far deixis
demonstratives, which are distinguish by the contrast of /v/ and /w/, viz. ev (near
deixis), ew (far deixis). In the Şemzînan area the [v] and [w] are not phonologi-
cally distinct and so there is only one demonstrative, viz. ew. Near deixis is dis-
tinguished by attaching the deictic clitic, e. g. ew-defter=e ‘this book’, ew-defter
‘that book’, ew-e ‘this one’, ew ‘that one’. These constructions and the phonetic
shape of the demonstratives have close replicas in some NENA dialects of the
area, which are innovative and have most likely developed by a process of con-
vergence with Kurdish, e. g. Ashitha ʾawwa (< ʾaw-ha) ‘this one’, ʾaw ‘that one’
(anaphoric). The forms have Aramaic etymologies but have taken on the phonetic
shape of the corresponding Kurdish forms. The same applies to the qəltu Arabic
dialect of Siirt, in which near deixis demonstratives have an innovative form as
a result of convergence with the phonetic shape of Kurmanji, e. g. āva < *hāḏā
(Jastrow 1978, 1: 108).
Some NENA and Arabic dialects of the region have three types of deixis, viz.
(i) near to speaker, (ii) near to hearer, (iii) far from both speaker and hearer, e. g.
NENA Ashitha (i) ʾawwa, (ii) ʾaw, (iii) ʾawaha, Arabic Daragözü (Jastrow 1978,
1: 109) (i) āza, (ii) āk, (iii) ukkā́ . The hearer deixis form (ii) functions also as an
anaphoric pronoun. The far deixis form (iii) is formed by adding a deictic suffix
–ha. In the case of the NENA Ashitha (iii) form, this clitic is added to the (i) form,
36 Geoffrey Khan
which already has a deictic clitic, resulting in the accumulation of two suffixes
(ʾawaha < ʾaw-ha-ha). This three-way distinction of demonstratives is found in
Eastern Armenian (Dum-Tragut 2009: 129–130) and also in some Kurmanji dia-
lects in northern Iraq, which resemble closely the three-way set of NENA demon-
stratives cited above, e. g. Kurmanji of Amedia: awa ‘this’, aw ‘that (anaphoric),
awēhē ‘that over there’ (MacKenzie 1961: 82, 174). Bedir Khan and Lescot (1968:
224) also mention the additional demonstrative particle ha / he, which attaches
to a noun modified by a demonstrative, following the postposed clitic mentioned
above: ev jinik=a ha ‘this woman’.
2.2. Noun phrase
2.2.1. Genitive markers

In some Arabic dialects of the region a head noun is combined with a dependent
noun in a genitive relation by means of a relative particle (Procházka, chapter 2.4,
§4.2), e. g.
(1) Haksöy
sǝʕēb lē baqqāl
owner rel / gen shop
‘The owner of the shop’ (literally: the.owner the.one.of the.shop)
Constructions with such relative/genitive particles are innovative developments
in Arabic but are an archaic feature of Aramaic and continue to be used in the
Neo-Aramaic dialects of the region. In Central Neo-Aramaic dialects the particle
is prefixed to the dependent noun, as in earlier Aramaic, e. g.
(2) Ṭuroyo
ʾi-barθo d-u-malko
art . fs -daughter gen - art . ms -king
‘the daughter of the king’
The synthetic construction with a relative/genitive particle in Arabic dialects such
as Haksöy may have developed by convergence with this Aramaic construction.
Arabic dialects also have an independent genitive pronominal element, e. g.
(3) Qarṭmīn (Jastrow 1978, 1: 125)
əlfaṛas ḏīli-ye
the.horse gen . pro - cop .3 fs
‘The horse is mine’
This also is likely to have developed through convergence with Neo-Aramaic dia-
lects, e. g.
(4) Christian Urmi

ʾo-suysa diyyi=ilə
the-horse gen . pro = cop .3 ms
‘The horse is mine’
Parallel constructions are found also in Kurdish with independent ezafe particles,
e. g.
(5) va defter=na yê min=in
dem . prox notebook=dem . pl ez . pl 1 sg . obl - cop .3 pl
‘These notebooks are mine’
In genitive constructions in noun phrases in the NENA dialects the relative/geni-
tive particle is typically suffixed to the head noun, e. g.
(6) Christian Urmi
bet-ət malca
house-gen king
This is likely to be a convergence with the pattern of the Kurdish attribute marking
ezafe, which is suffixed to the head noun, e. g.
(7) a. xani-yê mirov-î
house-ez man-obl
‘the man’s house’
b. xani-yê wî mirov-î
house-ez that.obl man-obl
‘this man’s house’
In Kurdish dependent nouns, pronouns, and demonstratives are in an oblique case
form. In Neo-Aramaic the oblique of the demonstratives have been replicated by
the development of an innovative oblique pronoun:
(8) C. Urmi
bet-ət do malca
house-gen that.obl king
‘the house of that king’
2.2.2. Adjectives
In many of the Arabic dialects of the region a noun and its attributive adjective are
combined in a genitive-type construction, which is shown by the fact that a defi-
nite head noun does not have the article (Procházka, chapter 2.4, §4.2).
38 Geoffrey Khan
(9) Mardin
bǝnt ǝl-ǝkwayys-e
girl def -beautiful-f
‘the beautiful girl’
This is an innovation in Arabic which is likely to have developed by convergence
with Kurdish, in which a noun is regularly linked to an adjective by the relative/
genitive ezafe particle, e. g.
(10) bajar-ek-î mezin
town-indef - ez . m big
‘a big town’
Some features of the syntax of adjectives in Kurdish appear to have had an impact
on NENA constructions. The use of independent ezafe particles to add additional
dependents to a head noun and to nominalize an adjective have parallels in NENA,
in which a demonstrative pronoun stands in the position of the ezafe, e. g.
(11) Kurdish
bra-yê min ê mezin
brother-ez . m 1sg . obl ez . m big
‘my older brother’
(12) C. Urmi (Khan 2016, 2: 49)
brāt-malca ʾe +
ɟurta
daughter-king dem . fs . big
‘The elder daughter of the king’
Kurdish: yê spî

NENA (C. Urmi): ʾo +xvara
‘the white one (masc. sing.)
2.3. Prepositions
Some grammaticalization pathways of nouns to prepositions that are found in
Kurdish have been replicated in NENA, e. g.
Kurdish: ser ‘head’ li ser ‘on, upon’
NENA (C. Urmi): riša ‘head’ b-rīš ‘on, upon’
2.4. Attention-drawing particle d-

Several languages of the region use a particle dV-, with various different vowel
qualities filling the V position, to intensify an imperative. This is found in Arabic
dialects, e. g. Mardin: dǝ-qūmu ‘stand up (pl.)!’, Harran-Urfa d-iniṭ-ni ‘give
(m.sg.) me!’, du-gul-li ‘tell (m.sg.) me!’, Kurmanji Kurdish, e. g. de bêje ‘go on,
say!’, and Neo-Aramaic dialects, e. g. Ashitha de-pluṭ! ‘go out!’. It is possible that
it has its origin in an Arabic demonstrative particle ḏā/ḏī (Procházka, chapter 2.4,
§6). This is likely to be related to the Arabic near deixis demonstrative hāḏā. The
particle may also be identified as a prefix in the deictic copula of some NENA
dialects of the area, in which it typically expresses near deixis, e. g. Christian Urmi
d-ulə ‘here he is’ (Khan, chapter 2.5, §6.1).
2.5. Verbal constructions
2.5.1. Preverbal particles

All languages of the region express TAM distinctions by affixes attached to verbal
stems. There are conspicuous similarities in the patterning of some of these par-
ticles across the languages. This is particularly the case with regard to the pre-
fixes expressing the indicative present, which are found in Arabic qəltu dialects,
Neo-Aramaic, Kurdish and Armenian, e. g.
Arabic (Azəx): kū-nəktəb ‘we are writing’
Neo-Aramaic (Ṭuroyo) ko-nošeq ‘he kisses’
NENA (Christian Urmi) ci-šatə ‘he drinks’
Armenian (Muš) kə-sirim ‘I love’
Kurmanji di-xum ‘I eat’
It appears that most, if not all, of these preverbal particles are in origin deictic par-
ticles that draw attention to an activity in progress in the present.1 It can be estab-
lished that several of them have developed from a deictic particle with a copula
enclitic, e. g. Arabic k-u < k-uwe ‘it is that’, C. Urmi c-i- < c + i. This probably
also applies to the Ṭuroyo prefix k-o < k-yo or kale-yo. One of the etymologies
proposed for the Armenian kə- is a deictic particle ‘behold’ (Makaev 1977). It is
notable that many of the particles have a similar phonetic shape across the various
languages, although each is likely to be an internal development. In some lan-
1
Matras (2010: 75) considers the parallels among the languages discussed here to be the
result of a common grammaticalization pathway, suggesting that the prefixes “usually
derive from a preposition indicating location or similarity”. This appears unlikely for
the languages of Anatolia, but note that the etymology of the Kurdish indicative prefix
di-, or the functionally equivalent forms da-, a- in other varieties of Kurdish, has not yet
been satisfactorily resolved.
40 Geoffrey Khan
guages, such as Arabic, the progressive is still the basic function of the form. In
most languages the aspectual range has been extended to habitual as well as pro-
gressive (e. g. Ṭuroyo, Kurmanji and many western Armenian dialects). In these
languages a progressive is sometimes specifically marked by additional particles,
e. g. Ṭuroyo kal-ko-nošəq ‘he is kissing’, or in some cases by repeating the parti-
cle, e. g. Armenian Zeyt‘un gɔ g-aṙnum ‘I am taking’. In Southeastern Kurmanji
(Bahdini) a progressive is expressed by combining the present with an ezafe par-
ticle, which functions as a copula in these dialects, e. g. yê di-hêt ‘he is coming’.
In Armenian, Neo-Aramaic and Arabic the irrealis subjunctive is expressed
by the verb without a preverbal particle, whereas in Kurmanji the subjunctive is
expressed by a dedicated particle.
In the eastern Armenian dialects in the east of the region the indicative present
is expressed by other types of constructions, mostly consisting of an infinitive
(sometimes in the locative case), participle or verbal noun and the copula (Mar-
tirosyan, chapter 2.2, §4.2). In NENA dialects east of Bohtan, such constructions
based on the infinitive are used to express the progressive, constructions with the
preverbal particle being largely restricted to the habitual. In the far east of the
region, in northwestern Iran, the usage of constructions based on the infinitive are
extended to include the habitual, in some cases as an alternative to the habitual
expressed by a construction with a preverbal particle, e. g. Christian Urmi, and
in some cases as the regular form for both progressive and habitual, e. g. Jewish
Salamas. The distribution of the construction in the Jewish Salamas NENA dialect,
therefore, exhibits the greatest convergence with eastern Armenian dialects. The
indicative construction based on the copula and infinitive is found also in north-
west Iranian languages belonging to the Caspian and Tatic groups (see Stilo, this
volume, chapter 5) in adjacent areas in the Caspian region in northern Iran and
southern Azerbaijan, where eastern Armenian dialects were also spoken. In some
cases, this is the general construction for the indicative, e. g. North Talyshi (Tatic),
in other cases it has specifically a progressive sense, e. g. Harzani, Kalāsuri
(Caspian) (Stilo, this volume, chapter 5, §6.5.1.3). Preverbal particles are used to
form the indicative in north-west Iranian languages spoken further south (Stilo,
this volume, chapter 5, §6.5.1.1). Standard Turkish spoken in the Republic of
Turkey expresses the progressive by locative constructions based on the infinitive,
but the various non-standard Turkic vernaculars of the region do not exhibit this
feature (Bulut, this volume, chapter 4.2, §2.3.3.2).
In some Armenian dialects other constructions are used to express the pro-
gressive, such as possessive constructions with the verb ‘to have’ (Martirosyan,
chapter 2.2, §4.2.), parallels for which can be found in north-west Iranian lan-
guages (Stilo, this volume, chapter 5, §6.5.8).
2.5.2. Future
The future tense is expressed by a preverbal particle in most languages of the
region. In the Arabic qəltu dialects this is ta-. In Ṭuroyo (Central Neo-Aramaic)
the future is expressed by a preverbal particle with various allomorphs (g, k, gd,
kt), all of which derive historically from a particle with the form *gəd (Jastrow
1985: 148–149). This can also express habitual aspect. In Mlaḥso (Central
Neo-Aramaic) the future particle has the form d-, which is likely to be related
(Jastrow 1994: 52). It is likely that the alveolar element t/d in these forms is a
subordinating particle. In NENA dialects of the region the future preverbal par-
ticle has the form bəd-, which is often contracted to b- or, before vowels, p̂ ṱ- ṱ.
In Kurmanji Kurdish the future is expressed by the preverbal particle (d)ê or wê
followed by the verb with a subjunctive prefix bi- (though in the southeastern
dialects of Kurmanji, the verb lacks this prefix in the future tenses). The Arabic
ta- is derived from the purpose particle *ḥattā ‘so that, in order that’. The NENA
future particle appears to be derived from the volitive expression *bāʿē d- ‘wants
to’. The Kurmanji future marker dê/wê is likely to be a fossilized third singular
form of the verb vîyan / -vê- ‘be necessary, desirable’, still used in many dialects
to express ‘want’, cf. Sorani awē, southeastern varieties of Kurmanji dɨvê(t). A
parallel to the Ṭuroyo future can be identified in the expression ga d- (< *ka d-)
to express the imminent future (‘about to’), which is used in NENA dialects in
northern Iraq (Khan 2008: 583–584) and some of the dialects of Anatolia.
In Standard Kurmanji the future particle dê/wê etc. can be combined with a
negated subjunctive form of the verb to express negated future. However, in the
varieties of Kurmanji spoken in Şemzînan and northern Iraq, this is not possible.
Instead, a negated form of the indicative is used to express the negated future. The
lack of a future particle in negative future constructions is the norm in the qəltu
Arabic dialects (Jastrow 1978, 1: 313) and the NENA dialects.
In some of the languages of the area future verb forms may also be used to
express the habitual aspect. This is the case in Ṭuroyo, NENA and the Kurmanji
dialects in the southeast of the region (known as Badini).
2.5.3. Perfect
The perfect is formed in Arabic qəltu dialects and in Ṭuroyo Neo-Aramaic by
combining a deictic/presentative particle with the past perfective form (Procházka,
chapter 2.4, §3.3; Khan, chapter 2.5, §7). The particle is typically the same, or
similar to, the one used with present verbs to express the present indicative or
present progressive and is a reduced form of a 3 s deictic copula, e. g. Arabic Āzəx
kū-ṣərtu ‘I have become’ (Jastrow 1978, 1: 307), Ṭuroyo ko-qayəm ‘he has arisen’.
Armenian and the NENA dialects east of Bohtan express the perfect by a par-
ticiple and copula. In Armenian this is a perfect participle, e. g. Eastern Armenian
42 Geoffrey Khan
gnac’el em (go.prf . ptcp cop .1 s ) ‘I have gone’ (Dum-Tragut 2009: 222), in NENA
it is expressed by a participle that functions as both a resultative and perfect parti-
ciple, e. g. Christian Urmi xišə-vən (go.res . ptcp cop .1 ms ) ‘I have gone’. In some
Armenian dialects the construction with the perfect participle has come to express
the perfect, Kakʻavaberd (eastern Armenian), xəmal əm ‘I drank’. In Kakʻavaberd
the perfect is now formed by a construction with a resultative participle (Muradyan
1967: 139).
The perfect in Standard Kurmanji appears to be a hybrid between these two
strategies. Most of the paradigm of Standard Kurmanji looks like a simple past
combined with a present 3 s copula, e. g. hat-im-e (come.pst -1 s - cop .3 s ) ‘I have
come’ (cf. hatim ‘I came’) but the 3 s form looks more like a perfect participle +
copula hatiy-e ‘he has come’ (cf. hat ‘he came’) (Thackston 2006: 53).
In Southeastern Kurmanji dialects the perfect is formed by a combination of
the ezafe particle with a past participle, e. g. yê hatî (ez . ms come.pst . ptcp ) ‘he has
come’. In such dialects the ezafe functions as a copula, so this is equivalent to the
strategy of combining the participle with a copula.
The Mlaḥso dialect of Central Neo-Aramaic and the NENA dialects of the
Bohtan area express the perfect by attaching to a past stem a set of person suf-
fixes that are characteristic of the inflection of the present, e. g. Ruma (Bohtan):
qəm-la (perfective < qim rise. pst - erg .3 fs ) ‘she stood up’, qim-a (perfect: qim
rise. pst - direct .3 fs ) ‘she has stood up’; cf. qem-a (present: stand.prs - direct .3 fs )
‘she stands up’.
The western Bedouin subgroup of Arabic dialects express the perfect by
the active participle, which is a common strategy across many Arabic dialects
(Procházka, chapter 2.4, §3.3).
The languages of the region use the perfect not only to express events from
a reference point of the present (resultative or anterior) but also to express the
evidential.
2.6. Copula
All the languages of the region express the copula by an enclitic that is attached
to the predicate. In the Indo-European languages, Armenian and Kurmanji, the
copula is verbal in origin, though phonetically reduced to little more than person
markers. In the Semitic languages, Arabic and Neo-Aramaic, the copula is pro-
nominal in origin. In Arabic the enclitic copula is found specifically in the qəltu
dialects, which share many other areal features with contact languages. In these
qəltu dialects the copula is clearly pronominal (Procházka, chapter 2.4, §4.1). Also
in Central Neo-Aramaic (Ṭuroyo and Mlaḥso) the enclitic copula is clearly pro-
nominal. In NENA dialects, however, the copula reflects some feature of verbal
inflection, which is an innovation that has presumably developed by convergence
with the verbal copulas of Kurmanji and Armenian.
In addition to the enclitic copula, qəltu Arabic and Neo-Aramaic also have
deictic copulas, which draw attention to an entity or a situation. These typi-
cally consist of a deictic pronominal element and an enclitic copula, e. g. Arabic
(Kəndērīb) kūwe (< k + uwe) (Jastrow 1978, 1: 139), NENA Ashitha hole (<
ha-ʾaw=ile) ‘there it is him, there he is’. In Ṭuroyo it consists of an object pronoun
rather than an enclitic copula, e. g. kalé ‘see him, there he is’.
In the Southeastern Kurmanji dialects (Bahdini) the ezafe, originally a relative/
attributive particle, has come to be used as a copula. Unlike the enclitic copulas
just described, the ezafe copula is placed before the predicate, sometimes in an
independent form, e. g. Ergin yê bərsî ‘Ergin is hungry’. This appears to have
developed first in affirmative contingent and locative predicates. Such construc-
tions are replicated in qəltu Arabic and NENA dialects by deictic copula construc-
tions, e. g.
(13) Arabic Qarṭmīn (Jastrow 1978, 1: 140)
əbnu kū qəddām əmm-u
son-his deic . cop .3 ms before mother-his
‘His son is in front of his mother’
(14) NENA Ashitha
bron-e hole qam-yəmm-e
son-his deic . cop .3 ms before-mother-his
‘His son is in front of his mother’
In negative copula constructions, the enclitic copulas are typically attached to the
negators in Arabic, Neo-Aramaic and Armenian, e. g.
(15) Arabic Mḥallami (Procházka, chapter 2.4, §5.3)
ma-na hawn
neg - cop .1s here
‘I am not here’
(16) NENA (Ashitha)
lɛle (< la=ile) laxxa
neg - cop .3 ms here
‘He is not here‘
(17) Eastern Armenian (Dum-Tragut 2009: 215)
Anuš-ě gełec’ik ałǰik ē
Anuš. nom -the beautiful girl.nom cop .3 s
‘Anuš is beautiful girl’
(18) Anuš-ě gełec’ik ałǰik č’-ě
Anuš. nom -the beautiful girl.nom neg - cop .3 s
‘Anuš is not a beautiful girl’
44 Geoffrey Khan
3. Syntax
3.1. Alignment of transitive clauses

The Kurdish dialects of the region exhibit an essentially ergative alignment in
perfective and perfect transitive clauses, in that the verbal inflection of perfective
verbs and the copula of perfect constructions agree with the object of the clause
and the agent has oblique case. The Neo-Aramaic dialects of the region exhibit
some degree of convergence to the alignment patterns of Kurdish. Subjects of
perfective transitive clauses have oblique agreement suffixes in NENA. In many
dialects, moreover, a pronominal object is expressed by an absolutive suffix and
a definite object argument in the clause is cross-referenced by such an absolu-
tive suffix. In some Neo-Aramaic dialects, however, objects are not expressed by
absolutive suffixes and the alignment is closer to an accusative typology than an
ergative one.
The other languages of the region all have accusative alignment.
3.2. Relative clauses

Both dialect groups of Arabic in the region exhibit an innovation in the struc-
ture of relative clauses whereby the head noun is put into attributive annexation
(“construct state”) with what follows. This is presumably due to convergence
with Kurdish ezafe constructions, which is used to join both a head to a genitive
nominal attribute and a head noun to an attributive relative clause (Procházka,
chapter 2.4, §5).
The Southeastern Kurmanji (Bahdini) predicative ezafe functions both as
copula and as a relative copula, e. g.
(19) meriv yê l-mal
man ez . m in-house
‘The man is in the house’ ~ ‘The man who is in the house’
The relative function the ezafe is replicated in qəltu Arabic and NENA by a rela-
tive particle with a clitic copula: Arabic (Qarṭmīn) la-wwe (rel - cop .3 ms ), NENA
(Ashitha): ṱ-ile (rel - cop .3 ms ), e. g.
(20) NENA (Ashitha)
ʾo-naša ṱ-ile gu-bɛθa
that-man rel - cop .3 ms in-house
‘The man who is in the house’
References
Bedir Khan, Emir Djeladet and Roger Lescot. 1968. Grammaire kurde (dialecte kurmanji).
Paris: Maisonneuve.
Dum-Tragut, Jasmine. 2009. Armenian. (London Oriental and African Language Library
14). Amsterdam & Philadelphia: Benjamins.
Jastrow, Otto. 1978. Die mesopotamisch-arabischen Qəltu-Dialekte, vol. 1. Wiesbaden:
Harrassowitz.
Jastrow, Otto. 1985. Laut- und Formenlehre des neuaramäischen Dialekts von Mīdin im Ṭūr
ʻAbdīn. Wiesbaden: Harrassowitz.
Jastrow, Otto. 1994. Der neuaramäische Dialekt von Mlaḥsô. (Semitica Viva 14). Wies-
baden: Harrassowitz.
Khan, Geoffrey. 2008. The Neo-Aramaic Dialect of Barwar, 3 vols. Leiden & Boston: Brill.
Khan, Geoffrey. 2016. The Neo-Aramaic Dialect of the Assyrian Christians of Urmi, 4 vols.
MacKenzie, David Neil. 1961. Kurdish Dialect Studies. London Oriental Series, vols. 9–10.
Oxford: Oxford University Press.
Makaev, E. A. 1977. The origins of the modal particle ke in Armenian. In N. Ya Gabaraev
(ed.), Questions of Iranian and General Philology, 195–200. Tbilisi: Metsniereba pub-
lishing house. [In Russian].
Matras, Yaron. 2010. Contact, convergence, and typology. In Raymond Hickey (ed.), The
handbook of contact linguistics, 66–85. Maldon & Oxford: Blackwell.
Muradyan, H. D. 1967. Kak‘avaberdi Barbaṙə. Yerevan: Academy Press.
Thackston, Wheeler. M. 2006. Kurmanji Kurdish: A reference grammar with selected read-
ings. Unpublished Manuscript, Harvard University. http://www.fas.harvard.edu/~iranian/
Kurmanji/index.html (accessed 15 January 2015).
2.2. The Armenian dialects
Hrach Martirosyan
1. Introduction
The Armenian language is known to us from the fifth century CE onwards thanks
to an unbroken literary tradition, which involves three periods: Classical (5th to
11th centuries), Middle (12th to 16th), and Modern (17th to 21st). Modern Arme-
nian was canonized in the 19th century and consists of two branches: Western
(based on western dialects such as that of Constantinople/Polis) and Eastern
(based on the dialects spoken on the Ararat plain and surroundings). Armenian is
genetically related to Indo-European languages such as Hittite, Sanskrit, Avestan,
Greek, Latin, Gothic, and Slavic.
One usually distinguishes around fifty or sixty modern Armenian dialects, a
number of which are now extinct. The dialects have been classified according to a
phonological principle, namely the development of the consonant system, as well
as a morphological one, that is the formation of the indicative present (these two
features are jointly demonstrated in Table 2). J̌ ahukyan (1972; Djahukian apud
Greppin and Khachaturian 1986: 19–24) offers a multi-feature classification and
on this basis identifies eleven dialectal groups. Armenian is written with its own
script, shown in Table 1. To enhance accessibility, most of the material in this
chapter is provided in a Latin-based transcription.
After a short bibliographical sketch and brief accounts of the problems of the
dialectal archaisms, the classification of the Armenian dialects and the earliest
dialectal isoglosses, we shall turn to the description of a number of dialects from
different dialectal groups.
The Armenian dialects 47
Table 1: The Armenian Alphabet

Arm. scr. Translit. Phonet. Arm. scr. Translit. Phonet.
ա a a մ m m
բ b b յ y j
գ g g ն n n
դ d d շ š ʃ
ե e (j)e ո o (v)o
զ z z չ č‘ t hʃ
է ē ɛ պ p p
ը ə ə ջ ǰ ʤ
թ t‘ tհ ռ ṙ ṙ
ժ ž ʒ ս s s
ի i i վ/ւ v/w v/w
լ l l տ t t
խ x χ ր r r
ծ c ts ց c‘ t hs
կ k k ու u u
հ h h փ p‘ ph
ձ j ʣ ք k‘ kh
ղ ɫ ʁ օ ō ɔ
ճ č tʃ ֆ f f
2. Bibliographical sketch
The foundations of Armenian dialectology were laid by Hrač‘ya Ačaṙyan, whose

Armenian dialectology (1911, cf. 1909), Armenian regional dictionary (1913) and
eleven dialect descriptions form the basic corpus of dialectological data, which are
systematically included, supplemented and evaluated in his fundamental History
of the Armenian language (AčaṙHLPatm 1940, 1951), Complete grammar of the
Armenian language (AčaṙLiak 1955, 1957, 1971), and especially in his crowning
work, the Root dictionary of Armenian (1971–1979).
Besides Ačaṙyan’s works, the following general dialectological research and
handbooks should be mentioned: Patkanov (1869), Yovnanean (1897), Msereanc‘
(1899), Ɫaribyan (1953), A. V. Grigoryan (1957), J̌ahukyan (1972), Simonyan
(1979), Greppin and Khachaturian (1986), G. Mkrtč‘yan (2015). Extensive pho-
nological treatments are given in S. A. Avagyan (1973), Muradyan (1982) and
Vaux (1998), as well as in a series of papers by Weitenberg. General overviews
on some aspects of Armenian dialectology can be found in Weitenberg (2002) and
Hovsep‘yan and Geworgyan (2013). Besides the aforementioned Ačaṙyan (1913),
important dialectological dictionaries are Ačaṙyan (1902), Amatuni (1912) and
HayLezBrbBaṙ 1–7 (2001–2012). The most recent description of all the Arme-
nian dialects with references can be found in Martirosyan (2013). Dialect maps
48 Hrach Martirosyan
can be found in Ačaṙyan (1909, 1911), J̌ahukyan (1972), and Hewsen (2001: 227,
Nr. 218).
3. Dialectal diversity in the Classical period and earlier
The existence of dialectal diversity in the Classical period is much debated. Schol-
ars consider a number of traces of early diversity (Aɫayan 1958; Winter 1966:
205; Kortlandt 1980: 105, 2003: 32; Beekes 2003: 142–143; Clackson 2004–2005:
154). A frequently cited example is dial. *lizu vs. Classical lezu ‘tongue’ (Meillet
1936: 11; Viredaz 2003: 76). The modern dialects preserve important data for the
reconstruction of the oldest history of the language.
All the modern dialects have fully participated in the fixation of the pro-
to-Armenian accent on the (prehistoric) penultimate syllable and the subsequent
apocope (loss of some sounds of the final unaccented syllable). The formation of
the Armenian dialects cannot thus be pushed back beyond the date of apocope. At
a later stage, the accent was retracted back to the penultimate syllable in Eastern
dialects.1 It is certain, however, that Armenian dialect diversity existed in the pre-
historic period (i. e. before the 5th century CE), and the modern dialects have
preserved features which are not present in Classical Armenian.
Issues regarding the origin of the Armenian dialects and their existence in the
classical period, as well as numerous archaic dialectal words and features are dealt
with in several monographs (Ačaṙyan-AčaṙHLPatm1 1940b: 118–122; Ačaṙyan-
AčaṙHLPatm2 1951: 114–141, 324–439; J̌ahukyan 1972, 1985, 1987: 252–256,
274–279; Simonyan 1979) and a number of papers mostly by Armenian scholars.
In these studies, dialectal archaisms are often represented as a preservation of what
has been lost in the corresponding classical forms. This view should be verified
in each individual case. Many deviant features that have been assigned to great
antiquity can, in fact, be easily explained within the framework of recent internal
developments.
The deviant dialectal forms and features are systematically treated in the recent
etymological dictionary of the inherited lexicon, Martirosyan (2010).
1
See Ačaṙyan (1899: 184–186; AčaṙHLPatm2 1951: 371–372); Weitenberg (2002: 148–
152).

The Armenian dialects
Figure 1: Location of Armenian dialects discussed in this chapter

49
4. Classification of Armenian dialects
4.1. The consonant system

The Armenian dialects play an important role in the discussion of two major issues
in Indo-European phonology: glottals and voiced aspirates.
Classical Armenian possesses a three-fold opposition: voiced – voiceless –
voiceless aspirate. Most of the dialects display varying developments of the voiced
and voiceless stops, whereas the voiceless aspirate series is stable everywhere.
The dialects of the Group 6 (T‘iflis, Ardvin, Ararat/Loṙi, Agulis, Kṙzen, etc.)
have retained the system intact. Particularly interesting are the dialects of Group
4 (Sasun, Cilicia, Svedia, etc.) which display the following steps of the so-called
Second/Modern Armenian Sound Shift: voiceless > voiced; voiced > voiceless.
This seems to require an interchange of two features, voiced and voiceless (cf.
Class. Tigran > dial. Dikran), which is impossible from a relative chronological
point of view. Therefore other features have been adduced in the discussion. On
the one hand, the Classical Armenian voiced stops have been interpreted as voiced
aspirates that directly continued the Proto-Indo-European series *bh/*dh/*gh. In
these terms, the original opposition would seem to have been preserved in Group
2 (Karin, Muš, parts of Ararat, etc.). On the other hand, the glottalic character of
the voiceless stops in some dialects has been interpreted as an inherited, Indo-Eu-
ropean, feature. These two issues have been heavily debated, however. Are these
phonological features archaic remnants of Proto-Armenian phonology or they
are recent innovations in Armenian dialects? This basic question still awaits an
answer.2
4.2. The formation of the indicative present

The morphological classification of the Armenian dialects, developed mostly by
Ačaṙyan and Ɫaribyan, is based on the formation of the present indicative. In Clas-
sical Armenian, the present indicative was of a simple synthetic type, e. g. sirem ‘I
love’, mnam ‘I stay’. Armenian gradually used this form to express the subjunctive
and future, while a new present indicative developed in various ways: kə present
formative particle + finite verb (kə sirem, kə sires, etc.), participle [originally
dative-locative case] + copula (sirum em), infinitive + copula (sirel em), infinitive
in locative case + copula (sirelis / sire[li]s em), as well as action noun + copula
(lalman em, cf. CA action nouns in -umn, gen. -man). The development of these
characteristic features dates to around the 11th century. Xotorǰur and Aramo have
preserved the classical present intact.
2
For a discussion and references, see Khachaturian (1983); Pisowicz (1997); Vaux
(1997, 1998: 7–12, 211–241); Weitenberg (2002: 146–151) and Haneyan (2010).
The particles kə/ku, ha and others are also involved in making the present
progressive/continuous forms in western dialects. In the village of Galaduran the
particle ha(y) forms both the simple and continuous presents, and kə is used for
the subjunctive. The particle kə/ku (> gə/gu) is often doubled, e. g. Zeyt‘un gɔ
g-aṙnum ‘I am taking’; verbs starting with a consonant do not distinguish between
the simple and progressive or continuous present (Ačaṙyan 2003: 246–252).
A combination with gɔr (a by-form of the progressive / continuous particle) is
observable in other dialects, such as Polis g-ɛrt‘am gɔr ‘I am going’ (Ačaṙyan
1941: 150–151), Sebastia b‘ɛrɛm gɔr ‘I am bringing’, g-udɛyi gɔr ‘I was eating’
(Ačaṙyan 1911: 227). Some dialects combine kə with other particles: Aslanbek gə
sirim háyɛ ‘I am loving’ (Ačaṙyan 1951: 353), Trapizon g-aṙnim ɛr ‘I was taking’
(Ačaṙyan 1951: 344), Šapin-Garahisar g-əsim dar ‘I am saying’ (Ačaṙyan 1911:
174–175). Sivri-Hisar uses kar, kikɛnan or ɛr/ɛrɛr (Mkrtč‘yan 1995: 212; 2006:
115, 181–184), and Nikomedia uses yɔr (Ačaṙyan 1951: 353).
Hamšen uses the verb unim ‘to have’ as a progressive marker: b‘ɛrim uni (or
b‘ɛrim g-uni, with the same particle gə) ‘I am bringing’ (ClArm. 1sg . prs berem ‘to
bring, carry’), b‘ɛrɛyə g-uni ‘I was carrying’ (ClArm. 1sg .impf berēi); or involves
the infinitive: yɛs ɛguš ä ‘I am coming’ (ClArm. es ‘I’; ek- aor. stem of the supple-
tive verb gam ‘to come’; 3sg copula ē ‘is’), tun ɛguš ä ‘you are coming’ (ClArm.
du sg ‘you’; ek- aor stem of the suppletive verb gam ‘to come’; 3.sg copula ē
‘is’), etc.; or with a subject in the genitive: imə/ims ɛguš ä ‘I am coming’, k‘ugə(d)
ɛguš ä ‘you are coming’, with im ‘my’ and kʻo ‘your’ respectively (Ačaṙyan 1947:
140–141). Note also Arabkir g-ɛrt‘am (ə)nə ‘I am going’ (Ačaṙyan 1911: 216;
1951: 349).
For the material and a discussion on the (progressive) present tense, see espe-
cially G. Geworgyan (1985, 1994, 2010). For a general overview in English, see
Weitenberg (2002: 145–146).
4.3. Dialect classification based on phonology and morphology

The phonological and morphological classifications are jointly demonstrated
in Table 2, which cannot, it must be said, pretend to a high degree of accuracy
and completeness. The main problem here is, of course, that our data concerning
numerous dialects are ambiguous or incomplete.
For the sake of simplicity only the word-initial reflexes are taken into account.
There is a certain discrepancy between various descriptions of the consonant
systems of some dialects, e. g. Svedia, Xarberd and Erznka. It is noteworthy that
Sivri-Hisar and Yozgat‘/Gamirk‘, according to N. Mkrtč‘yan’s data, display k-k-k‘
and therefore belong to the Group 7, that is Van-Ɫarabaɫ, though they are very far
from them, both geographically and linguistically.
In the column “Present indicative” we find the aforementioned particles and
first person singular forms of the following verbs: aɫam ‘to grind’, amačʻem ‘to
be ashamed’, ancʻanem ‘to pass’, aṙnum ‘to take’, asem ‘to say, tell’, arjakem
‘to untie, loosen’, banam ‘to open’, gam ‘to come’, grem ‘to write’, ertʻam ‘to
go’, lam ‘to weep’, lsem ‘to hear, listen’, xaɫam ‘to move, play’, xawsim / xōsim
‘to speak, talk’, xmem ‘to drink’, kardam ‘to shout, call, read’, mnam ‘to remain,
stay’, nstim ‘to sit’, sirem ‘to love’, tam ‘to give’, uze(na)m ‘to will, wish’, utem
‘to eat’.
In the last two columns of the table one can find references to linguistic, eth-
nographical and folklore literature concerning the individual dialect areas. The
dictionaries are given in italics.
4.4. J̌ ahukyan’s multi-feature classification

J̌ ahukyan (1972)3 offers a multi-feature classification and on this basis identifies
the following dialectal groups:
A. Western grouping
I. The Antiok‘ or extreme South-Western group: K‘esab/Svedia, Beylan.
II. (I–III) The Cilicia or South-Western intergroup: Hačən, Maraš/Zeyt‘un.
III. The Asia Minor or Western group: Karin, Šapin-Garahisar, Sebastia, Evdokia,
Marzvan/Amasia, Nor-Naxiǰewan/Crimea, Polis (Constantinople), Sivri-
Hisar, Kyurin, Syolyoz, Malat‘ia, Kesaria, Xarberd/Erznka, Aslanbek, Akn,
Arabkir.
IV. (I–III–VI) The Hamšen or North-Western intergroup: Hamšen, Edesia.
V. (III–VI) The Aṙtial (Transylvania) or extreme North-Western intergroup:
Aṙtial (Transylvania).
VI. The Muš/Tigranakert or South-Central group: Muš, Talvorik/Motkan, Sasun/
Gelieguzan, Tigranakert.
VII. (VI–VIII) The Van or Southern intergroup: Van, Moks, Šatax, Diadin.
B. Eastern grouping
VIII. The Xoy/Maraɫa or South-Eastern group: Xoy, Maraɫa.
IX. The Ararat or North-Eastern group
X. Ararat, J̌ uɫa, Bayazet, Astraxan, Ardvin/T‘iflis.
XI. (IX–XI) The Ɫarabaɫ/Šamaxi or extreme North-Eastern intergroup: Mehtišen,
Ɫarabaɫ, Ɫazax/Kirovabad, Kṙzen, Havarik, Šamaxi, Burdur.
XII. The Agulis/Meɫri group: Agulis, Meɫri.
3
J̌ ahukyan (Djahukian) apud Greppin and Khachaturian (1986: 19–24).

Table 2: Overview of dialect classification
Indo-Eur. *gh *g *k
Classical g(‘) k k‘ Pres. indicative Dial. descr. Ethn. & Folklore
Xarberd/ Gr. 1 g‘/k g k‘ gə sirim, g-uzim Baɫramyan (1960); Andranik (1900); Sargisean (1932);
Dersim Gasparyan (1979), Halaǰyan (1973); Hayk (1959);
Kostandyan (1982) Srapean (1960)
Šapin-Garah. Gr. 1 g‘ g k‘ gə sirɛm, H. S. Xač‘atryan (1985) Tēōvlēt‘ean (1954)
g-ɛrt‘am
Arabkir Gr. 1 g‘ g k‘ sirim gu, Dawit‘-Bēk (1919) Baxtikean (1934);
g-ɛrt‘am ga, P‘olatean (1969)
gɔ g-ɛrt‘am
Sebastia Gr. 1 g‘ g k‘ sirɛm gə, Gabikean (1952) Gabikean (1952: 591–682)
g-ɛrt‘am gə
Akn Gr. 1 g‘ g k‘ gü sirim, g-üdim Maxudianz (1911); Čanikean (1895); Azatean (1943);
Mak‘sudeanc‘ (1910); K‘ēč‘ean & Parsamean (1952)
Gabriēlean (1912)
Aṙtial Gr. 1 g‘ g k‘ gi sirim Hanusz (1887–1889);
Ačaṙyan (1953)
Crimea / Gr. 1 g‘ g k‘ k‘(i)-sirim, gə Ačaṙyan (1925); Patkanov (1875.1);
Nor- lsim J̌ alašyan (2012) P‘ork‘šeyan (1971)
Naxiǰewan
Hamšen Gr. 1/4 g‘/k g k‘ sirim gu, g-udim Ačaṙyan (1947); Bläsing Muradean (1901); T‘oṙlak‘yan (1981,
(1992, 1995, 2007); 1986); Gurunyan (1991); Simonian
Vaux (2000–2001, (2007); JaynHamš 1–4; HamšHamš
2007) (2007); Vardanyan (2009);

Hovannisian (2009)
53
54
Indo-Eur. *gh *g *k
Xotorǰur Gr. 1/2 g‘ k/g k‘ sirim, mənam; Kostandyan (1985); Gawaṙac‘i (1903); Hačean (1907);
k-asin Baɫramyan (1976) YušamXotorǰ (1964)
Erznka Gr. 2/7 g‘/k k k‘ k/gə sirim, Kostandyan (1979) Tēr-Vardanean (1968)
k-udam
Hrach Martirosyan
Karin Gr. 2 g‘ k k‘ sirɛm k/gə, Tomson (1887); H. Lalayean (1892, 1897, 1983);
k-udem k/gə M. Mkrtč‘yan (1952); Mxit‘areanc‘(1901); Malxasyanc‘
Hakobyan (1974: (1958); Hakobyan (1974)
409–437)
Muš Gr. 2 g‘ k k‘ kə sirim, k-aɫam Patkanov (1875.2); Sedrakean (1874); Lalayean (1917a);
Msereanc‘ Melik‘ean (1964); Movsisyan (1972);
(1897–1901);Baɫdasaryan Hovannisian (2001)
T‘ap‘alc‘yan (1958);
Tarōnean (1961)
Alaškert Gr. 2 g‘ k k‘ kə sirim, k-aɫam Madat‘yan (1985) Yovsēp‘eanc‘(1892: 47); Nždehean
(1899–1910)
Ararat Gr. 2 g‘ k k‘ sirə/um ɛm Nawasardeanc‘ (1903);
Kapancjan (1975);
Markosyan (1989)
Nor J̌ uɫa Gr. 2 g‘ k k‘ sirum am, Ačaṙyan (1940a) Eremean (1923, 1930)
moṙanum am;
lalman am,
talman am
Polis Gr. 3 g g k‘ gə sirɛm, Ačaṙyan (1941) Svazlyan (2000)
g-ɛrt‘am

Indo-Eur. *gh *g *k
Trapizon Gr. 3 g g k‘ sirim gu, g-uzim Ačaṙyan (1911: 178) Yovakimean (1967); T‘oṙlak‘y.
(1986); Hovannisian (2009)
Evdokia Gr. 3 g g k‘ gə sirim, Gazančean (1899) Alpōyačean (1952)
g-ɛrt‘am
Marzvan Gr. 3 g g k‘ gə sirim, gə T‘umayean (1930)
banam
Amasia Gr. 3 g g k‘ (nstim ga, Ačaṙyan (1951: 350)
g-ɛrt‘am ga)
[3sg karda-v!]
Kesaria Gr. 3 g g k‘ ga sirɛm, ga Ant‘osyan (1961) Grigorean & Garakēōzean (1963)
g-aṙnum
St‘anoz Gr. 3 g g k‘ g-ɛrt‘am kar N. A. Mkrtč‘yan (2006: Ōtean-Gasbarean (1968)
202–222, 293–294)
Aynt‘ap Gr. 3 g g k‘ – Vaux (1999–2000) Sarafean (1953)
Sasun Gr. 4 k g k‘ gə sirəm, Petoyan (1954) K‘alant‘ar (1895); Tomaschek

g-ɛrt‘am (1896); Karapetyan (1962); Petoyan
(1965); R. Xač‘atryan (1999)
Zeyt‘un Gr. 4 k/g‘ g k‘ gɔ siyim, Mak‘sudeanc‘ (1911), Allahvērtean (1884); Galustean
gɔ-’yt‘ɔm Ačaṙyan (2003) (1934); Guyumčean (1990)
Hačən Gr. 4 k/g‘ g k‘ gə siyiɛm, Ačaṙyan (2003); Pōɫosean (1942)
gə g-aṙnum Gasparyan (1966)
55
56
Indo-Eur. *gh *g *k
Svedia Gr. 4 k/g‘ g k‘ gə sirim, Andreasyan (1967); Svazlyan (1984, 1994);
gy-äṙnim Ačaṙyan (2003); Gyozalyan (2001)
Hananyan (1995)
Aramo Gr. 4 k g k‘ (hay) sirɛym, Ayceamn (1907),
(hay) urt‘ɔum Ɫaribyan (1958: 9–77)
Hrach Martirosyan
K‘abusie Gr. 4 k g k‘ gu sirim, Ɫaribyan (1958: 78)

gɛu urt‘um
K‘esab Gr. 4 k g k‘ ha sirim, Ɫaribyan (1955);
hay urt‘um Č‘olak‘ean (1986)
Beylan Gr. 4 k g k‘ gä sirim, gä Ɫaribyan (1955)
gartɔm
Malat‘ia Gr. 5 k‘ g k‘ sirim a, g-ɛrt‘am Danielyan (1967) Alpoyačean (1961)
a
Nikomedia/ Gr. 5 k‘/g‘ g k‘ gə sirim, g-iãm Ačaṙyan (1898a, Tēr-Yakobean (1960)
Aslanbek (contin. + hayɛ) 1898b); Vaux (2001)
Edesia/ Gr. 5 k‘ g k‘ sirɛm a / gə Ɫaribyan (1958: 146);
Uṙha sirɛm Haneyan (1982)
g-ɛrt‘am a
Tigranakert Gr. 5 k‘ g k‘ gə sirim, Haneyan (1978)
g-uzim
T‘iflis Gr. 6 g k k‘ sirum im, Petermann (1867b); Tēr-Aɫek‘sandrean (1885);
ku sirim Tomson (1890) Aɫababean (1901)

Indo-Eur. *gh *g *k
Ararat/ Gr. 6 g k k‘ sirəm ɛm, Asatryan (1968) T. Geworgyan (1999); Lalayean
Loṙi mnəm əm (1903, 2004)
Agulis Gr. 6 g k k‘ sáyräm əm, Petermann (1867a); Ayvazyan (1984)
n-ɔ́tam əm Sargseanc‘ (1883);
xsɛ́lis əm, Ačaṙyan (1935);
C‘ɫna təsnäs əm Zak‘aryan (2008)
Areš/ Gr. 6 g k k‘ 1sg sirɛlis əm, Tēr-Pōɫosean (1921–
Havarik sirɛli yəm 1922); Lusenc‘ (1982)
2sg sirɛləm əs
3sg sirɛlim i
Kṙzen Gr. 6 g k k‘ aɫas əm vs. Baɫramyan (1961)
əm aɫali
Ardvin Gr. 6 g k k‘ sirɛlis im, Ačaṙyan (1911:
mnalis im 291–292; 1926: 267);
Ɫaribyan (1939:
33–49; 1953: 281–292,
329–342), Alaverdyan
(1968ab).
Ɫaradaɫ Gr. 6 g k k‘ sirilis əm, Ɫaribyan (1953: 281– H. Hovsep‘yan (2009, 1–2)
mnalis əm 304)
D(i)zmar Gr. 6 g k k‘ siriyis əm, Ɫaribyan (1953: 281– H. Hovsep‘yan (2009, 1: 60)
mnayis əm 292)
Meɫri Gr. 6 g k k‘ siris im, mənas Ɫaribyan (1939:
im 83–106); Aɫayan (1954)

57
58
Indo-Eur. *gh *g *k
Karčewan Gr. 6 g k k‘ siríy im / im sirís Ɫaribyan (1939: 59–82);
Muradyan (1960)
Kak‘avaberd Gr. 6 g/k k k‘ xəmúm əm, Muradyan (1967)
ásəm əm;
A. mənáyis im
Hrach Martirosyan
B. mənális im
Astraxan Gr. 7 k k k‘ asəm ɛm, lsəm Ačaṙyan (1911: 82–86); G. G. Geworgyan (1980); Ɫaziyan &
ɛm Ačaṙyan (1951: 333) Vardanyan (2004)
Goris/ Gr. 7 k k k‘ sirum ɛm, Vartapetjan (1962); Lisic‘yan (1969)
Syunik‘ xəmum ɛm Margaryan (1975)
Ɫarabaɫ Gr. 7 k k k‘ sírəm əm, Ačaṙyan (1899); Barxudareanc‘(1883);
xɔ́səm əm Davt‘yan (1966); Barxutareanc‘(1883); Melik‘-
Sargsyan (2013) Šahnazareanc‘(1907 f); Lisic‘yan
(1981); Ɫaziyan (1983); Lalayean
(1897a, 1988); Grigoryan-
Spandaryan (1971); L. Harut‘yunyan
(1991)
Tavuš Gr. 7 k k k‘ sirəm ɛm, Mežunc‘ (1989) Xemč‘yan (2000, 2008)
mnəm ɛm
Burdur Gr. 7 k k k‘ sirum əm, Mkrtč‘yan (1971, 2006:
mənum əm 304)
Šamaxi Gr. 7 k k k‘ A. siräm äm, Baɫramyan (1964) Ɫaziyan & Vardanyan (2004)
mənam am;
B. kyiris ɛm,
mənays ɛm

Indo-Eur. *gh *g *k
y
Šaɫax Gr. 7 k k k‘ k əriys əm, Ɫaribyan (1939: 50–58);
mənayis əm Davt‘yan (1966)
Hadrut‘ Gr. 7 k k k‘ siris əm, mənas Ɫaribyan (1939: 107–
əm 127); Poɫosyan (1965);
Davt‘yan (1966)
Urmia Gr. 7 k k k‘ kyirɛs ɛm/s, 3sg Ɫaribyan (1939: 128–
kyirɛl i; 146; 1953: 342–358);
mənas ɛm/s, 3sg Asatryan (1962)
mənal i
Xoy Gr. 7 k k k‘ üzɛl ɛm, ɛt‘äl ɛm Asatryan (1962)
Maraɫa Gr. 7 k k k‘ üzeli im, Ačaṙyan (1926)
amč‘ənalǝ́ yim
Hin J̌ uɫa Gr. 7 k k k‘ talman am Ačaṙyan (1940a: 27–32)
Van Gr. 7 k k k‘ kə sirem, Ačaṙyan (1952); Sedrakean (1874); Tēr-Sargsencʻ
k-uzem, Tʻuršyan (2018) (1875); Šērenc‘ (1885–1999);
ku kyäm Lalayean (1910, 1914, 1915,
1917b); Sruanjteanc‘ (1978); S.
M. Avagyan (1978); Hovannisian
(2000)
Moks, Šatax Gr. 7 k k k‘ kə sirim, M. H. Muradyan (1962, Yovsēp‘eanc‘ (1892); Šahpazean
kə xaɫam, 1982); Orbeli (2002) (1913); Lalayean 2 (1914)
k-ənc‘nim
59
60
Indo-Eur. *gh *g *k
Ozim Gr. 7/2 k/g‘ k k‘ kə kaṙtəäm, N. E. Hovsep‘yan
k-arc‘kim (1966);
Arewikyan (1967)
Sivri-Hisar Gr. 7 k k k‘ (cont.) k-ɛt‘am N. A. Mkrtč‘yan (1995, PtmSivHisHay (1965)
ɛr / kar, 2006)
Hrach Martirosyan
xɔsim kɛnam
Yozgat‘ / Gr. 7 k k k‘ sirim ka N. A. Mkrtč‘yan (2006) T‘emurčyan (1970)
Gamirk‘
5. Earliest dialectal isoglosses
5.1. Weitenberg’s isoglosses

In a series of articles Weitenberg (1986, 1992, 1993, 1996, 2001) discussed a rel-
atively homogenous set of dialectal isoglosses within a chronological framework
starting from pre-literary times, all of which are anterior to the sound shift h > x
in the Van-Urmia group and the devoicing (b, d, g > p, t, k) which are dated to
approximately the 7th century or later:
● Retraction of the accent to the penultima.
● Diphthongization of stressed o (and perhaps e) in initial position; all dialects
diphthongize any o in absolute anlaut, whereas the dialects having accent
retraction (Ɫarabaɫ, etc.) and the dialects of the Van-Urmia region do so only
in monosyllables (these are marked in Table 3 “no”).
● Monophthongization of aw; if aw (under penultimate accent) is followed by a
dental stop or affricate, the w yields x, e. g. eawt‘n ‘seven’ > Meɫri ɔ́xtə (these
are marked in Table 2 “yes”).
● Ačaṙyan’s Law (fronting of back vowels after voiced obstruents).
● Devoicing b, d, g > p, t, k (7th century or later).
In Weitenberg (1996: 111) these isoglosses are combined in a table in relative

chronological order, from left to right. In addition, the isogloss of present verbs in
-nul vs. ClArm. -anem is represented by the example of tesanem ‘to see’.
5.2. Miscellaneous
There are more of such isoglosses, e. g. the additional -n mostly represented
in the dialects of the old Arc‘ax-Siwnik‘ area, which is a feature originating in
the pre-literary period (H. D. Muradyan 1982: 322–326; Weitenberg 1985) and
monophthongization of the final stressed -ay, which can be traced back to the fifth
century (Weitenberg 1999–2000).
5.3. Extending Weitenbergʼs classification

In Table 3, I present a revised and supplemented version of Weitenberg’s afore-
mentioned table in combination with data from my Table 2. The archaic set of
isoglosses demonstrates an opposition between the South-Eastern periphery
(Ɫarabaɫ/Agulis area) on the one hand, and the Central and Western regions on the
other. As Weitenberg (1996: 112) points out, the Van-Urmia intermediate area may
originally have formed part of the South-Eastern area. Note also the sharp contrast
between Muš and Ɫarabaɫ.
Table 3: Extended classification based on Weitenberg
62
Dialect Phonological Present Accent Diphthong o- ɔxt ‘7’ Ačaṙ. Law Devoicing tesanel
classification indicative retraction
Aṙtial Gr. 1 gi sirim no no yes no no -nul
Arabkir Gr. 1 sirim gu, no no no no no -nal
g-ɛrt‘am ga,
gɔ g-ɛrt‘am
Hrach Martirosyan
Erznka Gr. 1/4 k/gə sirim, no no yes no yes -nal

k-udam
Hamšen Gr. 1/4 sirim gu, no no yes no yes -nul
g-udim
Karin Gr. 2 sirɛm k/gə, no no yes no no -nil
k-udem k/gə
Muš Gr. 2 kə sirim, no no no no no -nul
k-aɫam
Ararat Gr. 2 sirə/um ɛm (yes) (yes) yes no no (-nul)
Nor J̌uɫa Gr. 2 sirum am, no no yes no no -nel
moṙanum am;
lalman am,
talman am
Polis Gr. 3 gə sirɛm, no ABSENT yes no no -nal
g-ɛrt‘am
Hačən Gr. 4 gə siyiɛm, gə no no no no yes -nul
g-aṙnum

Svedia Gr. 4 gə sirim, no no no yes? yes -nul
gy-äṙnim
Edesia/ Gr. 5 sirɛm a / gə no no no no yes -nal

Uṙha sirɛm
g-ɛrt‘am a
Tigranakert Gr. 5 gə sirim, no no no no yes -nal
g-uzim
Ararat/ Gr. 6 sirəm ɛm, yes yes yes no no -nal

Loṙi mnəm əm
Agulis Gr. 6 sáyräm əm, yes (yes) yes yes no -nil

n-ɔ́tam əm
Areš/ Gr. 6 1sg sirɛlis əm, no no yes yes no -nel /

Havarik sirɛli yəm -nal
2sg sirɛləm əs
Meɫri Gr. 6 siris im, yes (yes) yes yes no -nil
mənas im
Karčewan Gr. 6 siríy im / im yes yes yes yes no -(n)ul

sirís
63
64
Kak‘avaberd Gr. 6 xəmúm əm, yes yes yes yes no/yes -(n)il
ásəm əm;
mənáyis im,
-lis im
Ɫarabaɫ Gr. 7 sírəm əm, yes yes yes yes yes -nal
Hrach Martirosyan
xɔ́səm əm
Urmia Gr. 7 kyirɛs ɛm/s, no yes yes yes yes -nel
3sg kyirɛl i;
mənas ɛm/s,
3sg mənal i
Maraɫa Gr. 7 üzeli im, no yes yes yes yes -nel
amč‘ənalǝ́ yim
Van Gr. 7 kə sirem, no yes no yes yes -nal
k-uzem,
ku kyäm
Moks, Šatax Gr. 7 kə sirim, kə no yes no yes yes -nel /
xaɫam, -nil
k-ənc‘nim
6. Dialectal groups
The Armenian dialects are presented in eleven groups in accordance with J̌ ahuk-
yan’s multi-feature classification. The dialects of the Eastern grouping (groups
VIII-XI) have developed and are still spoken in the Republic of Armenia, Arcʻax
(Karabagh / Ɫarabaɫ), as well as Parskahayk‘/Persian Armenia and surroundings
(North West Iran). By contrast, the western dialects (groups I–VII) which had
developed on the territory of the western part of the historical Armenia (that is,
the eastern half of present-day Turkey) for many centuries, with a few exceptions,
are no longer spoken in their original location. This is because of Turkish constant
genocidal policy and especially the Armenian Genocide starting in 1915 which
resulted in extermination of more than one million Armenians and constituted an
amputation of Armenian life, and therefore also an amputation of Armenian dia-
lects. Some surviving groups of the Armenian population were dispersed around
the world. Their descendants have partially preserved their dialects in the Republic
of Armenia, J̌ avaxk‘ (now in Georgia), Russia and other countries.
For these reasons our information on the western dialects is uneven. Many of
them are now almost or completely extinct. Some of them have been described
before the Genocide, others are known by some secondary materials recorded
from the refugees, and for the rest we practically have no information.4
In this survey of the dialects I shall concentrate on the most characteristic fea-
tures of a number of dialects from different groups. When discussing them, I often
present comparative data from dialects of other groups. The data concerning the
phonological and morphological classifications and the aforementioned earliest
isoglosses can be found in Tables 2 and 3 and will be specified here only when
necessary.
A. Western grouping
6.1. Svedia (the Antiok‘ or extreme South-Western group)
Armenians have been living in the northern parts of Syria and Mesopotamia pre-
sumably since the time of Tigran the Great (1st century BCE). There is informa-
tion on migration of some Armenian population to Syria in the 6th century CE.
The Armenian presence in these areas became more significant especially since
the 10th century. Here the Armenians called themselves k‘ist‘inə / k‘isdinɛ, which
reflects the word k‘ristoneay ‘Christian’, and the dialect, k‘ist‘inəg / k‘isdinig /
k‘ist‘inüg ‘language of a Christian, Armenian’.
The Armenian dialects of Syria and Cilicia are very divergent and extremely
interesting. Svedia is the best studied one among them. It was spoken in the Arme-
nian villages of the Svedia district, which is located at the foot of the mountain
4
For a general outline on these issues and the linguistic situation in the Republic of
Armenia, see Weitenberg (2002: 141–143 and 2006).
Musaleṙ/Musa-Dagh. There are three monographs on the dialect of Svedia: Ačaṙyan

(2003), Andreasyan (1967) and Hananyan (1995), based on the Svedian subdia-
lects of Haǰi-Habibli, Yoɫun-Ōluk and Xtrbek, respectively. K‘esab is described in
Č‘olak‘ean (1986). The other (sub)dialects have been poorly studied, mainly by
Ɫaribyan.
Phonology
According to Ačaṙyan (2003: 401), Classical Armenian voiced stops are reflected
in Svedia as voiced aspirates b‘ g‘ d‘ in initial and intervocalic positions and after
ɫ and r. In the descriptions of other subdialects, Andreasyan (1967) and Hananyan
(1995), b d g yielded p t k. Table 4 provides a simplified table of the development
of CA plosives into the dialect of Svedia.
Table 4: Reflex of CA plosives in Svedia

CA Svedia
initial internal final
b/g/d b‘/g‘/d‘ b‘/g‘/d‘, (N)b/g/d b‘/g‘/d‘, (N)b/g/d
p/k/t b/g/d b/g/d b/g/d
p‘/k‘/t‘ p‘/k‘/t‘ p‘/k‘/t‘ p‘/k‘/t‘
One of the most characteristic phonological features of Svedia is the change a > u
in a stressed position, where Beylan and the Cilician dialects have ɔ. As Ačaṙyan
(1951: 364–378, 2003: 349, cf. 10–11) points out, the development a > ɔ > u
logically ends in Syria, which marks the last habitat in the extreme SW corner of
Armenian-speaking areas.
In Agulis, an eastern dialect that is very far from Svedia both geographically
and linguistically, we find a similar change only in monosyllables. In Hamšen and
Akn we find a > ɔ only before a nasal. Table 5 provides examples from these dia-
lects, with the addition of some dialects that do not display a > ɔ, namely Ararat
(an eastern dialect) and Moks/Šatax (a western dialect).
Table 5: Reflexes of CA /a/ in selected dialects

CA gloss Ararat Moks/Šat. Agulis Hamšen Zeyt‘un Svedia
aɫ salt aɫ aɫ aɫ aɫ ɔɫ üɫ
arǰ bear aṙč‘ aṙ/rč ɔṙǰ ayč‘ äṙǰ‘ uṙǰ‘
art cornfield aṙt/d aṙt ɔṙt ayd ɔyd [uɔrt]5
bak courtyard bag päk bɔk – b‘ɔg b‘üg
ban thing b‘an pän bün pɔn b‘ɔn b‘un
bac‘ open b‘ac‘ päc‘ böc‘ pac‘ b‘ɔc‘ b‘uc‘
lar cord lar lar – lar lɔy lur
caṙ tree caṙ caṙ cɔṙ jaṙ jɔṙ juṙ
hac‘ bread hac‘ xac‘ hɔc‘ hac‘ hɔc‘ huc‘
maz hair maz maz mɔz maz mɔz muz
tanj pear tanj tanj tunj dɔnj dɔnj dunj
tasn ten tassə tas tɔ́ssə dasə dɔsə dussə
k‘ar stone k‘ar k‘yar k‘ɔr k‘ar k‘ɔy k‘ur
agṙaw crow akṙav [ak‘ṙav] 6
ákṙav – ag‘ṙɔv äg‘ṙuv
amaṙn summer amaṙ amaṙ ámaṙ(nə) ɔmaṙ amɔṙ amuṙ
ardar righteous art‘ar artär áṙt‘ar art‘/tar ayd‘ɔy ard‘uṙ
arcat‘ silver aṙcat‘ aṙcat‘ áṙcat‘ ayjat‘ ayjɔt‘ aṙjut‘
danak knife d‘anak tänak dä́näk tɔnag d‘anɔg d‘änüg
mangaɫ sickle mangaɫ mang äɫ y
mang äɫ y
mɔngaɫ mangɔɫ mängüɫ
paṙaw old woman paṙav paṙav páṙav baṙav baṙɔb baṙuv
gawazan stick g‘avazan k äväzan
y
– kavazɔn g‘avzɔn g‘yäväzun
hnazand obedient hnazand xnazänd hnázand – hnazɔnd‘ hnazund‘
patuhas punishment – patuxas – – baduhɔs bädihus
5
This is the K‘esab form (Č‘olak‘ean 1986: 197a). The word is not recorded in Svedia
itself.
6
Cf. HAB [Hayerēn armatakan baṙaran] vol. 1: 80a.
Nominal morphology
An important morphological peculiarity is the instrumental marker -um in Svedia,
which is not found in other Armenian (sub)dialects of Syria. K‘esab has -uɔn
(Č‘olak‘ean 1986: 68); K‘abusie has -u for inanimate objects and /genitive + hid
‘with’/ for people (Ɫaribyan 1958: 96).
For K‘abusie, Ɫaribyan (1958: 78–79, 83, 96–99) records a remarkable con-
trast: nominative-accusative indefinite murt‘ vs. definite maurt‘, cf. äs maurt‘ ‘this
man’; the preposition z- in the accusative, as in CA, is found in a definite context.
Note also the preposition i / y- in an allative usage: is i dün g-ürt‘um ‘I am going
home’; y-ambɔur g-ürt‘um ‘I am going to the barn’.
Table 6 provides the paradigms of hac‘, i-stem ‘bread’, and mard, o-stem ‘man,
human’, in Svedia, Zeyt‘un, (Ačaṙyan 2003: 165, 184, 460–461) and K‘abusie
(Ɫaribyan 1958: 98). In the following table n = nominative, acc = accusative, gd
= gentive/dative, abl = ablative, i = instrumental:
Table 6: Nominal paradigms in selected dialects

CA Zeyt‘un Svedia (indef. and def.)
sg n hac‘ hɔc‘ huc‘ huc‘ə
acc (z)hac‘ hɔc‘ huc‘ huc‘ə
gd hac‘-i hac‘-ǝ́ häc‘c‘-ə häc‘c‘ɛn
abl i hac‘-ē hac‘-e häc‘c‘-ɛ häc‘c‘in
i hac‘-i-w hac‘-öv häc‘c‘-um häc‘c‘umə
pl n hac‘-k‘ hac‘-iy häc‘c‘-ir häc‘c‘irə

acc (z)hac‘-s hac‘-iy häc‘c‘-ir häc‘c‘irə
gd hac‘-i-c‘ hac‘-iy-u häc‘c‘-irə häc‘c‘irɛn
abl i hac‘-i-c‘ hac‘-iy-e häc‘c‘-irɛ häc‘c‘irin
i hac‘-i-w-k‘ hac‘-iy-öv häc‘c‘irum häc‘c‘irum
CA Zeyt‘un K‘abusie Svedia

sg n mard mɔyd‘ murt‘ muṙd‘
acc (z)mard mɔyd‘ z-maurt‘ muṙd‘
gd mard-oy mayd‘-o mart‘-ɔn maṙd‘-ə
abl i mard-oy mayd‘-e G + irmɛyn maṙd‘-ɛ
i mard-o-v mayd‘-öv G + hɛyd maṙd‘-um
Verbal morphology
Table 7 shows the present and imperfect paradigms of sirem ‘to love’ in the
Haǰi-Habibli subdialect (Ačaṙyan 2003: 482–490) and grem ‘to write’ in the Xtrbek
subdialect (Hananyan 1995: 125–126).
Table 7: Present and imperfect paradigms, sirem ‘to loveʼ & grem ‘to write’ in Svedia
(Haǰi-Habibli subdialect)
Present indicative Imperfect
CA Svedia 1 Svedia 2 CA Svedia 1 Svedia 2
sg sirem, grem gə sirim gɛu kərim sirēi, grēi gə sirɛr ɛ gɛu kərōr
sires, gres gə siris gɛu kəris sirēir, grēir gə sirɛr gɛu kərɛr
sirē, grē gə sirɛ gɛu kəri sirēr, grēr gə sirir gɛu kərir
pl siremk‘, gremkʻ gə sirinkʻ gɛu kərənk sirēak‘, grēak‘ gə sirɛr inkʻ gɛu kəräyrənk
sirēk‘, grēkʻ gə sirikʻ gɛu kərək sirēik‘, grēik‘ gə sirɛr ikʻ gɛu kəräyrək
siren, gren gə sirin gɛu kərin sirēin, grēin gə sirɛr in gɛu kəräyrən
According to Ɫaribyan (1953: 444–445; 1955: 196, 201–202) and Č‘olak‘ean

(1986: 122), in K‘esab the present indicative is formed with the particle ha or
hay. In the village of Galaduran this particle forms both the simple and continuous
presents. The particle kə is used for the subjunctive. The imperfect is composed of
a participle in -er and the copula. The latter is dropped in the third person singular
form. Table 8 gives the paradigms of kardam ‘to shout, read’ and grem ‘to write’.
Table 8: Present and imperfect verbs in the Galaduran dialect

Present
CA K‘esab CA K‘esab
sg kardam ha gartum grem ha krim
kardas ha gartuɛs gres ha kris
karday ha gartu grē ha kri
pl kardamk‘ ha gartunk‘ gremk‘ ha krink‘
kardayk‘ ha gartäk‘ grēk‘ ha krik‘
kardan ha gartun gren ha krin
Imperfect
sg kardayi ha garter ɛm grēi ha krer ɛm
kardayir ha garter ɛs grēir ha krer ɛs
kardayr ha garter grēr ha krer
pl kardayak‘ ha garter ɛnk‘ grēak‘ ha krer ɛnk‘
kardayik‘ ha garter ɛk‘y grēik‘ ha krer ɛk‘y
kardayin ha garter ɛn grēin ha krer ɛn
Unlike the neighbouring K‘esab, the Beylan dialect forms the indicative present
and imperfect with the particle gä and therefore belongs to the kə-group. The
imperfect is also marked by a postposed particle di throughout the paradigm
excluding the third- singular form.7 Table 9 provides the paradigms of kardam ‘to
shout, read’ and grem ‘to write’.
Table 9: Present and imperfect forms in the Beylan dialect

Present
CA Beylan CA Beylan CA Beylan
sg kardam gä gartɔm grem gä kərim tesanem gä disnum
kardas gä gartɔs gres gä kəris tesanes gä disnus
karday gä gartɔ grē gä kərɛ tesanē gä disnu
pl kardamk‘ gä gartɔnk‘ gremk‘ gä kərink‘y tesanemk‘ gä disnunk‘
kardayk‘ gä gartɛk‘y grēk‘ gä kərik‘y tesanēk‘ gä disnuk‘
kardan gä gartɔn gren gä kərin tesanen gä disnun
Imperfect
sg kardayi gä garti di grēi gä kəri di tesanēi gä disni di
kardayir gä gartɛr di grēir gä kərir di tesanēir gä disnir di
kardayr gä gartɛr grēr gä kərɛr tesanēr gä disnɛr
pl kardayak‘ gä gartink‘ di grēak‘ gä kərink‘ di tesanēak‘ gä disnink‘ di
kardayik‘ gä gartik‘ di grēik‘ gä kərik‘ di tesanēik‘ gä disnik‘ di
kardayin gä gartin di grēin gä kərin di tesanēin gä disnin di
It is easy to notice that gartɔm etc. reflect CA present forms, kardam etc. The
stressed a turns to ɔ, e. g. amaṙn ‘summer’ > amɔṙ, ardar ‘righteous’ > artɔr,
beran ‘mouth’ > pərɔn, caṙ ‘tree’ > jɔṙ. Without the particle gä, the forms are used
for the subjunctive, thus: Beylan gartɔm ‘I may read’ from CA kardam ‘I read,
I am reading’. The conditional and debitive are formed with gə8 and bə / bədə
respectively: gə gartɔm ‘I will read’ and bə / bədə gartɔm ‘I have to read’. Table
10 combines the aorist paradigm of the verb tam ‘to give’ in Middle Armenian
(1901: 333, 2002: 315–316; Ant‘osyan 1975: 213; L. S. Hovsep‘yan 1997: 68–69),
Modern West and East Armenian (Ant‘osyan 1975: 214), Svedia (Ačaṙyan 2003:
494, 498), Aramo (Ɫaribyan 1958: 47) and Zeyt‘un (Ačaṙyan 2003: 243).9
7
Compare Hamšen 2nd-person singular imperfect ending -di, e. g. kie-di gu ‘you were
writing’ (on this see Vaux 2007: 268).
8
Łaribyan (1955: 229) explicitly distinguishes between gä and gə.
9
In the table (and henceforth) CA = Classical Armenian, SWA = Standard Western Arme-
nian.
Table 10: Aorist paradigms in selected dialects

CA Middle SWA Aramo Proto.- Svedia Proto- Zeyt‘un Proto-
Arm. Aramo Sved. Zeyt‘un
sg etu tu-i tu-i ədva *etu-i dva *tu-i dɐvɐ *tu-i
etur tu-ir tu-ir ədvɛy *etu-e(r) dvir *tu-er dɐvɐy *tu-ir
et etu-r, tu-aw ida *etu i-dör *e-tu-r dɔvɔv *tu-aw
eret,
tu-aw
pl tuak‘ tuak‘ tu-ink‘ ədvunk‘ *etua(n)k‘ dvunk‘ *tu-ank‘ dɔvɔnk‘ *tu-ank‘
etuk‘ tuik‘ tu-ik‘ ədväk‘ *etu-ik‘ dväk‘ *tu-ik‘ dɐvɐk‘ *tu-ik‘
etun tuin tu-in ədväyn10 *etu-in dvɛn *tu-in dɐvɐn *tu-in
Syntax of basic clauses

(1) murd-ə k-asi
man-def . art ind -say. prs .3sg
‘The man says’
(Ačaṙyan 2003: 546, line 8 from bottom)
(2) murd-ə k-aṙni z-at
man-def . art . nom . sg ind -take. prs .3 sg def . art . acc .-that
pʻara-n
money-def . art
‘The man is taking that money’
(3) hälivür-ə k-aṙni z-at
old_man-def . art .nom . sg ind -take. prs .3 sg def . art . acc -that
murd-ə
man-def . art . acc . sg
‘The old man is taking that man’
(Ačaṙyan 2003: 548, line 11)
(4) Astuc kə-təsna z-lɛyr
god ind -see. prs .3 sg def . art . acc -mountain
‘The God sees the mountain’
(Ačaṙyan 2003: 533)
10
Printed as ənväyn in Łaribyan (1958: 47), probably a misprint.
6.2. Hamšen / Homšec‘ma (the Hamšen or North-Western intergroup)

There are three basic subgroups of the Hamšen population (Vaux 2000–2001: 48;
2007: 257; Bläsing 2003: 7):
• Western Hemşinli, who live in the Turkish province of Rize, speak Turkish,
and are Sunni Muslim. They preserve a huge amount of words and proper
names from the Armenian dialect of Hamšen (Bläsing 1992, 1995). They are
the descendants of Islamized Armenians from Hamšen.
• Eastern Hamšen/Hemşin(li)/Homše(n)c‘ik‘, who live in the vicinity of Hopa
in the province of Ardvin (also elsewhere in Turkey, Central Asia and Europe),
speak Homšec‘ma (often being unaware that it is Armenian), and are Sunni
Muslim.
• Northern Homšenc‘ik‘, who live in Abkhazia and southern Russia (Krasnodar),
speak Homšec‘ma, and are Christian. They are the descendants of non-Islam-
ized Hamšen Armenians who lived in the provinces of Samsun, Ordu, Giresun
and Trabzon.
The most complete description of the dialect is Ačaṙyan (1947). Texts: Haykuni
(1892) and Dumézil (1964, 1965, 1967, 1986–1987). Hamšen/Homšec‘ma is one
of the most divergent Armenian dialects. Due to its isolated position the dialect
preserves a considerable number of archaisms, both morphological and lexical,
and also developed numerous peculiar innovations.
Phonology
The most characteristic phonological feature of Hamšen Armenian is the change a
> o when followed by a nasal consonant: amis > ɔmis ‘month’, ban > pɔn ‘thing’,
etc.; also in Turkish loans, e. g. tavan > t‘avɔn ‘ceiling’ (Ačaṙyan 1947: 11, 22–24;
Ačaṙyan 1971: 404–406; Bläsing 2003: 9–12). See also §6.1 (Svedia).
Nominal morphology
Table 11 gives the paradigms of hac‘, i-stem ‘bread’ and mard, o-stem ‘man,
human’ in Hamšen (Ačaṙyan 1947: 90–91).
Table 11: Nominal paradigms in CA and Hamšen

CA Hamšen11 CA Hamšen
sg N hac‘ hac‘ Pl hac‘-k‘ hac‘-ɛr
Acc (z)hac‘ hac‘ (z)hac‘-s hac‘-ɛr
GD hac‘-i hac‘-i, -ɛc‘ hac‘-i-c‘ hac‘-ɛr-u
Abl i hac‘-ē hac‘-ä, -ən i hac‘-i-c‘ hac‘-ɛr-ä
I hac‘-i-w hac‘-ɔv hac‘-i-w-k‘ hac‘-ɛr-ɔv
sg N mard mašt mard-kʻ mašt-ɛr

Acc (z)mard mašt (z)mard-s mašt-ɛr
GD mard-oy mašt-u mard-ocʻ mašt-ɛr-u
Abl i mard-oy mašt-ä(mä) i mard-ocʻ mašt-ɛr-ä(mä)
I mard-o-v mašt-ɔv mard-o-v-kʻ mašt-ɛr-ɔv
A number of Classical Armenian words that were mainly used in the plural appear
in the majority of dialects with petrification of the nominative plural form. Table
12 provides a few remarkable relics of the old genitive plural in Hamšen, Zeyt‘un12
and Evdokia13 (see also §6.3. Aṙtial).
Table 12: Relics of old genitive plural in selected dialects

gloss Case CA Hamšen Evdokia Zeyt‘un
eye nom . pl ač‘-k‘ ač‘k ač‘k‘ ɛč‘vɛ
gen . pl ač‘-ac‘ & -ic‘ ač‘ič‘(< -ic‘) ač‘ic‘ ɛč‘ɔc‘ (< -ac‘)
wedding nom . pl harsan-i-k‘ haysnik‘ harsnik‘ haysnək‘
gen . pl harsan-e-a-c‘ haysnɛc‘ harsnɛc‘ haysnic‘
beard nom . pl mō/u/iru-k‘ miruk‘ mɔruk‘ muyuk‘
gen . pl mō/u/iru-a-c‘ mirvac‘ mɔrvac‘ muyvic‘
Another remarkable morphological relic is observable in the paradigm of the word

for ‘star’, CA astɫ. The Hamšen dialect has nom.sg astɛxk/k‘ ‘a star’, which is a
frozen form of nom .pl asteɫk‘, and gen . pl astɫɛc‘ ‘of stars’, cf. CA gen .pl asteɫ-
a-c‘. The gen . sg -ɛc‘ is derived from the genitive plural ending -eac‘, e. g. gen . sg
gaɫnɛc‘ from CA kaɫneac‘, the gen .pl form of kaɫni ‘oak’ (Ačaṙyan 1947: 91–94).
11
Abl. hac‘-ä is indefinite, and hac‘-ən is definite. Other forms: hac‘-ɛc‘-ən, hac‘-ɛn.
12
See Ačaṙyan (1947: 98 and 2003: 188–189) and glossaries, also Martirosyan (2010:
17–18, 98–99, 454–455).
13
Gazančean (1899: 81) (the forms here do not seem phonetically very accurate).
Verbal morphology
Hamšen has the following important peculiarities in verbal morphology:
The CA infinitive ending -(V)l has been replaced by -uš (probably of Turkish
origin) in all four conjugations, e. g. berel ‘to bring’ > pɛruš, ert‘al ‘to go’ > ɛštuš,
etc. (Ačaṙyan 1947: 11, 156–158; Ačaṙyan 1965: 46–47).
In Mala and the villages of Trapizon the verb ‘to have’ is used as a progressive
marker: b‘ɛrim uni or g-uni ‘I am bringing’, b‘ɛrɛyə g-uni ‘I was carrying’. Čanik
uses a different pattern involving the infinitive and 3sg copula: yɛs ɛguš ä ‘I am
coming’, tun ɛguš ä ‘you are coming’, etc.; or with a subject in genitive: imə/ims
ɛguš ä ‘I am coming’, k‘ugə(d) ɛguš ä ‘you are coming’, etc. (Ačaṙyan 1947: 11,
140–141).14
The second-person singular imperfect ending -yd(ə) / -di of Hamšen is compa-
rable with -idi (of Turkish origin) seen throughout the whole imperfect paradigm
in Hačən.15
First person plural imperfect: The Classical Armenian first person plural imper-
fect ending is characterized by the presence of a vowel -a-; thus, for example, in
the e-conjugation we have the following set of endings: -ēi (1sg ), -ēir (2sg ), -ēr
(3sg ), -ēak (1pl ), ‘-ēik‘ (2pl ), -ēin (3pl ). In most dialects of Modern Armenian this
-a- has been analogically eliminated, but some dialects have preserved it intact.
In table 13 I present the Classical imperfect paradigm of the verb utem ‘to eat’
and its corresponding tense forms (imperfect or past subjunctive) in the dialects
of Akn (Ačaṙyan 1911: 223, cf. 227 on Sebastia), Hamšen (Ačaṙyan 1947: 136,
139), and South Eastern Armenian. I also include two paradigms from the region
of Dersim, grem ‘to write’ in Xarberd/Erznka, and k‘ašem ‘to pull, drag’ in Č‘mšk-
acag (Baɫramyan 1960: 22, 30); the latter preserves the -a- intact whereas the
former is innovative.
Table 13: Imperfect paradigms in CA and in selected contemporary dialects

CA Hamšen Akn Č‘mškacag Xarb./Erz. SEA
sg utēi g-udɛyə g-üdɛi gə k‘ašɛi gə g‘ərɛi k-utei
utēir g-udɛyd(ə) g-üdɛir gə k‘ašɛir gə g‘ərɛir k-uteir
utēr g-udɛr g-üdɛr gə k‘ašɛr gə g‘ərɛr k-uter
pl utēak‘ g-udayk‘ə g-üdɛank‘ gə k‘ašɛank‘ gə g‘ərɛink‘y k-uteink‘
utēik‘ g-udɛ́yə g-üdɛik‘ gə k‘ašɛik‘ gə g‘ərɛik‘y k-uteik‘
utēin g-udɛ́yə g-üdɛin gə k‘ašɛin gə g‘ərɛin k-utein
The same 1pl -a- is seen in the aorist of Classical Armenian. Here again, some dia-
lects preserve it intact, cf. e. g. the paradigms of xaɫam ‘to play’ and sirem ‘to love’
14
For a discussion, see Vaux (2007: 263).
15
See Ačaṙyan (1959: 568–569); cf. Vaux (2007: 268).
in Hamšen: xaɫac‘i (1sg ), xaɫac‘ir (2sg ), xaɫac‘ (3sg ), xaɫac‘ak‘ (1pl ), xaɫac‘ik‘
(2pl ), xaɫac‘in (3pl ), sirɛc‘i (1sg ), sirɛc‘ir (2sg ), sirɛc‘ (3sg ), sirɛc‘ak‘ (1pl ),
sirɛc‘ik‘ (2pl ), sirɛc‘in (3pl ) (Ačaṙyan 1947: 128, 131). In old monosyllabic aorist
forms of this dialect we find yet another archaism, the augment e-.16 Table 14 gives
the example of banam, aor stem bac‘- ‘to open’ (Ačaṙyan 1947: 135).
Table 14: Aorist paradigm of ‘open’ in CA and Hamšen

CA act. CA med. Hamšen
sg bac‘i bac‘ay pac‘i
bac‘er bac‘ar pac‘ir
e-bac‘ bac‘aw pac‘aw, ɛ-pac‘
pl bac‘ak‘ bac‘ak‘ pac‘ak‘
bac‘ēk‘, -ik‘ bac‘ayk‘ pac‘ik‘
bac‘in bac‘an pac‘in
A note on Hamšen asuš, aor stem ast- ‘to say’
Classical Armenian asem ‘to say, tell, speak’ displays aorist forms based on ast-
in Hamšen. According to Ačaṙyan (1947: 134–135), the -t- is an epenthesis of a
phonetic nature, cf. almas > almast ‘diamond’, etc. This explanation is not entirely
satisfactory because: (1) it is not clear why the -t- is only found in the aorist; and
(2) we expect to see the reflexes of the aoristic c‘-. I, therefore, propose the fol-
lowing solution.
In Classical Armenian the aorist stem of this verb is asac‘. This subtype dis-
plays both non-syncopated and syncopated aorist forms in Hamšen, namely xaɫac‘i
and xaɫc‘i ‘I played’ (see Ačaṙyan 1947: 130–131). It seems, therefore, likely that
Hamšen asti etc. derive from the syncopated forms *asc‘i etc.; the development
*asc‘i > asti is probably due to dissimilation, which may have been triggered or
reinforced by the -(s)t- epenthesis. For the inclusion of the 3sg .aor . med into the
main paradigm, compare the aforementioned pac‘aw alongside ɛ-pac‘.
Table 15 combines the Classical and Hamšen paradigms with that of Šamaxi /
K‘yärk‘yänǰ, an easternmost dialect (on which see Baɫramyan 1964: 166).
16
For a discussion of this Hamšen archaism, see Vaux (2007: 265–266). It is also present
in a neighboring dialect of Xotorǰur (Kostandyan 1985: 60).
Table 15: The verb ‘to say’ in CA, Hamšen, and Šamaxi
CA Syncope Hamšen Šamaxi
sg asac‘i *asc‘i as-t-i asc‘i, assi
asac‘er *asc‘er as-t-ir asc‘ir, assir
asac‘ *as(a)c‘-aw (med.) as-t-av asɛc‘, asɛc
pl asac‘ak‘ *asc‘ak‘ as-t-ak‘ asc‘ink‘, assink‘
asac‘ēk‘, -ik‘ *asc‘ēk‘, -ik‘ as-t-ik‘ asc‘ik‘, assik‘
asac‘in *asc‘in as-t-in asc‘in, assin
Table 16 combines the aorist paradigms of the irregular verb tam ‘to give’ in
Hamšen (Ačaṙyan 1947: 137) and literary forms of Armenian.
Table 16: The irregular verb tam ‘to give’ in Hamšen and literary forms of Armenian
CA Mid. Arm. SWA Hamšen
sg etu tu-i tu-i dvi
etur tu-ir tu-ir dvir
et etu-r, eret, tu-aw tu-aw dvav, ɛrɛd
pl tuak‘ tuak‘ tu-ink‘ dvak‘
etuk‘ tuik‘ tu-ik‘ dvik‘
etun tuin tu-in dvin

(5) hɔxbɛy-s mɛzigi šad sirɛr gu
uncle-my we. acc much love.impf .3 ms particle
‘My uncle loved us very much’
(Ačaṙyan 1947: 194, line 1)
(6) mɛkʻ unikʻ mɛg varǰadun
we.nom have.1 pl . prs one school
‘We have got one school’
(7) mir gʻɛɫacʻi-kʻ gɔv šad bɛhin gu
we.gen villager-nom . pl cow. acc . sg many keep. prs .3 pl particle
‘Our villagers keep many cows’
(Ačaṙyan 1947: 196 line 17–18)
(8) im ɔnun-s Ašot ä
I.gen name-my PN(masc ) cop .3 sg
‘My name is Ašot’
6.3. The Aṙtial (Transylvania) or extreme North-Western intergroup

Aṙtial (Transylvania)
This dialect had four branches: Poland, Hungary, Suč‘ava and Romania. The
dialect descriptions: Hanusz (1887–1889); Ačaṙyan (1953). In Pisowicz (2003),
one finds a collection of Turkish, Arabic and Persian borrowings in Aṙtial.
Phonology
Voiced aspirates: for a discussion, see Pisowicz (1997: 216–217) and Vaux (1997:
233–234).
Nominal morphology
Important archaisms:
CA accusative definite marker z- (Ačaṙyan 1953: 10, 135–136, 156–157);
CA harsaneac‘ (gen .pl of harsani-k‘ ‘wedding’) > gen harsnic‘, loc harsnec‘n
(see also §6.2. Hamšen);
CA dustr, gen dster ‘daughter’ > d‘usdrə, gen d‘sder (Ačaṙyan 1953: 141–
142).
Table 17 gives he paradigms of hac‘, i-stem ‘bread’ and mard, o-stem ‘man,
human’ in Aṙtial (Ačaṙyan 1953: 136–137).
Table 17: Nominal morphology in Aṙtial

CA Aṙtial CA Aṙtial
sg n hac‘ hac‘ pl hac‘-k‘ hac‘-ɛr
acc (z)hac‘ (əz)hac‘ (z)hac‘-s (əz)hac‘-ɛr
gd hac‘-i hac‘-i hac‘-i-c‘ hac‘-ɛr-u
abl i hac‘-ē hac‘-ē i hac‘-i-c‘ hac‘-ɛr-ɛ
i hac‘-i-w hac‘-ɔw hac‘-i-w-k‘ hac‘-ɛr-ɔw
CA Aṙtial
sg n mard mard‘
acc (z)mard (əz)mard‘
gd mard-oy mard‘-u
abl i mard‘-unmɛ,
mard-oy mard‘-əvɛn
i mard- mard‘-ɔw,
o-v mard‘-əvɔw
Verbal morphology
In the table below, I present the paradigms of sirem ‘to love’ in Aṙtial (Ačaṙyan
1953: 160–164) and Polis (Ačaṙyan 1941: 149–151). The 2sg . impf ending -r is
replaced by -s by analogy with the present. This did not take place in the Polish
subdialect:
Table 18: Verbal morphology in Aṙtial (‘love’)

Present Imperfect
CA Aṙtial Polis CA Aṙtial Polis
sg sirem gi sirim gə sirɛm sirēi gi sirɛi gə sirɛyi
sires gi siris gə sirɛs sirēir gi sirɛis gə sirɛyir
sirē gi sirɛ gə sirɛ sirēr gi sirɛr gə sirɛr
pl siremk‘ gi sirink‘ gə sirɛnk‘ sirēak‘ gi sirɛink‘ gə sirɛyink‘
sirēk‘ gi sirik‘ gə sirɛk‘ sirēik‘ gi sirɛik‘ gə sirɛyik‘
siren gi sirin gə sirɛn sirēin gi sirɛin gə sirɛyin
The aorist (sirec‘i ‘I loved’) has been replaced by the pattern [past participle +
copula], siril im.
(9) B‘axd‘-ə dun gu-g‘a jɛw

husband -def . art . home particle –come.prs .3 sg - and
gnganə hɛd adɛs gə zuruc‘ɛ
wife.gen . dat . def . art . with like.that particle talk. prs .3 sg
‘The husband is coming home and talks to (his) wife like this’ (gu-g‘a < gə/
u-particle + ig‘al ‘to come’; gnganə < gnig ‘wife’; gə zuruc‘ɛ < gə-particle
+ zurucʻel ‘to talk’)
(Ačaṙyan 1953: 223, line 9–10 from bottom)
(10) dɛrdɛr-ə inji asil ɛ
priest-def . art I.dat tell.prf . ptcp cop .3.sg
‘The priest has told me …’
(11) xalp aɫvɛs mə gə dɛsnu mi ak‘ṙav mə
fraudulent fox a particle see. prs .3 sg one crow a
(< gə-particle + dɛsnul ‘to see’)
‘A fraudulent fox is seeing a crow’
(Ačaṙyan 1953: 227, line 13)
6.4. Muš (the Muš / Tigranakert or South-Central group)

The Muš dialect was spoken in the territory of the Armenian historical province of
Turuberan and in south of Ayrarat. During the Russo-Turkish wars, two large groups
of the Muš and Alaškert populations migrated to Aparan and south of the Lake
Sevan, now in the Republic of Armenia. Near Axalk‘alak‘, there are three villages of
immigrants from Xnus. The 1915 Genocide survivors dispersed around the world.
Dialect descriptions: Patkanov (1875, 2), Mserianc (1897–1901), Ačaṙyan
(1911: 116–139), Baɫdasaryan-T‘ap‘alc‘yan (1958), Baɫeš, village of Xult‘ik in
Tarōnean (1961); Alaškert: Aparan/Aragac in Madat‘yan (1985).
Phonology
Fourfold system of plosives: b‘ (in anlaut only) b p p‘. CA h- becomes x in some
subdialects, such as Baɫeš and Xlat‘, and remains intact elsewhere.
In word internal and final positions, CA voiced aspirates become voiceless aspi-
rates in Muš and voiceless unaspirated in Alaškert. In the same positions, CA voice-
less aspirates become voiceless unaspirated in Alaškert but remain intact in Muš.
The dialect also has an initial voiced h‘-, either as prothetic or deriving from
y- (cf. Martirosyan 2010: 764–765).
Nominal morphology
Archaisms: def. accusative z- and locative/allative i /y-; preposed prepositions:
məč‘ mər tan ‘in our house’. The subdialect of Alaškert does not have the z-. Table
19 shows the paradigms of p‘ayt, i-stem ‘wood’ and ji, o-stem ‘horse’ in Alaškert
(Madat‘yan 1985: 76–78).
Table 19: Nominal paradigms in the dialect of Alaškert

CA Alaškert CA Alaškert
sg n p‘ayt p‘ɛd pl p‘ayt-k‘ p‘ɛd-er
acc (z)p‘ayt p‘ɛd (z)p‘ayt-s p‘ɛd-er
gd p‘ayt-i p‘ɛd-i p‘ayt-i-c‘ p‘ɛd-ɛr-u
abl i p‘ayt-ē p‘ɛd-ɛn, -ic i p‘ayt-i-c‘ p‘ɛd-ɛr-ɛn,
p‘ɛd-ɛr-uc
i p‘ayt-i-w- p‘ɛd-ɔv p‘ayt-i-w-k‘ p‘ɛd-ɛr-ɔv
CA Alaškert CA Alaškert
sg n ji j‘i pl ji-k‘ j‘i-an
acc (z)ji j‘i (z)ji-s j‘i-an
gd ji-oy j‘i-u ji-o-c‘ j‘i-an-ɛr-u
abl i ji-oy j‘i-u-c i ji-o-c‘ j‘i-an-ɛr-u-c,
j‘i-an-ɛr-ɛn
i ji-o-v j‘i-ɔv ji-o-v-k‘ j‘i-an-ɛr-ɔv
Verbal morphology
Indicative present: kə sirim ‘I love’, k-aɫam ‘I grind’. Table 20 combines the aorist
paradigm of the verb tam ‘to give’ in Muš (Baɫdasaryan-T‘ap‘alc‘yan 1958: 169),
Sasun (Petoyan 1954: 59), Polis (Ačaṙyan 1941: 149) and elsewhere.
Table 20: Aorist paradigms for tam ‘to give’ in CA and South-Central dialects
CA SWA SEA Muš Sasun Polis
sg etu tu-i tv(ec‘)i təv(ɛc‘)i dväc‘ə duvi
etur tu-ir tv(ec‘)ir təv(ɛc‘)ir dväc‘ər duvir
et tu-aw tvec‘ təvɛc‘ dvəc‘ duvav
pl tuak‘ tu-ink‘ tv(ec‘)ink‘ təv(ɛc‘)ink‘ dväc‘ək‘ (!) duvink‘
etuk‘ tu-ik‘ tv(ec‘)ik‘ təv(ɛc‘)ik‘ dväc‘ək‘ duvik‘
etun tu-in tv(ec‘)in təv(ɛc‘)in dväc‘ən duvin

(12) Sarkis k-ɛrta kə-hasni
Sargis (m. pers. name) particle -go. prs .3 sg particle -reach. prs .3 sg
Mɛlikk‘yand
place name
‘Sargis is going and reaching Mɛlikk‘yand’ (k-ɛrta < kə-particle +, ɛrtal ‘to
go’; kə-hasni < kə-particle + hasnel ‘to reach’)
(Madat‘yan 1985: 155, line 15 from bottom)
(13) Saṙa-i kisur kə-tɛsna Sarkəs-in
Sarah-gen husband’s mother ind -see. prs .3 sg Sargis-acc
‘The mother-in-law (husband’s mother) of Saṙa sees Sargis’ (kə-tɛsna <
kə-particle + tɛsnal ‘to see’)
(Madat‘yan 1985: 155 line 13 from bottom)
(14) Mɔla Nasradin šad b‘ampaṙag maṙt ɛr
Molla Nasreddin much lewd man be.impf .3 sg
‘Molla Nasreddin was a very lewd man’
(Madat‘yan 1985: 158 line 10)
6.5. Van (the Van or Southern intergroup)

Before the 1915 Genocide, the Van dialect and the Moks and Šatax subdialects, as
well as the (sub)dialects of Ozim and Diadin were spoken in territories that histor-
ically belonged to Vaspurakan and Moks, provinces of Greater Armenia. The 1915
Genocide survivors dispersed around the world. Some populations settled in the
Republic of Armenia.
Dialect description: Phonology of Van (Ačaṙyan 1903); Van – in the glos-
sary Moks and Ozim are also included (Ačaṙyan 1911: 140–158; 1952); Šatax
(M. H. Muradyan 1962); Moks (M. H. Muradyan 1982); Sevan region (Baɫramyan
1972, passim); Diadin/Vardenis (V. Xač‘atryan 2004). A very rich glossary of
Moks: Orbeli (2002), ca. 4500 words. Ozim is described in Ačaṙyan (1911: 147–
150), Ɫaribyan (1953: 93–97), N. E. Hovsep‘yan (1966), Arewikyan (1967), and
especially in the unpublished thesis by Hovsep‘yan (1970, non vidi). Texts from
Ozim: Kaycoṙik (1899); Ačaṙyan (1911: 156–158).
M. H. Muradyan (1982) discusses 72 phonological and morphological
isoglosses between Urban Moks, Rural Moks, Ozim, Šatax and Van and concludes
that Rural Moks is closest in relationship to Urban Moks and furthest from Van,
and Šatax takes an intermediate position between Rural Moks and Van. According
to some scholars (Ɫaribyan 1953: 93–95; N. E. Hovsep‘yan 1966), Ozim should
be treated as a distinct dialect. J̌ ahukyan (1972: 135) takes it together with Van.
Phonology
One of the earliest Armenian dialectal isoglosses (see §4.1) is Ačaṙyan’s Law, that
is, the fronting of back vowels after voiced obstruents.17 This feature unites the
Van-Urmia and Goris/Ɫarabaɫ/Kṙzen/Agulis areas. Table 21 provides examples for
both voiced and unvoiced obstruents in the initial and internal positions respec-
tively.
Table 21: Fronting of back vowels after voiced obstruents (Ačaṙyan’s Law)
CA gloss Van Moks/Šat. Urmia Ɫarabaɫ Kṙzen
bak courtyard päk päk päky päk, pɛk –
bah spade päx päx päx päh bäh
barak subtle päräk pärak päräky pä/ɛrak bärak
barj pillow päṙc‘ pärc‘ päṙc‘ pä/ɛrc‘ bärj/c‘
beṙn burden peṙ pɛ/eṙ pɛṙ pɛṙnǝ bɛṙ
beran mouth pɛran pɛran pɛran pɛ́ran bǝran
berd fortress peṙt‘ pirt‘/pert‘ pɛṙt‘ – –
boɫk radish pöxk pöɫk poxky pɔ̈/ɛxk bɔ̈xk
boc‘ flame pöc‘ püc‘ poc‘ – –
burd wool pürt‘ pürt‘ pürt‘ pürt‘ bürd/t‘
pakas less pakas pakas pakas pákas pakas
paɫ cold paɫ paɫ paɫ – –
paṙaw old woman paṙav paṙav paṙav paṙav paṙav
port navel poṙt puṙt poṙt pɔr/ṙt pɔrt
gal to come kyäl kyäl ikyäl kyäl gyöl
gaṙn lamb kyäṙ kyäṙ kyäṙ kyäṙnǝ gyäṙ
garun spring kyärün kyärun kyärün kyärunk‘ gyärunk‘
get river kyet kyit/kyet kyet kyɛt gyɛt
17
For a discussion and literature on Ačaṙyan’s Law, see Vaux (1992) and Martirosyan
(2010: 747).
CA gloss Van Moks/Šat. Urmia Ɫarabaɫ Kṙzen

gort frog kyöṙt kyɔ̈ṙt koṙt kyɛ/ɔ̈ṙt‘nuk gyɔ̈rt
kap tie kap kap kap kap kap
karōt need(y) karɔt karɔt karɔt kárɔt karɔt
ker food ker kir/ker – kɛr- kɛr
kot‘ handle kot‘ kut‘/kot‘ kot‘ kɔt‘ kɔt‘
kut grain kut kut kut kɔt kut
daɫjn wild mint täxc‘ täxc‘ täxc tɛ́(ɫ)snǝ däɫj
das order täs täs täs täs, tɛs däs
deɫ remedy teɫ tɛɫ/teɫ tɛɫ tɛɫ dɛɫ
doɫ tremor töɫ töɫ tɔɫ tɔ̈ɫ dɔ̈ɫ
tasn ten tas tas tassǝ tásǝ tasǝ
teɫ(i) place teɫ tɛɫ/teɫ tɛɫ t(ǝ)ɛɫ tɛɫ
tun house tun tun tun tɔn tu
CA gloss Van Moks Šatax Ɫarabaɫ Kṙzen

ambar storehouse ämbär ambär ambär ámbar ambar
andundk‘ abyss andundk‘ händütk‘ händütk‘y ǝndɔ́xtǝ –
aprank‘ products aprank‘y aprank‘y aprank‘y áprank‘ aprank‘
ardar righteous ärtär artär artär ärt‘är –
aregakn sun arɛkyäk Ɛrɛkyäk arɛkyäk ǝrík‘ynak ärägyäk
arcat‘ silver aṙcat‘ ar/ṙcat‘ aṙcat‘ árcat‘ arcat‘
arǰaṙ cattle äčäṙ ačäṙ ačäṙ áč‘a/ɛṙnǝ aǰäṙ
arǰasp vitriol – arčäp‘s arčäps – –
bambak cotton pämbäk pämbäk pämbäk pɛ́mba/äk bän/mbäk
borbos mold/Schimmel pɛɔrpɛɔs pürpürs pürpös p‘ǝrp‘ɛ́šn- bǝmbɛṙn-
gagat‘n summit kyäkyät‘ kyäkyät‘ kyäkyät‘ – Šm. kyäkyät
gangat complaint kyänkyät kyänkyät kyänkyät kyɛ́ngyät k/gyängyät
dadarel to cease tätrel tätäril tätrel t‘ǝt‘árɛ/il t‘ät‘ärɛl
ddum pumpkin tǝtyüm tǝtyüm tǝtyüm – –
erkank‘ hand-mill ɛrkank‘y ɛrkank‘y ɛrkank‘y árkank‘ arkank‘
kanač‘ green kanač‘ kanač‘ kanač‘ kána(n)č‘ kananč‘
kardal to shout, read kärtäl kartäl kartäl kárt‘ɛ/il kardal
karkut hail karkut karkut karkut kárku/ɔt karkut
margarit pearl markyǝrit märkyǝrit märkyǝrit – mǝrk‘yärit
mardak beam (of wood) märtäk martäk – – –
šaɫgam turnip šäxkyäm šaxkyäm šäxkyäm šaxkam –
urbat‘ Friday urpät‘ urpät‘ urpät‘ ɔ̈rp‘ät‘ urbät‘
paganel to kiss pak‘yel pakynil pakynel – –
The vocalism in Moks and Šatax is worthy of note. In e. g. Van ärtär and Ɫarabaɫ
ärt‘är ‘righteous’ we can reconstruct the following sequence of developments: CA
ardar > *artär > ärtär. Moks and Šatax18 did not share the assimilation in vowel
quality with Van, but preserved the intermediate form artär.
Nominal morphology
There are a number of plural markers, such as (Ačaṙyan 1952: 108–115):
• -er, with monosyllables, e. g. xac‘, pl. xac‘er;
• -ner (1) with words that became monosyllabic after the loss of final -n, e. g.
cük (from CA jukn ‘fish’), pl. ckner; (2) with polysyllables, e. g. axpɛr (cf. CA
eɫbayr ‘brother’), pl. axpɛr-ner;
• -hyter < *-k‘-ter, with polysyllables ending in a vowel or -n, e. g. kyini (< CA
gini ‘wine’), pl. kyini-hyter; maṙan ‘pantry, cellar’, pl. maṙan-hyter;
• -ɛstar, e. g. ikyi, pl. ɛkyɛstar from CA aygi ‘vineyard’, cf. coll. aygestan(i).
An archaism that is preserved in Moks, Šatax and Ozim but lost in Van is the def.
accusative marker z-: əz-caṙ kətric‘ ‘(he/she) cut the tree’; əz-lač höröxkic‘ ‘(he/
she) sent the boy’.
As Ačaṙyan (1952: 13) points out, the ablative in -ic‘ and the accusative/dative
by-forms with animate nouns give Van an intermediate position between western
and eastern dialects.
The ablative of place names is formed with -a, e. g. Vänä ‘from Van’, cf. CA
abl. i Vanay.
Relics of Classical Armenian plural genitive, e. g. Van ceṙ, gen. ceṙac‘ reflect-
ing CA jeṙn (cf. pl. jeṙ-k‘) and gen.pl jeṙ-a-c‘ ‘hand’, respectively. Table 22 pro-
vides the paradigms of hac‘, i-stem ‘bread’ and gini, wo-stem, ea-stem ‘wine’
(Ačaṙyan 1952: 117, 121).
Table 22: Nominal paradigms in the Van or Southern intergroup

CA Van CA Van
sg n hac‘ xac‘ pl hac‘-k‘ xac‘-er
acc (z-)hac‘ xac‘ (z)hac‘-s xac‘-er
gd hac‘-i xac‘-i hac‘-i-c‘ xac‘-er-ac‘
abl i hac‘-ē xac‘-ic‘ i hac‘-i-c‘ xac‘-er-ac‘
i hac‘-i-w xac‘-ɔv hac‘-i-w-k‘ xac‘-er-ɔv
18
See Ačaṙyan (1952: 248) and M. H. Muradyan (1962: 192). The form is corroborated
by the genuine record of Orbeli (2002: 208) in the Moks area in 1911–1912.
CA Van CA Van
sg n gini kyini pl kyini-hy-ter
acc (z-)gini kyini kyini-hy-ter
gd ginw-oy kyinu gine-a-c‘ kyini-hy-ter-ac‘
abl i ginw-o-y kyinu-c‘ kyini-hy-ter-ac‘
i ginw-o-v, kyin-ɔv gine-a-w-k‘ kyini-hy-ter-ɔv
gine-a-w
Verbal morphology
Table 23 combines the paradigms of the verb tam ‘to give’ in Van (Ačaṙyan 1952:
173), Moks (M. H. Muradyan 1982: 173), Šatax (M. H. Muradyan 1962: 148), in
CA and SWA.
Table 23: Verbal paradigms in the Van or Southern intergroup

CA SWA Van Moks Šatax
sg etu tu-i tv(ic‘)i təvə təvi
etur tu-ir tv(ic‘)ir təvir təvir
et tu-aw tvec‘, i-tu(r) i-tu i-tu
pl tuak‘ tu-ink‘ tv(ic‘)ink‘y təvink‘y təvink‘y
etuk‘ tu-ik‘ tv(ic‘)ik‘y təvik‘y təvik‘y
etun tu-in tv(ic‘)in təvin təvin

(15) im xɔr anun Märkyär i
my father.gen . sg name Margar cop .3sg . prs
‘My father’s name is Margar’
(16) Xuṙik xat‘ün nstir ɛr xɔv caṙi tak
Mrs. Xuṙik sit. pst . ptcp be.impf .3 sg cool tree.gen . dat . sg under
‘Mrs. Xuṙik was sitting under a cool tree’
(17) astɫ.er.n anuš p‘ɛlk‘ k-utan
star-nom . pl . def sweet shine ind -give. prs .3 pl
‘The stars are shining sweetly’ (k-utan < kə-particle + tal ‘to give’)
B. Eastern grouping
6.6. Xoy-Urmia (the Xoy / Maraɫa or South-Eastern group)
J̌ ahukyan (1972: 135) here takes one dialect, Xoy/Maraɫa, and identifies four rep-
resentatives of it: Maraɫa, Urmia, P‘ayaǰuk and Xoy. This dialectal area comprises
the territory of the Armenian historical province of Parskahayk‘ (Persian Armenia),
as well as some adjacent regions, in particular Maraɫa (east of Lake Urmia), where
Armenians have lived since the 13th century. In 1828, a considerable part of the
Armenian populations of these regions migrated to the Caucasus, the village of Aza
(Naxiǰewan, on the River Arax), Ɫarabaɫ and elsewhere. Before the First World War
there lived 368 Armenian families in the Maraɫa region, and in 1920 the number
was 200. Nowadays the dialect is also spoken in a few regions of the Republic of
Armenia, namely Artašat, Vayoc‘-jor, Kotayk‘, Hoktemberyan, etc. (Ačaṙyan 1911:
288; 1926: 9–13; 1951: 359; Ɫaribyan 1953: 342–343; Asatryan 1962: 9–11).
Dialect description of Urmia/Xoy in Ačaṙyan (1911: 288–290), Ɫaribyan
(1939: 128–146; 1953: 342–358) and Asatryan (1962). Maraɫa is described in
Ačaṙyan (1926).
Phonology
Before a syllable-initial consonant, the velars k/ky and g/gy become y, while k‘/k‘y
becomes hy, cf. taxtak ‘board, plank’, pl. taxtəynɛr; axč‘iky ‘girl’, pl. axč‘iynɛr,
p‘ɛt‘ägy ‘beehive’, pl. p‘ɛt‘äynɛr; irɛk‘y ‘three’, irɛhy xɛt ‘three times’, etc. This
also holds for words with, e. g., k‘n from gn, such as aganim ‘to put on clothes’ >
xahynɛl (cf. hak‘nɛ/il in the majority of dialects). See also §6.5, Van. On Ačaṙyan’s
Law and the shift h > x, see §§5.1, 6.5 (Van), 6.8 (Ɫarabaɫ).
Nominal morphology
There are three plural markers:
• -ɛr in monosyllables, e. g. ɛc (from CA ayc ‘goat’), pl. ɛc-ɛr;
• -nɛr (1) in words that became monosyllabic after the loss of final -n, e. g. kyäṙ
(< CA gaṙn ‘lamb’), pl. kyäṙnɛr; (2) in polysyllables, e. g. kyäzän (< CA gazan
‘beast’), pl. kyäzännɛr;
• -k‘yɛr (a combination of the Old Armenian plural markers k‘ and -e(a)r) in
polysyllables, e. g. pälis (< CA acc.pl banali-s ‘key’), pl. pälis-k‘yɛr.
In some subdialects one finds the following distribution: -nɛr in words ending in
a vowel, and -k‘yɛr in words ending in a consonant, cf. aṙu-nɛr (aṙu ‘brook’) vs.
karas-k‘yɛr (karas ‘jar’).
In some subdialects the marker -k‘yɛr is used in possessive constructions for
expressing the plurality of the possessor, cf. tun-əs ‘my house’, tən-ɛr-əs ‘my
houses’ and tənə-k‘yɛr-əs ‘our houses’. The paradigms of caṙ, o-stem ‘tree’ and
hogi ‘spirit’ in CA and the Xoy dialect (Asatryan 1962: 77–78) are provided in
Table 24.
Table 24: Nominal paradigms CA and Xoy dialect

CA Xoy CA Xoy
sg n caṙ caṙ pl caṙ-k‘ caṙ-nēr
acc (z-)caṙ caṙ (z-)caṙ-s caṙ-nēr
gd caṙ-o-y caṙ-i(/ə) caṙ-o-c‘ caṙ-nēr-i(/ə)
abl i caṙ-o-y caṙ-ē i caṙ-o-c‘ caṙ-nēr-ē
i caṙ-o-v caṙ-ɔv caṙ-o-v-k‘ caṙ-nēr-ɔv
CA Xoy CA Xoy
sg n hogi xɔk‘i pl hogi-k‘ xɔk‘i-nɛr
acc (z-)hogi xɔk‘i (z-)hogi-s xɔk‘i-nɛr
gd hogw-o-y xɔk‘-u hogw-o-c‘ xɔk‘i-nɛr-i(/ə)
abl i hogw-o-y xɔk‘iy-ɛ i hogw-o-c‘ xɔk‘i-nɛr-ɛ
i hogw-o-v xɔk‘(iy)-ɔv hogw-o-v-k‘ xɔk‘i-nɛr-ɔv
Verbal morphology
The dialect belongs to the l-group; the indicative present is formed as with the
construction participle + copula: üzɛl ɛm ‘I want’, ɛt‘äl ɛm ‘I go’ (Xoy); üzeli im
‘I want’, amč‘ənalǝ́ yim ‘I am ashamed’ (Maraɫa).
Remarkable is Urmia, in which the 3rd singular form has -l vs. -s in the rest
of paradigm, thus: kyirɛs ɛm/s ‘I/you write’, 3sg kyirɛl i ‘he or she writes’; mənas
ɛm/s ‘I/you stay’, 3sg mənal i ‘he or she stays’. It is interesting to compare this
pattern to that of Areš/Havarik (an eastern dialect displaying no consonant shift),
in which the 1st person has -li(s), whereas in the others the -l- is combined with
the participial ending -um: 1sg sirɛlis əm / sirɛli yəm, 2sg sirɛləm əs, 3sg sirɛlim i
(CA sirem ‘to love’).19The paradigms of the present indicative of the verb grem ‘to
write’ in Urmia and Areš/Havarik are provided in Table 25.
Table 25: Verbal paradigms CA and Urmia and Areš/Havarik dialects

CA Urmia Areš/Hav.
sg grem k irɛs ɛm
y
gyirɛlis əm /
gyirɛli yəm
gres kyirɛs ɛs gyirɛləm əs
grē kyirɛl i gyirɛlim i
pl gremk‘ kyirɛs ɛnk‘ gyirɛləm ank‘
grēk‘ kyirɛs ɛk‘y gyirɛləm ak‘
gren kyirɛs ɛn gyirɛləm an
19
For the forms and a discussion, see the relevant dialect descriptions, particularly Asa-
tryan (1962: 7–17), as well as G. Geworgyan (1985, 2010).
The imperfect is formed by addition of the particle ɛr to the present form, but the
3sg form is not distinct here. Thus:
• kyirɛs ɛm ‘I write’ vs. kyirɛs ɛm ɛr ‘I was writing’;
• kyirɛs ɛs ‘you write’ vs. kyirɛs ɛs ɛr ‘you were writing’;
• kyirɛl i ‘he/she writes’ vs. kyirɛs ɛr ‘he/she was writing’.
The imperfect in Areš is composed of -ləm + copula: gyirɛləm ɛy ‘I was writing’,

gyirɛləm ɛyr ‘you were writing’, gyirɛləm ɛr ‘he/she was writing’, etc.
The debitive is formed with the particle ti (< piti), e. g. ti kyirɛm ‘I have to
write’. Remarkable is the by-form tikyi / tikyə. A very interesting feature of this
dialect is the use of a possessive suffix with verbs, e. g. mɛ ci pṙnɛnk‘-t (one horse
let’s.catch-2 sg ) ‘let’s catch a horse for you’. This is due to Persian influence, cf.
dīdam-at ‘I saw you’.

(18) ɛt växkuy Näzär-ə mɛ k‘əšɛr kəngə xɛtə
that coward Nazar-def . art one night wife with
šɛmk‘y-n ə tüs ikyälə
threshold-def . art cop .3 sg out come. prs .3 sg
‘One night, this coward Nazar steps out onto the threshold’ (kəngə < kəniky
‘wife’; ikyälə < ikyäl ‘to come’)
(Asatryan 1962: 170, line 8–9 from bottom)
(19) mɛnk‘y ləsir ɛnk‘y ɛr k‘yɔ zärzändɛli anun-ə
we hear 1 pl . pst . prf your (sg ) fearsome name-def . art
‘We had heard your fearsome name (fame)’ (ləsɛl ‘to hear, listen to’)
(Asatryan 1962: 175, line 5)
(20) axč‘iky-ə an-kəṙɛl cəmɛrüky i
girl-def . art un-cut watermelon cop .3 sg
‘The girl is an uncut watermelon’
(Asatryan 1962: 183, line 9)
6.7. Ararat (the Ararat or North-Eastern group)

This is one of the largest Armenian dialects. It is mainly spoken in the northern
and central parts of the Republic of Armenia, historically the eastern part of the
province of Ayrarat, as well as the territory of Taširk‘, a district in the province of
Gugark‘.
Dialect description: Loṙi (Asatryan 1968), Hoktemberyan region (Baɫdasar-
yan-T‘ap‘alc‘yan 1973), Aštarak (Kapancjan 1975), general (Markosyan 1989).
Dictionary of the Ararat dialect: Nawasardean (1903).
Phonology
Ararat is a typically eastern dialect with indicative present in um (sirə/um ɛm
‘I love’) and penultimate accentuation. The series g/k/k‘ practically remains intact;
some subdialects have voiced aspirates b‘/g‘/d‘ in initial position.
The diphthongs oy and iw both become u or (in e. g. Loṙi) i, e. g. loys ‘light’ >
lus or lis, k‘oyr ‘sister’ > k‘ur or k‘ir; ariwn ‘blood’ > arun or arin, jiwn ‘snow’ >
jun or jin.
Nominal morphology
The paradigms of hac‘, i-stem ‘bread’, and mard, o-stem ‘man, human’ in the
Ararat dialect (Ačaṙyan 1957: 549–551; Markosyan 1989: 109) are provided in
Table 26.
Table 26: Nominal paradigms in the Ararat dialect

CA Ararat CA Ararat
sg n hac‘ hac‘ pl hac‘-k‘ hac‘-ɛr
acc (z)hac‘ hac‘ (z)hac‘-s hac‘-ɛr
gd hac‘-i hac‘-i hac‘-i-c‘ hac‘-ɛr-i
abl i hac‘-ē hac‘-ic‘ i hac‘-i-c‘ hac‘-ɛr-ic‘
i hac‘-i-w hac‘-ɔv hac‘-i-w-k‘ hac‘-ɛr-ɔv
CA Ararat
sg n mard mart‘
acc (z)mard mart‘(-u)
gd mard-oy mart‘-u
abl i mard-oy mart‘-uc‘
i mard-o-v mart‘-ɔv
Verbal morphology
The debitive pəti (< piti) can be both preposed and postposed, in the latter case
being conjugated, e. g. pəti aɫam vs. aɫal (pə)tɛm ‘I have to grind’.
One of the most interesting features of the Loṙi subdialect is the doublet forms
of imperfect, one of which is identical with the CA paradigm and can be derived
from it, though a secondary creation is also considered possible (Asatryan 1968:
24–25, 121–122; J̌ ahukyan 1972: 105). Table 27 gives the relevant forms in CA
and the Loṙi subdialect.
Table 27: Verbal paradigms for the imperfect in CA and Loṙi subdialect
Present Imperfect
CA Loṙi CA Loṙi 1 Loṙi 2
sg berem bɛrəm ɛm berēi bɛrɛi bɛrəm i
beres bɛrəm ɛs berēir bɛrɛir bɛrəm ir
berē bɛrəm a berēr bɛrɛr bɛrəm ɛr
pl beremk‘ bɛrəm ɛnk‘ berēak‘ bɛrɛink‘ bɛrəm ink‘
berēk‘ bɛrəm ɛk‘ berēik‘ bɛrɛik‘ bɛrəm ik‘
beren bɛrəm ɛn berēin bɛrɛin bɛrəm in
Similarly, there are doublet forms for the 3sg .aor . ending in Loṙi, e. g. ber-uc‘ vs.
ɛ-ber ‘he/she brought’. The latter is an archaic relic of CA e-ber. For the complete
aorist paradigm, see §6.8, Ɫarabaɫ.

Ɫɔrmacik Pōɫosə (subdialect of Astapat)
(21) ɫɔrmacik Pōɫos-ə k-asɛr
deceased Pōɫos-def . art ind -say.impf .3sg
‘The deceased Pōɫos was saying’
(22) mɛnk‘ vɛc‘ k‘ur ink‘ hama axpɛr č‘-unink‘
we six sister were but brother neg -have.impf .1 pl
‘We were six sisters but did not have brothers’
(23) t‘ak‘avor-ə gənəm a baɫ-ə
king-def . art go-3sg . prs garden-def . art
‘The king is going to the garden’
(HayŽoɫHek‘ 8, 1977: 443, line 4 from bottom)
6.8. Ɫarabaɫ (the Ɫarabaɫ / Šamaxi or extreme North-Eastern intergroup)

This is one of the largest Armenian dialects. It mainly occupies the territory of the
Armenian historical provinces of Arc‘ax and Utik‘. It is mainly spoken in Republic
of Artsakh/Arcʻax (Nagorno-Karabakh Republic). The population here is now ca.
140,000. Some of the subdialects of Ɫarabaɫ are spoken in surroundings, such as
Goris/Syunik‘, south of the Republic of Armenia.
Dialect description: Ɫarabaɫ proper (Ačaṙyan 1899; Davt‘yan 1966); Goris/
Syunik‘ (Vartapetjan 1962, Margaryan 1975); Tavuš (Mežunc‘ 1989); Šaɫax (Ɫar-
ibyan 1939: 50–58; Davt‘yan 1966); Hadrut‘ (Ɫaribyan 1939: 107–127; Poɫosyan
1965; Davt‘yan 1966). Historical phonology of the dialects of the Syunik‘ and
Arc‘ax areas: (Aɫabekyan 2010). Copious dictionary: (Sargsyan 2013).
Phonology
An archaic set of isoglosses such as accent retraction onto penultimate syllable and
Ačaṙyan’s Law (fronting of back vowels after voiced obstruents)20 demonstrates
an opposition between the South-Eastern periphery (Ɫarabaɫ/Agulis area) and Van,
on the one hand, and the Central and Western regions, on the other. The Van-Urmia
intermediate area may originally have formed part of the South-Eastern area. Note
also the sharp contrast between Muš and Ɫarabaɫ (see section 6). Ɫarabaɫ and Van
also share the devoicing consonant shift, in this case in contrast with Agulis, Meɫri
and Kṙzen.
The evidence for Ačaṙyan’s Law (see the table in 6.5, Van, as well as Marti-
rosyan 2010: 747) corroborates the assumption of A. E. Xač‘atryan (1984: 321–
322) that Ɫarabaɫ pɛ́rp‘ɛl is the regular outcome of Classical Armenian borb- ‘to
inflame’ rather than an archaic reflex of an otherwise unattested e-grade form
*berb-. Similarly untenable is the derivation of Ɫarabaɫ kɛ́ɫɛl and Meɫri gɛ́ɫil ‘to
hide, conceal oneself’ from an archaic *geɫ-. A. E. Xač‘atryan (1984: 321) con-
vincingly argues that these forms rather continue gaɫel ‘to hide’ through Ačaṙyan’s
Law.
Nominal morphology
Morphological archaisms: Ɫarabaɫ gen. tɛ́ṙnə from CA te-aṙn, gen. of tēr < *ti-ayr
‘master, lord’ (Martirosyan 2010: 61); Ɫarabaɫ cɛṙɔk‘ from CA instr . pl jeṙ-a-
w-k‘ ‘by/with hands’ (Davt‘yan 1966: 107). Table 28 shows the paradigm of hac‘
‘bread’ (Davt‘yan 1966: 105, 114–118):
Table 28: Nominal paradigms in Ɫarabaɫ

CA Ɫrb. indef. Ɫrb. def.
sg n hac‘ hac‘ hac‘-ə
acc (z-)hac‘ hac‘ hac‘-ə
gd hac‘-i hac‘-u hac‘-(ə-)ɛn
abl i hac‘-ē hac‘-a hac‘-a-n
i hac‘-i-w hac‘-a-v hac‘-a-v-ə
loc i hac‘-i hac‘-um, hac‘-um-ə,
hac‘u mɛč‘in hac‘ɛn mɛč‘in
20
For Ačaṙyan’s Law in Łarabał see Ałabekyan (2010: 154).
CA Ɫrb. indef. Ɫrb. def.

pl n hac‘-k‘ hac‘-ɛr hac‘-ɛr-ə
acc (z-)hac‘-s hac‘-ɛr hac‘-ɛr-ə
gd hac‘-i-c‘ həc‘-ɛr-ɛ/u həc‘-ɛr-ɛ-n
abl i hac‘-i-c‘ həc‘-ɛr-a həc‘-ɛr-a-n
i hac‘-i-w-k‘ həc‘-ɛr-a-v həc‘-ɛr-a-v-ə
loc i hac‘-s həc‘-ɛr-əm, həc‘-ɛr-əm-ə,
həc‘ɛrɛ mɛč‘in həc‘ɛrɛn mɛč‘in
Verbal morphology
Ɫarabaɫ basically belongs to the um-branch: indicative present sírəm əm (also
in Tavuš), Goris sirum ɛm ‘I love’. Šaɫax and Hadrut‘, however, belong to (l)
is-branch, cf. mənayis əm and mənas əm ‘I stay’, respectively.
CA paradigm of ek-, suppl. aor. of gam ‘to come’ (1sg eki, 2sg ekir, 3sg ekn,
3pl ekin, etc., imper. ek) is preserved in Ɫarabaɫ aor. yɛ́kɛ, yɛ́kɛr, yɛ́kə, imper. yɛk,
etc. (Martirosyan 2010: 249). The CA and Mid.Arm. aorist paradigms of the verb
tam ‘to give’ are shown together with those in the dialects of Šamaxi (Baɫramyan
1964: 129, 162, 167), Kṙzen (Baɫramyan 1961: 151), Hadrut‘ (Poɫosyan 1965:
212) and Ararat/Loṙi (Asatryan 1968: 141) in Table 29.
Table 29: Verbal paradigms in CA and Mid. Arm., and selected dialects
CA Mid. Arm. Šamaxi Kṙzen Hadrut‘ Loṙi
sg etu tu-i tur/vi tə/uvi tuvɛ təvi
etur tu-ir tur/vir tə/uvir tuvɛr təvir
et etu-r, eret, tu-aw u-tur, tur/vav ə-rɛt, təvuc‘ tu/əvav təvuc‘
pl tuak‘ tuak‘ turink‘ tə/uvink‘ tuvɛk‘y təvink‘
etuk‘ tuik‘ turik‘ tə/uvik‘ tuvɛk‘y təvik‘
etun tuin turin tə/uvin tuvɛn təvin

(24) t‘ak‘avɛr-ə həvak‘um a ir yɛrgyir-i imastun
king-def . art gather. ptcp - cop .3 sg . prs his country-gen . sg wise
mart‘-unc‘-ə
man-acc . pl - def . art
‘The king is gathering the wise men of his country’
(Davt‘yan 1966: 263, lines 11–12 from bottom)
(25) min t‘ak‘avɛr mɛhɛt min araz a tɛhnum
one king once one dream cop .3 sg -see. ptcp
‘Once upon a time a king sees a dream’
(Davt‘yan 1966: 265, line 16 from bottom)
(26) šinäc‘i Sirk‘yisin Maral ənəmav min kɔv

villager Sargis-gen . dat Maral name-ins . sg one cow
unɛr
have.3 sg . impf
‘The villager Sargis had a cow named Maral’
(Davt‘yan 1966: 280, line 12)
Abbreviations
1,2,3 first, second, third person masc Masculine

acc Accusative neg Negation
cop Copula nom Nominative
dat Dative pl Plural
def Definite prf . ptcp Perfect participle
def . art Definite article prs Present
gen Genitive pst . ptcp Past participle
impf Imperfective sg Singular
ind Indicative
References
Ačaṙyan, Hrač‘ya Hakobi. 1898a. (Ačaṙean), K‘nnut‘iwn Aslanbēgi barbaṙin [Study of the
dialect of Aslanbēg] Bazmavēp 56(1). 31–35; (2). 83–88.
Ačaṙyan, Hrač‘ya Hakobi. 1898b. (Ačaṙean), K‘nnut‘iwn Aslanbēgi barbaṙin [Study of
the dialect of Aslanbēg] (reprinted from 1898). Venice: S. Lazar. (Usumnasirut‘iwnner
haykakan gawaṙabanut‘ean, 1).
Ačaṙyan, Hrač‘ya Hakobi. 1899. (Ačaṙean), K‘nnut‘iwn Ɫarabaɫi barbaṙin [Study of the
dialect of Ɫarabaɫ] (reprinted from “Ararat” 1899). Vaɫaršapat: Tparan Mayr At‘oṙoy S.
Ēǰmiacni. (Usumnasirut‘iwnner haykakan barbaṙaxōsut‘ean, 3).
Ačaṙyan, Hrač‘ya Hakobi. 1902. (Ačaṙean), T‘urk‘erēnē p‘oxaṙeal baṙer hayerēni mēǰ [Turk-
ish loanwords in Armenian] (Ēminean azgagrakan žoɫovacu, vol. 3). Moscow: Vaɫaršapat.
Ačaṙyan, Hrač‘ya Hakobi. 1903. (Adjarian), Lautlehre des Van-Dialekts. Zeitschrift für ar-
menische Philologie 1. 74–86, 121–138.
Ačaṙyan, Hrač‘ya Hakobi. 1909. (Adjarian), Classification des dialectes arméniens. Paris:
Librairie Honoré Champion.
Ačaṙyan, Hrač‘ya Hakobi. 1911. (Ačaṙean), Hay barbaṙagitut‘iwn: uruagic ew dasawor-
ut‘iwn hay barbaṙneri (barbaṙagitakan k‘artēsov) [Armenian dialectology: A sketch
and classification of Armenian dialects (with a dialect map)]. Moskua: Nor-Naxiǰewan.
(Ēminean azgagrakan žoɫovacu, vol. 8).
Ačaṙyan, Hrač‘ya Hakobi. 1913. (Ačaṙean), Hayerēn gawaṙakan baṙaran [Armenian re-
gional dictionary]. T‘iflis (Ēminean azgagrakan žoɫovacu, vol. 9).
Ačaṙyan, Hrač‘ya Hakobi. 1925. (Ačaṙean), K‘nnut‘iwn Nor-Naxiǰewani (Xrimi) barbaṙi
[Study of the dialect of Nor-Naxiǰewan (Crimea)]. Yerevan.
Ačaṙyan, Hrač‘ya Hakobi. 1926. (Ačaṙean), K‘nnut‘iwn Maraɫayi barbaṙi [Study of the
dialect of Maraɫa]. Yerevan.
Ačaṙyan, Hrač‘ya Hakobi. 1935. (Ačaṙean), K‘nnut‘iwn Agulisi barbaṙi [Study of the dia-
lect of Agulis]. Yerevan: PetHrat (State Press).
Ačaṙyan, Hrač‘ya Hakobi. 1940a. (Ačaṙean), K‘nnut‘iwn Nor-J̌uɫayi barbaṙi [Study of the
dialect of Nor-J̌ uɫa]. Yerevan.
Ačaṙyan, Hrač‘ya Hakobi. 1940b-1951. (AčaṙHLPatm 1–2): Hayoc‘ lezvi patmut‘yun [His-
tory of the Armenian language], vol. 1 (1940), Yerevan: University Press; vol. 2 (1951).
Yerevan: State Press of Armenia (HayPetHrat).
Ačaṙyan, Hrač‘ya Hakobi. 1941. K‘nnut‘yun Polsahay barbaṙi [Study of the dialect of Polis
Armenian]. Gitakan ašxatut‘yunner of Yerevan State University, vol. 19. 19–250.
Ačaṙyan, Hrač‘ya Hakobi. 1947. K‘nnut‘yun Hamšeni barbaṙi [Study of the dialect of
Hamšen]. Yerevan: Academy Press.
Ačaṙyan, Hrač‘ya Hakobi. 1952. K‘nnut‘yun Vani barbaṙi [Study of the dialect of Van].
Yerevan: University Press.
Ačaṙyan, Hrač‘ya Hakobi. 1953. K‘nnut‘yun Aṙtiali barbaṙi [Study of the dialect of Aṙtial].
Yerevan: Academy Press (Hayeren barbaṙner: 1).
Ačaṙyan, Hrač‘ya Hakobi. 1955–1971. (AčaṙLiak), Liakatar k‘erakanut‘yun hayoc‘ lezvi
[Complete grammar of the Armenian language] (Introductory vol. 1955); (vol. 3, 1957);
(vol. 6, 1971). Yerevan: Academy Press.
Ačaṙyan, Hrač‘ya Hakobi. 1971–1979. (HAB), Hayerēn armatakan baṙaran [Armenian
root dictionary], in 4 vols (2nd edn.). Yerevan: University Press. [The original edition:
1926–1935, in 7 volumes, Yerevan: University Press].
Ačaṙyan, Hrač‘ya Hakobi. 2003. K‘nnut‘yun Kilikiayi barbaṙi [Study of the dialect of Cili-
cia]. Yerevan: University Press.
Aɫababean, A. 1901. Hawlabarc‘oc‘ mtawor keank‘ə [Intellectual life of inhabitants of
Havlabar]. T‘iflis: Tparan K. Martiroseanc‘i.
Aɫabekyan, Melada A. 2010. Syunik‘-Arc‘ax tarack‘i barbaṙneri patmakan hnč‘yunaba-
nut‘yun [Historical phonology of the dialects of the Syunikʻ-Arcʻax area]. Hayereni
barbaṙagitakan atlas: usumnasirut‘yunner ew nyut‘er 3, 54–165. Yerevan: “Nairi”.
Alaverdyan, S. A. 1968a. Ardvini barbaṙi hnč‘yunakan bnoroš aṙanjnahatkut‘yunnerə [Pho-
nological peculiarities of the dialect of Ardvin]. Lraber hasarakakan gitut‘yunneri
1968, Nr 9. 83–94.
Alaverdyan, S. A. 1968b. Ardvini barbaṙi jewabanut‘yan mi k‘ani aṙanjnahatkut‘yunnerə
[Some morphological peculiarities of the dialect of Ardvin]. In: PBH 1968, Nr 3. 229–
240.
Aɫayan, Ē. B. 1954. Meɫru barbaṙ [The dialect of Meɫri]. Yerevan: Academy Press.
Aɫayan, Ē. B. 1958. Barbaṙayin hnaguyn tarberut‘yunner hayerenum [Ancient dialectal dif-
ferences in Armenian]. Teɫekagir: hasarakakan gitut‘yunner 1958, Nr 5. 67–80.
Allahvērtean, Y. 1884. Ulnia kam Zeyt‘un. K-Polis: Paɫtatlean/Aramean.
Alpōyačean, Aršak. 1952. (Archag Alboyadjian) Patmutʻiwn Ewdokioy hayocʻ: teɫagrakan,
patmakan ew azgagrakan teɫekutʻiwnnerov. Gahirē: “Nor astɫ” [History of the Arme-
nians in Tokat. With topographical, historical and ethnographical information]. Cairo:
New Star Press.
Alpōyačean, Aršak. 1961. (Archag Alboyadjian or Alpoyačean) Patmut‘iwn Malat‘ioy
hayoc‘: teɫagrakan, patmakan ew azgagrakan [History of the Armenians in Malatia:
with topographical, historical and ethnographical information.] Beirut: Sevan Press.
Amatuni, Sahak Vardapet. 1912. Hayoc‘ baṙ u ban [Armenian word and speech].Vaɫaršapat.
Andranik. 1900. Tērsim: čanaparhordut‘iwn ew teɫagrut‘iwn [Dersim: travel and topogra-
phy]. T‘iflis: Aragatip M. Martiroseanc‘i. (T‘iflisi hayoc‘ hratarakč‘akan ənkerut‘iwn,
188).
Andreasyan, Tigran. 1967. Svediayi barbaṙə (K‘ist‘inəgi lezun) [The dialect of Svedia].
Yerevan: Academy Press.
Ant‘osyan, S. M. 1961. Kesariayi barbaṙə [The dialect of Kesaria]. Yerevan: Academy Press.
Ant‘osyan, S. M. 1975. Xonarhman hamakargi zargac‘man ənt‘ac‘k‘ə. In Hayoc‘ lezvi pat-
makan k‘erakanut‘yun, vol. 2, 159–377. Yerevan: Academy Press.
Arewikyan, a. i. 1967. Ozmi barbaṙə [The dialect of Ozim]. Lraber hasarakakan gitut‘yun-
neri 1967, Nr 5. 66–78.
Asatryan, G. S. 1992. Kontakty armjanskogo s novoiranskimi jazykami [Contacts of Arme-
nian with New Iranian languages]. (Doctoral dissertation). Yerevan: Academy Press.
Asatryan, M. E. 1962. Urmiayi (Xoyi) barbaṙə [The dialect of Urmia (Xoy)]. Yerevan:
University Press.
Asatryan, M. E. 1968. Loṙu xosvack‘ə [The subdialect of Loṙi]. Yerevan: “Mitk‘”, Univer-
sity Press.
Avagyan, Serine Movsesi. 1978. Arčak. Hay azgagrutʻyun ew banahyusutʻyun 8. Yerevan:
Academy Press.
Avagyan, Suren Artemi. 1973. Vimakan arjanagrut‘yunneri hnč‘yunabanut‘yun [Phonol-
ogy of inscriptions] (X-XIV dd.). Yerevan: Academy Press.
Awde, Nicholas (ed.). 1997. Armenian perspectives: 10th anniversary conference of the
Association Internationale des Etudes Arméniennes. Richmond & Surrey: Curzon.
Ayceamn. 1907. Nmoyš mə Aramō giwɫin hay lezuēn [A sample of the Armenian language
of the Aramo village]. Handēs amsōreay 21. 27.
Ayvazyan, Argam. 1984. Agulis: patmamšakut‘ayin hušarjanner [Agulis: historical-cul-
tural memorials] Yerevan: “Hayastan”.
Azatean, T‘. 1943. Akn ew Akncʻikʻ: azgagrakan u kensagrakan yišatakaran [Akn: ethno-
graphical and biographical monument]. Istʻanpul: Tpagr. Y. Abrahamean.
Baɫdasaryan-T‘ap‘alc‘yan, S. H. 1958. Mšo barbaṙə [The dialect of Muš]. Yerevan: Acad-
emy Press.
Baɫdasaryan-T‘ap‘alc‘yan, S. H. 1973. Araratyan barbaṙi xosvack‘nerə Hoktemberyani
šrǰanum [The Ararat subdialects in the district of Hoktemberyan]. Yerevan: Academy
Press.
Baɫramyan, Ṙ. H. 1960. Dersimi barbaṙayin k‘artezə [The dialectal map of Dersim]. Yere-
van: Academy Press.
Baɫramyan, Ṙ. H. 1961. Kṙzeni barbaṙə [The dialect of Kṙzen]. Yerevan: Academy Press.
Baɫramyan, Ṙ. H. 1964. Šamaxii barbaṙə [The dialect of Šamaxi]. Yerevan: Academy Press.
Baɫramyan, Ṙ. H. 1972. Hayereni barbaṙner: Sewana lči avazani xosvacqnerə [Armenian
dialects: the subdialects of the basin of Lake Sevan]. Yerevan: Haykakan petakan man-
kavaržakan institut.
Baɫramyan, Ṙ. H. 1976. Xotrǰuri barbaṙə [The dialect of Xotrǰur] PBH 1976, Nr 3. 95–105.
Barxudareanc‘, Makar Vard./Epis. 1883. Pələ-Puɫi. T‘iflis: Mov. Vardaneani ew Ənk. tpara-
num.
Barxutareanc‘, Usta-Gēorg. 1883. Arazə tarin ktari. Šuši: Tparan M. Mahtesi-Yakobeanc‘.
Baxtikean, S. A. 1934. Arabkir ew šrǰakayi giwɫerə: patmakan-azgagrakan hamaṙōt
tesut’iwn [Arabkir and the adjacent villages]. Pēyrut‘: Tp. “Vahagn” H. Yakobean.
Beekes, Robert S. P. 2003. Historical phonology of Classical Armenian, apud Kortlandt

2003: 133–211.
Bläsing, Uwe. 1992. Armenisches Lehngut im Türkeitürkischen am Beispiel von Hemşin.
Amsterdam & Atlanta: Rodopi.
Bläsing, Uwe. 1995. Armenisch – Türkisch: Etymologische Betrachtungen ausgehend von
Materialen aus dem Hemşingebiet nebst einigen Anmerkungen zum Armenischen, ins-
besondere dem Hemşindialekt. Amsterdam & Atlanta: Rodopi.
Bläsing, Uwe. 2003. Some aspects of Armenian-Turkish language contacts in the Hamshen
area. Annual of Armenian linguistics 22–23. 7–16.
Bläsing, Uwe. 2007. Armenian in the vocabulary and culture of the Turkish Hemshinli. In
Hovann Simonian (ed.), The Hemshin: History, society and identity in the Highlands of
Northeast Turkey, 279–302. London & New York: Routledge.
Čanikean, Y. K. 1895. Hnut‘iwnk‘ Aknay [Antiquities of Akn]. T‘iflis: Tparan M. D. Ṙōtine-
anc‘i. (T‘iflisi hayoc‘ hratarakč‘akan ənkerut‘iwn, 89).
Clackson, James. 2004–2005. Review of Kortlandt 2003 (with an appendix = Beekes 2003).
Annual of Armenian linguistics 24–25. 153–158.
Č‘olak‘ean, Yakob. 1986. K‘esapi barbaṙə [The dialect of Kʻesap]. Haleb.
Danielyan, T’ewan. 1967. Malat‘iayi barbaṙə [The dialect of Malatʻia]. Yerevan: Academy
Press.
Davt‘yan, K. S. 1966. Leṙnayin Ɫarabaɫi barbaṙayin k‘artezə [The dialectal map of the
Mountainous Ɫarabaɫ]. Yerevan: Academy Press.
Dawit‘-Bēk, Melik‘ S. 1919. Arabkiri gawaṙabarbaṙə: jaynabanakan ew k‘erakanakan
usumnasiru‘iwn [The dialect of Arabkir: a phonetic and grammatical study]. Vienna:
Mxit‘arean Tparan. (Azgayin Matenadaran, 79).
Dumézil, Georges. 1964. Notes sur le parler d’un Arménien musulman de Hemşin. Académie
Royale de Belgique: Mémoires de la Classe des Lettres (Bruxelles) 57(4). 3–52.
Dumézil, Georges. 1965. Notes sur le parler d’un Arménien musulman d’Ardala. Revue des
études arméniennes (Paris) n. s. 2. 135–142.
Dumézil, Georges. 1967. Trois récits dans le parler der arméniens musulmans de Hemşin.
Revue des études arméniennes (Paris) n. s. 4. 19–35.
Dumézil, Georges. 1986–87. Un roman policier en arménien d’Ardala. Revue des études
arméniennes (Paris) n. s. 20. 5–27.
Eremean, Aram. 1923. Č‘aharmahali hay žoɫovrdakan banahiwsut‘iwnə [Armenian folk-
lore of Čʻaharmahal]. Vienna: Mxit‘arean tparan (Azgayin Matenadaran, 96).
Eremean, Aram. 1930. P‘šrank‘ner J̌uɫahay ew Hndkahay banahiwsut‘iwnic‘: taɫer ew
žoɫovrdakan erger (17–19rd dar) [Fragments of folklore of the Armenians of Julfa and
India]. Vienna: Mxit‘arean Tparan (Azgayin Matenadaran, 129).
Gabikean, Karapet. 1952. Baṙgirk‘ Sebastahay gawaṙalezui [Dictionary of the dialect of
Sebastia]. Erusaɫēm: St Jacob.
Gabriēlean, M. S. 1912. Aknay gawaṙabarbaṙə [The dialect of Akn]. Vienna: Mxit‘arean
tparan.
Galustean, Gr. H. (Kilikec‘i). 1934. Maraš kam Germanik ew heros Zēyt‘un. New York:
“Koč‘nak” (Maraši hayrenakc‘akan miut‘iwn).
Gasparyan, H. H. 1966. Haǰəni barbaṙə [The dialect of Hačən]. Yerevan: Academy Press.
Gasparyan, H. H. 1979. Dersim: patmaazgagrakan aknark [Dersim: a historical-ethno-
graphical survey]. PBH 1979, Nr 2. 195–210.
Gawaṙac‘i, H. M. H. 1903. Erger, aṙakner, hanelukner, t‘erahawatut‘iwnner… Xotrǰroy

[Songs, fables, riddles, superstitions… of Xotorǰur]. Tp‘xis.
Gazančean, Yovhannēs. 1899. Ewdokioy hayoc‘ gawaṙabarbaṙə [The Armenian dialect of
Tokat]. Vienna: Mxit‘arean tparan.
Geworgyan, Gayane. 1985. Miǰčyuɫayin barbaṙnerə hayerenum [Interdialects in Armenian].
Lraber hasarakakan gitut‘yunneri 1985, Nr 9. 52–60.
Geworgyan, Gayane. 1994. (Gevorkian), The continuous present (շարունակական
ներկայ) in Armenian dialects. Annual of Armenian linguistics 15. 43–62.
Geworgyan, Gayane. 2010. Sahmanakan eɫanaki nerka žamanaki taracakan-gorcaṙakan
bnut‘agirə hayereni barbaṙnerum [Areal-functional characteristics of the indicative
present tense in Armenian dialects]. In Hayereni barbaṙagitakan atlas: usumna-
sirut‘yunner ew nyut‘er 3, 232–295. Yerevan: “Nairi”.
Geworgyan, Geworg G. 1980. Ɫzlar. Hay azgagrut‘yun ew banahyusut‘yun 10. Yerevan:
Academy Press.
Geworgyan, T‘amar H. 1999. Loṙi (Taširk‘ – Joraget). Hay azgagrut‘yun ew banahy-
usut‘yun 20. Yerevan: Academy Press “Gitut‘yun”.
Greppin, John A. C. & Amalya A. Khachaturian 1986. A handbook of Armenian dialectol-
ogy. Delmar & New York: Caravan Books.
Grigorean (Ɫazarukean) Alēk‘s & Sedrak‘ Garakēōzean. 1963. Yišatakaran Ēvērēk-Fēnēsēi.
P‘ariz.
Grigoryan, A. V. 1957. Hay barbaṙagitut‘yan dasənt‘ac‘[A handbook of Armenian dialec-
tology]. Yerevan: University Press.
Grigoryan-Spandaryan, M. M. 1971. Leṙnayin Ɫarabaɫi banahyusut‘yunə [Folklore of the
Mountainous Ɫarabaɫ]. Yerevan: Academy Press.
Gurunyan, Hakob. 1991. Hamšenahay banahyusut‘yun [Folklore of the Hamšen Armeni-
ans]. Yerevan: “Luys”.
Guyumčean, Sanduxt. 1990. Vaɫemi Marašə ew hayerə [The ancient Maraš and the Arme-
nians]. Los Angeles.
Gyozalyan (Geɫuni), Grigor Mat‘ewosi. 2001. Musa leṙan azgagrut‘yunə [Ethnography of
Musa-Leṙ] (Veržine Svazlyan, ed.). Yerevan: Academy Press “Gitut‘yun”.
Hačean, H. M. 1907. Hin avandakan hēk‘eat‘ner Xotorǰroy [Old traditional folk tales of
Xotorǰur]. Vienna: I vans Paštpan S. Astuacacni.
Hakobyan, G. A. 1974. Nerk‘in Baseni azgagrut‘yunə ew banahyusut‘yunə [Ethnography
and folklore of Lower Basen]. Yerevan: “Hayastan”.
Halaǰyan, Geworg. 1973. Dersimi hayeri azgagrut‘yunə [Ethnography of the Armenians of
Dersim]. Hay azgagrut‘yun ew banahyusut‘yun 5. Yerevan: Academy Press.
HamšHamš. 2007. Hamšēn ew Hamšēnahayut‘iwn (gitažoɫovi niwt‘er). Yerevan, Pēyrut‘:
Hamazgayin Vahē Sēt‘ean tparan.
Hananyan, Gaɫt‘uhi Karapeti. 1995. Svediayi barbaṙə (Xtrbeki xosvack‘)[The dialect of
Svedia (Subdialect of Xtrbek)]. Yerevan: University Press.
Haneyan, A. N. 1978. Tigranakerti barbaṙə [The dialect of Tigranakert]. Yerevan: Academy
Press.
Haneyan, A. N. 1982. Edesiayi barbaṙə [The dialect of Edesia]. Hayereni barbaṙagitakan
atlas: usumnasirut‘yunner ew nyut‘er 1. 184–240.
Haneyan, A. N. 2010. Jayneɫ šnč‘eɫnerə hayereni barbaṙnerum [The voiced aspirates in the
Armenian dialects]. (Part 1). In Hayereni barbaṙagitakan atlas: usumnasirut‘yunner
ew nyut‘er 3, 166–231. Yerevan: “Nairi”.
Hanusz, Johann. 1887–1889. Beiträge zur armenischen Dialectologie. Wiener Zeitschrift

für die Kunde des Morgenlandes (WZKM) 1, 1887: 181–197, 281–313; WZKM 2, 1888:
63–70, 124–132, 291–308; WZKM 3, 1889: 38–50.
Harut‘yunyan, Lewon. 1991. Nšxarner Arc‘axi banahyusut‘yan [Fragments of the Arcʻax
folklore]. Yerevan: University Press.
Hayk, Vahē. 1959. Xarberd ew anor oskeɫēn daštə: yušamatean azgayin, patmakan, mša-
kut‘ayin ew azgagrakan [Xarberd and its golden field]. New York.
Haykuni, S. 1892. Trapizōni Zefanoz, Kṙōmila hay giwɫōrēic‘ sovorut‘iwnner u baṙbaṙə
[The customs and dialect of the Armenian villages of Trapizon: Zefanoz and Kṙōmila].
Ararat 25(1). 428–447.
HayLezBrbBaṙ 1–7. 2001–2012. Hayoc‘ lezvi barbaṙayin baṙaran [Dialectal dictionary of
Armenian] (ed. A. Sargsyan et al.), vol.1, 2001 [letters a-z]; vol. 2, 2002 [letters ē-c];
vol. 3, 2004 [letters k-č]; vol. 4, 2007 [letters m-č‘];vol. 5, 2008 [letters p-s]; vol. 6,
2010 [letters v-pʻ]; vol. 7, 2012 [letters pʻ-f]. Yerevan: Academy Press “Gitut‘yun”.
HayŽoɫHek‘ 1. 1959. Hay žoɫovrdakan hek‘iat‘ner [Armenian folk tales]. Yerevan: Acad-
emy Press.
Hewsen, Robert H. 2001. Armenia: a historical atlas. Chicago and London: The University
of Chicago Press.
Hovannisian, Richard G. (ed.). 2000. Armenian Van/Vaspurakan. Costa Mesa, California:
Mazda Publishers. (UCLA Armenian history and culture series, Historic Armenian cit-
ies and provinces, 1).
Hovannisian, Richard G. (ed.). 2001. Armenian Baghesh/Bitlis and Taron/Mush. Costa
Mesa, California: Mazda Publishers. (UCLA Armenian history and culture series, His-
toric Armenian cities and provinces, 2).
Hovannisian, Richard G. (ed.). 2009. Armenian Pontus: The Trebizond-Black Sea commu-
nities. Costa Mesa, California: Mazda Publishers. (UCLA Armenian history and culture
series, ‘Historic Armenian cities and provinces, 8).
Hovsepʻyan, Hovhannes. 2009. Ɫaradaɫi hayerǝ [The Armenians of Ɫaradaɫ. Vol. 1: Azga-
grutʻyun (Suren Hobosyan, ed.); vol. 2: Banahyusutʻyun (Svetlana Vardanyan & Ēst‘er
Xemč‘yan, eds.). Yerevan: Academy Press. (Azgagrakan ew banahyusakan žaṙangutʻ-
yun, 1–2).
Hovsep‘yan, Liana S. 1997. ŽG dari hayeren jeṙagreri hišatakaranneri lezun [The language
of the 13th-century colophons of Armenian manuscripts]. Yerevan. Academy Press &
Matenadaran. (Accessing Armenian colophons, 2).
Hovsep‘yan, Liana & Gayane Geworgyan. 2013. Armjanskie dialekty: obščij obzor [The
Armenian dialects: General overview]. In Yuri B. Koryakov & Andrej A. Kibrik (eds.),
Languages of the World: Relict Indo-European languages of Western and Central Asia,
320–334. Moscow: Academia.
Hovsep‘yan, N. E. 1966. Ozmi barbaṙi mi k‘ani aṙanjnahatkut‘yunneri masin [On a few
peculiarities of the dialect of Ozim]. PBH 1966, Nr 2. 226–238.
J̌ ahukyan, G. B. 1972. Hay barbaṙagitut‘yan neracut‘yun: vičakagrakan barbaṙagitut‘yun
[Introduction to Armenian dialectology: statistic dialectology]. Yerevan: Academy
Press.
J̌ ahukyan, G. B. 1985 (Djahukian), Die archaischen Schichten armenischer Dialektwörter.
In U. Pieper & G. Stickel (eds.), Studia linguistica, diachronica et synchronica Werner
Winter sexagenario anno MCMLXXXIII, 151–160. Berlin & New York: Mouton de
Gruyter.
J̌ ahukyan, G. B. 1987. Hayoc‘ lezvi patmut‘yun: naxagrayin žamanakašrǰan [History of the

Armenian language: preliterary period]. Yerevan: Academy Press.
J̌ alašyan, G. S. 2012. Nor Naxiǰewani barbaṙi bac‘atrakan baṙaran [Defining dictionary of
the Nor Naxiǰewan dialect]. Rostov-na-Donu: “Starye russkie”.
JaynHamš 1–4. 1971–1999. Jayn hamšenakan [Hamšen voice]. Vol. 1 (Sero Xanzadyan,
eds., Yerevan: “Hayastan”, 1971); vol. 2 (A. Zeyt‘unyan, eds., Yerevan: “Sovetakan
groɫ”, 1979); vol. 3 (ed. by Sergey Vardanyan, Yerevan: “Xorhrdayin groɫ”, 1989); vol.
4 (Sergey Galstyan & Lyudvig Karapetyan, eds., Yerevan: “Vernatun”, 1999).
K‘alant‘ar, A. 1895. Sasun. [T‘iflis].
Kapancjan, Grigor. 1975. Istoriko-lingvističeskie raboty [Historical-linguistic studies].
(Vol. 2). Yerevan: Academy Press.
Kapri, Freiherrn M. 1913. Zwei Vorträge über die historische und kulturelle Bedeutung des
armenischen Volkes: mit einem Vorwort des Wiener armenophilen Komitees und einem
Anhang des Patriarchats-Memorandums, der Bevölkerungsstatistik und einer geogra-
phischen Karte von Armenien. Wien: Mechitharisten.
Karapetyan, Eɫiazar Karapeti. 1962. Sasun: azgagrakan nyut’er [Sasun: ethnographic ma-
terials]. Yerevan: State Press of Armenia (HayPetHrat).
Karayan, Sarkis Y. 2000. Demography of Van province, 1844–1914. In Richard G. Hov-
annisian (ed.), Armenian Van/Vaspurakan, 195–208. Costa Mesa, California: Mazda
Publishers.
Karst, Josef (Joseph). 1901. Historische Grammatik des Kilikisch-Armenischen. Strassburg:
Verlag von Karl J. Trübner ([1970] Photomechanischer Nachdruck. Berlin: Walter de
Gruyter & Co.)
Karst, Josef (Joseph). 2002. Kilikyan hayereni patmakan k‘erakanut‘yun [Historical gram-
mar of Cilician Armenian]. (ModArm. translation from Karst 1901 by Dmitri Alivn-
yak). Yerevan: University Press.
Kaycoṙik. 1899. P‘ayal-p‘ašan/aɫan u P‘arušan. Biwrakn 17. 20–21, 119–120, 298.
K‘ēč‘ean, Aṙak‘ēl & Mkrtič‘ Parsamean. 1952. Akn ew Aknc‘ik‘ (initiated and compiled by
Aṙak‘ēl K‘ēč‘ean; composed and edited by Mkrtič‘ Parsamean). Paris: Compatriotic
Association of Eguen.
Khachaturian, Amalya. 1983. The nature of voiced aspirated stops and affricates in Arme-
nian dialects. Annual of Armenian linguistics 4. 57–62.
Kortlandt, Frederik H. H. 1980. On the relative chronology of Armenian sound changes.
In John Greppin (ed.), Proceedings of the First international symposium on Armenian
linguistics (11–14 July, 1979), 97–106. Delmar, New York: Caravan Books.
Kortlandt, Frederik H. H. 2003. Armeniaca: comparative notes – with an appendix on the
historical phonology of Classical Armenian by Robert S. P. Beekes. Ann Arbor: Cara-
van Books.
Kostandyan, D. M. 1979. Erznkayi barbaṙə [The dialect of Erznka]. Yerevan: Academy
Press.
Kostandyan, D. M. 1982. Xarberdi barbaṙi hnč‘yunabanakan ew jewabanakan himnakan
aṙanjnahatkut‘yunnerə [The main phonological and morphological peculiarities of the
dialect of Xarberd]. In Hayereni barbaṙagitakan atlas: usumnasirut‘yunner ew nyut‘er
1, 241–272. Yerevan: Academy Press.
Kostandyan, D. M. 1985. Xotrǰuri barbaṙayin yurahatkut‘yunneri lezvaašxarhagrakan
bnut‘agirə [The linguistic-geographical characteristics of the peculiarities of the Xotrǰur
dialect]. In Hayereni barbaṙagitakan atlas: usumnasirut‘yunner ew nyut‘er 2, 46–68.
Kuzub, Mixayil. 2007. Hamšēnc‘i k‘ristoneaneri hek‘iat‘ayin banahiwsut‘ean diwabanakan

patkerac‘umnerə [The demonological notions of folklore of the Christians of Hamšen].
HamšHamš: 168–180.
Ɫafadaryan, K. G. 1967. Nyut‘er Axalc‘xayi hay hamaynk‘i patmut‘yan veraberyal [Mate
rials on the history of the Armenian community of Axalcʻxa]. PBH 1967, Nr 1. 117–124.
Lalayean, E. 1892. (Lalayeanc‘), J̌awaxk‘i burmunk‘ [Scent of J̌ awaxk‘]. T‘iflis: Tparan
“Arōr” T. Nazarean. (Žoɫovrdagrakan niwt‘er, 2).
Lalayean, E. 1897. J̌ awaxk‘ (reprinted from “Azgagrakan handēs”). T‘iflis: Tparan M. D.
Ṙotineanc‘i.
Lalayean, E. 1897a. Varanda: niwtʻer apagay usumnasirutʻean hamar [Varanda: materials
for future studies]. Azgagrakan handēs 2. 4–244.
Lalayean, E. 1903. Borč‘alui gawaṙ [District of Borčʻalu]. Azgagrakan handēs 10. 113–268.
Lalayean, E. 1910. Vaspurakan: azgagrutʻiwn. Azgagrakan handēs, year 12, book 20. 116–
196.
Lalayean, E. 1912–14. (LalVasp 1–2), Vaspurakan: banahyusut‘iwn (reprinted from
AzgHand). Vol. 1, 1912, T‘iflis: N. Aɫaneanc‘; vol. 2, Šataxi barbaṙov, 1914, T‘iflis:
“Ēsperanto”.
Lalayean, E. 1914. Vaspurakan: hawatkʻ. Azgagrakan handēs, year 15, book 25. 21–60.
Lalayean, E. 1915. (Lalajanc), Vanskij vilajet: Vaspurakan. SMOMPK 44(2). 1–84.
Lalayean, E. 1917a. Muš-Tarōn: azgagrut‘iwn. Azgagrakan handēs, year 16, book 26.
Tʻiflis [1916]. 149–194.
Lalayean, E. 1917b. Vaspurakan: hawatkʻ. Azgagrakan handēs, year 16, book 26, Tʻiflis
[1916]. 195–210.
Lalayean, E. 1983–2004. Erker. Vol. 1, 1983 (J̌ avaxk‘); vol. 2, 1988 (Arc‘ax); vol. 3, 2004
(Borč‘alui gavaṙ). Yerevan: Academy Press.
Ɫaribyan, Ararat Sahaki. 1939. Hayeren barbaṙneri mi nor čyuɫ [A new branch of Armenian
dialects]. Gitakan ašxatut‘yunner Yerewani petakan hamalsarani 11. 23–187.
Ɫaribyan, Ararat Sahaki. 1953. Hay barbaṙagitut‘yun: hnč‘yunabanut‘yun ew jewaba-
nut’yun [Armenian dialectology: phonology and morphology]. Yerevan: Armenian
State External [heṙaka] Pedagogical Institute Press.
Ɫaribyan, Ararat Sahaki. 1955. Hayereni mi k‘ani norahayt barbaṙner [A few newly-found
dialects of Armenian]. In Gitakan ašxatut‘yunneri žoɫovacu (Erevani X. Abovyani an-
van Haykakan petakan mankavaržakan institut) 4, 195–241. Yerevan: Academy Press.
Ɫaribyan, Ararat Sahaki. 1958. Hayereni norahayt barbaṙneri mi nor xumb [A new group
of Armenian dialects]. Yerevan: Academy Press.
Ɫaziyan, Alvard S. 1983. Arc‘ax. Hay azgagrut‘yun ew banahyusut‘yun 15. Yerevan: Acad-
emy Press.
Ɫaziyan, Alvard & Svetlana Vardanyan. 2004. Bak‘vi hayoc‘ banahyusut‘yunə. Yerevan:
“Zangak”.
Lisic‘yan, Step‘an D. 1969. Zangezuri hayerə [The Armenians of Zangezur]. Yerevan:
Academy Press.
Lisic‘yan, Step‘an D. 1981. Leṙnayin Ɫarabaɫi hayerə: azgagrakan aknark [The Armenians
of the Mountainous Ɫarabaɫ: ethnographical survey]. Hay azgagrut‘yun ew banahy-
usut‘yun 12. 7–84.
Lusenc‘, Ašot Ɫukasi. 1982. Areši barbaṙə [The dialect of Areš]. Yerevan: Academy Press.
Madat‘yan, K‘. M. 1985. Alaškerti xosvack‘ə (Aparan-Aragaci tarack‘) [The subdialect of
Alaškert]. Yerevan: Academy Press.
Mak‘sudeanc‘, Mesrop. 1910. Aknay barbaṙi p‘ox aṙac baṙerə ōtar lezuneric‘ [The loan-
words of the dialect of Akn]. Handēs amsōreay. 57–63.
Mak‘sudeanc‘, Mesrop. 1911. Zeyt‘uni barbaṙi jaynakan drut‘iwnə [The phonology of the
dialect of Zeytʻun]. Yušarjan 1911. 319–323.
Malxasyanc‘, Step‘an. 1958. Aṙakner avandut‘yunner anekdotner (Axalc‘xayi barbaṙov)
[Proverbs, traditional stories and anecdotes (in the dialect of Axalcʻxa)]. Yerevan:
Academy Press.
Margaryan, Alek‘sandr S. 1975. Gorisi barbaṙə [The dialect of Goris]. Yerevan: University
Press.
Markosyan, Ṙazmik Ararati. 1989. Araratyan barbaṙ [The dialect of Ararat]. Yerevan:
“Luys”.
Martirosyan, Hrach Kayosiki. 2010. Etymological dictionary of the Armenian inherited lex-
icon. Leiden & Boston: Brill. (Leiden Indo-European Etymological Dictionary Series
8).
Martirosyan, Hrach Kayosiki. 2013.Armj anskie dialekty: xarakteristika otdel’nyx diale-
ktov [Armenian dialects: Description of individual dialects]. In Yuri B. Koryakov &
Andrej A. Kibrik (eds.), Languages of the World: Relict Indo-European languages of
Western and Central Asia, 334–385. Moscow: Academia.
Maxudianz, Mesrob. 1911. Le parler arménien d’Akn (quartier bas). Paris: Librairie Paul
Geuthner.
Meillet, A. 1936. Esquisse d’une grammaire comparée de l’arménien classique. Wien: Me-
chitharisten.
Melik‘-Šahnazareanc‘, Kostandin K. (Tmblač‘i Xač‘an). 1907–1908 (1–2). Zuṙna-tmbla.
Vol. 1, 1907; vol. 2, 1908. Vaɫaršapat: Elēk‘trašarž Tparan M. At‘oṙoy S. Ēǰmiacni.
Melik‘ean, Eɫišē Y. 1964. Hark‘-Xnus. Ant‘ilias, Lebanon: Tparan Kat‘oɫikosut‘ean Hayoc‘
Meci Tann Kilikioy.
Mežunc‘, B. X. 1989. Šamšadin-Diliǰani xosvack‘ə [The subdialect of Šamšadin-Diliǰan].
Yerevan: University Press.
Mkrtč‘yan, Garik. 2015. Barbaṙagitutʻyan himunkʻner [Foundations of dialectology].(Lec-
tures). Yerevan: “Daso Print”.
Mkrtč‘yan, H. M. 1952. Karno barbaṙə [The dialect of Karin]. Yerevan: Academy Press.
Mkrtč‘yan, Nerses Alek‘sani. 1971. Burduri barbaṙə [The dialect of Burdur]. Yerevan:
Academy Press.
Mkrtč‘yan, Nerses Alek‘sani. 1995. Sivri-Hisari gaɫt‘ōǰaxi ew barbaṙi masin [On the com-
munity and dialect of Sivri-Hisar]. PBH 1995, Nr 2. 205–214.
Mkrtč‘yan, Nerses Alek‘sani. 2006. Anatoliayi norahayt hay barbaṙnerə ew banahy-
usut‘yunə [The newly-found Armenian dialects and folklore of Anatolia]. Yerevan:
“Ōrenk‘ ew irakanut‘yun”. (Heɫinakayin hratarakut‘yun).
Movsisyan, Sahak (Bense). 1972. Hark‘ (Mšo Bulanəx). Hay azgagrut‘yun ew banahy-
usut‘yun 3. Yerevan: Academy Press.
Msereanc‘, Lewon. 1897–01. (Mserianc), Ètjudy po armjanskoj dialektologii, 1–2 (on the
dialect of Muš). Moscow: University Press.
Msereanc‘, Lewon. 1899. Hayerēn barbaṙaxōsut‘iwn [Armenian dialectology]. (Armenian
transl. from Russian by Gabriēl Mēnēvišean). Vienna: Mxit‘arean Tparan. (Azgayin
Matenadaran, 30).
Muradean, Y. 1901. Hamšēnc‘i hayer [Hamšen Armenians]. T‘iflis: K. Martiroseanc‘i
tparan (reprinted from Azgagrakan handēs).
Muradyan, H. D. 1960. Karčewani barbaṙə [The dialect of Karčewan]. Yerevan: Academy

Press.
Muradyan, H. D. 1967. Kak‘avaberdi barbaṙə [The dialect of Kakʻavaberd]. Yerevan:
Academy Press.
Muradyan, H. D. 1982. Hayoc‘ lezvi patmakan k‘erakanut‘yun, vol. 1. Hnč‘yunabanut‘yun
[Historical grammar of Armenian: Phonology]. Yerevan: Academy Press.
Muradyan, M. H. 1962. Šataxi barbaṙə [The dialect of Šatax]. Yerevan: University Press.
Muradyan, M. H. 1975. Baṙagitut‘yun [Lexicology]. In Hayoc‘ lezvi patmakan k‘eraka-
nut‘yun, vol. 2, 5–158. Yerevan: Academy Press.
Muradyan, M. H. 1982. Urvagic Moksi barbaṙi [An outline of the dialect of Moks]. In
Hayereni barbaṙagitakan atlas: usumnasirut‘yunner ew nyut‘er 1, 108–183. Yerevan:
Academy Press.
Mxit‘areanc‘, A. 1901. P‘šrank‘ Širaki ambarneric‘ [Fragment from the storehouses of
Širak]. (Ēminean azgagrakan žoɫovacu, vol. 1). Moscow, Alek‘sandrapol.
Nawasardeanc‘, Tigran. 1903. Baṙgirk‘ Araratean barbaṙi [Dictionary of the dialect of Ar-
arat]. T‘iflis: Tparan Movsēs Vardaneanc‘i.
Nždehean, K‘. 1899. Alaškerti banawor grakanut‘iwn [Oral literature of Alaškert]. Az-
gagrakan handēs 4th year, 5th book, 1899, Nr 1. 185–199.
Nždehean, K‘. 1902. Paṙawašunč‘. Azgagrakan handēs 7th year, 9th book. 263–271.
Nždehean, K‘. 1908. Alaškerti banawor grakanut‘iwnic‘ [From the oral literature of Alašk-
ert]. Azgagrakan handēs 11th year, 18th book, 1908, Nr 2. 33–36.
Nždehean, K‘. 1910. Alaškerti banawor grakanut‘iwnic‘ [From the oral literature of Alašk-
ert]. Azgagrakan handēs 12th year, 20th book, 1910, Nr 2. 108–111.
Orbeli, I. A. 2002. Izbrannye trudy v dvux tomax [Selected writings in two volumes], vol. 2.
Yerevan: Academy Press “Gitutjun”.
Ōtean-Gasbarean, Alis. 1968. Patmagirk‘ Ankiwrioy ew Stanozi hayoc‘ [Chronicle of the
Armenians of Ankiwria and Stanoz]. Pēyrut‘: Tp. Tōnikean. [Alice Odian-Kasparian,
The history of the Armenians of Angora and Stanos. Beirut: Doniguian Press].
Patkanov, K. 1869. Izsledovanie o dialektax armjanskago jazyka: filologičeskij opyt [Study
on Armenian dialects]. Sankt-Peterburg: Tipografija Imperatorskoj Akademii Nauk.
Patkanov, K. 1875. Materialy dlja izučenija armjanskix narečij [Materials for the study of
Armenian dialects]. Vol. 1: Govor Naxičevanskij; vol. 2: Mušskij dialekt. Sanktpeter-
burg: Tipografija Imperatorskoj Akademii Nauk.
Petermann, H. 1867a. Über den armenischen Dialect von Agulis nach einer Mittheilung des
Herrn v. Patkanoff (Patkanean). Monatsberichte der Königlich Preussischen Akademie
der Wissenschaften zu Berlin: Sitzung der philosophisch-historischen Klasse vom 26
Nov. 1866. 727–741.
Petermann, H. 1867b. Über den Dialect der Armenier von Tiflis. Philologische und his-
torische Abhandlungen der Königlichen Akademie der Wissenschaften zu Berlin 1866
(gelesen am 16. Mai 1861). 57–87.
Petoyan, Vardan A. 1954. Sasuni barbaṙə [The dialect of Sasun]. Yerevan: Academy Press.
Petoyan, Vardan A. 1965. Sasna azgagrut‘yunə [The ethnography of Sasun]. Yerevan:
Academy Press.
Pisowicz, Andrzej. 1997. Consonant shifts in Armenian dialects during the post-Classical
period revisited. In N. Awde (ed.), Armenian perspectives, 215–230. Richmond & Sur-
rey: Curzon.
Pisowicz, Andrzej. 2003. “Muslim” words in the Polish subdialect of the Ardeal dialect of
Armenian. Annual of Armenian linguistics 22–23. 29–39.
P‘olatean, Andranik L. 1969. Patmut‘iwn hayoc‘ Arabkiri [History of the Armenians of

Arabkir]. New York: Baykar Press (Arabkir Union).
Pōɫosean, Y. P. 1942. Hačəni əndhanur patmut‘iwnə ew šrǰakay Gōzan-Taɫi giwɫerə [Gen-
eral history of Hačən]. Los Angeles: Bozart Press (Pōzačean).
Poɫosyan, A. M. 1965. Hadrut‘i barbaṙə. Yerevan: Academy Press.
P‘ork‘šeyan, X. A. 1971. Nor Naxiǰewani hay žoɫovrdakan banahyusut‘yunə [The folklore
of Nor Naxiǰewan]. Hay azgagrut‘yun ew banahyusut‘yun 2. Yerevan: Academy Press.
PtmSivHisHay. 1965. Grigor Tēr Yovhannēsean (ed.), Patmagirk‘ Sivri-Hisari hayoc‘:
teɫagrakan, patmakan ew azgagrakan [Chronicle of the Armenians of Sivri-Hisar: top-
ographical, historical and ethnographical], (initiated by Mihran Nersēsean). Pēyrut‘:
Tparan Mšak.
Šahpazean, Y. 1913. Bun haykakan steɫcagorcut‘yun [Genuine Armenian creation]. Ɫalat‘ia:
Tparan “Šant‘”.
Sarafean, Gēorg A. 1953. Patmut‘iwn Ant‘ēpi hayoc‘ [Armenian history of Ayntab], vol. 2.
Los Angeles, California.
Sargisean, Yarut‘iwn K‘hn. (Alewor). 1932. Balu: ir sovoroyt‘nerə, krt‘akan u imac‘akan
vičakə ew barbaṙə [Balu: its customs, educational and intellectual state and the dialect].
Cairo: “Sahak-Mesrop”.
Sargseanc‘, Sargis. 1883. Agulec‘oc‘ barbaṙə (zōkeri lezun): lezuabanakan hetazōtut‘iwn
[The dialect of Agulis: a linguistic study]. Vols. 1–2. Moscow: Skoropečatnja O. Ger-
beka.
Sargsyan, Armen Yuriki. 2013. Ɫarabaɫi barbaṙi baṙaran [Dictionary of the dialect of Ɫara-
baɫ]. Yerevan: “Edit Print”.
Sedrakean, Aristakēs. 1874. K‘nar mšec‘woc‘ ew vanec‘woc‘ (minč‘ew ayžm č‘hratarakuac
erger) [Lyre of inhabitants of Muš and Van: unpublished songs]. Vaɫaršapat: Tparan
Srboy Kat‘uɫikē Ēǰmiacni.
Šērenc‘, Gēorg G. 1885–99. (VanSaz 1–2), Vanay saz. Vol. 1. Hawak‘acoyk‘ Vaspurakani
žoɫovrdakan ergeri, hēk‘eat‘neri, aṙacneri ew hanelukneri (teɫakan barbaṙov). T‘iflis:
Tparan Yovhannēs Martiroseanc‘i, 1885. Vol. 2. Hawak‘acoyk‘ Vaspurakani žoɫovrda-
kan ergeri, hēk‘eat‘neri, aṙacneri, hanelukneri, ōrhnut‘iwnneri ew anēck‘neri (teɫakan
barbaṙov). T‘iflis: Tparan K. Martiroseanc‘i, 1899.
Simonian, Hovann H. (ed.). 2007. The Hemshin: History, society and identity in the High-
lands of Northeast Turkey. London & New York: Routledge.
Simonyan, N. M. 1979. Hay barbaṙneri hndevropakan hnabanut‘yunnerə [Indo-European
archaisms of the Armenian dialects]. In Hayoc‘ lezvi hamematakan k‘erakanut‘yan
harc‘er, 188–248. Yerevan: Academy Press.
Srapean, Lewon. 1960. K‘ɫi: Teɫagrakan ew azgagrakan [Kʻɫi: topography and ethnogra-
phy]. Ant‘ilias, Lebanon: Tparan Kat‘oɫikosut‘ean Hayoc‘ Meci Tann Kilikioy.
Sruanjteanc‘, Garegin. 1978–1982. (Srvanjtyanc‘) Erker. Vol. 1: 1978; vol. 2: 1982. Yere-
van: Academy Press.
Svazlyan, Veržine Gaṙniki. 1984. Musa leṙ. Hay azgagrut‘yun ew banahyusut‘yun 16. Ye-
revan: Academy Press.
Svazlyan, Veržine Gaṙniki. 1994. Kilikia: Arewmtahayoc‘ banavor avandut‘yunə [Cilicia:
The oral tradition of Western Armenians]. Yerevan: Academy Press “Gitut‘yun”.
Svazlyan, Veržine Gaṙniki. 2000. Polsahayoc‘ banahiwsut‘iwnə [Folklore of the Armenians
of Polis]. Yerevan: Academy Press “Gitut‘yun”.
Tarōnean, Eduard. 1961. Baɫēši nahangi Xult‘ik giwɫi barbaṙə [The dialect of the Xultʻik
village in the province of Baɫēš]. Ant‘ilias, Lebanon: Tparan Kat‘oɫikosut‘ean Hayoc‘
Meci Tann Kilikioy.
T‘emurčyan, V. S. 1970. Gamirk‘i hayerə [The Armenians of Gamirkʻ]. Hay azgagrut‘yun
ew banahyusut‘yun 1. 7–140.
Tēōvlēt‘ean, Sargis X. 1954. Yušer Šapin Garahisarēn [Memories from Šapin Garahisar].
P‘ariz: “Le soleil”.
Tēr-Aɫek‘sandrean, Gēorg. 1885. T‘iflisec‘oc‘ mtawor keank‘ə [Intellectual life of inhabit-
ants of Tʻiflis]. T‘iflis: Yovhannēs Martiroseanc‘i tparan.
Tēr-Pōɫosean, Gr. 1921–22. Havariki barbaṙə [The dialect of Havarik]. Banber Hayastani
Gitakan Instituti. Vol. 1–2. 160–177.
Tēr-Sargsencʻ, Aristakēs vardapet. 1875. Panduxt vancʻin [The emigrant from Van].
K. Pōlis: Tntesean.
Tēr-Vardanean, Gurgēn. 1968. Erznka-Kamax gawaṙabarbaṙə ew azgagrakan yušer [The
dialect of Erznka-Kamax and ethnographical memories. Erusaɫēm.
Tēr-Yakobean, Yakob. 1960. Partizakə xatutik. Yaweluacabar – Manušak ew ir hēk‘eat‘nerə.
Paris.
Tomaschek, Wilhelm. 1896. Sasun und das Quellengebiet des Tigris: historisch-topogra-
phische Untersuchung. Sitzungsberichte der philosophisch-historischen Classe der
Kaiserlichen Akademie der Wissenschaften. 133.4. Wien.
Tomson, A. 1887. Lingvističeskija izsledovanija [Linguistic studies], vol. 1. S. Peterburg:
Tipografija E. Evdokimova.
Tomson, A. 1890. Istoričeskaja grammatika sovremennago armjanskago jazyka goroda
Tiflisa [Historical grammar of the modern Armenian language of the city of Tiflis].
S.-Peterburg: Tipografija E. Evdokimova.
T‘oṙlak‘yan, Barunak G. 1981. Hamšenahayeri azgagrut‘yunə [Ethnography of the Hamšen
Armenians]. Hay azgagrut‘yun ew banahyusut‘yun 13. Yerevan: Academy Press.
T‘oṙlak‘yan, Barunak G. 1986. Nšxarner Hamšeni ew Trapizoni banahyusut‘yan [Frag-
ments of folklore of Hamšen and Trapizon]. Yerevan: University Press.
T‘umayean, Karapet. 1930. Marzuani gawaṙabarə [The dialect of Marzuan]. Vienna: Mx-
it‘arean tparan.
Tʻuršyan, Harutʻyun. 2018. Baṙaran Vani barbaṙi [Dictionary of Van dialect] (Dr. Haykanush
Mesropyan, ed.). Yerevan: “Gitutyun” publishing house of NAS RA.
Vardanyan, Sergey. 2009. Kronap‘ox hamšenahayeri barbaṙə, banahyusut‘yunə ew er-
garvestə: nyut‘er ew usumnasirut‘yunner [The dialect, folklore and song art of con-
verted Armenians of Hamšen: materials and studies]. Yerevan: University Press.
Vartapetjan, N. A. 1962. Sjunikskij dialekt [The dialect of Syunikʻ]. (PhD dissertation).
Vaux, Bert. 1992. Adjarian’s Law and consonantal ATR in Armenian. In Proceedings of the
Fourth International Conference on Armenian Linguistics, Cleveland State University,
Cleveland, Ohio, September 14–18, 1991, 271–293. (John A. C. Greppin, ed.). Delmar
& New York: Caravan Books.
Vaux, Bert. 1997. The phonology of voiced aspirates in the Armenian dialect of New Julfa.
In N. Awde (ed.), Armenian Perspectives, 231–248. Richmond, Surrey: Curzon.
Vaux, Bert. 1998. The phonology of Armenian. Oxford: Clarendon Press.
Vaux, Bert. 1999–2000. Notes on the Armenian dialect of Ayntab. Annual of Armenian
linguistics 20. 55–82.
Vaux, Bert. 2000–2001. Hemshinli: The forgotten Black Sea Armenians. Journal of Arme-
nian studies 6(2). 47–71.
Vaux, Bert. 2001. The Armenian dialect of Aslanbeg. Annual of Armenian linguistics 21.
31–64.
Vaux, Bert. 2007. Homshetsma: The language of the Armenians of Hamshen. In Hovann H.
Simonian (ed.), The Hemshin: History, society and identity in the Highlands of North-
east Turkey, 257–278. London & New York: Routledge.
Viredaz, Rémy. 2003. The Great Armenian Puzzle is not thorny. Annual of Armenian lin-
guistics 22–23. 59–81.
Weitenberg, Jos J. S. 1985. Additional ‑n in Armenian. Annual of Armenian linguistics 6.
101–106.
Weitenberg, Jos J. S. 1986. Additional h‑, initial y‑ and Indo‑European *y‑ in Armenian.
In Maurice Leroy & Francine Mawet (eds.), La place de l’arménien dans les langues
indo-européennes, 90–101. Louvain: Peeters.
Weitenberg, Jos J. S. 1992. An early dialectal phonetic split in Classical Armenian: the cases
of t‘awt‘ap‘em and p‘lanim. In John A. C. Greppin (ed.), Proceedings of the Fourth
International Conference on Armenian Linguistics, Cleveland State University, Cleve-
land, Ohio, September 14–18, 1991, 295–318. Delmar, New York: Caravan Books.
Weitenberg, Jos J. S. 1993. Initial y- in Classical Armenian and the early development of
Armenian dialects. In L. Hovsepian, N. Parnassian & S. Simonian (eds.), Proceedings
of the Second International Symposium on Armenian Linguistics, 21–23 September
1987, 208–217. Yerewan: Armenian Academy Press.
Weitenberg, Jos J. S. 1996. On the early development of the Armenian dialects. In Dora
Sakayan (ed.), Proceedings of the Fifth International Conference on Armenian Linguis-
tics, McGill University of Montreal, Quebec, Canada May 1–5, 1995, 93–118. Delmar
& New York: Caravan Books.
Weitenberg, Jos J. S. 1999–2000. On the early development of Armenian dialects – 2. The
monophthongization of ay. Annual of Armenian linguistics 20. 1–26.
Weitenberg, Jos J. S. 2001. Thoughts on early Armenian accentuation. Annual of Armenian
linguistics 21. 65–73.
Weitenberg, Jos J. S. 2002. Aspects of Armenian dialectology. In Jan Berns & Jaap van
Marle (eds.), Present-day dialectology: Problems and findings, 141–157. Berlin & New
York: De Gruyter Mouton. (Trends in linguistics: Studies and monographs, 137).
Winter, Werner. 1966. Traces of early dialectal diversity in Old Armenian. In H. Birnbaum
& J. Puhvel (eds.), Ancient Indo-European dialects: proceedings of the Conference
on Indo-European linguistics held at the University of California, Los Angeles, 1963,
201–211. Berkeley & Los Angeles: University of California Press.
Xač‘atryan, A. E. 1984. Jayndarji hin erewuyt‘ner hay barbaṙnerum [Old phenomena of
ablaut in Armenian dialects].In International symposium on Armenian linguistics (Ye-
revan, September 21–25, 1982): Reports, 318–326. Yerevan: Academy Press.
Xač‘atryan, H. S. 1985. Šapin Garahisari barbaṙ (holovumə ew xonarhumə) [The dialect
of Šapin Garahisar: declension and conjugation]. In Hayereni barbaṙagitakan atlas:
usumnasirut‘yunner ew nyut‘er 2, 136–159. Yerevan: Academy Press.
Xač‘atryan, Ṙaisa. 1999. T‘alin. Hay azgagrut‘yun ew banahyusut‘yun 20. Yerevan: Acad-
emy Press “Gitut‘yun”.
Xač‘atryan, V. 2004. Vardenisi (Diadini) barbaṙə [The dialect of Vardenis (Diadin)]. Yere-
van: “Nairi”.
Xemč‘yan, Ēst‘er H. 2000. Tavuš. Hay azgagrut‘yun ew banahyusut‘yun 21. Yerevan:

Academy Press “Gitut‘yun”.
Xemč‘yan, Ēst‘er H. 2008. Iǰewan (Joropʻor). Hay azgagrut‘yun ew banahyusut‘yun 25.
Yerevan: Academy Press “Gitut‘yun”.
Yovakimean, Yovakim (Aršakuni). 1967. Patmut‘iwn Haykakan Pontosi. Pēyrut‘: “Mšak”.
Yovnanean, Ɫewond. 1897. Hetazōtut‘iwnk‘ naxneac‘ ṙamkōrēni vray: usumnasirut‘iwnk‘
ew k‘aɫuack‘ner [Studies on the dialect of ancients: studies and excerpts]. 2. vols. Ven-
ice: S. Lazar. (Azgayin Matenadaran, 23–24).
Yovsēp‘eanc‘, Garegin Sarkawag. 1892. P‘šrank‘ner žoɫovrdakan banahyusut‘iwnic‘
[Fragments from folklore]. T‘iflis: M. Šarajē.
YušamXotorǰ. 1964. Yušamatean Xotorǰuri [Monument of Xotorǰur] (H. Yarut‘iwn v. Hu-
lunean & H. Matt‘ēos v. Hačean, eds.). Wien: Mechitharisten.
Zak‘aryan, Mari G. 2008. Agulisi barbaṙə: ink‘nut‘yan bac‘ahaytum [The dialect of Agulis:
discovery of the identity]. Yerevan: “Iravunk‘”.
2.3. Northern Kurdish (Kurmanjî)
Geoffrey Haig
1. Introduction
This chapter presents an overview of Northern Kurdish, or Kurmanjî (in various

spellings), as it is spoken in eastern Anatolia. The material is largely based on two
joint publications with Ergin Öpengin: Öpengin and Haig (2014), and Haig and
Öpengin (2018), to which the reader is referred for further details. The variety
of Kurmanjî spoken in northern Iraq (Behdinī, under various spellings) is treated
in Haig (this volume, chapter 3.3, §4). For the purposes of this chapter, “eastern
Anatolia” is taken to coincide with the the eastern part of Turkey, extending south-
eastward from a line beginning from Sivas, but excluding the Mediterranean and
Black Sea coastal regions.
Eastern Anatolia is the homeland of the majority of Kurmanjî speakers, but
their traditional settlement region overlaps into Syria, northern Iraq and western
Iran (see Figure 1). Various estimates put the number of Kurmanjî speakers in
Turkey at between 8 and 15 million, but any figures must be treated with caution,
due to differences in definitions and methodologies. Following decades of violence
in the region, there has been a large-scale diaspora from Anatoliaʼs rural areas to
Turkeyʼs larger cities, both within Anatolia (e. g. Van, Diyarbakır, and Mardin) and
in western Turkey (e. g. İstanbul, İzmir), and beyond into Europe. Perhaps as many
as one third of Kurmanjî speakers have left their Anatolian homeland in the last
40 years, but reliable figures are impossible to obtain.
Traditionally, Kurmanjî is classified as a member of the northwest Iranian
branch of the west Iranian languages, within the Iranian branch of Indo-Euro-
pean. Besides Kurmanjî, two other closely-related Iranian languages are spoken
in Turkey, but are not covered in this volume. The first are so-called Şêx Bizinî
dialects, the language of the descendants of southern Kurdish tribes re-settled in
various parts of Anatolia in the 16th century. With the exception of short descrip-
tions in Lewendî (1997), which demonstrate beyond doubt the southern Kurdish
origins of the dialects, further data on these varieties are unavailable to me. The
second is Zazaki, spoken in several locations in central Anatolia (cf. Figure 1, and
Paul 2009 for recent summary of Zazaki). Figure 1 illustrates the locations of the
various varieties belonging to what is traditionally termed “Kurdish”. Just how
one would define the assumed superordinate entity “Kurdish”, and which varieties
should be included in it, are questions that go beyond the scope of this chapter; see
Öpengin and Haig (2014), Haig and Öpengin (2018) for discussion.
Research on Kurdish in Turkey was hampered for decades by exclusory language
politics directed at “non-Turkish” languages (see Haig 2004; Öpengin 2012, 2015;
Northern Kurdish (Kurmanjî) 107
Figure 1: Overview of varieties traditionally considered to be “Kurdish”
Haig and Öpengin (2018) for discussion and references). Between around 1920
and 1990, the only reliable publications on Kurdish spoken in Turkey are Jas-
trow’s sketch of the phonology of the Van dialect (1977), and Ritter’s collection of
spoken Kurdish narratives from the Midyat region (1971 and 1976). A rich body
of material has also been compiled under the auspices of L’Institut kurde de Paris,
though we still await a more structured survey. Very recently, an online-accessible
data-base of Kurdish dialects has been launched (Matras et al. 2016), which pro-
vides the most comprehensive coverage to date of the Kurmanjî varieties spoken
in eastern Anatolia. Although there is considerable dialectal variation within Ana-
tolia, the situation is one of a dialect continuum, with a high degree of mutual
intelligibility across most of the region, except between the farthest southeastern
and northwestern dialects (see §4 below). In this regard, the situation of Kurmanjî
is rather different to that of Neo-Aramaic, likewise spoken in eastern Anatolia and
northern Iraq, but which consists of scattered and distinct local dialects (or perhaps
languages), with low levels of mutual intelligibility (see Khan, this volume, chap-
ters 2.5 and 3.4).
Below I provide a brief outline of the history and current situation of the speak-
ers (Section 1.2), followed by a grammatical sketch of what is termed here “Stand-
ard Kurmanjî”, loosely based on the dialect of Cizre and Botan in southeastern
Turkey (Section 2). In Section three, selected areas of variation are presented,
illustrating the two most divergent dialects in eastern Anatolia, namely southeast-
108 Geoffrey Haig
ern Kurmanjî and western Kurmanjî. Section four sums up the main points of the
chapter.
1.1. History of the Kurmanjî speech community in Anatolia

There are no direct historical records of Kurdish settlement in Anatolia, so esti-
mates of the timing and pathways are correspondingly speculative. Asatrian (2009)
follows MacKenzie (1961b) in assuming that the initial formation of Kurdish
would have occurred in a southwest Iranian environment, namely the northern
areas of Fars in Iran. Northern expansion of the Kurds into what was then Armenia
began, according to Asatrian, based on the first attestations of Kurds in Armenian
texts, between the 8–9 centuries AD and continued over a period of several centu-
ries. Some clues regarding the chronology of these events can be obtained from the
dating of Armenian loan words in Kurdish. The point of origin of this northward
expansion is considered to be “Northern Iraq, Hakkari, southern shore of Lake
Van” (Asatrian 2009: 35).
Haig and Öpengin (2018) suggest that a northwestward expansion of the Kur-
manjî-speaking peoples into a largely Armenian-populated region receives some
support from the areal distribution of certain features in Kurmanjî. The first is
the comparative homogeneity, and relative simplicity (in terms of morphology) of
those Kurmanjî dialects further to the west. The southeastern varieties of Kurmanjî
(cf. e. g. Behdinī in North Iraq (Haig, this volume, chapter 3.3, §4) and Şemzinan
in Turkey, show the richest morphology, the most complex syllable structures,
the most consistent retention of gender and ergativity, all items that can be con-
sidered typically “mature features” (Dahl 2004; Trudgill 2011), of the kind which
reflect a comparatively long period of stable settlement. The other varieties, on the
other hand, all show, to varying degrees, loss of these features, which would be
expected under conditions of mobility and language contact involved in the north-
western expansion of the Kurds (and possible shift to Kurdish among speakers of
other languages such as Armenian, Neo-Aramaic or Arabic, cf. Trudgill 2011, and
McWhorter 2005 for the role of contact in simplification). At any rate, there must
have been a strong Armenian influence on Kurmanjî in these formative stages,
evidenced in the presence of an additional row of unaspirated voiceless plosives
and affricates in Kurmanjî, a feature that is characteristic (to varying degrees) of
all the dialects (cf. Section 3.1).
Several thousand Kurmanjî speakers were forcibly resettled in the Khorasan
province of Iran in the seventeenth century by the Safavids, with the intention of
protecting the eastern borders of Persia from Uzbek and Turkmen tribes (Ober-
ling 2008). They have maintained their language and many customs down to the
present. Although there is little systematic research on their language, my impres-
sion of recordings made available to me by Don Stilo (p.c.) is that they are descend-
ants of speakers from the western Kurmanjî dialect group (see §3.5 below).
In Anatolia, Kurmanjî speakers have co-existed with speakers of Armenian,

Neo-Aramaic, Zazaki, Arabic and Turkish for many centuries, leading to consider-
able mutual influence. In the southeast, Kurdish has left a deep impact on Arabic
and Neo-Aramaic (see Khan, this volume, chapters 2.5 and 3.4), while Armenian
left its mark at least in the phonology (see §3.1). Turkish influence on Kurmanjî
has also been considerable (Haig 2001, 2006, 2007), particularly on Western Kur-
manjî (see §4). However, it is probably first since the founding of the Turkish
Republic in 1923, leading to nation-wide compulsory schooling and military
service, and the large-scale diaspora from rural Anatolia in the second half of the
twentieth century, that Turkish influence on Kurmanjî has become so omnipresent
(see §4). The future of Kurmanjî in Turkey is impossible to predict. But it must
be noted that, despite the seemingly high absolute figures of “Kurmanjî speakers”
today, very few children are exposed to the early rich input that would ensure full
acquisition of the language, and there is still no regular representation of Kurmanjî
in state-controlled primary education.
2. A grammatical sketch of “Standard Kurmanjî”
Although Kurmanjî is spoken across a large area, there is a reasonably widely-ac-

cepted written standard, loosely based on the dialect of the region Cizre and Botan
in southeastern Turkey. It uses a modified version of the Roman alphabet, and is
employed in all manner of publications, including journals, newspapers, litera-
ture, internet publications, chat-rooms etc. Today’s norms are largely based on
the standards established by Celadet Ali Bedir Khan in a series of articles in the
journal Hawar, published in the 1930’s. These conventions were later codified in
Bedir-Khan and Lescot’s Grammaire kurde (dialecte kurmandji), which was pub-
lished in 1970. In this chapter I will present a grammatical sketch of this Standard
Kurmanjî (Standard K.), based on Haig and Öpengin (2018, §3), with examples
provided in standard orthography, but noting additional phonetic detail where nec-
essary. In Section 3, selected issues in regional variation are presented.
2.1. Phonology
2.1.1. Vowels
The basic vowel system consists of eight simple vowel phonemes, five long (or
tense) vowels, and three short, or lax vowels.
Long, or tense vowels: /a/ <a>; /i/ <î>; /e/ <ê>; /o/ <o>; /u/ <û>
Short, or lax vowels: /æ/ <e>, /ʊ/ , /ɨ/ 
110 Geoffrey Haig
Figure 2: Pan-dialectal scheme for the vowel phonemes of Kurdish

(cf. Haig and Öpengin, 2018)
The tense vowels are /i/, /e/, /a/, /u/ and /o/. They are generally realized phonetically
long, particularly in open syllables, and indeed, they are the ones that are stretched
in traditional Kurdish songs. However, vowel length by itself is not phonemically
distinctive in Kurdish. The full vowels occupy approximately the five positions of a
fairly typical five-term vowel system; the other three vowels are more centralized.
Examples from Haig and Öpengin (2018, §3.1.1) are presented in Table 1.
Table 1: The tense vowels of Kurmanjî (from Haig and Öpengin, 2018)
/a:/ <a> /e:/ <ê>
Orthography IPA Gloss Orthography IPA Gloss
agir [a:gɨr] fire êvar [e:var] evening
sar [sa:r] cold sêv [se:v] apple
mal [ma:l] house, home lêv [le:v] lip
mar [ma:r] snake thêr [te:r] satiated
zava [za:va:] bridegroom avê [ave:] water(obl )
/i:/ <î> /u:/ <û>

spî [s pi:]
ә
white bûk [bu:k] bride
bîne [bi:næ] bring! zû [zu:] soon, fast
tarî [tari:] dark(ness) çû [ʧhu:] s/he went
nîne [ni:næ] there isn’t xwesû [xwæsu:] mother-in-law
dîk [di:k] rooster tûj [tu:ʒ] sharp
/o/ <o>
Orthography IPA Gloss
toz [thoz] dust
çok [ʧhok] knee
got [got] said
zozan [zo:zan] alpine summer settlement
koçer [koʧhæɾ] nomad
The two lax vowels vowels /ʊ/ and /æ/are realized more centrally than the tense
vowels. They are less prone to lengthening in open syllables, but are not subject
to elision under the phonological processes to be discussed below. They may also
occur at the end of words. Examples are provided in Table 2:
Table 2: The lax vowels of Kurmanjî (from Haig and Öpengin, 2018)
/æ/ <e> /ʊ/ 
em [æm] we guh [gʊ(h)] ear
dest [dæst] hand kuştin [kʊʃtɨn] kill
ser [sæɾ] head gund [gʊnd] village
dev [dæv] mouth quling [qʊlɨŋ] crane (bird)
re [ræʃ] black xurt [xʊɾt] strong, sturdy
The mid-high, centralized rounded vowel /ʊ/ is sometimes difficult to distinguish

from the mid-high, unrounded /ɨ/, leading to variation across dialects and in the
spelling of some words, e. g. muhacir ~ mihacir ‘refugee’, tucar ~ ticar ‘trader’.
The central vowel /ɨ/ is approximately a mid-high, mid-closed, unrounded
vowel; it cannot occur word-finally (though there are dialectal exceptions;
see §3.1). Haig and Öpengin (2018) distinguish two underlying sources of this
vowel, though both are written with the same symbol in the standard orthog-
raphy: the lexical central vowel, and the epenthetic central vowel. The lexical
version occurs as a stressed vowel in the stems of lexical items; it is not subject
to the deletion processes that affect the epenthetic central vowel.1 Some examples
are provided in Table 3.
Table 3: The lexical central vowel in selected words

Orthography IPA Gloss
mij [mɨʒ] fog, mist
pirç [phɨɾʧ] hair (of head)
dil [dɨl] heart
diz [dɨz] thief
kir [kɨɾ] did
Epenthetic central vowels occur in a number of forms, generally in order to satisfy

constraints on syllable structure. An example is the use of epenthetic vowels in the
infinitive endings of verbs. If we assume that the infinitive ending is [-n], then this
1
When the surrounding consonants are sonorants, as in dimirim ‘I die’, a lexical vowel
may be elided.
112 Geoffrey Haig
may attach directly to a vowel-final past verb stem, such as kêşa- ‘draw, pull’ (note
that some dialects have a different past stem for this verb). The infinitive is thus
kêşa-n. Following a consonant-final stem, however, an epenthetic [ɨ] is inserted to
avoid non-licensed syllable codas; see Table 4 for examples
Table 4: Epenthetic vowels in infinitives

Past stem Infinitive Gloss
kêşa- kêşa-n pull, smoke (cigarettes)
hat- hat-in come
dît- dît-in see
nasî- nasî-n know (a person)
xwend- xwend-in read, study, recite
Vowel epenthesis may also occur in syllable-onset clusters in lexical items, though
there is considerable regional variation here:
bɨlɨnd ~ blɨnd ‘high’ sɨtra:n ~ stra:n ‘song’
bɨlu:ɾ ~ blu:ɾ ‘type of wooden flute’ bɨɾa: ~ bɾa: ‘brother’
dɨre:ʒ ~ dre:ʒ ‘long’ ʃɨkæft ~ ʃkæft ‘cave’
fɨɾotɨn ~ fɾotɨn ‘sell’ zɨma:n ~ zma:n ‘tongue, language’
sɨpi: ~ spi: ‘white’ zɨla:m ~ zla:m ‘man’
Certain inflectional prefixes, and prepositions, consist of a single consonant plus
the central vowel /ɨ/. Examples are:
di- Indicative present
bi- Subjunctive
li ‘at, in’
bi ‘through’
ji ‘from’
In these formatives, the central vowel can also be considered epenthetic rather
than lexical. It tends to be deleted under certain conditions (and in some dialects
these vowels are seldom realized, regardless of phonological conditioning). For
example the preposition ji [ʒɨ] ‘from’ undergoes reduction and devoicing in normal
connected speech: ‘from you’ [ʒɨ tæ > ʃtæ].
2.1.2. Consonants
The consonant phonemes of Kurmanjî are shown in Table 5.
Table 5: The consonant phonemes of Kurmanjî, generalized scheme (Haig and Öpengin,
2018)
bilab. lab.- dent alveol post-alv. pal. vel. uvul. pharyn. glott.
dent.
Plos. ph p b th t d kh k g q ʔ
Fric. vf ʃʒ xɣ ʁ ħ ʕ2 h
Affr. ʧh ʧ ʤ
Nas. m n ŋ
Trill r
Flap ɾ
Approx. w j
Lateral l (dialectally also ɫ)
The most unusual feature of the consonant system is the three-way contrast on the
stops and affricates, which emerged most probably through Armenian influence,
illustrated with examples in Table 6.
Table 6: Three-way contrast on the stops and affricates (Haig and Öpengin, 2018)
voiceless aspirated: [pho:r] ‘hair’ [thæv] ‘together’
voiceless, unaspirated: [po:z] ‘nose’ [tævɨr] ‘hoe, mattock’
voiced: [bo:z] ‘grey-white (of horses)’ [dæv] ‘mouth’
voiceless aspirated: [kha:r] ‘work, matter, concern’ [ʧhɨma:] ‘why’

voiceless, unaspirated: [ka:ɫ] ‘old man’ [ʧæm] ‘stream, brook’
voiced: [ga:v] ‘step, time’ [ʤæm] ‘by, beside’
2
The phonemic status of the pharyngeal sounds in Kurmanjî is controversial. First, they
are most prominently linked to Semitic loan words, though Barry (2017) points to lan-
guage-internal factors that have contributed to the emergence of pharyngeals, and their
spread to native vocabulary (e. g. most dialects have initial [ħ] in the word for the
numeral “7”). Second, the extent to which they are realized is subject to considerable
cross-dialectal variation. Finally, as pointed out by Christiane Bulut (p.c.), in Kurdish
as well as other languages of the region, the corresponding segments can be considered
to be glottal stops produced with a retracted tongue root, rather than fricatives. Given
their prominence in at least some varieties, we include them in Table 2. We also note
that pharyngealization may be a feature that permeates over an entire syllable, rather
than being localizable on a single segment.
114 Geoffrey Haig
All word-initial <r> sounds are trilled, but in other environments the distribution
is not predictable. Examples for trilled and flap <r> are as follows:
Trilled Flap
[pɨr] ‘much, many’ [pɨɾ] ‘bridge’
[kær] ‘deaf’ [kʰæɾ] ‘donkey’
[bɨri:n] ‘to cut’ [bɨɾi:n] ‘wound’
With regard to pharyngeal segments, there is considerable cross-dialect variation,
(see Khan 2008 on pharyngealization as a variant feature of pronunciation, and
Haig and Öpengin, 2018, §4.2.1 for discussion of local variation). Some relatively
widespread examples include [ʕeli:] ‘Ali’; [teʕm] ‘taste’; [pʰeħn] ‘flat’.
2.2. Nominal morphology
2.2.1. Gender and case

Nouns have an inherent two-way gender distinction between masculine and femi-
nine. The difference is reflected formally in the form of the ezafe, and in the form
of the singular Oblique case marker. In the plural, all gender distinctions are neu-
tralized. Gender assignment is partially semantically motivated: words that refer
to human beings and higher animals with a particular sex, such as mehîn ‘mare’
and ap ‘uncle’ are assigned grammatical gender according to their biological sex.
Words that refer to persons, but which are usable with reference to either sex (e. g.
heval ‘friend’) have no lexically fixed gender. Gender assignment with such words
is determined according to the intended reference in a given context (Haig and
Öpengin 2015).
For words denoting inanimate objects, or smaller animals, the principles
of gender assignment are opaque. There are some morphological regularities
accounting for gender, for example nouns created with the derivational suffix -(y)î
are feminine, as are the infinitives of all verbs. In the dialects of Turkey, with the
exception of those close to Behdinī, the default gender for inanimate nouns is
feminine: most loanwords with non-human reference take this gender. In Behdinī,
on the other hand, the default gender is masculine. Dialectal variation in gender is
discussed in Haig and Öpengin (2018, §3.2.2).
There is a two-way case distinction between Direct (unmarked) and Oblique.
Indefiniteness is marked on singular nouns through the suffix -ek, while no ded-
icated definiteness marker exists. A bare noun may thus have either a singular,
definite reading, or a generic, sortal reading, depending on the context. Paradigms
for singular nouns showing their inflectional possibilities are shown in Table 7.
Table 7: Case and indefiniteness in Standard Kurmanjî (Haig and Öpengin, 2018)
Masculine singular Feminine singular
Definite Indefinite Definite Indefinite
Dir. Obl. Dir. Obl. Dir. Obl. Dir. Obl.
gund gund-î gund-ek gund-ek-î jin jin-ê jin-ek jin-ek-ê
The forms for oblique case marking on singular nouns are provided in Table 7.
Note that the expression of the oblique case is suppressed when the noun con-
cerned is followed by the ezafe, and it may be absent with singular masculine
nouns (see below). The oblique case is used in the following syntactic functions:
I. Object of a present-tense transitive verb
II. Goal or Recipient argument immediately following a predicate of motion or
transfer
III. Complement of any adposition (though dropped in certain combinations)
IV. Possessor in an ezafe construction
V. Subject of a past-tense transitive verb
The direct case is used elsewhere.
Zero is common for proper nouns, and for masculine singular nouns particularly
when they have generic reference, in most dialects of Central Anatolia. The fol-
lowing example is from the Kurdish textbook Hînker:
(1) Ez şîr ve-na-xw-im
I milk(m ) prv -ind -drink.prs -1sg
‘I do not drink milk.’
Stem-vowel raising is found in many dialects; it only affects the open, non-rounded
vowels [a, æ], when they are in stressed syllables, and raises them: [a, æ → e:].
For example:
aş êş ‘mill’ ga gê ‘ox’
nan nên ‘bread’ ba bê ‘wind’
baxçe bêxçe ‘garden’ kevir kêvir ‘stone’
bajar bajêr3 ‘town’ zilam zilêm ‘man’
hesp hêsp ‘horse’ xanî xênî ‘house’
şivan şivên ‘shepherd, goatherd’ lawik lêwik ‘boy, son’
welat welêt ‘state, country’ ezman ezmên ‘sky’
ziman zimên ‘tongue, language’
3
In some dialects where stem-vowel raising is not an option for marking the oblique
case, the raised form bajêr ‘town’ has become the unmarked form of the noun, used in
all contexts, implying that the rule existed at earlier stages of the language.
116 Geoffrey Haig
Bare masculine singular nouns only consistently receive a suffixal marking of the
oblique in the Badinan dialects of North Iraq, and in the east of the Hekari region
in Turkey. As noted above, suffixation is regularly and consistently applied to all
masculine singular nouns, and across all dialects, when the NP concerned has a
determiner such as a demonstrative, or the interrogative kîjan ‘which?’, or carries
the indefiniteness suffix -ek. An example with a demonstrative is (2); the presence
of an oblique suffix suppresses stem-vowel raising:
(2) li vî welat-î … (not: *li vî welêt-î … or *li vî welêt …)
in dem .obl homeland-obl . m
‘in this homeland’
See Haig and Öpengin (2018, §3.2.4) for regional variation in the marking of sin-
gular masculine obliques.
2.2.2. Plural number

In Standard K., only nouns in the oblique case are overtly marked for plural,
through the suffix -a(n) (deletion of -n is normal in some dialects), as shown in
Table 8:
Table 8: Plural and case marking in Standard Kurdish

Plural (masc. and fem.)
Definite Indefinite
D irect O blique D irect O blique
jin jin-a(n) jin(-in) jin-a(n)
gund gund-a(n) gund(-in) gund-a(n)
An ending for indefinite direct plural -in is regularly cited in pedagogical works
and is shown in brackets above, but it is only frequently attested in the dialects
of Mardin region, and across the border in Syria. Elsewhere it is rare or lacking
completely.
Nouns in the direct case do not inflect for plural. Such nouns are usually sub-
jects, so plurality is generally reflected in number agreement on the verb:
zarok hat-in ‘the children came’
zarok hat ‘the child came’
There is a tendency in the dialects to the west for the Oblique plural marker to
become a general plural marker, which is used on nouns in the direct case, and also
on the demonstratives; see below on Mereš dialect.
2.2.3. Pronouns
The forms for the personal pronouns are given in Table 9. The “third person” pro-
nouns are basically the distal demonstratives.
Table 9: Personal pronouns in Standard Kurmanjî (cf. Haig and Öpengin 2018)
D irect O blique
SG 1 ez min
2 tu te
3 ew wî (m.) /wê (f.)
PL 1 em me
2 hûn we
3 ew wan
In addition to the personal pronouns, Kurdish has an invariant reflexive pronoun

xwe (dialectally also [xæ], [xo]). It is used for all persons and numbers, both as
a personal pronoun and a possessor, when coreference with the subject of same
clause is intended. In Standard K., the reciprocal pronoun is hev or hevdû, again
used for all persons.
There are two demonstratives, ew ‘that’ and ev ‘this’, with considerable dialec-
tal variation. The standard forms are illustrated in Table 10.
Table 10: Demonstratives in Standard Kurmanjî

D irect O blique
(all gender/numbers) Sg. masc. Sg. fem. Plural
proximate ev vî vê van
distal ew wî wê wan
In addition to the demonstratives, most dialects attach an additional suffixal or

clitic marker to the nouns preceded by a demonstrative: in the singular -e/-a
(regional variants, not gender related) and in the plural -ene / -ana. They only
attach to the noun if it is the final element of the NP; if it is followed by a modifier
in an ezafe construction, then the marker is not used.
(3) (Antep-Adiyaman region, Haig and Öpengin 2018)
Va defter=na yē min=in
dem . prox notebook=dem .pl ez . pl 1 sg . obl = cop .3 pl
‘These notebooks are mine’
118 Geoffrey Haig
2.3. The structure of the NP

The basic structure of a NP in Kurmanjî is the following, where only N(oun) is
obligatory:
Dem Num N-Ez Poss Ez Adj
ev sê kum-ên min yên reš
these three hat-ez . pl 1 sg . obl ez . pl black
‘these three black hats of mine’
Demonstratives have already been illustrated in Table 10. The numerals are given
in Section 2.4. Possessors and descriptive adjectives follow the head, in that order
if both are present, and obligatorily occur with an ezafe (cf. Schroeder 1999 for
discussion of the NP in written Kurdish).
2.3.1. The ezafe construction

The ezafe construction is well-known from Persian, and is found, with certain
variations, in all varieties of Kurdish. It may be either a bound morpheme (suffix
or clitic, with as yet poorly researched dialectal variation in stress assignment) or
an independent particle. Historically it goes back to an Old Iranian demonstrative/
relativizer (cf. Haig 2011). In Standard K., traces of these pronominal origins are
evident in the fact that the ezafe still inflects for gender and number, agreeing
with its head noun in these categories, and in the fact that it occurs as the “free”,
or “demonstrative” ezafe, discussed briefly below. We distinguish between simple
linking ezafe constructions, and free or demonstrative ezafes.
A simple ezafe construction is found with any noun that is modified by an
adjective, as in (4), or with a possessor, as in (5). An ezafe particle attaches to the
head noun, and varies according to gender and number of that noun:
(4) bajar-ek-î mezin
town-indf - ez . m big
‘a big town’
(5) mal-a me
house-ez . f 1 pl . obl
‘our house’
Nouns may also be modified by prepositional phrases, as in (6), or by relative
clauses as in (7) and (8), in which case they also take the ezafe:
(6) dest-ēn min yēn ji ber serma-yē qeliš-ī […]
hand-ez . pl 1 sg . obl ez . pl from adp cold-obl . f split-ptcp
‘my hands which are cracked and split because of the cold’
(7) ev ri-ya ku tu di-d-ī pēši-ya me

dem . prox road-ez . f rel 2 sg ind -give. prs -2 sg front-ez . f 1 pl . obl
‘This road that you make us take’
(Sarman 37)
(8) kur-ē wī yē li welat-ēn xerībī-yē
son-ez . m 3 sg . obl . m ez . m in country-ez . pl foreign.land-obl . f
‘His son (who) is in foreign countries’
Possessors in ezafe constructions take the oblique case, adjectives in ezafe con-
structions remain uninflected. Compare the difference:
gel-ê kurd (people-ez . m Kurdish) ‘the Kurdish people’
welat-ê kurd-an (country-ez . m Kurd-obl . pl ) ‘the country of the Kurds’
If a possessor is coreferent with the same-clause subject, the reflexive pronoun
xwe is obligatorily used in place of a personal pronoun:
(9) Ez li mal-a xwe me / mal-a *min
1sg in house-ez . f refl cop .1 sg / house-ez . f 1 sg . obl
im
cop .1 sg
‘I am at my house’ (lit. I am at self’s house / *I am at my house)
The forms of the ezafe in Standard K. are given in Table 11:
Table 11: Ezafe with the nouns gund ‘village’, bra ‘brother’, jin ‘woman’, çira ‘lamp’
Singular Plural (masc. and fem.)
masculine feminine
Def Indef. Def. Indef. Def. Indef.
gund-ê gundek-î jin-a jinek-e gund-ên / -êt gund-in-e
bra-yê brayek-î çira-ya çirayek-e jin-ên / -êt jin-in-e
bra-yên / -yêt bra-n-e
çira-yên / -yêt çira-n-e
The plural forms with -êt are found mainly in the Behdinī (see Haig, this volume,
chapter 3.3, §4). As mentioned above in connection with gender, the gender dis-
tinction in the ezafe following the indefinite marker -ek tends to weaken, with
considerable uncertainty and inconsistency in the forms. In the spoken language,
an ezafe may be omitted completely following nouns with indefinite -ek, and this
can also be witnessed sporadically in the written language: li ber derê kafeyek
internetê ‘in front of the door of an internet cafe’, with no ezafe following the
initial head noun (Dirêj 2011: 21).
120 Geoffrey Haig
The presence of an ezafe on any noun suppresses the expression of oblique

case on that noun. This is a very crucial fact of Kurmanjî syntax: it means that the
ezafe itself is impervious to the external case of the entire NP. For example:
(10) Gund di nav [çiya-yên bilind] da ye
village adp middle mountan-ez . pl high adp cop .3 sg
‘The village lies between high mountains.’
(11) Li wê herêm-ê [çiya-yên bilind] he-ne
adp dem . obl . m
region-obl . m mountan-ez . pl high existent-cop . pl
‘There are high mountains there.’
The ezafe construction çiyayên bilind remains unchanged, regardless of the syn-
tactic function of the entire NP. Thus in (10), we would expect an oblique case,
because it is the complement of an adposition, while in (11) we would expect
direct case, because it is the subject of an existential clause. But the presence of
the ezafe -yên suppresses any overt expression of case on the noun. Overt case is,
however, expressed when the ezafe construction is introduced with a demonstra-
tive, which always expresses the case of the entire NP:4
(12) Gund di nav [wan çiya-yên bilind] da ye
village in middle those mountain-ez . pl high adp cop .3 sg
‘The village is in between those high mountains.’
(13) [ew çiya-yên bilind] li ser sinor in
those mountain-ez . pl high adp adp border cop .3 pl
‘Those high mountains are on the border.’
Ezafe particles may also occur separated from their head noun. We refer to these as
free, or demonstrative, ezafes. There are two possibilities. First, they may be used
to add additional dependents to an existing simple ezafe construction. They still
agree with the respective head nouns in number and gender:
bra-yê min ê mezin ‘my older brother’
mehîn-a boz a qenc ‘the good grey mare’
gund-ên Qersê ên kevn ‘the old villages of Kars’
Second, they occur as anaphoric elements with the sense of ‘the one …’. In such
contexts, they are prosodically independent, rather than enclitic, and are preceded
by a glide: yê spî ‘the white one (masc. sg.)’; ya te ‘your one (fem. sg.)’; yên mezin
‘the big ones (pl.)’.
4
Interestingly, in Şemzînan (and probably Badinan generally) this sometimes does not
hold, and the demonstrative may actually remain in the direct case: tu ew çiyayêt bilind
dibînî? ‘Do you see those high mountains’, where the demonstrative is in direct case.
In Behdinī Kurdish, and dialects close to it such as Şemzinan (cf. §3.2.1),

the use of ezafes as independent forms has entered the verbal domain, where
they accompany certain kinds of predicates, in particular copular elements (cf.
Haig 2011, and this volume, chapter 3.3, §4), as in (14).
(14) … ez ya bêdeng im
I ez . f silent cop .1 sg
‘I am remaining silent’ (the speaker is a woman)5
Something similar may be found in the Elbistan dialect, discussed below in §3.2.2,
though the origins of these forms are somewhat obscure.
2.4. Numerals
The main numbers, given in Standard Orthography (following Bedir Khan and
Lescot 1970) are as follows:
1 yek 11 yanzdeh, yazdeh 30 sî
2 didu, du 12 dwanzdeh 40 çel, çil
3 sisê, sê 13 sêzdeh 50 pêncî
4 çar 14 çardeh 60 şêst
5 pênc 15 panzdeh 70 heftê
6 şeş 16 šanzdeh 80 heştê
7 heft [ħæft] 17 hevdeh 90 nod, not
8 heşt [ħæʃt] 18 hejdeh 100 sed
9 neh 19 nozdeh 201 du sed û yek
10 deh 20 bîst 1000 hezar
The short forms of 2 and 3 are used when they are quantifiers in a NP: sê zarok
‘three children’. In the western parts of the Kurmanjî speech zone, the typically
Indo-European opaque forms for 11 and 12 have disappeared, and all the numer-
als 11–19 have been regularized along the lines of “10-and-1”, “10-and-2” etc:
dehûyek, dehûdu, dehûsê (cf. Haig 2006). This would appear to reflect contact
influence from Armenian and Turkish, which lack opaque forms for 11 and 12, and
instead have regularly-formed “10–1” etc.
5
From a short story Hirmîka Xirş by Mihemed Selim Siwarî, a writer from the Beh-
dinī-speaking region in North Iraq, published in Antolojiya çirokên kurmancên başûr,
edited by Xelîl Duhokî (Avesta, 2011).
122 Geoffrey Haig
2.5. Adpositions
In Kurmanjî, I distinguish three components of the adpositional system, which can
be combined in various ways: basic prepositions, locational nouns, and postposi-
tional particles.
2.5.1. Basic prepositions

Standard Kurdish has three basic prepositions, and these are reasonably stable in
most dialects (though Southeastern Kurmanjî differs in some respects, cf. §3.3).
Each covers a broad and fairly abstract semantic space, with a spatial core: ji
‘from’, bi ‘by, through’, li ‘at’.
Recall from the discussion of phonology in §2.1 that no word can end with the
short central vowel [ɨ]. From this fact, it is evident that these prepositions do not
constitute phonological words in Kurdish, and are probably best seen as proclitics.
These three prepositions are also unique in that they fuse with a demonstrative
to yield jê (ji + wî/wê), pê (bi + wî/wê) and lê (li + wî/wê) respectively. In some
dialects, the compositional form pê has been reanalysed as a simple preposition
with instrumental meaning, cf. (15) from Karakoçan dialect (field notes from Kar-
akoçan):
(15) ister pê dest-an bi-xw-e ister pê kevčî
want with hand-obl . pl subj -eat:pres - imper .2 sg want with spoon
‘Eat (it) with (your) hands, or with a spoon, as you please’6
Basic prepositions may occur alone, but are more common in combination with
a postpositional particle, or with locational nouns. In addition to the three men-
tioned above, there is also one fixed circumposition, consisting of di … de ‘inside’.
In Standard K., the prepositional element di, unlike the three mentioned above,
cannot occur by itself, but is always accompanied by the postpositional particle de.
In addition to the three “basic prepositions” just discussed, two other preposi-
tions are found throughout the Kurmanjî region, though they differ from the three
just mentioned in that they end in full vowels, and there is no fusion with the prep-
ositional complement: (a) the preposition bê ‘without’; (b) the preposition bo. In
most dialects of Turkey bo can be combined with ji to express benefactive mean-
ings (ji bo). The dialects of the southeast use simply bo, which is also extended
to cover recipient and goal meanings, where it generally replaces the combination
ji … re/ra of Standard K.
6
ister … ister is a modified loan construction from Turkish, based on the Turkish verb
istemek ‘want’
2.5.2. Locational nouns

A number of prepositions are evidently the result of the grammaticalization of
nouns; they can be used both independently and in combination with the basic
prepositions. The commonest are the following:
nav ‘inside’ di nav … de ‘inside’
ber ‘front‘ li ber ‘in front of’
ji ber ‘because of’
ser ‘head’ li ser ‘on, upon, over’
bin ‘bottom’ li bin, di bin … de ‘beneath, underneath’
dû ‘behind’ li dû ‘after’
pişt ‘back’ li pişt ‘behind’
rex ‘side’ li rex ‘next to, on the side’
tenişt ‘side’ li tenişt ‘by side’
Some of these locational nouns also occur in a kind of prepositional stranding
construction, occurring without a complement and post-verbally:
(16) min nan da ber û hat-im
1 sg . obl bread give. pst .3 sg front and come. pst -1 sg
‘I put the food in front (of him) and came’
2.5.3. Postpositional particles

These particles are mostly combined with a preposition; they cliticize to the right-
edge of the entire prepositional phrase, and provide additional meaning compo-
nents to the phrase. However, the resultant meanings are not always transparent,
for example (cf. Bedir Khan and Lescot 1970: 244–258 for a detailed list):
bi … re/ra ‘accompaniment, together with’
di … re / ra ‘through’
ji … re/ra ‘for, to, benefactive/recipient’
(ber) bi … de ‘towards’
In many dialects of central Anatolia (e. g. Dersim and Karakoçan) and the dia-
lects of Armenia, the benefactive/recipient ji … re/ra and the local di … de/da are
reduced to just the respective postpositional element: mi(n)-ra ‘to/for me’, mal-da
‘at home’.
Spatial meanings are also conveyed through directional adverbs, such as: jêr
‘down’, jor ‘up’, xwar ‘down (on the ground)’. Another important element is the
particle -de/da following NPs expressing directionals, when they occur after the
predicate.
124 Geoffrey Haig
(17) hinek av-ê bi-xi-yê da ji_bo_ku

a.little water-obl . f subj -drop. prs -3 sg . obl adp so.that
ne-şewit-e
neg -burn.prs -3 sg
‘Put a little more water in it so that it does not burn.’
2.6. Verbal morphology

Verbs exhibit the typical western Iranian characteristic of having two stems, a
present and a past stem, but the formation of one from the other is not fully pre-
dictable. Certain regularities can be identified, though we will not attempt a clas-
sification here. Table 12 provides a list of frequent verbs:
Table 12: Frequent verbs in past and present stems

Infinitive Past Stem Present Stem Meaning
bûn bû- -b- be
birin bir- -b- take
hatin hat- -(h)ê-/-wer- come
hêlan/hiştin hişt- -hêl- leave
bihîstin bihîst- -bihîz- hear
girtin girt- -gir- grasp, hold
gotin got- -bêj- say
kuştin kuşt- -kuj- kill
rûniştin rûnişt- -rûn- sit
kirin kir- -k- do, make
çûn çû- -ç-/-her- go
jîn/jîyan jî-/jiya- -jî- live
ketin ket- -kev- fall
xwarin xwar- -xw- eat
xwastin xwast- -xwaz- want, request
avêtin avêt- -avêj- throw
dîtin dît- -bîn- see
dan da- -d- give
mirin mir- -mir- die
zanîn zanî- -zan- know
girîn girî- -girî- cry, weep
ajotin ajot- -ajo- drive
barîn barî- -bar- rain
xistin xist- -x-/-xîn- strike, knock
xwandin xwand- -xwîn- read, study
Verbs are quite a small, closed word class in Kurmanjî (probably no more than 150
simplex verbs in regular usage in most dialects). The only moderately productive
derivational process for creating new verbs is a causative suffix, -and, used for
deriving transitive verbs from intransitive present stems: gerîn ‘walk, stroll’ →
gerandin ‘lead’, nivistin ‘sleep’ → nivandin ‘put to sleep’. New verb meanings are
normally created using light verb constructions usually based on kirin, bûn, dan
(see below). Additional verbs may also be derived through the lexicalization of
verb plus a dummy prepositional complement, for example jê birin ‘win’, literally
‘take from him/her’, also ‘erase’; lê xistin ‘beat (a person), lit. ‘strike on him/her’;
lê hatin ‘become’. Incorporation of a pre-verbal element may also yield a new verb
(see below).
2.6.1. Person marking suffixes

Finite verbs take agreement suffixes, indexing the verb for person and number of a
single core argument: the intransitive subject in all tenses, the transitive subject in
present tenses, and the transitive object with past tenses. Table 13 shows the two
sets of person agreement suffixes, one used with forms based on the present stem,
the other for forms based on the past stem. Subjunctive forms based on the past
stem have distinct composite endings with considerable cross-dialect variation,
beyond the scope of this section (see the dialect sketches below for some discus-
sion). Non-verbal predicates take a (slightly) different set of clitic copular endings
given in Table 15.
Table 13: Person agreement suffixes

S tem
P erson
P resent P ast
1sg -(i)m -(i)m
2sg -î / -e (imperative) -(y)î
3sg -e -Ø
1,2,3pl -(i)n -(i)n
TAM and negation prefixes

Verb forms based on the present tense (simple present, subjunctive, imperative,
future) obligatorily take a single prefix, either the neutral simple present prefix
di- (glossed ind icative), or the subjunctive prefix bi-, or a negation prefix na-
(replaces the indicate)7 or ne- (replaces the subjunctive). In imperative forms, and
with preverbal elements combined with light verbs, a prefix may be lacking. In
7
Two verbs negate the present stem with ni-, zanîn ‘know’ and karîn ‘be able’: n izanim /
nikarim ‘I don’t know / I can not’. The verb šiyan ‘be able’, used in Behdinī and
Şemzînan dialects, negates the present stem with ne: nešêm ‘I can not’.
126 Geoffrey Haig
the Behdinī dialects of North Iraq, the subjunctive present forms used to make the
future tense regularly drop the prefix bi-.
Verb forms based on the past stem, however, are not necessarily prefixed. The
simple past tense is basically thus the past stem of the verb plus the appropriate
person agreement markers. In the past tenses, an aspect distinction between pro-
gressive (or imperfective) and simple past is available, signalled by the prefix di-.
Negation of both simple and progressive pasts8 is through addition of the
prefix ne-: ne-hatin ‘they didn’t come’, or ne-di-hatin ‘they weren’t coming’. In
past tenses, agreement patterns vary according to the transitivity of the verbs (cf.
§3.4–3.5 below). Sample paradigms are given in Table 14.
Table 14: Sample verb conjugations

simple simple past simple past past progres- past
present (intrans.) (trans.) sive progressive
(trans.) hatin xwarin ‘eat’ (intrans.) (trans.)
gotin ‘say’ ‘come’ hatin ‘come’ xwarin ‘eat’
1sg ez dibêjim ez hatim min xwar ‘I ate (sth.)’ ez dihatim min dixwar
2sg tu dibêjî tu hatî te xwar tu dihatî te dixwar
3sg ew dibêje ew hat wî (masc.) xwar ew dihat wî (masc.)
wê (fem.) xwar dixwar
wê (fem.)
dixwar
1pl em dibêjin em hatin me xwar em dihatin me dixwar
2pl hûn dibêjin hûn hatin we xwar hûn dihatin we dixwar
3pl ew dibêjin ew hatin wan xwar ew dihatin wan dixwar
Non-verbal predicates in the present indicative receive a set of clitic person

markers or copula forms, see Table 15.
Table 15: Copular person clitics with non-verbal predicates

Following a consonant: Following a vowel:
kurd ‘Kurdish’ birçî ‘hungry’
1sg ez kurd-im ‘I am Kurdish’ ez birçî-me ‘I am hungry’
2sg tu kurd-î tu birçî-yî (reduced to [i:j])
3sg ew kurd-e ew birçî-ye
1pl em kurd-in em birçî-ne
2pl hûn kurd-in hûn birçî-ne
3pl ew kurd-in ew birçî-ne
8
Negation of progressive past in Western Kurmanjî (Adiyaman-Urfa) is na-, as in na-de-
kir-in ‘they were not doing it’.
In Western Kurmanjî, special constructions are found with non-verbal predicates

(cf. §3.4.2.2). For non-verbal predicates in the past tenses, or in subjunctive mood,
the appropriate form of bûn ‘be’ is required.
2.6.2. The verbs çûn ‘go’ and hatin ‘come’

These two verbs have suppletive stems, with regional variation in the choice and
forms of the stems.
Table 16: The verbs çûn ‘go’ and hatin ‘come’

çûn ‘go’ hatin ‘come’
Present -çi- / -her- (imperative) -(h)ê- / -wer- (imperative)
Past çû hat
The Standard K. indicative singular 1sg form of ‘come’ is têm, resulting from a
contraction of *di-hê-m with the typical devoicing of the d- in such contexts (in
some dialects the trace of the stem-initial h- can still be heard), while the negation
is nayêm. In the western dialects of Kurmanjî (WK, see §3.4.2 below), the imper-
ative stem -her- of ‘go’ is also used in the indicative, so for example in Elbistan,
Dersim, Erzurum, and Elaziğ, it is used for all forms of the present stem. Thus
first person indicative present in these dialects is terim (<*di-her-im) ‘I go / am
going’, while negative indicative present is narim (<*na-her-im). In other dialects,
the imperative form is also used to cover subjunctive meanings in the present.
The imperative stem of hatin is also often used in place of the regular subjunctive
(which is bêm in the first singular).
2.6.3. Mood
With the present stem, there is a simple distinction between indicative verb forms,
marked with di-, and subjunctive verb forms, prefixed with bi- or zero in some
dialects.
The subjunctive of the present stem (cf. 18–21) has a wide range of functions,
including clauses with irrealis sense (wishes or orders), and subordinate clauses
expressing possible or intended outcomes. It is obligatory in the complements of
modal predicates such as ‘want’, ‘be able’, ‘be obliged/must’. Some examples
follow (from Bedir Khan and Lescot 1970: 317–321):
(18) Kafir jî b-e
unbeliever add be.prs . subj -3 sg
‘even if (he) is an unbeliever’
128 Geoffrey Haig
(19) Ez di-tirs-im şermisarî û belengazî

1 sg aff -fear. prs -1 sg disgrace and misery
par-a me b-e
fate-ez . f 1 pl . obl be.prs . subj -3 sg
‘I am afraid that disgrace and misery will be our fate’
(20) heke birçî ne, bila bi-xw-in
if hungry cop . pl mod . prt subj -eat. prs - pl
‘If they are hungry, they should eat’
(21) heke pirs-a wan he-b-e,
if question-ez . f 3 pl . obl existent-be. prs . subj -3 sg
bila vê gavê bêj-in
mod . prt this time-obl say. prs . subj - pl
‘If they have a question, they should say so at the time’
Other subordinate clauses may (as in (22) and (23)) or may not be in the sub-
junctive (as in 24–25), depending on the degree of certainty of the proposition
expressed.
(22) Dît ko gotin-a wî rast e
see. pst (3 sg ) compl word-ez . f 3 sg . m . obl right cop .3 sg
‘He saw that what he said was right’
(23) De bêje, te çawan kir
mod . prt say. prs . imper 2 sg . obl how do. pst .3 sg
‘go on, say how you did it’
(24) Ni-zanîbû9 ko çawan bê pere
neg -know. pst (3 sg ) compl how without money
ve-ger-e mal
prv -return. prs . subj -3 sg home
‘He didn’t know how he would return home without any money’
(25) Bi-xwîn-in da ko ho zana bi-b-in.
subj -read. prs - pl so that thus knowledgeable subj -be. prs - pl
‘study, so that you may become knowledgeable’
2.6.4. The future tense

The future tense is expressed via a particle (d)ê or wê, combined with a subjunctive
form of the verb; see (26). It is most frequent in clause-second position, often (but
not always) preceded by the subject as in (27), though it is also possible clause-
9
The verb zanîn ‘know’ usually takes this form for the negated simple past.
initially; see (28) below. The main verb is in the present subjunctive. High-fre-
quency verbs such as gotin ‘say’ generally drop the bi- prefix in the future tense,
as does the verb bûn ‘be’ (exs. from Haig and Öpengin 2018).
(26) ji_bo Xwedē sal-ek-ē ez=ē řožī bi-gir-im
for God year-indf - obl 1 sg = fut fasting subj -keep. prs -1 sg
‘For the sake of God I will fast during one year.’
(27) ew dē mesel-ē ji te re bēž-e
3 sg fut issue-obl . f from 2 sg . obl postp say. subj .prs -3 sg
‘He will tell you the issue.’
(28) wē čawa heval-ēn nexweš derbas bi-bi-n
fut how friend-ez . pl ill prv .pass subj -be. prs -3 pl
‘How will the wounded friends pass?’
(29) ewro ne, dē sibe či-m
today no, fut tomorrow come. subj .prs -1 sg
‘not today, (but) tomorrow I will come’
In Standard K. and contemporary written Kurdish, the future auxiliary can be
retained in negative sentences, in which case the negation marker is the subjunc-
tive negation ‘ne-’. In Behdinī, there is no dedicated negative future; the negative
indicative is used:
Standard K.: Ez ē sibe bi wan re ne-či-m.
Behdinī: Ez sibe digel wan na-či-m ‘I won’t go with them
tomorrow’
2.6.5. The directional –e particle on verbs

A large class of verbs expressing motion (‘go’, ‘come’) or directed action (‘give’,
‘speak’, ‘send’) frequently takes the so-called ‘directional particle’ -e (in most
dialects [-æ]) after the person marker on the verb. MacKenzie (1961a: 197–198)
analyses it as a reduced form of a directional preposition, which has cliticized to
the preceding verb. It is assimilated to the final vowel of verb forms ending in one
of the full vowels (e. g. čû ‘went’). For verbs such as hatin ‘come’, the use of the
directional is almost obligatory (claimed, for example, for Tur Abdin dialect in
Turgut 2012). An example from a traditional text is the following (Bedir Khan and
Lescot 1970: 352):
130 Geoffrey Haig
(30) Se û zarok-ên gund li gur hat-in-e

dog and child-ez . p village at wolf come. pst -3 pl - drct
hev …
together …
‘The dogs and the children of the village gathered together around the
wolf …’
The precise conditions determining its realization remain, however, not fully
understood. It needs to be distinguished from the reduced form of a third person
singular addressee or recipient, which likewise cliticizes to the verb: got=ê ‘said to
him/her’, didin=ê ‘give to him/her’. If such a clitic goal is present, the directional
particle cannot be realized.
2.6.6. Light verb constructions

Like most Iranian languages, Kurdish makes extensive use of complex predicates
consisting of a so-called ‘light verb’ plus some non-verbal element. The most com-
monly used light verbs in Kurmanjî are kirin ‘do, make’, bûn ‘be, become’ and dan
‘give’. The following list is a small selection of widely-used light verb construc-
tions involving a nominal non-verb element (Haig 2002: 22–23):
ava kirin ‘build, establish’ xwedî kirin ‘bring up, raise’
bang kirin ‘call’ bar kirin ‘load, move (house)’
alî(karî) kirin ‘help’ gazî kirin ‘call’
bawer kirin ‘believe’ hez kirin ‘like, love’
fa(h)m/fêm kirin ‘understand’ dest pê kirin ‘start, begin’
şerm kirin ‘be ashamed’ nîşan dan ‘show’
ji bîr kirin ‘forget’ dest avêtin ‘reach for, begin’
guhdarî kirin ‘listen’ dev jê berdan ‘leave alone, cease
doing’
2.7. Syntax of the simple clause
2.7.1. Word order

The word order in pragmatically neutral clauses is SOVG, where “G” stands for
“Goal”, here a cover term for spatial goals of verbs of movement, recipients of
verbs of transfer, and addressees of verbs of speech. However, word order is not
rigidly fixed; direct objects may be fronted for pragmatic purposes, for example.
The position and means of marking of goal arguments (in the broad sense just
defined) also varies; in those dialects which make extensive use of the circum-
position ji … ra (cf. §3.5) for recipients and benefactives, they precede the verb,
yielding SGOV. For recipients with dan ‘give’, however, all dialects usually place
the recipient argument immediately after the verb, in the oblique case but with no
adposition. The southeastern dialects make more extensive use of the post-predi-
cate position, which is almost always coupled with the presence of the directional
particle on the verb (§2.6.5). These dialects also make use of a preposition bo with
some post-predicate recipients and benefactives (see Haig 2015 on post-predicate
goals), in which case no directional particle occurs on the verb.
2.7.2. Alignment and related issues

Kurmanjî has an ergative construction, used with the past tenses of transitive
verbs. Otherwise, the syntax is accusative throughout. The ergative construction
associated with past transitive verb forms has attracted a fair bit of attention in
recent years (Bynon 1979; Dorleijn 1996; Matras 1997; Haig 1998; Turgut 2012;
Haig 2008 for summary discussion), and we will only point out some of the more
salient facts here, and some points of variation across the dialects.
In the ergative construction, the transitive subject takes the Oblique case,
while the direct object is in the Direct case. The verb agrees with the direct object.
However, the order of subject and object remains unchanged. Similarly, the subject,
despite its Oblique case, still controls coreference with reflexive xwe. Throughout
Northern Kurdish, it is subjects only which control reflexive xwe, with exceptions
only regularly found in WK (§3.5). Thus the relation of subjecthood in Northern
Kurdish is quite robust, and largely independent of surface case (cf. Haig 1998 for
discussion). Typical examples (from Thackston 2006: 49) are given in (31–33):
(31) Jinik-ek-ê çay-a me anî
woman-indf - obl . f tea-ez . f 1 pl . obl bring. pst .3 sg
‘A woman brought our tea.’
(32) Wî mirov-î çay anî
dem . m . obl
man-obl . m tea bring. pst .3 sg
‘That man brought tea.’
(33) Gundi-yan tişt-ek ne-got
villager-pl . obl thing-indf neg -say. pst .3 sg
‘The villagers didn’t say anything.’
There are two main areas where the morpho-syntax of Kurdish diverges from what
would be expected from the rules of ergativity as just outlined, namely the agree-
ment on the verb, and the case of the direct object. With regard to verb agreement,
when the transitive subject is plural, and not expressed overtly in the clause, there
is a strong tendency to add plural agreement to the verb, even when the object
is singular. This usage is followed in all varieties of Kurdish, spoken or written,
when the clause with the transitive verb is preceded by an intransitive clause with
the same subject. Characteristically in (34) below, which represents the written
132 Geoffrey Haig
language (the poem Ji Biçukan re, by Cegerxwîn), an intransitive clause precedes

the past transitive verb gotin ‘say’:
(34) Herdu çû-n-e cem rovî Doz-a xwe jê_ra
the.two go. pst - pl - drct to fox case-ez . f self to.him
got-in
say.pst - pl
‘The two of them went to the fox (and) explained( pl ) their case to him.’
A second tendency, found in the dialects of Central Anatolia to the west, is to put the
direct object of a past transitive verb into the Oblique case, rather than the expected
Direct case, leading to a double-oblique construction (with both subject and object
in the Oblique). In the dialect of Muş, this tendency can be regularly observed:
(35) (story told by speaker from Muš)
ez zarok bû-m-e, biçûk bû-m-e, min
1 sg child be. pst -1 sg - perf small be. pst -1 sg - perf 1 sg . obl
girt-in-e …
take.pst -3 pl - perf
‘I was a child, I was young (they) took me …’
See Haig and Öpengin (2018), and Haig (2017: 477–479) for discussion and refer-
ences on deviations to canonical ergativity.
2.7.3. Non-canonical subjects

In the dialects of the south and the east (e. g. Şemzînan and Behdinī), certain pred-
icates take a subject in the Oblique, regardless of tense. Such constructions resem-
ble superficially the ergative construction, but should not be confused with it,
because (a) they are not conditioned by the tense of the verb; (b) the predicates
concerned can be intransitive. Typically such non-canonical subjects occur with
certain predicates of physical sensations, for example min( obl ) sar e ‘I am cold’.
However, not all such predicates have non-canonical subjects, cf. ez( dir ) birčî me
‘I am hungry’. The verb viyan, expressing necessity/desire, also takes an oblique
“wanter”:
(36) min d-vê-t b-çi-m
1 sg . obl ind -be.necessary. prs -3 sg subj -go. prs -1 sg
‘I want to go’
Finally, in expressions of possession the possessor is often in the oblique:
(37) min trimbêl nîne
1 sg . obl car not.existent.3 sg
‘I do not have a car.’
Northern Kurdish (Kurmanjî) 133
In most other dialects, these constructions are not used. Instead, canonical sub-
jects in the Direct case are used, or, in the case of possession, the possessor is the
modifier in an ezafe construction. A remnant of this construction may be found in
many dialects in the expression çav ketin ‘eye fall’, i. e. ‘catch sight of’, where the
‘possessor’ of çav occurs clause-initially, rather than via an ezafe-construction.
The following example from Ritter’s Midyat texts (transcription adapted) is fairly
typical:
(38) waxtê ku sofî çav pê ket …
time-ez . m compl Sufi eye with.him fall. pst .3 sg
‘When the Sufi caught sight of him …’
3. Dialectal variation in phonology and morphosyntax
This section briefly summarizes aspects of regional variation, largely based on

Haig and Öpengin (2018). For more detailed discussion of lexical and phonolog-
ical variation, see Öpengin and Haig (2014), which is based on a comparative list
of lexical items. The two studies just mentioned yield a broad division of Kurmanjî
into three main dialect groups: Southeastern Kurmanjî (SEK), Southern Kurmanjî
(SK), and Western Kurmanjî (WK). Their approximate respective locations are
indicated in Figure 3 below (see §4 for details).
Figure 3: Approx. locations of three main Kurmanjî dialect zones

134 Geoffrey Haig
The situation is best captured in terms of a northwest-to-southeast dialect con-

tinuum, with the two endpoints Southeastern Kurmanjî and Western Kurmanjî as
the most clearly differentiated dialects. In the intermediate region, provisionally
termed here “Southern Kurmanjî” (SK), dialect boundaries are blurred, and fea-
tures of both WK and SEK are found to varying degrees. Probably the most diver-
gent dialect is SEK, which includes Behdinī of North Iraq (see Haig, this volume,
chapter 3.3, §4).
An initial impression of dialectal diversity can be gained by considering the
distribution of the lexical item used to express the English activity verb ‘speak’. In
Kurmanjî, at least eight distinct lexical items (including complex predicates) are
used to express the English verb ‘speak’. They are provided in Table 17:
Table 17: Lexical expressions for SPEAK (numbers in brackets refers to numbering in
Figure 4)
deyn/deng kirin (1)
ştaxilîn (2)
xeber dan (3)
axaftin (4)
peyivîn (5)
qise kirin (6)
qez kirin (7)
şor kirin (8)
The distribution of these eight variants is largely geographically determined, and

is graphically represented in Figure 4. Each triangle represents the location of
a speaker, while the numbers refer to the variant of ‘speak’ which she used in
response to a translation task (see Table 17 for the actual variants). Most of the
data were extracted from the Database of Kurdish dialects (Matras et al. 2016); see
Haig and Bulut (2017) for details of the methodology and the map.
It is evident that SEK fairly consistently uses variant 4 axaftin, while WK
uses fairly consistently 1 deyn kirin. Elsewhere areal trends are clearly visible, but
we also find, for example, that variant 3 xeber dan is widely used throughout the
central region.
An area of morphosyntax where the dialectal divisions mentioned in Figure 3
are also relevant is adpositions, and word order. A particularly clear example con-
cerns the position, and adpositional marking, of the Addressee argument of the verb
‘tell, sayʼ, expressed through the lexeme gotin (with some phonological variants)
in all dialects of Kurmanjî. Three main constructions are associated with this verb,
and are illustrated in examples (39–41). In (39), typical for SEK, the Addressee
is post-verbal, and the verb carries the so-called directional particle (see §2.6.5),
the attenuated remnants of an earlier preposition. In (40), typical for Standard K.
and most of the core of the Kurmanjî speaking zone in Anatolia, the Addressee
Figure 4: Distribution of lexical variants for ‘speak’ (from Haig and Bulut, 2017)
is pre-verbal, and flagged through a circumposition, ji … =ra. Finally, in (41),

typical of WK and the northern peripheries of Anatolia, we find the Addressee
flagged solely through the postpositional clitic =ra, clearly a reduced form of the
circumpositional variant illustrated in (40), via loss of the preposition.
The three examples each represent translations of the sentence ‘She/that
woman told me’, and were extracted from the data for their respective locations
from the Database of Kurdish Dialects (Matras et al. 2016). Note that gotin is a
transitive verb in Kurmanjî, hence the subjects are in the Oblique case in these
past-tense clauses. Figure 5 is a map compiled by the present author to indicate the
areal distribution of these three variants, with each point indicating the location of
a speaker from the Database of Kurdish Dialects (Matras et al. 2016).
(39) Şemzinan (SEK)
ewê got=e min
3 sg . obl . f tell. pst .3 sg = drct 1 sg . obl
‘She told me’
(40) Bingöl (SK)
wê jin-ê ji mi=ra gotibû
3 sg . obl . f woman-obl . f adp 1 sg . obl = adp tell. pprf .3 sg
‘That woman told me’
136 Geoffrey Haig
(41) Elbistan (WK)

wê mi=ra go
3 sg . obl . f 1 sg . obl = adp tell. pst .3 sg
‘She told me’
Figure 5: Distribution of construction types with the verb gotin in Kurmanjî

Key: white=post-verbal, with directional particle (39); grey=pre-verbal, with
circumposition (40); black=pre-verbal, with postposition (41).
Having briefly illustrated the main dialectal divisions within Kurmanjî, I will turn
to some more specific features of phonology and morphosyntax, focussing on fea-
tures where SEK and WK show the most divergent features with respect to Stand-
ard K., and what has been provisionally termed Southern Kurmanjî above. Again,
it must be stressed that Southern Kurmanjî is not a well-defined entity, but essen-
tially covers those areas that are not captured by SEK and WK. Within SK, there
is of course a great deal of finer-grained dialectal differentiation, but it is beyond
the scope of this chapter to cover it.
3.1. Variation in Phonology
3.1.1. Phonological variation in SEK

In SEK of the Şemzinan dialect, there is a process of final-vowel cntralization:
A high front unrounded vowel [i:] is mostly centralized into an [ɨ] in word-final
position. The process affects certain inflectional morphemes, for example Stand-
ard Kurmanjî wî mirov-î ‘that man-obl . m ’ is wi mirow-i in Şemzinan, Standard
K. naç-î ‘neg .go.prs -2 sg ’ is naç-i in Şemzinan. It also affects some lexical items,
e. g. tiji as opposed to Standard K. tijî. However, otherwise a lexical final long [i:]
is preserved, as in spī ‘white’, tarī ‘dark’, or karī ‘a sort of plant’. Note that the
masculine ezafe and the oblique case following the indefiniteness suffix are not
subject to centralization, as seen in the following examples:10
mirov-ek-ī baš (man-indf - ez . m good) ‘a good man’;
xanī-yē mirov-ek-ī (house-ez . m man-indf - obl . m ) ‘a man’s house’
This feature is salient in the eastern half of the SEK dialect zone, but not found in
the western section such as in Dohuk or Amêdî (see Haig, this volume, chapter 3.3,
§4).
Another feature of SEK, particularly of northern Iraq (Haig, this volume,
chapter 3.3, §4.1.2), is the fronting of [u:] towards [y:]. In Behdinī of North Iraq
and in the southeastern dialects of Kurmanjî in Turkey, the fronting process is
accompanied by de-rounding, leading to [i:] in a number of lexical items, e. g.
[xæsi:] ‘mother-in-law’, or [di:ɾ] ‘far’ (Zakho, North Iraq), as opposed to Standard
Kurmanjî [xæsu:] and [du:ɾ]. Examples of fronting of [u:] to [y:], transcribed here
as <ü>, are given in Table 18 (examples from Şemzînan dialect):
Table 18: Vowel fronting in Southeastern Kurmanjî (SEK)

Şemz. Standard K. Gloss
stür stūr ‘thick’
mü mū ‘hair’
tü tū ‘mulberry’
bičük bičūk ‘child’
bük būk ‘bride’
Bilabialization of the voiced labio-dental fricative: Standard K. [v] is systemati-

cally seen as an approximant [w] in Şemzînan, similar to much of Central Kurdish
(Haig, this volume, chapter 3.3, §3.1), so the the contrast between [v] and [w]
10
The final [i:] of participles is centralized when the participle is used predicatively (dis-
cussed in §3.2.1), as in: mala wan a soti ‘their house has burnt down’. Otherwise, par-
ticiples retain the final long vowel.
138 Geoffrey Haig
is neutralized. For example, Standard K. av, şev, çav ‘water, night, eye’ are pro-
nounced aw, şew, çaw in Şemzînan. The process can also be observed in loan-
words, such as vazo ‘vase’ (from Turkish), which is wazo in Şemzînan. The few
occurrences of [v], as in vize viz ‘swirling of flies’ and bive ye ‘it’s dangerous’ in
child language, are onomatopoeic words and the speakers are usually not system-
atic in their pronunciation. In the Behdinī dialect of Dohuk, however, lenition of
Standard K. [v] is not evident, and in fact the [v] in syllable-final position tends
to be devoiced to [f]. Thus in these dialects, an opposition between [v] and [w] is
retained (Haig, this volume, chapter 3.3, §4).
3.1.2. Phonological variation in WK

The phonology of WK diverges from that of Standard K. in several respects. The
most striking is the backing and rounding of Standard K. [a:] to WK [ɔ:], shown
in Table 19.
Table 19: Backing and rounding of /a/ in Western Kurmanjî (WK)

Orthog. Standard K. WK Gloss
av [a:v] [ɔ:v] ‘water’
hatin [ha:tɨn] [hɔ:tɨn] ‘to come’
da [da:] [dɔ:] ‘s/he/it gave’
Standard K. [ɛ] or [æ] is also regularly retracted to a low central unrounded vowel
[æ̱ ] in WK. Thus Standard K. dest ‘handʼ and dev ‘mouthʼ ([dæst], [dæv]) become
[dæ̱ st] and [dæ̱v] respectively.
Turning to the consonants, the Standard K. [b] is lenited via [β] into an approx-
imant [w] in intervocalic, and in some cases, in word-initial and word-final posi-
tions. Note that the phenomenon is restricted to intervocalic position in other
dialects (such as northern part of Kurmanjî speech zone). The process regularly
affects an initial [b-] of verbal stems, when they are preceded by a tense, aspect,
mood, or negation prefix.
Table 20: Lenition of pre-vocalic /b/ in Western Kurmanjî (WK)

Orthog. Standard K. WK Gloss
hebek [ħæbæk] [ħæ̱wæ̱ k] ‘one unit’
seba [sæba:] [sɛwa:] ‘because of’
bīne [bi:næ] [wi:næ̱ ] ‘Bring (it)!’
bibīne [bɨbi:næ] [bɨwi:nɨ] ‘(If s/he) sees (it)’
nebēže [næbe:ʒæ] [mæ̱ we:] ‘Do not say!’
kitēb [kʰɨte:b] [kʰɨte:w] ‘book’
An epenthetic vowel [ɨ] (see §2.1) in a number of Standard K. function words and

inflectional morphemes is regularly a full vowel [æ] in WK (similar to Sorani/
Central Kurdish to which, geographically, WK is the most distant region). For
example, the indicative present suffix di- generally loses its vowel, and is reduced
do [d-], or [t-] in e. g. SEK (the entire morpheme is generally absent in the Mardin
region of Southern Kurmanjî), just as the epenthetic vowel of the basic preposi-
tions is likewise often lost in SEK. In WK, on the other hand, we find the indica-
tive prefix [dæ-], as in dæ-kɨm ‘I doʼ, or the preposition læ for SEK [l(ɨ)] ‘atʼ.
Notice that a pharyngeal [ʕ] is altogether not attested in the data of the Elbistan
variety of WK dialect. That is, the few words which are most prone to the devel-
opment of pharyngeals in Kurdish dialects, such as Standard K. mar ‘snake’, tehl
‘bitter’, čav/čehv ‘eye’, do not contain a pharyngeal phoneme.
3.2. Variation in the Ezafe construction
3.2.1. Ezafe in SEK

With definite nouns, the singular ezafe forms are the same as in Standard K., but
there are some differences in the plural ezafe and elsewhere, summarized below,
see also Haig (this volume, chapter 3.3, §4):
Table 21: The Ezafe in Southeastern Kurmanjî (SEK)

masc fem pl. (masc./fem.)
Definite -(y)ē -(y)a -(y)ēd/-(y)ēt
Indefinite -ī/-ē -e/-a
Demonstrative ezafe yē ya yēt
One of the features distinguishing SEK from Standard K. and the rest of Kurmanjî
is the use of the ezafe as a predicative element, rather than as part of the noun
phrase. This phenomenon is discussed in MacKenzie (1961a: 205–208) and in
Haig (2011); here we will only briefly outline it for Şemzînan (and SEK). Essen-
tially it involves an ezafe which agrees in number and gender with its antecedent,
but does not link that antecedent to some modifier; instead it introduces a verb
phrase. Examples of this kind of usage are given below. (42) illustrates a clausal
expression of possession (realized via the copula in Standard K.).
(42) min du bičūk-ēt he-y
1 sg . obl two child-ez . pl existent-not . analyzed
‘I have two children’
140 Geoffrey Haig
Predicates expressing location require a clause-final copula in Standard K., but in

SEK the ezafe suffices:
(43) pīrežin-ek-ē li bin dīwar-i
old.woman-indf - ez . m at under wall-obl . m
‘An old woman is at the base of the wall.’
In the present tenses of clauses with a nominal or adjectival predicate, the copula
is combined with the ezafe.
(44) kuř-ek yī zīrek=e
boy=def ez . m clever=cop . prs . sg
‘The boy is clever.’
With finite verbs in the present tense, the ezafe expresses a progressive aspect:
(45) ber-ē xū da-yē kičik-a di-bēž-īt
direction-ez . m self give.pst -3 sg . goal girl-ez . f ind -say.prs -3 sg
‘(S/he) looked at her (and saw that) the girl is saying: …’
With past participles as main predicates, it forms the present perfect tense:
(46) hirč-ē yē xū lē da-y
bear-obl . f ez . m self at.him give.pst -ptcp
‘The bear has attacked him.’
(47) pīrežin-ē ser-ē da-na-y-e se
old.woman-obl . f head-ez . m prv -put.pst - ptcp - drct on
ber-ek-ē
stone- indf - obl . m
‘The old woman has put her head on a stone.’
Although it is fairly uncontroversial that these particles are etymologically iden-
tical with the ezafe, they are in fact not fully identical in form with the adnominal
ezafes discussed further above, and there is some inconsistency in the forms used.
In general, there seems to be a tendency for overgeneralization of the form -(y)ē,
regardless of the gender of the antecedent (cf. (46) above). In the Yezidī dialects
of Tur ‘Abdîn discussed in Bailey (2005), there is a similar neutralization of these
tense-ezafes, leading to a uniform -ī.
3.2.2. Ezafe in WK
The ezafe forms and their functions constitute another domain where the WK
dialect diverges remarkably from Standard K. Related forms are below:
Table 22: The Ezafe in Western Kurmanjî (WK)

masc fem pl. (masc./fem.)
Definite -ī / -ē -ē / -ɔ -ē
Indefinite -ī -ē -e
Dem. ezafe ī ɔ ē
With definite nouns the basic ezafe forms are -ī and -ē. The alternative forms -ē
and -ɔ, masculine and feminine respectively, which are parallel to Standard K.
forms, occur rarely and the conditions of their occurrence are not yet clear. The
plural ezafe, as in Mardin, is a reduced form -ē. Thus, theoretically, in some cases
of definite nouns, gender and number distinctions of ezafe are neutralized, illus-
trated in following examples:
ziman-ī/-ē mi ‘my tongue’ (masc. sg.)
mɔl-ē/-ɔ min ‘my home’ (fem. sg.)
sēv-ē mi ‘my apples’ (pl.)
mɔl-ē bɔv-ī te ‘your father’s home/house’ (mɔl ‘home’: fem. sg.; bɔv
‘father’: masc. sg.)
In indefinite nouns, however, the alternative forms are not used at all. Thus, the
ezafe forms in indefinite nouns are the same with Standard K. in singular mascu-
line and plural but differ from Standard K. in feminine, illustrated below:
kečik-ek-ē rindik ‘a lovely girl’ (fem. sg.)
mērik-ek-ī gir ‘a big man’ (masc. sg.)
pisīk-n-e řeš ‘(some) black cats’ (pl.)
The demonstrative or pronominal ezafe forms are substantially different from
Standard K. and other dialects. A three way distinction (singular feminine and
masculine, and plural) is preserved albeit with different forms.
æ̱ v pisīkɔ ɔ min=e ‘This cat (fem) is mine.’
æ̱ v xɔynɔ ī min=e ‘This house (masc) is mine.’
æ̱ v xɔynɔnɔ ē min=in ‘These houses are mine.’
The most distinctive feature of Elbistan WK morphosyntax is the obligatory use of
what appears to be an ezafe, which cliticizes to the subject constituent of certain
types of clauses. The examples below show the construction in copular clauses:
142 Geoffrey Haig
æ̱z-ī/-ē gir=im ‘I (masc./fem.) am big.’

t-ī/-ē gir=æ ‘You (sg. masc./fem.) are big.’
æ̱w-ī/-ē gir=æ ‘She/he (masc./fem.) is big.’
æ̱m-e gir=in ‘We are big.’
hūn-e gir=in ‘You (pl.) are big.’
æ̱w-ē gir=in ‘They are big.’
With negated non-verbal predicates, the ezafe particle attaches to the negation
marker (Kömür 2003: 19):
az nî birçî ma ‘I (male) am not hungry’ (ne+î=nî)
az nê birçî ma ‘I (female) am not hungry’ (ne+ê=nê)
The ezafe forms used with the singular pronouns correspond to the indefinite sin-
gulars (see above), while the plural indefinite ezafe is used only with pronouns of
the first and second person plural. For the third person plural, the definite plural
ezafe is used. These particles introduce gender distinctions into the first and second
person singular of non-verbal clauses.
3.3. Adpositions in SEK

The system of adpositions in SEK differs from that of standard K. in several
respects. Some of these are illustrated in (48), from Haig and Öpengin 2018
(glosses simplified). One of the three basic prepositions of standard K., ji ‘from’,
is only present in SEK in a few formulaic expressions. In SEK, the sense of ‘from’
is covered by li, which also expresses ‘in, at’. Example (48b) illustrates the prep-
osition li in the sense of ‘from’. Benefactives in SEK are expressed through the
preposition bo, rather than the standard K. circumposition ji … ra. This is shown
in (48a). In addition to Standard K. bi ‘with, through’, SEK has also (li)gel or (di)
gel ‘with’, as well as a further circumpositional di … da ‘inside’, which is often
reduced to the postpositional element (cf. 48c).
(48) a. tu hinde šīrē kīwīya nešēy bo min
you some milk.ez . m goat neg .can.2 sg for me
bīni
subj .bring.2 sg
‘Can’t you bring some goat milk for me?’
b. ez dē šīrē kīwīya li kē_derē īnim
I fut milk.ez . m goat from where (subj )take.1 sg
‘Where shall I bring the goat milk from?’
c. du šēx małekē da čēnabin
2 sheikh(pl ) hous.ind . obl in prv . neg .be.pl
‘Two sheikhs in one house can’t be.’
The prepositions li, bi and di are never realized as they are cited here; they are
reduced to the consonantal element when preceding a vowel, and they show
metathesis to il, ib, id preceding a consonant. They are thus realized as enclitics on
whatever element precedes the prepositional phrase. This could be considered part
of a general tendency to tolerate more complex syllable codas in SEK as opposed
to dialects to the north and west. However, it also needs to be noted that simple
prepositions are often completely elided (shown in parentheses), as seen in this
proverb (49):
(49) čūn (ji/li) mirū=ye, hatin (ji/li) xudē
going (from) man=cop .3 sg coming (from) God
‘Going is (from) man, returning (from) God.’
SEK makes use of simple ‘bo X’ construction, as in (48a) rather than Standard K.
circumposition‘ji X re’ for expressing benefactive; ‘(li)gel X’ rather than Standard
K. ‘bi X re’ for comitative. Finally, the common Standard K. postpositional particle
ře/řa exists in SEK only in a circumposition di … řa ‘through’ or its contracted
pronominal form tē řa ‘through it’.
As in Standard K., there are also complex prepositions composed of a simple
preposition and a local noun (cf. §3.5). In such combinations, the basic preposition
is generally dropped, yielding what appears to be a new set of simple prepositions:
(li) se ‘on’ (Standard K. li ser), (li) nik ‘beside’, (li) bin ‘under’.
3.4.1. SEK verbal morphology
3.4.1.1. Stem formation

As noted in §2.6, a number of Standard K. verbs have present stems consisting of
either a bare consonant, or arguably, a consonant plus the short central vowel. In
SEK, these verbs have what we refer to as “heavy” present stems, consisting of the
initial consonant plus a vowel [æ] (orthographically <e>), or in the case of xwarin,
a [o]. This is a feature shared in most of the SEK speech zone as well as in Central
Kurdish. Examples of first person present tense forms of such verbs are in (50):
(50) bi-ke-m bi-be-m
subj -do.prs -1 sg subj -take.prs -1 sg
‘I’ll do’ ‘I’ll take’
di-de-m di-xo-m
ind -give.prs -1 sg ind -eat.prs -1 sg
‘I give’ ‘I eat’
144 Geoffrey Haig
The present and past stem of a number of intransitive verbs (mostly “unaccusative”)
in SEK have an extension -(i)yē (for present) and -(i)ya (for past); see Table 23. Fur-
thermore, in a number of verbs, also shown in Table 23, the present and past stems
of the verbs in SEK/Şemzînan are different from Standard K. and other dialects.
Table 23: Comparison of verb stems in Şemzînan (SEK) and Standard K.

Past Present Infinitive
Infinitive +Gloss
Standard K. Şemz. Standard K. Şemz. +Gloss
hišt- hēla- hēlan ‘leave’ -č- / -her- -č- čūn ‘go’

anī- īna- īnan ‘bring’ -gih- /-gihīž- -geh- gehištin ‘reach’
axivī- axiwt- axiftin ‘speak’ -ē- -hē- hatin ‘come’
ēšīya- ēša- ēšan ‘hurt’ -kišīn- -kēš- kēšān ‘pull’
kišand- kēša- kēšan ‘pull’ -řiž- -řižiyē- řižiyan ‘spill’
řižand- řēt- řētin ‘spill’ -řižīn- -řēž- řētin ‘pour’
avēt- howēt- howētin ‘throw’ -āvēž- -howēž- howētin ‘throw’
ajot- ha(w)jot- ha(w)jotin ‘drive’ -x- -ēx- ēxistin ‘drop’
xist- ēxist- ēxistin ‘drop’ -girī- -girī-/-giriyē- giriyan ‘weep’
3.4.1.2. Preverb incorporation
In Standard K., there is a set of opaque preverbal particles such as hil, řā, da, which
combine with verb stems to create new verbs. In the infinitive, they are usually
written together with the stem as a single item. However, inflectional prefixes such
as negation, or indicative/imperfective, are inserted between the preverb and the
stem, as in Standard K. ra-di-keve ‘goes to sleep’, from raketin ‘go to sleep’. In
SEK, however, negation and imperfective prefixes will often precede these pre-
verbal particles, indicating full lexicalization of preverb+stem and the creation of
a new stem. The same phenomenon is also found in the southernmost dialects of
Southern Kurmanjî, for example around Midyat; see Haig and Öpengin (2018),
ex. (75) and accompanying discussion.
Examples (51a) and (51b) show preverb incorporation in Şemzînan (SEK).
In (51a), the present indicative form of the verb hel-(h)atin ‘preverb-come’
(=‘rise’) has the indicative prefix preceding the preverbal element, and in (51b),
both negation and indicative prefixes precede the preverbal element.11
(51) a. řoj spēdē zū di-helē-t11
sun morning quick ind -rise.prs -3sg
‘The sun rises early in the morning.’
11
The verb form can be analysed as a contraction of di-hel-hē-t (ind - prv -come.prs -3 sg ).
Cf. the corresponding standard K. form hil-t-ê (prv - ind -come.prs .3 sg ).
b. ez heta hēwari žī ne-di-řa-westīya-m

1 sg until evening also neg - ipfv - prv -stand.pst -1 sg
‘I would not stop (working) until evening.’
Furthermore, in some highly lexicalized and frequent light verb constructions, the
negation “prefix” can even occur on the leftmost edge of the verbal complex,
preceding the non-verbal elements of the construction, as shown in (52).12
(52) min ne=ber-ē xū da-yē
1 sg . obl neg =direction-ez . m self give.pst -3 sg .goal
‘I did not look at (him/her/it).’
3.4.1.3. Additional aspect distinctions

As already discussed under §3.2.1, an analytic “present progressive” can be
expressed in SEK by using the ezafe, as illustrated in (53).
(53) ber-ē xo da-yē kičik=a di-bēž-īt
direction-ez . m self give.pst -3 sg .goal girl=ez . f ind -say.prs -3 sg
‘(S/he) looked (and saw that) the girl is saying (something).’
Other dialects of Kurmanjî in Turkey lack this possibility. Similarly, an alternative
present perfect can be created using the ezafe in combination with past participles,
as shown in examples (46) and (47). Note that these two tenses, present progres-
sive and present perfect tense, constructed using the ezafe, are mostly restricted to
affirmative and declarative clauses, as they are in Behdinī (cf. Haig 2011, Haig,
this volume, chapter 3.3, §4). However, in Şemzînan the present progressive does
lend itself to negation, illustrated in (54).
(54) axir tu yē na-xo-y
finally 2sg ez . m neg -eat.prs -2sg
‘But you are not eating!’
3.4.2. Verbal morphology in WK
3.4.2.1. Person marking

The WK person marking system differs from Standard K. in that the copula forms
of 2sg and 3sg are merged in -(y)æ. Similarly, the 2sg and 3sg verbal agreement
suffixes are merged in -i [ɨ]; see Table 24. In this manner, similar to the Mardin
12
In Mêrd. dialect, a similar incorporation of preverbal particles can also be observed,
cf. §4.2 in Haig and Öpengin (2018).
146 Geoffrey Haig
dialect, but unlike Şemzînan dialect of SEK, the person marking distinctions on
verbs is reduced to three levels: 1sg – 2sg /3sg – 1pl /2pl /3pl .
3.4.2.2. Gender marking in the predicate

In WK, an ezafe particle attaches to the subject of non-verbal predicates, as shown
in §3.2.2. A very similar usage also obtains in the present indicative, as in the
following:
(55) a. æw-ī t-er-i
3sg -ez . m ind -go.prs -3sg
‘He goes’
b. æz-ē dæ-gē-m
1sg -ez . f ind -reach.prs -1sg
‘I (female) am arriving.’
(56) a. æz-ē te dæ-pē-m
1sg -ez . f 2sg .obl ind -wait.prs -1sg
‘I (female) am waiting (for) you.’
b. t-ī dar-an xiš dæ-k-æ
2sg -ez . m wood-pl .obl prv .cut ind -do.prs -2sg
‘Are you (male) cutting the wood?’
However, it is yet to be confirmed whether clauses with full verbs in the past
tenses allow for the subject to be further marked by the ezafe forms. It is absent on
a number of past tense sentences in Çapar (2009). We conclude provisionally that
ezafes attach to the subjects of present tense verbs, and to copular constructions
irrespective of the tense, but we await a full account of the conditions on the use
of the ezafe in other verbal constructions.
(57) æz-ē dæ-zɔn-im k=æw-ī læ vir bū
1sg -ez . f ind -know.prs -1sg that=3sg -ez . m in here be.pst .3sg
‘I know that he was here.’
(Çapar 2009: 63)
(58) [pisīk-n-e řeš] geyrɔ-n
cat-indf .pl -ez black roam.around.pst -3pl
‘The black cats roamed around.’
(59) řē-yɔ xa šaš-miš kir
road-ez . f self wrong-mIš do.pst .3 sg
‘I lost my way’
The ezafe forms marking the subject in the present tense and copular construc-
tions apply also to non-pronominal subjects. The resulting forms are (superfi-
cially) identical with oblique marked agents in past tense constructions of Stand-
ard K. and other dialects: Musayī læ viræ ‘Musa-ez . m (is) here’. With plurals, the
ezafe applies regardless of whether the subject carries the plural oblique suffix, as
in (60b), or does not carry it, as in (60a).
(60) a. pisīk-ē šīr væ-dæ-xɔ-n
cat-ez .pl milk prv -ind -eat.prs -3pl
‘The cats are drinking milk.’
b. pisīk-ɔn-ē šīr væ-dæ-xɔ-n
cat-obl .pl -ez .pl milk prv -ind -eat.prs -3pl
‘The cats are drinking milk.’
Note that in this dialect, the plural oblique case suffix has been generalized to
apply to nouns which in Standard K. would be in the direct case, as in (60b).
However, it does not seem to have been fully reanalyzed as a generic plural suffix,
since it does not systematically mark all the plural entities, hence the variation
between (60a) and (60b).
With complex subject noun phrases, the particle occurs at the end of the subject
phrase, as in (61):
(61) a. pisīk-n-e řeš-ē šīr væ-dæ-xɔ-n
cat-indf .pl -ez black-ez .pl milk prv -ind -eat.prs -3pl
‘The black cats are drinking milk.’
b. vī īlag-ɔ-y qilēr=e
this shirt-prox -ez . m dirty=cop .3sg
‘This shirt is dirty.’
The same system apparently also applies to the future tense, according to the
description in Kömür (2003: 18–20). The relevant forms are provided in Table 24.
I assume that in the plural, the relevant clitic is uniform =ê, though this is not
shown in the source. Note also the identical person marking suffixes in the second
and third persons, as mentioned above. Unfortunately, we lack a detailed analysis
of the verbal system of these dialects.
Table 24: Gender marking in Western Kurmanjî verbs (‘send’, present stem şîn-)
P resent F uture
1 sg . m az=î da-şîn-im az=î ku bi-şîn-im
1 sg . f az=ê da-şîn-im az=ê ku bi-şîn-im
2 sg . m tu=yî da-şîn-i tu=yî ku bi-şîn-a
2 sg . f tu=yê da-şîn-i tu=yê ku bi-şîn-a
3 sg . m aw=î da-şîn-i aw=î ku bi-şîn-a
3 sg . f aw=ê da-şîn-i aw=ê ku bi-şîn-a
148 Geoffrey Haig
3.4.2.3. Verbal negation

The negation prefix in past imperfective verb forms is nɔ-, identical with the nega-
tion prefix used in present indicative verbs, as in (62). In this feature, WK differs
from Standard K., which uses the same negation prefix for all past tense verbs, and
a different one for the indicative present. Furthermore, there is a distinct negation
prefix for imperatives, mæ-, as in mæ-wē ‘do not say (it)’.
(62) gɔv-ɔ k=æz-ē læ mereš-ē wū-m min
time-ez that=1sg -ez in place.name-obl be.pst -1sg 1sg .obl
pir sēv nɔ-dæ-xɔr-in
many apple neg .ipfv -ipfv -eat.pst -3pl
‘When I was in Maraş, I would not eat so many apples.’
3.4.2.4. Turkish miş-verb forms in WK

A ubiquitous feature of all the western dialects is the massive influx of Turkish verb
forms based on the Turkish perfect/evidential suffix -mIš, combined with Kurdish
light verbs, for example an(l)amīš kirin ‘understand’ (Tk. anlamış), qapatmīš kirin
‘close’ (Tk. kapatmış). The widespread use of such forms constitutes an important
feature of these dialects as opposed to those of the southeast such as SEK, or SK,
where at least in the speech of older speakers, such forms are rarely used (e. g.
the extensive text material of Ritter, from Midyat region, or that of Nikitine from
Şemzinan (in MacKenzie 1995) contain hardly a single form). But from WK, they
are well attested in older sources (e. g. in the Kurmanjî texts of Le Coq 1903), and
many are firmly established and phonologically adapted, as in (63).
(63) a. min řē-yɔ xa šaš-mīš kir
1 sg . obl road-ez . f self wrong-mIš do.pst
‘I lost my way.’
(Çapar 2009: 63)
b. insɔn-? dayan-miš na-b-ī ki
human-ez stand-mIš neg -be.prs -3 sg ptcl
‘One cannot endure it.’
3.5. Issues in Western Kurmanjî (WK) syntax

Reflexive pronoun: In WK the reflexive pronoun in possessor function is general-
ized to be used in contexts where it is not controlled by a co-referential subject. It
is thus used in much the same way as a 3sg oblique pronoun, as in (64).
(64) a. bɔv-ē xe čū-ye alwistan-ē

father-ez . m refl go.pst -drct 13 place.name-obl . f
‘His/her father has gone to Elbistan.’ (Standard K.: bavê wî …)
b. ferq-a xe či=ye
difference-ez . f refl what=cop .3sg
‘What is its difference?’
A particle ki, homophonous to the particle also used in functions such as rela-
tive particle and subordinating conjunction, expresses the modality of “having the
intention of doing something” (glossed as MOD), illustrated in (65).
(65) Sudi ew ki hata türk baqɔl-ē har-in,
tomorrow 3pl mod until turkish grocery-obl . f go.prs -3pl
ez=jī ki vē=rɔ har-im
1sg =also mod 3 sg . obl . f = postp go.prs -3 sg
‘Tomorrow they will go to the Turkish grocery store, I will also go with her.’
(Çapar 2009: 78)
The ki particle can be used with the subject-marking ezafe, but it cannot be used
with a future tense particle -ē. Note finally that the particle might originate from
the auxiliary use of the verb kirin ‘do’ (present stem: ki-). In Standard K. and
in central areas of Kurmanjî speech zone, as in SK, the conjugated form of the
verb kirin is employed as the auxiliary in expressing the prospective aspect or the
“immediate future”.
The conditionals in WK usually incorporate the Turkish clausal enclitic condi-
tional marker =se to mark the verb of the protasis,14 as in (66). But the conditional
conjunction eger and more widely the ki particle can also start the sentence.
(66) tu hat=se telafon-a mi ke
2sg come.prs =cond phone-ez . f 1sg .obl do.imper .2sg
‘Call me if you come.’
(Çapar 2009: 64)
Note that the ki relative/subordinating particle (Standard K. ku) is formally the
same with the corresponding Zazaki (Haig 2001: 202; Paul 1998) and in all its
functions it is usually a proclitic and reduced to the sole consonantal element.
The Standard K. adhortative particle bila does not exist in WK, a form ma is
used in this function, as in (67).
13
This may be a present perfect formative, widely used in this dialect, rather than the
directional particle. It is impossible to decide in this context (they cannot both be
overtly realized on the same verb).
14
This is observed also for the geographically close Tunceli (Kr. Dersim) Kurmanjî in
Haig (2006).
150 Geoffrey Haig
(67) tēlefon-a Domi ki-m, ma wer-i

phone-ez . f proper.name do.prs -1sg hort come.prs .subj -3sg
‘I shall call Domi so that he comes’
(Çapar 2011: 78)
4. Northern Kurdish in eastern Anatolia: summary of main contact

issues
Northern Kurdish is spoken across most of eastern Anatolia, and has thus been
exposed to contact influence from several different languages: In the southeast, it
has co-existed for centuries with Neo-Aramaic and local varieties of Arabic, while
probably the most important historical contact language in central and northeast-
ern Anatolia would have been Armenian. More recently, Turkish has exerted con-
siderable influence on all varieties of Kurmanjî, through Turkish-language mass
media, compulsory schooling, military service, and large-scale migration to the
main administrative centres, where representatives of the Turkish state tend to be
concentrated.
Areally, Kurmanjî is split across the Mesopotamian zone and the Caspian/Cau-
casian zone, and variation in morphosyntax corresponds broadly to this north/
south divide (Haig 2017). For example, the SEK dialects are firmly within the
Mesopotamian zone, and here we find widespread use of non-canonical sub-
jects with experiencer predicates, modal ‘want’, and expressions of possession
(Haig 2006, 2017), a greater reliance on prepositions (§3.3), and a larger range
of arguments that can occur post-predicatively (Haig 2015). The dialects of the
north and west lack these features. This ties in with the general picture of Semitic
influence in the southeast with a gradual fade-out northwards and westwards. But
not everything fits this picture. It is quite unclear, for example, how areal consid-
erations would be relevant in understanding the Western Kurmanjî constructions
with the ezafe particle in the verbal domain (§3.2, 3.4).
In general, the core areas of Kurmanjî morphology show relatively little evi-
dence of heavy structural borrowing (Haig 2007: 180). Most plausible candidates
for contact-induced developments stem from phonology, lexicon, and syntax. The
following list of candidate features for contact influence is not exhaustive, but
merely illustrates some of those discussed in the literature:
1. Additional series of voiceless obstruents, presumably in part through Armenian
influence (§2.2);
2. Pharyngealization, presumably through Semitic influence, but building on
inherited features of the phonological system (Barry 2017);
3. Close similarities across the vowel systems in the languages of Anatolia
(Haig 2017: 402);
4. Borrowing of Turkish conditional clitic =ise (cf. (66) from WK);
5. Use of Turkish -mIš-verb forms, incorporated into Kurdish complex predi-

cates (cf. (59), (63) from WK);
6. Western Kurmanji dialects: numerals 11–19 follow Armenian pattern, rein-
forced by Turkish (‘11’ dah-u-yek etc. instead of yānzdah etc.), WK only;
7. Strategies for clause linkage (Matras 2002);
8. Common Anatolian clause-final copula construction (Haig 2017);
9. Borrowing of Turkish comparative particle daha;
10. Loss of the rule for reflexive binding with xwe ‘self’ (WK only, see
Haig 2006, §3.5)
Previous research (Dorleijn 1996, Haig 2006, Haig 2007) has tended to focus on

Turkish influence on Kurmanjî. While contemporary spoken Kurmanjî is undoubt-
edly heavily influenced by Turkish, it is important to consider the issue from a
longer-term perspective. If we consider the situation of Kurdish prior to the found-
ing of the Turkish state in 1923, there is little evidence of Turkish influence on
much of Kurmanjî. Original texts recorded as late as the 1960’s by Ritter (1971,
1976, see §5), show few traces of Turkish influence, either in lexicon or mor-
phosyntax. The same holds for most of what I have above termed Southeastern
Kurmanjî (SEK), particularly in the far southeast of the country. There are still
monolingual speakers of Kurdish in this region today, and we can reasonably
assume that this was much more widespread a century ago. The texts compiled
by Nikitine from this region in the early twentieth century reflect reasonably reli-
ably the Kurdish at the time, and illustrate the general paucity of Turkish influ-
ence (see MacKenzie (1995) for a critical edition of one Nikitine’s texts). In what
we have termed Western Kurmanjî (WK), Turkish influence appears more deeply
entrenched, and is evident in the texts of Le Coq (1903). In these texts, provided
by speakers from Zincirli, west of today’s Gaziantep, we already find the reflexive
pronoun used as a general possessive marker, without being subject to the binding
conditions that apply to Standard Kurmanjî, we find the numerals 11–19 in the
Turkish/Armenian form rather than the inherited Iranian form, and a scattering
of Turkish miş-verbforms. But none of these sources show anything approaching
the massive Turkish influence (e. g. in terms of loan words, code-switching, Turk-
ish-influenced syntax) that characterizes the casual speech of many Kurds today.
While the data is still very sketchy, it seems reasonable to assume that up until the
beginning of the twentieth century, Kurmanjî speech communities were able to
foster and preserve their language over a vast region, and ensure unbroken trans-
mission across generations. Levels of loan words in the basic vocabulary (see e. g.
Haig and Öpengin 2014) are also low; despite centuries of co-existence, there are
remarkably few clear cases of Armenian or Neo-Aramaic borrowings in the basic
Kurmanjî lexicon, suggesting that Kurmanjî was not under any particular pressure
from these languages (Kurdish loanwords in Neo-Aramaic on the other hand, are
extremely numerous, see Khan 2007).
152 Geoffrey Haig
Considering today’s situation, the most crucial difference to pre-republican times

is the incomplete childhood acquisition of the full spectrum of grammatical and
lexical oppositions of Kurdish. The destruction of traditional village networks, the
intrusion of Turkish into the domestic sphere via the media, and most importantly,
pre-school and primary school monolingual language policies, means early acqui-
sition of Kurdish is interrupted, with concomittant loss or simplification of lexical
and grammatical structure. Thus much of what is often considered “Turkish influ-
ence” (see e. g. Dorleijn 1996) can also be interpreted as the result of imperfect
acquisition.
5. Short glossed text
The following text is an excerpt from the story ‘The poor man, the snake, the
Jew, and good fortuneʼ, recorded in the 1960ʼs in one of the villages southeast of
Midyat which, at that time, were still inhabited by Yezîdîs. The speaker was the Pîr
of the local Yezîdîs, and the recording was made by a local Kurd, who was collab-
orating with the German Semitist Hellmut Ritter. They subsequently transcribed
and published this and several other texts, together with a German translation, in
Ritter (1976). Unfortunately, Ritter himself passed away in 1971, and the original
magnetic tapes have never been recovered.
These stories represent one of the very few reliable records of spoken Kurmanjî
from this period, and are typical of the oral tradition preserved in e. g. MacKen-
zie (1962), or Blau (1975), and discussed in Turgut (2012). Ritterʼs original tran-
scription is phonetic rather than phonemic, and uses quite idiosyncratic symbols,
making it rather inaccessible. In the version provided below, I have adapted it to
the standard Kurmanjî orthography as outlined in Section 2 above, but the syntax
remains as in the original, and dialectal features are noted where necessary.
(68) roj-ek15-ê kerk-ê16 wî li mil-ê wî
day-indf - obl . f yoke-ez . m 3 sg . obl . m at shoulder-ez . m 3 sg . obl . m
ye
cop .3 sg
‘One day, his yoke is at his shoulder
15
In the original transcription, the indefinite suffix is rendered with -(i)k, but I have stand-
ardized it throughout. Reduction of the indefiniteness suffix is a typical dialect feature
of the Mardin region, where the suffix is realized as [-(ɨ)k], unlike Standard Kurmanjî
[-æk].
16
This word, transcribed in the original as kärkē (with ezafe), is translated into German as
“Holzgabel”, evidently a wooden artefact to enable a person to carry a load of wood on
his or her shoulder. I have not been able to trace it in any of the Kurdish lexical sources
known to me.
û bivr-ê wî li ser dest-ê wî ye

and axe-ez . m 3 sg . obl . m at on hand-ez . m 3 sg . obl . m cop .3 sg
and his axe is in his hand.
(69) ji xwe=ra di-ç-e dar-a çîy-ê.
for self=postp ind -go-prs .3 sg wood-ez . f mountain-obl . m
He is just going to the woods on the mountain.
(70) çû, dîn da-ye vaye mar-ek
go.pst .3 sg sight give.pst .3 sg = drct excl snake-indf
He went, and saw – whatʼs that, a snake
ser-ê xwe der-êxist ba(ng) kiri=yê,
head-ez . m self out-put.pst .3 sg calling do.prf .3 sg =3 sg . obl
has popped out its head and called to him
(71) go kur-o! go ha!
say.pst .3 sg fellow-voc . m say.pst .3 sg yes!
saying: “fellow!”, he said “yes?”
(72) go ka were ez bêj-im=e te
say.pst .3 sg prt come.imper . sg 1 sg say.prs . subj -1 sg = drct 2 sg . obl
He said: “won’t you come, that I may tell you (something)?”
(73) were cem mi(n)!
come.imper . sg to 1 sg . obl
“come to me!”
(74) belengaz goti=yê go ya haywan-ê
poor.man say.prf .3 sg =3 sg . obl say.pst .3 sg excl animal-ez . m
xwedê
god.obl . m
The poor man said to him, saying: “Oh creature of God,
(75) tu mar î û ez insan im
2 sg snake cop .2 sg and 1 sg person cop .1 sg
you are a snake, and I am a human.
(76) ez=ê çawa b-êm=e cem te? […]
1 sg = fut how subj -come.prs -1 sg = drct to 2 sg . obl
How should I come to you?”
(77) belengaz çû cem mêr sekinî
poor.man go.pst .3 sg to snake.obl . m stop.pst .3 sg
The poor man goes up to the snake and waits.
154 Geoffrey Haig
(78) ya haywan-ê xwedê derd-ê te çi

excl animal-ez . m god.obl . m trouble-ez . m 2 sg . obl what
ye,
cop .3 sg
“Oh creature of God, what is your plight?
(79) tu çi ji mi(n) di-xwaz-î? gôti=yê
2 sg what from 1 sg . obl ind -want.prs -3 sg say.prf .3 sg =3 sg . obl
go
say.pst .3 sg
What do you want from me?” (He) said to him:
(80) ka bost-ik-ê ji dûv-ê mi(n)
compl span-indf - obl . f from tail-ez . m 1 sg . obl
jê-bi-k-e!
from.it-subj -do.imper -2 sg
“cut off one span (unit of measure) from my tail!
(81) bost-a xwe bi-gir-e bi dûv-ê mi(n)
span-ez . f refl subj -take.imper -2 sg through tail-ez . m 1 sg . obl
Measure a span across my tail
û jê-ke!
and from.it-do.imper -2 sg
and cut it off!”
(82) bê17 belê tu kêm santîn-k-î
but 2 sg less centimetre-indf - obl . m
jê-k-e
from.it-subj -do.imper -2 sg
but if you cut off one centimetre too little
(83) ez=ê mal-a te xirab bi-k-im!
1 sg = fut house-ez . f 2 sg . obl ruined subj -do.prs -1 sg
I will destroy your house!
(84) bi-hêl-im pirç-a ser-ê te
subj -let.prs -1 sg hair-ez . f head-ez . m 2 sg . obl
bi-waş-e […]
subj -fall_out.prs -3 sg
I will cause the hair of your head to fall out.”
17
This reflects the original, though in Standard K. one might have expected lê belê here.
(85) ê go ya haywan-ê xwedê, ez

ez.m say.pst .3 sg excl animal-ez . m god.obl . m 1 sg
na-wêr-im
neg -dare.prs -1 sg
The other one said: “Oh creature of God, I don’t dare
dûv-ê te jê-k-im
tail-ez . m 2 sg . obl from.it-subj -do.prs -1 sg
to cut off your tail.”
(86) go me-tirs-e baxt-ê xwedê ji
say.pst .3 sg neg -fear.imper -2 sg fortune-ez . m god.obl . m from
te=ra
you=postp
He (the snake) said: “don’t be afraid, the fortune of God is with you.”’
Abbreviations
1 first person mod modality

2 second person neg negation
3 third person obl oblique
add additive perf perfect
adp adposition pl plural
aff affirmative postp postposition
compl complementizer pprf pluperfect
cond conditional pres present
cop copula prf perfect
dem demonstrative prox proximal
drct directional prs present
excl exclamative prt particle
ez ezafe marker prv preverbal particle
f feminine pst past
fut future ptcp participle
hort adhortative refl reflexive
imper imperative rel relative
ind indicative sg singular
indf indefinite subj subjunctive
ipfv imperfective voc vocative
m masculine
156 Geoffrey Haig
References
Asatrian, Garnik. 2009. Prolegomena to the study of the Kurds. Iran and the Caucasus 13.
1–58.
Bailey, Denise. 2005. A comparative study of grammatical relations in Northern Kurdish.
Göttingen: University of Göttingen, unpublished MA thesis.
Barry, Daniel. 2017. The emergence of pharyngeal sounds in Kurmanji Kurdish. Paper held
at the 7th International Conference on Iranian Linguistics, Institute of Asian and African
Studies, Lomonosov Moscow State University, 28–30 August, 2017.
Bedir Khan, Emir & Roger Lescot. 1970. Grammaire Kurde. Paris: Adrien Maisonneuve.
Blau, Joyce. 1975. Le Kurde de ’Amādiya et de Djabal Sindjār: Analyse Linguistique, Tex-
tes Folkloriques, Glossaires. Paris: Klincksieck.
Bynon, Theodora. 1979. The ergative construction in Kurdish. Bulletin of the School of
Oriental and African Studies 42. 211–224.
Çapar, Fidan. 2009. Code-Switching und Sprachkontakt bei kurdischen Migranten aus Maraş.
Frankfurt am Main: Johann Wolfgang Goethe-Universität, unpublished MA t hesis.
Dahl, Östen 2004. The growth and the maintenance of linguistic complexity. Amsterdam &
Philadelphia: Benjamins.
Dirêj, Evdila. 2011. Kurdî Kurmancî. Berlin: Dilop.
Dorleijn, Margreet. 1996. The decay of ergativity in Kurmanci. Tilburg: Tilburg University
Press.
Haig, Geoffrey. 1998. On the interaction of morphological and syntactic ergativity: Lessons
from Kurdish. Lingua 105. 149–173.
Haig, Geoffrey. 2001. Linguistic diffusion in present-day East Anatolia: From top to bot-
tom. In Alexandra Aikhenvald & R.M.W. Dixon (eds.), Areal diffusion and genetic
inheritance: Problems in comparative linguistics, 195–224. Oxford: Oxford University
Press.
Haig, Geoffrey. 2002. Noun-plus-verb complex predicates in Kurmanjî Kurdish: Argument
sharing, argument incorporation, or what? Sprachtypologie und Universalienforschung
/ Language typology and universals 55(1). 15–48.
Haig, Geoffrey. 2004. The invisibilisation of Kurdish: The other side of language planning
in Turkey. In Stephan Conermann & Geoffrey Haig (eds.), Die Kurden: Studien zu ihrer
Sprache, Geschichte und Kultur, 121–150. Schenefeld: EB.
Haig, Geoffrey. 2006. Turkish Influence on Kurmanjî: Evidence from the Tunceli Dialect.
In Lars Johanson & Christiane Bulut (eds.), Turkic-Iranian Contact Areas: Historical
and Linguistic Aspects, 283–299. Wiesbaden: Harrassowitz.
Haig, Geoffrey. 2007. Grammatical borrowing in Kurdish (Northern Group). In Yaron Ma-
tras & Jeanette Sakel (eds.), Grammatical borrowing in cross-linguistic perspective,
Haig, Geoffrey. 2011. Linker, Relativizer, Nominalizer, Tense-Particle: On the Ezafe in
West Iranian. In Yap Foong, Karen Grunow-Hårsta & Janick Wrona (eds.), Nominali-
zation in Asian Languages: Diachronic and Typological Perspectives, vol. 1, Sino-Ti-
betan and Iranian Languages, 363–392. Amsterdam & Philadelphia: Benjamins.
Haig, Geoffrey. 2015. Verb-Goal (VG) word order in Kurdish and Neo-Aramaic: Typolog-
ical and areal considerations. In Geoffrey Khan & Lidia Napiorkowska (eds.), Neo-Ar-
amaic and its linguistic context, 407–425. New York: Gorgias Press.
Haig, Geoffrey & Necle Bulut. 2017. Hearing, speaking, looking and learning: A pilot study
of lexical variation across Kurmanjî Kurdish. Wiener Jahrbuch für Kurdische Studien
5. Special edition: Katharina Brizić, Agnes Grond, Christoph Osztovics, Thomas
Schmidinger & Maria Six-Hohenbalken (eds.), Sprache-Migration-Zusammenhalt:
Kurdisch und seine Diaspora. 99–109.
Haig, Geoffrey & Ergin Öpengin. 2014. Kurdish: A critical research overview. Kurdish
Studies 3. 99–122.
Haig, Geoffrey & Ergin Öpengin. 2015. Gender in Kurdish: Structural and socio-cultural as-
pects. In Marlis Hellinger & Heiko Motschenbacher (eds.), Gender across Languages,
vol. 4, 247–276. Amsterdam & Philadelphia: Benjamins.
Haig, Geoffrey & Ergin Öpengin. 2018. Kurmanjî Kurdish in Turkey: Structure, varieties,
and status. In Christiane Bulut (ed.), Linguistic Minorities in Turkey and Turkic speak-
ing minorities of the peripheries, 157–230. Wiesbaden: Harrassowitz.
Jastrow, Otto. 1977. Zur Phonologie des Kurdischen in der Türkei. Studien Zur Indologie
und Iranistik 3. 84–106.
Khan, Geoffrey. 2007. Grammatical borrowing in North-eastern Neo-Aramaic. In Yaron
Matras & Jeanette Sakel (eds.), Grammatical borrowing in cross-linguistic perspective.
Khan, Geoffrey. 2008. The Neo-Aramaic dialect of Barwar, vol. 1, Grammar. Leiden: Brill.
Kömür, Mehmet. 2003. Taw dilo: Akçadaǧ-Elbistan aǧzıyla Kürt folkoründen seçmeler
[Taw dilo: Selections from Kurdish folklore from the Akçadaǧ-Elbistan dialects]. Istan-
bul: Weşanên Enstîtuya Kurdî ya Stenbolê.
Lewendî, Mehmûd. 1997. Kurdên Şêxbizinî [The Şêxbizinî Kurds]. Bîrnebûn 3. 78–98.
Le Coq, Albert von. 1903 [2003]. Kurdische Texte: Kurmānğí-Erzählungen und -Lieder,
nebst einer Zāzā-Erzählung. Teil I und II. Berlin. [Reprint Amsterdam: APA-Philo Press].
MacKenzie, David. 1961a. Kurdish dialect studies I. Oxford: Oxford University Press.
MacKenzie, David. 1961b. The origins of Kurdish. Transactions of the Philological Society
60(1). 68–86.
MacKenzie, David. 1962. Kurdish dialect studies II. Oxford: Oxford University Press.
MacKenzie, David. 1995. The Kurdish of Mullâ Sa’îd Shamdînânî. The Journal of Kurdish
Studies 1. 1–27.
Matras, Yaron. 1997. Clause combining, ergativity, and coreferent deletion in Kurmanjî.
Studies in Language 21. 613–653.
Matras, Yaron. 2002. Kurmanji complementation: Semantic-typological aspects in an areal
perspective. Sprachtypologie und Universalienforschung / Language typology and uni-
versals 55. 49–63.
Matras, Yaron et al. 2016. The Dialects of Kurdish. Web resource, University of Manches-
ter. http://kurdish.humanities.manchester.ac.uk/ (accessed 10 November 2017).
McWhorter, John H. 2005. Defining Creole. Oxford: Oxford University Press.
Oberling, Pierre. 2008. Khorasan i. Ethnic groups. Encyclopædia Iranica. New York:
Center for Iranian Studies, Columbia University. http://www.iranicaonline.org/articles/
khorasan-1-ethnic-groups (accessed 16 September 2017).
Öpengin, Ergin & Geoffrey Haig. 2014. Regional variation in Kurmanjî: A preliminary
classification of dialects. Kurdish Studies 3. 143–176.
Paul, Ludwig. 1998. The position of Zazaki among West Iranian languages. In Nicholas
Sims-Williams (ed.), Proceedings of the 3rd European Conference on Iranian Studies,
163–176. Wiesbaden: Reichert.
158 Geoffrey Haig
Paul, Ludwig. 2009. Zazaki. In Gernot Windfuhr (ed.), The Iranian languages, 545–586.
London & New York: Routledge.
Ritter, Helmut. 1971. Kurmanci-Texte aus dem Tur’abdin. Teil I. Oriens 21–22. 1–135.
Ritter, Helmut. 1976. Kurmanci-Texte aus dem Tur’abdin Teil II. Oriens 25–26. 1–37.
Schroeder, Christoph. 1999. Attribution in Kurmancî (Nordkurdisch). In Karl Wagner &
Wolfgang Wildgen (eds.), Studien zur Phonologie, Grammatik, Sprachphilosophie und
Semiotik, 43–63. Bremen: Institut für Allgemeine und Angewandte Sprachwissenschaft.
Thackston, Wheeler M. 2006. Kurmanjî Kurdish: A reference grammar with selected readings.
Unpublished Manuscript, Harvard University. http://www.fas.harvard.edu/~iranian/
Kurmanjî/ (accessed 15 January 2015).
Trudgill, Peter. 2011. Sociolinguistic typology: Social determinants of linguistic complex-
ity. Oxford: Oxford University Press.
Turgut, Lokman. 2012. Mündliche Literatur der Kurden in den Regionen Botan und Hekarî.
Berlin: Logos.
2.4. The Arabic dialects of eastern Anatolia
Stephan Procházka
Vernacular Arabic is spoken in a number of non-connected parts of southeastern

Anatolia from the city of Urfa in the west to the city of Siirt in the east.1 The region
thus roughly covers the part of Mesopotamia that lies within the borders of mod-
ern-day Turkey.
1. General remarks
1.1. Genetic affiliation

Vernacular Arabic belongs to the Semitic branch of the Afro-Asiatic language
family. The varieties of Arabic spoken in eastern Anatolia can be divided into
two large groups. The eastern, sedentary-type dialects are spoken mainly in the
provinces of Mardin, Batman, Muş and Siirt, and belong to the Anatolian branch2
of the Mesopotamian dialect group. They are often called qǝltu dialects, from the
verb qǝltu ‘I spoke’. The western dialects, which are of the Bedouin type, form
the northernmost branch of the Arabian dialect group and constitute a sub-group
of the so-called Šāwī Arabic, i. e. the language of the sheep- and goat-breeding
semi-nomads of the Syrian Desert.
1.2. Brief history of the speech community

The history of settlement of eastern Anatolia has yet to be written. The Arab pres-
ence in the region goes back to the Parthian period when they organized border
states at Hatra and Urfa (Edessa). Under the Sasanians members of the Rabīʕa and
Taġlib tribes had settled around Nuṣaybīn (Nasibin) and Beth Arbaye (Morony
2005: 215, 221). Despite this strong presence of Arabs in pre-Islamic times, it can
be assumed that most ancestors of today’s Arabic speakers arrived in the aftermath
of the Muslim conquest of the region in the middle of the 7th century CE. Political
history indicates that the core of the Arabic-speaking sedentary population, i. e. the
speakers of the qǝltu dialects, may have settled in SE Anatolia during the 7th and
early 10th centuries, when the region was under the stable control of Arab dynasties
1
For details see the maps in Jastrow (1978), Procházka (2003: 88) and Procházka (2018)
as well as the lists of villages in Andrews (2002).
2
A detailed classification of Anatolian Arabic is given in Jastrow (1994: 121) and Jas-
trow (1978: 3–23).
160 Stephan Procházka
like the ʕAbbāsids and Ḥamdānids.3 This is corroborated by the close linguis-
tic relationship of the Eastern Anatolian qǝltu Arabic with the dialects spoken in
northern Iraq and, albeit less distinctive, in northern Syria.4 However, over the
centuries many changes due to emigration, displacement, and resettlement took
place. The qǝltu-Arabic-speaking communities, with the exception of Mardin and
its surroundings, have become isolated language islands amidst Kurdish- and —
restricted to the towns— Turkish-speaking majority populations.5 This picture is
clearly reflected in the “dialectal landscape”: the Mardin dialect group is much
more conservative, and thus closer to Northern Iraqi Arabic, than the dialects of
the “outer circle” of Arabic-speaking enclaves (Jastrow 2006–2007: 64). A good
example of how volatile ethnic and religious identities can be is the group called
the Mḥallamīye, who live between Mardin and Midyat.6 Their self-designation
most likely goes back to the name of a Kurdish tribe and they may have been
Christians until around 1600 (Benninghaus 2002: 179). Thus they originally spoke
Aramaic and were Arabicized only after they became Muslim (Jastrow 2004: 99).
Another example of shifting identities in the region is the saying of a man from
Mardin recorded in Grigore (2007: 34): ana tǝrki-ana, abūy kǝrdi-we w ǝṃṃi
ʕarabīye-ye ‘I am a Turk, my father is a Kurd, and my mother is an Arab.’
It has to be added that in all but the remote mountain regions regular, although
often hostile, contacts with the Arab nomads of the Syrian Steppe have always
existed.
In the Harran-Urfa area there are seven important Arabic-speaking tribes (with
numerous sub-tribes) whose presence in the region north of the Euphrates prob-
ably goes back to the very first centuries of Islam (Oppenheim 1939: 222). The
immigration of Arab tribes to the region reached a peak in the late 10th century
(Heidemann 2002: 32), but migration to and from the Jazeera happened up to the
First World War. Many of them were semi-nomads until the middle of the 20th
century and therefore had close contacts with their fellow tribes-people in the
3
Cf. Encyclopedia of Islam (EI), s. v. Anadolu, Ḥarrān, Ḥiṣn Kayfā, Mardin, Siʕird. It is
worth mentioning that the eastern branch of the line of fortifications against the Byzan-
tines, the ṯuġūr al-Ǧazīra, was “only so-called because the ribāṭs there are manned by
the men of Mesopotamia” (EI, s. v. al-T̲h̲ ug̲ h̲ ūr).
4
Local tradition, however, does not always coincide with these assumptions. For exam-
ple, the two Arab tribes in the village of Tillo trace themselves back to immigrants who
settled in the region only in the 14th and 17th centuries, respectively (Lahdo 2010: 26).
5
The more or less complete isolation of all Anatolian Arabs from the rest of the Ara-
bic-speaking world took place only after 1923, when it became almost impossible for
them to cross the borders into Syria and Iraq. This situation only changed in the late
1990s.
6
Tens of thousands of them emigrated to Lebanon after WW I, where they are now called
Mardilli and regarded as Kurds. Kern (2015) presents a very good overview on the dif-
ferent Mḥallami identities.

The Arabic dialects of eastern Anatolia
Figure 1: Location of Arabic dialects discussed in this chapter

161
Syrian Desert —which explains the relatively few linguistic differences between
these groups.7
1.3. Current size and status of the speech community: religious affiliation
The last official census of the Republic of Turkey which gathered data relevant
to languages dates back to 1965. Hence all numbers given in the following are
based on more or less rough estimates. The qǝltu-Arabic speakers of eastern Ana-
tolia number approximately 620,000 people8, and the speakers of Harran-Urfa
Arabic can be estimated to be around 150,0009. The latter are all Sunni Muslims.
Among the other group were significant Christian and Jewish minorities until a
few decades ago. Today, there are no Jews in the region, and many of the Chris-
tians have also left, particularly those who were living in villages or in the smaller
towns.10 Thus the overwhelming majority of the remaining Arabic-speaking pop-
ulation is Sunni Muslim.
1.4. Sociolinguistic situation

In most of the area covered by this article Kurdish is the dominant language of
daily life, and Turkish is the official language (Talay 2006–2007: 180). Today,
almost all of the Arabic speakers have a very high proficiency in Turkish, which
dominates education, business life, and the media. Turkish is also the language
exclusively used for written communication by Anatolian Arabs. Knowledge of
Kurdish has decreased among the younger generation in favor of Turkish, and
depends very much on the geographical setting. Many inhabitants of mixed Ara-
bic-Kurdish villages speak Kurdish and Turkish and thus are trilingual.11 In larger
towns, particularly in Urfa, few Arabic speakers rather than craftsmen or mer-
chants, are fluent in Kurdish. Contacts of Arabs with speakers of Aramaic (par-
ticularly Ṭūrōyō) may have been relatively intense, especially in the region west
of Tur Abdin. Due to the rapid decline of Aramaic in Turkey12, such contacts have
ceased to play any but a very marginal role in the region’s sociolinguistic setting.
7
One has to bear in mind that until the 1920s there were no political borders between
what is today Syria, Iraq, and Turkey.
8
This number is given on the “Ethnologue” website www.ethnologue.com/language/ayp
(accessed 8 July 2016).
9
Cf. Procházka (2003: 76).
10
Cf., e. g. Wittrich (2001: 4).
11
Cf. Benninghaus (2002: 179) for the Mḥallami; and Lahdo (2009), which is a good case
study on the situation in the village of Tillo in the Siirt Province.
12
Only 15,000 speakers of Aramaic were left in Turkey in 2014 (data according to
www.ethnologue.com/language/tru, accessed 14 July 2016).
The Arabic dialects of eastern Anatolia 163
On the whole, both branches of Eastern Anatolian Arabic are still vigorous,
non-endangered varieties of Arabic with the following caveat: dialects spoken in
small enclaves whose speakers have few contacts with other Arabic speakers must
be considered highly endangered. Many Arabic speakers in such places suffer from
both poor economic conditions and increasing Kurdish nationalism and therefore
leave their homes. This is particularly true of Christians but has been reported for
Muslim villages as well.13 But larger communities such as the Arabs of Mardin
may also have difficulties in resisting the social and economic pressures that are
resulting in a gradual shift to Turkish among the young people, many of whom
recent studies have shown lack full proficiency in Arabic (Grigore 2007: 34–36).
Such developments are also observed among the youngest generation of speakers
of Harran-Urfa Arabic.
In general, the prestige of local Arabic is rather low. For many decades minor-
ity languages were completely ignored if not actively suppressed by the Turkish
authorities. This attitude only changed around the year 2000 with an ever-increas-
ing public and academic interest in local spoken languages. These developments
may strengthen the self-confidence of the Arabic communities and foster efforts
towards language preservation among their younger generation. There are increas-
ing activities on internet sites as well as on local radio and TV stations to promote
local varieties of Arabic. Another event which has changed the sociolinguistic
setting is the massive immigration of Syrians who fled from the civil war that
began in 2011. Thus, speakers of Urfa Arabic in particular have begun to adapt
their speech to the dialect of the Syrian migrants and increasingly use typical
words and phrases of Syrian Arabic.
As most Arabic speakers of the region in question are illiterate in Arabic, the
local vernacular is usually not written. There are, however, numerous internet
sites where one can find single phrases, songs, and proverbs in vernacular Arabic
written in Latin script.14
2. Linguistic description
The huge differences between the two varieties of Arabic here under consider-
ation, which are mutually not understandable,15 make it necessary to split many
13
According to Lahdo (2009: 112) the number of Arabic speakers in Tillo decreased from
3,000 to less than 1,000 during the five years from 2000 to 2005.
14
A good example is http://www.mahalmi.com/default.asp?t=23 (accessed 28 Febru-
ary 2013) and the “Dictionary” of the dialect found on http://www.haldeh.com/index.
php?option=com_content&view=category&id=44&Itemid=119 (accessed 8 July 2016).
15
Communication between local Arabs and the Arabic-speaking farm workers from the
Mardin area who annually come to the Plain of Harran for harvest is usually in Turk-
chapters of the following linguistic sketch. For the qǝltu dialects, this article will
mainly focus on the dialect of the region of Mardin, where by far most of the
qǝltu-Arabic speakers live. However, outstanding linguistic features of other than
this region will be referred to.16 For the sake of convenience the western, Bedouin,
dialects will be called Harran-Urfa Arabic, the eastern, sedentary, dialects, qǝltu
Arabic when referring to the whole group, otherwise Mardin Arabic.
2.1. Phonology
2.1.1. Consonants
The following table includes the consonantal phonemes of Mardin and Harran-
Urfa Arabic. Sounds marked by * occur in Mardin Arabic only.
Table 1: Consonant inventory

Bilabial Labio- Inter- Dental- Palatal Velar Uvu- Pharyn- Glottal
dental dental alveolar lar geal
Plosive
voiceless p t ṭ [ᵵ] k q (ʔ)
voiced b d ɡ
Affricate
voiceless č [t͡ʃ]
voiced ǧ [d͡ʒ]
Fricative
voiceless f ṯ [Ɵ] s ṣ[ᵴ] š [ʃ] x ḥ [ħ] h
voiced *v ḏ[ð] ḏ[ð̴
̣ ] z *ž [ʒ] ġ [ɣ] ʕ
Nasal m n
Lateral l ḷ [ɫ]
Trill r ṛ [ᵲ]
Approximant w y
ish. According to informants the situation is different in the easternmost parts of the
province of Urfa. The Arabic vernacular of the regions of Viranşehir and Ceylanpınar
reputedly shows more influence from the neighboring qǝltu dialects.
16
Detailed descriptions of dialects from the whole region are found in Jastrow (1978),
which is the classic monograph on qǝltu Arabic. Other comprehensive studies are
Grigore (2007) on Mardin, Jastrow (1973) on Daragözü, Lahdo (2010) on Tillo, Sasse
(1971) on Mḥallamīye, Talay (2001) and (2002) on Hasköy, Wittrich (2001) on Āzǝx/
İdil, Akkuş (2016) on the Mutki-Sason region.
The sound /ġ/ is only marginal in Harran-Urfa Arabic because it is usually reflected
as /q/ (qanam ‘sheep’)17. There are several sub-groups of qǝltu Arabic where the
interdental fricatives (ḏ, ṯ, ḏ)̣ have shifted to dental stops (d, t, ḍ), sibilants (z, s, ẓ),
or labio-dental fricatives (v, f, ṿ), respectively (Jastrow 1978: 34–39).
A very common phenomenon of qǝltu Arabic is the devoicing of voiced con-
sonants in final position: e. g. ʔīd > ʔīt# ‘hand’, bīḏ ̣ > bīṯ#̣ ‘white (pl )’ (Jastrow
1978: 98); and (Siirt) ǧēmeʕ > ǧēmeḥ# ‘mosque’.18 The same phenomenon exists
in Urfa Arabic, e. g. aṭluʕ> aṭluḥ# ‘I ascend’.
2.1.2. Vowels
Mardin Arabic possesses two short (/ǝ/ and /a/) and five long (/ā/, /ē/, /ī/, /ō/, /ū/)
vowel phonemes as well as two diphthongs, /ay/ and /aw/ (Jastrow 1978: 81–84).
The vowels [ē] and [ō] are often products of the lowering of /ī/ and /ū/ when
preceding a velarized or back consonant: malēḥ < malīḥ ‘good’, ṭyōṛ < ṭyūr ‘birds’
(Grigore 2007: 77). Diachronically seen, the vowel /ē/ has also emerged from /ā/
by the shift of ā > ē conditioned by a short or long i in an adjacent syllable: e. g.
Mardin ǧmēl < ǧimāl ‘camels’.19
Harran-Urfa Arabic has the same five long vowels20 as Mardin Arabic but three
short vowels, /a/, /i/ and /u/. The phonemic status of /i/ and /u/ is weak and only
attested by a handful of minimal pairs.
2.1.3. Syllable structure21

In both groups the following types of syllables occur: Cv̆ , Cv̆ C, Cv̄ , Cv̄ C, CCv̆ ,
CCv̄ , CCv̆ C, and CCv̄ C. The syllables Cv̆ CC, Cv̄ CC,22 and CCv̆ CC are found
among the qǝltu dialects only, since in Harran-Urfa Arabic -CC# is normally
realized as -Cv̆ C#.23 In the qǝltu dialects there is a tendency to articulate initial
CC- with an epenthetic, non-phonemic /(ʔ)ǝ/: e. g. [ǝftáḥ ~ ʔǝftáḥ ~ ftaḥ] ‘open!’
17
Whereas OA /q/ is reflected as /g/: e. g. bugaṛ ‘cows’.
18
The devoicing of /ʕ/ does not occur in Mardin Arabic (Jastrow 1978: 43).
19
For a detailed description of this phenomenon called ʔimāla cf. Jastrow (1978: 65–69).
20
The two long vowels /ē/ and /ō/ are products of the monophthongization of OA /ay/ and
/aw/.
21
The syllable structure of the Anatolian qǝltu dialects is presented in detail by Jastrow
(1978: 85–95).
22
The syllable C1v̆ /v̄ C2C3 does not occur with C3 = m, n, l, r: e. g. *fa-taḥl-ki > fataḥǝlki
‘he opened (for) you’, but xalf-na ‘behind us’ (Jastrow 1978: 88).
23
In the qǝltu dialects only if the last C = m, n, l, r: e. g. šahr > šahǝr ‘month’ (see also
preceding footnote).
(Jastrow 1978: 87).24 This is also true of Harran-Urfa Arabic, but there forms like
trāb ‘soil’ and ghawa ‘coffee’25 can also be heard.
In all Anatolian qǝltu dialects /ǝ/ (< /i/, /u/) is elided in unstressed open sylla-
bles unless this elision causes a cluster of CCC: e. g. Mardin yqēʕǝd ‘he sits down’,
yqēʕdūn ‘they sit down’; but yǝktǝbūn ‘they write’.26 If, however, the vowel /ǝ/
goes etymologically back to /a/, it is not elided (Jastrow 1978: 62): e. g. šǝṛǝbna
‘we drank’ < Old Arabic (= OA) šaribnā).
In Harran-Urfa Arabic too, /i/ and /u/ are elided in unstressed open syllables:
e. g. yākul ‘he eats’, yāklūn ‘they eat’. However, both vowels are retained if they
go back to OA /a/: e. g. zimān < zamān ‘time’. Certain patterns are subject to sound
changes if suffixes are attached. This is especially true for words of the structure
CaCaC, which show regular re-syllabification regardless if the word in question is
a verb or a noun. For instance, ḏạ ṛab ‘he hit’ vs. ḏṛ̣ ibat ‘she hit’; ḥaǧal ‘partridges
(coll )’ vs. ḥǧile ‘partridge’, maras ‘rope’ vs. mrise ‘thick rope’.27
2.2.1. Inflection
As in all Arabic vernaculars there are no case markers in Eastern Anatolian Arabic.
Nouns are inflected according to the three numbers singular, dual, and plural.
The singular is not marked. The suffix of the dual is -ayn (Mardin) and -ēn (Har-
ran-Urfa). The plural is formed by suffixes (in particular -āt and -īn) or by modi-
fication of the word pattern.
Examples from Harran-Urfa Arabic:28

time, instance ram door
sg nōba čabiš bāb
dual nōbtēn čabšēn bābēn
pl nōbāt čbāš būb
24
Only in the Kozluk-Sason group is the epenthetic vowel inserted: e. g. *ftaḥ > fǝtáḥ
‘open!’ (Jastrow 1978: 87).
25
Unless otherwise indicated the examples from Harran-Urfa Arabic are from unpub-
lished own research.
26
Exceptions from that rule are found in the Kozluk-Sason group (Jastrow 1978: 88).
27
Cf. Procházka (2003: 77).
28
In Harran-Urfa Arabic, the plural suffix -īn is almost exclusively used with adjectives
and participles.
2.2.2. Pronouns
The following table shows the independent and bound personal pronouns of
Mardin and Harran-Urfa Arabic (from Procházka 2006–2007: 125 and Jastrow
1978: 127–131; Jastrow 2006: 90).
Table 2: Personal pronouns

independent forms bound forms
Mardin Harran-Urfa Mardin Harran-Urfa
after C—after V after C—after V
1 sg ʔana ʔāni -i/-ni — -yi/-ni -i/-ni — -ya/-ni
2 sg m ʔǝnta ʔinte -ǝk — -k -ak — -k
2 sg f ʔǝnti ʔinti -ki — -ki -ič — -č
3 sg m hūwe huwwa -u — -hu -u — v̄́ 29
3 sg f hīye hiyye -a — -ha -ha – -ha
1 pl nǝḥne ʔiḥna -na — -na -na — -na

2 pl m ʔǝntǝn ʔintu -kum — -kum
-kǝn — kǝn
2 pl f ʔintin -čin — -čin
3 pl m hǝnne huṃṃa -hum — -hum
-ǝn — -hǝn
3 pl f hinne -hin — -hin
A salient feature of the Anatolian qǝltu dialects (with some exceptions in the
Kozluk-Sason group) is the sound n in several plural pronouns as well as in inflec-
tion morphemes of the perfect. As can be seen from the corresponding forms in
Harran-Urfa Arabic, the qǝltu forms (as in hǝnne, bayt-ǝn, ʔǝntǝn, bayt-kǝn) coin-
cide with the respective feminine pronouns whereas Harran-Urfa masculine pro-
nouns have -m instead.30 It should be mentioned that Harran-Urfa Arabic exhibits
gender distinction in all plural forms except the first person in both pronouns and
verbs (see below).
29
The pronominal suffix is indicated by the lengthening and stressing of the final vowel.
30
For the pronouns that show an -n, Aramaic substratum influence has been suggested.
The pros and cons of such a development are discussed in detail in Arnold and Behn-
stedt (1993: 75–79), Diem (1979: 44), Owens (2006: 244–245), Weninger (2011: 749).
2.3.1. Inflection
Finite verbs are conjugated for perfect and imperfect. In qǝltu Arabic two dif-
ferent vowel patterns exist in the basic stem (Form I): CaCaC and CǝCǝC for the
perfect and -CCaC, and -CCǝC for the imperfect. Harran-Urfa Arabic possesses
two stems for the perfect, CCiC and CaCaC (the latter with the phonological vari-
ants CiCaC and CuCaC), and three for the imperfect, -CCaC, -CCiC, and -CCuC.
Table 3: Paradigms for the perfect and imperfect in Mardin and Harran-Urfa
Perfect Imperfect
Mardin Harran-Urfa Mardin Harran-Urfa
1 sg šǝṛǝb-tu šrib-it ʔa-šṛab ʔa-šṛab
2 sg m šǝṛǝb-t šrib-it tə-šṛab ti-šṛab
2 sg f šǝṛǝb-ti šrib-ti tə-šṛab-īn ti-šṛab-īn
3 sg m šəṛəb šrib yə-šṛab yi-šṛab
3 sg f šəṛb-ət širbat tə-šṛab ti-šṛab
1 pl šǝṛǝb-na šrib-na nə-šṛab ni-šṛab

2 pl m šrib-taw ti-šṛab-ūn
šǝṛǝb-tən tə-šṛab-ūn
2 pl f šrib-tan ti-šṛab-in
3 pl m širb-aw yi-šṛab-ūn
šəṛb-u yə-šṛab-ūn
3 pl f širb-an yi-šṛab-in
In the qǝltu dialects, the final -n of the inflection morphemes -īn and -ūn is dropped
when a pronoun suffix is attached: e. g. yišṛabū-hu ‘they drink it’ (Jastrow 2006:
93). In Harran-Urfa Arabic the -n is retained: e. g. yšūfūn-ha ‘they see her’.31
31
In Harran-Urfa Arabic, the final -n of the plural morpheme is elided when an indirect
object pronoun is attached to a participle: e. g. gāylīn ‘they have told’, but gāylī-li ‘they
have told me’.
Imperative
Table 4: Imperatives in Mardin and Harran-Urfa

Mardin Harran-Urfa
2 sg m šṛab ʔišrab
2 sg f šṛab-i ʔišrab-i
2 pl m šṛab-u ʔišrab-aw
2 pl f ʔišrab-in
Mardin and Harran-Urfa Arabic possess a special particle which serves to empha-
size the imperative, dē ~ dǝ- in Mardin and di- ~ du- in Harran-Urfa: e. g. Mardin:
dǝ-qūmu ‘stand up (pl )!’ (Jastrow 1978: 310); and Harran-Urfa di-niṭni ‘give
(m . sg ) me!’, du-gul-li ‘tell (m . sg ) me!’ (see below in §3).
2.3.2. Derivation
In both groups there are nine forms for tri-consonantal roots and two forms for
quadri-consonantal roots that are used for the derivation of verbs. A tenth tri-con-
sonantal form is limited to a very few verbs. Each form has a set of patterns for the
perfect and imperfect base as well as for the active and passive participle.32
A characteristic feature of the dialect of Mardin and its surroundings is that
Form II verbs, which usually have causative and intensifying functions, are fre-
quently used instead of the corresponding Form I verbs if they govern a direct
object in plural:
(1) a. Mardin (Jastrow 1978: 164)
yǝ-qṭaʕ-lu ʕūd mǝ-d-dawm
ipf .3. m . sg -cut.form 1-for.3. m . sg stick.sg from-def -tree
‘He cuts himself a stick from the tree.’
b. yǝ-qaṭṭǝʕ-lu ʕūd-ayn mǝ-d-dawm
ipf .3. m . sg -cut.form 2-for.3. m . sg stick.pl from-def -tree
‘He cuts himself sticks from the tree.’
32
Cf. Jastrow (1978: 146–195); Lahdo (2010: 124–147); Procházka (2003: 81–82).
2.1.3. Verbal modifiers33

The general present tense is not marked in either group, with the exception of some
varieties of the Kōsa subgroup of qǝltu Arabic.34 Though Mardin Arabic does not
use a modifier for the progressive present, most other qǝltu dialects of the region
do.
In the qǝltu dialects the verbal modifier of the progressive present is kū-, as in
kū-yǝšṛab ‘he is drinking’.
(2) Kǝndērīb (Jastrow 1999: 46)
kū-tǝ-qšaʕ-ūn ṣīyād-ana kwā ṭ-ṭāžīye maʕ-i
mod - ipf .2-see- pl hunter-cop .1. sg foc def -greyhound with-1.sg
‘You see that I am a hunter having the greyhound with me.’
In Harran-Urfa Arabic the invariable participle ǧaʕad, sometimes shortened to ǧaʕ
(< OA qāʕid ‘sitting’), is used.
(3) Harran-Urfa (own data)
ǧaʕad a-šṛab ᵊghawa
mod ipf .1. sg -drink coffee
‘I am drinking coffee.’
In the qǝltu dialects the future tense is indicated by particles and modifiers which
go back to the OA conjunction ḥattā ‘so that, in order that’ (Jastrow 1978: 302).
Thus in Mardin one hears ta-nkayyǝf ‘we shall have fun’ and ta-yǝǧi ‘he will come’,
in the dialects around Mardin tǝ-nǝšṛab ‘we shall drink’, and in Siirt/Daragözü
dǝ-yāxev ‘he will take’.35 Imminent future tense can be expressed by a combination
of kū- and tǝ-.
ǝn-nawb īḏa hạwnak-we kū-t-ī-ṛōḥ
then if there-cop .3. m . sg mod - mod - ipf .3. m . sg -go
y-fǝkk-a
ipf .3. m . sg -untie-3.f . sg
‘If he then comes there he will immediately untie it.’
In Harran-Urfa Arabic the future tense is expressed by the auxiliary verb yrīd ‘to
want’: e. g. ʔarīd ašṛab ‘I will drink’ (and ‘I want to drink.’).
33
A very good overview of the complex verbal system is Jastrow (1999), which deals with
the dialect of the village of Kǝndērīb. But most of Jastrow’s findings are true for the
majority of the qǝltu dialects of Anatolia.
34
Jastrow (1978: 300), bǝ-nqūl ‘we say’. Jastrow denies any influence from the analogous
modifier that is found in Syrian Arabic.
35
In the village of Tuzlagözü lē- (< li-/la- ‘in order to’) is used: e. g. lēbqa ‘he will stay’
(Jastrow 1978: 303).
Continuous and habitual past tense is expressed in Mardin town by uninflected

kān (literally ‘he was’) followed by an imperfect: e. g. kān yǝšṛabūn ‘they used to
drink’. In other qǝltu dialects (rural Mardin, Siirt) the modifier kā- ~ ka- is used
together with a finite verb in the imperfect form: e. g. kā-yǝbki ‘he was crying’.
kān fī wēḥǝd m-Mǝḏyāḏ, ka-y-say
be.pf .3. m . sg there.is one from-Midyat mod -ipf .3. m . sg .make
sǝḥǝr
magic
‘There was one from Midyat who used to practice magic.’
In Harran-Urfa Arabic the continuous and habitual past tense is expressed by the
complex predicate čān plus the verb in the imperfect.
čin-na n-ākul čime
be-pf .1. pl ipf .1. pl -eat truffle.coll
‘We used to eat truffles.’
The qǝltu dialects usually express the present perfect by invariable modifiers such
as kǝl-/kǝ- followed by a verb in the perfect. Thus in Mardin Arabic (and other dia-
lects) one finds kǝl-qǝʕdǝt ‘she has sat down’ (Jastrow 1978: 307). Some dialects
(Āzǝx, rural Siirt) use kū- with the perfect, others (Mḥallami/Kōsa) use kūt-.
waḷḷa ǝl-bǝnt mō-d-bayyǝn kūt-haṛab-ǝt
truly def -girl neg - ipf .3. f . sg -be.seen mod -run.off-pf .3. f . sg
‘The girl is not to be seen, she has run off.’

In Harran-Urfa Arabic the present perfect is usually expressed by an active parti-
ciple (cf. Procházka and Batan 2016).
(8) Harran-Urfa (Procházka and Batan 2016: 458)
ʕAli mitʕallim ʕala sōg aṭ-ṭaqṣi.
Ali learn.ap . m . sg on driving def -car
‘Ali has learned to drive a car (and now knows how).’
2.4. Syntax of basic clauses
2.4.1 Copula
A conspicuous feature of all Anatolian qǝltu dialects is the consistent use of a
copula that links the subject and predicate of a sentence without a verb. The “Ana-
tolian copula” consists of enclitic variants of the personal pronoun and is attached
to the predicate.36
(9) Mardin (Jastrow 1978: 27)
bayt-i gbīr-we
house-1. sg big-cop .3. m . sg
‘My house is big.’
(10) Mardin (Grigore 2007: 289)
mērdīn fōq ǝǧ-ǧabal-ye
proper . name on def -mountain-cop .3. f . sg
‘Mardin is on the mountain (top).’
ana Mērdīni-ana
1.sg inhabitant.of.Mardin-cop .1. sg
‘I am an inhabitant of Mardin.’
The dialect of Siirt is an exception in that in propositions the copula is placed
between subject and predicate (ex. 12). In questions, however, Siirt Arabic con-
forms to the other varieties (ex. 13).
(12) Siirt (Jastrow 1978: 132)
āvi l-bǝnt īye malīḥ-a
dem . f . sg def -gril cop .3. f . sg good-f
‘This girl is good.’
(13) Siirt (Jastrow 1978: 132)
ʔǝxt-ok malīḥ-a-ye?
sister-2. m . sg good-f - cop .3. f . sg
‘Is your sister good?’
2.4.2 Nominal attribution

In both groups, the default case of nominal attribution is by asyndetic linkage
of the two (or more) elements. In this case the head noun never gets the defi-
nite article, and feminine heads appear in a special construct state marked by -t.
36
The complete paradigms of three dialects are given in Jastrow (2006: 91).
The grade of determination depends on whether the last element of the phrase is
determined or not (ex. 14). All qǝltu dialects possess a prepositional marker for
syndetic attribution.37 It is, however, rare, being used mainly to avoid the repetition
of the noun (ex. 15).
(14) Harran Urfa (Procházka 2013: 209/3)
miǧann-t [< miǧann-e] aẓ-ẓġāṛ
cemetery-f . constr . st def -small.pl
‘the cemetery of the small (i. e. children)’
(15) Qarṭmīn (Jastrow 1978: 125)
ǝl-faṛas ḏīl-i-ye
def -horse gen . marker -1. sg - cop .3. f . sg
‘This horse is mine.’38
Only in the remote dialects of the Kozluk-Sason region and in Hasköy is analytical
nominal attribution frequent: it may even be regarded as the default case (Jastrow
1978: 126; Eksell 2006: 84).
(16) Hasköy (Talay 2001: 79)
sǝʕēb lē baqqāl
owner gen . marker shop
‘the owner of the shop’
(17) Hasköy (Talay 2001: 79)
ṛēḥān lē mōy
mill.pl gen . marker water
‘water-mills’
In Harran-Urfa Arabic the prepositional marker for substantive or pronominal
attributes is very infrequent. Even for foreign words attribution is asyndetic: e. g.
pīkab-i ‘my pick-up truck’.39
As for adjectival attributes, in the qǝltu dialects definite noun-adjective phrases
do not differ from definite noun-substantive phrases (Grigore and Biţună 2012:
553). This implies that the head noun of the phrase appears in the construct state,
and that the definite article is attached to the following adjective only (whereas in
37
Most forms can be traced back to *ḏī-la, the etymology of which is, however, not undis-
puted. Jastrow (1978: 125) points out that it resembles the OA relative pronoun allaḏī.
But Talay (2001: 78), fn. 23 suggests an Aramaic origin. In the former Jewish dialect
of Siverek the typically Syrian form tabaʕ was used: e. g. mǝ-nǝstaḥi mǝn dīn tabaʕna
‘We are not afraid of our religion.’ (Nevo 1999: 71).
38
Instead of *ǝl-faṛas faṛasi-ye.
39
Rarely hnīt ‘thing’ is attested to express a kind of belonging, e. g., ʔiḥna, ᵊhnīt as-sūg
mā nāxuḏhe ‘We do not buy the one from the market’.
Arabic usually both components take the definite article). In Harran-Urfa Arabic,
the article is usually attached to both components, though with the peculiarity that
feminine nouns appear in the construct state.
(18) Mardin (Jastrow 1978: 124)
bǝnt ǝl-ǝkwayys-e
girl def -beautiful-f
‘the beautiful girl’
bǝqč-ǝt ǝl-wǝsx-a
bale-f . constr . st def -dirty-f
‘the dirty bale’
al-ʕarab awwali balāy al-ghaw-t
def -Arab.coll formerly without def -coffee-f . constr . st
al-murr-a mā-y-ṣīr
def -bitter-f neg - ipf .3. m . sg -become
‘In former times, among the Arabs, nothing happened without bitter
coffee.’
In Harran-Urfa Arabic an adjectival attribute is nearly always linked to its head
noun by the enclitic suffix -in if the head noun is indefinite40 (exs. 21–22). The
same morpheme is also used to link qualifying components other than adjectives
to indefinite nouns (ex. 23). (For relative clauses see below, section 2.5.1).
ʕaǧīy-in ẓaġīr ʕǝmr-u šahaṛ šahr-ēn
child-linker small age-3. m . sg month.sg month.dual
‘a small child whose age is one or two months’
ḥōš-in čibīr
house-linker big
‘a big house’
ṛabīʕ-in miṯl-ak
friend-linker like-2.m . sg
‘a friend like you’
40
This suffix is also called a tanwīn after the term for the indefinite marker in OA.
2.4.3 Negation
Anatolian qǝltu dialects use two different particles to negate perfect and imperfect,
mā and mō. The latter also negates sentences which do not contain a verb.
mō tǝ-ǧi
neg ipf .2. m . sg -come
‘You don’t come.’
mā daxal
neg enter.pf .3. m . sg
‘He did not enter.’
(26) Mardin (Jastrow 1978: 30)41
mō hawn-ana
neg here-cop .1. sg
‘I am not here.’
In Harran-Urfa Arabic, the particle mā is used to negate both perfect and imperfect
verbs. Non-verbal predicates are negated by a combination of m(v̆)- and the per-
sonal pronouns (exs. 29–30).
ʔawwal marra mā ligā-ha
first time neg find.pf .3. m . sg -3. f . sg
‘At the first time he did not find her.’
mā y-āxḏ-ūn-hum ʕa-l-garāye
neg ipf .3. m -take-pl -3. m . pl to-def -village.pl
‘They don’t take them to the villages.’
m-āni čaḏḏāb
neg -1. sg liar
‘I am not a liar.’
mi-hin šēn-āt
neg -3. f . pl bad-pl . f
‘They (f . pl ) are not bad.’
41
In some Anatolian dialects the enclitic pronoun is attached to the negator: e. g. Mḥal-
lami mana hawn ‘I am not here’ (Jastrow 1978).
In both groups the particle lā negates the imperative and the optative: e. g. Mardin:
lā yǝǧi ‘He may not come!’ (Jastrow 2006: 92); Harran-Urfa: lā tibčīn! ‘Don’t
(f . sg ) cry!’
2.4.4 Word order

According to the in-depth study by Dahlgren (1998), in Anatolian qǝltu Arabic
there is a strong tendency towards SV in both dialogues and description (Dahlgren
1998: 194, 205). Unless topicalized, nominal direct objects come after the verb, as
in examples (46) or (71) below.
2.5. Syntax of clause linkage

In both groups, coordinative clauses are often linked by conjunctions such as
(Mardin) w ‘and’, ʔaṃṃa/lākǝn ‘but’, vēya ‘or’. However, unlinked sequences of
clauses are equally frequent.
Subordinate clauses which are dependent on verbs of speaking, belief, and the
like are very often not introduced by a conjunction (cf. also Lahdo 2010: 173). An
example of syndetic linkage is (31).
y-qūl ki42 nǝḥne ṣǝdqān
ipf .3. m . sg -say conj .that 1.pl friend.pl
‘He said that we are friends.’
Both syndetic (exs. 32–33) and asyndetic (ex. 34) linkage is frequently used for
final and causal clauses. Final clauses are often introduced by the proclitic con-
junction ta-.
Maryam ṛāḥ-ǝt ta-tǝ-ṛa bǝnt-a
proper . name go-pf .3. f . sg conj -ipf .3. f . sg -see daughter-3. f . sg
‘Maryam went to see her daughter.’
(33) Harran-Urfa (Procházka 2010: 132/19)
i-taqassal ta-mā y-ḏạ ll bī waḥl-u
ipf .3. m . sg -be.washed conj - neg ipf .3. m . sg -stay in.3. m . sg mud-3. m . sg
‘It is washed in order that no mud remains on it.’
42
This conjunction is a loan from Turkish, but ultimately of Persian origin.
(34) Tillo (Lahdo 2010: 175)

kǝǧ-ǧī-na nǝ-ṭlab bǝnt-ak
mod -come-pf .1. pl ipf .1. pl -ask daughter-2. m . sg
‘We came to ask for your daughter’s hand.’
In Mardin Arabic, causal clauses are often introduced by the conjunction čǝnki
(< Kurdish çunki/çimki or Turkish çünkü), in Harran-Urfa Arabic by ʕaman.
yǝlzǝm a-ṛōḥ a-štari akǝl čǝnki
necessary ipf .1. sg -go ipf .1. sg -buy food conj .because
ǧūʕān-ana
hungry-cop .1. sg
‘I have to buy food because I’m hungry.’
al-qanam ṣār-at zihīd-āt hēne
def -sheep.coll become-pf .3. f . sg few-pl . f here
ʕaman mā ḏạ ll muṭraḥ yi-sraḥ-ūn
conj .because neg stay.pf .3. m . sg place ipf .3-graze-m . pl
‘The (number of) sheep has decreased here, because there is no space left
to graze (them).’
Conjunctions are more or less constantly used in temporal clauses as well as in
conditionals.43
laman nǝzǝl-na mǝ-l-bayt kān
conj .when go.out-pf .1. pl from-def -house be.pf .3. m . sg
yǝ-ǧi salǧ
ipf .3. m . sg .come snow
‘When we left the house it was snowing.’
(38) Mardin (Jastrow 1981: 4/3)
ʕaǧab laman a-mūt w yǝ-flǝs
particle conj .when ipf .1. sg -die and ipf .3. m . sg -go.bankrupt
ʔaš ta-yṣēr duṛum-u?
q .what mod - ipf .3. m . sg -become situation-3. m . sg
‘When I die and he goes bankrupt, what will become of him?’
43
The conditional sentences of Mardin Arabic are described in detail by Grigore (2007:
293–308) and Grigore (2008); for the dialect of Kǝndērīb see Jastrow (1999: 51–53).
(39) Tillo (Lahdo 2010: 173)

waxt lay ṭalaḥ-t mǝ-l-karm aḷḷa kān
conj .when leave-pf .2. m . sg from-def -vineyard God be.pf .3. m . sg
ʕala ṛās-ak
on head-2. m . sg
‘When you left the vineyard God was watching you.’
(40) Mardin (Jastrow 1981: 8/18)
ʔawwal mā ṛa-w-hu mǝn bʕīd yāho kalb ǝbǝn
conj .as_soon_as see-pf .3. pl -3. m . sg from far hey dog son
kalb
dog
‘As soon as they saw him (they said to him), “Hey dog, son of a dog!”’
(41) Mardin (Jastrow 1981: 10/28)
abū-y kǝl-ḥallaf-ni qabǝl la y-mūt
father-1.sg mod -let.swear.pf .3. m . sg -1. sg conj .before ipf .3. m . sg -die
‘My father let me swear before he died.’
(42) Harran-Urfa (Procházka 2010: 129/5)
yōm-in y-ṣīr muṭar y-gūl-ūn
conj .when ipf .3. m . sg -become rain ipf .3-say-m . pl
‘When it rains they say …’
ʕugub-ma mayyit abū-ha ḏạ̄ ll-it-ill-u
conj .after die.ap . m . sg father-3. f . sg remain.ap - f . sg -for-3. m . sg
ibnayye
daughter
‘After her father had passed away, he left behind a daughter.’
ʕugub-ma y-ṣīr ǧubun ʕādēn ši-t-sawwi?
conj .after ipf .3. m . sg -become cheese particle q .what-ipf .3. f . sg -do
‘After it has become cheese, what is she doing?’
gabǝl-ma ti-ǧi l-mayye čin-na ni-zraʕ
conj .before ipf .3. f . sg -come def -water be-pf .1. pl ipf .1. pl -plough
ʕala l-fiddān
on def -ox
‘Before the water (viz. irrigation) came, we used to plough with oxen.’

šnōn-ma t-gūm ᵊt-qassil bēt-he
conj .as.soon.as ipf .3. f . sg -get.up ipf .3. f . sg -wash house-3. f . sg
‘As soon as she gets up, she cleans her house.’
2.5.1 Relative clauses

There are only slight differences between qǝltu and Harran-Urfa Arabic in the
general rules regarding relative clauses. For the most parts, such clauses follow
their head. In the Anatolian qǝltu dialects these clauses are asyndetically linked
to the head noun if this is indefinite. In Harran-Urfa Arabic the relative clause as
a qualifying element is connected to the head noun by the linking suffix -in (see
above, section 2.4.2).
(47) Kənderib (Jastrow 1978: 124)
fī ʕalay-u kǝtābe mō tǝ-nqǝṛi
there.is on-3. m . sg inscription neg ipf .3. f . sg -be.read
‘On it is an inscription that is not readable.’
ʔana ʔa-dawwǝr ʕala kamyōn
1. sg ipf .1. sg -seek on lorry
y-ṭēq yǝ-ḥmǝl ṯāṯ ṭōn-āt qǝrēmīḏ
ipf .3. m . sg -can ipf .3. m . sg -carry three ton-pl tile.pl
‘I am looking for a lorry that can carry three tons of roof tiles.’
hināk šī muṭraḥ-in yi-ṭlaʕ yi-sraḥ
there there.is place-linker ipf .3. m . sg -go.out ipf .3. m . sg -graze
bī-hin
in-3.f . pl
‘There is space where he (can) go and graze them.’
If the head of the relative clause is definite, it is linked to it by the invariable
relative pronoun la/lǝ/lē/lay in the qǝltu dialects and al in Harran-Urfa Arabic.44
Both dialect groups exhibit two salient features which distinguish them from vir-
tually all other Arabic dialects (Retsö 2004: 266 and Retsö 2009: 21): the use of
the construct state with feminine nouns and the non-occurrence of the definite
article on the semantically definite head noun. The latter feature is very common
44
In Harran-Urfa Arabic the relative is identical with the definite article and thus subject
to assimilation to following dentals and sibilants (see ex. 53).
in the qəltu dialects,45 but only occasionally found in Harran-Urfa Arabic. Thus for
the Anatolian qǝltu dialects there are completely identical structures for (definite)
noun-substantive, noun-adjective, and noun-clause attributions.
ʕarab-at la xalf-na t-rīd
car-f . constr . st rel behind-1. pl ipf .3. f . sg -want
ta-t-fūt-na
conj - ipf .3. f . sg -pass-1. pl
‘The car behind us wants to pass us.’
kalb la gabbaṛ-tu fǝ-ḥawš-na
dog rel raise-pf .1. sg in-court-1. pl
‘the dog which I have raised in our court’
ayn-i ḥǝnṭ-ǝt lǝ-tǝ-yǝ-ḥṭī-k
q .where-cop .3. f . sg wheat-f . constr . st rel - fut - ipf .3. m . sg -give-2. m . sg
‘Where is the wheat which he will give to you?’
(53) Harran-Urfa (Procházka 2003: 83)
zlim-t as sōlaf-t-u minu?
man-constr . st 46 rel speak-pf .2. m . sg -3. m . sg q .who
‘Who is the man you spoke with?’
al-ḥunṭ-it al mā t-tiṣawwal
def -wheat-f . constr . st rel neg ipf .3. f . sg -be.soaked
y-ṣīr xubuz-he mitīn
ipf .3. m . sg -become bread-3. f . sg hard
‘Bread from wheat which has not been soaked becomes hard.’
al-ǧild al y-ḥuṭṭ-ūn bī as-samin
def -skin rel ipf .3-put-m . pl in.3. m . sg def -ghee
‘the skin in which they put the ghee’
There is evidence of relative clauses in Harran-Urfa Arabic which contain a head
noun carrying the suffix -in although it is semantically clearly definite.
45
Examples showing the definite article on the head noun are also attested, e. g. from Tillo
ạw ǝl-kurmanč lay kǝǧ-ǧaw mǝn qabǝl ‘these Kurds that came earlier’ (Lahdo 2010:
178).
46
The word zlime ‘man’ has the morphological feminine ending but is, of course, mascu-
line by natural gender.

ghawt-in ištarē-nā-ha
coffee.idf - linker buy-pf .1. pl -3. f . sg
‘the coffee which we have bought’
ʔawwal yōm-in ǧī-na
first day.idf - linker come-pf .1. pl
‘on the first day we were here’
In clauses without a verb, the relative pronoun merges with the copula in qǝltu
Arabic. In Harran-Urfa Arabic again the suffix -in is used to link the two elements:
tǝ-yǝ-ṛa la-wwe ʕa-l-arḏ ̣
fut - ipf .3. m . sg -see rel - cop .3. m . sg on-def -ground
‘He will see the one who is sitting on the ground.’
sint-in inte ṣāyir
year-linker 2.m . sg become.ap . m . sg
‘in the year you were born’
3. Additional notes on contact-phenomena
All Arabic dialects spoken in eastern Anatolia exhibit features that indicate the
influence of the two major languages of the region, Turkish and Kurdish. Neo-Ara-
maic may have played a more significant role in the past, particularly in the region
adjacent to Tur Abdin. But it must be emphasized that the local distribution of con-
tact-phenomena is highly variable. As was pointed out in Procházka (2006–2007:
116), the bulk of idiosyncratic dialectal features is found in “language islands”
(especially in the Kozluk-Sason group). On the whole, the Bedouin-type dialects
of the Harran-Urfa region show much less Turkish (and Kurdish) influence than
the qǝltu dialects.47
Most striking is the Turkish and Kurdish impact on the lexicon of the region’s
Arabic, resulting in a large number of loanwords that constitute up to one-third of
the vocabulary.48 A clear indicator of intensive contact or even bilingualism are the
47
For possible reasons cf. Procházka (2006–2007: 115–117).
48
For the village of Daragözü, see Vocke and Waldner (1982: xliii). Exhaustive lists of the
Turkish and Kurdish loanwords in the dialect of the village of Tillo are found in Lahdo
(2010: 203–214, 217–219).
numerous calques found in phraseology. Frequently found49 is the copying of the

so-called phrasal verbs of Turkish and Kurdish, which consist of the verb ‘to do, to
make’ (Turk. etmek, Kurd. kirin) with a following noun. Examples are sawa yārdǝm
(cf. Turkish yardım etmek) ‘to help (lit. to make help)’, and sawa ǧāmērtīye (cf.
Kurdish camêrtî kirin) ‘to act generously (lit. to make generosity)’ (Talay 2006–
2007: 184). Examples of idiomatic calques: from Harran-Urfa ʔimil ʕēnak! ‘Delight
your eyes!’ which is a calque of the Turkish gözü doldur, literally ‘Fill the eye!’;
and from Tillo bǝqīna fǝ-n-nǝṣṣ ‘we remained helpless’, which is a calque of the
Turkish ortasında kaldık, literally ‘we stayed in the middle’ (Lahdo 2010: 174).
A consequence of the extensive adoption of Turkish and Kurdish vocabulary
is that all dialects spoken in eastern Anatolia have phonemes alien to Old Arabic.
Particularly conspicuous are /p/, /ž/ and /v/ (the latter two are not found in Har-
ran-Urfa), which overwhelmingly occur in loanwords. The situation is different for
/č/ and /g/: in the qǝltu dialects these two phonemes are also more or less restricted
to loanwords,50 whereas in Harran-Urfa Arabic they are the regular reflexes of OA
/k/ and /q/ when in certain vowel settings.
In the Kozluk-Sason group the velarized variants of post-dentals and sibilants
have, under the influence of Kurdish, lost their phonemic status (Talay 2006–2007:
181).51
Morphological changes due to the influence of Turkish and Kurdish are not
numerous. In the isolated Kozluk-Sason group the system of determination and
indetermination has been remodeled according to Kurdish (and less so Turkish)
patterns. This explains the frequent omission of the definite article, especially in
subject position.52 The absence of determination marking can be explained by the
fact that both adstratum languages do not have a definite article. But on the other
hand these dialects exhibit the enclitic indefinite marker -ma which is used exactly
like the Kurdish indefinite marker -ek that is likewise enclitic.
(60) Mutki (Akkuş 2016: 39)
ifi atsūra-ma fo fəstox
there.is bird-idf on roof
‘There is a bird on the roof.’
In the same dialect group, inversion of the unit and decimal positions in the numer-
als from 11 to 19 has taken place. Thus numerals such as ʕašṛa w sǝlāse ‘13’ and
49
This is particularly true of the qǝltu dialects, but less common in Urfa Arabic.
50
In a few roots g is also found in Arabic words as a consequence of the generalization
of assimilated consonants: gǝbǝṛ – yǝgbaṛ ‘to grow’. Cf. Jastrow (1978: 47) for further
examples.
51
For further phonological contact phenomena see Talay (2006–2007: 181–183), and
Lahdo (2010: 191–192).
52
Cf. Jastrow (1973: 91), Talay (2006–2007: 185).
ʕašṛa sǝtte ‘16’ follow the Turkish model (on üç, on altı). Talay (2001: 78, fn. 26)
reports that in the village of Hasköy nominal attribution is sometimes done by
using the Kurdish linking suffix -ē: e. g. šǝllī-yē ṛabīh ‘spring rain’.
Probably the most striking contact phenomenon on the syntactical level is the
consistent use of a copula in all the qǝltu dialects (see above, section 2.4.1). As this
feature is found in Kurdish, Turkish, and Aramaic, but not in other Arabic dialects,
it can certainly be traced back to the influence of these adstratum languages.
As is pointed out by Talay (2006–2007: 183) and Jastrow (2011: 91), there
are many similarities between the tense and aspect categories expressed by verbal
modifiers in the qǝltu dialects (see above, section 2.3.3) and the Turkish verbal
system. The existence of similar categories in many other Arabic dialects clearly
indicates that they are not a direct product of Turkish influence, but the degree of
coincidence is so striking that one can assume that the Arabic verbal system has
developed at least some of these categories by analogy with the Turkish model.
This may be especially true for constructions like ka-tǝ-tǝftaḥ ‘you would have
opened’, which is rather unusual elsewhere53 and morphologically a copy of the
Turkish suffix -ecek-ti, a combination of future and perfect tense markers like the
Arabic ka-tǝ-.
Equally influenced by Turkish is the use in Harran-Urfa Arabic of the partici-
ple to express evidentiality (i. e. like the Turkish miş-perfect). This is very often
found in narrative speech when the speaker refers to an event which was not per-
sonally observed or experienced.54 For example, in a story about a girl who went
mad, the narrator frequently used participles instead of perfect forms:
(61) Harran-Urfa (Procházka and Batan 2016: 464)
gāyil Šēx Mǝṭar ʔǝnṭ-ū-ni hāt-ū-li
say.ap . m . sg proper . name give-imp . m . pl -1. sg give-imp . m . pl -to.1. sg
hdūm-ha! ᵊmnawwš-īn ᵊhdūm-ha …
clothes-3. f . sg hand.ap - m . pl clothes-3. f . sg
minṭī-he gāym-e ti-lbas gāyl-e
give.ap . m . sg -3. f . sg stand.up.ap - f . sg ipf .3. f . sg -put.on say.ap - f . sg
‘Sheikh Mǝṭar said, “Give me her clothes!” They handed him her clothes;
and when he gave them to her, she started to put them on and said …’
Another possible influence of Turkish in Tillo Arabic is that temporal clauses
usually precede the main clause (Lahdo 2010: 174). Grigore (2007: 168) claims
that the particle dǝ-, which emphasizes an imperative (see above, §2.3.1), goes
back to the Turkish interjection haydi ‘let’s go!’ But this is doubtful as Turkish pos-
sesses a distinct suffix to intensify imperatives (-sana/-sene) and the use of haydi
53
In the dialect of the Jews of Kurdistan the same combination of prefixes is found in a
very similar function: e. g. kān t-īmūtūn ‘they had died’ (Jastrow 1990: 66).
54
For a more detailed analysis cf. Procházka and Batan (2016: 464–465).
together with such forms is only optional.55 Somewhat more likely is to assume
influence of the particle de which is widely used in Kurmanjî Kurdish in the same
function (Thackston 2006: 206), e. g. de rûne ‘Do sit down!’56 However, this par-
ticle may well be of Arabic origin (probably a reflex of the OA demonstrative ḏā/
ḏī) as it is also found in various bedouin-type dialects such as Harran-Urfa Arabic
and even Baghdadi Arabic (Erwin 1963: 140). Similar forms are also attested for
Aleppo, e. g. dē ʔǝl-la ‘Come on, tell it to her!’ (Sabuni 1980: 76), and Bahraini
Arabic, e. g. d-rūḥi yumma, trayyagay ‘Go (f ), my dear, and have your breakfast!’
(Holes 2016: 304) for both of which Kurdish influence is less likely.
In summary: With the exception of the Kozluk-Sason group the impact of
Turkish and Kurdish on the region’s dialects is surprisingly low outside of the
lexicon. Kurdish has clearly had less impact on the region’s Arabic than Turkish
despite its longer duration of contact and the fact that Kurdish is still the area’s
dominant everyday language. A possible explanation for this could be that Kurdish
has never had such a dominant role as Turkish in the media and in school educa-
tion, both of which strongly influence all layers of the population.
Urfa Arabic (from the village of Yalnızca)

Speaker: male, born ca. 1965, farmer
Recorded by Stephan Procházka in May 2011
(62) ʔawwali al-mille kull ǝ.byūt-hum ǝ.hnīt-hum ǝ.b=gaḷb
formerly def -people all house.pl -3. m . pl stuff-3. m . pl in=heart
aǧ-ǧarye
def -village
‘In former times the houses and the stuff of the people were all inside of
the village.
(63) ʔalḥaz al ʕind-u gāʕ gām
now rel at-3. m . sg land stand.up.pf .3. m . sg
y-sawwi b=gaḷǝb gāʕ-t-u bēt
3. m . sg -make.ipf in=heart land-f .constr -3. m . sg house
Now, the one who owns land has begun to make a house on his estate.
55
Furthermore, the Turkish haydi is widely attested in the Anatolian dialects as hēdi, ēdi
etc. (Vocke and Waldner 1982: 451), which makes it rather unlikely that it has been
shortened to become a prefix.
56
I owe this suggestion to Geoffrey Haig who also provided the example.
(64) ti.dāġaṭ-at al-mille daḥḥiǧ hīčiḏ al-ḥagg

disperse.pf -3. f . sg def -people look.imp . m . sg so def -truth
The people have dispersed. Look! That’s the truth!
(65) w-aḷḷa ʔawwali kull-u qanam al-bēt at
by-God formerly all-3. m . sg sheep def -house rel
t-rūḥ bī ʔarbaʕīn xamsīn ímy-it
2. m . sg -go.ipf in.3. m . sg 40 50 100-f . constr
dābb-it qanam šī
beast-f . constr sheep there.is
By God, formerly there were all sheep. Whatever house you entered
there were 40, 50, 100 head of sheep.
(66) ʔalḥaz mā.min mā.min alḥaz qanam zihīd
now neg .there.is neg .there.is now sheep few
čiṯīr zihīd bass b=aǧ-ǧbāl šī
very few only in=def -mountain.pl there.is
Now there aren’t. There aren’t. Now there are few, very few sheep, only
in the mountains are there sheep.
(67) hināk b=aǧ-ǧbāl šī qanam ʕaman hināk šī
there in=def -mountain.pl there.is sheep because there there.is
muṭraḥ-in yi-ṭlaʕ yi-sraḥ bī-hin ʔama
place-link 3. m . sg -go.up.ipf 3. m . sg -graze.ipf in-3. f . pl but
hēne mā.min
here neg .there.is
Over there in the mountains, there are sheep, because there is enough
place to graze them. But here aren’t any.
(68) muṭraḥ al bī-he mayye ʔalḥaz mā ḏạ ll
place rel in-3. f . sg water now neg remain.pf .3. m . sg
bī-he qanam
in-3. f . sg sheep
At places where there is water, there no sheep now remain.
(69) kull-u b=aǧ-ǧbāl haḏāk bāʕ-u ʕala
all-3. m . sg in=def -mountain.pl dem . m . sg sell.pf -3. m . pl on
ʔahil aǧ-ǧbile
people def -mountain
All of them are in those mountains. They have sold (them) to the mountain
people.
(70) ʔawwali al-qanam yōm.in y-ḥalb-ūn-ha ʕugub.ma

formerly def -sheep when 3-milk.ipf -m . pl -3. f . sg after.conj
y-ḥalb-ūn-ha al-ḥurma ač-čibīr-e al b=al-bēt
3-milk.ipf -m . pl -3. f . sg def -woman def -old-f . sg rel in=def -house
hiyya ǝ.t-ʕarf
3. f . sg 3. f . sg -know.ipf
In former times, when they milked them, after they have milked them …
the old woman (did it), the one who was at home, she knew it.
(71) ḥurm-it ač-čibīr-e al-uṃṃa haḏīč
woman-f . constr def -old-f . sg def -mother dem . f . sg
t-āxḏ al-ḥalīb w=ʕa=n-nār
3. f . sg -take.ipf def -milk and=on=def -fire
t-fawwr-u ʔawwal fāl
3. f . sg -bring.to.the.boil.ipf -3. m . sg first time
The old woman, the “mother” is the one who takes the milk and first brings
it to the boil on the fire.
(72) ʕugub.ma t-fawwr-u t-xallī
after.conj 3. f . sg -bring.to.the.boil.ipf -3. m . sg 3. f . sg -let.ipf .3. m . sg
t-sawwi dinlenme ta=yu-brud guṭma
3. f . sg -make.ipf rest [Turkish] purp =3. m . sg -cool.ipf a.little
After she has brought it to the boil she lets it rest so it cools down a little.’
Abbreviations
ap Active participle idf Indefinite

coll Collective imp Imperative
conj Conjunction ipf Imperfective
constr . st Construct state m Masculine
cop Copula mod Modifier
def Definite article neg Negation
dem Demonstrative pf Perfective
dual Dual pl Plural
f Feminine purp Purposive
foc Focus q Question particle
form 1, 2 First, second verbal stem rel Relativizer
F ut Future Sg Singular
G en . marker Genitive marker
References
Akin, Cahit, Jastrow, Otto & Shabo Talay. 2017. Nōṛšēn – ein anatolisch-arabischer Dialekt
der Sason-Muş-Gruppe. Zeitschrift für arabische Linguistik 65. 73–95.
Akkuş, Faruk. 2016. The Arabic dialect of Mutki-Sason areas. In George Grigore & Babriel
Bițună (eds.), Arabic varieties: Far and wide. Proceedings of the 11th Conference of
AIDA – Bucharest, 2015, 29–41. Bucharest: Editura Universității din București.
Andrews, Peter Alford (ed.). 2002. Ethnic groups in the Republic of Turkey. Wiesbaden:
Reichert.
Arnold, Werner & Peter Behnstedt. 1993. Arabisch-Aramäische Sprachbeziehungen im Qa-
lamūn (Syrien). Wiesbaden: Harrassowitz.
Benninghaus, Rüdiger. 2002. Identity in question – Some preliminary remarks on the Mḥal-
lami. In Peter Alford Andrews (ed.), Ethnic groups in the Republic of Turkey. Supple-
ment and Index, 179–189. Wiesbaden: Reichert.
Dahlgren, Sven-Olof. 1998. Word order in Arabic. Göteborg: Acta Universitatis Gothobur-
gensis dissertation.
Diem, Werner. 1979. Studien zur Frage des Substrats im Arabischen. Der Islam 56. 12–80.
Eksell, Kerstin. 2006. Analytic Genitive. In Kees Versteegh (ed.), Encyclopaedia of Arabic
Language and Linguistics, vol. 1, 82–85. Leiden & Boston: Brill.
Erwin, Wallace M. 1963. A short reference grammar of Iraqi Arabic. Washington, D.C.:
Georgetown University Press.
Grigore, George & Gabriel Biţună. 2012. Common features of North Mesopotamian Ara-
bic dialects spoken in Turkey (Şırnak, Mardin, Siirt). In M. Nesim Doru (ed.), Bilim
düşünce ve sanatta Cizre, 545–555. Istanbul: Mardin Artuklu Üniversitesi Yayınları.
Grigore, George. 2007. L’arabe parlé à Mardin: monographie d’un parler arabe «périphéri-
que». Bucharest: Editura Universitãţii din Bucureşti.
Grigore, George. 2008. Conditional structures in Mardini Arabic. Zeitschrift für Arabische
Linguistik 49. 63–77.
Heidemann, Stefan. 2002. Die Renaissance der Städte in Nordsyrien und Nordmesopotam-
ien: Städtische Entwicklung und wirtschaftliche Bedingungen in ar-Raqqa und Harrān
von der Zeit der beduinischen Vorherrschaft bis zu den Seldschuken. Leiden & Boston:
Brill.
Holes, Clive. 2016. Dialect, culture, and society in Eastern Arabia, vol. 3: Phonology, Mor-
phology, Syntax, Style. Leiden & Boston: Brill.
Jastrow, Otto. 1973. Daragözü: Eine arabische Mundart der Kozluk-Sason-Gruppe (Sü-
dostanatolien). Grammatik und Texte. (Erlanger Beiträge zur Sprach- und Kunstwis-
senschaft 46). Nürnberg: Hans Carl.
Jastrow, Otto. 1978. Die mesopotamisch-arabischen qǝltu-Dialekte, vol. 1: Phonologie und
Morphologie. Wiesbaden: Steiner.
Jastrow, Otto. 1981. Die mesopotamisch-arabischen qǝltu-Dialekte, vol. 2: Volkskundliche
Texte in elf Dialekten. Wiesbaden: Steiner.
Jastrow, Otto. 1990. Der arabische Dialekt der Juden von ʿAqra und Arbīl. Wiesbaden:
Harrassowitz.
Jastrow, Otto. 1994. The qǝltu dialects of Mesopotamian Arabic. In Jordi Aguadé, Federico
Corriente & Marina Marugán (eds.), Actas del congreso internacional sobre interfer-
encias lingüísticas arabo-romances y paralelos extra-iberos, 119–123. Zaragoza: [no
publisher].
Jastrow, Otto. 1999. Verbformen und ihre Funktionen im arabischen Dialekt von Kǝndērīb
(Südostanatolien). In Norbert Nebes (ed.), Tempus und Aspekt in den semitischen
Sprachen: Jenaer Kolloquium zur semitischen Sprachwissenschaft, 43–53. Wiesbaden:
Harrassowitz.
Jastrow, Otto. 2004. Aramäische Lehnwörter im arabischen Dialekt von Kinderib (Südost-
Türkei). Estudios de Dialectología Norteafricana y Andalusí 8. 99–103.
Jastrow, Otto. 2006. Anatolian Arabic. In Kees Versteegh (ed.), Encyclopaedia of Arabic
Jastrow, Otto. 2006–2007. Where do we stand in the research on the Anatolian qəltu dia-
lects? Romano-Arabica 6–7. 63–69.
Jastrow, Otto. 2011. Turkish and Kurdish influences in the Arabic Dialects of Anatolia. Türk
Dilleri Araştırmaları 21(1). 83–94.
Kern, Laurenz W. 2015. Kurden, Araber, Scheinlibanesen: Die vielschichtige Ethnizität der
Mḥallami. Wiener Zeitschrift für die Kunde des Morgenlandes 105. 189–202.
Lahdo, Ablahad. 2009. Some remarks on language use and Arabic dialects in eastern Tur-
key. Orientalia Suecana 58. 105–114.
Lahdo, Ablahad. 2010. Tillo-Arabic: A dialectological study with emphasis on social-lin-
guistic and language-contact phenomena. Saarbrücken: Lambert Academic Publisher.
Morony, Michael G. 2005. Iraq after the Muslim Conquest, 2nd ed. Piscataway, NJ: Gorgias
Press.
Nevo, Moshe. 1999. Notes on the Judaeo-Arabic dialect of Siverek. Zeitschrift für Arabis-
che Linguistik 36. 66–84.
Oppenheim, Max Freiherr von. 1939. Die Beduinen, vol. 1. Wiesbaden: Harrassowitz.
Owens, Jonathan. 2006. A linguistic history of Arabic. Oxford: Oxford University Press.
Procházka, Stephan & Ismail Batan. 2016. The functions of active participles in Šāwi
Bedouin dialects. In George Grigore & Gabriel Biţună (eds.), Arabic Varieties: Far
and Wide. Proceedings of the 11th International Conference of AIDA – Bucharest, 2015,
457–466. Bucharest: Editura Universităţii din București.
Procházka, Stephan. 2003. The Bedouin Arabic Dialects of Urfa. In Ignacio Ferrando &
Juan José Sánchez Sandoval (eds.), AIDA, 5th Conference Proceedings. Cádiz, Septem-
ber 2002, 75–88. Cádiz: Universidad de Cádiz.
Procházka, Stephan. 2006–2007. Does geographical periphery imply linguistic periphery?
The examples of the Arabic dialects of Cilicia and Urfa in southern Turkey. Roma-
no-Arabica 6–7. 109–132.
Procházka, Stephan. 2010. Genüsse aus der Steppe: Kulturgeschichtliches und Etymolo-
gisches zur Wüstentrüffel nebst einem Text im arabischen Dialekt von Urfa (Südost-
Türkei). Wiener Zeitschrift für die Kunde des Morgenlandes 100. 119–135.
Procházka, Stephan. 2013. Interesting facts on Ancient Mounds – Three Texts in the Bedouin
Arabic Dialect of the Harran-Urfa-Region (Southeastern Turkey). In Clive Holes & Ru-
dolf de Jong (eds.), Arabia and its dialect heritage. Essays in honour of Bruce Ingham,
203–213. Leiden & Boston: Brill.
Procházka, Stephan. 2018. The Northern Fertile Crescent. In Clive Holes (ed.), The histor-
ical dialectology of Arabic: Linguistic and sociolinguistic approaches. Oxford: Oxford
University Press [forthcoming].
Retsö, Jan. 2004. Relative-clause marking in Arabic dialects: A preliminary survey. In Mar-
tine Haak, Rudolf de Jong & Kees Versteegh (eds.), Approaches to Arabic dialects:
A collection of articles presented to Manfred Woidich on the occasion of his sixtieth
birthday, 263–273. Leiden & Boston: Brill.
Retsö, Jan. 2009. Nominal attribution in Semitic: Typology and diachrony. In Janet C. E.
Watson & Jan Retsö (eds.), Relative clauses and genitive constructions in Semitic,
3–33. Oxford: Oxford University Press.
Sabuni, Abdulghafur. 1980. Laut- und Formenlehre des arabischen Dialekts von Aleppo.
Frankfurt: Peter Lang.
Sasse, Hans-Jürgen. 1971. Linguistische Analyse des arabischen Dialekts der Mḥallamīye
in der Provinz Mardin (Südosttürkei). Munich: Ludwig-Maximilians-Universität dis-
sertation.
Talay, Shabo. 2001. Der arabische Dialekt von Hasköy (Dēr-Khāṣ) Ostanatolien. Teil I:
Grammatikalische Skizze. Zeitschrift für Arabische Linguistik 40. 71–89.
Talay, Shabo. 2006–2007. The influence of Turkish, Kurdish and other neighbouring lan-
guages on Anatolian Arabic. Romano-Arabica 6–7. 179–188.
Thackston, Wheeler M. 2006. Kurmanji Kurdish: A reference grammar with selected read-
ings. Unpublished Manuscript, Harvard University. https://sites.fas.harvard.edu/~ira-
nian/Kurmanji/kurmanji_1_grammar.pdf (accessed 20 February 2018).
Vocke, Sibylle & Wolfram Waldner. 1982. Der Wortschatz des anatolischen Arabisch. Er-
langen: W. Waldner.
Weninger, Stefan. 2011. Aramaic-Arabic language contact. In Stefan Weninger (ed.),
The Semitic Languages: An International Handbook, 747–755. Berlin & Boston: De
Gruyter Mouton.
Wittrich, Michaela. 2001. Der arabische Dialekt von Āzex. Wiesbaden: Harrassowitz.
2.5. The Neo-Aramaic dialects of eastern Anatolia
and northwestern Iran
Geoffrey Khan
1. Genetic affiliation
The Neo-Aramaic dialects of Anatolia and northwestern Iran belong to two major
subgroups of Neo-Aramaic. These are the Central Neo-Aramaic (CNA) subgroup
and the North-Eastern Neo-Aramaic (NENA) subgroup.1 The dialect geography
of Neo-Aramaic in the region has undergone radical changes over the last one
hundred years, the most stark result of which has been the displacement of all the
NENA-speaking communities from Turkey (see §2). The following geographical
description, therefore, relates to the situation that existed at the beginning of the
twentieth century, before these major upheavals. The CNA subgroup of dialects
were spoken by Christian communities in southeastern Turkey in the region of Ṭūr
ʿAbdīn, which extends from the town of Mardin in the west up to the boundary of
the Tigris river in the east and north. The main component of this subgroup is the
cluster of dialects of the Neo-Aramaic variety generally known in the academic
literature as Ṭuroyo.2 The main dialect split in Ṭuroyo is between the dialect of
the town of Midyat and the dialects of the surrounding villages. The differences
between the Ṭuroyo dialects are small and they are mutually comprehensible
(Jastrow 1985; Ritter 1990; Waltisberg 2016). In addition to the Ṭuroyo cluster
one other dialect is known to have existed in the CNA subgroup. This is the dialect
of the Christians of the village of Mlaḥso (now Yünlüce), situated near Lice in
northern Diyarbakir province, which is related to Ṭuroyo but exhibits a number of
significant differences (Jastrow 1994a, 2011).
The NENA dialects of the region were spoken to the east of the CNA area
across southeastern Turkey up to Lake Urmi in West Azerbaijan province in north-
western Iran. The boundary between the areas of the two dialect subgroups is
formed by the Tigris river. The NENA dialect area of the region is an extension
of the NENA dialect area of northern Iraq (Khan, this volume, chapter 3.4). As in
northern Iraq, the NENA dialects in the region dealt with in this chapter exhibit
considerable diversity. In southeastern Turkey the vast majority of the dialects
were spoken by Christian communities, with only isolated Jewish communities.
1
The term Central Neo-Aramaic was proposed by Tsereteli (1977) for Ṭuroyo in his
classification of the Neo-Aramaic dialects. The term North-Eastern Neo-Aramaic was
coined by Hoberman (1988: 557). For the basic features of NENA see chapter 3.4.
2
The most commonly used term used by native speakers of Ṭuroyo to refer to their lan-
guage is Ṣurāyt.
The Neo-Aramaic dialects of eastern Anatolia and northwestern Iran 191
In the region of Urmi in northwestern Iran, on the other hand, there were several
NENA-speaking Jewish communities, in particular in the towns of Urmi, Šəno
(official name Ushnuye), Solduz (official name Naghade) and Sablagh (now
Mahabad) (Garbell 1965; Khan 2008), in the area of Salamas north of the Urmi
plain (Duval 1883; Mutzafi 2015), and in adjacent towns that are now situated in
the east of Turkey, such as Başkale and Gawar (official name Yüksekova). The
Jewish dialects of this area are closely related and form a single cluster, which
is referred to here as the J. Urmi cluster. These are an extension of the so-called
trans-Zab cluster of Jewish dialects in Iraq (Khan, this volume, chapter 3.4;
Mutzafi 2008). A few isolated NENA-speaking Jewish communities were found
elsewhere in southeastern Turkey, for example in Challa (Čāl) (Fassberg 2010) in
Hakkâri province and Cizre (Nakano 1973) in Şırnak province, who spoke dialects
very closely related to those of the so-called lišana deni cluster of Jewish dialects
of northwestern Iraq (Khan, this volume, chapter 3.4).
The Christian NENA dialects of the region may be classified into several clus-
ters. These include the following:
i. The Bohtan cluster, spoken in villages in the area that is now the Şırnak and
Siirt provinces of Turkey (referred to below as C NENA Bohtan).3
ii. The Cudi cluster, spoken in villages in the area of the Cudi mountain (Cudi
Daği) that is now in the Şırnak province of Turkey (referred to as C NENA
Cudi).
iii. The Tiyare cluster, divided into Upper Tiyare and Lower Tiyare, spoken in
villages on the western side of what is now the Hakkâri province of Turkey
(referred to as C NENA Tiyare).
iv. The Txuma cluster, spoken in villages lying to the east of the Lower Tiyare
area (referred to as C NENA Txuma).
v. The Hakkari cluster, spoken in a variety of villages in the Hakkari mountains
east and north-east of Tiyare, including villages in the area of lake Van, and
Salamas in northwestern Iran (referred to as C NENA Hakkari).
vi. The cluster of dialects spoken in the far east of Turkey in the areas of
Šamməsdin, Gawar and in the mountains of Tergawar over the border in
northwestern Iran (referred to as C NENA Šamməsdin-Gawar).
vii. The Urmi cluster, which includes varieties of what is best considered a single
dialect spoken by Christians in villages situated on the plain of Urmi and
within the town of Urmi (referred to as C. Urmi).
3
Fox (2002, 2009) uses the regional term “Bohtan” to refer to the dialect specifically of
the villages of Ruma, Shwata, and Borb, which are distinct from other dialects of the
Bohtan cluster in some features. For the sake of clarity, examples from Fox’s data are
cited here under the name of the village Ruma.
192
Geoffrey Khan
Figure 1: Location of Neo-Aramaic dialects discussed in this chapter

The Neo-Aramaic dialects of eastern Anatolia and northwestern Iran
193
Figure 2: Christian tribal areas in the Hakkari region (southeastern Turkey)

194 Geoffrey Khan
2. History of the speech community
The geographical division between the Central Neo-Aramaic subgroup and the
NENA subgroup coincides with the ancient border between the Romans and Par-
thians, and later between the Byzantines and the Sassanians (Kim 2008). The lin-
guistic boundary between the two subgroups also coincides with an early Chris-
tian ecclesiastical division between the Jacobite (Syrian Orthodox) denomination
of the communities west of the Tigris and the Nestorian (Church of the East)
denomination of the communities to the east of the Tigris. The Christianity of
the NENA-speaking communities has become more diverse in recent centuries,
especially after the formation of the Chaldean Church, which is in communion
with Rome, and the activities of Protestant missionaries in the 19th century. There
are only sparse historical records relating to the various Neo-Aramaic-speaking
settlements of the region. Some linguistic aspects of the dialects, however, give
us insights into the history of the communities. The diversity of the NENA dia-
lects, for example, can be interpreted as a reflection of the antiquity of settlement
of the communities. Another factor that is likely to have had a bearing on this
diversity is the fact that many of the NENA-speaking communities had the status
of semi-independent tribes (ʾaširatte). As elsewhere in the NENA subgroup, the
Jewish NENA dialects of the region exhibit major differences in their structure
from the neighbouring Christian NENA dialects. This reflects the differing migra-
tion histories of the communities. The Jews of Urmi, for example, had settled in
the town at an early period whereas the Christians of the area were almost exclu-
sively agriculturalists living in the surrounding villages and only began to settle in
the town in large numbers in the late 19th century.
The Neo-Aramaic-speaking communities underwent a major upheaval during
the First World War in 1915, when they suffered massacres and mass displace-
ment from their homes in an Ottoman-led campaign in southeastern Turkey. No
accurate statistics are available for the total death toll, but it is estimated that as
much as half of the Neo-Aramaic-speaking population perished, either through
violence, disease or starvation, possibly amounting to around 250,000. Some of
the survivors of the Christian communities subsequently returned to their homes
in the Ṭūr ʿAbdīn area and the villages in the vicinity of the Cudi mountain. The
NENA-speaking communities in the remainder of southeastern Turkey, however,
became permanently displaced. The majority settled initially in refugee camps in
Iraq, then subsequently in Iraqi towns, in particular Baghdad and Kirkuk. Some
of the Christians from the Bohtan region fled northwards and found safety in the
Russian empire, eventually settling in the village of Gardabani in Georgia or in
Krosnodar. From 1933 to 1935 about ten thousand refugees from southeastern
Turkey were settled in refugee camps in northeastern Syria, then subsequently in
villages on both sides of the Khabur River. During the Kurdish uprisings in the
second half of the twentieth century there were further upheavals. The villages
in the Cudi region of southeastern Turkey were destroyed in the 1980s and early
1990s and the NENA-speaking population settled in Europe, mainly in France and
Belgium. Many of the speakers of Ṭuroyo left the region, settling in the Turkish
cities, especially Istanbul, or emigrating, mainly to Germany, Sweden and the
USA.
The vast majority of the Jews left the region in the early 1950s and settled in
the newly established State of Israel.
3. Current status of the speech community
The Ṭuroyo-speaking community of the Ṭūr ʿAbdīn region is now considera-

bly depleted. A few isolated members of the community who emigrated subse-
quently built houses back in the homeland, but the general trend is still an inex-
orable migration and now only about 2,500 still live in the original homeland,
mainly in the town of Midyat. The dialect of Mlaḥso, which was documented by
Jastrow (1994a) is now, apparently, extinct.
There are now no NENA-speaking communities in southeastern Turkey. Those
who had remained in the area of the Cudi mountain finally left in the 1980s when
their villages were destroyed in conflict between the Turks and the Kurds. There are
still some Christians living in the Urmi region, almost exclusively now in the town
of Urmi rather than the villages. A large proportion of the Christian community,
however, has left the region. Some have settled in the large Iranian cities, mainly
Tehran. In the 19th and early 20th centuries many Christians speakers of the dialect
of Urmi and other dialects of northwestern Iran and the adjacent region moved
to the Caucasus. These dialects, especially the Urmi dialect, are still spoken by
communities in Georgia and Armenia, and elsewhere in the former Soviet Union
(Tsereteli 1970). A large proportion of speakers have now settled in the USA, in
particular in California, and Australia.
In recent times the communities who settled in the Khabur region have been
under considerable pressure, in particular during the current military conflict in
Syria.
After the Jews settled in Israel in the 1950s there has been a gradual decline in
the number of speakers of the dialects. The dialect of Challa (Čāl) is now extinct
(Fassberg 2010). There are currently very few surviving speakers of the Urmi
cluster of dialects who have a good knowledge of the language. In the 19th century
many families from northwestern Iran, in particular from the region of Salamas,
moved to Tbilisi in Georgia, and subsequently in 1950 were settled by Stalin in
Almaty in Kazakhstan, where several speakers of the dialect can be found today.
196 Geoffrey Khan
4. Sociolinguistic situation
There was a general state of multilingualism across the Neo-Aramaic speech com-
munities of the region. In the Ṭūr ʿAbdīn area speakers of Ṭuroyo often also speak
vernacular Arabic (of the town of Mardin or the Mḥallamī bedouin), Kurdish and
Turkish, the latter being the official state and school language. In addition, Clas-
sical Syriac is used as a liturgical language and also, by some learned members of
the community, as a written language. In Sweden in the 1980s an official written
form of Ṭuroyo in the Roman alphabet was created by Yusuf Ishaq and his collab-
orators (Ishaq 1990; Heinrichs 1990).
In the NENA-speaking area of southeastern Turkey at the beginning of twen-
tieth century before the upheavals there was general bilingualism in Neo-Ara-
maic and Kurdish. There were also Armenian-speaking communities in the region.
There was little exposure to Arabic, though Arabic loanwords within the dialects
suggest that the contact with Arabic was greater at an earlier historical period.
The communities from southeastern Turkey who settled in the Khabur area of
Syria in the 1930s speak Arabic in addition to their native NENA dialects. The
communities of the Khabur generally preserved their individual dialects and did
not develop a koine dialect as was the case among the refugee communities in Iraq
(Talay 2008a).
In northwestern Iran speakers of the NENA dialects had contact with Kurdish
and Azeri Turkish. Nowadays most Christian speakers of NENA who still live in
the area also speak Azeri, the vernacular of the Muslim population in this area, and
also Persian, the official language of Iran, but not Kurdish. There are, however,
numerous Kurdish loanwords in the NENA dialects of this area and their mor-
phology indicates that they form an older historical layer of the lexicon than the
many Azeri words. This indicates that there must have been a more widespread
knowledge of Kurdish in the NENA communities at an earlier period (Khan 2016,
vol. 3: 1–3). Western missionaries who were active among the Christian commu-
nities of the area in the middle of the 19th century developed a literary form of
the Christian Urmi dialect written in Syriac script. This form of literary language
became widely used by learned native speakers and is still used to this day (Murre-
van den Berg 1999).
The NENA speakers who live in the Caucasus speak also Russian, Armenian
or Georgian. In Armenia the Christian NENA speakers live in villages together
with Armenians and there are many intermarriages. Many of the native Armeni-
an-speakers in these villages also speak NENA.
The surviving Jewish speakers of NENA from northwestern Iran who settled
in Israel are fluent in Modern Hebrew, which is now their primary language.
The communities who have survived in Almaty now generally prefer to speak in
Russian, especially the young generations.
5. Phonology
Dialects differ in the degrees of conservatism in phonology. Innovations in

phoneme inventories are typically conditioned by language contact, the less con-
servative dialects eliminating sounds that do not occur in contact languages. In
some cases language contact can facilitate archaic retention. In what follows a
selection of cases will be presented from the consonant system. The term ‘earlier
Aramaic’ is used to refer to what can be assumed to be the ultimate historical form
of features attested in NENA.
5.1. bgdkpt consonants

In earlier Aramaic the stop consonants bgdkpt developed fricative allophones after
vowels, which can be represented *[ḇ], *[ḡ], *[ḏ], *[ḵ], *[p̄], *[ṯ]. In both the
CNA and NENA dialects these fricative allophones became phonemicized, with
the result that minimal pairs are found with stops and fricatives, e. g. CNA Ṭuroyo
kətyo ‘he is’ vs. kəθyo ‘she comes’, NENA Ashitha (C. Tiyare) tela ‘fox’, θela ‘she
came’ (Borghero 2005: 43).
Ṭuroyo in the CNA subgroup is the most conservative dialect in this feature,
since it is the only dialect in the region that has preserved all the original fricative
forms of the bgdkpt consonants (Jastrow 1985: 6–10), e. g.
táwno ‘straw’ < taḇnā
ráġlo ‘foot’ < raḡlā
bóxe ‘he weeps’ < *bāxē
ʾíðo ‘hand’ < *ʾīḏā
káfno ‘hunger’ < *kap̄ nā
tlóθo ‘three’ < *tlāṯā
CNA Mlaḥso is less conservative in that the interdental fricatives *ḏ and *ṯ have
merged with /s/ and /z/ respectively (Jastrow 1994a):
tevnó ‘straw’ < taḇnā
reġló ‘foot’ < *raḡlā
boxé ‘he weeps’ < *bāxē
ʾizó ‘hand’ < *ʾīḏā
nofél ‘he falls’ < *nāp̄ el
tlosó ‘three’ < *tlāṯā
Mlaḥso differs from Ṭuroyo also in the reflex of fricative *ḇ. In Ṭuroyo this reflex
is /w/, which coincides with the reflex of historical *w. In Mlaḥso the reflex of *ḇ
is the labio-dental /v/, which is distinct from the reflex of historical *w:
198 Geoffrey Khan
Ṭuroyo táwno ‘straw’ (< *taḇnā) gáwzo ‘nut’ (< *gawzā)

Mlaḥso tevnó ‘straw’ (< *taḇnā) gawzó ‘nut’ (< *gawzā)
CNA, both Ṭuroyo and Mlaḥso, is distinguished from NENA in the preservation of
the fricatives /ġ/ and /f/ in the bgdkpt series of consonants, probably due to contact
with Arabic, which has these sounds in its phoneme inventory (Jastrow 2015). In
all NENA dialects of the region historical *ḡ has been lost and shifted either to a
laryngeal /ʾ/ or to Ø. The historical development was [ḡ] > /ʿ/ > /ʾ/ > Ø, in which
the velar first shifted to a pharyngeal fricative before weakening to a laryngeal and
zero (Tsereteli 1990). The reflex /ʾ/ is, accordingly, more archaic than the reflex
Ø, e. g.
Hertevin (C NENA Bohtan) mášeʾ (< *maššeḡ; cf. Ṭuroyo mašəġ) ‘he
washes’ (Jastrow 1988: 6)
C Urmi +
pálli (< *mpalləḡ; cf. Ṭuroyo mfaláġ)
‘divide!’
Throughout NENA the reflex of the fricative *ḇ coincides with the reflex of *w,
which is /w/ throughout most of southeastern Turkey, but /v/ in the eastern periph-
ery of the region in northwestern Iran:
Ashitha (C Tiyare NENA) kθawa ‘book’ (< *kṯāḇā)
C Urmi ctava
A historical *p̄ has generally merged with the stop /p/, e. g.
C Urmi nápəl (< *nāp̄ əl; cf. Ṭuroyo nófel) ‘he
falls’
Within NENA of the region, several dialects have preserved the interdental frica-
tives. These are mainly in the C Lower Tiyare and C Txuma clusters, e. g.
Ashitha (C NENA Lower paθəx ‘he opens’, ʾiða ‘hand’ (Borghero
Tiyare) 2005)
In other dialect clusters, however, the interdentals *ṯ and *ḏ have been lost. In
most cases they have merged with the stops /t/ and /d/. This shift is attested across
the Christian NENA dialects of the region from Bohtan and Cudi in the western
periphery to Urmi in the eastern periphery, e. g.
Hassana (C NENA Bohtan) patəx ‘he opens’ (< *pāṯəx),
beta ‘house’ (< *bayṯā),
ʾida ‘hand’ (< *ʾīḏā)
Bēṣpən (C NENA Cudi) patəx, bayta, ʾida
Urmi (C NENA Urmi) patəx, beta, ʾida
In dialects between these two peripheries, the unvoiced interdental *ṯ has under-
gone other shifts. In the Upper Tiyare dialects *ṯ shifts to /š/ in certain contexts,
most of which have in common the existence, either synchronically or diachroni-
cally, of a preceding high front vowel:
Upper Tiyare paθəx ‘he opens’ (< *pāṯəx),
bɛša ‘house’ (< *bayṯā),
mliša ‘full fs.’ (< *mlīṯā),
ʾiða ‘hand’ (< *ʾīḏā) (Talay 2008a: 66–68)
In many dialects in the C Hakkari cluster the interdental *ṯ is weakened to /h/ or
Ø. When its reflex is Ø, adjacent vowels merge or are connected by a glide, e. g.
Baz (C NENA Hakkari) maha ‘village’ (< *māṯā),
xədyuwwa ‘happiness’ (< *ḥidyūṯā)
(Mutzafi 2000)
Jilu (C NENA Hakkari) ma ‘village’ (< *māṯā),
məlkuwwa ‘kingdom’ (< *malkūṯā)
(Fox 1997: 16)
There is a certain degree of lexical and morphological conditioning of this weak-
ening, in that it is a feature of particular lexical items and morphemes. In some
lexical items in these dialects the reflex of *ṯ is a stop /t/, e. g.
Jilu (C NENA Hakkari) patəx ‘he opens’ (< *pāṯəx)
In the Jewish NENA dialects of northwestern Iran the interdentals generally shift
to the lateral /l/, as is the case in the Jewish trans-Zab NENA dialects of Iraq
(chapter 3.4., §5.1):
J. Urmi paləx ‘he opens’ (< *pāṯəx),
ela ‘festival’ (< *ʿēḏā)
In a few words *ṯ in this Jewish dialect cluster has lost its oral articulation and
shifted to /h/. Most words in which this shift is found have suprasegmental phar-
yngealization, e. g.
J. Urmi +
ahra ‘town’ (< *ʾaṯrā),
+
nahale ‘ears’ (< *naṯāṯā)
In a few cases, including a number of common words in the lexicon, the reflex of
*ḏ is /d/:
J. Urmi ida ‘hand’ (< *ʾīḏā),
od ‘he does’ (< *ʿāwəḏ)
In a small group of NENA dialects in the Bohtan cluster on the western periphery
of the region the velar fricative *ḵ shifts to a pharyngeal /ḥ/:
200 Geoffrey Khan
Hertevin (C NENA Bohtan) baḥe ‘he weeps’ (< *bāḵē)

The dorsal stops *k and *g of the bgdkpt series undergo palatalization in many
of the NENA dialects of the region, in some cases resulting in the affricates /č/
and /j/. In most dialects where palatalization occurs, it is not systematic and tends
to be restricted to particular lexical items and phonetic contexts, e. g. Hertevin
(C NENA Bohtan) before long vowels and short front vowels (Jastrow 1988: 4).
The process is most advanced in the varieties of the C. Urmi dialect that were
spoken on the southern Urmi plain, in which affrication of *k and *g is regular in
all contexts. A similar palatalization of dorsals is found in the Kurdish and Azeri
dialects of this area (Khan 2016, vol. 1: 109–112), e. g.
Gulpašan (C NENA Urmi) malča ‘king’ (< *malkā),
jəšra ‘bridge’ (< *gəšra)
In C. Urmi a dorsal *k may palatalize completely to /y/ in a few cases, e. g.
*dukta > duyta ‘place’
5.2. Laryngeal Settings of Unvoiced Stops and Affricates

NENA dialects in the region exhibit both aspirated and unaspirated unvoiced stops
and affricates. The aspirated series are found elsewhere in NENA but the unaspi-
rated series is unique to this region. The following illustrates the two series in the
C. Urmi dialect (the voiced unaspirated consonants are distinguished by a dia-
critic):
unvoiced aspirated unvoiced unaspirated voiced
/p/ /p̂ / /b/
/t/ /ṱ/ /d/
/č/ /č̭ / /j/
/c/ /c̭ / /ɟ/
5.3. Pharyngeals and laryngeals

The pharyngeal consonants *ḥ (unvoiced pharyngeal fricative) and *ʿ (voiced
pharyngeal fricative) have been preserved in CNA:
Ṭuroyo ḥamro ‘wine’ (< *ḥamrā),
ʿafro ‘dust’ (< *ʿap̄ rā)
In most NENA dialects of the region *ḥ has shifted to /x/, e. g.
C. Urmi xmara ‘donkey’ (< *ḥmārā)
In a small group of NENA dialects in the Bohtan cluster on the western periphery
of the region, the reflex of *ḥ is /ḥ/ (Jastrow 1994b; Talay 2008a: 44–45). In such
dialects a velar fricative *ḵ has shifted to the pharyngeal /ḥ/, so the pharyngeal
reflex of *ḥ may have resulted in the development *ḥ > *x > *ḥ:
Hertevin (C NENA Bohtan) ḥmara ‘ass’ (< *ḥmārā)
In all NENA dialects the voiced pharyngeal *ʿ has been weakened to the laryngeal
/ʾ/ (occasionally /h/) or to Ø
Rumta (C NENA Tiyare) maṛʾa ‘illness’ (< *marʿā), beʾe ‘eggs’
(< *bēʿē) (Talay 2008a: 72–78; 2008b)
Qočanəṣ (C NENA Hakkari) bihe ‘eggs’ (< *bēʿē) (Talay 2008a: 79)
C. Urmi +
marra ‘illness’ (< *marʿā), biyyə ‘eggs’
(< *bēʿē) (Khan 2016, vol. 1: 169–172)
In the variety of the C. Urmi dialect spoken in Armenia a historical h shifts to the
velar fricative /x/ under the influence of a similar shift in the Armenian dialects of
the region, e. g.
C. Urmi (Armenia) xada ‘thus’ (< hada) (Khan 2016, vol. 1:
104)
5.4. Pharyngealized consonants

In CNA and the dialects in the western half of the NENA area of the region the
historical pharyngealized consonants of earlier Aramaic *ṭ and *ṣ have generally
been preserved as pharyngealized segments:
Ṭuroyo (CNA) ṭuro ‘mountain’ (< *ṭūrā),
ṣawmo ‘fast’ (< *ṣawmā)
Ashitha (C NENA Tiyare): ṭura, ṣoma
In addition, several of these dialects exhibit pharyngealization of a number of other
consonants, especially sonorants and labials. These were not historically pharyn-
gealized segments, but now have in some cases developed into fully phonemic
consonants that contrast with the non-pharyngealized counterpart. This applies in
particular to the rhotic *r:
Ashitha (C NENA Tiyare) ʾamṛa ‘wool’ (< *ʿamrā),
ʾamra ‘she says’ (< *ʾāmrā)
In some of the Tiyare dialects, in addition to the non-pharyngealized alveolar /r/,
a non-pharyngealized retroflex /ɻ/ has developed, which both can stand in opposi-
tion to the pharyngealized rhotic /ṛ/:
202 Geoffrey Khan
Ashitha (C NENA Tiyare) paɻa ‘layer of dust’ (< *ʿpārā),

para ‘ornamental coin’ (< Turk. para),
paṛa ‘lamb’ (< *parrā) (Mutzafi 2014)
In the NENA dialects in the eastern half of the region, the original pharyngealiza-
tion of the consonantal segments has developed into a suprasegmental feature that
takes a whole word as its domain (marked here by the symbol +). In words with
suprasegmental pharyngealization in Christian dialects, the original pharyngealized
consonant segment *ṭ has become the tense, unaspirated stop segment /ṱ/, which is
distinct from the aspirated stop /t/ (Khan 2013; 2016, vol. 1: 107–139), e. g.
C. Urmi +
ṱuyra ‘mountain’ (< *ṭūrā)
+
soma ‘fast’ (< *ṣawmā)
In the Jewish dialects of northwestern Iran, the historical *ṭ merges with the aspi-
rated stop /t/:
J. Urmi +
tura, +soma
There are differences in the distribution of suprasegmental pharyngealiza-
tion in the lexicon across the NENA dialects that have this feature (Khan 2016,
vol. 1: 75–76), e. g.
C. Urmi +
Mawana (C NENA
Šamməsdin-Gawar)
mata +
mata ‘village’
+
dana dana ‘time’
5.5. Vowels
5.5.1. */ā/ > /o/

The CNA dialects are distinguished from the NENA dialects of the region by the
shift of an original long *ā to /o/:
Ṭuroyo ḥmoro ‘ass’ (< *ḥmārā),
malko ‘king’ (< *malkā)
In most of the NENA dialects of the region, as in NENA dialects in other regions,
an original long *ā has not undergone this quality shift, e. g.
Ashitha (C NENA Tiyare) xmara, malka
C. Urmi xmara, malca
In a few dialects in the C. Bohtan cluster in the north-west periphery of NENA,
which is geographically close to the CNA area, *ā in an open syllable that is not
word-final shifts to /o/.
Ruma (C NENA Bohtan) xmóra ‘donkey’ (< *ḥmārā), pótax ‘he

opens’ (< *pātəx), but málka ‘king’
(< *malkā), pátxa ‘she opens’ (< *pātxā)
(Fox 2009)
These examples show that the quality shift took place after the shortening of an
original long *ā in a closed syllable or an unstressed word-final open syllable,
which do not change quality. The quality shift is, therefore, a later phenomenon
than in CNA.
In other dialects of the Bohtan cluster, such as Hertevin, a long /a/ is realized
as a back [ɑː] (Jastrow 1988: 15).
5.5.2. Vowel raising

In many NENA dialects of the C Hakkari cluster, a short *a vowel is raised to a
higher quality in closed syllables, or syllables that were historically closed:
Jilu (C NENA Hakkari) kelba ‘dog’ (< *kalba),
genawa ‘thief’ (< *gannāwā ‘thief’),
keka ‘tooth’ (< *kakkā),
breta ‘daughter’ (< *brattā),
besima ‘pleasant’ (< *bassima),
šedər ‘he sends’ (< *mšaddər)
(Fox 1997, 2015)
Dīz (C NENA Hakkari) kəlba, ginawa, kika, brita,
bisima, šidəṛ
(Talay 2008a: 140, 240)
The lexical distribution of this vowel-raising differs across the Hakkari dialects,
and is found also in isolated cases in adjacent clusters, e. g.
Ashitha (C NENA Tiyare) genawa, but čaka, čalba, brata, basima,
mšadər
C. Urmi ɟinava, kika, but calba, brata, basima,
+
šadər
In the Jilu dialect there is a productive rule whereby a stressed /a/ shifts to /e/
before a following /e/ through a harmony process (Fox 1997: 18–19):
náša ‘man’ néše ‘men’
In Christian NENA dialects in the eastern area of the region (Hakkari, Šamməs-
din-Gawar, Urmi) an original */ē/ is raised to /i/, e. g.
C. Urmi cipa ‘stone’ (< *kēp̄ ā)
204 Geoffrey Khan
Concomitantly in these dialects the original back rounded vowel */ō/ is raised to
/u/, e. g.
C. Urmi +
naṱura ‘guard’ (< *nāṭōrā)
In these dialects an original */ū/ is fronted to the region of [y], represented here
by /ü/:
+
Mawana (C NENA xabüša ‘apple’ (< ḥabbūšā)
Šamməsdin-Gawar)
This seems to have been induced by a similar fronting of u in Kurdish dialects of
southeastern Turkey and northern Iraq (Haig, this volume, chapter 3.3, §4.1.2.).
The shift of o > u in the NENA dialects can be regarded as a pull-chain effect to
fill the space left by the fronting of u, which raises also the corresponding front
vowel e > i. In C. Urmi long */ū/ is not fully fronted but shifts to a diphthong with
a palatal offglide:
C. Urmi xabuyša
In C. Salamas and C. Gawilan, spoken to the north of Urmi, the */ū/ is pushed
upwards resulting in the fortition of the offglide of the vowel as a velar fricative:
C. Salamas xibuxša
5.6. Diphthongs
The CNA dialects, Ṭuroyo and Mlaḥso, preserve the diphthongs *aw and *ay:
Ṭuroyo/Malḥso gawzo ‘nut’, qayse ‘wood (pl.)’
Also in some of the NENA dialects of the region the diphthongs are preserved, e. g.
Baz, Maha Xtaya (C NENA yawna ‘dove’, layša ‘dough’
Hakkari): (Mutzafi 2000: 298)
Halmun (C NENA Tiyare): mawθa ‘death’, bayθa ‘house’
(Talay 2008a: 145–148)
In many NENA dialects, however, the diphthongs are contracted. The result of the
contraction of the diphthong *aw is /o/, e. g.
Ashitha (C NENA Tiyare) moθa ‘death’ (< *mawṯā)
The diphthong *ay contracts to various vowels across the NENA area, including
/e/, /ɛ/ and /a/. These have a geographical correlation and are characteristic of
specific clusters, the vowel /e/ being found in clusters on the eastern side of the
region, /ɛ/ in the Tiyare cluster in the central area, and /a/ in the western periphery
in the Bohtan and Cudi clusters:
C. Urmi beta ‘house’ (< *bayṯā)

Ashitha (C NENA Tiyare) bɛθa
Harbole (C NENA Cudi) bata
5.7. Syllable structure

The general principles of the syllable structure of the NENA dialects of the region
are similar to those of the NENA dialects of Iraq (see chapter 3.4, §5.5). Vowels in
open syllables are generally long and those in closed syllables are short. Diacritics
are used in the transcription, therefore, only where the length of a vowel deviates
from this principle. The /ə/ vowel, however, is always short. A couple of distinc-
tive features of the dialects of the NENA region will be mentioned here.
In some of the NENA dialects of the region, the process of eliminating short
vowels in open syllables is more advanced than elsewhere. This involves length-
ening short vowels or geminating the following consonants. Such processes are
found in cases where short vowels in open syllables arise due to the elision of
consonants, e. g.
Baz, Aruntus (C NENA Hakkari) táṛa ‘door’ (< *tăra < *tarʿā)
C. Urmi +
tárra
It is also found where a short vowel in an open syllable originates as an epenthetic,
e. g.
Harbole (C NENA Cudi) maplíṭa ‘she brings out’ (< *maplə̆ṭa)
C Urmi +
maplə́ṱṱa
Some dialects add affixed augments to increase the number of syllables in a word.
One purpose of this is to increase monosyllabic words to bisyllabic:
Hertevin (C NENA Bohtan) ʾéttek ‘there is’ (ʾet + augment ek)
Baz, Aruntus (C NENA Hakkari) ʾíni ‘there is’ (ʾi [< *ʾīṯ] + augment ni)
The CNA dialects exhibit a number of differences in syllable structure from NENA
dialects.
In Ṭuroyo long vowels are shortened when the syllable is closed, but in Mlaḥso
the vowel generally remains long, or half long (Jastrow 1994a: 23):
Ṭuroyo Mlaḥso
dóməx [ˈdoːməx] doméx [doˑˈmeːx] ‘he sleeps’
də́mxi [ˈdəmxi] domxí [doˑmˈxiː] ‘they sleep’
206 Geoffrey Khan
In CNA gemination of consonants is weakened after all vowels, unlike NENA

where it is preserved after high front vowels (Khan, this volume, chapter 3.4,
§5.5):
Ṭuroyo Ashitha (C NENA Tiyare)
šato šata ‘year’
ʿezo ʾəzza ‘goat’
Monosyllabic nouns with an initial consonantal cluster are made bisyllabic by
phonemicizing a prosthetic vowel before the cluster, unlike NENA, in which a
epenthetic vowel between the consonants of the cluster is phonemicized:
Ṭuroyo Ashitha (C NENA Tiyare)
ʾádmo də́mma ‘blood’ < *dmā
ʾə́šmo ʾəzza ‘name’ < *šmā
5.8. Stress
In CNA there is a difference in basic stress position between Ṭuroyo and Mlaḥso,
in that Ṭuroyo has penultimate stress where Mlaḥso has word-final stress:
Ṭuroyo Mlaḥso
málko malkó ‘king’
dóməx doméx ‘he sleeps’
The attachment of enclitics does not affect stress position:
Ṭuroyo hárke ‘here’, hárke=yo ‘he is here’
Mlaḥso ʾeskó ‘woman’, ʾeskó=yo ‘it is a woman’
The Christian NENA dialects of the region have basic penultimate stress, as is
the case with the Christian NENA dialects of Iraq (chapter 3.4, §5.6). The Jewish
dialects of northwestern Iran, however, have word-final basic stress:
C. Urmi J. Urmi
béta belá ‘house’
dáməx +
damə́x ‘he sleeps’
In the Christian dialects, an original penultimate stress becomes word-final when
syllables merge due to loss of consonants. This is particularly conspicuous in dia-
lects of the Hakkari cluster, in which the fricative *ṯ is lost between vowels. Such
a process can lead to phonemic stress oppositions:
Jilu (C NENA Hakkari) báxta ‘woman’ (< *báxtā)
baxtá ‘women’ (< *baxtā́ ṯā)
In most of the NENA dialects of the region, as generally in NENA, attachment of

the enclitic copula does not affect the stress position:
C. Urmi basíma + ilə basímələ ‘it is fine’
In Ruma and the related dialects in the Bohtan cluster, however, stress shifts when
a copula is added (Fox 2009: 62):
ṭówa + ile ṭowéle ‘he is good’
6. Nominal morphology
6.1. Pronouns and copula

The independent pronouns in CNA have the following form:
Ṭuroyo (Midyat) Ṭuroyo (villages) Mlaḥso
3ms huwe hiye hiye
3fs hiya hiya hiya
3pl hənne hənnək hiyen
2s hat hat hat
2pl hatu hatu hatun
1s ʾŭno ʾono ʾono
1pl ʾaḥna ʾaḥna ʾeləna
These pronouns have undergone a number of innovations in comparison with the
forms that should be reconstructed as their historical ancestors (Jastrow 1990).
The 3rd person forms, for example, are augmented by additional pronominal suf-
fixes: Ṭuroyo (Midyat) huwe < *hu + e, hiya < *hi + a, hənne < *hen + hen. In
Mlaḥso all the 3rd person pronouns have the same stem (originally that of the 3fs
pronoun) and the number and gender is only distinguished by the augment. The
form of the 1pl pronoun in Mlaḥso, eləna, is originally an accusative form.
The original form of the 3rd person pronouns are used as proclitics with the
function of a definite article:
Ṭuroyo ʾú-malko ‘the king’ < *hū malkā
ʾí-malakθo ‘the queen < *hī-maləkṯā
ʾám-malke ‘the kings’ < *hen-malkē
208 Geoffrey Khan
In CNA the copula is expressed by enclitic forms of the pronouns:

Ṭuroyo (Midyat)
Independent pronoun Enclitic pronominal copula
3ms. huwe =yo
3fs. hiya =yo
3pl. hənne =ne
2ms. hat =hat
2fs. hat =hat
2pl. hatu =hatu
1s ʾŭno =no
1pl. ʾaḥna =na
In the villages there is gender distinction in the 2s:
Ṭuroyo (Midin)
Independent pronoun Enclitic pronominal copula
2ms. hat =hət
2fs. hat =hat
Ṭuroyo has an alternative copula paradigm in which the enclitic pronouns are
attached to the existential particle kit ‘there is’:
Ṭuroyo (Midin)
3ms. kətyo
3fs. kətyo
3pl. kətne
2ms. kəthət
2fs. kəthat
2pl. kəthatu
1s kətno
1pl. kətna
Deictic pronouns are suffixed to the nouns they modify. The paradigm in Ṭuroyo
is as follows:
Near deixis
ms -ano ‘this’ cf. Syriac hānā
fs. -aθe ‘that’ cf. Syriac hāḏe
pl. -ani ‘these’ cf. Syriac hālēn
Far deixis
ms -awo ~ -awok ‘that’ cf. Syriac haw
fs. -ayo ~ -ayək ‘that’ cf. Syriac hāy
pl. -anək ‘those’ cf. Syriac hānōn (m.) hānēn
(f.)
In Ṭuroyo the suffixed deictics are combined with the prefixed definite article, e. g.
ʾu-malk-ano ‘this king’, ʾu-malk-awo ‘that king’ (Jastrow 1985: 39). In Mlaḥso the
definite article is not obligatory in such constructions (Jastrow 1994a: 32).
Ṭuroyo also has a deictic copula which points to the existence of an entity in
the speech situation, formed from the particle ka + pronominal object suffixes
containing the element l. The pronominal copula may be optionally added to some
forms (Jastrow 1985: 122):
Ṭuroyo (Midin)
3ms. kalé ~ kalé=yo ‘There he is’
3fs. kalā́ ~ kalā́ =yo ‘There she is’
3pl. kalə́n ~ kalə́n=ne ‘There they are’, etc.
2ms. kalə́x
2fs. kaláx
2pl. kalóxu
1s. kalí ~ kalí=no
1pl. kalán
The NENA dialects of the region exhibit considerable diversity in the forms of the
personal pronouns and deictic pronouns. There are many parallels with the forms
of the pronouns used in the NENA dialects in Iraq (chapter 3.4, §6.1). Here we
shall focus on the differences between NENA and CNA.
In all NENA dialects the 3rd person singular personal pronouns, i. e. the pro-
nouns that have an anaphoric function, have a prefixed ʾa-. The proto-NENA forms
can be reconstructed as a-hu (3ms) and a-hi (3fs) (Hoberman 1988). These forms
have been preserved in some dialects of the region:
Ṭuroyo (Midyat) Baz, Maha Xtaya
(C NENA Hakkari)
3ms. huwe ʾahu
3fs. hiya ʾahi
The forms ʾahu and ʾahi may ultimately go back to far deictic pronouns *hāhū
and *hāhī, in which the *hā- is a deictic element. These deictic pronouns would
subsequently have taken on the function of anaphoric pronouns. Alternatively, the
ʾa- prefix may have developed by analogy with 1st and 2nd person pronouns, which
also begin with ʾa-.
The reconstructed proto-NENA singular deictic pronouns and the reflexes of
deictic pronouns in a selection of NENA dialects are presented in the table below.
This includes also the anaphoric pronoun, i. e. the 3rd person personal pronoun:
210 Geoffrey Khan
Proto- Hertevin Baz Ashitha C. Urmi J. Urmi

NENA (Bohtan) Aruntus (Tiyare)
(Hakkari)
Near deixis
ms hāḏā ʾohá ʾaha ʾawwa ʾaha ya
fs hāḏī ʾehá ʾaha ʾayya ʾaha ya
Far deixis
ms hāhū ʾawa ʾawaha ʾawaha +
ʾavva o
fs hāhī ʾaya ʾayaha ʾayaha ʾayya o
Anaphoric
ms hāhū ʾahu ʾaw ʾo +
ʾav o
fs hāhī ʾahu ʾay ʾɛ ʾay o
As can been, various innovations have taken place across the dialects. In the selec-
tion of dialects in the table, J. Urmi retains the original configuration whereby
the same pronoun expresses far deixis and anaphora (o is a contraction of *hāhū,
which in J. Urmi is of common gender). Other dialects innovated by distinguishing
the anaphoric pronoun from the far deixis pronoun by the suffixing of the deictic
element –ha to the latter, e. g. C. Urmi +ʾav (anaphoric), +ʾavva (far deixis < *ʾaw-
ha). Some dialects preserved a form of the near deixis pronoun that is derived ulti-
mately from the proto-form, e. g. Baz, Maha Xtaya ʾaha < *hāḏā. Many dialects,
however, replaced the original near deixis form by one based on the stem of the
anaphoric pronoun, by adding the deictic element –ha to this stem, e. g. Ashitha
ʾawwa (< *ʾaw-ha). It is likely that such forms originated as far deixis pronouns
(cf. C. Urmi +ʾavva), but shifted to near deixis forms when a further innovation
occurred in the far deixis forms involving the addition of a second deictic –ha
suffix, as is the case in the Ashitha dialect: ʾawaha (< *ʾaw-ha-ha). Some dialects
such as Hertevin express the distinction between the near deixis and far deixis
form by differences in the contraction of a diphthong (ʾohá ms. near deixis <
*ʾawha, ʾawa ms. far deixis < *ʾawha).
As in the NENA dialects of Iraq, many dialects can express ‘very far’ deixis by
phonological strengthening, e. g.
Hassana (C NENA Cudi)
Far deixis Very far deixis
ms ʾawáha ʾawáʾḥa
fs. ʾayáha ʾayáʾḥa
NENA dialects of the region have an enclitic copula, as in NENA dialects in other
regions (Khan, this volume, chapter 3.4, §7, chapter 4.4, §7.1). These originated
historically as enclitic pronouns, as in Ṭuroyo, but in most dialects they have
undergone morphological change resulting in their assimilation to the inflection
of presentative constructions and the inflections of verbs (Khan 2001, 2012). The

most archaic NENA dialects in this respect are on the western periphery, in which
the 1st and 2nd person copula forms are still recognizable as enclitic forms of the
independent pronouns, e. g.
Hertevin (C NENA Bohtan)
Independent Pronoun Enclitic Copula
3ms ʾahu =ile
3fs. ʾahu =ila
3pl. ʾaḥni =ini
2ms. ʾahət =ihət
2fs. ʾahat =ihat
2pl. ʾaḥniton =əḥton
1ms. ʾana =ina
1fs. ʾana =ina
1pl. ʾaḥnaḥ =əḥnaḥ
As with the NENA dialects of Iraq, most NENA dialects in the region have a deictic
copula (chapter 3.4, §7) in addition to an enclitic copula. A feature of some NENA
dialects of the region is the existence of a near deixis and a far deixis series of
copulas, e. g. in C. Urmi, where the far deixis series is restricted to the 3rd person:
Near deixis Far deixis Enclitic
3ms. dulə velə =ilə
3fs. dula vela =ila
3pl. duna vena =ina
2ms. duvət =ivət
2fs. duvat =ivat
2pl. dutun =itun
1ms. duvən =ivən
1fs. duvan =ivan
1pl. duvax =ivəx
6.2. Pronominal possessive suffixes

The pronominal suffixes of the NENA dialects of the region exhibit a similar range
of forms as is found in the NENA dialects of Iraq (chapter 3.4, §6.1). Here we shall
concentrate on the pronominal possessive suffixes in the CNA dialects and their
relationship to the NENA suffixes.
212 Geoffrey Khan
Ṭuroyo has two series of suffixes. Series I consists of the basic suffixes and series 2
consists of suffixes that are composed of the morpheme -ayð- followed by a suffix
Series 1 Series 2
3ms. -e -ayðe
3fs. -a -ayða
3pl. -ayye -aθθe
2ms. -əx -ayðəx
2fs. -ax -ayðax
2pl. -ayxu -aθxu
1fs. -i -ayði
1pl. -an -ayðan
The first series of suffixes without the augmented morpheme is used predomi-
nantly to express inalienable possession, e. g. with kinship terms and parts of the
body. The second series, with the heavier suffix, is used as a general possessive
suffix. The second series is combined with the definite article, whereas the first
series combines without the article:
ʾem-e ‘his mother’
ʾu-bayt-ayðe ‘his house’
The heavier series derives from the contraction of an originally independent pronom-
inal genitive particle with a noun < *ʾu-bayto-diðe. A similar distinction between
a light and a heavy series of possessive suffixes is found in Mlahso, in which the
inserted element in the heavy series has the form –ez- (Jastrow 1994a: 29).
In NENA dialects a genitive independent pronoun can optionally be used,
typically to express alienable possession, e. g. C. Urmi betət diyyan ‘our house’.
Whereas in NENA this construction has the status of a pragmatic strategy, in CNA
it has become grammaticalized and the genitive pronoun has become morphologi-
cally bonded with the head noun.
6.3. Inflection of nouns and adjectives

The inflection of nouns and adjectives in the NENA dialects of the region follow
the same basic patterns as the NENA dialects of Iraq (chapter 3.4, §6.3). Due to
the loss of intervocalic *ṯ in the Christian dialects of the Hakkari cluster, however,
nominal inflections that originally contained this consonant are contracted, e. g.
Baz, Maha Xtaya šə́na ‘sleep’ < *šə́nṯā
(C NENA Hakkari) baxtá ‘women’ < *baxtā́ ṯā
(Mutzafi 2000: 303) xawá ‘sisters’ < *ḥaṯwā́ ṯā
In the eastern sector of the region, the common nominal plural ending *–ē has
been analogically extended to the feminine plural ending *-āṯā, resulting in the
form –ate or –atə, e. g.
+
Mawana C. Urmi Ashitha
(Šamməsdin-Gawar) (C Tiyare)
naš-e naš-ə naš-e ‘people’
baxt-ate baxt-atə baxt-aθa ‘women’
The CNA dialects exhibit inflections of nouns and adjectives that are comparable
to NENA. This includes the analogical extension of the basic plural ending –e to
the feminine ending in some dialects:
Singular Plural
Ṭuroyo karmo ‘vineyard’ karme
(Midyat) ʾarʿo ‘field’ ʾarʿoθo (< *ʾarʿāṯā)
Ṭuroyo karmo ‘vineyard’ karme

(villages) ʾarʿo ‘field’ ʾarʿoθe
Mlaḥso karmo ‘vineyard’ karme

ʾarʿo ‘field’ ʾarʿose
Some of the Jewish dialects of northwestern Iran spoken to the south of Urmi have
borrowed a definite article suffix from Sorani Kurdish with the form –ăke, e. g.
J. Šəno bela ‘house’, belăke ‘the house’. This is used also in the trans-Zab Jewish
dialects of Iraq (chapter 3.4, §6.1) and western Iran (chapter 4.4, §6.3.1).
6.4. Nominal annexation

In Ṭuroyo the annexation of a head noun to a modifying dependent noun is expressed
by prefixing the inherited Aramaic subordinating particle d to the dependent noun,
as in earlier Aramaic (Jastrow 1985: 44–47). If the nouns of the construction are
definite, they have the definite article (see §6.1):
ʾi-barθo d-u-malko
art . fs -daughter gen - art . ms -king
‘the daughter of the king’
Another construction is used in Ṭuroyo whereby a 3ms pronominal suffix is placed
on the head noun anticipating the dependent noun:
foθ-e d-u-babo
face-his gen - art . ms .-father
‘the face of the father’
214 Geoffrey Khan
This 3ms is invariable and is used also when the dependent noun is feminine sin-
gular or plural:
foθ-e d-i-emo
face-his gen - art . fs .-mother
‘the face of the mother’
In most NENA dialects of the region the subordinating particle d is suffixed to the
head noun and devoiced, e. g.
C. Urmi
bet-ət malca
house-gen king
Such constructions may derive historically from constructions with a fossil-
ized 3ms anticipatory pronoun, as is found in Ṭuroyo, as already suggested by
Nöldeke (1868: 149):
bet-ə-t malca
house-his-gen king
The suffix may be contracted in closely-knit phrases, in particular those express-
ing kinship relations or inalienable possession:
C. Urmi
brūn malca
son king
‘the son of the king’
In the Jewish dialects on the eastern periphery an ay element is often placed
between the two nouns. The etymology of this is likely to be an Aramaic demon-
strative pronoun, but its usage was probably induced by the izafe particle of Iranian
languages in contact (Khan 2008: 176–178):
lel-ət ay-xlula
night-gen dem -wedding
‘the night of the wedding’
As in the NENA dialects of Iraq (chapter 3.4, §6.4), the genitive particle d has
become bonded with the NENA demonstrative pronouns to form genitive allo-
morphs of the pronouns (Khan 2016: 239–242):
C. Urmi
bet-ət d-o naša
house-gen gen -that man
‘the house of that man’
7. Verbal morphology
The basic principles of verbal derivational morphology and inflection in the NENA
dialects of the region are the same as those that have been described for the NENA
dialects of Iraq (chapter 3.4, §7). Here we shall concentrate on some distinctive
features of the NENA dialects of the region and on the verbal morphology of the
CNA dialects.
The three derivational stems of verbs are preserved in the Christian NENA
dialects, e. g.
Ashitha (C NENA Tiyare)
Present Base
Stem I p-θ-x paθəx ‘to open’
Stem II mb-š-l mbašəl ‘to cook’
Stem III m-pl-x mapləx ‘to use’
In some Christian NENA dialects, predominantly in the eastern sector of the
region, the m- prefix of stem II is lost, resulting in a merger of stem II with stem I
in the present base:
C. Urmi
Stem I
p-t-x ‘to open’
Present base: patəx
Past base: ptix-
Resultative participle: ptixa
Imperative: ptux
Infinitive: ptaxa
Stem II
b-š-l ‘to cook’
Present base: bašəl
Past base: bušəl
Resultative participle: bušla
Imperative: bašəl
Infinitive: bašulə
Stem III
m-pl-x ‘to use’
Present base: mapləx
Past base: mupləx
Resultative participle: mupləxxa
Imperative: mapləx
Infinitive: mapluxə
216 Geoffrey Khan
In the Jewish dialects of northwestern Iran the merger of the stems and their inter-
nal vocalic patterns is more advanced than in the Christian dialects of the region.
In J. Urmi, for example, the original stem II has been lost completely and the
vocalic patterns of the surviving two stems have been levelled.
Stem I
+
q-t-l ‘to kill’
Present base: +
qatəl-
Past base: +
qtəl-
Resultative participle: +
qtila
Imperative: +
qtul
Infinitive: +
qatole
Stem II
m-nx-p ‘to shame’
Present base: manxəp
Past base: mənxəp-
Resultative participle: mənxipa
Imperative: mənxup
Infinitive: manxope
In most of the NENA dialects across the region the resultative perfect of both tran-
sitive and intransitive verbs is expressed by combining the resultative participle
with the copula (see chapter 3.4, §7). This construction, however, is not used in
some of the dialects of the Bohtan cluster in the western periphery of the region.
Instead they express the perfect by inflections of the past base, the perfect being
distinguished from the perfective by different inflectional suffixes. In Ruma and
the related dialects described by Fox (2009) the perfective is expressed by the past
base inflected by L-suffixes and the perfect by the past base inflected by D-suf-
fixes (see chapter 3.4, §7 for these inflectional suffixes):
Perfective Perfect
Transitive: grəš-le ‘He pulled’ griš-Ø ‘He has pulled’
grəš-la ‘She pulled’ griš-a ‘She has pulled’
Intransitive: qəm-le ‘He stood up’ qim-Ø ‘He has stood up’
qəm-la ‘She stood up’ qim-a ‘She has stood up’
In Hertevin only the perfect of intransitive verbs can be expressed in this way
(Jastrow 1988: 46–59):
Perfective Perfect
Transitive: greš-le ‘He pulled’ —
greš-la ‘She pulled’
Intransitive: qem-le ‘He stood up’ qem-Ø ‘He has stood up’
qem-la ‘She stood up’ qim-a ‘She has stood up’
In such a system, in which the perfect form is less developed, the function of the tran-
sitive perfective form grəšle is extended to cover also that of the transitive perfect.
In the Jewish dialects of northwestern Iran on the eastern periphery of the
region one finds a system similar to that of Hertevin with a perfect constructed with
S-suffixes on the past base of intransitive verbs. The perfect of transitive verbs,
however, is expressed by a resultative participle with a copula (Khan 2008: 70–80):
Perfective Perfect
Transitive: grəš-le ‘He pulled’ grišé ‘He has pulled’
(< *griša + ile)
grəš-la ‘She pulled’ grəštá ‘She has pulled’
(< *grəšta + ila)
Intransitive: qəm-le ‘He stood up’ qim-Ø ‘He has stood up’
qəm-la ‘She stood up’ qim-a ‘She has stood up’
In many NENA dialects of the region transitive perfective verbs express definite
objects by absolutive nominative D-suffixes, which occur before the L-suffixes,
e. g.
C. Ashitha (Borghero 2005)
griš-a-le griš-i-le
pull.pfv - d .3fs - l .3ms pull.pfv - d .3pl - l .3ms
‘he pulled her’ ‘he pulled them’
In such dialects the D-suffix agrees with definite nominal objects in the clause, as
in the NENA dialects Iraq (chapter 3.4, §7).
Some dialects do not allow any absolutive pronominal objects at all and express
all pronominal objects with accusative L-suffixes. This applies, for example, to the
Ruma dialect (C NENA Bohtan) (Fox 2009: 53). In some dialects that have this pro-
nominal object-marking strategy, adjustments are made to the form of the L-suffix
that marks the subject. A noteworthy adjustment of this kind has been documented
in the Hertevin dialect (C NENA Bohtan) by Jastrow (1988: 61). In this dialect
the L-suffixes remain unadjusted if the subject is third person but are adjusted
by giving them the ending of D-suffixes if the subject is 1st or 2nd person, e. g.
C Hertevin
(a) Perfective stem + L-suffix (A)
ḥze-le ‘he saw’
ḥze-lox ‘you (ms) saw’
ḥze-li ‘I saw’
(b) Perfective stem + L-suffix (A) + L-suffix (O)
ḥze-le-li ‘he saw me’
ḥze-let-ti (< ḥze-let-li, cf 2ms D-suffix –et) ‘you saw me’
ḥze-len-ne (< ḥze-len-le, cf 1ms D-suffix –en) ‘I saw him’
218 Geoffrey Khan
The result is the conversion of the 1st and 2nd person L-suffixes that express the
agentive subject (A) to suffixes that resemble the form of nominative D-suffixes.4
In most NENA dialects of the region the progressive is expressed by con-
structions consisting of the infinitive combined with the copula, which appears
to have developed under the influence of the model of progressive forms based
on the infinitive in Eastern Armenian. This construction is not used, however, in
some of the dialects of the Bohtan cluster on the western periphery, e. g. Ruma and
Hertevin. In these dialects inflections of the present base are used to express the
progressive (Fox 2009: 31, 55; Jastrow 1988: 54).
As in the NENA dialects of northern Iraq, in the NENA dialects of the region
preverbal particles are attached to the present base when it expresses indicative
realis mood. In most dialects of south-eastern Turkey this is ʾi- or y-. In some
dialects the prefix is elided in verbs beginning with a strong consonant but is gen-
erally preserved before verbs beginning with a vowel, e. g.
C. Ruma (Bohtan) (Fox 2009: 47)
y-oxəl ‘he eats’ (realis) oxəl ‘may he eat’ (irrealis)
In dialects in which the progressive is expressed by a construction based on the
infinitive, present forms with such a prefix express the habitual. In dialects that
do not use such a progressive construction, the form with the prefix may express
both the habitual and the progressive. In the NENA dialects of southeastern Turkey
there are traces of a k- prefix, originally, it seems, combined with the prefix ʾi, e. g.
Mazṛa (Txuma) k-i-ʾodax ‘we do’. In the Upper Tiyari dialects this has become
restricted to a few lexical verbs and palatalized to č-, e. g. čiʾin ‘I know’ (< *k-i-
yāḏʿən) (Talay 2008b: 306). In the NENA dialects of northwestern Iran, the k-
element has been preserved to a greater extent. In C. Urmi the indicative prefix
has the form ci- (< k + i), e. g. ci-patəx ‘he opens’. In C. Urmi an original velar *k
has been fronted to the palatal /c/. J. Urmi has preserved k- before verbs beginning
with weak consonants, e. g. k-xəl ‘he eats’ (< k + axəl).
The future is expressed throughout the NENA dialects of the region by a prefix
particle deriving from the volitive expression *bāʿē d- ‘wants to’. This has the
form bəd-, which is often contracted to b- or, before vowels, p̂ ṱ- ṱ.5 In J. Urmi the
future may be expressed also by gbe, the indicative 3ms form of ‘to want’, e. g. gbe
garšen ‘I shall pull’ (Khan 2008: 76).
Some dialects of the region use the particle de- before an imperative to give
immediacy and intensity to the command, e. g. Ashitha (C. Tyare) de-pluṭ! ‘go
out!’ (Borghero 2005: 126).
4
For futher details of the various constructions with transitive verbs in the NENA dia-
lects of the region see Khan (2017).
5
The sound transcribed as /ṱ/ is an unaspirated unvoiced alveolar stop. A /t/ without a
diacritic represents an aspirated unvoiced alveolar stop.
The CNA dialects are similar to the NENA dialects in that their verbal system
is largely constructed on bases that were active and passive participles of earlier
Aramaic, inflected with suffixes corresponding to the D-suffixes and L-suffixes of
NENA. The CNA dialects, however, exhibit a number features that are distinctive
from NENA and these will be the main focus of the following discussion.
As in most NENA dialects Ṭuroyo has three derivational stems, e. g.
Stem I n-š-q ‘to kiss’
Present base: nóšəq-
Stem II mḥ-l-q ‘to throw’
Present base: mḥáləq-
Stem III m-nš-f ‘to cause to dry’
Present base: mánšəf
Unlike NENA, however, Ṭuroyo also has a passive base for each stem:
Stem I Passive f-t-ḥ ‘to open’
Present base: mə́ftəḥ-
Stem II Passive mḥ-l-q ‘to throw’
Present base: miḥā́ ləq-
Stem III Passive m-nš-f ‘to cause to dry’
Present base: mitánšəf-
These passive bases are derived historically from passive and middle derivational
patterns that contained a t prefix in earlier Aramaic (cf. the so-called ʾeṯpeʿel,
ʾeṯpaʿʿal and ʾettap̄ ʿal forms in Syriac). The t is preserved only in stem III.
The past bases are inflected with L-suffixes and D-suffixes. The forms of these
in Ṭuoryo are as follows (Jastrow 1985, 128–129):
L-suffixes D-suffixes
3ms. -le -Ø
3fs. -la -o
3pl. -ːe (after consonant),6 -i
-lle (after vowel))
2ms. -ləx -ət
2fs. -lax -at
2pl. -xu (after consonant) -utu
-lxu (after vowel)
1ms -li -no
1fs. -li -ono
1pl. -lan -ina
6
I. e. the final consonant of the verbal base is geminated.
220 Geoffrey Khan
The L-suffixes are used with transitive verbs and the D-suffixes with intransitive
and passive verbs. The distribution of the suffixes, therefore, is based on transi-
tivity and voice. Intransitive verbs nearly all belong to stem I, where they have a
past base that is distinct in form from the past base of active stem I verbs. The two
bases are derived historically from different participle forms of earlier Aramaic
(transitive < the passive participle *CCīC and intransitive < the verbal adjective
*CaCCīC). The category of intransitive includes unaccusative verbs and also
verbs with non-agentive experiencer subjects (e. g. ‘to hear’, ‘to know’) (Furman
and Loesev 2015):
Stem I Active transitive
Past base: nšə́q-le ‘He kissed’
nšə́q-la ‘She kissed’
Stem I Intransitive
Past base: dáməx-Ø ‘He slept’
damíx-o ‘She slept’
The dialect of Mlaḥso has a system of derivational stems that is similar to that of
Ṭuroyo but Mlaḥso exhibits significant differences from Ṭuroyo in the distribution
of the inflectional suffixes that are added to past bases. In Mlaḥso the distribution
of the suffixes correlates with verbal aspect rather than transitivity and voice. The
D-suffixes are used on past bases to express the resultative perfect of both intran-
sitive and transitive verbs. This construction is attested only in stem I on past bases
equivalent to the intransitive past base of Ṭuroyo (Jastrow 1994a: 33, 66):
n-f-q (stem I) ‘to go out’
nafiq-Ø ‘He has gone out’
nafiq-o ‘She has gone out’
s-y-m (stem I) ‘to do’
saym-Ø ‘He has made’
saym-o ‘She has made’
L-suffixes are used on transitive, intransitive and passive verbs to express a per-
fective aspect. This configuration of D-suffixes and L-suffixes has parallels with
the verbal system of some NENA dialects in the Bohtan cluster. Neither Ṭuroyo
nor Mlaḥso have constructions consisting of the resultative participle + copula or
infinitive + copula to express the resultative perfect and progressive respectively.
This differs from most of the NENA dialects of the region, but, again, coincides
with the verbal system of dialects in the NENA Bohtan cluster.
Ṭuroyo expresses 3rd person pronominal objects of transitive verbs by absolu-
tive D-suffixes as in many NENA dialects, e. g.
grəš-Ø-li
pull.pst - d .3 ms - l .1 s
‘I pulled him’
griš-o-li
pull.pst - d .3 fs - l .1 s
‘I pulled her’
griš-i-li
pull.pst -d .3 pl - l .1 s
‘I pulled them’
The profile of Ṭuroyo with regard to pronominal object-marking is, therefore,
similar to that of many NENA dialects. Mlaḥso, by contrast, does not allow any
absolutive pronominal objects and all pronominal objects are expressed by accusa-
tive L-suffixes or separate accusative prepositional phrases (Jastrow 1994a: 54–56).
A point of difference between Ṭuroyo and NENA is that in transitive clauses
verbs do not have pronominal suffixes agreeing with a definite object argument
(Hemmauer and Waltisberg 2006: 32, 35) and as a result there is no absolutive
agreement in verbs derived from past bases:
g-nəšq-o ʾu-zlām
fut -kiss.prs -d .3FS art . ms -man
‘She will kiss the man’
nšəq-le ʾi-aθto
kiss.pst - l .3 ms art . fs -woman
‘He kissed the woman’
Ṭuroyo expresses the progressive and perfect by combining prefixes with the
present and past base respectively, e. g. kal-ko-nošəq ‘he is kissing’, ko-qayəm-Ø
‘he has arisen’ (Jastrow 1985: 146–150). The future is expressed in Ṭuroyo by
attaching to the present base a preverbal particle with various allomorphs (g,
k, gd, kt), all of which derived historically from a particle with the form *gəd
(Jastrow 1985: 148–149). This can also express habitual aspect. In Mlaḥso the
future particle has the form d-, which is likely to be related (Jastrow 1994a: 52).
The particle *gəd may be compared to the particle ket- which is used in the Herte-
vin (C NENA Bohtan) with the sense of ‘to be able’.
A distinctive feature of Ṭuroyo that is not found in the NENA dialects of the
region is the construction used to express the two pronominal objects of ditransi-
tive verbs. In these constructions the pronominal indirect object is expressed by an
L-suffix and the pronominal direct object is expressed by an enclitic pronoun that
has the same form as the copula. This expression of the pronominal direct object
is restricted to the 3rd person:
222 Geoffrey Khan
Ṭuroyo (Jastrow 1985: 142)
gd-oba-l-ləx=yo
fut -give.prs - d .1fs -l .2 ms =3s
‘I shall give him/her/it to you’
Parallel constructions are found in some of the Christian NENA dialects of the
Mosul plain (chapter 3.4, §7).
8. Syntax of basic clauses
8.1. Copula clauses

The syntax of copula clauses in NENA has been described in chapter 3.4., §8.1.
Here we shall only make the observation that, as in the NENA dialects of Iraq, the
moveability of the copula from its default position on the predicate differs across
the dialects. The general observation to be made is that in the Christian NENA
dialects of the region the copula may be moved and placed after constituents other
than the predicate in order to express the prominence of the constituent. In the
Jewish dialects of northwestern Iran, however, the copula is fixed on the predicate,
as is the case with the trans-Zab Jewish dialects of Iraq:
(1) J. Urmi (Khan 2008: 316)7
dad-óx +
tajə̀r=ile?| amon-ì +
tajə́r=ile| lá
father-your merchant=cop .3 ms uncle-my merchant=cop .3 ms NEG
dad-ì.|
uncle-my
Contrast C. Urmi, where such movement is possible (Khan 2016, vol. 2: 296):
(2) Yósəp +xòr-an=ilə.| là,| Yònan=ilə +
xór-an.|
Yosəp friend-our=cop .3ms neg Yonan=cop .3 ms friend-our
‘Yosəp is our friend.’ ‘No, Yònan is our friend.’
The movement of the copula from the predicate position to mark the prominence
of other constituents in a clause is possible also in Ṭuroyo (3a–b) on the western
periphery of the region:
7
In these examples intonation group boundaries are marked by |, the nuclear stress of the
intonation group is marked by a grave accent and non-nuclear stress is marked by acute
accents.
(3) a. aḥún-ux hàrke=yo.|

brother-your.ms here=cop .3 s
My brother is here.
b. lò,| ḥòθ-i=yo hárke,| ló aḥùn-i.|
No sister-my=cop .3 s here not brother-my
‘No, my sister is here, not my brother.’
For other contexts where the copula is moved in Ṭuroyo see Waltisberg
(2016: 97–99).
In the NENA dialects the copula is negated by prefixing to it the negative
particle la, e. g. C. Ashitha lɛle ‘he is not’ (< la + ile). In the Jewish dialects of
northwestern Iran, the stem of the copula is different when negated, e. g. J. Urmi
ile ‘he/it is’, lewe ‘he/it is not’. In Ṭuroyo the negator of verbs is lo, but this is not
used with the enclitic pronominal copula. Rather the negative copula is expressed
by an alternative paradigm consisting of the negative existential particle lat and
the enclitic copula, e. g. –yo ‘he/she/it is’, latyo ‘he/she/it is not’.
8.2. Verbal clauses

The description of the ordering of constituents in the verbal clauses of the NENA
dialects of Iraq (chapter 3.4, §8.2) apply also to the NENA dialects of the region with
which this chapter is concerned. As in Iraq, the main typological split is between
the Jewish trans-Zab dialects in northwestern Iran and the Christian dialects. The
Jewish trans-Zab dialects of the region exhibit the same type of word order as
those of Iraq, the most distinctive feature of which is the regular placement of the
direct object before the verb. The Christian NENA dialects of the region exhibit
a more flexible word order with the placement of the direct object after the verb
being the unmarked order, as is the case with the Christian NENA dialects of Iraq.
Flexible order with the unmarked positioning of the object after the verb is also
the typology of the ordering of constituents in Ṭuroyo (Waltisberg 2016: 249–254).
In Ṭuroyo verbs are negated by the particle lo, which contracts to lə in closed
syllables. It is placed before all preverbal particles, including the future/habitual
particle (Jastrow 1992: 29):
(4) lo ko-doʿər-Ø
neg ind -return.prs - d .3 ms
‘He is not returning’
(5) lə-g-šoməʿ-Ø
neg - fut -hear. prs - d .3 ms
‘He will not her’/‘He does not hear (habitual)’
In Ṭuroyo the negator cannot be used with the imperative. A prohibition is expressed
by negating the irrealis present, e. g. ḥər! ‘Look’, ló-ḥayrət! ‘Do not look (ms.)!’
224 Geoffrey Khan
In the NENA dialects of the region verbs are negated by the particle la, e. g.
(6) C. Ashitha (Tiyare)
la-šate-Ø
neg -drink.prs - d .3 ms
‘He does not drink’.
In most NENA dialects the future particle cannot be used with the negator, so a
negated future is identical to the present habitual, e. g.
(7) C. Ashitha (Tiyare) (Borghero 2005: 129)
la-šate-Ø
‘He will not drink’.
In dialects that express the habitual by a preverbal particle ʾi-/y, the sequence la +
y contracts to le, e. g.
(8) C. Halmun (Tiyare) (Talay 2008b: 329)
le-šate-Ø
In C. Urmi in northwestern Iran, the negation of the habitual prefix ci- is le,
e. g. ci-šatə ‘he drinks (habitual)’, le-šatə ‘he does not drink/he will not drink’
(Khan 2016, vol. 1: 286).
In the J. Urmi dialect the future particle b- may optionally be retained in nega-
tive clauses and is placed before the negator, e. g.
(9) J. Urmi (Khan 2008: 85)
b-la šate-Ø
fut - neg drink.prs - d .3 ms
In NENA dialects on the western periphery of the region, in the Bohtan and Cudi
clusters, the negator cannot be used with an imperative form and prohibitions are
expressed by combining the negator with the irrealis present, e. g.
(10) C. Umra (Bohtan) (Hobrack 2000: 78)
la-maḥk-ütən!
neg -speak.prs - d .2 pl
‘Do not speak (pl.)!’
In dialects in the eastern sector of the region the negator is used with the impera-
tive, e. g.
(11) J. Urmi (Khan 2008: 85)

la-+qtul
neg -kill.imp
‘Do not kill!’
9. Syntax of clause linkage
9.1. Coordinating linkage

The means of coordinating linkage of main clauses in the NENA dialects of the
region corresponds to the typology of the dialects of Iraq (chapter 3.4, §9.1). It
may be asyndetic or marked by a particle. When marked by a particle, this is gen-
erally either (i) a clausal connective particle or (ii) a particle marking topical prom-
inence. Examples with these two types of particle from C. Urmi are as follows:
(12) a. ʾannə miyya +
maxdur-e=na +
ʾallu
They water(pl.) pour. prog - poss .3 pl = cop .3 pl on.him
ʾu-bərrak̭=əna.
and-run.prog = cop .3 pl
‘They pour the water over him and run away.’
(Khan 2016, vol. 2: 401–404)
b. malca +ʾavva bəxšal-u=lə baxt-u=da
king that pound.prog - poss .3 ms = cop .3 ms wife-his=top
laxma marcux-u=la.
bread soften.prog - poss .3 ms = cop .3 fs
‘The king pounds that (garlic) and his wife softens the bread.’
(Khan 2016, vol. 2: 405–412)
Asyndetic linkage is reported to be the norm in some dialects on the western
periphery of NENA in the Bohtan cluster (Fox 2009: 110).
Ṭuroyo links coordinated clauses either asyndetically or with a connective par-
ticle (Waltisberg 2016: 261–268).
9.2. Subordinating linkage

The general subordinating particle d of earlier Aramaic is widely used in Ṭuroyo
in relative clauses, clauses that are complements of verbs and clauses that are
complements of adverbials:
(13) ʾi-barθo d-rḥim-o-li
the-girl rel -love.pst -d .3 fs - l .1 s
‘the girl that I have fallen in love with’
(Jastrow 1985: 49)
226 Geoffrey Khan
(14) ko-baʿ-no d-oðaʿ-no

ind -want.prs -d .1 s comp -know.prs - d .1 s
‘I want to know’
(Waltisberg 2016: 287)
(15) ḥzeli d-nafəq-Ø tarte-kurfe šaʿuθe ġălăbe
see.pst -l .1 s comp -leave.pst -d .3 ms two-snakes yellow very
šafire
beautiful
‘I saw that two very beautiful yellow snakes came out.’
(16) ʾi-naqqa d-ko-nəfq-o … Faṭəme … ko-nəfq-o
the-time comp -ind -leave.prs - d .3 fs Fatima ind -leave.prs -3 fs
‘When it (the army) goes out, Fatima goes out.’
Asyndetic linkage is also used in these contexts, especially between a main verb
and an irrealis complement clause (Waltisberg 2016: 280–283). In some cases
the d in Ṭuroyo is replaced by subordinating particles from languages in contact
such as ki from Kurdish and, more commonly, ʾanno and ʾənnahu from Arabic
(Waltisberg 2016: 290).
In the NENA dialects of the region the Aramaic subordinating particle d in rel-
ative and adverbial clauses generally has the form of a suffix -ət on the head item:
(17) Ashitha (C NENA Tiyare) (Borghero 2005: 366)
yarx-ət ʾaθe-Ø
month-rel come. prs - d .3ms
‘The month that is coming (i. e. next month)’
(18) C. Urmi (Khan 2016, vol. 2: 497)
ʾe-+dan-ət zamər-Ø-va ʾana +
šamm-ən-va
that-time-rel sing.prs - d .3 ms - pst I sing.prs - d .1 ms - pst
‘When he sang, I listened.’
In some dialects the original Aramaic subordinator is frequently replaced by inno-
vative subordinating particles or particles taken from languages in contact. This
process is particularly advanced in the dialects of northwestern Iran. In C. Urmi,
for example, the particle k̭ at, which derives historically form k̭ a ‘to’ + d, originally
expressing purpose, is now used to introduce direct complement clauses and rel-
ative clauses. This may be in phonetic imitation of the Kurdish subordinator ka:
(19) ʾe baxta k̭ at pyay=əla laxma

that woman rel bake.prog = cop .3 fs bread
bədray-u=la +
ʾal-de-mazrak̭
put.prog - poss .3 ms = cop .3 fs on-that-cushion
‘The woman who is baking bread puts it on the baking cushion.’
(Khan 2016, vol. 2: 454)
(20) +
bədday=ələ k̭ at +ʾAxiqar mət-lə.
knows.prog = cop .3 ms that Axiqar die.pst .- l .3 ms
‘He knows that Axiqar died.’
(Khan 2016, vol. 2: 487)
In J. Urmi the Iranian particle ki frequently replaces original d in these contexts,
e. g.
(21) une naše ki-la-ayol=u mar-ay-i.
those people rel - neg -know.prog = cop .3 pl deontic -know.prs - d .3 pl
‘Those people who do not know, let them know.’
(Khan 2008: 355–359)
(22) mar=e ki-+ms-e brat-ox
say.prog = cop .3 ms com -can.prs - d .3 ms daughter-poss .2 ms
mamqe-Ø-la.
cause.speak-d .3 ms - l .3 fs
‘He says that he can make your daughter speak.’
(Khan 2008: 366)
In some dialects a subordinating particle is omitted in some constructions. This
is the norm across CNA and NENA dialects of the region in direct complement
clauses of verbs (irrealis and realis) and relative clauses.
In most NENA dialects of the region the protasis of conditional clauses is
introduced by the original Aramaic conditional particle ʾən:
(23) C. Urmi (Khan 2016: 516–523)
ʾən-ʾavə-Ø xaya, palɟ-ət malcuyt-i
if-be.irr - d .3 ms alive half-gen kingdom-poss .1 s
b-yavv-ən-na k̭ at-ux
fut -give.prs .-d 1 ms -l .3 fs to-poss .2 ms
‘If he is alive, I shall give you half of my kingdom.’
In the Jewish dialects in the northwestern Iran, the conditional particle is regularly
replaced by the Iranian particle agar:
228 Geoffrey Khan
(24) J. Urmi (Khan 2008: 378)

agar ād mənn-i xoiš ode-t, ana
if you from-poss .1 s request make.irr - d .2 ms I
g-de-n
ind -come-d .1 s
‘If you ask me, I shall come’
Also in Ṭuroyo, on the western periphery of the region, the Aramaic particle is
replaced by a particle loaned from a contact language, either Arabic (e. g. ʾənkan,
ʾiðakan) or Kurdish (ʾagar) (Waltisberg 2016: 318–341):
(25) Ṭuroyo
w-iðakan ʾu-zlam-ano hawxa=yo, hano latyo
and-if the-man-this thus=cop .3 ms this cop . neg .3 ms
nošo
man
‘and if this man is thus, he is not a man.’
10. Concluding remarks
In the region of southeastern Turkey and northwestern Iran the major linguis-
tic boundary in Neo-Aramaic is between the CNA and NENA subgroups at the
Tigris river. CNA, especially Ṭuroyo, is more archaic than the NENA dialects in
some aspects of its phonology and morphology. Some of the NENA dialects in the
Bohtan cluster on the western periphery of the NENA area exhibit more parallels
with CNA than NENA dialects elsewhere. These features are likely to have arisen
by convergence rather than a common ancestry, since the parallels are often only
partial (e. g. the distribution of the *ā > o shift in Ruma). Within the NENA dia-
lects of the region there is considerable diversity. The Tiyare cluster on the whole
is the most archaic in its phonology and morphology. The dialects in the eastern
sector of the region are more innovative. The most innovative is the cluster of
Jewish dialects in northwestern Iran.
The lexicon of the CNA and NENA dialects contain many loanwords from the
languages in contact. Kurdish has had particular impact on the lexicon in all dia-
lects. In CNA there are also numerous loanwords from Turkish and the local Arabic
dialects. The Turkish influence on the lexicon intensified in the twentieth century
(Tezel 2015). In the NENA dialects of northwestern Iran there are many loanwords
from Azeri Turkish and Persian. These, however, appear to be a relatively recent
acquisition, since the majority of them are not morphologically integrated into
dialects, unlike Kurdish loanwords, which are generally morphologically adapted
and represent an older lexical layer in the dialects. In the dialects of northwestern
Iran calques of Iranian and Azeri phrasal verbs containing a noun and a light verb
are particularly common, e. g. C. Urmi sohbat ʾavəd ‘he converses’ (literally: he
makes conversation; cf. Persian sohbat kardan), +k̭ aravul ɟarəš ‘he guards’ (liter-
ally: to pull guard; cf. Azer. qarovul çəkmək) (Khan 2016, vol. 1: 457–458).
11. Glossed texts
11.1. Ṭuroyo
A dream (Jastrow 1985, 264–265)
The transcription and marking of prosody is taken from Jastrow’s presentation of
this text.
(26) fa b-ú-zabn-ā́ no – kə́t-wa ḥðó ʾə́šm-a faṭə́me
and at-dem -time-this there.is-pst one.fs name-her Faṭima
kə́t-way-la nxiróyo ʾə́šm-e génč-xalìl-aġa.
there.is-pst - l .3 fs lover name-poss .3 ms Genč-Xalil-Aġa
At that time there was a woman whose name was Faṭima. She had a lover
whose name was Genč Xalil Aġa.
(27) fa-hā́ no ʾí-ʿaskar d-tərkíya mbá-lle
and-this def -army gen -Turkey take.away.pst - l .3 pl
zbə́ṭ-ṭe maḥát-te b-í-ʿaskarìye ʾu
seize.pst - l .3 pl place.pst - l .3 pl in-def -army and
mbá-lle.
take.away.pst - l .3 pl
The Turkish army took this man away. They seized him, they put him in the
army and took him away.
(28) mbá-lle l-í-ʿaskarìye ʾu fayíš-o
take.away.pst - l .3 pl to-def -army and remain.pst - d .3 fs
faṭə́me táne ko-ḥə́lm-o b-lályo
Faṭima alone ind -dream.prs .- d .3 fs at-night
ʾu b-imómo b-ú-nxiroy-áyða
and in-day about-def -lover-poss .3 fs
They took him to the army and Faṭima remained alone. She dreamt night
and day of her lover.
230 Geoffrey Khan
(29) yáwmo m-áy-yawme qayím-o mə́l-la

day from-def -days stand.pst - d .3 fs say.pst - l .3 fs
yā́ -de ʾəmm-o mə́n=yo? ʾəmm-o
voc -mother say.prs - d .3 fs what=cop .3 s say.prs - d .3 fs
b-rámšəl ḥzé-li ḥə́lmo ʾəmm-o bárθ-i
at-last.night see.pst - l .1 s dream say.prs - d .3 fs daughter-my
ʾu-ḥəlm-áyðax hòw-e xér ʾamál-li
def -dream-poss .2 fs be.prs - d .3 ms good say.imp - l 1 s
mə́n-ḥze-lax?
what-see.pst - l .2 fs
One day she said ‘Mother!’ She says ‘What is it?’ She says ‘Last night I
dreamt a dream.’ She says ‘My daughter, your dream may be good. Tell
me, what did you dream?’
(30) ʾə̀mm-o yā́ -de ḥzé-li ʿáyno ftíḥ-o
say.prs - d .3 fs voc -mother see.pst - l 1 s spring open. pass - d .3 fs
b-í-dərt-áyðan ʾu ʿam-gab-è-d-í-ʕayn-ā́ θe
in-det -courtyard-poss .1 ps and with-by-poss .3 ms - gen - det -spring-this
kə́t-wa dáwmo d-ḥabúšo ʾu-b-í-dawm-ā́ θe kə́t-wa
there.is-pst tree gen -apple and-in-det -tree-this there.is-pst
tré kə́rfe kóme mʿálq-e ʾu b-í-dawm-ā́ θe
two snakes black hung.ptcp - pl and in-det -tree-this
yā́ -de kə́t-wa tré ḥabúše ʾu kə́t-wa
voc -mother there.is-pst two apples and there.is-pst
tré-kəkwe me-d-í-šmayo b-í-dawm-ā́ θe
two-stars from-gen - det -sky in-det -tree-this
She says “Mother, I saw a spring that opened up in our courtyard and by
this spring there was an apple tree, and in this tree two black snakes were
hanging, and in this tree, mother, there were two apples, and there were two
stars from the sky in this tree.’
11.2. North-Eastern Neo-Aramaic

C. Urmi dialect
The loan of a cooking pot (Khan 2016, vol. 4, 66)
An acute accent (v́ ) marks non-nuclear stress. A grave accent (v̀ ) marks the nuclear
stress of an intonation group. A vertical line (|) marks an intonation group bound-
ary.
(31) xá-yuma +
málla +
Nasràdən| bərrə̀xš=ələ| bəšk̭ál=ələ
one-day mullah Nasradin go.prog = cop .3 ms take.prog = cop .3 ms
+
k̭ usárta déna mən-švàv-u.| màr=ǝlə|
cooking.pot loan from-neighbour-poss .3 ms say.prog = cop .3 ms
hál-li xá-dana +
k̭ usàrta| +
báyy-ən
give.imp = l .1 s one-single cooking.pot want. prs - d .1 ms
bášl-ən ɟáv-o bušàla.| +k̭ usárta| +
ɟúrta
cook . irr - d .1 ms in-poss .3 fs stew cooking.pot big
lə̀t-li.| bəšk̭ál-o=lə
there.is.no-l .1 s take.prog - poss .3 fs = cop .3 ms
mày-o=lə +
k̭ usárta| bušála bašùl=ələ,|
bring.prog - poss .3 fs = cop .3 ms cooking.pot stew cook.prog = cop .3 ms
labùl-o=lə, |
yáv-o=lə mə̀drə| k̭ à|
take.prog - poss .3 fs = cop .3 ms give.prog - poss .3 fs = cop .3 ms again to
švàva.| ʾína tré +k̭ usaryay-sùrə| mattúy=əl
neighbour but two pots-small. pl put.prog = cop .3 ms
ɟàv-o.|
in-poss .3 fs
One day mullah Nasradin goes and takes a cooking pot as a loan from
his neighbour. He says ‘Give me a pot, I want to cook stew in it. I do not
have a big pot.’ He takes the pot and brings it back, cooks stew, takes it
and gives it back to the neighbour, but he puts two pots in it.
(32) švàva| màr=ǝlə| ʾáha tré +k̭ usaryàtə| k̭ àm
neighbour say.prog = cop .3 ms this two cooking.pots why
muyy-é=vət?| mə̀r-rə| +
k̭ usárt-ət
bring.ptcp . ms - poss .3 pl = cop .2 ms say.pst - l .3 ms cooking.pot-gen
dìyyux| də̀l-la| tré xínə mə̀nn-o.|
gen .2 ms give.birth.pst - l .3 fs two others with-poss .3 fs
yáv=əl k̭ àt-u| ʾáv-ət basìma,|
give.prog = cop .3 ms to-poss .3 ms be.irr - d .2 ms pleasant
bitày=ələ.|
come.prog = cop .3 ms
The neighbour says ‘This is two pots, why have you brought them?’ He
said ‘Your pot has given birth, two others are with it.’ He gives this to
him ‘Thank you’ and comes back.
(33) ʾé-šabta xìta| +málla +Nasrádən bərrə́xš=əl mə̀drə.|
that-week other mullah Nasradin go.prog = cop .3 ms again
màr=ələ| +
maxlèta,| xa-+k̭ usárta buš-+ɟùrta
say.prog = cop .3 ms pardon one-cooking.pot more-big.fs
+
byáy=əvən.| +
málla +Nasràdən| +ʾáyn-u
want.prog = cop .1 ms mullah Nasradin eye-poss .3 ms
pə́lt=əva +
ʾal-xa- k̭ usart-ət
+
švàvə.| nàk̭ša
fall.ptcp . fs = cop . pst .3 s on-one-cooking.pot-gen neighbours plan
232 Geoffrey Khan
ɟríš=əva| +
bayyí-va ʾày mayyí-va-la|
draw.ptcp . ms = cop . pst .3 s want. prs .3 ms - pst that.3fs bring. prs - pst - l .3 fs
k̭ à-de| +
k̭ usárt-u yuvv-à-lə| tré-xinə=da
for-that cook.pot-poss .3 ms give.pst - d .3 fs - l .3 ms two-other.pl =also
súrə mə̀nn-o.| xə́š-lə mə̀r-rə|
small.pl with-poss .3 fs go.pst - l .3 ms say.pst - l .3 ms
+
byáy=ən xa-+k̭ usárta +
ɟùrta,| k̭ át ʾə́t-li
want.prog = cop .1 ms one-cooking.pot big.fs comp there.is-l 1 s
ʾàrxə| bàšl-ən| xùrrac ɟáv-o.|
guests cook.irr - d .1 ms food in-poss .3 fs
The next week mullah Nasradin goes again. He says ‘Excuse me, I want a
bigger pot. The eye of mullah Nasradin had fallen on one of the pots of the
neighbour. He had made a plan, since he wanted to bring back that one. For
that (reason) he gave his pot as well as two others with it. He went and said
‘I want a big pot to cook food in—I have guests.
(34) yáv-o=lə ʾe-+k̭ usárta buš-šap̂ ə́rta
give.prog - poss .3 fs = cop .3 ms that.fs -cooking.pot most-beautiful.fs
+
ɟùrta| k̭ á +málla +Nasràdən.| +málla +
Nasrádən
big.fs to mullah Nasradin mullah Nasradin
mày-o=lə| mattúy-o=lə bèta.|
bring.prog - poss .3 fs = cop .3 ms put.prog - poss .3 fs = cop .3 ms house
šváva bəxzày=ələ| xà-yuma,| trè-yumə,| xà-šabta,|
neighbour see.prog = cop .3 ms one-day two-days one-week
trè-šabay,| xà-yarxa,| trè-yarxə| +
k̭ usárta lìtən.|
two-weeks one-month two-months cooking.pot there.is.not
ba-dàx ví-la ʾa-+k̭ usárta?| mù ví-la
but-how be.pst - l .3 fs this-cooking.pot what be.pst - l .3 fs
+
k̭ usárta?|
cooking.pot
He gives the biggest and most beautiful pot to mullah Nasradin. Mullah
Nasradin brings it back and puts it in the house. The neighbour sees that
one day, two days, one week, two weeks, one month, two months (pass)
but there is no pot. ‘But what has become of the pot? What has become
of the pot?’
(35) bərrə́xš=ələ +
ʾal-+tárr-ət +màlla,| mára +
màlla
go.prog = cop .3 ms to-door-gen mullah say.prog mullah
ṱ-ávə-t basìma| +k̭ usárta mùyy-o=vət
fut -be.irr - d .2 ms well cooking.pot bring.ptcp - poss .3 fs = cop .2 ms
ʾátən| cmá-+dana k̭ àm ʾadíyya,| +bayy-ə̀n-na.| mùt
you some-time before now want.prs - d .1 ms - l .3 fs what
+
k̭ usárta?| mə́rrə +
k̭ usárta ʾat-muyy-á-lux
cooking.pot said.pst - l .3 ms pot you-bring.pst - d .3 fs - l .2 ms
m-cə́s-lan k̭ at-bašl-ə́t-va xùrrac ɟáv-o.| ʾà|
from-with-l .1 pl comp -cook.irr - d .2 ms - pst food in-poss .3 fs this
mə̀rrə, | +
k̭ usàrta, | +
k̭ usárta mə̀t-la. |
mə́r-rə
say.pst - l .3 ms cooking.pot cooking.pot die.pst - l .3 fs say.pst - l .3 ms
ba-dàx c-óy-a mə́t-la?!| mə́rrə bas-dàx
but-how ind -be. prs - d .3 fs die.pst - l .3 fs say.pst - l .3 ms but-how
c-óy-a +
k̭ usárta yàdl-a?!|
ind -be. prs - d .3 fs cooking.pot give.birth.irr - d .3 fs
He goes to the door of the mullah and says ‘Mullah, if you please, I want a
pot that you took some time ago.’ ‘What pot?’ He said ‘A pot that you took
from me to use to cook food in.’ He said ‘Ah, the pot, the pot has died.’ He
said ‘How could it have died?!’ He said ‘How could a pot give birth?!’
Abbreviations
Abbreviations not found in the Leipzig glossing conventions include:

d = D-suffix (i. e. direct, nominative pronominal suffix)
l = L-suffix (i. e. oblique pronominal suffix)
References
Borghero, Roberta. 2005. The Neo-Aramaic dialect of Ashitha. Cambridge: University of

Cambridge, unpublished PhD thesis.
Duval, Rubens. 1883. Les dialectes néo-araméens de Salamâs: textes sur l’état actuel de
la perse et contes populaires publiés avec une traduction française. Paris: F. Vieweg.
Fassberg, Steven. 2010. The Jewish Neo-Aramaic dialect of Challa. Leiden & Boston:
Brill.
Fox, Samuel E. 1997. The Neo-Aramaic dialect of Jilu. Semitica Viva, Bd. 16. Wiesbaden:
Harrassowitz.
Fox, Samuel E. 2002. A Neo-Aramaic dialect of Bohtan. In Werner Arnold & Hartmut Bob-
zin (eds.), Sprich doch mit deinen Knechten Aramäisch, wir verstehen es! 60 Beiträge
zur Semitistik. Festschrift Für Otto Jastrow zum 60. Geburtstag, 165–180. Wiesbaden:
Harrassowitz.
234 Geoffrey Khan
Fox, Samuel E. 2009. The Neo-Aramaic dialect of Bohtan. Piscataway, NJ: Gorgias Press.
Fox, Samuel E. 2015. A > e raising in the Jilu dialect of Neo-Aramaic. Aramaic Studies 13.
161–173.
Furman, Yulia & Sergey Loesev. 2015. Studies in the Ṭuroyo verb. In Geoffrey Khan &
Lidia Napiorkowska (eds), Neo-Aramaic and its linguistic context, 1–28. Piscataway,
NJ: Gorgias Press.
Garbell, Irene. 1965. The Jewish Neo-Aramaic dialect of Persian Azerbaijan: Linguistic
analysis and folkloristic texts. Janua Linguarum. Series Practica 3. Berlin & Boston:
De Gruyter Mouton.
Heinrichs, Wolfhart. 1990. Written Turoyo. In Wolfhart Heinrichs (ed.), Studies in Neo-Ar-
amaic, 181–188. Harvard Semitic Studies 36. Atlanta, GA: Scholars Press.
Hemmauer, Roland & Michael Waltisberg. 2006. Zum relationalen Verhalten der Verbalf-
lexion im Ṭuroyo. Folia Linguistica Historica 27(1–2). 19–59.
Hoberman, Robert. 1988. The history of the Modern Aramaic pronouns and pronominal
suffixes. Journal of the American Oriental Society 108. 557–575.
Hobrack, Sebastian. 2000. Der neuaramäische Dialekt von Umra (Dere-Köyü). Laut- und
Formenlehre. Texte. Glossar. Erlangen-Nürnberg: Friedrich-Alexander-Universität, un-
published MA thesis.
Ishaq, Yusuf M. 1990. Turoyo: From spoken to written language. In Wolfhart Heinrichs
(ed.), Studies in Neo-Aramaic, 189–199. Harvard Semitic Studies 36. Atlanta, GA:
Scholars Press.
Jastrow, Otto. 1988. Der neuaramäische Dialekt von Hertevin (Provinz Siirt). Semitica
Viva 3. Wiesbaden: O. Harrassowitz.
Jastrow, Otto. 1990. Personal and demonstrative pronouns in Central Neo-Aramaic: A com-
parative and diachronic discussion based on Ṭūrōyo and the Eastern Neo-Aramaic dia-
lect of Hertevin. In Wolfhart Heinrichs (ed.), Studies in Neo-Aramaic, 89–103. Atlanta:
Scholars Press.
Jastrow, Otto. 1992. Lehrbuch der Ṭuroyo-Sprache. Semitica Viva 2. Wiesbaden: Harras-
sowitz.
Jastrow, Otto. 1994a. Der neuaramäische Dialekt von Mlaḥsô. Semitica Viva 14. Wies-
Jastrow, Otto. 1994b. Neuentdeckte aramäische Dialekte in der Türkei. In C. Wunsch (ed.),
XXV. Deutscher Orientalistentag. Vorträge. München, 1991, 69–74. Deutsche Morgen-
ländische Gesellschaft: Zeitschrift der Deutschen Morgenländischen Gesellschaft, Sup-
plement 10. Stuttgart: Franz Steiner Verlag.
Jastrow, Otto. 2011. Ṭuroyo and Mlaḥsô. In Stefan Weninger, Geoffrey Khan, Michael
Streck & Janet Watson (eds.), The Semitic languages: An international handbook, 697–
707. Berlin & Boston: De Gruyter Mouton.
Jastrow, Otto. 2015. Language contact as reflected in the consonant system of Ṭuroyo.
In Aaron M. Butts (ed.), Semitic languages in contact, 234–250. Leiden & Boston:
Brill.
Khan, Geoffrey. 2001. Quelques aspects de l’expression d’”être” en néo-araméen. In Anaïd
Donabédian (ed.), Langues de diaspora. Langues en contact, 139–148. Faits de Langues
Revue de Linguistique 18. Paris: Ophrys.
Khan, Geoffrey. 2008. The Jewish Neo-Aramaic Dialect of Urmi. Gorgias Neo-Aramaic
Studies. Piscataway, NJ: Gorgias Press.
Khan, Geoffrey. 2012. Remarks on the historical development of the copula in Neo-Ara-
maic. In Federico Corriente, Gregorio del Olmo Lete, Ángeles Vicente & Juan-Pablo
Vita (eds.), Dialectology of the Semitic languages. Proceedings of the IV Meeting on
Comparative Semitics, Zaragoza 9–11 June, 2010, 25–31. Sabadell: AUSA.
Khan, Geoffrey. 2013. Phonological emphasis in North-Eastern Neo-Aramaic. In Jean Léo
Léonard & Samia Naïm (eds.), Base articulatoire arrière: Backing and backness, 111–
132. Munich: Lincon Europa.
Khan, Geoffrey. 2016. The Neo-Aramaic Dialect of the Assyrian Christians of Urmi. 4 vols.
Khan, Geoffrey. 2017. Ergativity in Neo-Aramaic. In Jessica Coon, Diane Massam & Lisa
Travis (eds.), Oxford Handbook of Ergativity, 873–899. Oxford: Oxford University
Press.
Kim, Ronald. 2008. The subgrouping of Modern Aramaic dialects reconsidered. Journal of
the American Oriental Society 128(3). 505–531.
Murre-van den Berg, Heleen L. 1999. From a spoken to a written language: The introduc-
tion and development of literary Urmia Aramaic in the nineteenth century. Leiden:
Nederlands Instituut voor het Nabije Oosten.
Mutzafi, Hezy. 2000. The Neo-Aramaic Dialect of Maha Khtaya d-Baz: Phonology, Mor-
phology and Texts. Journal of Semitic Studies 45(2). 293–322.
Mutzafi, Hezy. 2008. Trans-Zab Jewish Neo-Aramaic. Bulletin of SOAS 71(3). 409–431.
Mutzafi, Hezy. 2014. The three rhotic phonemes in Ṭyare Neo-Aramaic. Aramaic Stud-
ies 12. 168–184.
Mutzafi, Hezy. 2015. Christian Salamas and Jewish Salmas: Two separate types of Neo-Ar-
amaic. In Geoffrey Khan & Lidia Napiorkowska (eds.), Neo-Aramaic and its linguistic
context, 289–304. Piscataway, NJ: Gorgias Press.
Nakano, Aki’o. 1973. Conversational texts in Eastern Neo-Aramaic (Gzira dialect). Study
of Languages and Cultures of Asia and Africa, A Series, no. 4. Tokyo: Institute for the
Study of Languages and Cultures of Asia and Africa.
Nöldeke, Theodor. 1868. Grammatik der neusyrischen Sprache am Urmia-See und in Kurd-
istan. Leipzig: T.O. Weigel.
Ritter, Helmut. 1990. Ṭūrōyō: Die Volkssprache der syrischen Christen des Ṭūr ʿAbdīn. C:
Grammatik. Stuttgart: Franz Steiner.
Talay, Shabo. 2008a. Die neuaramäischen Dialekte der Khabur-Assyrer in Nordostsyrien:
Einführung, Phonologie und Morphologie. Wiesbaden: Harrassowitz.
Talay, Shabo. 2008b. The Neo-Aramaic dialects of the Tiyari Assyrians in Syria: With
special consideration of their phonological characteristics. In Geoffrey Khan (ed.),
Neo-Aramaic Dialect Studies, 39–63. Piscataway, NJ: Gorgias Press.
Tezel, Aziz. 2015. The Turkish lexical influence on Ṣūrayt/Ṭūrōyo: A preliminary selection
of examples. In Geoffrey Khan & Lidia Napiorkowska (eds.), Neo-Aramaic and its
linguistic context, 69–99. Piscataway, NJ: Gorgias Press.
Tsereteli, Konstantin. 1970. Die Assyrer in der UdSSR. In Ruth Stiehl & Hans Erich Stier
(eds.), Beiträge zur alten Geschichte und deren Nachleben: Festschrift für Franz
Altheim, 2, 375–385. Berlin & Boston: De Gruyter Mouton.
Tsereteli, Konstantin. 1977. Zur Frage der Klassifikation der neuaramäischen Dialekte.
Zeitschrift der Deutschen Morgenländischen Gesellschaft 127. 244–253.
236 Geoffrey Khan
Tsereteli, Konstantin. 1990. The velar spirant ġ in Modern East Aramaic dialects. In Wolf-
hart Heinrichs (ed.), Studies in Neo-Aramaic, 35–41. Harvard Semitic Studies. Atlanta:
Scholars Press.
Waltisberg, Michael. 2016. Syntax des Ṭuroyo. Semitica Viva 55. Wiesbaden: Harrassowitz.
3. Northern Iraq
3.1. Northern Iraq: overview
Geoffrey Khan and Geoffrey Haig
1. Introduction
This section treats the main languages spoken in today’s northern Iraq. Politically,
most of this region is part of the Kurdistan Region within Iraq, established in 2005
under a local administration, the Kurdish Regional Government, with its capital
in Erbil. Historically, the region is part of what is traditionally known as Mesopo-
tamia, geographically encompassing the lower catchment areas of the Euphrates
and Tigris rivers. Across millennia, this is a region characterized by multilingual-
ism. Four thousand years ago, the Semitic languages Akkadian and Amorite, and
the genetically unclassified Hurrian and Sumerian were spoken here,1 and today’s
northern Iraq is likewise multilingual, though the constellation has shifted: Semitic
languages continue to be spoken in the region (Arabic and Neo-Aramaic), and
while no descendants of Sumerian and Hurrian have survived, Turkic and Iranian
languages have been added to the mix. Over the last century the language situa-
tion has undergone further changes due to the displacement of certain population
groups. This applies in particular to the Christian and Jewish communities who
speak Neo-Aramaic, a large proportion of whom now live in a diaspora outside of
Iraq, and also to the non-Islamic groups of Kurdish speakers, such as the Yezidis,
many of whom have been forced to leave their homeland.
Todayʼs languages are genetically affiliated to three different language families:
Semitic (Arabic, Neo-Aramaic), Iranian (Sorani and Kurmanjî Kurdish, Gorani)
and Turkic (Turkman). This section contains chapters on Arabic (Procházka),
Neo-Aramaic (Khan), Iranian (Haig) and Turkic (Bulut), the main spoken lan-
guages of northern Iraq. Each chapter covers the structure, and gives a survey
of dialectal varieties. In keeping with the aims of the volume, we do not treat
the principal official language of Iraq, Modern Standard Arabic, but focus on the
(often poorly documented) local varieties. In the rest of this introduction, we touch
on a number of striking similarities among the languages of this region, several of
which may have arisen through language contact.
1
For a historical perspective on the languages of Iraq see the contributions in Postgate
(2007).
238 Geoffrey Khan and Geoffrey Haig
2. Phonology
Several dialects of the Semitic languages of the region have lost interdental con-
sonants (ṯ and ḏ), generally by shifting the series to stops (t, d) or sibilants (s, z)
(Procházka, chapter 3.2, §2.1.1; Khan, chapter 3.4, §5.1). This brings the conso-
nant system closer to that of Iranian and Turkic, which do not have interdentals.
In several dialects of Sorani Kurdish a /d/ following a vowel or sonorant under-
goes lenition resulting in assimilation or loss of oral articulation, the so-called
‘Zagros d’ (Haig, chapter 3.3, §3.1.1), likewise attested in several Iranian lan-
guages of western Iran (see chapter 4.1, §2.1). In Jewish Neo-Aramaic dialects
spoken east of the Great Zab river (known as trans-Zab dialects) historical inter-
dentals *ṯ and *ḏ shift to the sonorant /l/ (Khan, chapter 3.4, §5.1). An intermediate
stage of development of this appears to have been the shift of the interdentals to
the voiced stop *d. The sonorant /l/ can then be regarded as lenition of the *d. Such
lenition, therefore, can perhaps be considered to be a convergence with that of the
Sorani Kurdish ‘Zagros d’.
The assimilation of /l/ to a stem-final /n/, which is a feature of derivational
suffixes beginning with /l/ in the Turkic dialects of the region (Bulut, chapter 3.5,
§2.2) has a parallel in the Neo-Aramaic dialects, in which inflectional verbal suf-
fixes beginning with /l/ assimilate to a stem-final /n/ (Khan, chapter 3.4, §7.0).
Palatalization of the dorsal stops /k/ and /g/ is a feature that is found in the
Kurdish, Turkic and Neo-Aramaic dialects of the region (Haig, chapter 3.3, §3.1.1;
Bulut, chapter 3.5, §2.1.1; Khan, chapter 3.4, §5.1). A certain degree of areal cor-
relation with regard to this feature can be identified in the various languages. The
predominance of the palatalization of these consonants in the northern varieties of
Turkic in the region, for example, corresponds to the situation in the Neo-Aramaic
dialects, in which it is found mainly in dialects spoken north of Arbil (e. g. Barwar).
The existence of unaspirated voiceless stops in the consonantal system of Kur-
manjî Kurdish in the north of the region (Haig, chapter 3.3, §4.1.1) has induced in
NENA dialects of the area the development of unaspirated voiceless stops, which
are not a historical feature of Aramaic (Khan, chapter 3.4, §5.1). The unaspirated
voiceless stops themselves probably entered Northern Kurdish through Armenian
(Martirosyan, chapter 2.2, §4.1; Haig, chapter 3.3).
In Turkic of the region there is a general tendency for a laryngeal /h/ to be
added before initial vowels and for an initial laryngeal stop /ʾ/ in loanwords to
be replaced by /h/ (Bulut, chapter 3.5, §2.1.1.2). This feature has spread to some
Neo-Aramaic dialects in the east of the region (Khan, chapter 3.4, §5.5). Develop-
ments in the back vowels of Kurmanjî Kurdish dialects in the north of the region
have been replicated in some Neo-Aramaic dialects, especially in the north-east of
the region. This involves the fronting of tense back rounded vowels ([uː] > [yː])
and the raising of close-mid back vowel [oː] towards [u] (Haig, chapter 3.3, §4.1.2;
Khan, chapter 3.4, §5.4).
Northern Iraq: overview 239
3. Morpho-syntax
The Sorani Kurdish suffixed marker of definiteness –eke is used in the Jewish
trans-Zab Neo-Aramaic dialects (Khan, chapter 3.4, §6.1) and in the Turkic dia-
lects of the region (Bulut, chapter 3.5, §2.4.4.1). There is, however, a difference in
grammatical integration. In the Turkic dialects it is added directly after the stem,
before case and plural suffixes, as in Kurdish, whereas in Neo-Aramaic it is added
after plural suffixes.
Constructions in the Kurdish noun phrase containing the genitive/attributive
marker known as ezafe (Haig, chapter 3.3, §3.2.1.1, §4.2.1) appear to have acted as
the model for developments in the Neo-Aramaic and Arabic dialects. In Neo-Ar-
amaic the genitive marker has become an affix on the head noun of genitive con-
structions, in accordance with the Kurdish model, e. g. C. Barwar bron-ət malka
son-gen king ‘the son of the king’. In Sorani Kurdish the ezafe constructions con-
sists of an invariable affix –i on the head noun, as in the Neo-Aramaic example just
cited. In Kurmanjî, however, the ezafe is inflected for gender and number, and the
dependent noun has oblique case-marking: kuř-ē pāšā-yī ‘son-ez . sg . m king-obl ’.
A replication of this oblique case-marking is identifiable in Neo-Aramaic in some
constructions, e. g. C. Barwar bron-ət d-o malka son-gen gen -that king ‘the son
of that king’. In origin the Neo-Aramaic -ət morpheme is a pronominal possessive
suffix + attributive marker < *-eh-d. A similar construction is found in some Arabic
dialects of the region, e. g. šəkl-u l-əlwalad appearance-his gen -young_man ‘the
appearance of the young man’ (Procházka, chapter 3.2, §2.4.2).
Several features of the verbal system of Kurdish have been replicated in
Neo-Aramaic, Arabic and Turkic (Matras 2010: 75; Haig 2017). One conspicu-
ous example of such areal features is the development of preverbal particles in
Neo-Aramaic (Khan, chapter 3.4, §7.0) and Arabic (Procházka, chapter 3.2,
§2.3.3). The preverbal particles in these Semitic languages appear to have a
native etymology, but their distribution and function corresponds closely to the
Kurdish model. This is seen most clearly in the development of preverbal parti-
cles expressing the indicative present, and particles expressing the future. There is
close functional correlation between these particles in Neo-Aramaic and Kurdish.
In Neo-Aramaic dialects in the Sorani-speaking area, for example, the indicative
present particle is used to express also the future, as in Sorani. In the Kurman-
jî-speaking area the Neo-Aramaic future particle is also used to express a habitual,
as in the Kurmanjî model. There are parallels in Neo-Aramaic also to the Kur-
manjî construction in which the ezafe particle is combined with the present indic-
ative to express the progressive (Haig, chapter 3.3, §4.2.2), e. g. Neo-Aramaic C.
Ankawa də-k-garəš (he is) one that pulls, i. e. ‘He is pulling’. It is noteworthy that
neither Neo-Aramaic nor Arabic has replicated the Iranian subjunctive pre-verbal
particle. Some dialects of Neo-Aramaic use a Turkic suffix on imperative verbs,
e. g. C. Barwar pluṭ-gən! ‘go out (sing.)!’ (Khan, chapter 3.4, §7.0), which cor-
responds to the Turkic imperative modifying suffix {-ginen} (Bulut, chapter 3.5,

§2.3.2.3).
Another innovative feature of the Neo-Aramaic and Arabic verbal systems that
has developed through contact with Iranian is a perfect form that is distinct from
the past perfective. Neo-Aramaic dialects exhibit various perfect forms, most of
which have clear structural correlates in Kurdish. These include constructions con-
sisting of a participle and copula (e. g. C. Barwar grišɛle ‘he has pulled’ < griša
+ ile pull.ptcp + cop .3 ms ) and constructions consisting of an invariable preverbal
particle placed before the past perfective form (e. g. J. Arbel lā grəšle ‘he has
pulled’ particle pull.pst .3 ms ) (Khan, chapter 3.4, §7.0). Arabic exhibits the latter
type of perfect construction, e. g. ʔaš kū-ǧāb what particle -bring.pst .3 ms ‘What
has he brought?’ (Procházka, chapter 3.2, §2.3.3). Turkic dialects in the region
have also developed an innovative perfect construction consisting of a participle
and copula on the Iranian model, which is in the process of replacing the old Turkic
perfect containing the morpheme {-mIš} (Bulut, chapter 3.5, §2.4.2). There has
also been borrowing of actual morphemes in the verbal domain. For example, the
Sorani Kurdish post-verbal aspectual clitic -ewe, -eve basically meaning ‘again,
backʼ, has been borrowed into Kurmanjî Kurdish and Neo-Aramaic.
There are innovative developments in the expression of pronominal comple-
ments of verbs in some of the languages of the region. The Turkic dialects, for
example, actually borrow the pronominal clitics of Sorani, attaching them to the
inflected Turkish verb to express recipients and benefactives, on the model of
Iranian (Bulut, chapter 3.5, §2.4.4.2). Nominative forms of clitic pronouns are
used to express the direct object in ditransitive constructions in both Arabic and
Neo-Aramaic, e. g. Mosul Arabic ʕaṭētū-k=uwwa give.pst .1 s -2 ms =3 ms . nom ‘I
gave it to you’ (Procházka, chapter 3.2, §2.2.2), Neo-Aramaic C. Qaraqosh kewi-
ləḥ=ina give. prs .3 pl - l .3 ms =3 pl . nom ‘They give them to him’ (Khan, chapter 3.4,
§7.0).
All languages of the region have enclitic copulas. In the Arabic dialects
these are enclitic nominative pronouns, which are a clear innovation in relation
to earlier attested forms of Arabic. The Neo-Aramaic copula is likewise histor-
ically an enclitic pronoun, but in most dialects it has acquired some degree of
verbal inflection due to convergence with the Iranian verbal enclitic copula. In all
West Iranian languages, clause-final enclitic copulas, etymologically going back
to the Old Iranian verbal copula, are present. Thus these elements are clearly an
inherited feature of Iranian, and presumably provided the model for the devel-
opment of overt copulas in the neighbouring Semitic languages of northern Iraq.
However, the influence has gone both ways, with the Kurmanjî Kurdish dialects
of the region exhibiting an additional innovative copular construction, used in e. g.
clauses expressing spatial location. The copula particle in these constructions is
clause-medial, and is etymologically related to the Ezafe particle, hence ultimately
of pronominal origin (Haig, chapter 3.3, §4.2.2).
Northern Iraq: overview 241
The alignment of Neo-Aramaic in clauses with past perfective verbs exhibits

various features of ergativity, although no dialect has totally canonical ergativity.
This is an innovation that has arisen by convergence with the Iranian model of
ergativity, likewise restricted to clauses based on past-tense stems of the verbs.
Particularly noteworthy is the manner in which verbal agreement patterns are
reversed in the accusative (present, or imperfective) and ergative (past, or perfec-
tive) structures respectively. In Neo-Aramaic, agreement with the agent of tran-
sitive clauses is expressed by one set of verbal affixes in the present tenses (the
D-suffixes), while the object can be expressed by another set (the L-suffixes). In
the past tenses, this is reversed. For example, in the Jewish NENA dialect of Sule-
maniyya (Khan, chapter 3.4, §7.0) we have garš-á-lu pull.prs -3. fs (D)-3. pl (L)
‘she pulls themʼ, while in the past tense (with stem-vowel alternation) we have
gərš-á-lu ‘they pulled herʼ. In Sorani in the same region, verbal suffixes index
the transitive subject in the present, and the object is indexed via mobile pronom-
inal clitics, but in the past this is reversed: present tense nā=t=nēr-īn neg =2 sg .
cl =send.prs -1 pl ‘we don’t send you’ versus past ne=t=nerd-īn ‘you didn’t send
us’. Although the NENA and Sorani systems do not match completely (in NENA
the ergative agreement patterns often extend to certain intransitives, and the argu-
ment marking in Sorani past transitives is more complex than suggested here)
the overall pattern of a tense-based reversal of functions for two sets of person
markers is comparable. It seems unlikely that a complex and cross-linguistically
rare pattern of this nature would have arisen independently in two unrelated lan-
guages (but see also the Kartvelian “inversion”, described for Laz in Lacroix,
(chapter 6.2).
Several of the Neo-Aramaic dialects of the region have developed a predomi-
nantly verb-final clausal syntax in conformity with that of Iranian and Turkic dia-
lects. This applies in particular to the Jewish trans-Zab dialects (Khan, chapter 3.4,
§8.2). All languages of the region, regardless of genetic affiliation and regardless
of the position of the direct object, place the goal of a verb of movement after the
verb, regardless of the position of the direct object. As a result, the Jewish trans-
Zab dialects of NENA, the Iranian languages, and the Turkic languages of the
region all converge on a typologically unusual combination of OV (direct object-
verb) and VG (verb-goal) word order, for which Iranian languages appear to have
provided the model. Similarly, all languages of the region favour finite construc-
tions for subordination, including relativization. While this is to be expected for
the Iranian and Semitic languages, the Turkic languages of northern Iraq have
adapted towards the Iranian and Semitic patterns, making widespread use of finite
subordination. Notably, however, the Turkic languages of northern Iraq have
retained more non-finite subordination than, for example, the Turkic languages of
Iran (Bulut, chapter 3.5, §2.5.1).
References
University Press.
Matras, Yaron. 2010. Contact, convergence, and typology. In Raymond Hickey (ed.), The
Handbook of Contact Linguistics, 66–85. Malden & Oxford: Blackwell.
Postgate, John Nicholas (ed.). 2007. Languages of Iraq. Ancient and modern. Cambridge:
British School of Archaeology in Iraq.
3.2. The Arabic dialects of northern Iraq
Stephan Procházka
1. Background to the language and the speech community
The region covered by this chapter comprises the Arabic-speaking parts of Iraq
north of the city of Širqāṭ and thus roughly corresponds to those areas of the
current administrative District of Niniveh (Nīnawā) which are situated along and
west of the Tigris River. The focus of the linguistic description lies on the contem-
porary dialect of the district’s capital Mosul (al-Mawṣil), which is a predominantly
Arabic-speaking city.
Vernacular Arabic belongs to the Semitic branch of the Afro-Asiatic language
family. Within the continuum of Arabic dialects, most varieties spoken in northern
Iraq belong to the so-called Tigris branch of the Mesopotamian group. The rural
Bedouin-type dialects (see below) form a distinct subgroup within Mesopotamian
Arabic and exhibit many features in common with the dialects of Arabia. Some
of the Arabic spoken close to the Syrian border constitutes a different layer of
Bedouin Arabic: it is of the Šammar type1 and hence also belongs to the Arabian
dialect group. An overview of the main localities discussed in this chapter is pro-
vided in Fig. 1.
1.1. Brief history of the speech community

The presence of Arab tribes in pre-Islamic Mesopotamia is well-documented and
can be traced back to the Parthian period and earlier. By the end of the Sassanian
Empire the whole region west and south of Iraq was probably dominated by Arab
pastoralists. The sedentary Arabs, whose settlements were situated mainly along
the Euphrates, were culturally assimilated and often bilingual or even trilingual
as they could speak Aramaic and Persian, the then dominant languages. The lin-
guistic situation changed with the Muslim conquest of the region in the middle of
the 7th century CE, which fostered massive Arab immigration from the Arabian
Peninsula.2 In the following centuries Arabic gradually replaced Aramaic and—
to a lesser extent—Persian and Kurdish. The details of the Arabicization of the
northern part of Iraq are largely unknown (Holes 2007: 123). What can be said
with certainty is that it was a relatively slow process and that it only marginally
1
Named after the large tribal confederation whose members live in northern Najd, the
Syrian Desert, and parts of the Jazeera.
2
For a detailed description of the Arab presence in pre- and early Islamic Iraq cf. Morony
(2005: 214–253).
affected the mountainous regions east of the Tigris and the Sinjar Mountains west
of Mosul. Today, in both regions Kurdish is the dominant language, with scattered
enclaves of Arabic- and Neo-Aramaic-speaking small towns and villages.3
In Iraq, as well as in northeastern Syria and parts of southeastern Anatolia,
two layers of vernacular Arabic can be differentiated. The older layer consists
of sedentary-type dialects which are usually labeled qǝltu dialects (after the verb
qǝltu ‘I said’) and which are mainly spoken in towns and ancient settlements. The
Bedouin-type dialects (also called gǝlǝt dialects after the same verb) form a more
recent layer because its speakers gradually infiltrated Iraq from the 13th century
onwards.4 Thus, “for a long time the Arabic spoken in the province of Mosul con-
sisted only of qǝltu dialects, spoken along with varieties of Neo-Aramaic” (Abu
Haidar 2004: 2, Fn.5). Until their exodus in 1950–1951, the Jews of some larger
towns in Kurdistan (e. g. Arbil, ʕAqra, and Kirkūk) spoke a distinct qǝltu dialect
which constituted the only autochthonous Arabic vernacular in that region (Jastrow
1990: 5–6). Encouraged by the Arabicization policy of the Baathist regime, during
the 1970s and 1980s Arabic speakers from other parts of Iraq settled in Kurdish
and Turkoman speaking areas of the country. But after 2003 many of them left the
region and today both vernacular and standard Arabic play an ever-decreasing role
in the area controlled by the Kurdistan Regional Government.
1.2. Current size, status, and religious affiliations of the speech community
No up-to-date statistical data regarding population structures of Iraq are available,
but from reliable sources5 the number of Arabic speakers in the region can be esti-
mated to be between 2.5 and 3 million people. As a vernacular of the official lan-
guage of Iraq, the local Arabic possesses a higher status than the other languages
of the region. This is particularly true for the dialect of the city of Mosul, which
can be regarded as the prestige variety of the whole area and which constitutes
without doubt the most important qǝltu dialect with regard to number of speakers
(cf. Jastrow 1979: 36).
The overwhelming majority of the Arabic-speaking population of the governo-
rate are Sunni Muslims; however, some of the speakers of the gǝlǝt and Šammar
dialects belong to Shia Islam (e. g. in the town of Rabīʕa close to the Syrian border).
Christians of different confessions used to constitute as much as 15 per cent of the
area’s population, but their number drastically decreased in the aftermath of the
3
Cf. the detailed map “Languages in the area of al-Mauṣil” of the Tübinger Atlas des
Vorderen Orients (TAVO: A VIII 11).
4
For the underlying historical developments and their impact on the linguistic setting of
Iraq cf. Holes (2007: 130–134).
5
Inter-Agency Information and Analysis Unit, Ninewa Governorate Profile, downloaded
from www.iauiraq.org/gp/print/GP-Ninewa.pdf (accessed 01 April 2013).

The Arabic dialects of northern Iraq
245
Figure 1: Locations of Arabic dialects discussed in this chapter

2003 war, which brought in its wake hostile attitudes towards non-Muslim com-
munities.6 A few Arabic speakers are Yazidis, who are concentrated in the Arab vil-
lages of Bǝḥzāni and Baʕšīqa, 20 km northeast of Mosul (Jastrow 1978: 24), and
in the town of Tall Kayf.7 The conquest of large parts of northern Iraq, including
Mosul, in the summer of 2014, by the ultra-Islamist forces of the so-called Islamic
State has had a massive impact on the Christian and Yezidi inhabitants. Most if not
all of them were killed or expelled from the regions controlled by the extremists.

Due to the multi-ethnic character of the region, bilingualism of Kurdish and Arabic
as well as of Turkoman and Arabic is a relatively widespread phenomenon. In most
places, particularly urban settings, where Arabic speakers rarely know Kurdish,
vernacular Arabic is the dominant language of communication, trade, and busi-
ness. Given its status and its large number of speakers, northern Iraqi vernacular
Arabic is, on the whole, definitely not endangered. However, this does not include
the varieties of the displaced Christian communities, whose linguistic identity is
definitely at risk because of their diaspora. The Jewish dialects of the region have
already become nearly extinct, being now spoken only by a few old people, mostly
living in Israel. Northern Iraqi vernacular Arabic is rarely written except in infor-
mal usage (especially by younger people in social media or text messages) or in
collections of local proverbs, songs, fairy tales, and the like.
2. Linguistic description
As was mentioned, this article mainly covers the Arabic of the city of Mosul,
called Maṣlāwi. But observations will also be made about characteristic features
of related dialects of the region. There are no data available for the northern Iraqi
gǝlǝt and Šammar dialects, so they must be left out of consideration. As was
pointed out by Jastrow (1979: 54–55), Maṣlāwi is a very typical variety of Iraqi
Arabic in general, and of the Tigris group in particular. The city with its approxi-
mately two million inhabitants is situated in the center of the qəltu-speaking area
and is the source of linguistic diffusion into its surroundings. As there are only
6
Most of the area’s Christians left Iraq for Europe and America; but tens of thou-
sands fled to politically stable Iraqi Kurdistan (source: webpage of the German NGO
“Gesellschaft für bedrohte Völker” www.gfbv.de/pressemit.php?id=2958&stayInside
Tree=1 (accessed 26 March 2013).
7
ar.wikipedia.org/wiki/%D8 %AA%D9 %84 %D9 %83 %D9 %8A%D9 %81 (accessed
25 March 2016).
The Arabic dialects of northern Iraq 247
minor differences between Muslim and Christian Maṣlāwi speakers,8 we have not
indicated the religion of the speakers of the examples given below.
2.1. Phonology
2.1.1. Consonants
The following table includes the consonantal phonemes of Mosul Arabic.
Table 1: The consonantal phonemes of Mosul Arabic

Bilabial Labio- Inter- Dental- Palatal Velar Uvu- Pharyn- Glottal
dental dental alveolar lar geal
Plosive
voiceless p t ṭ [ᵵ] k q (ʔ)
voiced b d ɡ
Affricate
voiceless č [t͡ʃ]
voiced ǧ [d͡ʒ]
Fricative
voiceless f ṯ [Ɵ] s ṣ [ᵴ] š [ʃ] x ḥ [ħ] h
voiced ḏ [ð] z ġ [ɣ] ʕ
ḏ ̣[ð̴]
Nasal m n
Lateral ḷ [ɫ]
Trill (r)
Approximant w y
A salient feature of Mosul Arabic and of (almost all) other Tigris branch dialects is
that /r/ is only a marginal phoneme since in most words the two Old Arabic (= OA)
phonemes /r/ and /ġ/ have merged into /ġ/9 (Jastrow 1979: 38), e. g. baqaġ ‘cows’
(< baqar), ġǝǧǝl ‘foot’ (< riǧl), daġǝb ‘path’ (< darb).10 In the sequence v̆ġC, /ġ/
8
Cf. Jastrow 2004: 142; this was confirmed by my informants. The dialect of the Jews,
however, significantly deviated from Muslim and Christian Maṣlāwi.
9
The sound /r/ is often found instead of /ġ/ in numerals. It is also heard in some lexical-
ized words like rǝkǝb ‘he mounted (a horse)’ and farraġ ‘he emptied’ (Jastrow 1979:
38).
10
Examples, for which no source is given, are from own research with informants and
unpublished BA/MA-theses from the University of Vienna, particularly by Hiam Wadie
and Lea Bäumler.
is frequently elided, resulting in the lengthening of the preceding vowel: e. g. qōṣa
(< qǝġṣa) ‘loaf (of bread)’. The sound shift r > ġ is an ancient Iraqi feature and
discussed in detail by Blanc (1964: 20–25).
In the dialects spoken in the western part of northern Iraq, /r/ and /ġ/ are inde-
pendent phonemes: Rabīʕa rǝḥtu ‘I went’; ʔarnab ‘rabbit’ (Abu Haidar 2004: 5);
al-Baʕʕāǧ11 rǝǧǝl ‘foot’ (Talay 1999: 68). The same was true for all the Jewish
dialects of Kurdistan (Jastrow 1990: 6).
Most dialects of the region possess the three interdental fricatives /ḏ/, /ṯ/, /ḏ/.̣
Exceptions are Bǝḥzāni, where they have shifted to sibilants (Jastrow 1978:
34–38), and Rabīʕa where they have become dental stops: e. g. hāda ‘this’, mǝtǝl
‘like’, ḥaḍḍ ‘luck, fate’ (Abu Haidar 2004: 4). Concerning the sounds /p/, /g/, and
/č/, see below, section 3. A common feature of Mosul Arabic is the devoicing of
consonants in word-final position; this applies particularly to /b/ and /d/, but not
to fricatives such as /ʕ/ and /ġ/ (Jastrow 1979: 41). Abu Haidar (2004: 6) reports
examples in the dialect of Rabīʕa of the elision of /ʕ/: for instance mōrǝf ‘I don’t
know’ (< mā aʕrǝf).
2.1.2. Vowels
Mosul Arabic has two short (/ǝ/ and /a/) and five long (/ā/, /ē/, /ī/, /ō/, /ū/) vowel
phonemes (Jastrow 1979: 39). Diphthongs are monophthongized in Mosul but
occur in Rabīʕa: mawt ‘death’, bayt ‘house’ (Abu Haidar 2004: 9). Frequently
found are the shifts ū > ō and ī > ē when the vowel precedes a velarized or back
consonant: ṣōġa ‘picture’ (< ṣūra); baṭṭēxa ‘sugar melon’ (< baṭṭīxa).
Diachronically seen, in many words an original vowel /ā/ has shifted to /ē/ or
to /ī/ under the influence of /i/ and /ī/, respectively, in the same word. This pho-
nological process (called ʔimāla by the Arab grammarians) is a characteristic of
all qǝltu-dialects. Examples: ǧawēmǝʕ ‘mosques’ (< ǧawāmiʕ); fanīǧīn ‘cups’ (<
fanāǧīn)12 (Jastrow 1979: 40). In Mosul Arabic this sound shift also happens when
a suffix that contains a long or short /i/ is attached to a noun: e. g. bǝstān ‘garden’,
bǝstēn-či ‘gardener’ (Blanc 1964: 47).
The feminine suffix has two allomorphs, -i and -a, which depend on the preced-
ing consonant (Jastrow 1979: 40): The allomorph -a appears after velarized and
back consonants, and -i in all other cases. Examples are: bēḏạ ‘egg’, mēlḥa ‘salty’,
fāġa ‘mouse’, ǧēǧi ‘hen’, lēli ‘night’, and ǧāmūsi ‘female buffalo’.
11
The qəltu Arabic speaking inhabitants of the town are of the Xawētna tribe whose dia-
lect belongs to the Euphrates branch.
12
In Rabīʕa only the shift ā > ē occurs: e. g. dakēkīn ‘shops’ (and not *dakīkīn; Abu Hai-
dar 2004: 9).
2.1.3. Syllable structure

The possible syllables are Cv̆ , Cv̄ , Cv̆ C, and Cv̄ C. The syllable CCv̆ /v̄ - is usually
realized with the epenthetic, non-phonemic vowel /ǝ/, which traditionally has been
transcribed as ǝ: e. g. ǝnġōḥ ǝnzūr xālna ‘we are going to visit our uncle’.13 The
sequence Cv̆ CC#14 is always changed to Cv̆ CǝC# (Jastrow 1979: 41), e. g. ḏǝ̣ hǝġ
‘noon’, mǝlǝḥ ‘salt’, qǝġǝn ‘horn’ (but qəġnu ‘his horn’).
If, due to inflection or the attaching of suffixes, a syllable of the type Cǝ
becomes unstressed, the vowel is in most cases elided: šǝ́ġǝb ‘he drank’, *šǝġǝ́btu
> ǝšġǝbtu ‘I drank’. A typical feature of the qǝltu dialects is that, if a pronominal
suffix is attached to a noun or verb, elision of the final vowel is prevented by
moving the stress onto the word’s last syllable: mašā́ kǝl ‘problems’ – mašākǝ́lǝm
‘their problems’; qǝṣṣa ‘story’ – qǝṣṣǝ́ti ‘my story’; yāxǝḏ ‘he takes’ – yāxǝ́ḏu ‘he
takes it’ (in contrast to yāxḏu ‘they take’).
2.2.1. Inflection
There are no case markers in northern Iraqi vernacular Arabic. There is masculine
and feminine gender, the latter usually marked by the morpheme -a/-i. Nouns can
generally be inflected according to the three numbers singular, dual, and plural:
the singular is not marked; the suffix of the dual is -ēn; and the plural is formed by
suffixes (in particular -āt) or by the internal reshaping of the word pattern.
Table 2: Number inflection on nouns

time, instance door path
sg kaġġa bāb daġǝb
dual kaġġtēn bābēn daġbēn
pl kaġġāt ǝ
bwāb ǝ
dġūb
In Rabīʕa and Mosul the unit nouns of collectives often have the ending -āyi: e. g.
nǝǧmāyi ‘a star’, kǝbbāyi ‘a patty of crushed wheat and minced lamb’, bayḍāyi ‘an
13
In the former Jewish dialects of Kurdistan an initial CC- was realized as CǝC-. In
certain verbal patterns it also occurred without an epenthetic vowel: e. g. šṛab ‘drink!’
Examples of CǝC are zǝbōʕ ‘week’, šǝrǝbtu ‘I drank’, and nǝnām ‘we sleep’ (Jastrow
1990: 32).
14
If the two consonants are identical, only one is pronounced: C1vC2C2 > C1vC2. Only in
the dialect of the village of Bǝḥzāni syllables ending on -CC# are found (Jastrow 1979:
41).
egg’ (Abu Haidar 2004: 10). In Mosul most of these forms are used besides those
ending in -i/-a; thus one also hears nǝǧmi star’ and bēḏạ ‘egg’.
2.2.2. Pronouns
The following table shows the independent and bound personal pronouns of Mosul
Arabic, based on Procházka (2006–2007: 125) and Jastrow (1979: 42–43).
Table 3: Pronouns (independent and bound) of Mosul Arabic

independent forms bound forms after C — after V
1 sg ʔana -i/-ni — -yi/-ni
2 sg m ʔǝnta -ak — -k
2 sg f ʔǝnti -ki — -ki
3 sg m huwwa (hīnu) -u — -nu
3 sg f hīya -a — -ha
1 pl nǝḥna -na — -na

2 pl ʔǝntǝm -kǝm — -kǝm
3 Pl hīyǝm -ǝm127 — -hǝm
12715
The form hīnu for 3m . sg seems to have become obsolete as only huwwa is heard in
contemporary speech (cf. also Jastrow 1979: 42). The independent forms are used
mainly in the subject position, but they (or shortened variants of them) can also
indicate a second pronominalized direct object: ʕaṭētū-k-uwwa ‘I gave it (masc.) to
you’, ʕaṭētū-k-īya ‘I gave it (fem.) to you’ (Jastrow 1979: 43); mā-nǝʕṭī-ki-hīyǝm
‘we do not give them to you’ (Jastrow 2004: 143/8); sallamtū-lǝm hīyǝm ‘I handed
them over to them’ (Jastrow 2004: 143/7).
2.3.1. Inflection
There is a twofold system for the inflection of finite verbs: the suffix-based conju-
gation for the perfect, and the mostly prefix-based conjugation for the imperfect.
The inflectional affixes are attached to the perfect and imperfect stem, respec-
tively. Two sets of vowel patterns exist for the basic stem (also called Form I):
CaCaC and CǝCǝC for the perfect, and -CCaC and -CCǝC for the imperfect.
15
In Rabīʕa -ǝn is used (as in Anatolia): baytǝn ‘their house’ (Abu Haidar 2004: 11).
Table 4: Perfect of šǝġǝb ‘to drink’

sg pl
1 ǝ
šġəb-tu ǝ
šġəb-na
2 m ǝ
šġəb-ət ǝ
šġəb-təm
2 f ǝ
šġəb-ti
3 m šəġəb
šəġb-u
3 f šəġb-ət
Table 5: Imperfect of šǝġǝb ‘to drink’

sg pl
1 ʔa-šġab nə-šġab
2 m tə-šġab
tə-šġab-ūn
2 f tə-šġab-īn
3 m yə-šġab
yə-šġab-ūn
3 f tə-šġab
The final –n of –īn and –ūn is dropped when a pronominal suffix is attached: e. g.
yəqšaʕūn ‘they see’, but yəqšaʕū-k ‘they see you (m . sg )’.
Table 6: Imperative of šǝġǝb ‘to drink’

sg pl
m ǝ
šġab ǝ
šġab-u
f ǝ
šġab-i
There exists an enclitic particle =ze16 which is used to emphasize imperative

forms: e. g. qūmī-ze ‘Stand up (f . sg )!’.
2.3.2. Derivation
The derivation of verbs is limited to nine forms for triconsonantal roots and two
for quadriconsonantal roots. Each form has a set of patterns for the perfect and
imperfect base as well as for the active and passive participle.
16
This appears to parallel the use of a clitic particle īš in Iraq-Turkic, which may be added
to finite verbs (including imperatives) to impart a sense of emphasis. In Iraq-Turkic,
it can be plausibly attributed to borrowing of the Kurdish clitic particle =īš / =žī; cf.
Bulut, this volume, chapter 3.5, §3.4.3.
2.3.3. Verbal modifiers

A set of verbal modifiers is used in Maṣlāwi Arabic and other sedentary dialects
of the region17 to express future tense as well as the durative, habitual, perfect, and
progressive. The last is expressed by the prefix qa- (qad- with 1 sg ):18 e. g. qa-yfat-
tlūnu b-ǝs-sōq ‘they are showing him around the market’ (Jastrow 1979: 47);
ən-nǝswān qa-yəxbəzūn xəbəz ‘the women are baking bread’. Progressive present
tense is also indicated by the invariable prefix kē-, e. g. ġēḥǝt ǝl-pāča kē-tǝṭlaʕ ‘the
smell of the pāča is ascending’.19
In Mosul continuous or habitual past tense is indicated by inflected forms of
kān ‘to be’ followed by an imperfect: e. g. kənna nəxbəz ‘we used to bake bread
~ we were baking bread’ (Jastrow 1979: 47). In Rabīʕa, the same is expressed by
the prefix ka-: e. g. ka-tǝḥlǝb ǝn-naʕǧēt ‘she used to milk the ewes’ (Abu Haidar
2004: 10).
The present perfect aspect is expressed by the invariable proclitic kǝn- fol-
lowed by a verb in the suffix conjugation: e. g. w-nǝḥna kən-ʕaǧanna l-ʕaǧīn
w ḥaḏḏ̣ ạ ġnānu ‘and we have already kneaded the dough and have prepared it’
(Jastrow 1979: 72/34); mā kǝn-ǝġšǝʕt ǝš-šōbak? ‘Haven’t you seen the rolling pin
yet?’ (Jastrow 1979: 70/29).
In the dialects of the Jews of Kurdistan, the verbal modifier kū was used for
both present and past time events: ʔaš kū-trīd? ‘What do you want?’; ʔaš kū-ǧāb
‘What has he brought?’ (Jastrow 1990: 63–67).
Future tense is indicated by the invariable ġayyǝḥ: e. g. ʔaš ġayyǝḥ tǝʕmalūn
ġada b-ǝl-lēl? ‘What will you (pl ) do tomorrow evening?’; ġayyəḥ tǝzzawwaǧ ‘she
will marry’. Jastrow (1979: 47) mentions two other modifiers, də- and kawi (dǝ-):
e. g. balki də-yəbʕaṯ ‘maybe he will send’; kawi ʔaġōḥ ~ kawi d-aġōḥ ‘I shall go’.
In the village of Bǝḥzāni tǝ- is found: e. g. tīqūl (< *tǝ-yqūl) ‘he will say’ (Jastrow
1978: 303).
2.4. Syntax
2.4.1. The syntax of copula clauses

The only known dialects of northern Iraq which use a copula in expressions “A is
B” are all spoken east of the Tigris. Among them is the dialect of the mainly Yazidi
17
An exception is the dialect of the Xawētna – probably because of Bedouin influence
(Talay 1999: 178).
18
The prefix goes back to the participle qāʕid ‘sitting’.
19
This modifier is not found in Jastrow’s texts, but in the texts published by Socin 1882–
83 (see also Jastrow 1979: 47, Fn.33). According to my informants it is typical of Chris-
tian Maṣlāwi speakers.
village of Bǝḥzāni and the former Jewish dialects, which also used an enclitic
copula.
(1) Bəḥzāni (for details see Jastrow 1978: 135)
báyt-a-hu
house-3 sg . f - cop .3 sg . m
‘It is her house.’
(2) Kurdistan Jews (Jastrow 1990: 37 f)
b-əl-bēt-nəḥne
in - def -house-cop .1 pl
‘We are at home.’
2.4.2. Nominal attribution

There are different constructions for nominal attribution in the dialect of Mosul.
One possibility is direct asyndetic attribution which requires the construct state of
the head noun that is morphologically marked only with feminine nouns (exam-
ples 3a and b). A less common way of expressing the same meaning is by an antic-
ipatory pronoun plus the preposition l- ‘for’ (example 3c).
(3) Mosul (Jastrow 2004: 142/5)
a. sīyār-t ǝl-pōlīs ǧǝ-tt ʕalē-na
car-f . constr . st def -police come-pf .3 sg . f to-1 pl
‘The police car came to us.’
Mosul (own data)
b. šəkl əl-walad
appearance.constr . st def -young_man
‘the young man’s outer appearance’
Mosul (Jastrow 1979: 49)
c. šəkl-u l-ǝl-walad
appearance-3 sg . m for-def -young_man
‘the young man’s outer appearance’
Extremely frequent is analytical attribution by means of the so-called genitive
marker māl or mālǝt. In this case the whole expression can be definite even if the
head noun is morphologically marked as indefinite.20 (4) and (5) are two examples
of definite expressions with an indefinite head noun, whereas (6) and (7) are exam-
ples of expressions with definite head nouns.
(4) Mosul (Jastrow 2004: 143/7)
20
There is evidence for an analogous usage in the Christian Baghdadi dialect (Blanc
1964: 125–126) and in the Jewish dialect of Hīt (Khan 1997: 93).
ʔaxaḏ-u ʔaqwāl mālǝt-na

take-pf .3. pl testimony.pl gen . marker -1 pl
‘They recorded our testimonies.’
(5) Mosul (Jastrow 2004: 143/7)
daftar xǝdma māl zōǧ-i
notebook service gen . marker husband-1 sg
‘my husband’s service record’
(6) Mosul (Jastrow 2004: 143/18)
asbāb ǝl-xurūǧ mālǝt-i mǝn ǝl-ʕirāq
reason.pl def -exit gen . marker -1 sg from Iraq
‘my reasons for leaving Iraq’
(7) Mosul (Jastrow 2004: 144/33)
ǝl-kǝġši māl-u
def -paunch gen . marker -3 sg . m
‘his paunch’
An indefinite expression with an indefinite head noun is:
(8) Mosul (Jastrow 2004: 144/28)
ʕḏạ̄ m māl ġanam
bone.pl gen . marker sheep
‘bones of sheep’
2.4.3. Direct objects

Besides asyndetically expressed direct objects, a second way to mark a direct
object if the latter is a definite noun is by linking the noun to the predicate by the
preposition l-21 preceded by an anticipatory pronoun that agrees in number and
gender with the object. This construction, which is found in several other Semitic
languages (Khan 1984), is commonly regarded to be an Aramaic substratum in
(northern) Iraqi Arabic (Blanc 1964:130; Contini 1999: 105; Weninger 2011: 750).
In the dialect of Mosul it is very frequent and often in free variation with construc-
tions without marking the direct object:22
(9) Mosul (own data)
a. ǝ
mbēḥa qšǝʕ-tū-nu l-abū-yi
yesterday see-pf .1 sg -3 sg . m for-father-1 sg
21
The preposition is sometimes omitted (Jastrow 1979: 49).
22
There are many instances of it in the texts which Jastrow 1979 recorded from a Muslim
woman. In Jastrow 2004, recorded from a Christian woman, this construction is much
rarer.
or:7
b. ǝ
mbēḥa qšǝʕ-tu ʔabū-yi
yesterday see-pf .1 sg father-1 sg
‘Yesterday I saw my father.’
2.4.4. Negation
Several particles are used in different situations for negation. They all immediately
precede the word to be negated. The particle mā is the most frequent way to negate
finite verbs, both perfect and imperfect.
(10) Mosul (Abu-Haidar 2002: 6)
ḥamāt-i kǝll ʕǝmġ-a mā dāġ-ǝt-l-i
mother-in-law-1 sg all life-3 sg . f neg turn-pf .3 sg . f -to-1 sg
bēl
attention
‘All her life my mother-in-law didn’t pay any attention to me.’
kaġġt-ēn sallam-tu ʕalayy-a w mā sallam-ǝt ʕalayy-i
time-dual greet-pf .1 sg on-3 sg . f and neg greet-pf .3 sg . f on-1 sg
‘Twice I greeted her and she didn’t greet me.’
mā n-ṭīq nə-dfaʕ əl-īǧāġ
neg ipf .1 pl -can ipf .1 pl -pay def -rent
‘We cannot pay the rent.’
Occasionally mā is also used to negate active or passive participles:
ʔa-šūf mā lēbǝs qāṭ
ipf .1 sg -see neg wear.ap . m . sg suit
‘I see you are not wearing a suit.’
The particle lā negates the imperative, as in (14), and occasionally also an optative
(15). When followed by kān it expresses the speaker’s wish that an event should
not have happened, as in (16).
lā tǝ-nṭǝy-ǝm ahammǝyyi ha-l-qadd!
neg ipf .2. m . sg -give-3 pl attention dem - def -extent
‘Don’t give them so much importance!’

lā yǝ-qšaʕ-ōn yōm ġāḥa
neg ipf .3-see-pl day rest
‘May they never see a day of rest!’
lā kān sāhaġ-təm l-nǝṣṣ ǝl-lēl
neg be.pf .3 sg . m stay.awake-pf .2. pl to-middle def -night
‘You shouldn’t have stayed up until midnight!’
The particle mū negates a non-verb (17). Sometimes, in admonitory speech, it
precedes an imperfect, shown in (18):
mū ǝsm-a salwa maġt axū-nu
neg name-3 sg . f Salwa wife brother-3 sg . m
‘His brother’s wife is not called Salwa.’
mū tǝ-ǧ-ōn šǝbʕān-īn
neg ipf .2-come-pl replete-pl
‘You (pl ) must not arrive feeling full!’
2.4.5. Syntax of clause linkage

Coordinative clauses are mostly linked by conjunctions, particularly w ‘and’, bass/
lākǝn ‘but’, and wǝlla ‘or’. Subordinate clauses which are dependent on verbs of
speaking, of belief, and the like are very often not introduced by a conjunction,
shown in (19–20). Less frequently, the same may be true of final clauses, as in
(21).
(19) Mosul
qāl mā yǝ-ǧi
say.pf .3 sg . m neg ipf .3 sg . m -come
‘He said that he will not come.’
(20) Mosul
mā n-ṣaddǝq ġayyǝḥ yǝ-zzawwaǧ-ūn
neg ipf .1 pl -believe fut ipf .3-marry-pl
‘We don’t believe that they will marry.’
(21) Mosul (Jastrow 1979: 70/33)
lǝmmǝn-mā nǝ-ǧi nǝ-ʕlǝq-u
when-prtcl ipf .1 pl -come ipf .1 pl -light-3 sg . m
‘When we come to light it.’
Final clauses are frequently marked by the verbal modifier dǝ-.

(22) Mosul (Jastrow 1979: 75/42)
w yǝ-ǧi yǝ-ġlī-nu yǝ-šġab-u
and ipf .3 sg . m -come ipf .3 sg . m -boil-3 sg . m ipf .3 sg . m -drink-3 sg . m
dǝ-yǝ-staġīḥ
mod - ipf .3 sg . m -recover
‘To recover, he boiled it and (then) drank it.’
(23) Mosul
ǧā baʕǝd sani dǝ-y-āxǝḏ ḥǝṣṣǝt-u
pf .3 sg . m .come after year mod - ipf .3 sg . m -take share-3 sg . m
‘After one year he came to take his share.’
Conjunctions are frequently found in temporal and causal clauses as well as in
conditionals. Many temporal conjunctions consist of a preposition followed by a
particle mā. Numbers (24–29) are examples of subordinated temporal clauses. An
example of a temporal clause without a subordinating conjunction is (30).
(24) Mosul (Jastrow 1979: 70/33)
nǝ-šʕǝl ᵊb-baṭn-u ḥatta y-ṣīġ aḥmaġ
ipf .1 pl -light in-belly-3 sg . m until ipf .3 sg . m -become red
‘We heat its inner part until it gets red-hot.’
(25) Mosul (Jastrow 1979: 72/35)
n-ṣǝff-u … īlǝmmǝn-mā nǝ-nzal lī-ǧawwa
ipf .1 pl -line.up-3 sg . m until-prtcl ipf .1 pl -descend to-below
‘We put them in a row … until we reach the bottom line.’
(26) Mosul
lǝmmǝn ṣāġ ǝḏ-̣ ḏǝ̣ hǝġ ġāḥ ʕǝnd
until pf .3 sg . m -become def -noon pf .3 sg . m -go at
ǝl-kabābči
def -kebab_seller
‘When it was noon he went to the kebab seller.’
(27) Mosul (Jastrow 1979: 72/39)
baʕd-mā yǝ-stawi y-ǧīb-ūn ṭaḥīn
after-prtcl ipf .3 sg . m -be.cooked ipf .3-bring-pl flour
‘After it has fully cooked they bring flour.’
(28) Mosul
ʕǝqǝb-mā yǝ-stawi y-qūm-ūn
after-prtcl ipf .3 sg . m -be.cooked ipf .3-stand_up-pl
y-ḥǝṭṭ-ū-l-u ṃāy
ipf .3-put-pl -for-3 sg . m water
‘After it has fully cooked, they put water into it.’
(29) Mosul
qabǝl-mā y-ġōḥ ʕa-l-bēt ə
štaġa təffāḥ
before-prtcl ipf .3 sg . m -go to-def -house buy.pf .3 sg . m apples.coll
‘Before he went home he bought some apples’
(30) Mosul (Jastrow 1979: 68/24)
w tǝ-ǧi s-samak-i w y-ǧǝġġ-ū-ha y-ṣīd-ū-ha
and ipf .3 sg . f -come def -fish-f and ipf .3-pull.out-pl - ipf .3-hunt-
3 sg . f pl -3 sg . f
‘When the fish comes, they pull it out and catch it.’
The following is an example of a causal clause.
(31) Mosul
mā štaġē-tu ṭamāṭa mbūġ kān-ǝt ᵊkṯīġ
neg buy-pf .1 sg tomatoes.coll because be-pf .3 sg . f much
ġālīy-i
expensive-f
‘I did not buy tomatoes because they were too expensive.’
Conditional clauses are introduced either by ʔiḏa (32) or by lō (33).
(32) Mosul (Jastrow 1979: 70/28)
yǝ-tmāsak akṯaġ iḏa ṣṭabar-na ʕalē-nu
ipf .3 sg . m -cohere much.ela if be.patient-pf .1 pl on-3sg.m
šwayya
little
‘It (the dough) coheres better if we are a little bit patient with it.’
(33) Mosul
lō qšəʕ-tī-nu salləm-ī-l-i ʕalē-nu
if see-pf .2. f . sg -3 sg . m greet-imp .2. f . sg -for-1 sg on-3sg . m
‘If you (f . sg ) see him, give him my regards!’
Relative clauses follow the head noun and are introduced by the particle ǝlli if the
head noun is definite.
(34) Mosul
ᵊn-ġōḥ maʿā-hǝm ʿǝnd xālǝt-na ǝlli tǝ-skǝn
ipf .1 pl -go with-3. pl at aunt-1 pl rel ipf .3 sg . f -live
b-ǝmḥall-ǝt ǝl-ǧūlāq
in-quarter-f . constr . st toponym
‘We go with them to our aunt, who lives in the Ǧūlāq neighborhood.’
3. Additional notes on contact-phenomena
We may assume that linguistic interaction of speakers of Kurdish, of Turkoman

(and, until the 1920s, of Ottoman Turkish), and of Bedouin and sedentary Arabic
is a matter of everyday life in most of the towns and villages of the region in ques-
tion. As far as vernacular Arabic is concerned, these contacts have mainly influ-
enced the lexicon, particularly terms of material culture, such as food, housewares,
tools, and the like.23
A direct result of these borrowings is the establishment of sounds in the ver-
nacular which were originally unknown to Arabic, particularly /p/, /g/, and /č/.
Examples of words with these sounds are pāča ‘stew made of the head, feet, and
neck of an animal’, parda ‘curtain’, lagan ‘metal wash basin’, glāṣ ‘glass’, ṣūč
‘fault’, čāy ‘tea’.24
In the dialect of the Iraqi Xawētna (who mainly reside in al-Baʕʕāǧ), the sounds
/č/ and /g/ are not only found in foreign loans, but also due to the influence of the
Bedouin dialects, where they go back to the OA consonants k and q (Talay 1999:
22, 31). Talay (1999: 19, 22) emphasizes the strong Bedouin element in Southern
Xawētna Arabic, which certainly includes its Iraqi branch.
We may assume that calques are numerous and that there is a good deal of
mutual influence in phraseology. A prominent example is the phrase ‘What’s the
news?’ (35), which is an exact calque of its Turkish equivalent.25 In-depth studies
on the lexicon of the languages involved could certainly detect many other such
loan translations.
(35) a. Mosul
ʔaš aku mā-ku?
q .what there.is there.is.not
23
Examples can be found in the texts published by Jastrow 1979 and 2004 as well as in
the dictionary by Bakrī 1972.
24
As can be seen from the examples, these sounds are also found in loanwords taken from
languages from outside the region, particularly English.
25
Jastrow (1990: 13) states that the Turkish of the Ottoman culture had more influence on
the Jewish Arabic dialects than did the local Azeri (also known as Turkoman, see Bulut,
this volume, chapter 3.5).
b. Turkish
ne var ne yok
q .what there.is q .what there.is.not
‘What’s the news?’
The influence of Iranian and Turkic languages on the morphology26 and syntax27 of
Northern Iraqi Arabic seems to be marginal or even non-existent. One reason for
this might be that—at least in cities like Mosul—Arabic speakers are rarely bilin-
gual, a fact which prevents far-reaching influence on morphological and syntacti-
cal structures. There are, however, no extended corpora of recordings of unmon-
itored everyday conversations and it is entirely possible that such texts would
reveal a good deal more contact phenomena than has hitherto been discerned.
A possible influence of Kurdish in the dialect of the village of Bǝḥzāni28 is
the shift of the three interdental fricatives /ḏ/, /ṯ/, /ḏ/̣ to the sibilants /z/, /s/, /ẓ/,
which resembles very much the substitution of Arabic loanwords in Kurdish (and
Turkish). Whether or not the analogous shift to dental stops in Rabīʕa is the result
of a contact with Syrian sedentary dialects is hard to say. The town is very close to
the Syrian border, though all adjacent dialects have interdentals. Hence one would
have to assume that this shift has been caused by one of the urban Syrian prestige
varieties (Damascus, Aleppo). This is not impossible as in the very same dialect
the typical Syrian verbal modifier b- is also found: e. g. b-tākǝl ‘you eat’ (Abu
Haidar 2004: 10).
4. Glossed text sample
The text was provided by a male speaker of Mosul Arabic, born approximately in
1942. The recording was made by Hiam Wadie in May 2004, who also supplied an
initial rough gloss, later modified for the present purposes.
(36) ǧā š-šēṭān ʕa=l-maṣlāwi
come.pf .3 m . sg def -devil on=def -Maṣlāwi
w=qal=l-u
and=say.pf .3 sg . m = to -3 sg . m
‘The devil came to the Maṣlāwi and said to him,’
26
An example of influence on word formation is the productive use of the Turkish suf-
fixes -li, -lik, -siz, and -çi (cf. Masliyah 1996).
27
For an Aramaic influence on syntax see above, section 4.3.
28
For examples see Jastrow 1978: 36–38.
(37) ʔa-ġīd a-tšārǝk maʕā-k bǝ=hāḏa l-mūsǝm

1 sg -want.ipf 1 sg -share.ipf with-2 sg . m in=dem . m . sg def -season
bǝ=z-zaġǝʕ
in=def -crop
‘“This season I want to share the crop with you.”’
(38) qal=l-u l-maṣlāwi ʔahlan w=sahlan bī-k
‘say.pf .3 sg . m =to-3 sg . m def -Maṣlāwi welcome in-2 sg . m
‘The Maṣlāwi said, “Welcome!”’
(39) wǝ=ttafaq maʕa š-šēṭān ǝz-zaġǝʕ
and=agree.pf .3 sg . m with def -devil def -crop
yǝ-t.wazzaʕ bēn-āt-ǝm
3 sg . m -pass .devide.ipf between-pl -3 pl
‘And he came to an agreement with the devil that the planted crop will be
divided among them (as follows):’
(40) ǝl-fōq-āni mǝn ǝz-zaġǝʕ lǝ=l-maṣlāwi
def -above-nmlz from def -crop to=def -Maṣlāwi
wǝ=t-tǝḥt-ēni lǝ=š-šēṭān
and=def -under-nmlz to=def -devil
‘The upper part of the planted crop for the Maṣlāwi and the lower part for
the devil.’
(41) w=wāfaq ǝš-šēṭān w=ġāḥ zaġaʕ
and=agree.pf .3 sg . m def -devil and=go.pf .3 sg . m plant.pf .3 sg . m
ǝl-maṣlāwi ḥǝnṭa
def -Maṣlāwi wheat
‘The devil agreed and the Maṣlāwi went to plant wheat.’
(42) w=lǝmmǝn ṣāġ ǝl-ḥaṣād ǧā
and=when become.pf .3 sg . m def -harvest come.pf .3 sg . m
š-šēṭān dǝ=y-āxǝḏ ḥǝṣṣǝt-u
def -devil purp =3 sg . m -take.ipf share-3 sg . m
‘When it was (the time) of the harvest, the devil came to take his share.’
(43) ġǝšǝʕ ǝl-maṣlāwi kǝn=axaḏ ǝs-sǝnbǝl
see.pf .3 sg . m def -Maṣlāwi aux =take. pf .3 sg . m def -ears
w=xallā=l-u t-tǝbǝn
and=leave.pf .3 sg . m =to-3 sg . m def -straw
‘He saw that the Maṣlāwi had taken the ears and had left him the straw.’
(44) ǝš-šēṭān axaḏ=l-u ṣafni

def -devil take. pf .3 sg . m =to-3 sg . m meditation
w=qal=lu lǝ=l-maṣlāwi
and=say.pf .3 sg . m = to -3 sg . m to= def -Maṣlāwi
‘The devil pondered and (then) said to the Maṣlāwi,’
(45) ǝt-ġīd nǝ-t.šārǝk marrt=ǝl-lǝx ʕa=s-sani
2 sg . m -want.ipf 1 pl - recp .share time=def -other on=def -year
lǝ-ǧdīd-i
def -new-f . sg
‘“Do you want us to share another time the next year?”’
(46) wǝ-l-maṣlāwi ġadd ʕalē-nu

def -Maṣlāwi come.back.pf .3 sg . m on-3 sg . m
w=qal=l-u ʔē mā=yǝ-nšǝt ʔana
and=say.pf .3 sg . m =to-3 sg . m yes neg =3 sg . m -ask. ipf 1 sg
ḥāḏǝ̣ ġ
to.be.ready.ptcp
‘The Maṣlāwi responded to him, “Yes, why not? I’m ready!”’
(47) w=ǝš-šēṭān ʕǝmǝl nafs-u šāṭǝġ
and=def -devil make.pf .3 sg . m refl -3 sg . m clever
ʕa=l-maṣlāwi w=qal=l-u
on= def -Maṣlāwi and=say.pf .3 sg . m =to-3 sg . m
‘The devil fancied himself to be more clever than the Maṣlāwi and said to
him,’
(48) ba.lākin ha-l-marra ʔana ʔa-ġīd āxǝḏ
but dem - def -time 1 sg 1 sg -want.ipf 1 sg .take.ipf
ǝl-fōq-āni mǝn ǝz-zaġǝʕ w=lǝ-k ǝǧ-ǧaww-ēni
def -above-nmlz from def -crop and=to-2 sg . m def -under-nmlz
‘“But this time I want the upper part of the planted crop and you will
get the lower part.”’
(49) qǝbǝl ǝl-maṣlāwi w=ġāḥ ǝš-šēṭān
accept.pf .3 sg . m def -Maṣlāwi and=go.pf .3 sg . m def -devil
‘The Maṣlāwi accepted and the devil went away.’
(50) w=ǧā baʕǝd sani dǝ=y-āxǝḏ
and=come.pf .3 sg . m after year purp =3 sg . m -take.ipf
ḥǝṣṣǝt-u mǝn ǝz-zaġǝʕ
share-3 sg . m from def -crop
‘After a year (the devil) came to take his share of the crop.’
(51) ġǝšǝʕ ǝl-maṣlāwi kǝn=zaġaʕ baṣal

see.pf .3 sg . m def -Maṣlāwi aux =plant. pf .3 sg . m onions
‘He saw that the Maṣlāwi had planted onions.’
(52) ǝl-maṣlāwi ʔaxaḏ ḥǝṣṣǝt-u ǝt-taḥt-ēnī-yi
def -Maṣlāwi take.pf .3 sg . m share-3 sg . m def -under-nmlz - f . sg
ǝl-baṣal w=baqa lǝ=š-šēṭān bass ǝl-ǝʕġūš
def -onions and=remain.pf .3 sg . m to=def -devil only def -leaves.pl
‘The Maṣlāwi took his share, the lower part, the onions, and for the devil
only the leaves remained.’
Abbreviations
ap Active participle mod Modifier

coll Collective neg Negation
constr . st Construct state nmlz Nominalizer
cop Copula pass Passive
def Definite article pf Perfective
dem Demonstrative pl Plural
dual Dual prtcl Particle
ela Elative ptcp Participle
f Feminine purp Purposive
gen . marker Genitive marker refl Reflexive
ipf Imperfectiv sg Singular
m Masculine
References
Abu-Haidar, Farida. 2002. Negation in Iraqi Arabic. In Werner Arnold & Hartmut Bobzin
(eds.), Sprich doch mit deinen Knechten aramäisch, wir verstehen es!”. 60 Beiträge zur
Semitistik. Festschrift für Otto Jastrow zum 60. Geburtstag, 1–13. Wiesbaden: Harras-
sowitz.
Abu Haidar, Farida. 2004. The Arabic of Rabīʿa: A qǝltu dialect of north-western Iraq. In
Martine Haak, Rudolf de Jong & Kees Versteegh (eds.), Approaches to Arabic dialects:
Blanc, Haim. 1964. Communal dialects in Baghdad. Cambridge: Center for Middle Eastern
Studies of Harvard University.
Contini, Riccardo. 1999. Le substrat araméen en néo-arabe libanais : Préliminaires à une
enquête systématique. In Marcello Lamberti & Livia Tonelli (eds.), Afroasiatica Terg-
estina: Papers from the 9th Italian Meeting of Afro-Asiatic (Hamito-Semitic) Linguis-
tics, Trieste, 23–24 April, 1998, 101–122. Padova: Unipress.
Holes, Clive. 2007. Colloquial Iraqi Arabic. In John Nicholas Postgate (ed.), Languages of
Iraq, ancient and modern, 123–134. British School of Archaeology in Iraq.
Jastrow, Otto. 1978. Die mesopotamisch-arabischen qǝltu-Dialekte, vol. 1, Phonologie und

Morphologie. Wiesbaden: Steiner.
Jastrow, Otto. 1979. Zur arabischen Mundart von Mossul. Zeitschrift für Arabische Linguis-
tik 2. 36–75.
Jastrow, Otto. 1990. Der arabische Dialekt der Juden von ʿAqra und Arbīl. Wiesbaden:
Harrassowitz.
Jastrow, Otto. 2004. Jüdisches, christliches und muslimisches Arabisch in Mossul. In Mar-
Khan, Geoffrey. 1984. Object markers and agreement pronouns in Semitic languages. Bul-
letin of the School of Oriental and African Studies 47. 468–500.
Khan, Geoffrey. 1997. The Arabic Dialect of the Karaite Jews of Hīt. Zeitschrift für arabis-
che Linguistik 34. 53–102.
Masliyah, Sadok. 1996. Four Turkish suffixes in Iraqi Arabic: -li, lik, siz and -çi. Journal of
Semitic Studies 41. 291–300.
Morony, Michael G. 2005. Iraq after the Muslim Conquest, 2nd ed. Piscataway, NJ: Gorgias
Press.
Procházka, Stephan. 2006–2007. Does geographical periphery imply linguistic periphery?
The examples of the Arabic dialects of Cilicia and Urfa in southern Turkey. Romano-
Arabica 6–7. 109–132.
Socin, Albert. 1882–1883. Der arabische Dialekt von Mossul und Märdin. Zeitschrift der
Deutschen Morgenländischen Gesellschaft. 36: 1–53, 238–277; 37: 293–318.
Talay, Shabo. 1999. Der arabische Dialekt der Khawētna. 1. Grammatik. Wiesbaden: Har-
rassowitz.
Weninger, Stefan. 2011. Aramaic-Arabic language contact. In Stefan Weninger (ed.), The
Semitic languages: An international handbook, 747–755. Berlin & Boston: De Gruyter
Mouton.
Additional references
Abu-Haidar, Farida. 1989. Maintenance and shift in the Christian Arabic of Baghdad.
Zeitschrift für Arabische Linguistik 21. 47–62.
Eksell, Kerstin. 2006. Analytic Genitive. In Kees Versteegh (ed.), Encyclopaedia of Arabic
Holes, Clive. 1995. Community, dialect and urbanization in the Arabic-speaking Middle
East. Bulletin of the School of Oriental and African Studies 58. 270–287.
Jankowski, Henryk. 1989. Word order in Iraqi Arabic. Lingua Posnaniensis 32–33. 109–
112.
Jastrow, Otto. 1981. Die mesopotamisch-arabischen qǝltu-Dialekte, vol. 2, Volkskundliche
Texte in elf Dialekten. Wiesbaden: Steiner.
Jastrow, Otto. 1983. Tikrit Arabic verb morphology in a comparative perspective. Al-Ab-
hath 31. 99–110.
Jastrow, Otto. 1988. Zum arabischen Dialekt der Juden von Arbīl. Zeitschrift für Arabische
Jastrow, Otto. 1989. Notes on Jewish Maṣlāwi. Jerusalem Studies in Arabic and Islam 12.
282–293.
Jastrow, Otto. 1990. Die arabischen Dialekte der irakischen Juden. In Werner Diem & Ab-
doldjavad Falaturi (eds.), XXIV. Deutscher Orientalistentag vom 26. bis 30. September
1988 in Köln. Ausgewählte Vorträge, 199–205. Stuttgart: Steiner.
Jastrow, Otto. 1991. Brotbacken: Ein Text im arabischen Dialekt der Juden von Mossul in
memoriam Ezra Laniado. Zeitschrift für Arabische Linguistik 23. 7–13.
Jastrow, Otto. 1992. Der arabische Dialekt der Juden von Kirkuk. Jerusalem Studies in
Arabic and Islam 15. 240–254.
Jastrow, Otto. 1993. Geschichten aus Sǝndōṛ. Zeitschrift Für Arabische Linguistik 25. 161–
177.
Jastrow, Otto. 1994. The qǝltu dialects of Mesopotamian Arabic. In Jorge Aguadé, Federico
Corriente & Marina Marugán (eds.), Actas del congreso internacional sobre interferen-
cias lingüísticas arabo-romances y paralelos extra-iberos, 119–123. Zaragoza.
Jastrow, Otto. 2001. Three anecdotes in Jewish Maslawi. In Judith Rosenhouse & Ami
Elad-Bouskila (eds.), Linguistic and cultural studies on Arabic and Hebrew. Essays
presented to Professor Moshe Piamenta for his eightieth birthday, 61–70. Wiesbaden:
Harrassowitz.
Jastrow, Otto. 2005. Linsen mit Reis – und doch keine mžaddaṛa! Jüdisch-arabische
Kochrezepte aus Mossul. In Thomas Bauer & Ulrike Stehli-Werbeck. (eds.), Alltag-
sleben und materielle Kultur in der arabischen Sprache und Literatur: Festschrift für
Heinz Grotzfeld zum 70.Geburtstag, 223–232. Wiesbaden: Harrassowitz.
Jastrow, Otto. 2006. Where do we stand in the research on the Anatolian qeltu dialects?
Romano-Arabica 6–7. 63–69.
Jastrow, Otto. 2007. Iraq. In Kees Versteegh (ed.), Encyclopaedia of Arabic Language and
Linguistics, vol. 2, 414–424. Leiden & Boston: Brill.
Krotkoff, Georg. 1991. Remarks on the Iraqi-Arabic lexicon: problems and characteristics.
In Alan S. Kaye (ed.), Semitic studies in honor of Wolf Leslau on the occasion of his
eighty-fifth birthday, November 14th, 1991, 886–889. Wiesbaden: Harrassowitz.
Lecerf, Jean. 1967. Regards sur l’état présent des études sur les dialectes arabes de Bagdad.
In Atti del terzo Congresso di studi arabi e islamici, Ravello 1966, 443–459. Naples:
Istituto Universitario Orientale.
Levin, Aryeh. 1994. The role of the Mesopotamian qǝltu dialects in the history of the Arabic
dialects of Iraq. In Dominique Caubet & Martine Vanhove (eds.), Actes des premières
journées internationales de dialectologie arabe de Paris. Colloque international tenu à
Paris du 27 au 30 janvier 1993, 317–332. Paris: Publications Langues’O.
Levin, Aryeh. 2000. The ʾimāla in the modern qǝltu dialects. In Abderrahim Youssi, Fouzia
Benjelloun, Mohamed Dahbi & Zakia Iraqui-Sinaceur (eds.), Aspects of the dialects of
Arabic today. Proceedings of the 4th Conference of the International Arabic Dialectol-
ogy Association (AIDA), Marrakech, Apr. 1–4, 2000, 98–104. Rabat: Amapatril.
Mansour, Jacob. 1985. The particle baqa in the Judaeo-Arabic dialect of Baghdad. Zeitschrift
für Arabische Linguistik 14. 62–75.
Mansour, Jacob. 2001. The particles ʾaš/ʾeš and ʾášu in the Jewish Arabic of Baghdad.
Masliyah, Sadok. 1997. The diminutive in spoken Iraqi Arabic. Zeitschrift für Arabische
Melcer, Ioram. 1995. The analytical genitive in the Jewish-Baghdadi Arabic dialect.
Müller-Kessler, Christa. 2003. Aramaic ‘k’, lyk’ and Iraqi Arabic ‘aku, māku: the Mesopo-
tamian particles of existence. Journal of the American Oriental Society 123. 641–646.
Retsö, Jan. 1987. Copula and double pronominal objects in some Semitic Languages.
Retsö, Jan. 2004. Relative-clause marking in Arabic dialects: a preliminary survey. In Mar-
a collection of articles presented to Manfred Woidich on the occasion of his sixtieth
Rubin, Aaron D. 2005. Studies in Semitic grammaticalization. Winona Lake: Eisenbrauns.
Salonen, Erkki. 1980. On the Arabic dialect spoken in Širqāṭ (Assur). Helsinki: Suoma-
lainen Tiedeakatemia.
Zemer, Hila. 2008. The copula in the Christian Arabic dialect of Baghdad – possible origins.
In Stephan Procházka & Veronika Ritt-Benmimoun (eds.), Between the Atlantic and
Indian Oceans: Studies on contemporary Arabic dialects. Proceedings of the 7th AIDA
Conference, 505–514. Vienna & Münster: LIT.
3.3. The Iranian languages of northern Iraq
Geoffrey Haig
1. Introduction
The Iranian languages currently spoken in northern Iraq can be assigned to three
main groups: Behdinī, the local name for the varieties of Northern Kurdish (or
Kurmanjî) spoken in Iraq; Sorani Kurdish (used here synonymously with Central
Kurdish), and Gorani.1 The latter subsumes Hawrāmī of the Halabja region,
together with a cluster of other dialects that go under various names. All three
groups are traditionally classified as belonging to the northwestern branch of
Iranian. The approximate locations of Northern and Central Kurdish are provided
in Fig. 1, while Gorani varieties are shown in Fig. 4 below.
Both Behdinī and Sorani are unanimously considered varieties of Kurdish,
while the nature of the relationship between Kurdish and Gorani remains a matter
of controversy. As will become apparent, at least in terms of morphology, the
various varieties of Gorani diverge from Sorani and Behdinī rather systematically,
implying a historically more distant relationship. From the perspective of compar-
ative Iranian philology, then, a distinction is justified. Culturally and in terms of
subjective identity perceptions among the speech communities, on the other hand,
there are reasons for including Gorani within a broader socio-cultural notion of
“Kurdish” (see Haig and Öpengin 2014 on the concept of “Kurdish”).
Among the three groups, the most important in terms of prestige, degree of
standardization, media representation, and number of speakers, is Sorani (Mac-
Kenzie 1961, 1962), spoken by around three million speakers in Iraq, with a further
three million in neighbouring regions of Iran, and further north to the shores of
Lake Urmi (also spelled Urmiye). The least-well documented languages are the
various varieties of Gorani. In particular, very little reliable information is availa-
ble on the fragmented groups of dialects spoken westward of the Hawrāmī region,
variously referred to as Kakaʼī, Šabak, Sarlī, or Bāĵaɫānī (see §5). The majority of
Sorani and Behdinī speakers are Sunnite Muslims, with the exception of the Ezidi
communities among the Behdinī speakers, while Gorani speakers are generally
affiliated with heterodox, or non-Islamic, religious beliefs.
1
There is little agreement with regard to the spelling conventions for language names,
and this overview makes no claims to consistency. Generally the form that is most wide-
ly-used in English publications has been chosen, avoiding diacritics and non-standard
characters, but in the case of lesser-known varieties, a form more closely representing
a transcription is chosen.
268
Geoffrey Haig
Figure 1: Approximate locations of Northern and Central Kurdish varieties mentioned in this chapter
The Iranian languages of northern Iraq 269
This chapter gives a brief overview of the main structural features of each, together
with some comparative notes. Regrettably, since the pioneering work of MacKen-
zie on Kurdish dialects of Iraq, undertaken in the 1950ʼs, and Blauʼs study of
Behdinī (1975), very little research on the vernaculars has been undertaken, and
the coverage here is correspondingly uneven.
2. Background to the speech communities
When compared to Neo-Aramaic, the degree of early linguistic attestation of the

Iranian languages of Iraq is meagre. For Kurdish generally, textual attestation does
not extend farther back than around the 16th century. Nor can the predecessor of
Kurdish be equated with any of the historically attested Western Middle Iranian
languages. Standard accounts of the history of the Kurds (McDowall 2004) locate
Kurdish tribes in Mesopotamia from the outset of the Islamic expansion in the
seventh and eighth centuries, prior to the establishment of larger Kurdish princi-
palities in the Jazira in the 10th and 11th centuries (McDowall 2004: 23; see also
Haig, this volume, chapter 2.3, §1.1). Up until the 19th century, the destiny of
the Kurdish groups in the region was shaped by the shifting fortunes of Ottoman/
Iranian rivalry, with the majority of Kurdish tribes supporting the former, while
never entirely abandoning their independence. Two major groupings can be iden-
tified: the Ardalan principality, and the Baban Kurds. The Ardalan controlled
large amounts of territory both eastward and westward of the Zagros mountains in
the fourteenth and fifteenth centuries, with their centre located at Senna (todayʼs
Iranian city Sanandaj). Their allegiance lay initially with the Iranian rulers, though
they shifted to the Ottomans over the 16th and 17th centuries, and thus retained
some influence in Ottoman-dominated regions of todayʼs northern Iraq. The Arda-
lans are associated with the Gorani language, and with the Ahl-i Haqq (or Yaresan)
religion, which accounts for the presence of groups affiliated with these beliefs
in todayʼs northern Iraq (see §5). There is a considerable corpus of poetry in the
written koiné form of Gorani, dating back to the 14th century (MacKenzie 2002),
while written prose is almost completely lacking. The other major tribal power,
rivals to the Ardalan, was the Baban (Bābān) confederacy, centred around Sule-
maniyya, which became a centre of Kurdish cultural and linguistic development.
Primarily allied with the Ottomans, they maintained a dominant role in todayʼs
northern Iraq up until the 19th century, when the Babans were largely disbanded
by the Ottomans during their efforts at reformation. This effectively put an end to
the Kurdish emirates as major political and military actors in the region (McDow-
all 2004: 47). The language of the Babans was primarily Sorani.
Among the most fascinating aspects of northern Iraqʼs linguistic landscape
is the admixture of heterodox Islamic, and non-islamic groups concentrated in
the region north and east of Mosul. Along with the Christian and Jewish NENA
270 Geoffrey Haig
communities (see Khan, this volume, chapter 3.4), the region also hosts Northern
Kurdish-speaking Ezidī communities as well as a range of Yaresan-affiliated reli-
gious groups, and others exhibiting what Leezenberg (1994) refers to as “pre- and
non-islamic belief elements”. However, their histories remain shrouded in mystery,
as they “developed outside the major centers of the Islamic world” (Leezenberg
1994), and tend, through necessity, to maintain a low profile. The ravages of the
Islamic State, which effectively occupied the city of Mosul from 2014–2017, but
which had been persecuting non-orthodox religious groups for several years pre-
viously, has now largely obliterated much of the historical diversity of this region.
Sorani and Behd. are both widely spoken in their respective regions, are widely
used in broadcasting and internet media, are acquired by children as a first lan-
guage, and are represented in the education system of the Autonomous Kurdish
Region of Iraq, though to differing degrees. Of the two, Sorani has the highest
prestige, and the best-established written tradition (which uses a modified version
of the Arabic script). Behd., on the other hand, has a more troubled status: it is
the less prestigious variety of Kurdish in Iraq, thus ranges behind both Sorani
and Arabic in terms of overall status, nor does it profit from the emergent stand-
ardisation of Kurmanjî, which is based on a Roman-alphabet writing system and
propagates a standard form that differs quite significantly from Behd. (see Haig
and Mustafa, in press). The Gorani varieties, on the other hand, have virtually no
official status, and must be considered highly endangered.
3. Sorani (Central Kurdish)
Sorani, also known as Central Kurdish (CK) refers to a range of dialects spoken in
Iraq and the bordering regions of Iran (see Fig. 1 in Haig, this volume, chapter 2.3).
It has been quite extensively documented and analysed in grammatical descrip-
tions and dictionaries, and has a well-established written standard language, using
a modified version of the Arabic script, based on the dialect of the northern Iraqi
city of Sulemaniyya (see Hassanpour (1992: Ch.8) on the emergence of standard-
ized Sorani). A structural overview of Sorani, based on the Sulemaniyya dialect,
is provided in McCarus (2009), while Öpengin (2016) provides a comprehensive
linguistic analysis of the dialect of Mukri, spoken near the city of Mahabad in
West Iran. The most detailed treatment from the perspective of dialectal variation
remains MacKenzie (1961 and 1962), covering several Central Kurdish dialects
spoken in northern Iraq. This overview focusses on the spoken vernacular, rather
than the standard language, and draws primarily on MacKenzie (1961 and 1962),
McCarus (2009), and Öpengin (2016).
3.1. Phonology
3.1.1. Consonants
According to MacKenzie (1961: 1), the Sulemaniyya dialect has 28 consonant
phonemes, shown in Table 1.
Table 1: Sulemaniyya Sorani consonant phonemes

Labial Alveo- Post- Palatal Velar Uvular Pharyn. Glott.
dental alv.
Stop pb td kg q (ʔ)
Affricate ʧʤ
Nasal m n ŋ
Fricative fv sz ʃʒ xɣ ʕħ h
Tap/Trill ɾ/r
Approximant
Lateral l ɫ
Glide w j
Sorani lacks the phonemic three-way stop distinction typical of Northern Kurdish
(voiceless aspirated / voiceless non-aspirated / voiced, see Haig, this volume,
chapter 2.3). Pharyngealized segments outside of the pharyngeal fricatives are not
included as phonemes here, although regularly realized in Arabic loans and in
varying degrees in the inherited lexicon. An important feature of Sulemaniyya
Kurdish is the so-called ‘Zagros dʼ. This refers to a general instability of /d/
when following a sonorant, except /r/. The result may be the lenition of /d/ to a
vocalic segment, or assimilation to a preceding sound, or loss: birīndar [bɾi:nna:ɾ]
‘woundedʼ; bad [baə] ‘badʼ. In other environments, /d/ is preserved: dast [dast]
‘handʼ, kird [kɨrd] ‘didʼ. It is noteworthy that the stem initial /d/ of one of the
most frequent verbs, dān ‘giveʼ, is regularly lenited, even when not preceded by a
sonorant. Presumably this is due to the fact that most inflected forms of this verb
involve a sonorant preceding the stem (e. g. a-y-da-m ind -3s.P-give.prs-1s [ɛ͡i.jɛm]
‘I give itʼ (MacKenzie 1961: 3), and the resulting lenition has generalized to all
forms of this verb. Dialectal variation in phonology affects mostly allophonic real-
ization rather than the underlying phonemic system. Of note is the realization of
/k, g/ as the affricates [ʧ, ʤ] respectively, and the fronting of the original affri-
cates to [tɕ, dʑ] (alveo-palatal affricates), notable in Bingird and Piždar dialects
(MacKenzie 1961: 24–25). In fact palatalization of /k, g/ before front vowels is
widespread across Sorani, and yields affricates in many of the dialects north of
Sulemaniyya (e. g. in Mukri, Öpengin 2016).
272 Geoffrey Haig
3.1.2. Vowels
According to McCarus (2009: 591), Sulemaniyya Central Kurdish has the nine
vowel phonemes provided in Fig. 2.
Figure 2: Vowels of Sorani
MacKenzie (1961: 1) gives a slightly different account of the vowels, which

includes a mid front rounded [ø], and only one short unrounded central vowel.
The first could be analysed as [w] plus a front vowel (as in McCarus 2009), but
see Öpengin (2016) for a different view. The central unrounded [ɪ] in Fig. 2, not
included in MacKenzie (1961), is mainly found in Arabic loans such as [ɪmtɪḥān]
‘examinationʼ (McCarus 2009: 591), and is thus placed in brackets here. The
vowel shown here as [a] has a broad range of realizations, most frequently as
schwa (before glides within the same syllable), [æ], [ɛ] before [j] in the following
syllable [hɛ.jə] ‘there isʼ, and low central [a] adjacent to pharyngealized [ṣ], as in
[šaṣt] ‘sixtyʼ (McCarus 2009: 591). In keeping with the conventions used below
for Behdinī and those of Öpengin (2016) for the Mukri dialect of Central Kurdish,
but unlike MacKenzie (1961) and McCarus (2009), I transcribe it here with <e>.
Syllable onsets are relatively complex for a West Iranian language; one might
conjecture Semitic influence here. According to McCarus (2009: 593), any com-
bination of two consonants is permitted in syllable onsets, though certain restric-
tions obtain (e. g. stops are only licensed as second consonants when the first is
also a stop): ktēb ‘bookʼ, tfeŋ ‘rifleʼ, xrāp ‘badʼ. However, these complex onsets
can generally be analysed as resulting from elision of underlying [ɨ], still audible
in careful speech. Nevertheless, it is striking that Sorani speakers evidently have
no difficulty articulating such complex onsets, which are undoubtedly the norm
in natural speech. CCC-onsets are not reported (the cases of CCC-clusters in
McCarus (2009: 593) either involve an intervening syllable boundary, or an initial
glide). Syllable-internal vowel combinations are avoided within individual lexical
items, but may arise through combinations at phrase level.
3.2. Morphosyntax
Although Central Kurdish is regularly considered a ‘dialectʼ of ‘Kurdishʼ, the dif-
ferences between Central and Northern Kurdish in the realm of morphology are
considerable, and heavily impede mutual intelligibility (Öpengin and Haig 2014;
Haig and Öpengin 2018). In contrast to Northern Kurdish, most of Central Kurdish
(i) lacks grammatical gender on nouns; (ii) lacks morphological exponence of
structural case; (iii) has a suffixed marker of definiteness -eke; (iv) has a dis-
tinct paradigm of clitic personal pronouns, not derivable from the free pronouns;
(v) has an affixal passive marker. In the Sulemaniyya dialect, the passive marker
attaches to the present stem of the verb, and then secondarily differentiates tense:
kuštin ‘killʼ (infinitive) > kuž- (present stem) > kuž-rē- (passive, present), kuž-rā-
(passive, past). The realization of passive morphology varies considerably across
Central Kurdish (MacKenzie 1961: 118–119).
From differences (i) and (ii), it follows that the Ezafe particle is undifferen-
tiated for gender; it is also undifferentiated for number. Structural case relations
(S, A, P, and possessive) are carried via word-order (basically SV, AOV, N-pos-
sessor within the NP), rules of agreement morphology that are sensitive to gram-
matical relations, and the clitic pronouns. Other case relations (local, non-core
arguments such as instruments or comitatives etc.) are expressed through a rich
inventory of pre- and circumpositions. As in most of West Iranian, verbs consti-
tute a small lexical class, which is primarily expanded through complex predi-
cate-formation (non-verbal element plus a light verb, for example kirdin ‘doʼ, dān
‘giveʼ, būn ‘be, becomeʼ, birin ‘takeʼ, kewtin ‘fallʼ, hātin ‘comeʼ). The non-verbal
element may be transparently related to a lexical item that occurs in other con-
texts (e. g. teslīm kirdin ‘surrender, hand overʼ, or it may be a particle of uncertain
word-class membership, as in hāɫ wāsīn ‘hang upʼ, with the particle hāɫ indicating
approximately ‘upwardʼ).
3.2.1. Nominal morphology

From a broader Iranian perspective, one of the most striking features of Sorani
nominal morphology is the suffixal expression of definiteness. While indefinite-
ness suffixes are widespread across west Iranian, only a small pocket of Iranian
languages spoken in northern Iraq and neighbouring regions of Iran have developed
definiteness suffixes in addition to the indefiniteness markers. Table 2 (adapted
from McCarus 2009: 598) gives an overview of the morphological marking of
definiteness.
274 Geoffrey Haig
Table 2: Indefiniteness and definiteness in Sorani

generic , under - I ndefinite D efinite P roxal / distal
specified for number demonstrative
sg pyāw ‘man, menʼ pyāw-ēk ‘a manʼ pyāw-eke em / ew pyāw-e

‘the manʼ ‘this/that manʼ
pl pyāw-ān ‘menʼ pyāw-ek-ān em / ew pyāw-ān-e
‘the menʼ ‘these/those menʼ
This definiteness suffix, with reflexes across Gorani and in Southern Kurdish, is
of considerable interest from a diachronic perspective. Cross-linguistically, defi-
niteness markers (often termed articles) are often the result of grammaticalization
of some earlier independent deictic element (typically a demonstrative, see e. g.
Skrzypek 2010). But for Sorani, no plausible candidate source element for such
a grammaticalization process is available. Rather, it appears the suffix goes back
to a derivational suffix, widely attested in the Western Middle Iranian languages
Middle Persian and Parthian; see Jahani (2015) and Haig (in press) for discus-
sion. It is nevertheless worth noting that the definiteness suffix is not equivalent
to, for example, the definite article in English: it is not used with unique refer-
ents, like the sun. Furthermore, it is not even consistently present on nouns that
have discourse-recoverable referents. The available original texts have numerous
examples of non-suffixed nouns occuring in contexts where one would expect a
definiteness suffix (for example ‘the letter’ in MacKenzie’s narrative text (1962:
10). I would therefore hesitate to refer to this marker as a ‘definite article’. This is
an area requiring much more research.
In combination with demonstratives, NPs take an additional stressed -é. When
demonstratives are used without complements, they too take the demonstrative
marker (the clitic/suffix distinction is not addressed here), as shown in Table 3.
Table 3: Sorani pronouns, personal and demonstrative

personal demonstrative
1 st pers . 2 nd pers . 3 rd pers . ‘thatʼ ‘thisʼ

sg min tō ew ew-e em-e
pl ēme ēwe ew-ān ew-ān-e em-ān-e
As mentioned, Sorani also has an additional set of clitic personal pronouns. Section
3.2.4 below is dedicated to the morphosyntax of clitic pronouns.
3.2.1.1. Ezafe constructions

NPs are generally head-initial, with attributes following, but demonstratives and
numerals precede the head noun. Post-nominal attributes are linked to the head by
an Ezafe-clitic, of which two kinds can be distinguished:
The phrasal Ezafe =ī (ez )

This is used to freely combine phrasal elements to a complex NP. The depend-
ent (second element) can be a nominal phrase of any category (AP, NP, PP, or
pronoun). The head may carry the indefinite suffix -ēk (cf. Table 3), i. e. the pres-
ence of an attribute does not preclude indefiniteness of the head. The phrasal Ezafe
can be repeated when the head is accompanied by more than one attribute, as in
(4) below:
(1) le xošī=y ew-e
from pleasure=ez that-dem
‘from the pleasure of thatʼ (MacKenzie 1961: 62)
(2) tūtik-ēk=ī pičkole
dog-indf = ez small
‘a little dogʼ (MacKenzie 1961: 63)
(3) xānū=y ēme
house=ez 1 pl
‘our houseʼ (MacKenzie 1961: 63)
(4) kič-ēk=ī ʤwān=ī čwārde sāl
girl-indf = ez beautiful=ez fourteen year
‘a beautiful girl of fourteen yearsʼ (MacKenzie 1961: 63)
The compound Ezafe =e (cez )

This element is used in frequent and partially lexicalized combinations, but also
for freer and evidently not lexicalized combinations. I nevertheless refer to it as
a compound ezafe, following McCarus (2009: 613), for mainly formal reasons:
First, the entire construction may be marked with the definiteness or indefinite-
ness suffixes -eke, or -ēk, while in none of the examples I have encountered is the
head noun itself inflected for definiteness or indefiniteness. There are thus restric-
tions on internal inflection, which are strongly suggestive of compound status.
However, it must be noted that many of the examples cited in MacKenzie (1961:
64) are semantically incongruous for compounds; more research is required on
this topic.
(5) kič=e jwān-eke
girl=cez pretty-def
‘the pretty girlʼ (McCarus 2009: 613)
276 Geoffrey Haig
(6) kilk=e ker-ēk

tail=cez donkey-indf
‘a donkey’s tailʼ (MacKenzie 1961: 65)
(7) ew xēwet=e sewz-e
that tent=cez green-dem
‘that green tentʼ (MacKenzie 1961: 64)
3.2.2. Verbal morphology

As in other West Iranian languages, each verb has two stems, called here the past
and present stem respectively. Examples of stems for common verbs are provided
in Table 4. The infinitive, the traditional citation form for Kurdish verbs, is formed
from the past stem via the addition of -(i)n. In natural speech, infinitives are only
rarely attested, barring certain lexicalized forms.
Table 4: Past and present stems of frequent verbs in Sorani

G loss I nfinitive P ast stem P resent stem
go čūn / řōīštin čū- / řō- č-

come hātin hāt- (h)ē-
be būn bū b-
give dān dā- de-
do, make kirdin kird- ke-
see dīn dī- bīn-
fall keftin, kewtin kewt, keft- kew-
say wutin wut- ɫē-
eat xwārdin xwārd- xō-
Each stem forms the base for a particular set of TAM and negation affixes, con-
stituting a number of distinct paradigms. Very roughly, those based on the past
stem are associated with past time reference, and to some extent with past irrealis
modality, while present stems form indicative present, future, and various kinds of
unrealized (or non-asserted) forms, typically in complement clauses governed by
verbs such as ‘hope’, ‘want’, ‘intend’ etc.
Predicates are obligatorily indexed for person and number of one argument.
Three distinct paradigms can be identified, shown in Table 5. Generally, the
indexed argument is the S or A, but with transitive verbs in the past tense, verbal
affixes may index the P (if it is not otherwise expressed in the clause), or under
specific conditions, an indirect participant, cf. §3.2.4, ex. (18), or may be simply
default third person singular if the P is overtly present in the clause; at this point,
details of person marking vary considerably across and even within dialects, see
Öpengin (2016) for detailed discussion. The copular clitics attach to non-verbal
predicates in the present tenses, but are also used with certain verb forms, most
notably participles (see Table 7 below).
Table 5: Verbal affix and copular person markers in Sulemaniyya Sorani

verbal affixes
C opular clitics
present stem past stem
sg 1 -im -im =im

2 -ī(t), -y(t) -ī(t) =ī
3 -ē(t)2 -Ø =e
pl 1 -īn -īn =īn
2 -in -in =in
3 -in -in =in
3.2.2.1. Verb forms based on the present stem

Finite verb forms based on the present stem of the verb require one of the six
mutually exclusive present-tense TAM and negation prefixes, though in impera-
tives, the prefix may be dropped. The appropriate form of the present stem person
markers (cf. Table 5) is suffixed to the stem. The prefixes are shown in Table 6.
Table 6: TAM and negation prefixes with the present stem

TAM/Negation (abbreviation) Example with present stem of wutin ‘sayʼ
indicative (ind ) e- e-ɫē-m ‘I say, am sayingʼ
negated (neg ) nā- nā-ɫē-m ‘I donʼt sayʼ
subjunctive (subj ) bi- bi-ɫē-m ‘that I sayʼ
subjunctive negated ne- ne-ɫē-m ‘that I not sayʼ
(subj . neg )
imperative (imp ) bi- bi-ɫē (2sg ), bi-ɫē-n (2pl )
with consonant-final stem (e. g. ‘goʼ): bi-č-e (2sg ),
bi-č-in (2pl )
imperative negated (imp . neg ) me- me-ɫē, me-ɫē-n
2
According to MacKenzie (1961: 90), the “euphonic” -t of the second and third person
singular is realized before vowel-initial clitics, or before a pause, but otherwise left
unrealized.
278 Geoffrey Haig
The form of the present indicative is an important isogloss within Central Kurdish,
with Sulemaniyya and Wārmāwa dialects having e-, but elsewhere de- (the former
is adopted for the standardized written language). The forms of the other TAM/
negation prefixes is fairly consistent across all dialects, and have obvious cognates
throughout most of West Iranian. All TAM/negation prefixes are potential hosts for
pronominal clitics, and may thus be separated from the verb stem:3
(8) e=mān=ewē bi-řō-yn
ind =1 pl =be.desirable.prs .3 sg subj -go.prs -1 pl
‘We want to goʼ (McCarus 2009: 620)
(9) bi=y=gir-in
subj =3 sg =hold.prs -2 pl
‘Hold it!ʼ (MacKenzie 1961: 93)
Indicative forms do not distinguish progressive/immediate from habitual senses.
Unlike Northern Kurdish, Sorani lacks a dedicated future marker, with the present
tense used in future contexts, when a reasonable degree of certainty of fulfillment
is implied. The subjunctive verb forms are used for dependent clauses, particularly
following predicates of desire (as in (8) above), ability, and obligation, and more
generally in independent clauses to express events that are not asserted, but are
portrayed as possible, hypothetical, or desired. These may be introduced by modal
particles such as bā ‘letʼsʼ, or an expression such as ḥāyfe ‘itʼs a pityʼ:
(10) bā mināl=ī tō bi-bīn-im
mod child=ez 2 sg subj -see.prs -1 sg
‘Let me see your childrenʼ (MacKenzie 1961: 106)
(11) hāyf=e bi=y=kuž-īn
pity=cop .3 sg subj =3 sg =kill.prs -1 pl
‘Itʼs a pity that we should kill itʼ (MacKenzie 1961: 77)
3.2.2.2. Verb forms based on the past stem

Unlike the present stem, a number of verb forms based on the past stem are unpre-
fixed, including the simple past. However, the past stem is also the base for a
participle, in Sulemaniyya of the form -uw (on MacKenzieʼs (1961: 97) analysis),
e. g. hāt-uw ‘come.pst - ptcpl ʼ, which in turn forms the base for a large number of
secondary verb forms involving copular forms of the person markers (see Table 5).
3
The ability of TAM and negation prefixes (or perhaps proclitics) to host pronominal
clitics is an important isogloss distinguishing Central Kurdish from Southern Kurdish
and Gorani; in the latter, these prefixes are not possible hosts for pronominal clitics.
The analysis of McCarus (2009) and MacKenzie (1961) differ on a number of

points, which cannot be disentangled here. I will only present a selection of those
verb forms, following MacKenzie (1961), which appear to be uncontroversial,
while referring to the sources for more details. Table 7 provides an overview of the
six paradigms introduced below.
I Indicative forms based on the past stem

Ia Simple past
Consists of the past stem plus the past-stem person markers from Table 6.
Ib Past Imperfective
Identical to the simple past, but with the addition of the prefix e- (with dialectal
variants, as discussed above).
Ic Perfect
Based on the participle Past stem -uw, to which the enclitic forms of the copular
person indexes are added.
Id Pluperfect
Based on the past stem, to which an inflected simple past form of the verb būn ‘be’
is attached. Following a stem-final consonant, an epenthetic vowel [ɨ] intervenes
between verb stem and the form of būn (this epenthetic vowel is lacking in dialects
other than Sulemaniyya).
II Subjunctive and conditional forms based on the past stem

IIa The past conditional (MacKenzie 1961: 97)
Based on the simple past form of the verb, to which the subjunctive prefix bi- is
added, and the suffix (clitic?) -āye following the person markers.
IIb Perfect conditional (MacKenzie 1961: 99–100)
Identical to the pluperfect, except that the form of the verb būn ‘be’ which is
attached to the verb is the present subjunctive, not the simple past.
Table 7: TAM forms based on the past stem, illustrated with the verb hātin ‘come’
I ndicative N on -I ndicative
S imple P st . I mpf . P erf . P luperf . C onditional P erf . S ubj .
1 sg hāt-im e-hāt-im hāt-uw-im hāt-i-bū-m bi-hāt-im-āye hāt-i-bim
2 sg hāt-ī(t) e-hāt-ī(t) hāt-uw-ī(t) hāt-i-bū-ī(t) bi-hāt-ī(t)-āye hāt-i-bī(t)
3 sg hāt e-hāt hāt-uw-a hāt-i-bū bi-hāt-āye hāt-i-bē(t)
1 pl hāt-īn e-hāt-īn hāt-uw-īn hāt-i-bū-īn bi-hāt-īn-āye hāt-i-bīn
2/3 pl hāt-in e-hāt-in hāt-uw-in hāt-i-bū-in bi-hāt-in-āye hāt-i-bin
280 Geoffrey Haig
The uses of the different TAM forms conform approximately to the expectations
conveyed by the traditional labels, but see MacKenzie (1961: 133–140) for more
details.
3.2.3. Word order in the simple clause

Word order in Sorani is largely identical to that of Northern Kurdish, both within
the NP and the simple clause: SV, AOV. Like Behdinī, Sorani also makes exten-
sive use of post-predicate goals, generally flagged through a clitic particle =(y)e,
attached to the verb and glossed drct , for the goals of motion verbs. This particle
is probably the historical reflex of an older preposition that flagged the goal NP
itself, but it is now realized as a clitic to the verb:
(12) būk=yān hēnā=ye māl=ewe
bride=3 pl bring.pst = drct home=asp
‘They brought the bride back homeʼ (MacKenzie 1962: 62)
The directional particle is absent, however, when the post-predicate goal is gov-
erned by the prepositions be and bō:
(13) haɫ=ī girt be āsmānā
upwards=3 sg take.pst into sky
‘She took him up into the skyʼ (MacKenzie 1962: 46)
(14) kem=tān čū-w-in bō henār
which=2 pl . poss go.pst - prf -2 pl for pomegranates
‘Which of you went for the pomegranates?ʼ (MacKenzie 1962: 52)
Despite the placement of most goal arguments after the verb, the placement of
direct objects is fairly consistently in pre-verbal position, so that Sorani can still be
characterized as OV. Fronting of an object to a clause-initial position is possible,
but post-posing an object after the verb is scarcely attested in the available texts.
3.2.4. Clitic personal pronouns

The clitic forms of the personal pronouns are a feature common to most of contem-
porary West Iranian, with a few exceptions such as Northern Kurdish, Zazaki, and
Sangesari (Windfuhr 2009: 33). The forms are provided in Table 8.
Table 8: Clitic forms of personal pronouns in Sorani

1 st pers . 2 nd pers . 3 rd pers .
sg =(i)m =(i)t =ī / =y
pl =mān =tān =yān
Within the languages conventionally assigned to ‘Kurdish’ (Haig and Öpengin

2014), the presence versus absence of clitic pronouns is a major isogloss, with
Sorani, Gorani, and Southern Kurdish exhibiting them, while Kurmanjî and Zazaki
lack them. There is a narrow belt of overlap, e. g. the dialect of Surči in North Iraq
(MacKenzie 1961: 222), but on the whole, Kurdish seems either to make abundant
use of the clitic pronouns, or abandon them entirely. In Sorani, they are used in the
following five functions, labeled A-E.
A
Pronominal possessor, where they attach to the possessed NP:
birā-k-ān=im brother-def - pl -1 sg ‘my brothers’ (MacKenzie 1962: 6)
B
Pronominal complement of adpositions:
legeɫ=tān with=2 pl ‘with you(pl )’ (MacKenzie 1962: 10)
C
Direct object of a verb in the present tense:
attaches to the right edge of the first available stress-bearing constituent of the VP
(often a TAM or negation prefix on the verb, e. g. (11) above):
(15) be kuř-e pāšā=y nā-de-yn
to son-ez king=3 sg neg -give . prs -1 pl
‘We are not giving her to the Pashaʼs sonʼ (MacKenzie 1962: 12)
D
The A of a past-tense transitive verb:
attaches to the first stress-bearing constituent of the VP. Three possible hosts,
including the verb itself, are illustrated in (16a–c):
(16) a. min seg-eke=m ne-kušt
1 sg dog-def =1 sg neg -kill.pst (3 sg )
‘I didn’t kill the dog’
b. min ne=m=kušt
1 sg neg =1 sg =kill.pst (3 sg )
‘I didn’t kill (it)’
c. min kušt=im
1 sg kill.pst (3 sg )=1 sg
‘I killed (it)’ (Haig 2017)
If a direct object is fronted to the left of a subject, e. g. for pragmatic purposes, it
cannot host the A-clitic. Because the subject itself is by definition outside of the
VP, in a clause where the object precedes the subject, neither can host the clitic,
which would then typically go to the verb, as in the following:
282 Geoffrey Haig
(17) kuř-ēk=mān seg bird-uw-yat=ī

boy-indf =1 pl . poss dog take.pst - prf -3 sg =3 sg (A- clitic )
‘a boy of ours, the dog took (him)ʼ (MacKenzie 1962: 38)
E
The Indirect Participant (the human, non-affected, non-agentive, non-core argu-
ment of a transitive or intransitive clause, typically a recipient, a benefactive, a
‘wanterʼ, or an external possessor):
If not linked to an adposition, or blocked by another argument-indexing clitic (see
§3.2.4.1), clitic placement is similar to the rule above (first stress-bearing constit-
uent of the VP) but may include subjects:
(18) a. řāw=im bi-der-ewe
quarry=1 sg subj -give.prs .2 sg = asp
‘Give me back my quarryʼ (MacKenzie 1962: 8)
b. min žin=im nā-wē
1 sg woman=1 sg neg -be.desirable.prs .3 sg
‘I don’t want a wife’ (MacKenzie 1962: 6)
c. heyās ū ḥesen memend=im be hīč le des
Heyas and Hasan Mamand=1 sg for nothing from hand
der-čū-n
prv -go.pst -3 pl
‘Heyas and Hasan Mamand have been lost to me for nothingʼ
(MacKenzie 1962: 36)
It needs to be noted that the so-called clitic pronouns are not merely the reduced
form of the corresponding free pronouns (see Table 4). In fact, only in functions
A–C above can they be substituted by the corresponding form of the free pro-
nouns. In function D, they are agreement markers, rather than pronouns, so the
label is actually misleading in these contexts, though I will continue to use it as a
cover term for members of the paradigm.
3.2.4.1. Clitic placement and displacement

The syntax of pronominal clitics is the most complex and theoretically chal-
lenging aspect of Sorani syntax. Considerations of space preclude a full treat-
ment of this topic (see Öpengin 2016, in press, for detailed analysis of close-
ly-related Mukri), but some of the more important principles will be briefly
illustrated here. The complications involve almost exclusively clitics in the “Indi-
rect Participant” function E above. In some cases, they are the complements of
prepositions.
Because the pronouns concerned are clitics, their placement is determined by
the interaction of distinct principles, including rules on clitic placement (e. g. sec-
ond-position), constraints on clitic stacking within a word, and faithfulness prefer-

ences that favour complements being adjacent to their heads.
In general, we expect the complements of adpositions to occur adjacent to the
adposition concerned, as in legeɫ=tān ‘with you(pl )’, i. e. respecting a basic faith-
fulness requirement. However, there are a number of contexts where pronominal
clitics that are adpositional complements are required to move away from their
syntactic heads and be hosted by another element. This is almost obligatory with
some prepositions, when the clause concerned also includes an overt direct object,
and it is not already hosting another argument-indexing clitic:4
(19) čeŋ suʼāl-ēk=it lē bi-ke-m
some question-indf =2 sg to.abs subj -do.prs -1 sg
‘(that) I put some questions to youʼ (McCarus 2009: 617)
The preposition here is the multi-functional le ‘from, for, toʼ, which transforms
into its so-called absolute form lē when its complement is a pronominal clitic. This
rule affects certain prepositions in Sorani: le > lē ‘to, from, forʼ; be > pē ‘by, withʼ,
=(y)e >(y) ē (remnant of a goal-flagging preposition, which now occurs solely as
a clitic hosted by verbs when a goal argument immediately follows the verb; it is
glossed drct in the examples). Thus compare be min bi-ɫē ‘say to me!ʼ (with an
independent pronoun as complement of the preposition be) vs. pē=m bi-ɫē! (same
meaning), with a pronominal clitic complement (MacKenzie 1961: 123). Return-
ing now to (19), we find that the pronominal clitic is not realized locally on its
preposition, but on the preceding direct object. Crucially, the preposition remains
in its absolute form even when its pronominal complement is realized elsewhere.
Leftward displacement of prepositional complements is generally blocked in
past-tense transitive constructions, because here the clitic pronoun which indexes
the A takes precedence, and will be hosted by the direct object or other constit-
uent of the VP (illustrated above), if available. If a past transitive construction
also includes an adpositional argument, with a pronominal complement, then that
pronominal complement is frequently displaced, but not to a preceding element.
Instead, it is indexed via a verbal agreement affix. This is illustrated in (20), where
the verbal suffix expresses the person value of the displaced complement of the
preposition:
(20) selām=yān lē kird-īn
greetings=3 pl .A to.abs do.pst -1 pl
‘They(A) greeted usʼ, lit. ‘did greetings for/to usʼ (McCarus 2009: 618)
4
The constraint on clitic stacking concerns clitics indexing (or bearing) a verbal argu-
ment; in the context of this rule, Indirect Participants act like verbal arguments. How-
ever, an argument indexing clitic can attach to a possessor-indexing clitic. There is also
a general constraint against the repetition of phonetically identical pronominal clitics.
284 Geoffrey Haig
Not all adpositional phrases partake in this kind of construction; it is most wide-
ly-used with those adpositions governing what we have loosely termed an Indirect
Participant, typically a benefactive, recipient, or external possessor.
If the adpositional complement is third person singular, the appropriate verbal
agreement affix in the past tense is zero, so no overt indexing of the Indirect Partic-
ipant occurs, except indirectly, through the absolute form of the preposition itself:
(21) kič […] dergā=y lē kird=uwe
girl door=3 sg .A for.abs do.pst - prf (3 sg )
‘The girl(A) […] opened the door for himʼ (MacKenzie 1962: 30)
The prepositions bō ‘for’ and ɫegel ‘with’ differ from those just discussed in that
they allow the leftward displacement of a pronominal complement, but lack a cor-
responding absolute form:
(22) mesʼele=y eḥme-y _bēɣān=im bō b-ēn-ē
story-ez Ahmed=ez _carefree=1 sg for subj -bring.prs -3 sg
‘(That he may) bring the story of Ahmed the Carefree for meʼ (MacKenzie
1962: 56)
3.2.5. Subordinate clauses

Sorani is consistently right-branching, with all types of subordinate clause follow-
ing main clauses, and relative clauses following their heads. Like Kurdish gener-
ally, Sorani almost entirely lacks non-finite syntax. There are thus no infinitival
constructions directly comparable to English complements in -ing, or to+infini-
tive. Modal verbs such as wīstin ‘be desirable, necessaryʼ, and twānīn ‘be ableʼ,
take complement clauses with verbs in the present subjunctive, and no comple-
mentizer. Other complement-taking verbs, such as zānīn ‘knowʼ generally do not
use a complementizer between main and complement clause. A complementizer
ke is available for relative clauses, restrictive and unrestrictive, as in selāḥedīn,
ke dinye=y girt ‘Saladin, who conquered the worldʼ. Restrictive relative clauses
may also be introduced through an Ezafe particle, on the condition that the head
noun is marked as definite, either through the definiteness suffix -eke, or through a
demonstrative determiner, or is itself pronominal (MacKenzie 1961: 132–133). In
this case, they lack any complementizer; this is a further difference between Sorani
and Northern Kurdish. Postposed relative clauses, on the other hand, require the
complementizer ke, regardless of any other factors.
(23) ew-e=y tō dī-w-it-e
that-dem = ez 2 sg see.pst - prf =2 sg .A-3 sg
‘That which you have seen.ʼ (MacKenzie 1962: 133)
In Bingird and Piždar, non-restrictive relative clauses are introduced by her ke. In
these dialects, the particle agar is used in a variety of clause types as a comple-
mentizer:
(24) de-zān-ē agar kuř=ī N. nī=ye
ind -know.prs -3 sg compl son=ez (proper name) neg =cop .3 sg
‘(when he read it) he knows he is not the son of N.ʼ (Bingird dialect,
MacKenzie 1962: 132)
3.2.6. Alignment
As in many West Iranian languages, in Sorani the morphosyntax associated with
past transitive verbs is distinct from that of other clause types in the language. This
has often been referred to generally as “ergativity”, though it is not particularly
meaningful to characterize an entire language in these terms, particularly Sorani;
see Dabir Moghaddam (2012) for extensive discussion. In what follows, I will
present the main outline of the system, without committing to any particular clas-
sification of the language as a whole.
There is no morphological case in Sorani, so the entire alignment discussion
centres on patterns of agreement (or more neutrally, on indexing). The general
pattern in Sorani is the obligatory indexing of S or A. In all environments except
past transitive clauses, the person indexing is via one of the verbal suffixes shown in
Table 6. In past transitives, however, the A is obligatorily indexed through the appro-
priate pronominal clitics provided in Table 8. In this function, the clitic is best con-
sidered an agreement marker, rather than any kind of pronoun (see Samvelian 2007;
Haig 2008; Öpengin 2016). One argument in favour of this analysis is the presence
of the clitic in constellations where a pronoun would not normally be expected, for
example in sequences of same-subject clauses such as the following (glosses simpli-
fied), where a pronoun would normally be omitted in the second and third conjuncts:
(25) a. Kuř beyānī zū heɫstā
boy morning early rise.pst (3 sg )
b. čūe lāy pāšā wut=ī
went side.of king say.pst (3 sg )=3 s .A
‘The boy rose early in the morning, went to the king, and saidʼ
(MacKenzie 1962: 56)
(26) šew pel=ī kuř=yān girt kird=yān=e
evening arm=ez boy=3 pl .A take.pst do.pst =3 pl .A= drct
perde=we
curtain=asp
‘In the evening they took the boy by the arm and put him behind the
curtainʼ (MacKenzie 1962: 520)
286 Geoffrey Haig
Perhaps even more telling is the presence of the A-clitic in subject relativization,
where we would normally not expect any pronoun, as in the example mentioned
above: selāḥedīn, ke dinye=y girt ‘Saladin, who conquered the worldʼ. Unlike
free pronouns, which are preferably dropped in such contexts (even in a pro-
noun-happy language like English), the clitic pronouns indexing a past-tense A
cannot be omitted, but are required in all past tense transitive clauses. The other
objection to considering them pronouns is that they occur in the presence of a
coreferent subject NP. In short, the clitic pronouns in past-A functions are evi-
dently better analysed as agreement markers, and most contemporary scholarship
converges on this point.
The rules determining placement of the clitics have been discussed in §3.2.4.1
above. With regard to the verbal agreement affixes, with past tense transitive
clauses they may index an object that is otherwise not present in the clause; oth-
erwise the verb carries default third person agreement (zero). Alternatively, the
agreement suffix may be co-opted to index an indirect participant; see (20) above.
3.2.6.1. Experiencers as Non-Canonical Subjects

The past transitive construction just discussed shows obvious similarities with
clauses based on a set of (basically intransitive) experiencer-type predicates (e. g.
of desire, physical perception and sensation, but also of possession). Here too the
experiencer or possessor is obligatorily indexed via a pronominal clitic, adhering
to similar clitic placement principles as those obtaining for past A clitics. Exam-
ples of this type of construction include the following:
(27) he=m=bū ‘I had’ (existent=to.me=it.was)
(28) tīnū=m=e ‘I am thirsty’ (thirst=to.me=it.is)
The verb wīstin ‘be desirable, necessary’ (present stem -ewē-) indexes the desirer
through an obligatory pronominal clitic, and the ‘desiredʼ though a verbal affix,
regardless of tense:
(29) e=y=ewē-m ‘he wants meʼ (ind =3 sg =be.desired.prs -1 sg , MacKenzie
1961: 105)
In Wārmāwa dialect this construction is largely substituted by a similar construc-
tion based on non-verbal predicate, the Turkic loan gerek:
(30) gerek=im=e ‘I want, need’ (gerek=1 sg = cop .3 sg )
The experiencers and possessors in this kind of construction are syntactically not
distinct from other subjects, and exhibit the kinds of semantics widely associated
with so-called Non-Canonical Subjects (cf. Onishi 2002) cross-linguistically, so
it is reasonable to refer to the experiencers in constructions such as (28–30) as
Non-Canonical Subjects. The Non-Canonical Subjects associated with Central

Kurdish experiencer and possessive predicates are most probably retentions of
archaic structures in Iranian, and close parallels can be found in many contempo-
rary West Iranian languages, though they have disappeared in the northernmost
varieties of Kurmanjî Kurdish (Haig 2006; Haig and Öpengin 2018).
4. Northern Kurdish of North Iraq: Behdinī (Behd.)
The varieties of Northern Kurdish (NK) spoken in northern Iraq are referred to
by their speakers as Behdinī. According to the classification of Öpengin and Haig
(2014), Behdinī is part of the “Southeast” dialect group within Northern Kurdish,
a group that spills over into neighbouring regions of Turkey such as Hakkari, and
eastern Syria. Behdinī shares most of its morphosyntax with the better-known
Northern Kurdish dialects of Turkey, and much of the description of NK found in
Haig (this volume, chapter 2.3) also applies to Behdinī. In order to avoid repetition,
this section focuses on those aspects of Behdinī that distinguish it from the rest of
NK. My account draws largely on MacKenzie (1962) and Blau (1975), supple-
mented with information from a native speaker of Dohuk, currently living in Zakho.
4.1. Phonology
4.1.1. Consonants
Table 9: The consonant phonemes of Behdinī
bilab. lab.- dent. alv. post- pal. vel. uvul. pharyn. glot.
dent. alv.
Plos. ph p b th t ṭ d kh k g q ʔ
Fric. vf sz ṣẓ ʃʒ xɣ ʁ ħʕ 5
h
Affr. ʧ ʧʤ
h
Nas. m n ŋ
Trill r
Flap ɾ
Approx. w j
Lateral l (dialectally also ɫ)
5
As pointed out by Christiane Bulut (p.c.), in Kurdish as well as other languages of the
region, these segments can be considered to be glottal stops produced with a retracted
tongue root, rather than fricatives.
288 Geoffrey Haig
As discussed in Haig (this volume, chapter 2.3) the most notable feature of the
Northern Kurdish consonants is the presence of an aspirated vs. non-aspirated
distinction on the voiceless stops and affricates. However, MacKenzie (1961: 30)
notes that the Akre dialect of Behd., and the varieties spoken by the surround-
ing Zēbārī tribal confederation, lack this additional series, bringing their systems
closer to that of Sorani. MacKenzie does, however, include the pharyngeals /ṭ ṣ
ẓ/ in the consonant system of Behd., stating that they are identical in quality with
the corresponding sounds of Arabic. They are retained in words of Arabic origin,
such as ṣebr ‘patience’, or ṭeyr ‘bird’, but also occur in the native vocabulary of
Behd., e. g. ṣār ‘coldʼ, ṭārī ‘dark(ness)ʼ, where they trigger backing of the follow-
ing vowel (MacKenzie 1961: 36). Minimal pairs, or near-minimal pairs, demon-
strating the phonemic status of the pharyngeals include include tā ‘feverʼ vs. ṭā
‘branchʼ, or bez ‘suetʼ vs. peẓ ‘sheepʼ (MacKenzie 1961: 35–36). Blau (1975: 28)
does not include the pharyngeals in her analysis of the Sinjarī dialect. In general,
the phonemic status of the pharyngeal consonants in Northern Kurdish remains
a matter of debate, and although they are undoubtedly perceptually salient and
constitute a feature of native pronunciation of the language, the functional load of
pharyngeals remains limited in any variety of Northern Kurdish.
In more westerly varieties of NK, an inherited sequence [xw] is retained, or
rendered as a labialized fricative[xʷ], while in Behd. it is generally reduced to a
velar fricative [x] (Haig and Öpengin 2018).
Behd. Standard Kurm. Gloss
xārin xwārin ‘to eat’
xāndin xwāndin ‘to read, studyʼ
xo xwe ‘self’
xē xwē ‘salt’
4.1.2. Vowels
The analysis of the Behd. vowel system is hampered by differences in the tran-
scription conventions across different sources, and in standardized NK orthogra-
phy. This presentation aims for a compromise solution between the widely-used
orthography of NK, and the transcription of MacKenzie (1961, 1962). Table 10
gives an overview of the main differences, and establishes the symbols used in
this description:
Table 10: Transcription of vowels for Behd.

IPA MacKenzie (1961) Stand. NK orthography This chapter
[ỵ] û (ü) ü
[uː] ô û ū
[ʊ] u u u
[oː] (ō, for Šēxān dialect, 1961: 39) o o
[ɛ~æ~a] a e e
[ɑ~ɔ] ā a ā
[e:] ē ê ē
All varieties of Behd. exhibit basically the same eight-vowel system that charac-
terizes Northern Kurdish generally (see Haig, this volume, chapter 2.3). MacKen-
zie (1961: 33) assumes the system for the Akre dialect shown in Fig. 3.
Figure 3: Vowel system of Behd., variety of Akre
Vowel length is not by itself distinctive, though [i, y, u, e, ɑ] are generally pro-
nounced longer and tenser than other vowels, particularly in open syllables. The
main difference between Behd. and the rest of NK involve a set of changes, called
here the Behd. Vowel Shift, and originally outlined in Blau (1975: 33). The changes
affect the tense back rounded vowels:
1. The closed back rounded [uː] is fronted to [yː], and in many dialects, deroun-
ded to [iː], thus merging with inherited [iː]. For example, the standard Kurdish
minimal pair [ʃuːr] ‘swordʼ vs. [ʃiːr] ‘milkʼ are both realized as [ʃiːr] in Dohuk
and Zakho.
2. The close-mid back vowel [oː] raises towards [u]. Thus standard Kurmanjî
[mɨroːv] ‘personʼ is [mɨruv], and [goːt] is [gut] ‘saidʼ in Zakho. This latter
process is also noted for Šemzinan Kurmanjî (Haig and Öpengin 2018), but it
does not apply as regularly as the first one.
290 Geoffrey Haig
The full change [uː → iː] appears to be relatively recent. MacKenzie (1961), based
on fieldwork from the 1950ʼs, noted the fronting of [uː→ y] for Akre, and assumed
[yː] to be the norm, but does not mention systematic derounding. He notes (1961:
40) that some speakers of Amediye and Zakho occasionally “confuseˮ [yː] with
unrounded [iː], implying that derounding was at best sporadically attested at that
time. Similarly, Blau (1975) based on fieldwork from the late 1960ʼs, notes only
fronting of [uː] for Amediye, but not derounding. More recently, Haig and Mustafa
(2016) note the systematic shift of original [uː] to [iː] for most of the region sur-
rounding Dohuk. It also occurs in loanwords such as [siːk] ‘marketʼ from Arabic
suq. Zakho has apparently gone further than most other dialects in that deround-
ing also applies to the third person singular past of ‘beʼ, which is [buː] in Stand-
ard Kurdish, but in Zakho [biː]. However, in no dialect is the past tense of ‘goʼ
affected, which generally contains [uː]. This is presumably because the [uː] here is
a secondary development from [oː].
Haig and Mustafa (2016) also oberve that the Ezidi speakers of the town of
Šarya east of Dohuk retain [uː] (e. g. [buːk], as opposed to [biːk] ‘brideʼ of the
Sunnite speakers of Dohuk). This ties in in with Blauʼs (1975) findings on the
Ezidi speakers of Sinjar region, which likewise retain inherited [uː]. In the speech
of an elderly male Ezidi speaker from Ba’adra, a predominantly Ezidi-inhabited
township some 40km. from Dohuk, we also find no evidence of [iː] (e. g. the form
for ‘everyone’ is [hæmuː], rather than Dohuk [hæmiː]). Thus while the Behd.
vowel shift is still only poorly understood, it seems possible that it is utilized as a
marker of religious boundaries; it certainly belongs to the kind of phonetic vari-
ants which Meyerhoff (2011: 26) labels as stereotypes, i. e. those that speakers are
aware of, and can meta-linguistically comment on, hence are good candidates for
markers of identity. However, the absence of the Behd. Vowel Shift among Ezidi
speakers may also reflect distinct origins of the groups concerned; this remains to
be investigated.
Table 11: Examples for the Behd. vowel shift

Gloss Stand. Kurm. Ezīdī of Šarya Šemzinan Dohuk Zakho
‘hairʼ muː muː myː miː miː
‘quick, soonʼ zuː zuː zyː ziː ziː
‘brideʼ buːk buːk byːk biːk biːk
‘satʼ ruːniʃtiː ruːniʃtiː ryːniʃti riːniʃtiː riːniʃtiː
‘wasʼ buː buː buː buː biː
Other differences from the dialects of Turkey include the lowering of [æ] towards
[a], e. g. [bas] ‘enoughʼ, and the backing of [a] to [ɑ] as in [bɾɑː] ‘brotherʼ.
4.2. Morphosyntax
Most of the features that set Behd. apart from other dialects of Northern Kurdish
concern aspects of the predicate. Nominal morphology, on the other hand, is very
close to Standard, or Botan, Kurmanjî. In the interest of brevity only those points
are discussed where Behd. differs from the rest of Northern Kurdish (Haig, this
volume, chapter 2.3). Some of the points mentioned are illustrated in the short text
excerpt provided in §4.3 below.

The gender distinction in the singular is consistently observed in Behd., both in
the form of the Oblique case: -ī (masc. sg.) vs. -ē (fem. sg.), and the forms of the
Ezafe: =yē (masc. sg.) vs. =yā (fem. sg.). The oblique case of masculine singular
nouns, often lacking or marked through Ablaut in other varieties of NK, is consist-
ently marked on the noun via the suffix -ī: min gut=e zelām-ī ‘I(obl ) said=to the
man-obl ʼ. The plural form of the Ezafe is -ēt, rather than the -ē(n) of most of NK.
The paradigm of personal pronouns is identical to the rest of NK, except for the
second person plural in Surčī and Akre, where the forms engo (Surčī, both direct
and oblique), hung or hing (Akre, Amadiye, direct; (hun)ngu, (hu)nge oblique) are
found.
4.2.2. Verbal morphology

In the person and number agreement paradigm, most varieties of Behd. resemble
Sorani in that they distinguish the first person plural from the second and third
person plural (-īn first person plural versus -in second and third person plural),
though in Zakho this additional distinction is also reported absent, at least for
some speakers. Elsewhere in NK, there is a single form -in for all persons in the
plural. All varieties of Behd. preserve a historical final -t on the agreement suffix
for the third person singular on the present tenses (MacKenzie 1961: 182). In
Surčī, Amediye, Akre and Dohuk the sufix is -īt, as in Dohuk čē-nā-b-īt prv -
neg -be.prs -3 sg ‘It doesnʼt happenʼ. In Zakho and Gullī we find -it (t-č-it ind -go.
prs -3 sg ‘he goesʼ, Zakho, MacKenzie 1962: 362). Elsewhere in NK, the ending is
reduced to -e. Thus we can recognize three grades of weakening of 3 sg present: -īt
(Dohuk), -it (Zakho), -e (most NK dialects of Turkey). Finally, most of Behd. has
so-called ‘heavyʼ present stems for the verbs kirin ‘do, makeʼ dān ‘giveʼ, and birin
‘bringʼ (ke-, de- and be- respectively), as opposed to k-, d- and b- in the rest of NK,
though again, Zakho seems to pattern more like the rest of NK.
In certain contexts (generally subjunctive, or at least semantically irrealis),
Behd. verbs take an ending-itin in the third person singular:
292 Geoffrey Haig
(31) zelām-ek ne hinde qewet jī dē šē-t

man-indf neg that strength add fut can.prs -3 sg
wan dār-ā bi dest-ē xo hil-kēš-itin
those.obl tree-pl . obl with hand-ez . m refl prv -pull.prs -3 sg
‘(even) a man not that strong will be able to pull those trees out with his
handʼ (Unger 2014)
The origin and function of this formative remain unclear. MacKenzie (1961:
91) notes an alternative “optionalˮ third singular verbal suffix -tin in Arbil and
Rewandiz dialects of Sorani, noting only that it occurs “most commonly in pauseˮ.
This is presumably related to the Behd. form, but does little to explain its function.6
The present indicative suffix, in NK generally of the form di-, is reduced in
Behd. to [t-], and is transcribed as such here. The future tense in Behd. differs
from the rest of NK in that the main verb lacks the subjunctive prefix bi-. Thus
the future is created through a clause-initial dē (as in NK generally, cf. Haig, this
volume, chapter 2.3, §2.3.4); this particle occurs generally in second position, but
in the absence of an overt subject NP, may be clause initial) plus the inflected bare
present stem of the verb. Functionally, the Behd. future has a much broader range
of application than the label ‘future’ implies. It is frequently used for present habit-
ual senses, as in the narrative provided in MacKenzie (1962: 361–362), where the
speaker recounts the means of livelihood of his father (a charcoal-seller), alter-
nating between simple indicative and future tenses to describe actions which evi-
dently recur regularly, including in the past:
(32) l wērē dē dār-ā kom k-in
there fut wood-pl . obl gathering do.prs -3 pl
‘there (they) gather the woodʼ (MacKenzie 1962: 362)
My impression is that the future tense is used with much greater frequency in
Behd. than in the westerly varieties of NK, but this has not been systematically
investigated.
Unique to Behd. is a past habitual, also used in a conditional sense, based on a
particle dā, with similar positional constraints to future dē, plus the non-prefixed
present stem. For example, an account of earlier traditional wedding celebrations
(Haig and Mustafa 2016) begins with the following:
6
Geoffrey Khan (p.c.) points out that in NENA dialects in the Behdini area a –ni or –in
augment is often added to verbal endings (of all persons) in pause, e. g. garəš ‘he pulls’
> garəšni (pause), garšet ‘you (ms.) pull) > garšetin (pause). There is no clear Semitic
etymology for this. There may be a link here, but the Behd. -in is restricted to third
persons only.
(33) zelām dā hē-n […]

man cond come.prs -3 pl
‘the men would / used to come […]ʼ
This form is also used for irrealis forms with non-past semantics; see Unger (2014).
A further feature distinguishing Behd. from the rest of NK is the very frequent use
of the aspectual clitic -ewe ‘back, againʼ, presumably reflecting the areal proxim-
ity to Sorani.
The final feature discussed here is likewise unique to Behd. within NK, and
indeed within the entirety of Kurdish. Behd. has developed a present continuous,
based on the Ezafe particle (indicating gender and number of the subject) and the
present indicative form of the verb. Example (34) contrasts the use of this form,
indicating immediacy of the event, with the simple indicative present (35), where
a general present tense is implied:
(34) tu yē cotī t-ke-yī
2 sg ez . m ploughing ind -do.prs -2 sg
‘Are you ploughing? (right now)ʼ
(35) ev čend sal=e tu cotyarī-yē lē
this how_many year=cop .3 sg 2 sg cultivation-obl . f on.it
t-ke-yī
ind -do.prs -2 sg
‘How many years is it that you are cultivating it?ʼ
This usage is presumably related to the use of the Ezafe with non-verbal predi-
cates, particularly with expressions of location, and with participles, described in
MacKenzie (1961: 205–208), and Haig (2011). Finally, as in Sorani, Behd. makes
extensive use of non-canonical subjects (in the oblique case) with predicates of
possession, desire, and some physical sensations (heat, cold, but not hunger and
thirst). The following example illustrates an oblique subject of a predicate of
perception;7 note also in this example the use of the ezafe particle following the
oblique subject, which is typical for non-verbal predicates expressing temporary
states in Behd:
7
In Dohuk, to express ‘hear’ a different construction is used, involving a complex pred-
icate consisting of gūlē combined with the light verb būn. In the present tense, the
subject is treated canonically (i. e. is in the direct case). Thus ‘I (female) hear the sound
of the children’ is: ez a dengē pičīkā gūlē dibim (Baydaa Mustafa, p.c.). Remarkably, in
the past tense the subject goes into the oblique case, indicating that this expression is
treated as overall transitive, despite the intransitive light verb būn ‘be’.
294 Geoffrey Haig
(36) min=ē gū lē
1 sg . obl = ez . m . s ear at.it
‘I hear itʼ (spoken by a male speaker, hence the masc. sg. form of the
Ezafe, MacKenzie (1961: 206), dialect of Akre)
4.3. Short text in Behdinî Kurdish

The following text is an excerpt from a traditional tale, told by a male Ezidi
speaker, approximately 60 years old at the time. The text was recorded in 2011
during fieldwork in the township of Baʼadra, undertaken by the author together
with Ergin Öpengin.
(37) der-ā mirin lē he-b-itin
place-ez . f death in.it existent-cop . prs . subj -3 sg
ez lē nā-b-im
1 sg in.it neg - cop . prs -1 sg
‘(He says:) I won’t stay at a place where death (mortality) isʼ
(38) hēj_ko ne-čū māl-ā xo
before neg -go.pst (3 sg ) house-ez . f refl
‘before he went back to his homeʼ
(39) ū išāret-ek bo xo dā-nā […]
and sign-indf for refl prv -put.pst (3 sg )
‘and placed a sign for himselfʼ
(40) dā b-zān-im išāret-ek wā he=ye
cond subj -know.prs -1 sg sign-indf such existent=cop .3 sg
‘(saying) that I know there is such a signʼ
(41) wāxtē čū hevāl ū bend-ēt xo helā-n8
when go.pst (3 sg ) for and comrade-ez . pl refl leave.pst -3 pl
‘when he departed and left his friends and comradesʼ […]
(42) cāmēr-ek-ī got=ē te xēr=e
man-indf - obl . m say.pst (3 sg )=to.him 2 sg . obl goodness=cop .3 sg
bāb-o
father- voc . m
‘a man said to him: “what are you doing, old fellow?”ʼ
8
The verb is helān ‘leaveʼ, which in other dialects has the infinitive hištin. Note the reg-
ular number agreement withe the object, typical of the canonical ergative construction
in Behd.
(43) dē kēwe č-ī?

fut where go.prs -2 sg
‘where are you going?ʼ (note the use of the future tense)
(44) go bi xādē ez dē č-im=e
say.pst (3 sg ) by God I fut go.prs -1 sg = drct
Qub-ā Felek-ē
dome-ez . f Heaven-obl . f
‘he said: “By God, I am going to Heavenʼs Dome”ʼ […]
(45) yāʼnī dē č-īt=e Qub-ā Felek-ē
so fut go.prs -3 sg - drct dome-ez . f Heaven-obl . f
‘so that is, he would go to Heavenʼs Domeʼ
5. Varieties of Gorani in northern Iraq
5.1. Overview
Following Mahmoudveysi et al. (2012), I use Gorani (spelled Gūranī in Bailey
2016, and Gurāni in McKenzie 2002) as a cover term for a group of West Iranian
languages, with a probable historical epicentre in the mountainous Hawromān
region of western Iran. The best-described, and morphologically most complex,
variety of Gorani is Hawrāmī, which has a reasonably well-established written
standard, based on the variety of Pāve in Iran. The historical forerunners of Gorani
spilled westward from their mountainous origins into todayʼs Iraq, and are cur-
rently still spoken by a geographically scattered and socio-culturally diverse group
of speech communities, beginning in the Iraq-Iran border region between Halabja
and Xanaqīn, and extending northwest towards Mosul. According to MacKenzie
(2002), the original speech zone of Gorani in this region was much larger, and
todayʼs scattered Gorani islands thus represent the remnants of what must have
been a more extensive contiguous region.
The most reliable and accessible description of any variety of Hawrāmī remains
MacKenzie (1966). Recently, documentation and analysis of local varieties of
Gorani spoken in Iran have been published as part of language documentation pro-
jects (Mahmoudveysi et al. 2012; Mahmoudveysi and Bailey 2013; Bailey 2016).
The varieties of Gorani spoken in Iraq, on the other hand, remain very poorly doc-
umented. Bailey (2016) summarizes much of the existing scholarship on Gorani,
and this section largely adopts her conventions. Bailey identifies the following
varieties as belonging to Gorani (see the map in Fig. 4 for locations).
296 Geoffrey Haig
Hawrāmī
As mentioned, this is the name given to those varieties spoken in the mountainous
Hawrāmān (Awromān) regions of western Iran and eastern Iraq. It is commonly
differentiated into Hawrāmān-i Taxt, the varieties of the high mountain regions,
and Hawrāmān-i Luhon, the region of valleys. Varieties of Hawrāmī are spoken
in Iraq in and around the city of Halabja, close to the Iranian border. They are not
treated in this chapter; see Mahmoudveysi and Bailey, this volume, chapter 4.5.
Kandūlayī
Kandūlayī refers to the variety of Gorani spoken in a complex of three villages
some 50 km. north of Kermānšāh (see Fig. 4). This variety is close to to Hawrāmī
of Pave.
Zardayāna
The variety spoken in the village of Zarda, spoken in a village about 100 km. north
of Sar Pol-e Zahāb in Iran. It is documented in Mahmoudveysi and Bailey (2013).
Gawřaĵūyī
The variety spoken only in the village of Gawřaĵu, located about 10 km. west of
the township of Gahvāre in West Iran. It has been described in Mahmoudveysi et
al. (2012), and Bailey (2016). The dialect of Gawřaĵu is relatively isolated from
the rest of Gorani, and has been heavily influenced by surrounding varieties of
Southern Kurdish. This may explain why it has lost many of the identifying fea-
tures of Gorani (see below).
Bāĵaɫānī
This is a term loosely identified with the speech of the Bāĵaɫān tribes, spoken
in several locations just east of Mosul, but also villages near Xanaqīn and Koy
Sanjak to the southeast near the Iranian border. The Bāĵaɫān are sometimes sub-
sumed under the Šabak (see below), but I follow Bailey (2016), basically adhering
to MacKenzie (1956), in maintaining a difference between the two groups, which
seems to be more in line with their self-perceptions. Along with the language (here
considered a variety of Gorani), the Bāĵaɫān are known for their religious hetero-
doxy.
Šabakī
The speakers are socially and linguistically closely aligned with the Bāĵaɫān, and
are originally located east of Mosul on the Nineveh plains. The claim that Šabakī
is a mixture of Turkish, Arabic and Kurdish is not substantiated by the available
data, which suggest that it is a variety of Gorani, very close to Bāĵaɫānī (Bailey
2016: 643).
Kākayī
The name Kākayī is generally applied to groups belonging to the Yaresan or Ahl-e
Haqq religious community, around the towns of Tōpzāwa near Kirkūk, Xānaqīn
and Arbil (Bailey 2016: 644). The language is often referred to with the term Mačo
(3sg present of the verb ‘sayʼ).
Figure 4: Names and locations of Gorani speaking localities

(adapted from Bailey 2016: 9–10)
Key to Abbreviations used in Fig. 4:

B (Bēwyānī) B1: Sarqizil, Bardī ʿAlī Xwārū/Žūrū, Say Miṣafā, Gōřī Ginūž, Kānī Šīrna;
B2: Bēwyānī Gawra/Bičūk and about another fifteen villages; B3: Bēwyān, Dūšamnān
(their main places in this area), Sar Pol-e Ẕahāb, Dāraka, Qałama, Šāy Tōtyā, Barxu
Bārānī Xwārū/Žūrū, Miǰūryānī Ambar/Awbar, Tangī Ḥamām, Sarqałā
Bǰ (Bāǰalānī) Bǰ1: several villages near Al-Mawṣil (members of the tribe are also present
in Xānaqīn, Kalār, etc.); Bǰ2: Kānī Māz, Taqtaq, Sē Girdkān, Qāmīš, (Kōya (Koy Sanǰaq),
Dukān)
G: Zarda and Gawraǰū villages
H (Hawrāmān area) Center: Pāve and Nowsūd
K (Kākayī) K1: Xānaqīn, Mēxās, Qarāmīn, Malā Řaḥmān, Dārā, Qałama, ʿAlī Bāpīr,
Řamazān, Tapa Čarmē, Ḥāǰī Miṣṭafā; K2: Kirkūk (districts Ḥayy ʿAskarī, Usarāʾ al-
Mafqūdīn etc.); K3: around Tōpzāwa (center of the Kākayī), ʿAlī Sarāy, Ġarǰa Kōyī,
Maʿtīq, Dāquq, Zaqar; K4: near Arbīl: Sufaya, Wardak, Kabarlū, Tūlaban, Gazakān
Kn (Kandūla) three villages
298 Geoffrey Haig
Ř (Řōžbayānī) Ř1: Furqān; Ř2: Čamčamāł; Ř3: Arbīl; Ř4: Kirkūk and Laylān (near
Kirkūk)
Šb (Šabak) Šabak villages near Al-Mawṣil
ŠrB (Šaraf Bayānī) villages around Bamō
Šx (Part of Šēxānī) Šx1: villages near Qādir Karam: Qašqa, Wēła, Šawak, etc.; Šx2: Kānī
Māz, Taqtaq, Sē Girdkān, Qāmīš, (Kōya (Koy Sanǰaq), Dukān)
Z (Zangana) Z1: Qādir Karam; Z2: Bakragařa, Sipasar and many more (in Xānaqīn,
Kalār, Kifrī (between Kalār and Ṭūz Xurmātū), Arbīl and surrounding area and in the sur-
roundings of Al-Mawṣil); Z3: Ṭūz Xurmātū
5.2. Gorani and Kurdish

The relationship of Gorani, as delineated above, to Kurdish is quite complex. Most
speakers refer more or less collectively to all the Iranian languages of Iraq as some
form of Kurdī, and Leezenberg (1994: 3) states that the Gorani speakers he met in
Iraq all considered themselves “Kurdsˮ. Unsurprisingly, centuries of close contact
between speakers of Gorani and speakers of Central (and in some places Northern)
Kurdish have yielded convergence in phonology, but also in syntax and lexicon.
MacKenzie (2002) considers the phonology of Gorani basically identical to that
of Sorani (cf. §3.1). Nevertheless, from a historical Iranianist perspective there are
good reasons for distinguishing Gorani from Central and Southern Kurdish, and
most scholars of Iranian languages continue to do so.
Verb paradigms are among the more reliable means of assessing historical relat-
edness. Sample paradigms of the present indicative are provided in Table 12. For
Hawrāmī of Luhon (Iran, MacKenzie 1966: 37) and Bāĵaɫānī (Šabakī, according to
the speaker) of the village of Arpačī, a few kilometres east of Mosul (MacKenzie
1956: 424) the verb is ‘sleep’ and for “Mačoˮ of Topzawa (Leezenberg 1994), the
verb is ‘see’. The corresponding person agreement suffixes from Zardayāna of
West Iran (Mahmoudveysi and Bailey 2013: 43) are also provided. As a compar-
ison with Kurdish, the corresponding present-tense suffixes from Sorani Kurdish
(Table 5 above) are included.
Table 12: Present indicative verb paradigms in Gorani

G orani K urdish
H awr . B aj ./ Š abakī M ačo Z ard . S orani
‘sleep’ ‘sleep’ ‘see’
1 sg m-us-u m-ōs-ī me-wīn-ü [møjny] -ī/y -im
2 sg m-us-ī m-ōs-ī me-wīn-ü -ī -īt, -yt
3 sg m-us-o m-ōs-ō me-wīn-o -o -ē(t)
1 pl m-us-me m-ōs-mē me-wīn-im -mē -īn
2 pl m-us-de m-ōs-ē me-wīn-de -dē -in
3 pl m-us-ā m-ōs-ān me-wīn-ā -ān, -ānē -in
A number of points of interest emerge from this comparison. First, the syncre-
tism of first and second singular forms in all varieties of Gorani outside of Hawr.
Second, the phonological shape of the person markers in Gorani, which are both
relatively internally consistent, and exhibit obvious differences to the correspond-
ing paradigms from Sorani.
Table 13 below provides a summary of ten morphological features common to
most of Gorani, but lacking in Kurdish. The features are selected from the extensive
comparative tables provided in Bailey (2016: 648–668), and include only those
features that (i) are present in at least three of the varieties identified as Gorani;
and (ii) are lacking in Northern, Central and Southern Kurdish. Thus features such
as the clitic =ewe ‘back, againʼ, which is characteristic of all Iranian languages of
Iraq (and has even spread to non-Iranian languages), are not included. Likewise, a
feature such as a first person plural pronominal clitic with the form =mā(n), with
cognates throughout much of West Iranian, are of little value in distinguishing
Gorani from Kurdish, and are hence not included.
Table 13 is far from exhaustive, and should only be considered as a tenta-
tive and partial set of candidates for morphological innovations that set Gorani
apart from its Kurdish relatives. Not all features are of equal significance. The
presence of a feature such as the -g(i)n- present stem of ‘fallʼ in both Hawrāmi,
and geographically quite distant Bāĵaɫānī, is highly significant; such a feature is
hardly likely to have been independently borrowed, nor does it seem likely that
all intervening languages had this feature, and then dropped it. It would therefore
be more likely to consider it an inheritance from a presumed common ancestor
of these two languages, i. e. some (branch of) proto-Gorani. It will be noted that
Gawřaĵūyī (abbreviated Gawr.) differs from the rest of Gorani on several counts,
mostly exhibiting the corresponding forms from Southern Kurdish. This con-
firms the overall impression that Gawr. is the most heavily Kurdicized variety of
Gorani yet documented. A question mark indicates lack of relevant data; I have
300 Geoffrey Haig
collapsed Baileyʼs two columns “Bāĵaɫānī” and “Bāĵaɫānī/Šabakī” into a single

column.
Table 13: Lexical and morphological features of Gorani not attested in Kurdish
Gawr. Haw. Kand. Zard. Kāk. Baj./Šab.
At least one demonstrative – + – + ? +
pronoun stem contains -ēd-
(Bailey 2016: 651, 654)
Reflexive pronoun stem w- (Bailey – (ištān) + + + + hē
2016: 655)9 (<*wē?,
cf. hūn
‘bloodʼ)
3sg present indic. verbal suffix rare + + + + +
with a back rounded vowel (Bailey
2016: 657)
2pl imperative contains a -d- – + + + ? –
(Bailey 2016: 659)
unstressed present indicative + + + + + +
prefix m+unrounded, central/
open, vowel (e. g. m(ɨ)- (Haw.,
MacKenzie 1966: 32), or ma- else-
where (Bailey 2016: 656).10
9
According to Sara Belelli (p.c. 17. 09. 2016), a reflexive pronoun wiž (possible vari-
ant viž) is present in most ‘Laki-Kermānshāhi’ varieties (e. g. Bisotun, Chehr, Harsin,
Pāyravand: see Fattah, 2000: 291 and Belelli, 2016: 65–66). Depending on how one
defines “Southern Kurdish”, this feature may not qualify. It is also attested in varieties
of Lekī.
10
One might conjecture on possible influence of the Persian mī- prefix here. Two facts
militate against this assumption: (i) the considerable differences in vowel quality and
stress patterns (evident in the assimilation of the prefix vowel to stem-initial vowels in
Gorani); (ii) the histories of the speech communities: in order for bound verbal mor-
phology to be affected in this way, there would have to have been prolonged heavy
contact and bilingualism with Persian across the entirety of Gorani. This seems unlikely
given the locations and history of the Gorani speech communities. Sara Belelli (p.c.
17. 09. 2016) notes the presence of m(a)- (normally accompanied by a clitic =a attach-
ing to the element immediately preceding the verbal form) in most ‘Laki-Kermānshāhi’
dialects (e. g. Bisotun, Chehr, Harsin, Pāyravand). This morpheme is also typical of
Laki dialects (see, for instance, Lazard, 1992: 218); see Fattah (2000: 371–372), Belelli
(2016: 99–100). Thus the reservations that apply to the reflexive feature also apply
here.
Gawr. Haw. Kand. Zard. Kāk. Baj./Šab.

1sg past intransitive verbal suf- - (-im) + + + + – (-īm)
fixes contain -n- (Bailey 2016:
658)
Third person clitic copula, present + + + + + +
tenses contain -n- (Bailey 2016:
661)11
‘whatʼ+-š (the interrogative word - + + + + +
‘whatʼ carries a final sibilant, e. g.
Zard. čiš, Bailey 2016: 665)
-g(i)n- stem for present tense of - + + (mangū) - - +
‘fallʼ (e. g. magnō ‘it will fallʼ,
Baj., MacKenzie 1956: 422)
3sg present indic. ‘sayʼ: māč+o/ū - + + + + +
Obviously we are still far short of a true comparative study of Gorani and Kurdish,
but these few facts should suffice to demonstrate that despite the scattered and iso-
lated nature of the Gorani speech communities in northern Iraq, they have retained
a core of common morphological and lexical features that distinguish them from
the Kurdish spoken by the surrounding speech communities.
5.3. Notes on syntax

The variety of Hawrāmī spoken in and around Halabja is very close to that of
Pāve, thus has retained gender and case on nouns. Elsewhere, however, gender
appears to have been lost. For the variety of Baj. described in MacKenzie (1956),
an oblique case is apparently maintained:
(46) ī zelām-a-y činī hē=t b-er-e
this man-dem - obl with refl =2 sg . poss subj -bring.prs . imp -2 sg
‘Bring this man with you!ʼ (MacKenzie 1956: 426, glosses added)
However, no other clear example of an oblique-marked direct object was availa-
ble in MacKenzieʼs material, which makes this example doubtful. Similarly, no
evidence for an oblique case can be found in the scanty material of Leezenberg
(1994). The following example is provided by Leezenberg from Mačo of Topzawa
11
Along with a copula in =ān, Gawr. also has a 3sg copula -ē, which is actually attested
much more frequently than =ān (261 tokens of =ēn versus 15 tokens of =ān in Bailey’s
Gawr. corpus, Denise Bailey p.c.). This could be interpreted as further indication of the
heavy Kurdification of Gawr.
302 Geoffrey Haig
(unfortunately all the examples of direct objects in Leezenbergʼs material are

indefinite, so the possible effects of DOM cannot be ruled out):
(47) Ew zilām-ē me-wīn-ō
3 sg man-indf ind -see.prs -3 sg
‘He sees a manʼ
All varieties of Gorani use pronominal clitics to index an A in past transitive con-
structions, with the A-clitic hosted by the first constituent of the VP, as described
for Sorani in §3.2.4:
(48) etū čēš=it wāt bene=šān
2 sg what=2 sg .A say.pst (3 s ) to=2 pl
‘What did you say to them?ʼ (Baj., MacKenzie 1956: 431)
Unlike Central Kurdish, the third person singular pronominal clitic is =(i)š in
Gorani; otherwise the paradigm of pronominal clitics is cognate with the forms
given in Table 8 above.
Abbreviations
abs Absolutive indf Indefinite

add Additive m Masculine
asp Aspect mod Modal
cez Compound Ezafe neg Negation
compl Complementizer obl Oblique
cond Conditional pl Plural
cop Copula poss Possessive
def Definite prf Perfect
dem Demonstrative prs Present
drct Directional particle prv Preverbal particle
ez Ezafe pst Past
f Female refl Reflexive
fut Future sg Singular
imp Imperative subj Subjunctive
ind Indicative voc Vocative
References
Bailey, Denise. 2016. A grammar of Gawraĵū Gūrānī. Göttingen: Universität Göttingen

dissertation.
Belelli, Sara. 2016. A study on language and folklore in the city of Harsin (in Kermānshāh
province of West Iran. Sketch grammar with texts and lexicon. Naples: LʼOrientale
University PhD dissertation.
Blau, Joyce. 1975. Le Kurde de ’Amādiya et de Djabal Sindjār: Analyse Linguistique, Tex-
tes Folkloriques, Glossaires. Paris: Klincksieck.
Fattah, Ismaïl Kamandâ. 2000. Les dialectes kurdes méridionaux. Étude linguistique et di-
alectologique. Leuven: Peeters.
Haig, Geoffrey. 2006. Turkish influence on Kurmanji: Evidence from the Tunceli dialect. In
Lars Johanson & Christiane Bulut (eds.), Turkic-Iranian contact areas: Historical and
linguistic aspects, 283–299. Wiesbaden: Harrassowitz.
approach. Berlin & New York: Mouton De Gruyter.
Haig, Geoffrey. 2011. Linker, Relativizer, Nominalizer, Tense-Particle: On the Ezafe in
West Iranian. In Foong Yap, Karen Grunow-Hårsta & Janick Wrona (eds.), Nominaliza-
tion in Asian languages: Diachronic and typological perspectives, vol. 1, Sino-Tibetan
and Iranian Languages, 363–392. Amsterdam & Philadelphia: Benjamins.
University Press.
Haig, Geoffrey. In press. Debonding of inflectional morphology in Kurdish and beyond. In
Songül Gündoğdu, Ergin Öpengin, Geoffrey Haig & Erik Anonby (eds.), Current issues
in Kurdish linguistics. Bamberg: Bamberg University Press.
Haig, Geoffrey & Baydaa Mustafa. In press. Language choice and patterns of usage among
Kurdish speakers of Dohuk: An empirical intergenerational study. In Songül Gündoğdu,
Ergin Öpengin, Geoffrey Haig & Erik Anonby (eds.), Current issues in Kurdish linguis-
tics. Bamberg: Bamberg University Press.
Haig, Geoffrey & Ergin Öpengin. 2014. Kurdish: A critical research overview. Kurdish
Studies 3. 99–122.
Haig, Geoffrey & Ergin Öpengin. 2018. Kurdish in Turkey: Structure, variety, and status. In
Christiane Bulut (ed.), Linguistic minorities in Turkey and Turkic speaking minorities
of the peripheries, 157-230. Wiesbaden: Harrassowitz.
Hassanpour, Amir. 1992. Nationalism and language in Kurdistan 1918–1985. San Fran-
cisco: Mellen Research University Press.
Jahani, Carina. 2015. On the definite marker in Modern Spoken Persian. Paper held at the
Sixth International Conference on Iranian Linguistics (ICIL6), Ilia State University,
G. Tsereteli Institute of Oriental Studies, Tbilisi / Georgia, 23–26 June, 2015.
Lazard, Gilbert. 1992. Le dialecte laki dʼAleshtar (kurde méridional). Studia Iranica 21.
215–245.
Leezenberg, Michael. 1994. The Shabak and the Kakais: Dynamics of ethnicity in Iraqi
Kurdistan. ILLLC Technical Reports Series X. Amsterdam: Institute for Language,
Logic, Language, and Computation.
MacKenzie, David. 1956. Bāĵalānī. Bulletin of the School of Oriental and African Studies
18(3). 418–435.
MacKenzie, David. 1961. Kurdish dialect studies I. Oxford: Oxford University Press.
MacKenzie, David. 1962. Kurdish dialect studies II. Oxford: Oxford University Press.
MacKenzie, David. 1966. The dialect of Awroman (Hawrāmān-ī Luhōn). Grammatical
sketch, texts, and vocabulary. København: Det Kongelige Danske Videnskabernes Sel-
skab.
MacKenzie, David. 2002. Gurāni. Encyclopædia Iranica 6.4, 401–403. New York: Center
for Iranian Studies, Columbia University. http://www.iranicaonline.org/articles/gurani
(accessed 28 August 2016).
304 Geoffrey Haig
Mahmoudveysi, Parvin, Denise Bailey, Ludwig Paul & Geoffrey Haig. 2012. The Gorani
Language of Gawraĵū, a village of West Iran. Texts, grammar and lexicon. Wiesbaden:
Reichert.
Mahmoudveysi, Parvin & Denise Bailey. 2013. The Gorani Language of Zarda, a village of
West Iran. Texts, grammar and lexicon. Wiesbaden: Reichert.
McCarus, Ernest N. 2009. Kurdish. In Gernot Windfuhr (ed.), The Iranian Languages, 587–
633. London & New York: Routledge.
McDowall, David. 2004. A modern history of the Kurds, 3rd edn. London: Tauris.
Öpengin, Ergin. 2016. The Mukri variety of Central Kurdish. Grammar and Texts. Wies-
baden: Reichert.
Öpengin, Ergin. In press. Accounting for the combination of clitic and affix person mark-
ers in Central Kurdish. In Songül Gündoğdu, Ergin Öpengin, Geoffrey Haig & Erik
Anonby (eds.), Current issues in Kurdish linguistics. Bamberg: Bamberg University
Press.
Öpengin, Ergin & Geoffrey Haig. 2014. Regional variation in Kurmanjî: A preliminary
classification of dialects. Kurdish Studies 3. 143–176.
Samvelian, Pollet. 2007. A lexical account of Sorani Kurdish prepositions. In Stefan Muller
(ed.), Proceedings of the 14th International Conference on Head-Driven Phrase Struc-
ture Grammar, 235–249. Stanford: CSLI Publications.
Skrzypek, Dominika. 2010. Between a demonstrative and an article. The status of -in in Old
Swedish. Folia Scandinavica 11. 145–162.
Unger, Christoph. 2014. Badini Kurdish modal particles dê and da: Procedural semantics
and language variation. Kurdish studies 3. 177–200.
3.4. The Neo-Aramaic dialects of northern Iraq
Geoffrey Khan
The Neo-Aramaic dialects of northern Iraq are modern vernacular forms of

Aramaic, a Semitic language, which has a documented history in the Middle East
of over 3,000 years, the earliest inscriptions being datable to approximately 1,000
B.C.E. The dialects in Iraq belong to the North-Eastern Neo-Aramaic (NENA)
subgroup of Neo-Aramaic.1 This is a highly diverse subgroup of over 150 dialects
spoken by Jews and Christians originating from towns and villages east of the
Tigris River in northern Iraq, southeastern Turkey and western Iran. The NENA
subgroup is distinct from three other subgroups of Neo-Aramaic. These include
the western subgroup spoken by Christians and Muslims in the villages of Maʿlula,
Baxʿa and Jubʿadin in the region of Damascus, the Ṭuroyo subgroup, spoken by
Christians in the Ṭūr ʿAbdīn region of southeastern Turkey (see Khan, this volume,
chapter 2.5), and Mandaic, spoken by Mandaeans in the cities of Ahwaz and Khor-
ramshahr in Iran (see Khan, this volume, chapter 4.4). None of these subgroups is
as diverse as NENA.
The NENA dialects are not direct descendants of any of the earlier literary
forms of Aramaic, although they exhibit close affinities to Syriac and Jewish
Babylonian Aramaic. The dialects rather have their roots in a vernacular form of
Aramaic that existed in antiquity in the region of northern Mesopotamia, which
differed from the vernacular underlying the literary languages of Syriac to the
west and Jewish Babylonian Aramaic to the south. This is shown by the fact that,
although exhibiting numerous innovations, they are more conservative than Syriac
and Jewish Babylonian Aramaic in some features (Khan 2007; Fox 2008). Some of
the dialects, moreover, have preserved lexical items of apparently Akkadian origin
that do not appear in dictionaries of the earlier forms of literary Aramaic.2 Struc-
tural differences among the NENA dialects are likely to reflect, to some extent,
migrations of communities in the northern Mesopotamian region.
Within NENA itself one may identify a number of subgroups on the basis of
linguistic structure and lexicon. There is a fundamental split between the dialects
spoken by the Jews and those spoken by the Christians. This applies even to cases
where Jewish and Christian communities lived in the same town, such as Koy
Sanjak and Sulemaniyya.
1
The term was coined by Hobermann (1988: 557).
2
For Christian dialects see Krotkoff (1985) and Khan (2002a: 515, 2017: 262), and for
Jewish dialects Sabar (2002: 12).
306 Geoffrey Khan
Within Jewish NENA dialects of Iraq two main subgroups are clearly identifi-
able: (1) the so-called lišana deni subgroup, which was spoken in the northwest of
Iraq mainly in Dohuk province in locations to the west of the Great Zab river, such
as Zakho, Dohuk, Amedia, Betanure, and Nerwa. (2) Dialects spoken in locations
east of the Great Zab river in the Arbīl and Sulemaniyya provinces, e. g. Rustaqa,
Ruwanduz, Koy Sanjak, villages of the plain of Arbel (Irbīl),3 the village of Dobe
which is on the western bank of the Great Zab, Ḥalabja and Sulemaniyya to the
east, and as far south as Khanaqin on the Iranian border. This subgroup is generally
referred to as trans-Zab (following Mutzafi 2008b). In addition, there was a small
cluster of dialects in the region of Barzan, located in Iraq between these two areas,
which exhibit a linguistic profile that is transitional between the two main sub-
groups (Mutzafi 2002, 2004b). The Jewish trans-Zab subgroup continues across
the border into Iran (see Khan, this volume, chapters 2.5 and 4.4).
The divisions among the Christian NENA dialects of Iraq on structural and
lexical grounds are not so clear-cut. One may, nevertheless, identify subgroups of
dialects with distinctive features. These include the following: (1) the dialects of
the Mosul plain in Nineveh province (e. g. Qaraqosh, Alqosh, Telkepe, Barətla), (2)
the dialects in the far north close to the Turkish border in Dohuk province mainly
west of the Zab (e. g. Aradhin, Barwar [= Barwari Bala], Nerwa, Derigne) and (3)
the dialects east of the Zab in Arbīl and Sulemaniyya provinces (e. g. Ankawa,
Shaqlawa, Bədyal, Koy Sanjak, Sulemaniyya). As can be seen, subgroups (2) and
(3) correspond broadly to the Jewish lišana deni and trans-Zab subgroups respec-
tively. In southeastern Turkey several dialects were spoken that are closely related
to the dialects of the Iraqi Christian subgroup (2), especially in the Tiyare region
(Khan, this volume, chapter 2.5). Subgroup (3) has one outlying dialect in western
Iran (Sanandaj) (Khan, this volume, chapter 4.4). These three Christian subgroups
in Iraq are distinguished for convenience of presentation,4 but it must be stressed
again that this division is very broad-brush. The subgroups exhibit considerable
internal diversity and the boundaries between them are not always clear. Fig. 1
and Fig. 2 below show the locations of the Jewish NENA dialects and the Chris-
tian NENA dialects respectively. In the following presentation the Iraqi Jewish
subgroups are referred to as IJ1 (=lišana deni) and IJ2 (= trans-Zab), and the Iraqi
Christian subgroups as IC1, IC2 and IC3.
3
The Jews in the town of Arbel itself spoke Arabic (Jastrow 1990).
4
For other classifications see Socin (1882: v), Duval (1896: 125), Maclean (1901: ix–xi)
and Tsereteli (1977; 1978).

The Neo-Aramaic dialects of northern Iraq
Figure 1: Locations of the Jewish NENA dialects mentioned in this chapter

307
308
Geoffrey Khan
Figure 2: Locations of the Christian NENA dialects mentioned in this chapter

The Neo-Aramaic dialects of northern Iraq 309
2. History of the speech community
The Christian and Jewish Aramaic-speaking communities in northern Iraq have

deep historical roots in the region. This is reflected by the linguistic background
of the NENA dialects, which, as remarked above, are shown by a number of fea-
tures that originate in vernacular Aramaic in antiquity when Akkadian was still a
living language. The Christians are descendants of communities that converted to
Christianity in the first half of the 1st millennium C.E. (Fiey 1965). Until the end
of the 1st millennium C.E. Aramaic was spoken more widely by Christians and
Jews throughout Iraq. By around the 10th century the communities in the centre
and south of the country had largely become Arabic-speaking. In the course of the
Middle Ages many of the Christians of central Iraq migrated to the north of the
country to avoid forced conversion with its consequent arabicization. We know, for
example, that around the eleventh century a large proportion of the Christian pop-
ulation of Tikrit migrated northwards and settled in the Mosul region, many in the
town of Qaraqosh (Khan 2002a: 2–3). In the Timurid invasions of the fourteenth
century the Christian communities of central and southern Iraq were decimated,
which would have hastened the end of any vestiges of Aramaic in this region.
However, the Christian communities in the mountains in the north of the country
and in villages on the nearby plain who fled to the mountains were largely spared
the ravages of these invaders. There are only scant historical records concerning
the Jews of northern Iraq before the modern period. Mann (1931: 216) publishes
letters datable to the beginning of the sixteenth century that mention villages in the
highlands of northern Iraq that had Jewish communities. Some of these had been
abandoned by their Jewish population by the twentieth century or the number of
Jewish inhabitants was considerably reduced, reflecting the fact that the Jewish
population of the region declined after the sixteenth century. One reason for this
reduction is likely to have been the forcible conversion of some Jews to Islam,
especially in the nineteenth century in some areas (Soane 1912: 186).
The momentous events of the twentieth century brought about the destruction
and displacement of a large number of the Aramaic-speaking communities of the
region. The decisive event for the Christians was the First World War, during which
the Christians in the mountains of what is now southeastern Turkey and the Urmi
region in northwestern Iran sided with Russia against the Ottoman government.
As a result, their villages were destroyed by the Turkish army and they were either
massacred or driven from their homes (Gaunt 2006; Rockwell 1916). A large pro-
portion of the survivors settled in refugee camps, villages and towns in Iraq. Some
of the Aramaic-speaking communities in northern Iraq are, therefore, relatively
recent immigrants from southeastern Turkey and northwestern Iran. The founda-
tion of the State of Israel in 1948 was the decisive event for the Aramaic-speaking
Jews of the region, since shortly thereafter the entire Jewish population of Iraq
emigrated to the new state, except for a few isolated individuals, most of whom
310 Geoffrey Khan
converted to Islam. In recent decades in Iraq there has been further displacement
of the Aramaic-speaking Christian communities. In the late 1970s and early 1980s,
for example, many of the numerous Christian villages in northern Iraq and south-
eastern Turkey were destroyed by the Iraqi and Turkish armies during the Kurdish
uprisings. Further displacements of Christian communities from the villages of the
Mosul plain have taken place recently. These recent events have driven many of
the Christian communities to leave Iraq and settle elsewhere, mainly in Europe,
North America and Australia.
3. Current status of the speech community
As remarked, the NENA subgroup is extremely diverse and the size of the speech
community of individual dialects varies considerably. Some dialects are now
reduced to a handful of final speakers. This applies in particular to the Jewish dia-
lects, all of which are now highly endangered and will not survive much beyond
the next two decades. Several Jewish dialects have recently become extinct, e. g.
Nerwa in 2012, Sandu in 2010, Challa in 2007, Shahe in 2000, Bədyal in 1998
(Mutzafi 2014). Some dialects of small Christian communities in Iraq are also
highly endangered, e. g. the Christian dialect of Bədyal. In general, however,
Christian dialects are generally less endangered than Jewish ones. It is not possible
to give precise statistics for individual dialects, but some of the larger ones have
several thousand speakers, taking into account speakers in migrant communities.
Among Christian Aramaic-speakers from Iraq a particularly widely spoken dialect
is a koine that developed in the towns after the merging of various refugee com-
munities after the First World War (Odisho 1988: 19–38).
The Aramaic-speaking Christians of Iraq belong to various church denomi-
nations, including the Assyrian Church of the East, Ancient Church of the East,
Chaldean, Syriac Orthodox and Syriac Catholic.
Before the disturbances of the First World War the majority of Christians were
agriculturalists living in villages. After the settlement of refugees in the towns
many adopted various urban professions. Most of the Aramaic-speaking Jews were
town-dwellers, who were small traders, goldsmiths, tailors, weavers and dyers.
Some of the traders were shopkeepers while others were pedlars who hawked their
wares around the surrounding countryside. Some of the Jews who remained in
villages down to the twentieth century, such as the communities of the villages of
Betanure, Shukho and Sandu, were agriculturalists (Mutzafi 2008a, 2014).
All speakers of Aramaic who were resident or originated in northern Iraq are bilin-
gual and in many cases trilingual. The Christians of the Mosul plain region speak
Arabic in addition to their local Aramaic dialects. They have been exposed to both
local Arabic vernaculars and to the literary Arabic of the Iraqi education system. In
many cases Arabic has had a major impact on their Aramaic speech, particularly in
peripheral dialects such as Qaraqosh, where code-switching between Aramaic and
Arabic is common. Aramaic-speakers elsewhere are generally fluent in Kurdish,
the language of the majority of the surrounding Muslim population, and those who
have been through the Iraqi education system know Arabic. In some areas Arama-
ic-speakers spoke Turkoman, especially Turkoman converts to Christianity.
Although the spoken NENA dialects have their roots in antiquity, they were
not committed to writing in any systematic way until the seventeenth century, from
which period onwards there are extant texts written by Christians (Mengozzi 2002;
Mengozzi and Braida 2011) and Jews (Sabar 1976).
The literary form of Christian NENA that emerged in the seventeenth century
was used by Chaldean Christians in the region of Mosul down to modern times
(Macuch 1976). Another type of Christian literary NENA emerged in the nineteenth
century in Urmi, in northwestern Iran. This was originally developed by western
missionaries to that region in order to propagate Bible translations in the vernacular,
but subsequently gave rise to a very variegated literature (Murre-van den Berg
1999). This form of written NENA was based on the spoken Christian dialect of
the Urmi region, but came to be used widely by Christians in Iraq. Within the last
few decades there has been an increase in the production of literature written in
this form of Christian literary NENA. Furthermore education through the medium
of the literary language has, in recent years, undergone considerable development
in northern Iraq. All forms of literary NENA written by Christians are influenced
by the classical Syriac language, in its orthography, grammatical structure and
lexicon.
The written form of the Jewish NENA dialects, by contrast, is not influenced
by a classical form of Jewish Aramaic. It has continued to be produced down to
modern times, especially to commit to writing traditions of oral literature (Aloni
2014). No systematic form of education system in Jewish NENA developed in the
communities. Within Israel, in recent decades, however, the communities have
organized various cultural events in an attempt to keep their spoken dialects alive.
These include theatre plays, poetry readings and other types of entertainment.
312 Geoffrey Khan
5. Phonology
The phonology of the NENA dialects of northern Iraq exhibit some historical
developments that are common to the majority of the NENA dialects and some
that are particular to individual dialects. In many cases such developments have
been stimulated by language contact. Dialects differ in the degrees of conserva-
tism in the phonology, the less conservative dialects eliminating a greater degree
of sounds that do not occur in contact languages. In what follows a selection of
cases will be presented from the consonant system. The term ‘earlier Aramaic’ is
used to refer to what can be assumed to be the ultimate historical form of features
attested in NENA.
5.1. The consonant group bgdkpt

vowels, which can be represented *[ḇ], *[ḡ], *[ḏ], *[ḵ], *[p̄], *[ṯ]. In the NENA
dialects these fricative allophones became phonemicized, with the result that
minimal pairs are found with stops and fricatives, e. g. C. Qaraqosh (IC1):5 šata
‘year’—šaθa ‘fever’; guda ‘wall’—guða ‘churn’. In the NENA dialects there is a
general uniformity of the reflexes of the historical fricatives *[ḇ], *[p̄] and *[ḵ].
These reflexes are as follows:
*[ḇ] > /w/ C. Qaraqosh (IC1) sawa (< *sāḇā) ‘grandfather’
*[p̄] > /p/ C. Qaraqosh (IC1) kipa (< *kēp̄ ā) ‘stone’
*[ḵ] > /x/ C. Qaraqosh (IC1) baxə (< *bāḵē) ‘he weeps’
In some dialects in southeastern Turkey /x/ has subsequently shifted to the pharyn-
geal /ḥ/ (Khan, this volume, chapter 2.5, §5.1).
A shared innovation of NENA that distinguishes the subgroup from the neigh-
bouring Ṭuroyo subgroup of Neo-Aramaic is the loss of the voiced velar fricative
[ḡ]. In the NENA dialects of Iraq the reflexes of this are generally either the laryn-
geal stop /ʾ/ or zero. The historical development was *[ḡ] > /ʿ/ > /ʾ/ > Ø,6 in which
the velar first shifted to a pharyngeal fricative before weakening to a laryngeal and
5
The abbreviations C. and J. are used to denote Christian and Jewish dialects respec-
tively. Unless otherwise indicated, the data in this chapter are taken from the pub-
lished descriptions of NENA dialects. These include Khan (2002a C. Qaraqosh), Khan
(2008 C. Barwar), Khan (1999 J. Arbel), Khan (2004 J. Sulemaniyya), Talay (2001
C. Nerwa), Coghill (2003 C. Alqosh), Greenblatt (2011 J. Amedia), Mutzafi (2004a
J. Koy Sanjak), Mutzafi (2008a J. Betanure), Napiorkowska (2015b C. Diyana-Zari-
waw). Some material is taken from the data of the NENA database project gathered by
G. Khan, E. Coghill and R. Borghero.
6
The velar fricative *[ḡ] was an allophone of /g/ in earlier periods of Aramaic, and so is
written in square brackets. The modern reflexes /ʿ/ and /ʾ/, by contrast, are phonemes.
zero (Tsereteli 1990). The reflex /ʾ/ is, accordingly, more archaic than the reflex Ø.
The distribution of /ʾ/ differs across the Iraqi NENA dialects:
C. Qaraqosh (IC1): šraʾa (< *šrāḡā) ‘lamp’, raʾəš (< *rāḡeš) ‘he wakes’,
ləʾma (< *laḡmā) ‘jaw’, paləʾ (< *pāleḡ) ‘he divides’, palʾa (< pālḡā) ‘she
divides’
C. Barwar (IC2): šraya, ṛayəš, pela (< *paḡlā) ‘radish’, păle, păla
C. Diyana-Zariwaw (IC3): šreya, ṛayəš, lama, păle, păla
J. Amedia (IJ1): šraʾa, raʾəš, peʾla, paleʾ, palʾa
J. Koy Sanjak (IJ2): zoʾa (< *zōḡa) ‘pair’, pella, ṛăʾəš, păle, palya
The pharyngeal is preserved in a few words in some Jewish dialects in pharynge-
alized environments, e. g. J. Amedia l-ʿoya, J. Koy Sanjaq loʿa (< *ḷə-ḡawwāyā)
‘inside’.
A considerable diversity is exhibited by the NENA dialects of Iraq in the
reflexes of the interdental fricatives *[ṯ] and *[ḏ]. These are preserved in most of
the Christian dialects of IC1 and IC2, e. g.
C. Qaraqosh (IC1): xaθa (< *ʾaḥāṯā) ‘sister’, ʾiða (< *ʿēḏā) ‘festival’
C. Barwar (IC2): xaθa ‘sister’, ʾeða ‘festival’
In the IC2 dialect of Nerwa the unvoiced /θ/ has merged with the voiced /ð/, e. g.
C. Nerwa (IC2): xaða ‘sister’, ʾeða ‘festival’
Some Christian dialects in the far northwestern periphery of IC1 and IC2 in and
around Zakho and adjacent to the Tigris replace the interdentals with stops or
sibilants, e. g.
C. Bidaro: xata ‘sister’, ʾeda ‘festival’
C. Peshabur: xasa ‘sister’, ʾeza ‘festival’
Dialects on the eastern periphery of IC2 and most dialects in IC3 to the east of the
Zab have replaced the interdentals with stops or sibilants, e. g.
C. Xərpa (IC2): xasa ‘sister’, *ʾiza (< *ʾīḏā) ‘hand’
C. Bədyal (IC3): xata ‘sister’, ʾida ‘hand’
In some Christian dialects the unvoiced interdental is replaced by a sibilant and the
voiced interdental by a stop, but never the reverse, e. g.
C. Sulemaniyya (IC3): xasa ‘sister’, ʾeda ‘festival’
Among the Christian IC3 dialects there are, however, some archaizing dialects that
preserve interdentals, in some cases only the unvoiced one, e. g.
C. Ankawa (IC3): xaθa ‘sister’, ʾida ‘hand’
314 Geoffrey Khan
Both interdentals are preserved in the speech of Christians in Hawdiyan (northeast

Iraq) who originate from Para and Shapat in southeastern Turkey.7
There is some conservatism of the interdentals among the Jewish dialects of
the IJ1 group, e. g.
J. Dohuk (IJ1): xaθa ‘sister’, ʾiða ‘hand’
Most dialects of IJ1, however, exhibit some degree of replacement of interdentals
with either stops or sibilants. If there is conservatism, it is of the unvoiced inter-
dental, e. g.
J. Betanure/J. Amedia (IJ1): xaθa ‘sister’, ʾida ‘hand’
J. Nerwa (IJ1): xasa ‘sister’, ʾida ‘hand’
J. Zakho (IJ1): xasa ‘sister’, ʾiza ‘hand’
The Jewish trans-Zab IJ2 group of dialects have the distinctive feature of the shift
of interdentals to the lateral /l/, e. g.
J. Arbel (1J2): mala (< *māṯā) ‘village’, ʾila (< *ʾīḏā) ‘hand’
In some dialects the velar stops /k/ and /g/ are palatalized to the affricates /č/ [ʧ ~
ts] and /j/ [ʤ ~ dz]. This is found predominantly in the northeastern sector of the
region in the IC3 group, e. g. C. Diyana-Zariwaw čalba (< *kalba) ‘dog’, jare (<
gare) ‘roof’, C. Bədyal čạlo (< *kalo) ‘bride’, julpa (< *gulpa) ‘wing’. Some dia-
lects also in IC2 exhibit some degree of palatalization in these consonants before
unrounded vowels, e. g. C. Barwar kyasa [ˈcyæːsæ] (< *kasa) ‘stomach’, gyawṛa
[ˈɟyæwræ] (< gawra) ‘man’.
In the C. Bədyal dialect in the northeastern sector of the region in the IC3
group the alveolar stops *t and *d are palatalized to the affricates /č/ [ʧ] and /j/ [ʤ]
before high vowels, e. g. C. Bədyal beta ‘house’, beči (< *beti) ‘my house’, ʾida
‘hand’, ʾiji (< *ʾidi) ‘my hand’.
In dialects in the north of the region in the area where Kurmanji Kurdish is
spoken, there are several Kurmanji loanwords with unaspirated unvoiced stops,
e. g. C. Barwar (IC2) ṱanga ‘saddle girth’ (where ṱ = unaspirated) < Kurmanji
teng. In these NENA dialects some unaspirated stops have developed in NENA
words, usually where a voiced stop is followed by an adjacent laryngeal stop, e. g.
C. Barwar ṱ-ile ‘that he is’ < d-ʾile.
5.2. Pharyngeals
The pharyngeal consonants *ʿ (voiced pharyngeal fricative) and *ḥ (unvoiced
pharyngeal fricative) generally undergo the following shifts in NENA dialects:
7
Lidia Napiorkowska (pc.).
*ʿ > /ʾ/ or Ø
*ḥ > /x/
The development of the voiced pharyngeal *ʿ is parallel to that of *[ḡ] and the
distribution of the reflexes /ʾ/ and Ø in the dialects of Iraq is the same as described
above for the reflexes of *[ḡ], e. g.
C. Qaraqosh (IC1): zraʾa (< *zrāʿā) ‘cultivation’, daʾər (< *dāʿer) ‘he
returns’, bəʾta (< *bēʿtā) ‘egg’, šaməʾ (< *šāmeʿ) ‘he hears’, tarʾa (< tarʿā)
‘door’
C. Barwar (IC2): zraya, dayəṛ, bita, šăme, tăra
C. Diyana-Zariwaw (IC3): zṛaya, ṱayǝr, bita, šăme, taṛa
J. Amedia (IJ1): pqaʾa (< *pqāʿā) ‘explosion’, daʾər, beʾta, šameʾ, tarʾa
J. Koy Sanjak (IJ2): beʾe (< *bēʿē) ‘eggs’, daʾər, beta, šăme, tăra
The development of the unvoiced pharyngeal *ḥ to the velar fricative /x/ is an
innovation of the NENA subgroup and is exhibited by all dialects of Iraq, e. g.
C. Barwar (IC2): xmara (< *ḥmārā) ‘ass’, xaθa (< *ʾaḥāṯā) ‘sister’
In the far northwest close to the Turkish border the secondary shift of /x/ > /ḥ/ is
found in the dialect of Derabun, e. g. ḥata (< *xaθa < *ʾaḥāṯā) ‘sister’. The shift
has also been applied to cases of /x/ that are reflexes of *[ḵ], e. g. baḥe (< *baxe <
*bāḵē) ‘he weeps’. This is a feature of a cluster of NENA dialects in southeastern
Turkey (Khan, this volume, chapter 2.5, §5.1), from where the Derabun commu-
nity originated.
In some of the NENA dialects of Iraq the original voiced and unvoiced pharyn-
geal are preserved in words containing emphatic, i. e. pharyngealized, consonants,
including /q/. This is found predominantly in the groups IC1, IC3, IJ1 and IJ2, but
not in the Christian dialect group IC2, e. g.
C. Qaraqosh (IC1): ṭaʿən (< *ṭāʿen) ‘he lifts’, raḥoqa (< *rāḥōqā) ‘distant’
C. Sulemaniyya (IC3): ṱaʿəl (< *ṭāʿel) ‘he plays’, ḥaruqa (< *rāḥōqā)
‘distant
J. Zakho (IJ1): ʿapṣa (< *ʿapṣā) ‘gallnut’, raḥuqa (< *rāḥōqā) ‘distant’
J. Koy Sanjak (IJ2): təʿna (< *ṭaʿnā) ‘load’, ḥănəq (< *ḥāneq) ‘he chokes’
This preservation has come about by a process of ‘long distance consonant agree-
ment’ (Khan 2013). As can be seen above, in J. Koy Sanjak (IJ2), the preservation
of the pharyngeal /ʿ/ entailed the neutralization of the pharyngealization of the
emphatic *ṭ resulting in plain /t/ (Mutzafi 2004a: 28–33) and in C. Sulemaniyya
the pharyngealization is lost from the *ṭ resulting in an unaspirated plain /ṱ/.
316 Geoffrey Khan
In some of the NENA dialects of Iraq the emphatic pharyngealized consonants *ṭ
and *ṣ of earlier Aramaic have been preserved as pharyngealized segments. This
is found predominantly in dialects in groups IC1, IC2 and IJ1 in the western areas
for the region, e. g.
C. Qaraqosh (IC1): ṭina (< *ṭīnā) ‘mud’, ṭarpa (< *ṭarpa) ‘leaf’, ṣəpra (<
*ṣeprā) ‘sparrow’, ṣliwa (< *ṣlīḇā) ‘cross’
The pharyngealization of the emphatic consonants in such dialects tends to spread
to adjacent segments in ways that are similar to what has been documented in
Arabic dialects. As in some Arabic dialects (Hoberman 1989; Watson 1999), the
spreading tends to be restricted to segments in adjacent syllables and is blocked by
high front vowels (Khan 2013).
In dialects in groups IC3 and IJ2, which were spoken in northeastern areas of
the region, the pharyngealization of the historically emphatic consonants *ṭ and *ṣ
tends to be weakened.
In the Jewish dialects of IJ2 /ṭ/ and /ṣ/ are weakly pharyngealized or lose their
pharyngealization altogether. The pharyngealization is neutralized in particular
when the word contains a pharyngeal consonant, e. g. J. Koy Sanjak (1J2) təʿna
(< *ṭaʿnā) ‘load’. This process of development seems to be due to convergence with
the behaviour of pharyngealization in the Kurdish dialects described by Margaret
Kahn (1976: 49–52). In Kahn’s analysis a pharyngealized consonant in Kurdish is
the result of the presence of an underlying pharyngeal segment in the word. Such
words can also be realized with the pharyngeal on the phonetic surface in place
of the pharyngealized consonant. Kahn found this type of pharyngealization in
Kurmanji Kurdish dialects north of Urmi. It is significant, however, that today it
does not seem to exist in the Sorani dialects of Iraq or Iran. The feature is found
in the Jewish NENA dialects that were in Sorani Kurdish areas (i. e. 1J2 group in
Iraq and the Jewish dialects of western Iran, see Khan, this volume, chapter 4.4).
It is possible, therefore, that this typology existed at an earlier historical period
in Sorani Kurdish and the NENA dialects converged with this typology and pre-
served it whereas it was lost in Sorani Kurdish.
In many of the Christian dialects in the northeastern region of Iraq (IC3) the
pharyngealization of the historical emphatic consonants *ṭ and *ṣ is totally lost
in some words, e. g. C. Diyana-Zariwaw (Napiorkowska 2015a) seda (< *ṣaydā)
‘hunt’. Reflections of the presence of the original emphatic phonemes have,
however, been preserved in a number of features. These include the lack of aspi-
ration of the original *ṭ after loss of emphasis, e. g. xaṱṱe (< *ḥaṭṭē) ‘wheat’, and
backing and/or lowering of adjacent vowels, e. g. xɒsa (< *ḥāṣā) ‘back’, sɔma
[ˈsˁo̱ːma] (< *ṣawmā) ‘fast (noun)’.
Pharyngealization is also associated with a group of non-historical emphatic
consonants that includes the sonorants /ṛ/, /ḷ/ and the labials /ḅ/, /ṃ/, /p/,̣ also /ẉ/.
This group developed in certain words, in some cases, it seems, to fulfil the func-
tional need of creating semantic contrasts, e. g.
(1) J. Amedia (IJ1)
ṃḷele ‘he filled’ mlele ‘it sufficed’
ḅaza ‘hawk’ b-aza ‘she will go’
bəṛke ‘pool’ bərke ‘his knee’
The phonemic contrast between the plain and emphatic rhotics /r/ : /ṛ/ is particu-
larly common in the NENA dialects. In a few cases the number of rhotic phonemes
has proliferated to three. In Iraq this is found, for example, in the dialect of C.
Barətla (IC1), which has an emphatic /ṛ/, a plain tap /r/ and a retroflex /ɽ/ (Arnold
2002; Mole 2015):
paɽe < *pārā ‘bran, husks left after sieving’
paṛe < *parre ‘lambs’
pare < *pāre ‘money’ (< Kurd. pāra)
5.4. Vowels
Various shifts have taken place in the quality of vowels across the NENA dialects
of Iraq. These include the following.
*/ē/
An original *ē has shifted to /i/ in word-medial position in a few dialects. These
include C. Qaraqosh (IQ1) on the southwestern periphery and some Christian dia-
lects in the northeastern sector in group IC3, e. g. C. Qaraqosh (IC1), C. Diyana-
Zariwaw (IC3) riša (< *rēšā) ‘head’.
In C. Qaraqosh */ē/ in a final open unstressed syllable undergoes centralization
to /ə/, e. g. xmarə (< *ḥmārē) ‘asses’.
*/e/
The reflex of original short */e/ is generally a centralized /ə/, e. g. C. Qaraqosh
(IC1) šaqəl (< šāqel) ‘he takes’. In many dialects /ə/ is also the reflex of a short-
ened */ī/, e. g. C. Qaraqosh xləmta (< *ḥlīmtā) ‘thick’, but C. Barwar xlimta.
*/u/, */ū/
In several dialects in the northeastern sector of the region there is a tendency to
front */u/ and */ū/ to the region of [ʏ] or [ʉ], represented here as /ü/, e. g. C. Diya-
na-Zariwaw ʾürxa (< ʾurḥā) ‘road’, barüza (< *barrūzā) ‘dry’.
318 Geoffrey Khan
*/ō/
In some Christian dialects in the northeast of the region (IC3) *ō is raised to /u/,
e. g. C. Diyana-Zariwaw (IC3) muxa (< *mōḥā) ‘brain’, C. Bədyal (IC3) šạduṛe
(< *šadore) ‘to send’. In such dialects the vowel sometimes undergoes both raising
and fronting, e. g. C. Bədyal xüna (< *ʾaḥōnā) ‘brother’.
*/a/ and */ā/

The distinction between the historical short */a/ and long */ā/ phonemes has been
lost. These collapsed into a signal phoneme /a/, the length of which is determined
by syllable structure. In some dialects this /a/ phoneme underwent qualitative
changes in certain contexts.
In some dialects, predominantly in the northern sector of the region, /a/ is
raised to /e/ before the laryngeals /ʾ/ (< */ʿ/) and /h/, e. g. J. Amedia (IJ1) ṭeʾna (<
*ṭaʿnā) ‘load’, sehra (< *sahrā) ‘moon’. This has occurred in dialects that sub-
sequently lost the laryngeals, e. g. C. Barwar (IC2) ṭena ‘load’, sera ‘moon’, C.
Diayana-Zariwaw (IC3) zdeta (< *zdaʿtā) ‘fear’. In some of these dialects there is
raising of /a/ also before an originally geminated consonant in a few lexemes, e. g.
C. Barwar (IC2) genawa (< *gannāḇā) ‘thief’, C. Diayana-Zariwaw ginawa; cf.
C. Qaraqosh (IC1) and J. Amedia (IJ1) ganawa.
An /a/ in final unstressed open syllables undergoes backing in C. Telkepe
(IC1), e. g. tamɒ [ˈtæːmɒ] (< *tāmā) ‘there’ (Coghill 2008).
*/aw/
The diphthong */aw/ contracts in most dialects to /o/, e. g. C. Qaraqosh (IC1)
moθa (< *mawṯa) ‘death’. In dialects in the northeastern sector of the region this
/o/ may undergo fronting, e. g. C. Diyana-Zariwaw jöza (< gawzā) or raising, e. g.
C. Bədyal muta ‘death’. In some dialects in the northern sector of the region this
diphthong has been preserved, e. g. C. Barwar mawθa ‘death’.
*/ay/
The diphthong */ay/ contracts in most dialects to either /e/ or /ɛ/, e. g. C. Qaraqosh
(IC1) beθa (< *bayṯā) ‘house’, C. Barwar bɛθa. Some dialects in the far northwest
do not monophthongize the diphthong completely, e. g. C. Bidaro beyta ‘house’.

The onset of syllables may consist of a single consonant C or a cluster CC (with
certain constraints, see below), e. g. (syllable boundaries are marked by a dot):
(2) C. Barwar (IC2)
bɛ.θa ‘house’
smo.qa ‘red’
A syllable in principle does not have a vowel as its onset but always begins with at
least the laryngeal /ʾ/, e. g. C. Barwar ʾá.na ‘I’, ʾá.ti ‘you’, ʾi.lána ‘tree’. In some
trans-Zab Jewish dialects (IJ2) an initial /ʾ/ shifts to /h/, e. g. J. Sulemaniyya hīt
‘there is’ (< ʾīt), heka ‘where?’ (< ʾeka).
There are differences across the dialects of Iraq with regard to the presence of
the laryngeal glottal stop /ʾ/ in clusters in the syllable onset. Some dialects, pre-
dominantly in groups IC1 and IJ1, tolerate /ʾ/ as the first or second segment of the
initial cluster, e. g.
C. Qaraqosh (IC1): ʾsi.ra ‘tied’, rʾi.ša ‘awake’
J. Amedia (IJ1): ʾwi.da ‘done’, rʾi.ša ‘awake’
Dialects of other groups tend not to tolerate /ʾ/ in initial clusters and elide them in
these circumstances.
The cluster is sometimes pronounced without being broken by an epenthetic
vowel. This is generally the case if the first or second segment is a sibilant or
sonorant continuant, e. g. C. Barwar šte.li ‘I drank’, pli.ṭɛ.le ‘he has gone out’.
Otherwise clusters tend to be broken by an ultrashort epenthetic vowel, e. g. C.
Barwar kpi.na [kəpiːna] ‘hungry’. An epenthetic prosthetic vowel is sometimes
pronounced before a cluster, especially if the first segment is a sonorant or sibilant,
e. g. C. Qaraqosh nqa.ša [ənqaːʃa] ‘beating’, ski.na [əskiːna] ‘living’. An alterna-
tive strategy of treating a cluster of two stops is to elide the first of them. This is
found in C. Bədyal, e. g. pina (< *kpina) ‘hungry’, tāna (< *ptāna) ‘plough’.
Across the NENA dialects several monosyllabic nouns have been made bisyl-
labic by geminating a consonant after an epenthetic vowel breaking an initial con-
sonantal cluster, e. g.
(3) C. Barwar
šə́m.ma ‘name’ < *šmā
də́m.ma ‘blood’ < *dmā
Monosyllabicity is tolerated to a greater extent in other grammatical categories.
Historical gemination of consonants has been lost after the low vowel /a/ but is
generally preserved after high vowels, especially /ə/, e. g.
(4) C. Barwar
kaka ‘tooth’ < *kakkā
ʾəzza ‘goat’ < *ʿəzzā
The sequence VCCCV in word internal position is found in certain inflections of
verbs. It may be syllabified VCC.CV or VC.CCV. An epenthetic vowel is usually
inserted between the CC cluster in the coda of VCC or the onset of CCV, e. g.
(5) C. Barwar
mapə́l.xa ~ map.lə́xa ‘she uses’ < maplxa
320 Geoffrey Khan
The rhymes of syllables are in principle bimoraic with a VV or VC structure. As

a result, vowels in open syllables are generally long and those in closed syllables
are short, e. g. C. Barwar ʾi.ði [ʔiːðiː] ‘my hand’, qim.li [qimliː]. In the transcrip-
tion, therefore, in this and other chapters on Neo-Aramaic the default length of
vowels is left unmarked by diacritics, i. e. a vowel without a diacritic in an open
syllable is long and a vowel without a diacritic in a closed syllable is short. The
only exception is the vowel /ə/, which is always short. Diacritics are used only
when the length of the vowel deviates from this principle. The surface phonetic
duration of vowels is conditioned by a number of other factors, such as stress and
speed of utterance. Some cases of phonetically short vowels in open syllables arise
from the presence of an underlying glottal. This is seen clearly in a conservative
dialect such as C. Qaraqosh (IC1), in which the glottal stop /ʾ/ alternates with
zero, e. g.
(6) C. Qaraqosh
daʾ.ra [daʔraː] ~ dă.ra [daraː] ‘she returns’
tar.ʾa [tʰarʔaː] ~ tăr.a [tʰaraː] ‘door’
In the alternants in which the glottal stop has been dropped the vowel length
remains unchanged, which can be interpreted as reflecting the continued presence
of the glottal at an underlying level.
5.6. Stress
The basic position of word stress in most NENA dialects in Iraq is on the penul-
timate syllable, e. g. C. Qaraqosh (IC1) béθa ‘house’, k-šáqəl ‘he takes’. Clitics,
such as the copula, are not taken into account in stress placement, e. g.
(7) C. Qaraqosh
béθan=ilə
house.our=cop .3ms
‘it is our house’
Stress is generally not placed on particles affixed to verbal forms, which can result
in the placement of the stress before the penultimate syllable of the word. This
lack of stressing of affixes is a property of clitics. It is not the case, however, that
the stress is completely unaffected by the affixes, since it moves to the end of the
verbal stem. I shall continue, therefore, to refer to these elements as verbal affixes
rather than clitics, e. g.
(8) C. Qaraqosh
k-šaqə́l-wa-lhə
ind -take.prs .3 ms - pst - l .3 pl
‘he used to take them’
In the trans-Zab Jewish dialects of the IJ3 group the basic position of stress is on
the final syllable of the word, e. g. J. Sulemaniyya belá ‘house’. This is likely to
be due to convergence with Kurdish. As in other NENA dialects, stress is not in
principle placed on clitics or verbal affixes, e. g.
(9) J. Sulemaniyya
a. laxxá=ya
here=cop .3 fs
‘she is here’
b. gărə́š-wa-lu
pull.prs .3 ms - pst - l .3 pl
‘he used to pull them’
6.1. Pronouns
The NENA dialects of Iraq have various series of third person pronoun. A dis-
tinction may be made between deictic pronouns, which point to a referent, and
anaphoric pronouns, which signal that a referent is identifiable in the context (typ-
ically the discourse history) without pointing to its location. A syntagmatic distinc-
tion may be made between independent pronouns and adnominal pronouns that are
combined with a noun. Deictic pronouns typically distinguish between near deixis
and far deixis. With regard to gender and number, third person pronouns distin-
guish masculine singular, feminine singular and common plural.
There are differences across the NENA dialects of Iraq with regard to the
degree to which these various types of third person pronouns are expressed by
separate morphological forms. Some dialects are lacking distinct forms for some
of the slots, as can be seen in the table below, which presents the independent third
person pronouns from a selection of dialects:
322 Geoffrey Khan
Table 1: Independent third person pronouns in Iraqi dialects of NENA

C. Qaraqosh C. Barwar C.Diyana- J. Amedia J. Arbel
(IC1) (IC2) Zariwaw (IC3) (IJ1) (IJ2)
Near deixis
3 ms ʾaða ʾawwa ʾawwa ʾayya ʾiyya
3 fs ʾaði ʾayya ʾayya ʾayya ʾiyya
3 pl ʾanə ʾanna ʾanne ʾanna ʾanne
Far deixis
3 ms ʾawa ʾăwaha ʾawó ʾawạhạ ʾo
3 fs ʾaya ʾăyaha ʾayó ʾayạhạ ʾo
3 pl ʾane ʾănaha ʾanó ʾanạhạ ʾanne
Anaphoric
3 ms ʾahu ʾaw ʾo ʾo ʾo
3 fs ʾahi ʾay ʾe ʾe ʾo
3 pl ʾanhən ʾani ʾan ʾan ʾoni
In this sample of dialects, J. Arbel (1J2) has a much smaller inventory of morpho-
logical forms than the other dialects. It lacks gender distinctions in the singular
forms and also lacks a formal distinction between far deixis and anaphoric forms
in the singular.
The C. Qaraqosh dialect preserves archaic forms of the singular near deixis
pronouns with an interdental (ʾaða, ʾaði) and so exhibits heterogeneity in the
morphological bases of the singular near deixis and far deixis forms. The other
dialects in the table, by contrast, have lost these archaic forms by forming the
near deixis pronoun base on the analogy of those of far deixis and anaphoric
pronouns.
Some dialects have developed further morphological distinctions to express
various forms of ‘intensive’ deixis, such as ‘very far deixis’ or ‘contrastive deixis’,
e. g.
(10) a. C. Barwar
3 ms . far ʾăwáha, very far ʾăwaʾha
b. C. Peshabur (Coghill 2008: 97–98)
3 ms far ʾăwáḥa, very far ʾăwáaḥa, extremely far ʾăwaʾḥa
c. C. Diyana-Zariwaw
3 ms far ʾawó, contrastive ʾawóha, ʾawóhani
The anaphoric pronouns must in principle always be bound to a referent in the
discourse, and so have not developed into definite articles. The Jewish trans-Zab
dialects (IJ2) have borrowed a definite article suffix from Sorani Kurdish with the
form –ăke, e. g. J. Arbel bela ‘house’, belăke ‘the house’. The form –ăke corre-
sponds to the oblique form of the Kurdish definite article (-ăkay) rather than the
nominative form (–ăka).
The first person singular independent pronoun is of common gender and has the
form ʾana across all dialects of Iraq.
Differentiation of gender is expressed by distinct forms of the second person
singular independent pronoun in many dialects in the northwestern sector of Iraq
in the IC1 and IJ1 groups:
IC1: 2ms. ʾahət, 2fs. ʾahat (C. Qaraqosh); 2ms. ʾayət, 2fs. ʾayat (e. g. C. Alqosh,
C. Telkepe, C. Baṭnaya, C. Tisqopa, C. Karamlesh)
IJ1: 2ms. ʾahət, 2fs. ʾahat (J. Betanure, J. Dohuk, J. Nerwa, J. Zakho)
Similar forms are found sporadically in the Christian dialects of northeastern Iraq,
e. g.
IC3: 2ms. ʾayət, 2fs. ʾayat (e. g. C. Ankawa, C. Sulemaniyya)
In other dialects a second person singular form of common gender is used, the
most widespread forms being ʾat and ʾati.
There are no gender distinctions in the independent first and second person
plural pronouns, just as there is no gender distinction in the 3rd person plural
demonstrative pronouns. The most widespread forms of the first person plural
independent pronoun are ʾaxni and ʾaxnan. Some dialects have innovative forms
based on the analogy of the 2pl. pronoun, e. g. ʾatxan on the analogy of 2pl. ʾatxun
(J. Arbel, J. Ruwanduz, J. Rustaqa). The most widespread form of the 2pl. pronoun
is ʾaxtun, with various analogical innovations attested in individual dialects, e. g.
IC1 ʾaxnutən (C. Baṭnaya)
IC2 ʾaxtum (C. Nargezine-Xarjawa, C. Shōsh-u-Sharmə), ʾaxtaxum
(C. Nargezine-Xarjawa), ʾaxtoxum (C. Shōsh-u-Sharmə, C. Xərpa),
ʾaxtutən (C. Tən)
IC3 ʾaxtun (C. Shaqlawa), ʾaxnoxən (C. Sulemaniyya)
IJ1 ʾaxtoxun (J. Betanure, J. Dohuk, J. Zakho)
1J2 ʾatxun (J. Arbel, J. Ruwanduz, J. Rustaqa, J. Ruwanduz); ʾaxnəxun (J. Koy
Sanjak), ʾaxnăxun (J. Sulemaniyya); ʾatoxun (J. Dobe)
6.2. Pronominal suffixes

Most dialects have a single set of pronominal possessive suffixes for singular and
plural nouns, although historically the suffixes had different forms according to
whether the noun stem was singular or plural. Some dialects in IC1, however,
have preserved a formal distinction in part of the paradigms. One such dialect is C.
Qaraqosh, which has two forms of suffixes for the plural persons:
324 Geoffrey Khan
Table 2: Pronominal possessive suffixes in C. Qaraqosh

On singular nouns On plural nouns
3 ms -əḥ -əḥ
3 fs -aḥ -aḥ
3 pl -hən -ehən
2 ms -ux -ux
2 fs -ax -ax
2 pl -xun -exun
1s -i -i
1 pl -an -enan
Many dialects that have only one set of suffixes generalize the use of forms that
were originally attached to plural nouns. This applies especially to the 3pl. and
2pl., in that they use forms related to –ehən and –exun in the paradigm above, e. g.
3pl. –eyhən (C. Mar-Yaqo), –ehən (C. Baṭnaya, C. Mangesh, C. Umra
d-Shish), -ɛhən (C. Aradhin), -ɛyḥən (C. Peshabur)
2pl. -ɛxu (C. Barwar), -exun (J. Nerwa), -exün (C. Bədyal)
Variation across the NENA dialects of Iraq is found in particular in the 3rd person
suffixes. Examples are given below of a selection of 3ms. forms:
3ms. -eh (C. Baṭnaya, C. Ankawa)
-əḥ (C. Qaraqosh, C. Alqosh, C. Karamlesh, C. Mangesh, C. Peshabur)
-e (C. Telkepe, C. Aradhin, C. Barwar, C. Dere, C. Hamziye, C. Bədyal,
J. Betanure, J. Nerwa, J. Zakho)
-eu (J. Arbel, J. Sulemaniyya)
-u (C. Umra d-Shish, C. Zariwaw)
In C. Sulemaniyya the gender distinction in the 3s has been lost and the original
3ms form -e now is used with common gender reference. This matches the col-
lapse of gender distinction in the demonstrative pronouns in this dialect (e. g. ʾo
goṛa ‘that man’, ʾo baxta ‘that woman’).

In all NENA dialects nouns are either masculine or feminine in gender. Most mas-
culine nouns have the inflectional ending –a, e. g. C. Barwar sus-a ‘horse’. This,
however, is not a gender marker, but rather a nominal inflection of unspecified
gender, since a certain group of feminine nouns also end in –a, e. g. C. Barwar
ʾɛn-a ‘eye’. A large proportion of feminine nouns have marked feminine inflec-
tional endings with one of the variant forms –ta and –θa, or phonetic develop-
ments of these, e. g. C. Barwar sus-ta ‘mare’, ṣlo-θa ‘prayer’, šən-θa ‘sleep’, J.
Arbel ṣlo-la ‘synagogue’ (< *ṣloθa), šən-da (< *šənθa). Some loanwords are left
without being morphologically adapted and do not have these inflectional endings.
All loanwords, however, are assigned a gender for agreement purposes, even if
they are from languages that do not distinguish gender, such as Sorani Kurdish
(Khan 2004: 177–180, 2009: 180–184).
The plural of nouns is marked by a variety of inflectional endings, which
replace the aforementioned singular endings. The most common ending is –e (or
phonetic variants thereof), which is the unmarked plural marker, as shown by the
fact that loanwords that are inflected with a plural marker are generally given this
ending. It is used on many masculine nouns with singulars in –a, on some feminine
nouns ending in –a or –ta and on loanwords ending in zero in the singular, e. g.
C. Barwar kep-a (masc.) ‘stone’, kep-e ‘stones’, ʾilan-a (fem.), pl. ʾilan-e ‘trees’,
nun-ta (fem.) ‘fish’, nun-e ‘fish (pl.)’, čangəl (fem. sing.) ‘fork’ (< Kurd. çengel),
čangal-e ‘forks’. There is no clear correlation, therefore, between the unmarked
plural ending –e and the gender of the noun.
Nouns ending in –a in the singular may form plurals by various other endings,
which may be categorized as marked plurals, e. g.
(11) C. Barwar
ʾumra (masc.) ‘church’ ʾumrane ‘churches’
ʾaqla (fem.) ‘leg’ ʾaqlaθa ‘legsʼ
xona (masc.) ‘brother’ xonăwaθa ‘brothers’
bɛθa (masc.) ‘house’ bɛθwaθa ‘houses’
Feminine nouns ending in –ta (or variants thereof) form plurals by an assortment
of plural endings, e. g.
(12) C. Barwar
karta ‘load’ karaθa ‘loads’
qalθa ‘basket’ qalaθa ‘baskets’
susta ‘mare’ susyaθa ‘mares’
As can be seen, there is some correlation between these marked plural endings and
the gender of the noun. Some of the marked plural endings are associated, more-
over, with particular semantic classes of nouns. The ending –ăwaθa, for example,
in C. Barwar is commonly used with kinship terms, and the ending –ane is used
predominantly with inanimate nouns.
In dialects on the Mosul plain (IC1) the Arabic plural ending -at is sometimes
added to feminine nouns with singulars ending in –a. Most, but not all, such nouns
are loanwords, e. g.
326 Geoffrey Khan
(13) C. Qaraqosh
fárda (f.) ‘load’ fárdat ‘loads’
hə́nna (f.) ‘thing’ hə́nnat ‘things’
ʾilána (f.) ‘tree’ ʾilánat ‘trees’
Some nouns, mostly those with the bisyllabic pattern CVCCa in the singular,
form plurals by reduplicating the final consonant, e. g. C. Barwar təlpa ‘eyelash’,
təlpape (eyelashes).
Adjectives are inflected for gender in the singular and inflect for the plural
by the unmarked plural ending –e. Unlike nouns, adjectives do not have marked
plural endings e. g.
(14) C. Barwar
basim-a (ms.) ‘pleasant’
basim-ta (fs.)
basim-e (pl.)

A noun may be modified by another noun in an annexation construction. In most
dialects this is formed by the general subordinating particle d, which has been
inherited from earlier Aramaic. Some dialects preserve the original syntax of the
particle and sometimes prefix it to the dependent noun, e. g.
C. Qaraqosh (IC1): bšala d-našə ‘the cooked food of people’ (Khan 2002a:
207)
Even in such dialects, however, there is a tendency to suffix the particle to the head
noun, e. g.
C. Qaraqosh (IC1): ḥukiṯəd sawi ‘the story of my grandfather’
The suffixing of the particle to the head noun is the norm in most dialects. In such
dialects the d is generally devoiced to t:
C. Barwar (IC2): bronət malka ‘the son of the king’ (Khan 2008: 397)
J. Arbel (1J2): babət Yosəf ‘the father of Joseph’ (Khan 1999: 168)
In some dialects the ending -ət of the head noun is contracted or elided, especially
in phrases expressing kinship relations or inalienable possession, e. g.
C. Barwar (IC2): brōn mami ‘the son of my uncle’ (Khan 2008: 400)
In the Jewish trans-Zab dialects on the east periphery of the region the subor-
dinating particle is generally omitted and the head noun has its normal nominal
inflectional vowel:
J. Sulemaniyya (IJ2): šəmma brona ‘the name of the boy’ (Khan 2004: 192)
The genitive particle d remains prefixed to demonstrative pronouns in the depend-
ent phrase in dialects that normally mark annexation by a -ət suffix on the head
phrase and even in IJ2 dialects like J. Sulemaniyya that do not normally use a gen-
itive particle in annexation. There are grounds for interpreting these as reflecting
the bonding of the particle with the pronoun to form a genitive allomorph of the
pronoun. This is shown by the fact that it may be combined with a head noun with
-ət in dialects that use this suffix:
(15) C. Barwar (IC2) (Khan 2008: 399)
ʾahwalt-ət d-o naša
condition-gen gen -that man
‘the condition of that man’
(16) J. Sulemaniyya (IJ2) (Khan 2004: 192)
bela d-oni
house gen -those
‘the house of those (people)’
In some dialects an analytical type of genitive construction is used in which the
dependent noun is introduced by an independent genitive particle. This is the case,
for example, in J. Arbel (IJ2), which uses the particle ʾot in this function (< ʾo 3s
pronoun + annexation suffix t). The head noun in such constructions may option-
ally have the annexation suffix -ət:
(17) a. J. Arbel (IJ2) (Khan 1999: 224)
bšəlmane ʾot-ʾArbel
Muslims gen -Arbel
‘the Muslims of Arbel’
b. jiran-ət ʾot hulaʾe
neighbours-gen gen -Jews
‘the neighours of the Jews’
In the Jewish trans-Zab dialects an Iranian ezafe particle is occasionally used in
annexation constructions, e. g.
J. Sulemaniyya (IJ2): maktab i hulaye ‘the school of the Jews’ (Khan 2004:
192)
328 Geoffrey Khan
The NENA dialects exhibit a root and template type of verb morphology, which
is characteristic of Semitic languages. A root is a lexical morpheme consisting
of a non-concatenative sequence of radicals. This sequence of radicals is merged
with non-concatenative templatic patterns consisting of vowels and, in some cases,
additional consonants, that express inflection and derivation of the lexical root.
Verbal roots consist of either three (triliteral) or four radicals (quadriliteral).
Inflections of verbs in NENA are formed from a series of inflectional bases.
Representative examples of these from the C. Qaraqosh dialect are as follows:
Table 3: Inflectional bases of the verb ‘to open’ in C. Qaraqosh

Stem I
p-θ-x ‘to open’ (triliteral)
Present base: paθəx
Past base: pθix
Resultative participle: pθixa
Imperative: pθox
Infinitive: pθaxa
The present base is derived historically from the active participle of earlier
Aramaic, and the past base and resultative participle from the erstwhile passive
participle. The finite present and past forms of earlier Aramaic (yiqṭol and qṭal
respectively) have been lost.
NENA preserves some of the derivational verbal morphology of earlier
Aramaic, whereby derived forms of verbs are created by morphologically more
complex sets of inflectional base patterns. The dialects have either one or two
derived sets of inflectional bases. C. Qaraqosh has two derived patterns, which
may be designated stem II and stem III, the simple pattern being stem I:
Table 4: Verbal stem II in C. Qaraqosh

q-d-m ‘to present, propose’
Present base: mqadəm
Past base: mqudəm
Resultative participle: mqudma
Imperative: qadəm
Infinitive: qadomə
Table 5: Verbal stem III in C. Qaraqosh

r-k-x ‘to soften’
Present base: markəx
Past base: murkəx
Resultative participle: murkxa
Imperative base: ʾarkəx
Infinitive: ʾarkoxə
The main function of the derived verbal stems is to express increases of transitiv-
ity. In earlier Aramaic there were derived stems that expressed a decrease in the
transitivity of the basic stem I, but these have been lost in NENA (Göransson 2015;
Khan 2015). C. Qaraqosh preserves the most archaic forms of the inflectional and
derivational patterns of the verbs, with the greatest morphological heterogeneity.
In most other NENA dialects various degrees of analogical levelling have taken
place within the set of inflectional bases of the derivational stems and across der-
ivational stems. This levelling is most advanced in dialects in the northeast of
Iraq, in the groups IC3 and IJ2. In many Christian dialects of IC3 and all Jewish
dialects of IJ2 the stem II derivation has been lost (Mutzafi 2004c). In some of the
Jewish IJ2 dialects on the northeastern periphery of Iraq the levelling of the vowel
patterns across the two surviving derivational stems is total, e. g.
Table 6: Stem I, J. Sulemaniyya

p-l-x I ‘to open’
Transitive Intransitive
Present base: păləx păləx
Past base: pləx plix
Resultative participle: pəlxa plixa
Imperative: păləx păləx
Infinitive: păloxe păloxe
Table 7: Stem II, J. Sulemaniyya

m-rd-x II ‘to boil’
Transitive Intransitive
Present base: mardəx mardəx
Past base: mərdəx mərdix
Resultative participle: mərdxa mərdixa
Imperative: mardəx mardəx
Infinitive: mardoxe mardoxe
330 Geoffrey Khan
It can be seen that stem I and stem II have identical vowel patterns. In the present,
imperative and infinitive the vocalism of stem II has been generalized to stem I.
In the past and resultative participle the vocalism of stem II, which is typically
causative transitive in function, has been transferred to transitive verbs of stem I.
Vice-versa, the original vocalism of the past and resultative participle of stem I is
preserved in intransitive verbs and this has been extended also to stem II to express
a detransitivized, anticausative.
The present and past inflectional bases are inflected for person and number by
two sets of suffixes, referred to here as D-suffixes and L-suffixes, which indicate
the grammatical relations of verbal arguments in the clause.
D-suffixes are historically ‘direct’ clitic pronouns agreeing in number, gender
and person with the nominative subject of a clause in the original nominative—
accusative alignment system of Aramaic. L-suffixes are historically prepositional
phrases consisting of the dative preposition l- and a pronominal suffix.
The forms of the suffixes in J. Sulemaniyya are as follows:
Table 8: D- and L-suffixes, J. Sulemaniyya

D-suffixes L-suffixes
3 ms -Ø -le
3 fs -a -la
3 pl -i -lu
2 ms -et -lox
2 fs -at -lax
2 pl -etun -lăxun
1 ms -na -li
1 fs -an -li
1 pl -ex -lan
The D-suffixes are morphologically less marked than the L-suffixes. Historically
the third person D-suffixes contain an expression only of number and gender, the
person marker of the third person being zero. In the case of the 3ms form, also the
number and gender marking is zero. The first and second person D-suffixes have
developed from a coalescence of number, gender and person markers. The L-suf-
fixes, on the other hand, all contain pronominal elements that syncretize person,
number and gender, as well as a relational element l-, which is historically a dative
preposition.
The /l/ of the L-suffixes assimilates to a stem-final sonorant /n/ or /r/ in most
NENA dialects, e. g. C. Barwar (IC2) kpinne ‘he became hungry’ (< kpin-le). In
some Christian dialects in the northeast of the region this assimlation as been
extended to other consonants, e. g. C. Diyana-Zariway (IC3) ptəxxe ‘he opened’
(< ptəx-le).
In J. Sulemaniyya, in the northeastern periphery of Iraq, there is ergative align-

ment in clauses with verbs derived from the past base. Clauses with verbs derived
from the present base, on the other hand, have nominative–accusative alignment.
The following examples show how an intransitive subject (S) aligns with a tran-
sitive subject (A) in clauses with present base verbs but aligns with the transitive
object (O) in clauses with past base verbs. In each case the common marker is
the 3fs D-suffix on the verb. The case-marking of the arguments of the clause is
expressed by the cross-referencing D-suffixes and L-suffixes on the verb. Verbs
derived from the past base express, in principle, the past perfective:
(18) J. Sulemaniyya
Present base
a. brat-i barux-e garš-á-lu
daughter-my friend-pl pull. prs - d .3 fs - l .3 pl
‘My daughter (A) pulls the friends (O).’
b. brat-i samx-a
daughter-my stand. prs - d .3 fs
‘My daughter (S) stands.’
Past base
c. barux-e brat-i gərš-á-lu
friend-pl daughter-my pull.pst - d .3 fs - l .3 pl
‘The friends (A) pulled my daughter (O).’
d. brat-i smix-a
daughter-my stand.pst -d .3fs
‘My daughter (S) stood up.’
This type of ergative alignment is found also in NENA dialects over the border
in western Iran (Khan, this volume, chapter 4.4, §7.1). As can be seen, there is an
inversion of agreement of the two sets of suffixes with grammatical arguments in
present base and past base verbs. For present base verbs, a D-suffix cross-refer-
ences the A argument and an L-suffix cross-references the O argument, whereas in
past base verbs, this is reversed, a D-suffix cross-references the O argument and an
L-suffix cross-references the A argument. In intransitive clauses with both present
base and past base verbs a D-suffix cross-references the S argument:
(19) Alignment of verbal agreement, according to stem and transitivity
a. V.prs —D-suffix(A)—L-suffix(O)
b. V.pst —D-suffix(O)—L-suffix(A)
c. V.prs —D-suffix(S)
d. V.pst —D-suffix(S)
332 Geoffrey Khan
Constructions such as (18c), in which the A argument is expressed by the histor-

ically dative L-suffix, are active ergative constructions rather than passive con-
structions.8 The suffixes that cross-reference the O argument in both present-base
constructions (like 18a) and past-base constructions (like 18c) are only present if
the O is definite.
The type of alignment of J. Sulemaniyya is, in fact, what Dixon (1994: 71)
refers to as split-S rather than canonical ergativity. D-suffixes on the past base
occur only when the subject of the intransitive verb is an undergoer of the action.
An ‘undergoer’ subject is one that is affected by a change in configurational state
or location. When the subject is agentive or is the source of the action (e. g. the
source of the production of sound), an intransitive verb has an L-suffix like transi-
tive verbs, e. g. ṣrəx-le ‘He shouted’ (Khan 2004: 300).
In most NENA dialects in Iraq, however, a hybrid form of alignment is found
in verbs formed from past bases. In this type of alignment, which may be called
‘extended ergative’ (Doron and Khan 2012), the marked L-suffixes are used to
express both transitive and intransitive subjects, e. g.
(20) C. Barwar
a. xawr-ăwaθ-i griš-a-la brati-i
friend-pl -my pull.pst - d .3 fs - l .3 pl daughter-my
‘My friends pulled my daughter.’
b. brati qim-la
daughter-my rise.pst - l .3 fs
‘My daughter rose.’
This type of case-marking should not be analysed as nominative–accusative, since
the transitive clauses exhibit some characteristic features of ergativity (Doron and
Khan 2012: 230–233). This is likely to have been the more archaic type of con-
struction in NENA. The more canonical type of ergative alignment found in the
eastern periphery of NENA in J. Sulemaniyya and Iran appears to be an innovation
in NENA, induced by a greater degree of convergence with the ergative construc-
tions of Kurdish (Khan 2016).9
The present base, which is historically an active participle, is used in NENA
dialects with a variety of functions, in many cases with the addition of auxiliary
particles. The unmarked form without auxiliary particles is widely used to express
irrealis. Present indicative and future are formed from the present base by means
of added particles, e. g.
8
For the arguments in favour of this analysis see Doron and Khan (2012) and Khan
(2016).
9
For more details about ergativity in NENA see Coghill (2016).
(21) C. Barwar
3 ms paθəx ‘(that) he open’ (irrealis)
paθəx-wa ‘(that) he opened’ (irrealis past)
ʾi-paθəx ‘he opens’ (indicative habitual)
ʾi-paθəx-wa ‘he used to open’ (indicative habitual past)
bəd-paθəx ‘he will open’ (future)
bəd-paθəx-wa ‘he would open’ (future in the past)
The indicative particles that are used in the dialects of the region include ʾi-, as in
C. Barwar, and k-, e. g. C. Qaraqosh (IC1) k-paθəx. The ʾi- is likely to be a reduced
form of the copula.
Several dialects in Iraq use a particle with the form qam- prefixed to the
present base to express the perfective past.10 In some dialects it has undergone
various degrees of phonetic lenition, e. g. C. Barwar qəm-, C. Qaraqosh kəm-,
C. Sulemaniyya tam-. This construction was used in place of constructions formed
from the past base to avoid using D-suffixes to express pronominal objects on past
perfective verbs formed from the past base. This is the case in particular when the
pronominal object is first or second person.11 The present base allows the use of
L-suffixes to express the direct pronominal object, e. g.
(22) C. Barwar1
a. griš-la
pull.pst - l .3 fs
‘she pulled’
b. qəm-garš-a-li
pst -pull. prs - d .3 fs - l .1 s
‘she pulled me’
Most NENA dialects do not allow the accumulation of two pronominal object
suffixes on ditransitive verbs. If a verb has a pronominal direct object suffix, a
pronominal indirect object must, in many dialects, be expressed by an independent
prepositional phrase:
(23) C. Barwar
yawəl-le ṭlal-i
give. prs .3 ms - l .3 ms to-1 s
‘he gives it to me’
10
For the historical background of this particle see Fassberg (2015).
11
For the motivation for the repair mechanism see Doron and Khan (2012) and Khan
(2016).
334 Geoffrey Khan
In some dialects in the group IC1 on the Mosul plain, such as C. Qaraqosh (Khan
2002a: 143–144) and C. Telkepe (Coghill 2010), and also dialects further east
in the IC3 group, such as C. Shaqlawa, present base verbs may express the indi-
rect pronominal object by an L-suffix and the direct pronominal object by a form
that resembles an enclitic copula (see below) and can be interpreted as a enclitic
pronoun (Khan 2012a). This expression of the pronominal direct object is restricted
to the third person:
(24) C. Qaraqosh
kewi-ləḥ=ina
give. prs .3 pl - l .3 ms = pro .3 pl
‘They give them to him’
The NENA dialects in Iraq have various types of copula. These include deictic
copulas, which draw attention to a referent or to a situation, and neutral copulas
which express simple predication. Paradigms of the deictic and the simple copula
in C. Qaraqosh and C. Barwar are as follows:
Table 9: Copulas in NENA dialects of Iraq

Deictic copula Simple copula
C. Qaraqosh C. Barwar C. Qaraqosh C. Barwar
3ms kilə hole =ilə =ile
3fs kila hola =ila =ila
3pl kina holɛ =ina =ilɛ
2ms kiyət hot =iyət =iwət
2fs kiyat hot =iyat =iwat
2pl kiyetu hotu =iyetu =iwitu
1ms kiyən hon =iyən =iwən
1fs kiyan hon =iyan =iwan
1pl kiyax hox =iyax =iwəx
The simple copula is generally attached to the predicate as an enclitic, though in

some dialects it has a degree of freedom of movement.
A characteristic feature of many of the NENA dialects of Iraq is the expression
of the resultative perfect by combining the resultative participle with a copula, e. g.
C. Qaraqosh kilə griša ‘he has pulled’
C. Barwar grišɛle (= griša + ile)
Unlike clauses that have past base verb forms, resultative perfect constructions
have nominative–accusative alignment in most dialects, with the copula and par-
ticiple agreeing with the subject in transitive clauses. In (25) J. Sulemaniyya (IJ2)
on the eastern periphery of the region, however, a hybrid type of alignment is

found, whereby the copula agrees with the subject but the participle agrees with
the object, e. g.
(25) J. Sulemaniyya (IJ2)
ʾo-gora baxt-ăke grəšta-aw-ye
that-man woman-the pull. ptc . fs - pro .3 fs - cop .3 ms
‘That man has pulled the woman’
In several dialects in the northeastern sector of the region, resultative perfects are
formed by a fossilized, uninflected form of the copula, e. g. lā (< 3fs copula –ila).
This is found, for example, in C. Bədyal (Table 10), as shown in the paradigm
below. In the third person, lā is used with the participle alone, the distinction
between 3ms, 3fs and 3pl being expressed by the gender and number inflection of
the participle. In the first and second persons a person marker is required, so an
inflected form of the enclitic copula is added to the participle:
Table 10: Resultative perfect in C. Bədyal (IC3)

3 ms lā griša ‘he has pulled’
3 fs lā grəšta ‘she has pulled’
3 pl lā griše ‘they have pulled’, etc.
2 ms lā grišewət
2 fs lā grišewat
2 pl lā grišütün
1 ms lā grišewən
1 fs lā grišewan
1 pl lā grišewax
In some Jewish dialects of IJ2, a resultative perfect is formed by combining a fos-

silized copula with a past base verb form, e. g.
(26) J. Arbel (IJ2)
lā grəš-le ‘he has pulled’
lā grəš-la ‘she has pulled’
lā grəš-lu ‘they have pulled’, etc.
In the J. Rustaqa dialect (IJ2), originally spoken on the northeastern periphery of
the region, the resultative perfect exhibits an additional distinctive feature, viz.
the inflection of the past base with D-suffixes rather than L-suffixes in intransitive
clauses. Intransitive past perfective verbs are inflected with L-suffixes:
336 Geoffrey Khan
Table 11: Resulative perfect in J. Rustaqa (IJ2)

Intransitive past perfective: qim-la
rise.pst - l .3 fs
‘She rose’
Transitive perfect: grəš-la
pull.pst - l .3 fs
‘She pulled’
Intransitive perfect: lā qim-a
rise.pst - d .3 fs
‘She has risen’
Transitive perfect: lā grəš-la
cop pull.pst - l .3 fs
‘She has pulled’
In many NENA dialects of the region the perfect verbal form is used not only to
express the resultative perfect but is also used to express the evidential or an event
in the remote past. In some dialects it is used as a narrative form, especially in
folktales (Khan 2012b).12
The majority of NENA dialects in Iraq express the present progressive by con-
structions containing the copula. Many combine the copula with the infinitive to
perform this function. The original form of this construction contained the locative
preposition b- ‘in’, which is still used in some dialects in some circumstances. This
seems to have arisen under the influence of Eastern Armenian, which exhibits a
similar construction for the progressive (see Martirosyan, this volume, chapter 2.2,
§4.2), e. g.
(27) Eastern Armenian, Ardvin
mnalis im
stay.inf . loc cop .1 s
‘I am staying’
The preposition b- is, however, frequently dropped and this is the regular situation
in many dialects, e. g.
(28) J. Betanure (IJ1)
ʾile bə-graša ~ ʾile graša ‘he is pulling’
ʾila bə-graša ~ ʾila graša ‘she is pulling’
ʾilu bə-graša ~ ʾilu graša ‘they are pulling’
12
The use of one and the same form for resultatives, evidentials, and as a special narrative
form for use in folktales, is clearly reminiscent of the range of functions associated with
the verbal suffix -mIş in Turkish.
This construction is not used in some dialects along the southern periphery of the
region. Instead progressive constructions are used in which the copula is combined
with a verb form derived from the present base, e. g.
(29) C. Qaraqosh (IC1)
kilə k-garəš ‘he is pulling’
kila k-garša ‘she is pulling’
kina k-garši ‘they are pulling’
In the eastern sector of the region many dialects use a fossilized uninflected form
of the copula in progressive constructions. In some cases this is combined with a
present base verb form, e. g. J. Arbel lā gărəš ‘he is pulling’, J. Dobe nā gărəš,
C. Xərpa ho k-garəš, Shosh u-Sharmən nə-k-garəš, C. Bədyal ma-k-garəš (Mutzafi
2004c; Coghill 2008). In some dialects the fossilized copula is combined with the
infinitive. Since the infinitive is not inflectable for gender, number or person, these
must be marked by inflected enclitic copulas in all slots of the paradigm beyond
the 3ms, e.g.
Table 12: Progressive constructions with copula, J. Rustaqa (IJ2) (Khan 2002b: 407)
3 ms lā garoša ‘he is pulling’
3 fs lā garoše=la ‘she is pulling’
3 pl lā garoše=lu ‘they are pulling’, etc.
2 ms lā garoša=wet
2 fs lā garoša=wat
2 pl lā garoša=wetu
1 ms lā garoša=wena
1 fs lā garoša=wan
1 pl lā garoša=wex
In C. Ankawa (IC3) the present progressive is expressed by combining the particle

də- with present base verbs, e. g. də-k-garəš ‘he is pulling’. This particle may be
a phonetic variant of the fossilized copula. Alternatively it may be identified as
a subordinating particle which presents the verb as complement of a zero copula
‘(it is) that he pulls’ (Borghero 2015). Indeed, the constructions with an explicit
invariable form of the copula described above may have a similar cleft syntactic
structure, e. g. J. Arbel lā gărəš ‘it is (that) he pulls’.
In some dialects the imperative is strengthened by prefixing the particle da-,
də-, which is a loan from Kurdish, e. g. J. Arbel da-holli! ‘give me!’ (Khan 1999:
282–283), C. Barwar də-pluṭ! ‘go out (sing.)!’ (Khan 2008: 179). C. Barwar also
uses suffixes of Turkic origin added to the imperative, e. g. pluṭ-gən! ‘go out
(sing.)!’ (Khan 2008: 179).
338 Geoffrey Khan
8.1. Copula clauses

The default position for the simple basic copula in predications is after the predi-
cate, generally as an enclitic. This applies to both equative clauses, identifying the
referent of the subject with that of a nominal in the predicate, or ascriptive clauses,
which ascribe a property to the referent of the subject, e. g.
(30) C. Barwar (IC2)
a. mamux šwawan=ile
uncle.your neighbour.our=cop .3 ms
‘Your uncle is our neighbour.’
b. mamux kpinɛle (< kpina=ile)
uncle.your hungry=cop .3 ms
‘Your uncle is hungry.’
In many dialects the copula is moveable. In C. Barwar, for example, it is some-
times placed before the predicate in equative clauses, e. g.
(31) C. Barwar
mamux ʾile šwawan
uncle.your cop .3 ms neighbour.our
‘Your uncle is my neighbour.’
In such dialects the copula may be cliticized to components of the clause that are
in narrow focus, e. g. in (32), in which the referent of the subject of the clause is
in narrow focus, the remainder of the clause being presuppositional. The narrow
focus, represented by upper case below, is expressed prosodically by the nuclear
stress of the intonation group.13
(32) C. Barwar
a. màmux=ile šwáwan.|
uncle.your=cop .3 ms neighbour.our
‘YOUR UNCLE is our neighbour.’
b. màmux=ile kpína.|
uncle.your=cop .3 ms hungry
‘YOUR UNCLE is hungry.’
In some dialects the positioning of the copula is more rigidly fixed and regularly
occurs as an enclitic on the predicate irrespective of the type of the predication
13
In the examples the intonation group boundary is represented by |, the nuclear stress of
the intonation group by a grave accent (v̀ ) and non-nuclear stresses by acute accents (v́ ).
or the information structure of the clause. This applies in particular to the Jewish
dialects in IJ2, e. g.
(33) J. Sulemaniyya
a. šwáwan tujā̀ r=ye.|
neighbour.our merchant=cop .3 ms
‘Our neighbour is a merchant.’
b. šwáwan jəhyà-ye.|
neighbour.our tired=cop .3 ms
‘Our neighbour is tired.’
c. šwàwan tujā́ r-ye.|
‘OUR NEIGHBOUR is a merchant.’
d. šwàwan jəhyá-ye.|
‘OUR NEIGHBOUR is tired.’
The copula is negated by prefixing to it the negative particle la, e. g. C. Barwar lɛle
‘he is not’ (< la + ile), J. Sulemaniyya la-y ‘he is not’ (< la-y). In some cases, the
negative copula has a different stem from the positive copula, e. g. J. Arbel (IJ2)
lewe ‘he is not’ (cf. ile ‘he is’), C. Bədyal (IC3) liwe ‘he is not’ (cf. ile ‘he is’).
In most dialects of the region the negated copula is placed before the predicate:
(34) C. Barwar
bábi lɛ́le làxxa.|
father.my neg . cop .3 ms here
‘My father is not here.’
In the Jewish dialects of the IJ2 group, however, it is placed after the predicate,
e. g.
(35) J. Sulemaniyya
tatí laxxá là-y.|
father.my here neg - cop .3 ms
‘My father is not here.’
8.2. Verbal clauses

The predominant order of the core arguments of intransitive verbal clauses (S and
V) is SV in all dialects. The core arguments of transitive clauses (A, V and O) is
AVO in most dialects, but AOV in the Jewish trans-Zab dialect group (1J2), e. g.
a. šwawan bəd-ʾaxəl-Ø laxma
neighbour.our fut -eat. prs - d .3 ms bread
‘Our neighour will eat bread.’
340 Geoffrey Khan
J. Sulemaniyya (IJ2)
b. šwawan ləxma k-xəl-Ø
neighbour.our bread ind -eat. prs - d .3 ms
‘Our neighbour will eat bread.’
A prepositional phrase expressing an indirect object or some other complement of
the verb is, however, normally placed after the verb in all dialects:
(37) C. Barwar
a. bəd-tafq-əx b-šwawan
fut -meet. prs - d .1 pl on-neighbour.our
‘We shall meet our neighbour.’
J. Sulemaniyya
b. doq-ex b-šwawan
hold. prs - d .1 pl on-neighbour.our
‘We shall hold onto our neighbour.’
A nominal without a preposition that expresses the goal of a verb of movement is
normally placed after the verb in all dialects, including those of the IJ2 group, e. g.
(38) C. Barwar
a. zil-lan bɛθa
go.pst - l .1 pl home
‘We went home.’
J. Sulemaniyya
b. zil-ex bela
go.pst - d .1 pl home
‘We went home.’
In all dialects the predominant order of clause components may be changed for
pragmatic purposes.
Verbal clauses are negated by the negative particle la, e. g.
šwawan la xil-le laxma
neighbour.our neg -eat. pst -L.3 ms bread
‘Our neighour did not eat bread.’
The future particle is not used when the verb is negated, e. g. C. Barwar bəd-ʾaxəl
‘he will eat’, la-ʾaxəl ~ la-y-axəl ‘he will not eat’. The verb in the second of the
negated alternatives (y-axəl) has the habitual prefix.
In all dialects except some in the southwestern periphery of the region on the
Mosul plain (the IC1 group) the imperative form may be negated, e. g. C. Barwar
(IC2) la pluṭ! ‘Do not go out.’ In C. Qaraqosh (IC1) a prohibition cannot be
expressed by negating an imperative, but rather the negative particle is com-
bined with an irrealis verb form, e. g. la palṭət! ‘Do not go out!’ (Khan 2002a:
351).
In dialects that express the progressive or perfect by a prefixed fossilized
copula particle lā, the negator is placed after this particle, e. g.
(40) J. Arbel (Khan 1999: 115)
lā la-ḥqe-li
cop neg -speak.pst - l .1 s
‘I have not spoken.’
In the Jewish on the eastern periphery dialects when la negates a perfect formed
by the combination of a resultative participle and a copula auxiliary, the negator is
placed before the participle rather than the auxiliary, e. g.
(41) J. Sulemaniyya (Khan 2004: 105)
a. la smiqa-y
neg redden.ptc - cop .3 ms
‘It has not become red.’
b. smoqa la-y
red neg - cop .3 ms
‘It is not red.’

The coordinative linkage of main clauses may be asyndetic or marked by a par-
ticle. When marked by a particle, this is generally either (i) a clausal connective
particle or (ii) a particle marking topical prominence.
The first strategy is performed by the Aramaic connective particle w or the
loaned Arabic particle fa-. The usual practice is to place the particle before the last
in a chain of sequential clauses, e. g.
(42) C. Qaraqosh (IC1)
ʾrə́q-lə baθr-èhə,| kəm-madə̀r-Ø-hə|
run.pst - l .3 ms after-them, pst -return. prs - d .3 ms - l .3 pl
ʾu-θí-lə l-bèθa.|
and-come.pst - l .3 ms to-house
‘He ran after them, returned them and came home.’ (Khan 2002a: 463)
In general, the particle fa marks a greater boundary in the discourse than does w.
In a narrative, speakers use fa as a means of signalling the onset of the main stages
in the progression of the events (Khan 2002a: 465, 2008: 918).
Particles marking topical prominence that are used in clausal connection
include the Aramaic particle ʾap and the Kurdish enclitic –ži or phonetic variants
342 Geoffrey Khan
thereof (e. g. –iš, all glossed below as top ). This is typically combined with a
clause initial subject, which has an identifiable referent, e. g.
(43) C. Barwar (IC2), (Khan 2008: 920)
zìltɛ-la ṱóṱo| mǝ́r-a màlka| …
go. ptcp .- cop .3 fs old.woman said.pst - l .3 fs king
ʾáp-ʾawwa šaqə́l-Ø-la ðá-kista qa-dày.|
top -he take. prs - d .3 ms - l .3 fs one-purse for-obl .3 fs
‘The old woman went and said “Oh King …” Then, he takes a purse for
her.’
(44) J. Sulemaniyya (IJ2) (Khan 2004: 401)
xól-la hyè-Ø-wa.| bába bràt-ăke-š|
indeed-not come.pst - d .3 ms -back father daughter-the-top
m-ṣálma komùl-ew| mad-hī́ twa-le zbə̀n-ne.|
from-face blackness-his what-exist . pst - l .3 ms sell. pst - l .3 ms
‘He indeed he did not come back. The father of the girl, out of his shame
sold what he had.’
The use of topical prominence particles to mark clausal connection is not used in
C. Qaraqosh and many dialects of IC1 on the western periphery of the region.

The general subordinating particle d inherited from earlier Aramaic is widely used
to introduce subordinate clauses of various types in the Christian dialects of Iraq
and the IJ1 group of Jewish dialects. These include the following:
9.2.1. Relative clauses

Some dialects preserve the archaic construction of earlier Aramaic in which the
particle is attached to the beginning of the modifying relative clause:
(45) C. Qaraqosh (IC1), (Khan 2002a: 475)
ʾáwa qə́ṭma d-k-pàyəš-Ø-wa.
that ash rel - ind -remain.prs -d .3 ms - pst
‘the ash that was left’
In many dialects the particle is suffixed to the head noun and devoiced:
(46) C. Barwar (IC2), (Khan 2008: 952)
ʾɛ-brat-ət maluš-i-wa-la jullə
that-girl-rel dress.prs - d .3 pl - pst - l .3 fs clothes
‘the girl whom they dressed in clothes’
In the Jewish IJ2 dialect J. Arbel the particle generally combines with an anaphoric
pronoun rather than being attached directly to the head noun:
(47) J. Arbel (IJ2), (Khan 1999: 388)
kré ʾot-loš-ì-lu
suits dem . rel -wear.prs - d .3 pl - l .3 pl
‘the suits that they wore’
9.2.2. Clausal complements of adverbs and prepositions

ʾə́mma d-g-yaʾl-í-wa l-íta
when comp - ind -enter. prs - d .3pl - pst to-church
šalx-í-wa čəròxə.|
take.off. prs - d .3 pl - pst shoes
‘When they entered a church, they took off their shoes.’
In many dialects the particle is suffixed to the adverbial head:

(49) C. Barwar (IC2), (Khan 2008: 982)
ʾíman-t y-aθ-ɛ́-wa dwáṛa qúrba
when-comp ind -come. prs - d .3 pl - pst pasture near
l-màθa,| y-asq-í-wa berìye.|
to-village ind -ascend. prs - d .3 pl - pst milkmaids
‘When they came to the pasture land near the village, milkmaids would
come up.’
9.2.3. Irrealis complements of verbs

ʾáxni g-báʾ-ax d-šák-ax l-ànə.|
we ind -want. prs - d .1pl comp -complain. prs - d .1pl about-them
‘We want to complain concerning them.’
9.2.4. Factual complements of verbs

ðə́ʾ-la d-íla niyə́t-tə xrùta.|
know.pst -L.3 fs comp -cop .3 fs intention-their bad
‘She knew that their intention was bad.’
344 Geoffrey Khan
In some dialects the d is occasionally replaced by a particle from a contact lan-

guage, as in the following factual complement clause:
k-ə́ð-i bəʾánnahu k-ína tíwə ʾánə
ind -know. prs - d .3 pl comp ind - cop .3 pl settled dem
suràyə.|
Christians
‘They know that (= Arabic biʾannahu) the Christians have settled there.’
The replacement of d by loaned particles is the norm in the IJ2 dialects on the
eastern periphery of the region:
(53) J. Sulemaniyya (IJ2), (Khan 2004: 414–415)
bába ʾó-brona ga-ləbl-á-le ḥajì-ye-le.|
father dem -boy rel -take.pst - d .3 fs - l .3 ms haji-cop . pst - l .3 ms
‘The father of the boy who (= Kurdish ga/ka) took her away was a ḥāji.’
(relative clause)
(54) J. Sulemaniyya (IJ2), (Khan 2004: 440)
kăy-én-wa ga-ʾó bratá ʾəl-do-bróna
know. prs - d .3 pl - pst comp - dem girl obj - obl . dem -boy
gb-à. |
love. prs - d .3 fs
‘They knew that the girl loved the boy.’ (complement clause)
In some dialects a subordinating particle is omitted in various constructions. There
is a greater tendency for this in direct complement clauses of verbs (irrealis and
realis) and relative clauses. There is also a geographical correlation, in that asyn-
detic direct complement constructions are more common in the dialects of the
central and eastern peripheries than the IC1 group.
The original Aramaic conditional particle ʾən is used in the protasis of condi-
tional constructions in many dialects of the region, e. g.
(55) C. Barwar (IC2), (Khan 2008: 1004)
ʾən-pálš-əx mən-dáni qaṭl-ì-lən.|
if-fight. prs -d .1 pl with-obl . dem . pl kill. prs -d .3pl .- l .1pl
‘If we fight with them, they will kill us.’
In the IJ2 dialects the particle is regularly replaced by the Kurdish conditional
particle ʾagar:
(56) J. Sulemaniyya (IJ2), (Khan 2004: 436)
ʾágar šabbát là doq-étun,| xafàd kəxl-í.|
if sabbath neg keep. prs - d .2 pl trouble eat. prs - d .3 pl
‘If you do not keep the Sabbath, they will be in trouble.’
In some of the dialects on the Mosul plain, the conditional particle is replaced by
an Arabic conditional particle:
kā́ n k-šaql-í-hə dàha,| ʾəm-madə́r-Ø-hə
if ind -take. prs - d .3 pl - l .3 pl now fut -return. prs - d .3 ms - l .3 pl
m-bàθər.|
from-after
‘If (= Arabic kān) they take them now, he will bring them back.’
10. Concluding remarks
In general, the dialects in the IC1 group exhibit greater archaism than other NENA
dialects in Iraq. This applies especially to C. Qaraqosh on the southwestern periph-
ery. This archaism is reflected, for example, in the preservation of interdental con-
sonants and also pharyngeal consonants in some contexts and heterogeneity in the
morphology of paradigms of pronouns and verbal forms. Also some aspects of
syntax in IC1 can be regarded as representing archaic features of NENA, such as
the extended ergative alignment and predominant placement of the direct object
argument after the verb. Dialects of other groups exhibit various degrees of inno-
vation in these features, the most innovative being the trans-Zab Jewish dialects in
group IJ2. One factor in the archaism of the IC1 group may have been the greater
exposure of these dialects to Arabic. In many of the archaic features listed above,
for example, parallels can be found in Arabic dialects, which may have had a con-
servative influence. The innovations of other dialect groups are no doubt due to a
greater exposure to Kurdish rather than to Arabic. In the most innovative group,
IJ2, this convergence with Kurdish included the development of canonical ergative
alignment in dialects in the eastern periphery of Iraq such as J. Sulemaniyya.
The lexicon of the NENA dialects of the region has been extensively influ-
enced by the languages with which they have been in contact. This applies even to
the morphologically and syntactically archaic dialects of the IC1 group. Kurdish is
a major source of loanwords. This applies in particular to the Christian dialects of
the IC2 group and the Jewish dialects. There is an old layer of Kurdish loanwords
also in the IC1 group. In the IC1 group numerous Arabic words are used by speak-
ers. Most of these, however, are recent loans, as demonstrated by the fact that most
have not been morphologically adapted. Many, indeed, are likely to be the result
of code-switching rather than lexical borrowing. The lexicon of all dialects also
has some loanwords of Turkic origin, originating mainly either through contact
with Turkoman or Ottoman Turkish. In the trans-Zab Jewish dialects calques of
Kurdish phrasal verbs containing a noun and a light verb are particularly common,
e. g. J. Arbel bāng ʾol ‘he invites’ (literally: he makes invitation; cf. Kurdish bang
kirdin) (Khan 1999: 109).
346 Geoffrey Khan
11. Glossed text sample
C. Barwar dialect (Khan 2008: 1568)
The scorpion and the snake

(| indicates an intonation group boundary. A grave accent (v̀ ) indicates the nuclear
stress in the intonation group. Acute accents (v́ ) indicate non-nuclear stresses in
intonation groups).
(58) ʾíθ-wa xa-nàša| ʾáwwa zíl-le zràʾa.|
exist - pst one-man he go.pst - l .3 ms cultivate.inf
zǝ́ryɛ=le xàčča|, xàrθa,|
cultivate.ptcp . ms = cop .3 ms little then
brōn-dìye| θé-le šǝ́nθ-e dìye,|
son-obl .3ms come.pst - l .3 ms sleep-his obl .3 ms
qəm-madmǝ̀x-Ø-le.|
pst -put.to.sleep.prs - d .3 ms - l .3 ms
ʾíθ-wa xáčča míya ṣlàya,| šaqìθa.|
exist - pst little water descend.prog channel
qəm-maθé-Ø-le tàma.| dmíx-le kəs-dày
pst -bring.prs - d .3 ms - l .3 ms there sleep.pst - l .3 ms by-obl . dem . fs
šaqíθa.| ʾo-xéna zràyɛ=le.|
channel dem -other cultivate.prog = cop .3 ms
xárθa ʾǝ́θyɛ=le xǝ́zyɛle ʾína
then come.ptcp . ms = cop .3 ms see.ptcp . ms = cop .3 ms behold
l-ay-gōt-hòdəx t-šaqíθa| ʾíθ-wa ða-pə̀qqe.|
on-dem . fs -side-farther gen -channel exist - pst one. f -frog
‘There was a man, who went to cultivate. He cultivated a little, then his
son became sleepy and he laid him down to sleep. There was some water
flowing, a channel. He took him there. He went to sleep by that channel
while the other one (his father) was cultivating. Then he came back and he
saw that on the farther side of the channel there was a frog.’
(59) m-ày-gota| pə́qqe šwír-ra l-a-gōt-hódəx
from-dem . fs -side frog jump.pst - l .3 mfs to-dem . fs -side-farther
t-šaqìθa.| klè-la,| síq-la
gen -channel stand.pst - l .3 fs ascend. pst -l .3 ms
ðá ʾaqǝ́rwa xáṣ-a dìya.|
one.f scorpion back-3 fs obl .3 fs
síq-la xáṣ-a dìya|
ascend-l .3 fs back-3 fs obl .3 fs
ʾu-mašur-á-la b-a-gōt-míya
and-transfer. prs -d .3fs - l .3 fs at-dem .3 fs -side-water
ʾáθya làxxa.| brōn-do-náša

come. irr . d .3 fs here son-obl . dem . ms -man
dmìxɛ=le támma,| ʾaqǝ́rwa ṣlé-la
sleep.ptcp . ms = cop .3 ms there, scorpion descend.pst - l .3 fs
m-xā́ ṣ-t pə̀qqe,| θéla wír-ra
from-back-gen frog come.pst - l .3 fs enter.pst - l .3 fs
xo-réšə do-yàla.|
under-head obl . dem . ms -child
‘From one side the frog jumped to the other side of the channel. It stood
still and a scorpion mounted onto its back. It mounted its back and it (the
frog) took it across to this side of the water so that it came here. The son
of that man was sleeping here. The scorpion got down from the back of the
frog and went underneath the head of the child.’
(60) ʾu-šqíl-le xá-kepa báb-e dìye,|
and-take.pst - l .3 ms one-stone father-3 ms obl .3 ms
bábə do-yàla,| mǝ́r-e qaṭl-ǝ́n-na
father obl . dem . ms -boy say.pst - l .3 ms kill. prs - d .1 ms - l .3 fs
yáha ʾaqǝ́rwa dìya.| ʾáyya pə́qqe zíl-la
dem . fs scorpion now. dem . fs frog go.pst - l .3 fs
muθy-á-la ʾaqǝ́rwa m-ʾay-gōt-hódəx mìya.|
bring.pst -d . fs .- l .3 fs scorpion from-dem .3 fs -side-farther water
muθy-á-la láxxa ta-t-nɛs-á-le
bring.pst -d . fs .- l .3 fs here to-comp -bite.irr - d .3 fs - l .3 ms
bròn-i,| brón-i màyəθ-Ø.| qaṭl-ǝ́n-na
son-my son-my die. prs - d .3 ms kill. prs .- d .1 ms - l .3 fs
yáha ʾaqǝ̀rwa.| xárθa xá-ga xéta klè-le.|
dem . fs . scorpion then one-time another stand.pst - l .3 ms
‘His father, the father of the boy, took a stone and said “I’ll kill that
scorpion right now. This frog has brought the scorpion from the other side
of the water. It has brought it here to bite my son, my son will die. I shall
kill that scorpion.” Then he refrained.’
(61) mǝ́r-e škíra šǝ́mm-ux ya-ʾàlaha.|
say.pst - l .3 ms praised.ptcp name-your voc -God
ʾən-ʾáti bắy-ət brón-i máyəθ-Ø
if-you want. prs - d .2 ms son-my die.irr - d .3 ms
b-àyya ʾúrxa,| ʾoð-ǝ́t-la ʾáyya pə́qqe bằlam,|
in-dem . fs way make.irr - d .2 ms - l .3 fs dem frog boat
ʾázəl-Ø m-day-góta xètə šaqíθa,|
go.irr - d .3 ms from-obl . dem . fs other.fs channel
šaql-á-la ʾaqǝ́rwa l-xáṣ-a dìya,|
take.irr - d .3 fs - l .3 fs scorpion on-back-3 fs obl .3 fs
348 Geoffrey Khan
máθy-a l-àyya gota,|

bring.irr - d .3 fs to-dem . fs side
ʾu-ʾáθy-a ṭla-t-nɛs-á-le brón-i màyəθ-Ø,|
and-come.irr - d .3 fs to-comp -bite.irr - d .3 fs - l .3 ms son-my die. irr - d .3 ms
xlàp-ux ya-ʾálaha.| hár ʾəjbóna dìyux=ile.|
sake-your voc -God just will obl .2 ms = cop .3 ms
‘He said “Praise be your name, oh God, if you will that my son die by this
means, that you make this frog into a boat to go from the other side of the
channel and take a scropion on its back, bringing it to this side, for it to bite
my son that he should die, let it be as you wish (Literally: It is for your
sake) oh God, it is your will.”’
(62) rapé-Ø-le ʾo-kèpa| là qəm-qaṭǝ́l-Ø-la
throw.prs - d .3 ms - l .3 ms dem . ms -stone neg pst -kill. prs - d .3 ms - l .3 fs
ʾaqǝ́rwa.| ʾo-xìš-le| píš-le
scorpion dem . ms -go.pst - l .3 ms remain.pst -l .3 ms
zráya zràya.| xá-palgət sàʾət,|
cultivate.prog cultivate.prog one-half-gen hour
xá-saʾət xèna,| ʾáw lɛ̀le ráḥat.|
one-hour other dem . ms neg . cop .3 ms comfortable
mǝ́r-e brōn-díyi hàr hóle míθa.|
say. pst - l .3 ms son-obl .1 s already deic . cop .3 ms dead.ms
ʾaqǝ́rwa nìstə=l-le.| zìlɛ=le|
scorpion bite.ptcp . fs = cop .3 fs - l .3 ms go.ptcp . ms = cop .3 ms
mùklə=l-la táwre t-wɛ́wa zráya.|
stop. ptcp . ms = cop .3 ms - l .3 pl oxen rel - pst . cop .3 pl cultivate.prog
šqíl-le brón-e dìye| mǝ́r-e
take.pst - l .3 ms son-3 ms obl .3 ms say.pst - l .3 ms
šaql-ǝ́n-na jănàze díye| nabl-ǝ́n-na bɛ̀θa.|
take. prs - d .1 ms - l .3 fs coffin obl .3 ms take. prs - d .1 ms - l .3 fs home
‘He threw away the stone and did not kill the scorpion. He went off and
continued to cultivate. Half an hour, an hour later, he was not comfortable.
He said “My son is already dead. The scorpion has bitten him”. He went
and stopped the oxen, which were ploughing. He took his son and said “I’ll
take his coffin and carry him home”.’
(63) dǝ́rya ʾíð-e b-brōn-dìye.| brōn-díye
put. ptcp . ms hand-3ms on-son-obl .3 ms son-obl .3 ms
qìm-wa-le.| mǝ́r-e ha-báb-i
rise.pst - pst - l .3 ms say-l .3 ms voc -father-my
mò bắy-ət?| mǝ́r-e čù-məndi bắy-ən.|
what want. prs .- d .2 ms say.pst -l .3 ms neg -thing want. prs - d .1 ms
prìq-li| ṱ-áz-əx bɛ̀θa.|

finish.pst - l .1 s fut -go. prs - d .1 pl home
xír-re xo-réšə do-yàla.| ʾíθ-wa
look.pst - l .3 ms under-head obl . dem . ms -boy exist - pst
xá-boy-ət xùwwe.| xá-xuwwe plíṭa táma bắye-Ø
one-hole-gen snake one-snake leave. ptcp there want-d .3 ms
nayǝ́s-Ø-le ṭla-brṓn dó fallā̀ ḥ.| ʾáyya ʾaqǝ́rwa
bite-d .3 ms - l .3 ms acc -son obl . dem . m peasant dem . fs scorpion
θìθa| gu-réšə do-bòya|
come. ptcp . fs in-head obl . dem . ms -hole
qṭìltə=l-le,| qəm-qaṭl-á-le ʾo-xùwwe.|
kill. ptcp . fs =cop .3 fs - l .3 ms pst -kill. prs - d .3 fs - l .3 ms dem . ms -snake
mǝ́r-e ʾən-ʾána haw-ǝ́n-wa qṭílə-lla
say.pst - l .3 ms if-I-be.irr - d .1 ms - pst kill. ptcp - l .3 fs
ʾăyáha ʾaqǝ̀rwa,| ʾáwwa xúwwe ṱ-awé-Ø-wa
dem . fs scorpion dem .3 ms snake fut -be.prs -d .3 ms - pst
qṭíla ṭla-bròn-i.| šqíl-le brōn-dìye|
kill. ptcp . ms . acc -son-my take.pst - l . ms son-obl .3 ms
zíl-le bɛ̀θa.|
go.pst - l .3 ms home
‘He put his hand on his son, his son had woken. He said “Ah, father, what
do you want?” He said “I don’t want anything. I have finished. Let’s go
home.” He looked under the head of the boy. There was a snake’s hole. A
snake had come out of there wanting to bite the son of that peasant. The
scorpion had come to the opening of the hole and killed it, it killed the
snake. He said “If I had killed that scorpion, that snake would have killed
my son”. He took his son and went home.’
Abbreviations

D = D-suffix (i. e. direct, nominative pronominal suffix)
L = L-suffix (i. e. oblique pronominal suffix)
350 Geoffrey Khan
References
Aloni, Oz. 2014. The Neo-Aramaic speaking Jewish community of Zakho: A survey of the
oral culture. Saarbrücken: Lap Lambert.
Arnold, Werner. 2002. Velarisiertes Ṛ und retroflexes Ɽ im neuostaramäischen Dialekt von
Baɽəṭle. Mediterranean Language Review 14. 1–8.
Borghero, Roberta. 2015. The present continuous in the Neo-Aramaic dialect of ʿAnkawa
and its areal and typological parallels. In Geoffrey Khan & Lidia Napiorkowska (eds.),
Neo-Aramaic and its linguistic context, 187–206. Piscataway, NJ: Gorgias Press.
Coghill, Eleanor. 2003. The Neo-Aramaic Dialect of Alqosh. Cambridge: University of
Cambridge dissertation.
Coghill, Eleanor. 2008. Some notable features in North-Eastern Neo-Aramaic dialects of
Iraq. In Geoffrey Khan (ed.), Neo-Aramaic Dialect Studies, 91–104. Piscataway, NJ:
Gorgias Press.
Coghill, Eleanor. 2010. Ditransitive constructions in the Neo-Aramaic dialect of Telkepe.
In Andreĭ Lʹvovich Malʹchukov, Martin Haspelmath & Bernard Comrie (eds.), Studies
in ditransitive constructions: A comparative handbook, 221–42. Berlin & Boston: De
Gruyter Mouton.
Coghill, Eleanor. 2016. The rise and fall of ergativity in Aramaic: Cycles of alignment
change. Oxford studies in diachronic and historical linguistics 21. Oxford: Oxford Uni-
versity Press.
Dixon, Robert M. W. 1994. Ergativity. Cambridge studies in linguistics 70. Cambridge:
Cambridge University Press.
Doron, Edit & Geoffrey Khan. 2012. The typology of morphological ergativity in Neo-Ar-
amaic. Lingua 122. 225–240.
Duval, Rubens. 1896. Notice sur les dialectes néo-araméens. Mémoires de la Société de
Linguistique de Paris 9. 125–135.
Fassberg, Steven. 2015. The origin of the periphrastic preterite Kəm/qam-Qāṭəlle in
North-Eastern Neo-Aramaic. In Geoffrey Khan & Lidia Napiorkowska (eds.), Neo-Ar-
amaic and its linguistic context, 172–186. Piscataway, NJ: Gorgias Press.
Fiey, Jean Maurice. 1965. Assyrie chrétienne: Contribution á l’étude de l’histoire et de
la géographie ecclésiastiques et monastiques du nord de l’Iraq, 2 vols. Recherches
subliées sous la direction de l’Institut de Lettres Orientales de Beyrouth, T. 22–23, 42.
Beyrouth: Imprimerie catholique.
Fox, Samuel E. 2008. North-Eastern Neo-Aramaic and the Middle Aramaic Dialects. In
Geoffrey Khan (ed.), Neo-Aramaic Dialect Studies, 1–18. Piscataway, NJ: Gorgias Press.
Gaunt, David. 2006. Massacres, resistance, protectors: Muslim-Christian relations in East-
ern Anatolia during World War I. Piscataway, NJ: Gorgias Press.
Göransson, Kathrin. 2015. Causative-inchoative alternation in North-Eastern Neo-Ara-
maic. In Geoffrey Khan & Lidia Napiorkowska (eds.), Neo-Aramaic and its linguistic
context. Piscataway, NJ: Gorgias Press.
Greenblatt, Jared R. 2011. The Jewish Neo-Aramaic dialect of Amadiya. Studies in Semitic
Languages and Linguistics, v. 61. Leiden & Boston: Brill.
Hoberman, Robert. 1989. Parameters of emphasis: Autosegmental analyses of pharyngeali-
zation in four languages. Journal of Afroasiatic Languages 2(1). 73–98.
Jastrow, Otto. 1990. Der arabische Dialekt der Juden von ʿAqra und Arbīl. Semitica Viva
5. Wiesbaden: Harrassowitz.
Kahn, Margaret. 1976. Borrowing and variation in a phonological description of Kurdish.
Michigan: University of Michigan dissertation.
Khan, Geoffrey. 1999. A Grammar of Neo-Aramaic: The dialect of the Jews of Arbel. Lei-
den & Boston: Brill.
Khan, Geoffrey. 2002a. The Neo-Aramaic dialect of Qaraqosh. Studies in Semitic Lan-
guages and Linguistics 36. Leiden & Boston: Brill.
Khan, Geoffrey. 2002b. The Neo-Aramaic dialect of the Jews of Rustaqa. In Werner Arnold
& Hartmut Bobzin (eds.), Sprich doch mit deinen Knechten Aramäisch, Wir verste-
hen es! 60 Beiträge zur Semitistik, Festschrift für Otto Jastrow zum 60. Geburtstag,
395–410. Wiesbaden: Harrassowitz.
Khan, Geoffrey. 2004. The Jewish Neo-Aramaic dialect of Sulemaniyya and Ḥalabja. Stud-
ies in Semitic Languages and Linguistics, 44. Leiden & Boston: Brill.
Khan, Geoffrey. 2007. The North-Eastern Neo-Aramaic dialects. Journal of Semitic Studies
52(1): 1–20.
Khan, Geoffrey. 2008. The Neo-Aramaic dialect of Barwar. 3 vols. Leiden & Boston: Brill.
Khan, Geoffrey. 2009. The Jewish Neo-Aramaic dialect of Sanandaj. Piscataway, NJ: Gor-
gias Press.
Khan, Geoffrey. 2012a. Remarks on the historical development of the copula in Neo-Ara-
maic. In Federico Corriente, Gregorio del Olmo Lete, Ángeles Vicente & Juan-Pablo
Vita (eds.), Dialectology of the Semitic languages. Proceedings of the IV meeting on
comparative Semitics Zaragoza 11/6–9/2010, 25–31. Sabadell: AUSA.
Khan, Geoffrey. 2012b. The evidential function of the perfect in North-Eastern Neo-Ara-
maic dialects. In Rebecca Hasselbach & Na’ama Pat-El (eds.), Language and nature.
Papers presented to John Huehnergard on the occasion of his 60th birthday, 219–228.
Studies in Ancient Oriental Civilization 67. Chicago: University of Chicago.
Khan, Geoffrey. 2013. Phonological emphasis in North-Eastern Neo-Aramaic. In Jean Léo
Léonard & Samia Naïm (eds.), Base articulatoire arrière. Backing and backness, 111–
132. Munich: Lincon Europa.
Khan, Geoffrey. 2015. Causative constructions in Neo-Aramaic (Christian Urmi dialect). In
Lutz Edzard (ed.), Arabic and Semitic linguistics contextualized: A Festschrift for Jan
Retsö, 506–530. Wiesbaden: Harrassowitz.
Press.
Khan, Geoffrey. 2017. Studies in the lexicon of Neo-Aramaic. In Yağmur Heffron, Adam
Stone & Martin Worthington (eds.), At the dawn of history. Ancient Near Eastern stud-
ies in honour of J.N. Postgate, 255–269. Winona Lake: Eisenbrauns.
Krotkoff, Georg. 1985. Studies in Neo-Aramaic lexicology. In Ann Kort & Scott Morschauer
(eds.), Biblical and related studies presented to Samuel Iwry, 123–134. Winona Lake:
Eisenbrauns.
Maclean, Arthur John. 1901. A dictionary of the dialects of vernacular Syriac as spoken
by the eastern Syrians of Kurdistan, northwest Persia, and the plain of Mosul. Oxford:
Clarendon Press.
Macuch, Rudolf. 1976. Geschichte der Spät- und Neusyrischen Literatur. Berlin & Boston:
De Gruyter Mouton.
352 Geoffrey Khan
Mann, Jacob. 1931. Texts and studies in Jewish history and literature. The Abraham and
Hannah Oppenheim Memorial Publications. Cincinnati: Hebrew Union College Press.
Mengozzi, Alessandro. 2002. Israel of Alqosh and Joseph of TelKepe. Corpus Scriptorum
Christianorum Orientalium. Louvain: Peeters.
Mengozzi, Alessandro & Emanuela Braida. 2011. Religious poetry in vernacular Syriac
from Northern Iraq (17th–20th centuries): An anthology, vol. v, 627–628. Corpus
Scriptorum Christianorum Orientalium. Louvain: Peeters.
Mole. 2015. The three /r/s of Baṛtəḷḷa. In Geoffrey Khan & Lidia Napiorkowska (eds.),
Neo-Aramaic and its linguistic context, 110–129. Piscataway, NJ: Gorgias Press.
Murre-van den Berg, Heleen L. 1999. From a spoken to a written language: The introduc-
tion and development of literary Urmia Aramaic in the nineteenth century. Leiden:
Nederlands Instituut voor het Nabije Oosten.
Mutzafi, Hezy. 2002. Barzani Jewish Neo-Aramaic and its dialects. Mediterranean Lan-
guage Review 14. 41–70.
Mutzafi, Hezy. 2004a. The Jewish Neo-Aramaic dialect of Koy Sanjaq (Iraqi Kurdistan).
Semitica Viva 32. Wiesbaden: Harrassowitz Verlag.
Mutzafi, Hezy. 2004b. Two Texts in Barzani Jewish Neo-Aramaic. Bulletin of the School of
Oriental and African Studies 67(1). 1–13.
Mutzafi, Hezy. 2004c. Features of the verbal system in the Christian Neo-Aramaic dia-
lect of Koy Sanjaq and their areal parallels. Journal of the American Oriental Society
124(2). 249–264.
Mutzafi, Hezy. 2008a. The Jewish Neo-Aramaic dialect of Betanure (province of Dihok).
Semitica Viva 43. Wiesbaden: Harrassowitz.
Mutzafi, Hezy. 2008b. Trans-Zab Jewish Neo-Aramaic. Bulletin of the School of Oriental
and African Studies 71(3). 409–431.
Mutzafi, Hezy. 2014. Jewish Neo-Aramaic. Encyclopedia of the Jews of the Islamic World.
Napiorkowska, Lidia. 2015a. A diachronic perspective on emphasis in Christian Diyana-Za-
riwaw. In Geoffrey Khan & Lidia Napiorkowska (eds.), Neo-Aramaic and its linguistic
context, 130–144. Piscataway, NJ: Gorgias Press.
Napiorkowska, Lidia. 2015b. A Grammar of the Christian Neo-Aramaic dialect of Diya-
na-Zariwaw. Leiden & Boston: Brill.
Odisho, Edward Y. 1988. The sound system of Modern Assyrian (Neo-Aramaic). Semitica
Viva 2. Wiesbaden: Harrassowitz.
Rockwell, William Walker. 1916. The pitiful plight of the Assyrian Christians in Persia and
Kurdistan described from the reports of eye-witnesses. New York: American Committee
for Armenian and Syrian Relief.
Sabar, Yona. 1976. Pəšaṭ Wayəhî BəŠallaḥ. A Neo-Aramaic Midrash on Beshallaḥ (Exodus).
Introduction, Phonetic Transcription, Translation, Notes and Glossary. Wiesbaden: Har-
rassowitz.
Sabar, Yona. 2002. Jewish Neo-Aramaic Dictionary. Semitica Viva 28. Wiesbaden: Har-
rassowitz.
Soane, Ely Bannister. 1912. To Mesopotamia and Kurdistan in disguise; with historical
notices of the Kurdish tribes and the Chaldeans of Kurdistan. London: J. Murray.
Socin, Albert. 1882. Die neu-aramäischen Dialekte von Urmia bis Mosul. Tübingen:
H. Laupp.
Talay, Shabo. 2001. Grammatikaliche Anmerkungen und Texte zum neuaramäischen Dia
lekt von Nerwa (Nordirak). Mediterranean Language Review 13. 1–37.
Tsereteli, Konstantin. 1977. Zur Frage der Klassifikation der neuaramäischen Dialekte.
Tsereteli, Konstantin. 1978. The Modern Assyrian language. Moscow: Nauka.
Tsereteli, Konstantin. 1990. The velar spirant Ġ in Modern East Aramaic dialects. In Wolf-
Scholars Press.
Watson, Janet. 1999. The directionality of emphasis spread in Arabic. Linguistic Inquiry
30(2). 289–300.
3.5. Iraq-Turkic
Christiane Bulut
1. The Turkman of Iraq
The Turcophone populations of Iraq call themselves Turkman, and their language
Turkmanja.1 The Turkman and their language Turkmanja, or Iraq-Turkic, must be
distinguished from the Turkmen language, part of the Northern Oghuz group (see
the classification in Table 1 in Bulut, this volume, chapter 4.2, §1), spoken in the
Republic of Turkmenistan, in Northeast Iran and in North Afghanistan. Iraq-Turkic
is spoken by groups sandwiched between the Kurdish regions to the north and east
and Arabic-speaking areas to the south. The speech communities are situated in a
number of separate areas, villages and towns within a belt stretching from Tal‘afer
in the northwest to Ba'adra in the southeast (see Fig. 1).2 Most larger cities, such as
Kirkuk and Arbil, have a mixed, mainly Turkic-Kurdish population. An important
group of Turkman is also found in Baghdad. The migration in recent years, which
was caused by the Arabicization policy of the Baath-Party and subsequent wars
and civil wars, has considerably changed the ethnic map of Iraq. Moreover, many
villages and traditional environments of the Turkman, such as the old quarters
within the citadel of Kirkuk, have been destroyed in recent years.
Established in the aftermath of WWI, Iraq is one of the new nation states of
the Middle East whose borders cut through the traditional areas of settlement of
various ethnic groups, its multi-ethnic and multi-lingual character partly also
reflecting its Ottoman or Iranian heritage. The Turkman of Iraq form the third
largest community in Iraq, after Arabs and Kurds; yet, they have been denied the
official status of a minority. Due to the repressive policy against the minorities
that commenced with the foundation of the new nation state Iraq and was enforced
under the regime of the Baath Party after 1968, it is difficult to give precise popula-
tion figures. In the late 1970s Buluç (1980) estimated that about 750,000 Turkman
were living in Iraq. Based on various statistics from pre-colonial data and the last
official census conducted in Iraq in 1957, Fischer (1993) arrived at an estimated
number of 600,000 for the year 1989, which then amounted to 3.3 % of the total
population of Iraq (18.27 million in 1989). This would imply that of today’s total
1
Like other Turkic-speaking communities of the larger region, the Turks of Iraq pre-
viously used the ethnonym Turk, while referring to their language as Turki; in 1959
the military junta introduced the names Turkman and Turkmanja, which nowadays are
widely accepted among representatives of the speech community.
2
This Turkman belt is sometimes called “Turkman Eli”.
Iraq-Turkic 355
population of 31.13 million, roughly one million and thirty thousand belong to the
Turkman minority.3
Nowadays Iraq is a predominantly Muslim country, with about 65 % Shiites
and 32 % Sunnis, and one may assume roughly the same distribution of confes-
sions among the Turkmans. After the collapse of the Baath regime, though, a
number of religious minorities, Sufi orders or sects began to re-surface, one of
which is the Bektashiyye. Recent sources claim that this Shii-Alevi Sufi order has
between 250,000 to 700,000 followers in the Turkman Belt, with centers in the
cities of Mosul, Tal‘afer, Kirkuk, Tuzkhurmatı and Arbil.4
1.1. Historical background

Iraq displays many features characteristic of a transitional area, a fact that also
shaped the prevailing linguistic situation. After the delimitation of the sphere of
influence of the Ottoman and the Safavids during the 16th century, major exchanges
of populations still occurred in Mesopotamia, which remained part of a disputed
border region. Whenever they could gain the upper hand, both Ottomans and Safa-
vids had reliable tribes moved into the area. Shah Ismail relocated a considera-
ble number of Shii tribesmen from the region of Maraghe in Azerbaijan to Iraq.
Under Sultan Murad IV (1623–1640) and other Ottoman rulers, Anatolian Turks
were settled in this region.5 Strategic settlement continued during the 18th century,
when Nader Shah (1733–1743) had Shii Turkic tribes brought into the region. At
the same time, the Ottomans tried to fortify the Turkic stronghold that lay behind
the notoriously unreliable Kurdish principalities by relocating sympathetic set-
tlers from Anatolia in the area. Much of the peculiar crescent-shaped form of the
Turkman belt coincides with the environment of the Great Trunk Road from Mosul
to Baghdad, which implies strategic movements of local populations in Ottoman
times. Toponyms such as Shahseven (i. e. the confederation which formed the mil-
itary backbone of the Qajar state in the 18th and 19th centuries) indicate that, in
later times as well, Turkic tribes adhering to the Shii denomination settled in Iraq.
For the Turkic dialects of Iraq these movements of populations indicate that
there was constant influence both from the West and the East, which continued
after the formation of the most important written Turkic languages of the region,
Ottoman Turkish and Azeri Turkic. It also explains the fact that the map of
3
Recent sources from Azerbaijan and Turkey mention much higher figures; Pashayev
(2003: 25) claims that there are about 2.5 to 2.6 million Turkmans living in Iraq, while
Turkish media even talk about 3 to 3.5 million. According to a more recent source, the
population figure must be at least 2,500,000 (Saatçi 2018: 332).
4
Source: the Turkish newspaper ZAMAN, 21 August, 2011.
5
See Marufoğlu (1998: 57).
356
Christiane Bulut
Figure 1: Locations of the Iraq-Turkic varieties in this chapter

Iraq-Turkic 357
Iraq-Turkic dialects is a patchwork rather than a continuum—which is, of course,

also true for Anatolian and Iran-Turkic dialects.

Three major languages are spoken in the so-called Turkman belt: Iraq-Arabic, till
recently the official language for all of Iraq; Kurdish, which due to the develop-
ments on the political scene has gained increasing importance during the last two
decades and consequently became the second official language in June 2004; and
Iraq-Turkic, which is not used for official purposes. The prestige the latter enjoys is
relatively low;6 thus, it is rather unusual for native speakers of Arabic or Kurdish to
learn Turkic. Most speakers of Iraq-Turkic, who are at the bottom of this hierarchy,
are tri-lingual in Arabic, Kurdish and Turkic. Due to a cultural orientation towards
Turkey, many also have active command of Turkish of Turkey; additionally, the
influence of Turkish media (especially programmes broadcast on satellite TV) has
steadily grown during the last twenty years, and a number of private schools, with
teaching in Turkish and backed by Turkish institutions, also operate in the region.
Thus, diglossia in Iraq-Turkic and Turkish of Turkey is a widespread phenomenon.
With the displacement of parts of the Iraqî Turkman population during the Gulf
Wars and the civil war in Iraq, different constellations of intra-Turkman diglos-
sia have also arisen. Nowadays, many speakers of Turkman speak a mixed idio-
lect displaying various regional features. Despite its low prestige, Iraq-Turkic is
also used as a regional language of communication by smaller minorities, such as
the Armenians around Mosul and Neo-Aramaic speaking Christians. Iraq-Turkic
never developed a written standard of its own. The works on history or folklore, or
literary texts comprising short stories and very rarely even novels which appeared
in Iraq and in Turkey during the last fifty years are written in Standard Turkish—
either in Arabic, Kurdicized Arabic or Latin script.7 In 1997, the Iraqi Turkman
Congress adopted a Declaration of Principles, Article Three of which states that:
‘The official written language of the Turkman is Istanbul Turkish, and its alphabet
is the new Latin alphabet.’8
Narrative texts representing the spoken language are rare: Two older collections
of folklore texts, the doctoral theses by Choban Khıdır Haydar (1979) and Hussin
Shahbaz Hassan (1979), have remained unpublished. Buluç (1973/74), who also
wrote some articles on the local varieties of Mandali (1975) and Khanaqin (1979),
6
Yet Iraq-Turkic ranks higher in prestige than smaller minority languages such as Arme-
nian, Gorani, and Neo-Aramaic (see Haig, this volume, chapter 3.3, and Khan, this
volume, chapter 3.4, for the latter two).
7
For a survey on modern Turkman literature see al-Bayati (1970).
8
Türkmeneli İşbirliği ve Kültür Vakfı. “Declaration of Principles of the Turkman Con-
gress”. http://www.iraqiturkman.org.tr/turkmen27.html (accessed 25 November 2011).
358 Christiane Bulut
has published a small sample text of Iraq-Turkic from the Tal‘afer region. Differ-
ent written versions exist of the most famous folklore text, the romantic novel Arzı
ile Qamber, which the Kirkuk lawyer Ata Terzibashı recorded in the 1960s.
2. Linguistic features
Iraq-Turkic, or Iraqi Turkman are collective terms for a number of transitional

dialects and regional varieties of predominantly West-, South- or Central-Oghuz
character, which the local populations nowadays call Turkmanja. Being closely
related to Iran-Turkic or South-Azerbaijanian, Iraq-Turkic also shares a consid-
erable amount of features with Anatolian-Turkish; consequently, characteristic
isoglosses demonstrate affiliations both with the eastern and western neighbours.
The following description will present characteristics of Iraq-Turkic classi-
fied according to the linguistic subsystems of phonology, morphology, syntax and
lexicon. With regard to their supposed origin, items pertaining to these subsys-
tems will be categorized as (i) genuine Turkic inventory; as (ii) so-called “areal
features”, indicating phenomena which appear throughout genetically unrelated
languages in a given area; or as (iii) language contact-induced developments
which reflect the interaction between structurally different languages. Moreover,
cross-references indicate the similarities between the spoken varieties of Turkic in
the historical area of East Anatolia, West Iran and Iraq.
Two short sample texts are provided at the end of this chapter, which illus-
trate some of the linguistic items in context; Text 1 was recorded forty years ago
with an old gentleman in Tuzkhurmatı, a predominantly Shii settlement 55 miles
south of Kirkuk.9 The informant who provided Text 2 was born in 1961 in Yengi
Tis‘în Mahallesi in Kirkuk; he holds a university degree, and is fluent in Iraqi
Turkman, Turkish, Arabic and Sorani Kurdish. In the following, quotations from
these sample texts are referenced by a number indicating the text and number of
the respective sentence. When referring to other recordings, I will just indicate the
place of origin. In addition, the folklore materials mentioned in Section 1.2 were
consulted; examples from these texts are quoted with the abbreviation (CKH 1979:
number of page/line) for Choban Khıdır Haydar (1979) and (HSH 1979: number
of text/line) for Hussin Shahbaz Hassan (1979).
9
I am grateful to Suphi Saatçi, who shared some of his older recordings of Iraqi Turkman
with me.
Iraq-Turkic 359
2.1 Phonology
2.1.1. Consonant inventory

With regard to morphology and lexicon, Iraqi Turkman dialects are quite similar
to each other; yet, they differ considerably with regard to phonology. The most
important differentiating items are the realization of the old velar /ŋ/ (an Intra-Tur-
kic isophone) and the so-called ‘fronting’ in the articulation of certain stops and
affricates (a phonological isogloss in the region).10
East-West division of Iraq-Turkic dialects – realisation of velar /ŋ/ > [y] or /ŋ/ >
[w] as an intra-Turkic isophone
Vertically the Iraqi Turkman varieties can be divided into two groups, using the
realization of the old velar /ŋ/ in the possessive suffixes and the possessive type
of the conjugation of the 2nd person singular and plural as a morpho-phonological
indicator. Around Arbil, and basically also in Kirkuk, the old velar nasal /ŋ/ has
developed into [w] or [u]; these varieties belong to the so-called w-group. Most
dialects in the west of the Turkman belt, on the other hand, display a development
of velar /ŋ/ > [j], representing the y-group. Thus, the pronunciation of the posses-
sive of the 2nd person singular is seniy ‘yours’, and qiziy ‘your daughter’ in dialects
of the y-group, and senuw ‘yours’ qizuw ‘your daughter’ in the w-group. Due to
the mixed background of the population in bigger cities such as Kirkuk, both types
of pronunciation may also exist parallel. The speaker from Kirkuk who produced
Text 2, for instance, uses forms pertaining to the w-group, such as [geʃdoʊ] ‘you
(2 sg ) went through’, and [thʊtsaʊ] ‘if you (2sg ) hold’. Tuzkhurmatı belongs to the
y-group, as forms such as [ɑlardij] ‘you should take’ (1/10) or [gøndɛrdij] ‘you
have sent’ demonstrate.
In Iraq-Turkic, some lexical items with irregular distribution also display a
development of old velar nasal /ŋ/ into [w] or [u], see examples such as: soŋra,
sõra ‘later’ <*soŋıra, gewil ‘heart’<*göŋül, yalγuz ‘alone, lonely’ <*yalıŋus, and
[øgʏndɛ] ‘in front of’ (2/9) < ög ‘front’ <*öŋ. Across the varieties of the area some
few items have consequently preserved the velar /ŋ/ as [ŋ] or even [ŋg], see, for
instance, pronunciations of ‘wool’ as [yŋ], [yŋg], [jyŋ], [ıŋ].
10
Stilo (1994) describes this phonological development for Azerbaijan and Transcauca-
sia; see also Haig, this volume, chapter 3.3, and Khan, this volume, chapter 3.4 for
similar phenomena in Kurdish and Neo-Aramaic.
North-South division of Iraq-Turkic dialects -‘Fronting’ in the articulation of stops

and affricates as an areal isophone
In this process, the velar stops /k/ and /g/ are palatalized to [kj] and [gj], or, in the
extreme case, affricated to [ʤ] or [ʧ], while the original affricates /ʤ/ and /ʧ/ may
be fronted to [dz] and [ʦ/tç] etc. Thirdly, the place of the former velar stops /k/ and
/g/ may be taken by the post-velar or uvular stops, which also shift farther to the
front; thus /q/ > [k] and /ɢ/ > [g] also occur.11
Fronting is more predominant in the north-eastern dialects around Arbil, and
less explicit towards the South. In the city of Kirkuk, where populations of different
origins mix, it is present to a lesser extent. The sample Text 2 displays a relatively
strong tendency to fronting, which is characteristic of the neighborhood of Yengi
Tis‘în. While /g/ is mostly palatalized (/g/ > [g, gj, j])—but, to my experience, never
affricated to [ʤ]—, /k/ displays palatalized (/k/ > [kj]) and additional affricated
allophones (/k/ > [kç, ç]) as in [kçɛrkjʏç] ‘Kirkuk’ (2/1), or [kçɛrɛm] ‘instance, time’
(2/6). Consequently the unvoiced affricate /č/ has a fronted allophone [ts] (/č/ > [ts]),
as in [tsoɣ] ‘very, much’ (2/1), [ˈoxʊtsɨ] ‘singer’ (2/6), or [gɛtsɛr] ‘it passes’ (2/7).
The voiced affricate /ĵ/ is pronounced [dz] in most instances (/ĵ/ > [dz]), see exam-
ples such as [gedzɛ] ‘night‘ (2/2). Counterexamples, such as [iʧɛn] ‘drinking’
(2/4), and [ɛglɛnʤɛ] ‘entertainment’ (2/3) obviously reflect the omnipresent influ-
ence of Turkish; note that the same speaker may also produce fronted variants in
connection with the same lexical items, as, for instance, in [æglɛndza] (2/7).
The text from Tuzkhurmatı, on the other hand, does not display fronting—nor
do the adjacent Iran Turkic varieties. Thus, Kirkuk seems to establish the south-
ernmost point of the fronting area. Note also that fronting is a so-called areal phe-
nomenon which occurs in the pronunciation of genetically unrelated languages in
North Iraq, SE-Anatolia and West Iran (see Haig, this volume, chapter 3.3, §3.1.1).
In varieties of Arabic spoken in Iraq, the word for ‘lawyer’ vakil, for instance,
is sometimes pronounced as [wætçil], the Kurdish numerals [ʧɒhɒr] ‘four’ and
[pɛnʤ] ‘five’ appear as [ʦɒːr], and [penʣ] etc. (see Haig, this volume, chapter 3.3,
§3.1.1 on fronting in Iraqi varieties of Kurdish).
Note that back [q] and [ɢ] do not take part in the fronting process, see examples
such as [qɑja] ‘rock’ (1/2), or [qæræ] ‘black’. The velar fricative ġayn [γ], which in
Turkic appears in word-internal/-final position in combination with velar vowels,
is preserved. Word-final /-k/ in some instances becomes [-x]: yemax ‘food’, but
it also has the voiced allophone [γ], in forms such as [jɛmɑγ]. Unvoiced stops in
11
As the phonology of Iraq-Turkic is heavily influenced by the surrounding languages
where /q/, /k/, /γ/ and /g/ are different phonemes, one cannot interpret these elements as
allophones of the Turkic phoneme /k/ (which they would be according to the Turkolog-
ical tradition).
Iraq-Turkic 361
word final position are sometimes pronounced as unaspirated, see [bøjʏg̊] < böyük
‘large’.
2.1.1.1. Copied elements

Pronunciation of ‘ayn [ʕ], bilabial [w] and the pharyngeal fricative [ħ]
Characteristic dialect markers of Iraqi Turkman and Kurdish varieties from Iraq
are copied or contact-induced pronunciations of Semitic phonemes, such as the
retracted tongue root glottal stop ‘ayn [ʕ], the pharyngeal fricative [ħ], the bilabial
[w], and the uvular plosive [q], see examples such as [saʕɑd̥] ‘time’ (2/7) or [jaʕne]
‘that is’ (2/11), [sɑjjɑħ] ‘traveler’ (1/5), [aħmæd] nom. prop. (1/8), or [hær ħalda]
‘usually’ (2/4), [wɑxɪd] ‘time’ (1/10), and [tæqriːßan] ‘roughly, approximately’
(2/8). Text 2 demonstrates that most of these items appear in combination with
lexical items loaned from Arabic, for which speakers from Kirkuk tend to chose
the Arabic pronunciation.
In other varieties of Iraq-Turkic, the distribution of the copied items is not
always true to the original; speakers may pronounce ‘ayn instead of a normal
glottal stop (hamza), while the pharyngeal fricative [ħ] replaces etymological [h].
This also happens in connection with Kurdish or Turkic roots. A special case,
though, is the Turkic lexeme hepsi ‘all’, which speakers from Kirkuk always pro-
nounce as [ħapsi], with word-initial [ħ-].
As the ‘borrowed’ phonetic items do not establish phonemes, the degree of
their realization may vary. In the region around Kirkuk, where the influence of
Arabic is stronger, the pharyngeal fricative [ħ] and the retracted tongue root glottal
stop ‘ayn [ʕ] are more clearly pronounced than in the dialects around Arbil in the
northern parts of the Turkman belt.
2.1.1.2. Unstable word-initial segments

In Arabic, empty syllable onsets are not tolerated, and there is a similar tendency
in Kurdish, leading to the insertion of prothetic h-elements (pharyngeal or glottal)
in etymologically vowel-initial words. This rule of the contact languages coin-
cides with a Turkic tendency across the whole area to pronounce an additional [h-]
before word-initial vowels. Yet, Turkic items such as his ‘soot’ (cf. Turkish: is),
and hörmaγ ‘knit’ (Turkish: örmek), may have preserved an older [h-].12 In loan-
words like helbet ‘naturally’ or hešye ‘things, belongings’ (a plural of the Arabic
šay ‘thing’), on the other hand, the secondary or prothetic nature of [h] is obvious.
Another irregular phonotactic process—which, in varying distribution, also
occurs in the dialects of eastern Anatolia and western Iran—is the elision of the
12
On word-initial h- in Turkic see Doerfer (1981/1982).
word-initial /y-/ before the vowels /ı/, /i/ and /ü/.13 In words such as ilan ‘snake’,
il ‘year’, (corresponding to Turkish yılan, yıl), the word-initial semi-vowel [y]
and the following central-back unrounded vowel /ı/ are replaced by [i-], while igit
‘hero’ (corresponding to Turkish yiğit) has obviously lost its initial /y-/. Similarly,
word-initial /yü-/ may be present as /ü-/, as in üz ‘face’ (but yüz ‘hundred’), üng
‘wool’, üskek ‘high’, or üzük ‘ring’ (compare Turkish yüz, yün, yüksek, yüzük).14
As word-initial sound groups such as yı-, yi-, or yü- are alien to the Iranian
contact languages, the reason for this substitution may again be language contact
influence. On the other hand, some dialects have preserved older forms like yıγla-
max ‘to weep’, which elsewhere became aγlamak. Moreover, prothetic [y-] also
occurs, in forms such as yê:v ‘house’ for ev, or yeriš- ‘to reach’ for eriš- etc.
2.1.2. Vowel inventory

2.1.2.1. The Turkic vowel inventory
The normal Turkic vowel system consists of eight (+ 1) vowel phonemes, each of
which can appear in the stem of Turkic words. They are symmetrically distributed:
two unrounded front vowels /i/ and /e/, and two unrounded back vowels /ı/ and /a/.
Each unrounded vowel phoneme corresponds to a rounded vowel in the same posi-
tion, namely /ü/, /ö/, /u/ and /o/. The preservation of the ninth vowel, the so-called
closed /ê/, is characteristic of the whole area, e. g. the form [jeːv] ‘house’. As a
rounded allophone of /a/, [ɒ] mainly appears in the vicinity of labialized conso-
nants such as /b/ and /v/ [v, w], as in words such as [ʔarwɒd] < avrat ‘woman’
(2/10) or [bɒba] ‘father’ (1/8). In most varieties of the area Iraq, West Iran/Azer-
baijan and East Anatolia, certain words, such as, for instance, böyük [bøjʏg] ‘large’
(2/6), and gözäl [gɶzɛl] ‘nice, beautiful’ (2/1) have preserved the older vowel /ö/,
which has developed into /ü/ in Turkish.
2.1.2.2. Copied items in the vowel inventory

In addition to the Turkic vowel inventory, most Turkic varieties of the area have
adopted new vowel qualities after the model of Iranian or Semitic languages. Short
/a/ in Arabic and Persian loanwords is represented by front half-open to open [a,
æ], as in examples such as [mæθælæn] < maṯalan ‘for instance, [baʕas] < baʕż
‘some’ (2/2 and 2/3) or [bæs] < bas ‘enough, however’.
In Turkic stems [æ] is in some instances an allophone of /e/, see examples
such as [dɛlɪg] ‘hole’ (1/3), but also [dælɪg] (1/6); yet, as in the neighboring
13
Most of the lemmata quoted here begin with y- in Old or Middle Turkic.
14
Word-initial /u-/ corresponding to /yu-/, on the other hand, is very rare; HSH (1979:
438) lists a single example: ut- ‘to swallow’, corresponding to Turkish yut-.
Iraq-Turkic 363
Iran-Turkic dialects, examples for [æ] representing /a/—even in combination

with uvular consonants such as /q/—also appear, see, for instance, [qæræ] < qara
‘black’.15
On the whole, Iranicization of the vowel system and the resulting disturbances
of vowel harmony (as the main principle of Turkic phonotactics) are less explicit
than in the adjacent dialects of Iran.
2.1.2.3. Delabialization of the Turkic front rounded vowels /ü/ and /ö/ as a contact-
induced phenomenon
In the southern varieties of Mandali and Khanaqin, the front rounded vowels /ü/
and /ö/ are substituted by their unrounded counterparts [i] and [e] or [ie], see exam-
ples such as /ü/ > [i, ı] in [sit] < süt ‘milk’, and /ö/ > [Ieː, eː] in [Ieːz] < öz ‘self,
own’ or [dek-] < dök- ‘to pour’. Delabialization of /ü/ and /ö/ is a widespread phe-
nomenon, which points to the influence of neighbouring Semitic and Iranian lan-
guages. Thus, across the Turkman belt, East Anatolia and Iran speakers may also
centralize or delabialize /ü/ > [u] ~ [ı] and /ö/ > [o] ~ [e] due to the interference of
these contact languages.
2.1.3. Phonotactic processes, assimilations

2.1.3.1. Shortening or elision of vowels: an Intra-Turkic phonological process
Shortening or even complete elision of the high/close vowel in the first syllable
occurs, if this vowel is surrounded by two of the followings sounds: stops, such as
[kh] and [th], which mostly display aspiration as a secondary feature, the unvoiced
affricate [ʧ], the alveolar fricatives /s/, /ʃ/ or /z/, or the velar fricative [x]. Compare
[ʧxandæ] or [ʧxar] (both (1/1), which represent contracted forms of the verb stem
čıx- ‘to come out’. Shortening or elision of high vowels in a word-initial syllable
occurs across the Turkic varieties from North Azerbaijan, Iraq, and West Iran.16
15
Across the area, an assimilation of the vowel quality to the famous ‘nine consonants’ –
as in Ottoman Turkish – does not occur. These ‘nine (Arabic) consonants’ are: the phar-
yngeal fricative /ḥ/ and the glottal stop ‘ayn /ʕ/, the pharyngealized stops /ḍ/ and /ṭ/, and
the fricatives /ṣ/ and /ẓ/, the uvular plosive /q/ and the fricatives /ġ/ and /ḫ/.
16
The phenomenon obviously is related to the internal structure of the syllable: unvoiced/
fortis consonants, especially when combined with aspiration, are longer than voiced,
that is, in most instances, lenis consonants. To avoid overlong syllables, the fortes seem
to acquire or borrow some of the length of the vowel, which consequently becomes
shorter.
2.1.3.2. Metathesis and rhotacism

Throughout the area, different forms of metathesis, lambdacism or rhotacism appear.
Consequently, stems and morphemes containing /l/ or /r/ are most susceptible to
various kinds of metathesis or assimilations (see below, Morpho-phonology). In most
East Anatolian, Iranian and Iraq-Turkic dialects, the word topraq ‘ground, soil’, for
instance, is pronounced [torpax] or [torpaγ], compare [thorpɑɣə] ‘its soil’ in (1/2).
Moreover, the consonant group /-lq-/ is pronounced as [x], both in Turkic or copied
lemmata, see /-lq-/ > [x]: in [qax] < qalq- ‘get up’, or [xax] < xalq ‘people’ (1/4).
2.1.3.3. Regressive assimilation /b/ > [m]

Throughout the Turkic dialects of Iraq and Iran, nasals trigger assimilation of the
labial consonant [b-]. This type of assimilation affects the pronoun of the first
person singular, which is ben in older Turkic, but in most instances [mæn] or [mɛn]
in Iraq and Iran. Accordingly, the dative and the accusative of the demonstra-
tive pronoun bu may have alternative forms such as [muna] instead of buna and
[munu] instead of bunu. Other examples for this type of assimilation are [mınʤɨɣ]
‘pearl’ < boncuq and [mın-] ‘to mount (a horse etc.)’ < bin-.
2.2. Morpho-phonology: assimilations at morpheme borders

Progressive assimilation is very frequent. As a rule, suffix-initial /l-/ assimilates
to stem-final nasal, (/-nl-/ > [-nn-]). The abstract suffix {+lIK}, for instance,
appears as [+nIK]; e. g. [hormannıγ]—with additional h-prothesis—for ormanlıq
‘a wooded place’. Similarly, /-nd-/ is assimilated to [-nn-]; the ablative morpheme
changes from {+DAn} to [+nAn]. Accordingly, the ablative of the demonstrative
pronoun is pronounced [munnan] < bundan. The locative suffix, however, remains
{+DA} across Iraq-Turkic varieties. Regressive assimilation of word-final [-r] to
the [l-] of the plural suffix {-lAr} is also characteristic of Turkic dialects of the
area. Thus, for instance, olurlar ‘they become’, is pronounced [olullar] in most
Iranian and Iraq-Turkic dialects.
2.3. Morphology
2.3.1. Case morphology

Case morphology displays the typical southwest Oghuz characteristics that appear
across Turkish and Azeri or Iran-Turkic dialects. Genitive, dative and locative
{+(n)In, +(y)A, +DA} show no conspicuous features. Like in Iran-Turkic or Azeri,
the possessive accusative is {+n}, in contrast to Turkish {+nI}. Assimilation of
the ablative has been described above. The instrumental case, which is {+(I)nAn}
Iraq-Turkic 365
in most East Anatolian and Iran-Turkic varieties, has the form {+(y)dan}. In the
equative case {+cA} the pronominal [n] does not appear, e. g. in forms such as
[ardıʤa] ‘following/behind him’ (< back:poss .3 sg . equ , confer the corresponding
Turkish form with -n-: ardınca); an exception is the equative of the demonstrative
pronouns, see [ondza] ‘that much’, Text (2/11).
2.3.2. Verb morphology

2.3.2.1. Focal present and aorist17
Focal present is functionally similar to a present progressive, while what is tra-
ditionally termed “aorist” in Turkology is a form covering iterative, habitual, or
potential action. Basically, Iraq-Turkic reflects the so-called Azeri type of present
paradigms, which it shares with other Turkic varieties from Iran/Azerbaijan, and
some eastern Anatolian dialects: The focal present is based on a morpheme {-Ir};
thus, it differs from the aorist in {-Ar} only with regard to its high/close vowel. In
practice, though, many of the positive forms of present and aorist have effectively
merged. As a rule, monosyllabic stems ending in vowels display the same forms
in the aorist and the present paradigm—with the exception of the 3rd person singu-
lar, which may be marked by {-(I)r} and an additional element {-i} for the focal
present, and simply by {-i} for the aorist. Secondly, stem-final liquids /-r/ or /-l/
tend to merge with the /r/ of (mainly, but not exclusively) the aorist marker, while
the distinctive vowel is dropped, see, for instance, forms such as värräm (< ? ver-
ir-äm or ver-är-äm) in sample Text (1/9, 1/11).
Thirdly, the distinctive vowel may not be pronounced due to a general tendency
to shorten or omit the vowel of middle syllables. Moreover, individual speakers
may use different variants of the same aorist or present form, plus copied para-
digms from Turkish. Consequently, it is difficult to predict the correct forms and,
in some instances, also to analyze them as either present or aorist.
Text 2, for instance, refers to customs and traditions, and one may assume that
it is aorist-based. Characteristically, the text displays a mixture of different forms,
namely the regular form of the aorist of the 3rd person singular: gäč-är ‘pass’ (2/7),
but also a special variant of the aorist of the third person singular (verb stem +
{-i}), found in (2/6) gäl-i ‘comes’ (≈ gäl-är), and (2/9) yıγıš-i ‘come together’ (≈
yıγıš-ar). The full form of the third person plural aorist occurs in: oturullar (2/5) (<
otur-ur-lar) ‘sit.pl ’, tsalallar (2/5) (< čal-ar-lar) ‘play.pl (music)’, and baxallar
(2/10) (< bax-ar-lar) ‘watch.pl ’; in other variants of the same form, though, the
syllable containing the differentiating vowel is missing, confer: düzällär (2/4) (<
düzäl-är-lär) ‘prepare.pl ’, gätillär (2/4) (< gätir-är-lär) ‘bring.pl ’, gällär (2/7) (<
gäl-är-lär) ‘come.pl ’, and oturlär (2/7) (< otur-ar-lar) ‘sit.pl ’. The third singular
17
The following examples are provided in a normalized orthography.
qal-ır (2/4) ‘remains’ and the 3rd plural oturullar (2/5) (< otur-ur-lar) ‘sit.pl ’ seem
to present forms of the focal present.
The overlaps in the forms, and the fact that the meaning of both present types
is not always kept neatly apart (one may find present were one expects aorist, and
vice versa), creates the impression that either a focal present is not fully devel-
oped, or that the two have merged into one. This may be due to the language
contact situation, as the surrounding languages (Iranian, Semitic) just have one
morphologically simple present. The negative forms of present and aorist, on the
other hand, are morphologically distinct, see, for instance, the negative forms of
the first person singular of the focal present ged{-mıräm} ‘I don‘t go (now)’, vs.
the aorist ged{-mäm} ‘I never go’.
2.3.2.2. A non-evidential perfect

As in the adjacent varieties of Iran and Anatolia, the perfect displays a mixed
paradigm based on the old morpheme {-mIš}, and the ‘new’ perfect marker in
{-(y)Ib}, which goes back to the gerund in {-(y)Ib} and the copula {+DIr} < durur
‘stand.3 sg ’.18 The most frequent paradigm consists of {-mIš} for first persons and
{-Ip} for second and third persons, or {-mIš} in the first and second persons and
{-Ip} for third persons, while Altunköprü, for instance, displays the new perfect
marker in {-Ip} in all persons. In contrast to the Turkish perfect in {-mIš}, the
perfect occurring in the area (East Anatolia, West-Iran/Azerbaijan and Iraq) is
strictly resultative and has no marked inferential or evidential qualities.
2.3.2.3. The imperative of the second person singular in {-ginen}

As in most Turkic languages, the bare verbal stem functions as the imperative
of the second person singular. Combinations with an additional suffix {-ginen},
which appear across the dialects of the area under observation, express a some-
what milder or more polite form of request or demand.
2.3.2.4. Potential and impossibility: {-abil-}, {-abilme-}

As is characteristic of the area, the potential verb form {-abil-} has the negation
{-abilme-}, e. g. gäl-äbil-mä-m (1/9) ‘I will not be able to come.’
18
According to Bodrogligeti (1968: 30), the complex forms in {-(y)IbDIr} may have
been inspired by the model of the Persian perfect, which—in contrast to the Turkic
zero-morpheme—does apply an explicit morphological element to mark the 3rd person
singular on the surface. Thus, the complex form gälübtur ‘has come’ in the glossary of
the Esfahan anonymous, for instance, would be a structural copy of the Persian âmade
ast (come:passive participle + copula 3sg ).
Iraq-Turkic 367
2.4. Morpho-syntactic elements
2.4.1. The question particle {mI}

In the entire area of eastern Anatolia, western Iran and Iraq, the question particle
{mI} does not appear. Questions are marked instead by rising intonation and a
lengthening of the last syllable—the so-called drawl.
2.4.2. Non-finite verbal forms in dependent clauses

While most non-finite verbal forms have disappeared in the varieties spoken in
Iran, Iraqi Turkman displays a rich inventory of gerunds, verbal nouns and partici-
ples—which, of course, has important implications for the structure of dependent
clauses.
The inventory of participles and verbal nouns comprises: the present parti-
ciple in {-(y)En}, which, like in Iran and Azeri-Turkic may function as a nomen
agentis, see, for instance išlä-yän ‘worker’, or bâlıxtut-an ‘fisherman’; the aorist
participles in {-r/-mEz}, appearing in lexicalized units exclusively; the perfect
participle in {-mIš}; the so-called future participle in {-(y)EcEK}; and the verbal
nouns in {-DIK} and {-mEK}.19 The participle in {-(y)En} and the verbal noun
in {-DIK} also form the basis of complex temporal gerunds (see below, chapter
on syntax) The most frequent gerund markers are: {-yE}, {-(y)Ip}, {-EnI < -ElI},
{-(y)IncE}, {aorist+ -kEn}, and {-dIKčE}.
2.4.3. An additional pronominal paradigm based on bilä

As in Iran Turkic, Azeri and certain East Anatolian varieties, the pronominal
system consists of the analytic pronouns (sg : män, sän, o; pl : biz, siz, olar), while
reflexive and emphatic pronouns are based on the stem öz ‘self’ (+poss + case ).
Moreover, Turkic varieties of Iraq and West Iran display a third pronominal par-
adigm based on the stem bilä, which always combines with possessive and case
markers. Thus, bilä-sin-ä (< bilä + poss .3 sg + dat ), in a given context, would
mean ‘to him/her’. The examples (1a-b), taken from sample Text 1, are identical
in meaning but differ with regard to the pronominal form: In (1a), the pronominal
form consists of the personal pronoun of the second person singular in the dative
case (sänä = you.dat ). In (1b), the pronominal form consists of the stem bilä, {-y}
or {-w} for the poss 2 sg , and the dative marker.
19
In the northeastern varieties of Iraq-Turkic, the latter has a unique form in {-maγ}.
(1) a. (1/9)
para värräm sänä
money give.aor .1 sg 2 sg . dat
b. (1/11)
para värräm bilä-y-ä
money give.aor .1 sg bilä-poss .2 sg - dat
‘I will give you money’
2.4.4. Copied morpho-syntactic units

2.4.4.1. The specifier in {+AkA}
The copying of bound morphemes is generally taken as an indicator for far-reach-
ing processes of language contact-induced change. One such loaned element is
the specifier in {+AkA}, which combines with noun stems and means ‘the afore-
said, the one already mentioned in the context’ and may also be used to mark the
head of a restrictive relative clauses, see example (2) below. {+AkA} is a global
copy of the Kurdish or, originally, Gorani definite article (cf. Haig, this volume,
chapter 3.3), from where it must have spread as far as Sonqor Turkic and the
Turkic varieties of the Îl-e Qashqâ’î in Southwest Iran (see Bulut, this volume,
chapter 4.2). The copied element is well integrated into Turkic noun morphology.
It is attached directly to the stem, precedes the case suffixes, and is even subject
to palatal harmony:
(2) (HSH 1979: 10/13)
o oγlanakâ ki mämläkät xâxıydı …
that boy.spec conj home right.poss .3 sg . cop . pst .3 sg
‘The boy whose territorial right it was …’ (cf. Turkish: memleket hakkı
olan oγlan …)
2.4.4.2. Global copies of Iranian enclitic personal pronouns

In spoken varieties of Modern Persian and in Southern Kurdish, clitic pronouns are
extremely frequent. Consequently the Turkic varieties that are in intensive contact
with these languages, such as, for instance, southern varieties of Iraqi Turkmen
and in Sonqori reflect copies of Iranian clitics. These clitics can be attached to the
verb after the personal ending, representing the dative, as in allam-it ‘I will buy
(for) you’ in example (3), or the direct object, in example (4).20
20
See Buluç (1975: 183), on the dialect of Mandali.
Iraq-Turkic 369
(3) almæ âlläm=it

apple buy.opt =2sg .dat
‘I will buy you an apple/apples.’
(4) yäyipti=šan
eat.pf .3sg =3pl .acc
‘he has eaten them’
In a construction appearing in the Turkic varieties from Iraq and Central Iran,
Turkic possessive suffixes have replicated structural properties of Iranian pronom-
inal clitics. The Iranian clitic pronouns are frequently used to index a benefactive
or recipient, as in the following example from Sorani Kurdish: hata mumkīn=mān
abē ‘if possible=1 pl may.be’, i. e. ‘if it may be possible for us’, where the clitic
first person plural pronoun indicates a benefactive (MacKenzie 1961). Especially
in combination with the borrowed Arabic noun lâzım ‘necessary’ (which are
applied to paraphrase the verb ‘to need’), Turkic possessive suffixes appear in a
position of equivalence to such Iranian pronominal clitics, and denote the needer/
wanter. Thus the Turkic phrases such as lâzım-ım (dır) ‘I need’ in example (5)
below, or lazım-mız ‘we need’ appear to replicate Central Kurdish21 for ‘I want’,
garak=(i)m-a (‘necessary=1 sg is’), see MacKenzie (1961: 105), where (interest-
ingly) the clitic combines with a copy of the Turkic adjective gerek ‘necessary’.
(5) (HSH 1979: 10/72)
källäsi lâzım-ım dı.
head.poss .3 sg necessary-poss .1 sg cop .3 sg
I need his head. (literally: ‘his head is necessary for me’)
2.4.4.3. {-iš}
An element {-iš} can be attached to finite verbs in the Mandali variety. Buluç
(1972=1975: 182) gives various examples for combinations of {-iš} with verb
forms in the imperative, the aorist, the past in {-DI}, and perfect in {-mIš}. The
suffix {-iš} denotes an intensive mode of the respective verb forms, in the sense of
‘to do something with certainty’, e. g. ôxi! ‘read!’, ôxıš! ‘make sure that you read’.
In all probability, {-iš} is a copy of the Kurdish clitic {-iš} ‘also, even’ [Kurmanjî:
jī], which, in Central Kurdish (cf. MacKenzie 1961: 128) may be suffixed to either
a nominal or a verbal form.
21
Modern Persian, on the other hand, expresses ‘to need’ with a formation based on lâzem
‘necessary’ and the verb for ‘to have’, dâran; ‘I need’ would thus be translated as lâzem
dâram, literally ‘I have necessary’.
2.4.4.4. Comparative in {+tar} and superlative {+tarin}

The Kurdish morphemes {+tar} ‘more …’ and {+tarin} ‘most …’ may appear in
combinations with Turkic nouns, e. g. forms such as irax-tär ‘farther’ and čox-
tærin ‘most’, where the Turkic stem čox ‘much, many’ combines with the Iranian
superlative morpheme {-tarin}.22
2.4.5. Copied word order properties

2.4.5.1. The dative/directive in postverbal position
Northern Kurdish has a twofold strategy to mark an object to which the action
expressed in the verb is directed. As a morphological marker, this object takes
the oblique case; syntactically, it is marked by the postverbal position it occupies
within the sentence, cf. the following two constructed examples from Northern
Kurdish (see also Haig, this volume, chapter 2.3):
(6) a. Ez pere dıdım te.
I money give.prs .1sg you.obl
‘I give you money.’
b. Kur-ê mun dıçe mektebê.
son-ez my go.prs .3sg school.obl
‘My son goes to school.’
Central Kurdish also regularly places goals after the verb, but there is no oblique
case in Central Kurdish, hence no case marking of the goal. Often, however, the
goal is marked through a preposition, e. g. wazīr kāɣazī bird bō kuř ‘the vizier
took the letter to the boy’ (MacKenzie 1962: 54). Colloquial Modern Persian,
another potential contact language, may also use the postverbal position to indi-
cate the directive. According to the norms of the standard language, directives
should be introduced by the preposition be ‘to’, compare: (Be) maktab mîravam
(to school I.go). However, in colloquial Persian the form mîram maktab (I.go
school) is very widespread. Like Kurdish, Iraq-Turkic applies both case marking
(dative) and word order to indicate directive, cf. (1) above. This kind of double
coding is redundant: In contrast to Kurdish, where the oblique case both denotes
the direct and the indirect object, the indirect object sänä/biläyä in example (1)
above is explicitly specified by its dative case. Consequently, the additional syn-
tactic constraint—postverbal position of the dative—has to be interpreted as a
copy of structural features from the contact languages.
22
I observed these copies of Iranian comparative and superlative morphemes in recent
recordings with bi-lingual Turkic-Kurdish speakers from the Arbil region; as they exist
alongside with the regular Turkic particles daha ‘more’ and en ‘most’, it is difficult to
judge to what extent they are conventionalized.
Iraq-Turkic 371
The same word order rule applies to verbal complexes paraphrasing ‘to begin
to …’. The finite form of the aspect verb bašla- ‘to begin’ combines with a lexical
verb in the form of a non-finite or “nominalized” verb form (the verbal noun in
{-mEK}, glossed as vn 1, in distinction to other verbal nouns, vn 2), which is
marked by case (dative), and, additionally, by its postverbal position, as in (7):
(7) (HSH 1979: 2/33)
ħâmsı bašlâdılar gülmäγä.
all begin.pst .3 pl laugh.vn 1. dat
‘They all began to laugh.’
2.4.5.2. Structural copies of ezafe constructions

With regard to word order properties, Turkic genitive-possessive constructions
with the structure {head/possessor:gen —modifier/possessum:poss } are diamet-
rically opposed to the Iranian or Semitic type, where the possessum precedes the
possessor {possessum—possessor}. Some Iranian languages additionally connect
possessum and possessor by a binding element, the so-called ezâfe. Bi-lingual Tur-
kic-Kurdish speakers sometimes combine Turkic lexemes using Iranian structural
properties, as in: âd-e xästäxâne (name-ez hospital), for Turkic xastaxâne-nin âdı
(hopital. gen name.poss ) ‘the name of the hospital’.
2.5. Syntax
To express dependent clauses most Turkic languages use non-finite verb forms. As
a rule, adverbial clauses are based on gerunds, while nominalized verb forms such
as verbal nouns and participles are used to express relative clauses or complement
clauses.23 Turkic varieties under strong influence of Indo-European or Iranian (or,
to a lesser extent, also Semitic) languages have abandoned these principles of
Turkic syntax. The dependent clause is based on a finite verb form. It may be con-
nected to the main clause by a conjunction, which expresses the semantic relation
between main clause and dependent clause.
2.5.1. Dependent clauses following Turkic principles of nominalization

While Iran Turkic varieties have nearly completely restructured their strategies of
clause combining according to Iranian models, East Anatolian varieties nowadays
display most of the non-finite subordinate verbal forms used in Standard Turkish.
The position of Iraq-Turkic lies somewhere in between. Non-finite subordinate
forms and replicated finite patterns are both used with roughly equal frequency.
23
See Johanson (1990: 199–200).
It should be noted, however, that the functional distribution of non-finite sub-

ordinate forms appearing in Turkish and in Iraq-Turkic in some cases different.
The verbal noun in {-DIK} (glossed with vn 2), for instance, displays most of the
syntagmatic and functional properties it has in Turkish. As a non-finite subordi-
nate verbal form it constitutes the basis of both absolutive and attributive relative
clauses. In absolutive position {-DIK} may combine with possessive suffixes as
agent markers, and additional case suffixes. In combinations with the instrumen-
tal {+instr }, though, it forms complex gerunds which are characteristic of Iraq.
On the other hand, the verbal noun does not appear in Turkic-style complement
clauses.
2.5.1.1. Temporal clauses

Besides gerunds in {-yE}, {-(y)Ip}, {-EnI (< -ElI)}, {-(y)IncE}, {aorist+ -kEn},
and {-dIKčE}, morphologically complex forms based on the present participle
in {-(y)En} and the verbal nouns in {-DIK} and in {-mEK} function as temporal
gerunds.
In combination with the suffix in {+deg},24 the present participle in {-(y)En}
indicates simultaneousness, as, for instance, in: İrân’ın ičinä gir-än-däg (Iran-gen
into enter-part . equ ) ‘As we entered Iran …’ (HSH 1979: 17/27). Across the greater
area of East Anatolia, Iran and Iraq, complex gerunds derive from the so-called
present participle in {-(y)An}, which combines with the case suffixes of the loca-
tive in {+DA}, or the ablative in {+DAn} and the postposition sonra ‘later’. The
resulting forms {-(y)Anda} and {-(y)AnNAn (sonra} express temporal relations
of simultaneousness (‘as, when’) or posteriority (‘after’); see, for instance (gün)
čıx-an-da (come out-part - loc , Text 1/1) ‘when the sun rises’, or män gäl-än-nän
sora (come:- part - abl after) ‘after I came’ (HSH 1979: 18/73). In example (8)
below, the aspect of simultaneousness (already contained in gid-än-dä) is stressed
by an additional expression of time, halat ‘situation, stage’, which syntactically
functions as the head of a relative clause:
(8) (Own field notes)
mäktäb-e gid-än halæd-da, (…) bašla-rık Arabca
school- dat go- part situation- loc begin- aor .1 pl Arabic
oqu-maγ-a.
study.vn 1. dat
‘When we go to school, we begin to study Arabic.’
The verbal nouns in {-mEK} (vn 1) and {-DIK} (vn 2) combine with possessive
suffixes (as agent marker) and the instrumental case in {+(y)dan (< *ilen)} or the
24
Probably the equative suffix; the postposition deg/değin ‘up to’ would require a dative
on the head.
Iraq-Turkic 373
postposition birlä ‘with’ to form temporal clauses, such as, for instance čal-mâγ-
ı-ydan (set.in.motion-vn 1- poss 3 sg - instr ) ‘as soon as he pressed …’, oyna-mâγ-ı
birlä (play-vn 1- poss 3 sg with) ‘while he was still playing’, or ye-diğ-i-yden (eat-
vn 2- poss 3 sg - instr ) ‘as soon as (they) ate, in example (9) below:
(9) (HSH 1979: 3/141)

otı yediğiydän o qardäšlärı här
greens.acc eat. vn 2. poss .3 sg . instr dem her.brothers all
altısı ceyran oldular,
six gazelle become.pst .3 pl
‘As soon as they ate the greens, all her six brothers became gazelles, …’
2.5.1.2. Relative clauses

Iraq-Turkic relative clauses based on non-finite/nominalized verb forms, such as
the present participle in {-(y)En} and the verbal noun in {-DIK}, resemble those
of Turkish. In example (10) below, the verbal noun appears in absolutive position,
fully marked with possessive suffix encoding the agent and case suffix anchoring
it in the matrix clause:
(10) (CKH 1979: 124/143)
Get bu de-digim-i eläginän!
go.imp this say-vn 2. poss .1 sg - acc do. imp
‘Go and do what I told you!’
It may also be part of a genitive-possessive construction with the agent of the sub-
ordinate form in the genitive, as in example (11):
(11) (HSH 1979: 10/50)
bunların dedigläri ‘aynän köskünde bir
they.gen say.vn 2- poss .3 pl just_like breast.poss .3 sg . loc a
hä:ritä kimin etti
map like do.pst .3 sg
‘In his heart he took notes like a map of the things they had said’.
Example (12) demonstrates that the subordinate form in {-(y)En} has preserved its
verbal potential to take direct objects:
(12) (CKH 1979: 37/15)
Bu Allah’ın rähmätin istiyäni belä taptadı, …
this God.gen mercy.poss .3 sg . acc want.part . acc so beat_up.
‘So much did he beat the one who had asked for God’s mercy, …’
The subjunctors in {-(y)En} and in {-DIK} can also be attributed to a head;

{-(y)En} signals co-reference of the head and the agent of the non-finite subjunc-
tor (or its possessor), see example no. (13):
(13) (HSH 1979: 18/76)
Adam var öz sulbunnan änän evlatta
man exist own seed.poss .3 sg . abl derive.part offspring.loc
munca ħürmät yôxtu
this_much respect not.exists
‘There are men who don’t see this much respect in their own offspring.’24
If the head co-refers with the object (Acc, Dat etc.) of the relative clause, a verbal
noun in {-DIK} is used, as in (14):
(14) (HSH 1979: 10/134)
Bä:li, ‘ayni tânı-dıγ-uv adam-am.
Indeed same know-vn 2. poss .2 sg man.cop .1 sg
‘Indeed, I am just the person you know.’
Examples (10) to (14) demonstrate that Turkmanja applies Turkic structures ren-
dering relativization (pre-positive relative clauses based on non-finite subordinate
forms) that are structurally very close to their Turkish counterparts.
2.5.2. Syntactic patterns imitating Indo-European strategies of clause

combining
2.5.2.1. Right-branching relative clauses
Relativization may also follow Indo-European patterns, with post-posed relative
clauses based on finite verb forms. Restrictive relative clauses often copy Iranian
types; frequently the head is introduced by a demonstrative pronoun, the under-
lying pattern being {dem + head (+ spec ) + conj }, see examples (2) and (15).
Eventually, copied markers of specificity may be attached to the head, such as, for
instance, the Kurdish unit {+Aka} in example (2).
(15) Yerišdıx o tâcir-e ki bizi
reach.pst .1 pl that merchant-dat conj 1 pl . acc
göndärmıšdı Kermânšâna.25
send.plup .3 sg Kermanshah.dat
‘We reached again the merchant who had sent us to Kermanshah.’
25
Literally: ‘in whose offspring coming from their (‘his‘) own seed is not this much of
respect.’
26
My transcription after the original recording; see also HSH 1979: 17/40.
Iraq-Turkic 375
2.5.2.2. Temporal clauses with the conjunctor ki

A structurally similar pattern forms temporal clauses. It may consist of a demon-
strative pronoun introducing a noun, an optional marker of specificity, and the con-
junction ki, i.e: {(dem ) + noun (+ spec ) + conj }. In spoken Persian and Kurdish,
this pattern is frequently applied to indicate a temporal relation between a main
clause and subordinate clause:
(16) (HSH 1979: 5/216)
Oγlan ki qâpı sä:si ešitti vêrdi
Boy conj door sound.poss .3 sg hear.pst .3 sg give.pst .3 sg
fizä:hi.
cry
‘As soon as the boy heard the sound of the door, he cried out:’
2.5.2.3. Complement clauses

As a rule, Turkic-style complement clauses are based on non-finite verb forms,
which are syntactically the complement of verba dicendi et sentiendi, such as, for
instance, ‘to say’, ‘to hear’, ‘to see,’ or ‘to know’. Turkmanja, though, uses com-
plement clauses containing a finite verb form, which may be introduced by the
loaned conjunction ki ‘that’, see examples no. (17), (18) and (19) below:
(17) (CKH 1979: 1/8)
Ešitti ki Sultân Murâd zuhur_edip, …
hear.pst .3 sg conj Sultan Murad appear.ger
‘She heard that Sultan Murad had appeared, …’
(18) (CKH 1979: 37/9)
Nä deyipsiz seleyin!
What say.pf .2 pl say. imp .2 pl
‘So tell (us) what you (Pl.) said!’
(19) (HSH 1979: 5/89)
Bildi ki bu qızčı ediri bu iši.
learn.pst .3 sg conj this girl.posp do. prs .3 sg this task.acc
‘He (the Jew) understood that he (the young man) does it for the girl’s sake.’
2.5.2.4. Analytic expressions of modality

The notion of finality (intention or purpose/aim to which the action is directed) can
be expressed by non-finite subordinate forms, such as, for instance, a verbal noun
in {-mAQ} combined with the postposition için, see (20, where it is reduced to
-čı), where the postposition and the verbal noun have undergone fusion:
(20) (HSH 1979: 12/48)

qäːrri salılla’ bunu öːldürmäːči
wife send.prs .3 pl he.acc kill.vn 1. posp
‘They send a woman to kill him.’
More frequently used, though, is a construction imitating Iranian models, where a
finite verb in the optative/subjunctive, as in example (21) below, or in the impera-
tive, as in example (22) below, replaces the non-finite subordinate form:
(21) (1/8)
Gälmäzsiz mäni aparasız o däligä?
come.aor . neg .2 pl me bring.opt .2 pl dem hole.dat
“Won’t you (2pl ) come along to guide me to this cave?”
(22) (Tuzkhurmatı)
Bu kärvän yixti sirħät etsin.
this caravan stop.pst .3 sg rest make. imp .3 sg
‘The caravan stopped to rest.’
The optative/subjunctive or the imperative are typically used in a dependent verb
after impersonal modal expressions, such as gäräk or lâzım ‘(it is) necessary, one
has to’, as in gäräk äl degmäsin (necessary hand touch:imp . neg .3 sg ) ‘his hand
must not touch’, and after modal verbs, such as istä- ‘to want’, or qoy- ‘to let’ (see
also Bulut 2000).
The possibility/impossibility suffix in {-Ebil-} ‘can, to be able to’, and
{-Ebilme-} ‘cannot, not be able to’, which in Turkish attach directly to the main
verb, may sometimes be carried by a dummy verb et- ‘to make/do’, distinct from
the main verb. In this manner, the modal expression is split into two parts, the
dummy verb plus modal suffix, e. g. edäbil- ‘can, to be able to’, and a lexical
verb in the optative/subjunctive or imperative. Thus, instead of a compact form
expressing inability, such as, for instance, yiγla-yabil-mir (weep-imposs - prs .3 sg ),
the following is found: ed-äbilm-iri yiγla-sın (make-pot . neg - prs .3 sg weep.imp -
3 sg ) ‘he cannot weep’.
2.6. Lexicon
2.6.1. Turkic items

Turkic vocabulary displays a mixture of distinctively Azeri and Turkish features,
reflecting the transitional situation between Anatolian and Azerbaijanian/West
Iranian dialects.27 Some lexical items overlap, such as the eastern dal ‘back’ and
27
The distribution of isoglosses is irregular; note that many Azeri isoglosses also appear
in Anatolian dialects.
Iraq-Turkic 377
its western counterpart arka, both of which are found in Iraq. Others have a mark-
edly western flavour, such as the verb sev- ‘love’, and eyi ‘good, nicely’ against
the eastern yaxčı. Yet, Iraq-Turkic shares a large proportion of its lexicon with
more eastern, that is: Iran-Turkic, varieties: adverbs, such as bu tay ‘this side’,
qanšär ‘opposite side’, harda/harada ‘where’, indi ‘now’, bıtov ‘all, completely’,
kimi(n) ‘like, till’; nouns, such as bulaγ ‘spring, well’, bala ‘child’, toy ‘wedding’,
küräkän ‘bridegroom’, nänä ‘mother’, ušaγ ‘child’, ämcäk ‘breast’, and verbs,
such as apar- ‘bring away’, axtar- ‘seek’, tap- ‘find’, and išlä- ‘work’. Pronominal
forms based on the stem bilä- are characteristic of the area of Iraq and West Iran.28
In some instances, the same Turkic root appears with different derivation suf-
fixes, such as suvar- (like in Iran-Turkic), instead of the western sula- ‘irrigate’.
Exceptional is the formation of to ‘to speak together’: While Turkish uses a recip-
rocal form of kon- ‘put’, viz. konuş-, Azeri varieties display the same derivation
based on a different stem, danıš-; in Iraq-Turkic the basic stem is sele-/söyle- ‘to
speak’; seleš- is a typical Iraq-Turkic isogloss. Due to the influence of Standard
Turkish, there are also a number of recent loans of Modern Turkish origin, of
which Bayatlı (1996: 407) has presented a survey. In general, such loans are more
frequent in the written language.
2.6.2. Foreign items

With regard to nouns and adverbs, the lexicon of Iraq-Turkic displays a relatively
high number of loans or calques of the Arabic and Kurdish contact languages;
compared to East Anatolian dialects, there is also more Persian vocabulary. Loaned
nouns are integrated into the Turkic morpho-syntax by means of Turkic possessive
suffixes and/or case morphology, see, for instance, complex forms such as nizik-i-
nda ‘in its neighborhood’, where a the Kurdish loaned noun nizik combines with
the Turkic possessive 3sg suffix and the locative suffix.
Most compound verbs are a mixture of loaned and Turkic material. They typ-
ically consist of a loaned nominal element and a calque of the light verb by its
Turkic equivalent, e. g. ħäz et- ‘love, like’, xılas elä- ‘to finish’ and the corre-
sponding Kurdish compound verbs ħez kırın or xılas kırın. Another strategy for
the formation of secondary verbs is the combination with the Turkic denominal
verb suffix {+lA}. A great number of verbs originating from such formations are
characteristic of Iraqi Turkman, as, for instance, färäħlä- ‘to be glad’, derived of
the Arabic noun faraħ ‘joy, gladness.
Characteristic idiomatic expressions are found across different languages of
the area, e. g. nänädän ol- (mother:A bl become) ‘to be born’, cf. Behdinî Kurdish
ji dayik būn ‘from mother become’, or yadımdan čıxdı (remembrance:poss .1 sg .
abl go out:pst .3 sg ) ‘I do not remember’.
28
On pronominal systems of the area see Bulut (2003).
Sample texts in a narrow (not strictly phonemic) transcription
Sample text 1: Tuzkhurmatı

(1/1) irɑn janɨnda bʏr dɑɣ wɒr, gyn
iran side.poss .3 sg . loc indef mountain exists sun
ʧxandæ dɑɣɨn ʏstʏnnɛn ʧxar.
rise.ger mountain. gen above rise.aor .3 sg
‘There is a mountain close to (the border of) Iran, and when the sun rises
it rises above that mountain.’
(1/2) bʊ dɑːɣda olan qɑjalær bajas qɑjaˬdeː,
this mountain.loc be. part rock. pl white rock. cop
ɑːltɨ da gøːk thorpɑɣə.
down.poss .3 sg too green soil. cop .3 sg
‘The rocks that are on this mountain are white, with green soil underneath.’
(1/3) O dɑɣɨn ɪʧɪndɛ dɛ bɪr dɛlɪg wɑr, khɪmsɛ
that mountain inside also a hole exists nobody
girmɛz ɪʧɪnɛ.
go inside.aor . neg .3 sg inside
‘Inside this mountain is a hole/cave which nobody enters.’
(1/4) Nɛ bılsən xax nɛ wɑr o dɛlɪg ɪʧɪndɛ?
what know.imp .3 sg people what exists that hole inside
‘So how should people know what is inside this cave?’
(1/5) bʏr gyn bɪr æfɛ̃ndi jældeː dʕuzxurmɑtə
one day a gentleman come.pst .3 sg Tuzkhurmatı
dolanɨrdə, sɑjjɑħ bɪr
walk around.impf .3 sg traveler a
‘One day a gentleman came and walked around Tuzkhurmatı, a traveler,’
(1/6) æfɛ̃ndi dedɪ: “bʊrda bʊ dælɪg næː ˬde?” dedlɛ̥r:
gentleman said here this whole what.cop .3 sg say.pst .3 pl
“wɒlla ßɒwa, o qaraʔʊl dɑɣəˬde dɪʔɛllɛr,
by God, father, that Qaravul mountain.3 sg . cop say.aor .3 pl
‘and he said: “What is this hole over there?” They answered: “By God,
good man, people say that this is Mount Qaravul,
Iraq-Turkic 379
(1/7) o dɑːɣda bɪr dælɪgˬdʏr, qurd

that mountain. loc a hole. cop .3 sg wolf
dælɪgəˬdɪ.” æfɛ̃ndi dede: “Gælmɛssɪz mɛnɪ
hole.poss .3 sg -is gentleman said come.neg . aor .2 pl me
aparasəz o dɛlɪgɛ?”
bring.opt .3 pl that hole.dat
and this is a cave in the mountain, a wolf’s hole.” The gentleman said:
“Won’t you (2pl ) come along to guide me to this cave?”’
(1/8) khɪmsɛ ʤɔwab vermɛdə. ßʏr gyŋ gældə
no_one answer give.neg . pst .3 sg one day came
dedɪ: “Gælmæssɛnəz̥ aħmæd bɒba, jæ, sæn
said come.neg . cond .2 pl Ahmad father yeah you
mɛnɪ apar o dælɪgɛ!”
me bring.imp .2sg that hole.dat
‘No one answered. One day he came and said: “If you (2pl ) don’t come
along, Ahmad, my man, yeah, you will bring me to this cave!”’
(1/9) mɛn dɛ dedɪm øznæ: “wɒlla mɛm ɪʃim
I also say.pst .1 sg himself.dat by God 1 sg . gen work.poss .1 sg
wɑr, gælɛbɪlmæːm!” dedə: “para vɛrrɛm
exists come.imposs . aor .1 sg say.pst .3 sg money give.aor .1 sg
sɛnɛ.”
you.dat
‘I said to him: “Honestly, I have work to do, I can’t come!” He said: “I will
give you money.”’
(1/10) dedɪm: “eːj, ʧox eːj! para verɪrsɛ, tɛz
say.pst .1 sg Eh very good money give.cond .3 sg quickly
ɑlardij ja, o wɑxɪd̥.”
take.aor . pst .2 sg yeah that time
‘I said (to myself): “Excellent, if he offers you money, you should have
accepted quickly that time.”’
(1/11) “næ verɪm sɛnɛ? b ɪr dænɛ vɛrrɛm
What give.vol .1 sg you.dat one piece give.aor .1 sg
bɪlɛjɛ, b ɪr sɑrɨ lira, osmanlɨ lirasə.”
you.dat one yellow Lira Ottoman Lira.poss .3 sg
‘“And what do I intend to give you? I will give you one, one golden Lira,
an Ottoman Lira.”’
Sample text 2: Kirkuk

(2/1) Kçɛrkjʏçdɛ toj tsoɣ gɶzɛl ʃejlɛr ˬdə, bıs toj …
kirkuk.loc wedding very nice thing.pl . cop we wedding
‘There were many beautiful things at a (traditional) wedding in Kirkuk,
we …’
(2/2) Toj ħalatɨndæ, mæθælæn tojdɛn bır
wedding condition.poss .3 sg . loc for instance wedding.abl one
gedzɛ ʔawwal beʃ gyn dʼ olʊrdı
night early five day also be.impf .3 sg
jɑdɨma gjælleː
mind.poss .1 sg . dat come.pst .3 sg
‘When there was a wedding, the first night before the wedding — now I
remember that it could also be five days (earlier) —’
(2/3) nɛdzɛ, ɛvlɛnmɑx gynʏnnɛn bʏr gyn
however marrying day.poss .3 sg . abl one day
ʔawwal gedzæse ɛglɛnʤɛ ʃiklındɛ
early night.poss .3 sg entertainment form.poss .3 sg . loc
olʊrdə, baʕas.
be.impf .3 sg some
ʻhowever, the night one day before the marriage day mostly was
(celebrated) in the form of a feast.’
(2/4) mæθælæn baʕas ʔajləlɛrɛ ˈqɑlɨr iʒgi
for_instance some family.pl . dat remain.prs .3 sg alcohol
iʧɛn iʒgi dʏzɛllɛr, tsɑlɨxʧə gjætıllɛr (…).
drink.part drink prepare.aor .3 pl musician bring. aor .3 pl
‘Usually it is up to the (respective) families, so, for instance, those who are
drinking alcohol will offer drinks, and they invite musicians.’
(2/5) otʊrʊllar hær ħalda bɑxtsadaː, jænʼ hɑʔʊʃdæ, tsɑlallar.
sit.prs .3 pl usually garden.loc that.is yard.loc play.aor .3 pl
‘Usually they sit in the garden or the yard and play.’
Iraq-Turkic 381
(2/6) bıʼ kçɛrɛm ˈgɶzɛl ˈoxʊtsɨlar gjælıʼ, bøjʏg

for once good singer.pl come.aor .3 sg great
oxʊtsəlarɨməsdan, ʕabdıl wɑħıd deː, gɶzɛl gɶzɛl
singer.pl . poss .1 pl . abl ‘Abdu’l-Wâhid also, very good
oxʊtsəlarɨməs wɑr.
singer.pl . poss .1 pl exists
‘Sometimes there are good singers, some of our great singers, such as
‘Abdu’l-Wâhid29 — we have some really fine musicians.’
(2/7) gjællɛ̜r otʊrlar, saʕɑd̥ bırˬıkjı jɛdze
come sit.aor .3 pl time one_or_two night
gɶzɛl bır æglɛndza wɑxɨd gɛtsɛr (…).
nicely a entertainment time pass.aor .3 sg
‘People come and sit down, and the entertainment will nicely last till one
or two at night.’
(2/8) Bæs ındı bʊ zamɑn daː tæqriːßan bo
however now this time also roughly this
ˈʔɑdɛt ondza qɑlˈmadə.
custom that.much remain.neg . pst .3 sg
ʻHowever, this tradition has not remained exactly the same.ʼ
(2/9) Toj gynʏnnɛn bʏr gyn ʔawwæl geˈdzæsə
wedding day.poss .3 sg . abl one day early night.poss .3 sg
bʏr gɶzɛl qɑpɨ øgʏndɛː ɨʃ xlar
ɨ
one nicely door front.poss .3 sg . loc light.pl

jɑnar, aqraba qonʃʊlar jiɣıʃeː.
burn.aor .3 sg relatives neighbor.pl come.together.aor .3 sg
ʻNow, one lights the lamps in front of the door, and neighbors and friends
gather there the night before the wedding day.ʼ
(2/10) Hær_ħalda tsoxlʊx ɛrkɛk qɨsmə, ʔarwɒd qɨsmə
usually majority man folk.poss .3 sg woman folk.poss .3 sg
eːvlɛrdan bɑxallar, dɑmlardan bɑxallar.
house.pl . abl watch.aor .3 pl roof.pl . abl watch.aor .3 pl
‘Usually, it’s menfolk (down there), while the women are watching from
the roofs or the houses.’
29
Probably Abdülvahid Küzecioğlu (born 1925 in Mousallah, a Turkman quarter inside
the walls of Kirkuk Castle, died 29th of June 2007).
(2/11) jaʕneː, bısdɛ wɑr taxriːbæn jɑ ondza qɨs

that is 1pl . loc exists approximately yeah that.much girl
qɨsmɛ, ʔarwɒd̥ qɨsmɛ ɛrkɛklɛrdɛn ʃej
folk.poss .3 sg woman folk.poss .3 sg man. pl . abl thing
edmɛz, jaʕnı gırıʃmɛz ondza ojnda
do. neg . aor .3 sg that is enter. recipr . aor .3 sg that.much dance. loc
ʃejdɛ.
thing. loc
‘That’s roughly what things are like with us: Girls and women’s folk
don’t have business with men, they don’t mingle with all this dancing
and so on.’
Abbreviations
abl Ablative neg Negation

acc Accusative obl Oblique
aor Aorist opt Optative
cond Conditional part Participle
conj Conjunction pf Perfective
cop Copula pl Plural
dat Dative plup Pluperfective
dem Demonstrative posp Postposition
ez Ezafe poss Possessive
gen Genitive pot Potential
ger Gerund pst Past
imp Imperative recipr Reciprocal
impf Imperfective sg Singular
imposs Impossible spec Specificity
indef Indefinite vn 1 Verbal noun (Infinitive)
instr Instrumental vn 2 Verbal noun (nominalized participle)
loc Locative vol Voluntative
References
al-Bayati, Mehdi. 1970. Die Anfänge der Prosaliteratur bei den Irak-Türkmenen. Mainz:
Mainz University Dissertation.
Bayatlı, Hidayet Kemal. 1996. Irak Türkmen Türkçesi [The Turkmen Turkic of Iraq]. 664.
Ankara: Atatürk Kültür, Dil ve Tarih Yüksek Kurumu Türk Dil Kurumu Yayınları
Bodrogligeti, A. 1968. On the Turkish Vocabulary of the Iṣfahan Anonymous. Acta Orient.
Hung. Tomus XXI. 15–43.
Buluç, Sadettin. 1973/74. Tellafer Türkçesi Üzerine [On Tellafer Turkic]. Türk Dili Araştır-
maları Yıllığı Belleten, 1973–74, 49–57. Ankara: TDK.
Iraq-Turkic 383
Buluç, Sadettin. 1975. Mendeli (Irak) Ağzının Özellikleri [The particularities of the Men-
deli (Iraq) dialect]. I. Türk Dili Bilimsel Kurultayına Sunulan Bildirler 1972, 181–183.
Ankara.
Buluç, Sadettin. 1979. Hanekin (Irak) Türk Ağzı Üzerine [On the Turkic dialect of Hanekin
(Iraq)]. I. Milletler Arası Türkoloji Kongresi, İstanbul 1973, 600–603.
Buluç, Sadettin. 1980. Irak Türk Ağızlarının Bazı Ses Özellikleri [Some phonological par-
ticularities of the Iraq Turkic dialects]. I. Milli Türkoloji Kongresi Tebliğler, İstanbul
1978. 54/1–54/4. İstanbul: Kervan Yayınları.
Bulut, Christiane. 1999. Klassifikatorische Merkmale des Iraktürkischen. Orientalia Sue-
cana, vol. 48. 5–27.
Bulut, Christiane. 2000. Optative constructions in Iraqi-Turkman. In Aslı Göksel & Celia
Kerslake (eds.), Studies on Turkish and Turkic Languages. Proceedings of the IXth In-
ternational Conference on Turkish Linguistics, Lincoln College, Oxford, August 12–14,
1998 (Turcologica 46.), 161–169. Wiesbaden: Harrassowitz.
Bulut, Christiane. 2003. Pronominal systems in the transitional varieties of eastern Anato-
lia, Iraq and Western Iran. In Sumry Özsoy (ed.), Proceedings of the Tenth International
Conference on Turkish Linguistics, 321–335. Istanbul: Boğaziçi Yayınları.
Bulut, Christiane. 2016. Convergence and variation in the Turkic varieties of Iran: Exam-
ples from Qashqâ’î. In Éva Á. Csató, Lars Johanson, András Róna-Tas, & Bo Utas
(eds.), Turks and Iranians: Interactions in Language and History. Proceedings of the
Gunnar Jarring Conference, Uppsala 2006. (Turcologica 105), 235–282. Wiesbaden:
Harrassowitz.
Doerfer, Gerhard. 1981. Materialien zu türkisch h- (I). Ural-Altaische Jahrbücher. Neue
Folge. Bd. 1. 93–141.
Doerfer, Gerhard. 1982. Materialien zu türkisch h- (II). Ural-Altaische Jahrbücher. Neue
Folge. Bd. 2: 138–168.
Doerfer, Gerhard. 2006. Irano-Turkic. In Lars Johanson & Christiane Bulut (eds.), Tur-
kic-Iranian Contact Areas. (Turcologica 62), 93–113. Wiesbaden: Harrassowitz,
Fischer, Reinhard. 1993. Die Türkmenen im Irak. Berlin: Freie Universität Berlin, unpub-
lished MA Thesis.
Hassan, Hussin Shahbaz. 1979. Kerkük Ağzı [The dialect of Kirkuk]. Istanbul: İstanbul
Üniversitesi Edebiyat Fakültesi Türk Dili ve Edebiyat Bölümü dissertation.
Haydar, Choban Khıdır. 1979. Irak Türkmen Ağızları. Metinler – İnceleme – Sözlük [The
dialects of Iraqi Turkmen. Texts – Investigation – Dictionary]. Istanbul: İstanbul
Üniversitesi Edebiyat Fakültesi Türk Dili ve Edebiyat Bölümü dissertation.
Johanson, Lars. 1990. Subjektlose Sätze im Türkischen. In Bernt Brendemoen (ed.), Altaica
Osloensia. Proceedings from the 32nd Meeting of the Permanent International Altaistic
Conference, 193–218. Oslo: Universitetsforlaget.
MacKenzie, David Neil. 1961. Kurdish Dialect Studies, vol. I. Oxford: Oxford University Press.
MacKenzie, David Neil. 1962. Kurdish Dialect Studies, vol. 2. Oxford: Oxford University
Press.
Marufoğlu, Sinan. 1998. Osmanli Döneminde Kuzey Irak [North Iraq in the Ottoman pe-
riod]. İstanbul: Eren Yayınları.
Pashayev, Gazanfar. 2003. Azerbaijanis in Iraq – Little known people. Azerbaijan Interna-
tional. Spring (11.1). 24–25.
Saatçi, Suphi. 2018. The Turkman of Iraq. In Christiane Bulut (ed.), Linguistic Minorities in
Turkey and Turkic-speaking Minorities of the Periphery. (Turcologica 111), 331–343.
Wiesbaden: Harrassowitz.
Sümer, Faruk. 1952. Bayatlar. İstanbul Üniversitesi Edebiyat Fakültesi Türk Dili ve Edebi-
yatı Dergisi IV(4). 373–417.
Online resource:
Türkmeneli İşbirliği ve Kültür Vakfı. “Declaration of Principles of the Turkman Congress”.
http://www.iraqiturkman.org.tr/turkmen27.html (accessed 25 November 2011).
4. Western Iran
4.1. Western Iran: overview
Geoffrey Khan
1. Preliminary remarks
This section includes chapters on Turkic, Iranian and Semitic languages of western
Iran. Some of the languages discussed in these chapters are spoken across a wide
geographical area, which extends beyond the confines of western Iran. This applies
in particular to chapter 4.2 on Turkic of Iran and chapter 4.6 on Persian. The lan-
guages described in these last two chapters extend also into the area covered by
section 3 (eastern Anatolia and northwestern Iran). Chapter 4.3 is on Bakhtiari,
an Iranian language spoken in the Zagros mountains of southwestern Iran in the
four provinces of Khuzestan, Chahar Mahal va Bakhtiari, Lorestan and Esfahan.
The description is based for the most part on the documented dialect of Masjed
Soleymān. Chapter 4.5 concerns the Iranian language Hawrami, the heartland of
which is the Hawrāmān region in the northern Zagros mountains in Kermanshah
province. This is a variety of the Iranian language Gorani, the geographical area
of which includes northern Iraq (see Haig, this volume, chapter 3.3 on Gorani in
Iraq). The description of Hawrami presented here is based on the documented
dialects of Pāwa and Nawsūd. Chapter 4.4 concerns the Neo-Aramaic dialects of
the region. These include dialects belonging to the subgroups of North-Eastern
Neo-Aramaic (NENA) and Neo-Mandaic. The NENA dialects were spoken until
the middle of the twentieth century in the Kordestan and Kermanshah provinces
and the Neo-Mandaic dialects were spoken in the towns of Ahvāz and Khorram-
shahr in Khuzestan province. These Neo-Aramaic dialects are now highly endan-
gered and very few speakers are left in western Iran. The descriptions are based
on fieldwork carried out in diaspora communities outside Iran. Chapter 4.7 is a
description of Kumzari, a language exhibiting a mixture of features from Western
Iranian and Semitic spoken in the Musandam governorate of Oman. This profile
of Kumzari has arisen through the mixing of migrant communities from Iran and
South Arabia.
The languages described in this section exhibit a variety of contact-induced
features. These can be classified broadly into features arising through convergence
with Persian, the national standard language of Iran, and those that are shared
areal features across the languages of western Iran arising through the geograph-
ical contiguity of various language communities. Standard Persian has had some
impact on all languages of the region, but this appears to be more superficial on
386 Geoffrey Khan
the languages of the areas of Kordestan and Kermanshah provinces (Hawrami

and NENA) than on languages spoken in other areas of Iran (Turkic, Bakhtiari,
Neo-Mandaic).
Since Turkic and Persian were spoken also in the region of section 3 (eastern
Anatolia and northwestern Iran), in the following overview reference will occa-
sionally be made to contact-induced features in languages described in section 3.
2. Phonology
2.1. Zagros d
In several languages of the region a d in post-vocalic position undergoes a process
of lenition. This has a variety of outcomes across the languages. In the variety of
Hawrami described in chapter 4.5 it is realized as an interdental approximant. In
several varieties of Hawrami spoken outside Hawraman the weakened d is real-
ized as a semivowel /y/. In the speech of migrants from Hawraman in Kerman-
shah it is realized as /y/ or as a lateral /l/ (Mahmoudveysi and Bailey, chapter 4.5,
§3.1). In Bakhtiari d after a word-internal vowel or glide is realized as an inter-
dental approximant or in some dialects as a sonorant r (Anonby and Taheri-Ardali,
chapter 4.3, §2.1). In some Turkic varieties in the Zagros area postvocalic d is
weakened to the sonorant /r/ or sometimes /y/ in combination with front vowels
(Bulut, chapter 4.2, §2.1.2.4). In NENA dialects in the Kordestan and Kerman-
shah provinces a postvocalic d has developed into a lateral /l/ (Khan, chapter 4.4,
§5.1.1). The lateral appears both where there is historically a voiced interdental
*ḏ or an unvoiced interdental *ṯ, e. g. Jewish Sanandaj ʾila < *ʾiḏā ‘hand’, bela <
*bayṯā ‘house’. An intermediate stage of development appears to have been *ṯ > d,
*ḏ > d, whereby both interdentals became a voiced stop d. This intermediate stage
is attested in some NENA dialects of northwestern Iran, e. g. Jewish Urmi ida <
*ʾiḏā ‘hand’, ade < *ʾāṯē ‘he comes’. The lateral /l/ would have, therefore, been
the outcome of a lenition of the stop d.
2.2. Pharyngeals and pharyngealization

Several languages of the region exhibit the use of pharyngeal consonants. In
some languages these are largely restricted to loanwords from Arabic, e. g. Turkic
(Bulut, chapter 4.2, §2.1.1.2) and Neo-Mandaic (Khan, chapter 4.4, §5.2.2). In the
languages of Kordestan and Kermanshah provinces they are also found in native
lexical items. In Hawrami this applies to the unvoiced pharyngeal, which is found
in words of Iranian etymology, e. g. ħaft ‘seven’ (Mahmoudveysi and Bailey,
chapter 4.5, §3.1). In the NENA dialects both unvoiced and voiced pharyngeals
that are native to Aramaic are preserved in the environment of pharyngealized con-
Western Iran: overview 387
sonants, or at least those that were pharyngealized at some earlier historical period
(Khan, chapter 4.4, §5.2.1). Indeed in both NENA and Neo-Mandaic the original
pharyngealized coronal consonants ṭ [tˁ] and ṣ [sˁ] now tend to be pronounced
without pharyngealization. In the NENA dialects pharyngeals occasionally occur
as a reflex of a historical interdental *ṯ in a pharyngealized environment, e. g.
ʾaḥra < *ʾˤaˤhˤrˤaˤ < *ʾaṯrā ‘town’.
In the NENA dialects there is a pharyngealized lateral phoneme ḷ [lˁ], which is
stable and kept distinct from its plain counterpart l (Khan, chapter 4.4, §5.3.1). The
pharyngealized lateral is an areal feature and found also in Hawrami (Mahmoud-
veysi and Bailey, chapter 4.5, §3.1).
Kumzari has a series of pharyngealized/velarized consonants that have their
origin in Semitic, but have spread to words of Iranian etymology, e. g. pānḍa
‘fifteen’. The language has the unvoiced Semitic pharyngeal ḥ but not the voiced
pharyngeal (van der Wal Anonby, chapter 4.7, §2).
2.3. Laryngeals
In the Turkic varieties of the region a non-etymological laryngeal h has developed
at the beginning of many words that historically began with a vowel, e. g. helämı-
jälär ‘they don’t do’ (< elämiyirlär), or in loanwords that would have begun with
a vowel without the added h, e. g. helbet (< Arabic albatte) ‘naturally’, or häqiq
(< Persian < Arabic ʿaqîq) ‘agate-stone’ (Bulut, chapter 4.2, §2.1.2.3). This phe-
nomenon appears to be related to the development of the word-initial glottal stop
to h- in Jewish NENA dialects in the region, e. g. J. Sanandaj hezəl ‘he goes’ (<
ʾezəl) (Khan, chapter 4.4, §5.2.1).
In Bakhtiari an intervocalic laryngeal stop develops into h in some dialects,
e. g. sāʔat > sāhat ‘hour’ (Anonby and Taheri-Ardali, chapter 4.3, §2.3).
2.4. Realization of q
The realization of the postvelar or uvular q as a fricative, which is characteristic
of Persian (Paul, chapter 4.6, §3.2), has been replicated in a number of other lan-
guages in the area. This is found in Turkic varieties of Iran, not only in Persian
loanwords but also in Turkic lexical forms, mostly in Sandhi assimilations after
voiced consonants or vowels, e. g. bu qayanası [bo ɣejanasɨ] ‘this (was) her moth-
er-in-law’, oqu- [oxu-] ‘to read’ (Bulut, chapter 4.2, §2.1.1). In Neo-Mandaic q is
often realized as a voiced uvular fricative [ʁ] after a vowel in a stressed syllable
(Khan, chapter 4.4, §5.1.2). A similar realization of postvocalic q is found in the
Jewish Urmi dialect of NENA (Khan 2008: 20). In the Jewish NENA dialects of
the region q after a vowel or w, it is occasionally realized as an unvoiced uvular
fricative (Khan, chapter 4.4, §5.1.1), e. g. Jewish Sanandaj qoqé [qoːˈχeː] ‘pots’.
388 Geoffrey Khan
2.5. Palatalization
Turkic varieties in the region exhibit the palatalization of the velar stops k and g to
[kj] and [gj], or, in a more advanced form, affrication to [ʤ] or [ʧ], in which case
the original affricates [ʤ] and [ʧ] are fronted to [dz] and [ʦ/tç]. This is found in par-
ticular in the Turkic spoken in the north of the region in the Azerbaijanian exclave
Nakhichevan, in some regions of Georgia, around Tabriz and Urmi in Iran, and in
northern Iraq (Bulut, chapter 4.2, §2.1.2.4). Similar palatalization of these conso-
nants is found in the Christian NENA dialects of the Urmi region (chapter 2.5, §5.1).
It is also documented in Bakhtiari (Anonby and Taheri-Ardali, chapter 4.3, §2.1).
2.6. Reduction of high vowels

In the Turkic of the region high vowels are shortened or completely elided in
word-initial unstressed syllables, e. g. [thIkɛːrde] < tikärdı́ ‘was sewing’ (Bulut,
chapter 4.2, §2.1.2.2). A similar reduction of short high vowels can be identi-
fied in the neighbouring Jewish NENA dialects, e. g. [ʃʰɪ̥ ˈma] < šəmá ‘heaven’
(Khan, chapter 4.4, §5.5.1), and also in Hawrami, e. g. wít ‘he slept’ and náwt
‘he did not sleep’ (Mahmoudveysi and Bailey, chapter 4.5, §3.2). In the Northern
Kurdish variety of Behdinī, a similar process can be observed in the formation
of the present indicative. In most of Northern Kurdish, the present indicative is
formed with a prefix [dɨ-], but in North Iraq, the vowel is regularly elided (Haig,
chapter 3.3, §4.1).
2.7. Breaking of long vowels

In Bakhtiari long ē sometimes breaks to the sequence ie, e. g. zēne > ziene ‘woman’
(Anonby and Taheri-Ardali, chapter 4.3, §2.1). This process of breaking of the
vowel is found also in Neo-Mandaic, no doubt induced by contact, e. g. bieṯā (<
*bēṯā < *bayṯā) ‘house’ (Khan, chapter 4.4, §5.4.2).
3. Morphology
3.1. Demonstrative pronouns

In the languages of the region the system of demonstrative pronouns marks two
levels of deixis, near and far. This contrasts with the demonstrative systems in dia-
lects of Iranian, Turkic and Semitic in the regions of Anatolia and Iraq, discussed
in sections 2 and 3, which generally have more levels of deixis, typically near, far
and very far, or near the 1st person, near the 2nd person and far.
There is also some degree of convergence of the form of demonstratives across
the languages of the region. This applies, for example, to the form of the demon-
strative pronouns of the Neo-Aramaic dialects (Khan, chapter 4.4, §6.1.1). The
NENA singular far deixis demonstrative pronoun ʾo ‘that’ resembles its counterpart
in Turkic o (Bulut, chapter 4.2, §2.3.2). It is also relevant to note that the gender
distinction of this demonstrative has broken down in the NENA dialects. This may
also have been induced by contact with Turkic, which has no gender distinctions.
The NENA near deixis pronoun ʾiya, which is attested in C. Sanandaj and also in
some NENA dialects in Iraq, closely resembles the discontinuous near demonstra-
tive ī … a of Hawrami, e. g. ī hanār=a ‘prox .dem .adj pomegranate=dem . m ’ (‘this
pomegranate’) (Mahmoudveysi and Bailey, chapter 4.5, §4.3.5). Compare also the
Kumzari near demonstrative ya (van der Wal Anonby, chapter 4.7, §4). These are
not loanwords in the NENA dialects but rather the result of the shaping of the
native morphological material in imitation of corresponding forms in languages
in contact.
3.2. The marking of definiteness

The Kurdish definite-marking nominal suffix -aka has spread to many languages
of the region. In the NENA dialects it has the form –ăke, which corresponds to
the oblique form of the Kurdish definite article (-ăkay) rather than the nomina-
tive form (–ăka), e. g. Jewish Sanandaj bela ‘house’, belăke ‘the house’; Christian
Sanandaj besa ‘house’, besăke ‘the house’ (Khan, chapter 4.4, §6.3.1). The suffix
is found also in Turkic (Bulut, chapter 4.2, §2.4.2.2), e. g. šär-äkä-sı-nı (poem-
def - poss .3 sg - acc ) ‘this certain/aforesaid poem of his’ (Sonqorî) and Bakh-
tiari (Anonby and Taheri-Ardali, chapter 4.3, §3.1.1), e. g. hast=eke ‘the bone’.
In Hawrami it is inflected for number and gender (Mahmoudveysi and Bailey,
chapter 4.5, §4.1.1), e. g. warg-aka ‘wolf-def . m ’ ‘the wolf’, ađā-kē ‘mother-def .f ’
‘the mother’, palawar-akē ‘bird-def .pl ’ ‘the birds.’
Kumzari has a definite-marking suffix with the form -ō, e. g. qiṣr ‘palace’,
qiṣr-ō ‘the palace’. Unlike in the other languages in this section the definite-marker
in Kumzari is attached also to attributive adjectives that modify a definite noun,
e. g. qēṣarit-ō jwān-ō ‘the good date’ (van der Wal Anonby, chapter 4.7, §4, §11).
3.3. The marking of indefiniteness

Indefinite-marking suffixes on nouns are found in many Iranian languages of the
region, e. g. Persian mard-ī (Paul, chapter 4.6, §4.5.1), Bakhtiari (Anonby and
Taheri-Ardali, chapter 4.3, §3.1.1), e. g. doʋdar=ey ‘a girl’, Hawrami (Mahmoud-
veysi and Bailey, chapter 4.5, §4.1.1), e. g. kuř-ēw ‘a son’. These constructions
may be combined with the cardinal numeral ‘one’ in some of these languages, e. g.
colloquial Persian yek mard-ī ‘a man’ (Paul, chapter 4.6, §4.5.1) and Bakhtiari
(Anonby and Taheri-Ardali, chapter 4.3, §3.1.1), e. g. ya doʋdar=ey ‘a girl’. The
cardinal may be suffixed in place of the indefinite-marker -ew in Hawrami, e. g.
390 Geoffrey Khan
dēq-ya ‘a sorrow’. Kumzari has an indefinite-marking suffix with the form -ē, e. g.
qiṣr ‘palace’, qiṣr-ē ‘a palace’ (van der Wal Anonby, chapter 4.7, §4).
Turkic (Bulut, chapter 4.2, §2.3.2) and NENA dialects (Khan, chapter 4.4,
§6.3.1) of the region use only the cardinal numeral ‘one’ to express indefiniteness.
Neo-Mandaic borrows the Iranian pattern cardinal numeral + noun + indefinite
suffix along with the Iranian morphemes ya ‘one’ and -i, e. g. ya tājer-i one mer-
chant-indf ‘a merchant’ (Khan, chapter 4.4, §6.3.2).
3.4. Copulas
All languages of the region have enclitic present indicative copulas. There is a
close, and in some cases complete, correspondence between the inflection of such
enclitic copulas and the inflectional endings of present verbs. In some languages
the 3rd person forms of the copula, or at least the 3rd singular forms, are distinct
from the present verbal inflection, e. g. Hawrami (Mahmoudveysi and Bailey,
chapter 4.5, §4.3.5), standard Persian (Paul, chapter 4.6, §6.5), Turkic (Bulut,
chapter 4.2, §2.3.3.1), the Christian NENA dialect of Sanandaj (Khan, chapter 4.4,
§7.1) and neo-Mandaic (Khan, chapter 4.4, §7.1). There is evidence of diachronic
development in some dialectal varieties of these languages resulting in a conver-
gence between the enclitic copula and the present verbal inflection also in the 3rd
person. This applies, for example, to colloquial Persian and the Jewish NENA
dialects of the region. Persian also has an independent stressed form of copula in
addition to an enclitic present copula (Paul, chapter 4.6, §6.5). An independent
copula also exists in Neo-Mandaic, which is more frequently used than the enclitic
copula. It is obligatorily used when the predicate is a nominal (Khan, chapter 4.4,
§7.1). The Neo-Mandaic enclitic copula is restricted to locative or adjectival pred-
icates (Khan, chapter 4.4, §8.1.1).
3.5. Pre-verbal indicative markers

The Iranian languages and Neo-Aramaic of the region share the common feature
of using a preverbal particle to mark the indicative form of the present:
Persian: mī-
Hawrami mi-, ma-
Bakhtiari e-
NENA k-, g-
Neo-Mandaic qa-, qə-
The Iranian languages also mark the irrealis subjunctive by a preverbal particle
before the present verb. There is no particle that corresponds to this in the Neo-Ar-
amaic dialects:
Persian: be-
Bakhtiari be-
Hawrami: bi-
NENA Ø-
Neo-Mandaic Ø-
Kumzari expresses the realis by a d infix before the inflectional ending. The irrealis
is expressed by the absence of this infix (van der Wal Anonby, chapter 4.7, §5.3).
4. Syntax
4.1. Progressive constructions

The pattern of some progressive constructions have spread across linguistic bound-
aries.
One example of this is the progressive construction formed by the reduplica-
tion of the verbal stem reported to exist in Hawrami (Mahmoudveysi and Bailey,
chapter 4.5, §4.3.6), e. g. řamāy ma-ram-ūna ‘I am now running’. This pattern is
found also in the Jewish NENA dialects of the region, e. g. găroše gărəš ‘he is
pulling’ (Khan, chapter 4.4, §7.1).
In Persian the progressive is expressed by combining the verb ‘to have’ with
the indicative present dāram mī-ravam ‘I am going’ (Paul, chapter 4.6, §6.11).
Similar constructions are found in some Iranian languages of the Caspian region.
In such languages the auxiliary verb related etymologically to the Persian verb
‘to have’ is what Stilo in chapter 5 on the languages of the Caspian region calls
a ‘locative be-verb’ (§6.5.8.1). This is evidently its original meaning. So the con-
struction expresses location within an action. The Persian construction has been
replicated in some Jewish NENA dialects (Khan, chapter 4.4, §7.1), e. g.
(1) hite k-e
poss -3 ms ind -come.prs .3 ms
‘he is coming’
It is also relevant here to mention the Bakhtiari indicative preverbal particle e-.
According to Anonby and Taheri-Ardali this is used with some verbs to express
specifically progressive aspect, e. g. daʋn-om ‘I run’ vs. e-daʋn-om ‘I am running’.
This e- particle is homophonous with the Bakthiari 3s copula clitic and this may
indeed be its origin. If so, this would resemble the progressive construction
attested in the Christian NENA dialect of Sanandaj, which is formed by combining
a copula with a present verb (Khan, chapter 4.4, §7.1), e. g.
392 Geoffrey Khan
(2) garš-en=yen
pull.prs - d .1 ms = cop .1 ms
‘I am pulling’
4.2. Perfect constructions

The pattern of some constructions of the perfect have spread over linguistic bound-
aries in the region.
The perfect construction consisting of a participle and copula that is a feature of
Persian, e. g. rafte-am ‘I have gone’ (Paul, chapter 4.6, §6.8), Hawrami (Mahmoud-
veysi and Bailey, chapter 4.5, §4.3.6) and Kurdish dialects of the region, has been
replicated in the Jewish NENA dialects zila=yena ‘I have gone’ (Khan, chapter 4.4,
§7.1). Also in the Turkic varieties of Iran this pattern of perfect is replicated in con-
structions consisting of a gerund in {-(y)Xb} and copula, replacing earlier Turkic
perfect constructions with the morpheme {-mXš} (Bulut, chapter 4.2, §2.3.3.2).
Another pattern of perfect construction consists of a fossilized invariable 3s
copula element combined with a past perfect form. This is attested in the Christian
NENA dialect of Sanandaj, e. g.
(3) g-i-grəš-le
ind - cop .3 ms -pull.pst - l .3 ms
‘he has pulled’
The same construction can also be identified in the perfect of Bakhtiari (Anonby
and Taheri-Ardali, chapter 4.3, §3.3.2), e. g.
(4) xaʋsid-om-e
sleep.pst -1 s - cop .3 s
‘I have slept’
The formation of a present perfect from the combination of a copula and a past
verb is attested in Early Judaeo-Persian (Paul, chapter 4.6, §6.10), e. g.
(5) nibišt hest
write.pst .3 s cop .3 s
‘he has written’
4.3. The syntax of pronominal clitics

In Persian, and the Iranian languages of the region, the series of pronominal clitics/
suffixes that are attached to nouns to express a possessor is also used to express
pronominal direct or indirect objects of verbs, e. g. Persian (Paul, chapter 4.6,
§4.9.1):
(6) a. barādar=at
brother=2s
‘your brother’
b. mī-bīnam=at
ind -see.
prs .1 s =2 s
‘I see you’
c. goftam=at
say.pst .1 s =2 s
‘I told you’
In some Iranian languages of the region such as Persian and Bakhtiari there is gen-
eralized accusative alignment and these pronominal clitics express the objects of
both present and past verbs. In other Iranian languages, such as Hawrami, where
there is ergative alignment, the clitics express the object only on present verbs.
The functional range of the pronominal clitics in Turkic varieties in Iran has
been extended from possessive clitics on nouns to clitics expressing benefac-
tive and dative on predicative constructions, e. g. lazım-mız (necessary-poss .1 pl
(is)) ‘we need’, ertebâṭ var-ı (relation existent- poss .3 sg ), literally ‘his contact
(is) existent’, that is: ‘he has contact’ under the influence of Iranian languages
(chapter 5.1, §7.2.3).
Neo-Mandaic, which has accusative alignment with all verbal forms, uses the
series of possessor pronominal clitics to express the pronominal objects of past
verbs and of 1st and 2nd person present verbs. 3rd person present verbs express
the pronominal object by oblique clitics consisting of the oblique, originally
dative, preposition l- and pronominal suffix (Khan, chapter 4.4, §7.2). In Classi-
cal Mandaic the ancestor of the Neo-Mandaic present forms, i. e. active partici-
ples, could express pronominal objects with either possessor suffixes or oblique
l-suffixes also when the subject was 1st or 2nd person (Nöldeke 1875: 391). This
suggests that the Iranian-type of pronominal objects have gained ground in the
Neo-Mandaic present, most likely through convergence with the Iranian model.
The NENA dialects, which have accusative alignment with present verbs, nor-
mally use oblique l-suffixes to express the accusative pronominal object of present
verbs. There is not, therefore, a correspondence between the possessor suffixes on
nouns and those expressing verbal objects as in Iranian languages. In the Jewish
NENA dialects of western Iran, however, unlike NENA dialects elsewhere, the
possessor suffix is used to express the pronominal object of present verbs with 1st
person singular subjects (Khan, chapter 4.4, §7.1), e. g.
(7) Jewish Sanandaj (Khan 2009: 154–155)
gărəš-n-ef
pull.pres -1 s - poss .3 ms
‘I (ms) pull him’
394 Geoffrey Khan
This could also be interpreted as convergence towards an Iranian-type of expres-

sion of pronominal object. As in Neo-Mandaic this convergence has taken place in
verbs with non-3rd person subjects.
4.4. Oblique case marking of nouns

Hawrami makes a distinction between nominative and oblique case in nouns.
The contexts in which the oblique case is used include (i) the complement of an
adposition, (ii) the dependent element in a genitive (ezāfe) construction, (iii) the
argument expressing the object of a present tense verb and (iv) the agent of a past
transitive clause (Mahmoudveysi and Bailey, chapter 4.5, §4.1.1). In the Jewish
NENA dialects of the region oblique case marking has developed in demonstra-
tive pronouns by bonding the genitive particle d with the pronoun, e. g. do ‘that
(oblique)’ < d gen + o dem (for this process see Khan 2015, 2016, vol. 1: 215–217;
Gutman 2018). Such oblique demonstratives are used not only as complements of
adpositions and in genitive constructions in noun phrases, which are the historical
contexts of the use of the genitive particle d in Aramaic, but also occasionally as
the direct object of present verbs, e. g.
(8) J. Sanandaj (Khan 2009: 157)
do gărəš
dem . obl pull.prs .3 ms
‘he pulls him’
This extension of the contexts in which the oblique pronoun is used is likely to be
the result of convergence with Iranian oblique constructions.
4.5. Postverbal position of goal constituent

A shared feature of the languages of the region is the placement of the constituent
expressing the goal of an action after the verb. This applies also to the Turkic of
Iran, in which a constituent with dative case expressing a goal occurs in postverbal
position. This deviation from the regular verb-final syntax of Turkic languages in
other regions must have developed by convergence with the Iranian languages of
the area (Bulut, chapter 4.2, §2.4.2.4).
4.6. Alignment
The Iranian language Hawrami has ergative alignment in clauses with past per-
fective and present perfect verbal forms. Ergativity is found in the same construc-
tions also in the Jewish NENA dialects of Kordestan and Kermanshah provinces,
with distinct inflectional endings marking the subject of transitive and intransi-
tive verbs. With the exception of the Jewish dialect of Sulemaniyya and Ḥalabja
across the border in Iraq (Khan 2004), in other NENA dialects the subject of tran-
sitive and intransitive past perfective and perfect verbs have the same inflectional
markers. It is likely that the ergativity of the Jewish dialects of western Iran arose
by convergence with the neighbouring ergative Iranian languages.
Persian and Bakhtiari have accusative alignment. Neo-Mandaic is likewise
accusative in its alignment. This is due to the fact that it has been in contact with
Iranian languages with accusative alignment, such as Persian and Bakhtiari, rather
than ergative Iranian languages.
4.7. Adverbial temporal clauses

Many languages of the region exhibit a common syntactic pattern of adverbial
temporal clauses that stand in sentence-initial position. In this pattern the subor-
dinating particle (glossed below as sub ) of the adverbial clause is placed imme-
diately before the verb, which is preceded by one of its complements (typically
subject or object), e. g.
(9) Bakhtiari (Anonby and Taheri-Ardali, chapter 4.3, §4.3.2)
ya modat=ey ke godašt
a while=indef sub pass.pst .3sg
‘When a while passed, …’ (= ‘after a while …’)
(10) Turkic (Bulut, chapter 4.2, §2.5.2.2)
o šoxm ke elämišdik äkärdik.
that plough sub do. pluprf .1 pl sow. imp .1 pl
‘After we had ploughed (the fields), we used to sow.’
(11) NENA, Christian Urmi (Khan 2016, vol. 2: 490)
+
búsra k̭ at-parə̀m-va,| …
meat sub -slaughter.prs .3 ms - pst
‘When meat was slaughtered, …’
Similar constructions with the subordinator shift to preverbal position in the lan-
guages of the Caspian region are described by Stilo in his chapter in this volume
(§8.2.4).
4.8. Evidential
The languages of the region exhibit a certain tendency to extend the use of the
perfect to the expression of the evidential. Kumzari has a more extensive gram-
maticalized system of evidentiality, which classifies the information source as
firsthand sensory, non-firsthand reportive, or non-firsthand inferred (van der Wal
Anonby, chapter 4.7, §8).
396 Geoffrey Khan
5. Turkic and NENA
Some of the distinctive features of Turkic described in chapter 4.2 have been rep-
licated in NENA dialects of the Urmi region in the West Azerbaijan province in
northwestern Iran, but not in the NENA dialects further south in the Kordestan
and Kermanshah provinces. These include the palatalization of dorsal consonants,
which has already been mentioned in §2.5 above.
Another phonological feature is the development of suprasegmental pharyn-
gealization in the NENA dialects of the Urmi area (chapter 2.5, §5.4), which must
have had a Turkic model. It is important to note that vowel harmony is in the
process of breaking down in the Turkic of Iran, resulting in phenomena such as the
delabialization of front rounded vowels (Bulut, chapter 4.2, §2.1.2.1). The model
of the NENA harmonic systems, therefore, is likely to have been the Turkic of
earlier periods.
A syntactic feature of the NENA of the Urmi region that has developed due to
convergence with Turkic is the frequent use of infinitives and verbal nouns in sub-
ordinate adverbial clauses (Khan 2008: 288–290, 2016, vol. 2: 241–247), which
replicates the syntax of Turkic gerund constructions. Bulut, however, reports
that in the Turkic of Iran today such subordinate gerund constructions have been
largely replaced by subordinate constructions with finite verbs under the influence
of Persian (Bulut, chapter 4.2, §2.5.1.1). Here again, therefore, it appears that the
convergence of the NENA dialects with Turkic must have taken place at an earlier
period when gerunds were still widely used in Turkic. The lexicon of the NENA
dialects of Urmi has a large proportion of Turkic loanwords (Khan 2008: 383–385,
2016, vol. 3: 1–3). In some cases these words are no longer used in the contem-
porary Turkic of Iran, but can be traced to Ottoman Turkish (Khan 2016, vol. 3:
2). In addition to lexical, there are some material borrowings of morphology. The
Jewish Urmi NENA dialect, for example, has borrowed the Turkic instrumental
case ending -inan (for this form of the ending in the Turkic of Iran see chapter 4.2,
§2.3.1.3) as a phrasal coordinating particle (Khan 2008: 234–235).
A feature relating to verbal aspect is the functional levelling of progressive and
non-progressive present in Turkic. Bulut (Bulut, chapter 4.2, §2.3.3.2) describes
how in some varieties of Turkic the distinction between the present progressive
(“the focal present” in her terminology) and the aorist, i. e. the non-progressive
present (habitual, iterative, intention) has been lost. In some varieties this is due to
a formal merger of the aspectual verbal affixes and in some varieties the functional
range of the progressive form has been extended. This phenomenon in Turkic may
have had an impact on the NENA dialects of the Urmi area, in which the originally
progressive verbal form has come to be used with a non-progressive function.
This is found frequently in the Christian Urmi (Khan 2016, vol.2: 185–198) and
Jewish Urmi (Khan 2008: 274–278) dialects. In the Jewish Salamas dialect this is
regularly the case.
References
Gutman, Ariel. 2018. Attributive constructions in North-Eastern Neo-Aramaic. Studies in

Diversity Linguistics 15. Berlin: Language Science Press.
ies in Semitic Languages and Linguistics 44. Leiden & Boston: Brill.
Khan, Geoffrey. 2008. The Jewish Neo-Aramaic dialect of Urmi. Gorgias Neo-Aramaic
Studies. Piscataway, NJ: Gorgias Press.
gias Press.
Khan, Geoffrey. 2015. Domains of emphasis, syllable structure and morphological bound-
aries in the Christian Urmi Dialect of Neo-Aramaic. In Geoffrey Khan & Lidia Nap-
iorkowska (eds.), Neo-Aramaic and its linguistic context, 145–161. Piscataway, NJ:
Gorgias Press.
Nöldeke, Theodor. 1875. Mandäische Grammatik. Halle: Buchhandlung des Waisenhauses.
4.2. The Turkic varieties of Iran
Christiane Bulut
1. Historical background and socio-linguistic situation
For the varieties of Turkic spoken all over western and central Iran (including
some pockets in eastern Iran) and their close relatives in Azerbaijan, eastern Ana-
tolia and Iraq no fully developed classification or Turcological model exists. The
linguistic map of the region has many blank spots, and a comparative study of the
whole area is still a desideratum — although a number of older and more recent
studies describe the Turkic varieties of individual regions, or inside the borders of
the respective nation states.1 Even terminology is still a matter of debate: Iran-Tur-
kic is called Turkî by most of its speakers, while the Turks of Iraq call themselves
Turkman, and their language Turkmanja. Publications from Azerbaijan use expres-
sions such as, for instance, Azeri (dialects) of Iraq, and Azeri (dialects) of Iran, or
South Azeri — often with political implications. Azerbaijani, on the other hand,
denotes the official language of the Republic of (North-)Azerbaijan. In Turcolog-
ical literature, closely related dialects in Turkey and Iraq generally are referred to
as eastern Anatolian or Iraq-Turkic/-Turkman dialects, respectively.
The Turkic varieties of Azerbaijan, Iran (with the exception of Khalaj and
certain dialects of Turkmen spoken in north-eastern Iran near the border of Turk-
menistan2), Turkey proper, eastern Anatolia and Iraq represent the western or Sal-
juqian branch of the Oghuz language group (see Table 1 below). The members of
this branch are genetically related and have shared major historical and linguistic
developments. During the 11th century, for instance, their speakers followed the
Saljuq’s migration from Central Asia into Iran, Iraq and Anatolia – thus causing
a first Oghuz split; for roughly four hundred years thereafter they formed part of
the tribal confederations dominating the greater area of Azerbaijan, eastern Ana-
tolia, Iran and Iraq, where some predecessor of ‘Turkî’, an Oghuz variety heavily
spiced with Iranian elements, was used as a language of communication.
Due to their common historical background, the Turkic varieties spoken in
present day Iran are very similar to each other and to the adjacent dialects west
1
Intensive research on dialectology has been carried out in North Azerbaijan (Republic
of Azerbaijan), see Širaliev (1983) on North Azerbaijani dialects, and the Dialect Atlas
(İslamov, M. İ, Aghajev, Ä. G., et. al. 1990). For a comprehensive survey of charac-
teristic features of Iran-Turkic varieties and models for their classification see Doerfer
(2006) and a very interesting model developed by Windfuhr (Enc. Iranica 2006).
2
Namely Turkmen (called North Oghuz in Doerfer 2006); Khorâsân-Turkic (or, accord-
ing to Doerfer 2006, East Oghuz) denotes a group of transitional dialects.
The Turkic varieties of Iran 399
of Iran, with characteristic features and isoglosses reaching down into Iraq and
eastern Anatolia. The official or standard language to which these Turkic varie-
ties are most closely related is Azerbaijani of the Republic of Azerbaijan, while
Turkish of Turkey has developed differently for at least 500 years.
All Turkic varieties of this region display different degrees of Iranicization. In
Iran proper, Iranicization of phonology, syntax and lexicon is extremely strong.
This facilitates mutual intelligibility despite some minor regional variation.3
The following survey (Table 1) lists the main representatives of the Oghuz lan-
guage group according to Doerfer’s (2006) classification (marked with an aster-
isk *), which is based on geographical criteria; it also considers the historical
affiliation of different Oghuz groups with the Saljuqs’ movement to the West and
their relation to Turkî, the predecessor of present day Azeri, Iraq and Iran-Turkic.
Figure 1 indicates the locations of the groups mentioned in this chapter.
Table 1: The Oghuz language group

Name *Geographical Historical Spoken in
classification affiliation
according to
Doerfer (2006)
Turkmen *North Oghuz non-Salju- Republic of Turkmenistan, border
qian regions of NE Iran
Khorâsân *East Oghuz transitional Khorâsân province in NE Iran
Turkic dialects,
mainly
Saljuqian >
Turkî
Azerbaijanian *Central Oghuz Saljuqian > Republic of Azerbaijan,
Turkic/Azeri Turkî Azerbaijan province in NW Iran;
exclaves Galûgâh, Lotfâbâd,
Daragaz (Khorâsân); Terekeme in
Kars province etc.
Qashqâʾî and *South Oghuz or Saljuqian > SW Iran (South of Qazvîn/Zanjân:
related dialects Afshâr Turkî Central province, Sonqor, prov-
inces of Esfahan, Shirâz, Kerman;
exclave: Qabul/Afghanistan.
3
The influence of the mainstream Oghuz varieties is very strong and has even invaded
Khalaj, the only non-Oghuz member of the Turkic languages of central Iran: Nowadays
Khalaj is a strongly Oghuzicized and Iranicized language containing some relics or
fragments of an underlying older Turkic linguistic system.
Name *Geographical Historical Spoken in

classification affiliation
according to
Doerfer (2006)
East Anatolian transitional area Saljuqian > SE and NE Anatolian dialects;
dialects between Turkî Urum of Georgia
Azeri and Turkish
Iraq Turkic transitional area Saljuqian > Turkman belt in Iraq
between Central, Turkî
South and West
Oghuz
Turkish of *West Oghuz Saljuqian > Turkey and periphery: Balkans,
Turkey and Turkî Gagauz
related dialects
1.1. Formative factors in the recent history of settlement in western Iran

In the early 16th century, new political and religious powers began to emerge in
the Middle East, namely the Ottomans (espousing Sunni Islam) in the West, and
the Safavids, their main competitor, implementing the Shiʿa of the Twelve Imams
as their state religion in Iran. A highly disputed border cut through the area of
settlement of the local Turkic speaking populations, which laid the grounds for
the diverging development their language — a second major split in the Oghuz
language group, into Ottoman and Azeri.
Initially, the Safavid rulers of Iran resided in Tabriz in Azerbaijan (the old
capital under the Aq- and Qaraqoyunlu); later the political centre moved to Qazvîn,
still in the northwest of the country, and then to central-western Esfahan (already
an important centre under the Saljuqs).
Although the borders to the West (the Ottoman-Safavid border) had become
more defined, the fluctuation and the dislocation of especially nomadic elements
of the population inside Iran was considerable. Two events in the recent history of
Iran caused further major changes in the landscape of settlement. Many tribes and
communities were displaced during the raids of the Afghan Ghilzay tribes in 1722.
In the subsequent restitution period under Nader Shah (1736–1747), the political
centre shifted to Khorasan. While Nader Shah attempted to fortify his new strong-
hold in the Northeast, Turkic and Iranian tribes from all over the country were
exposed to deportation or forced settlement, among them Kurds, Turks and Lurs
from the Province of Fars. The establishment of the Kurdish Zand dynasty (1750–
1779) took place in Shiraz in the extreme Southwest. There, tribes returning from
the exile in Khorasan met with other followers of the Zand; among those were also
Turkic-speaking communities from the northwest, the area of Azerbaijan proper.
Moreover, most established rulers used to send the tribes all over the country,
locating them in sensitive areas at the frontiers, re-locating unruly elements in
marginal regions, or forming powerful tribal confederations to back up their polit-
ical influence.
Thus, the relocation of populations between the 16th and 18th centuries depicted
in Fig. 2 are highly characteristic of the history of settlement in Iran. Uprisings of
tribes requiring severe measures such as deportation or forced settlement are by
no means a phenomenon of the remote past; they still took place during the first
decades of the 20th century in the region of Iran and Anatolia.
1.2. Linguistic situation in present day Iran

Nowadays the linguistic situation of Iran is still a multi-faceted mosaic of different
languages, including the Turkic, Iranian, Semitic, and Kartvelian language fami-
lies, with different constellations of bi- and multilingualism prevailing across the
area. Consequently, traces of contact-induced linguistic change can be detected
throughout the region.
Given the political dominance that Turkic dynasties exerted over the greater
region of Iran for roughly nine centuries (the Qâjâr lasted till 1925), the relatively
low esteem in which Turkic languages were held is somewhat surprising. Ever
since the arrival of the Saljuq Turks, Modern Persian, the language of the admin-
istration and literature, had maintained its place as the most prestigious language.
In recent times, it has also gained considerable influence as a means of commu-
nication, the language of the media and of education — a development that was
accelerated under the Pahlevis, who adopted the concept of linguistic homogeneity
as an important means of nation building. For nearly a millennium, Turkic has
been spoken widely, although it never reached the status of a prestige or standard
language.
Traditionally, Turkic had also developed into an supra-regional language of
communication among smaller communities of speakers of minority languages
(such as, for instance, the Armenians of eastern Anatolia, Iraq and Iran; or Kurds,
Georgians, and Armenians in the Province of Esfahan etc.), and it is used as a
means of communication among people of different ethnic backgrounds in certain
social and professional groups, such as, for instance, the Bâzâris of Tehran, and,
far away from our area of concern, the street vendors in Moskow.
1.2.1. The speakers of Turkî

Most Turkic-speaking inhabitants of Iran adhere to the Shiʿi denomination of Islam
(also called Shiʿa of the Twelve Imams, or Jaʿfariyya). They call themselves Turk
or Turkî-zabân ‘turcophone’, and their Turkic language Turkî/Torkî. The expres-
sion Azer/Azerî is used only in the northwest, in the Iranian province of Azer-
402
Christiane Bulut
Figure 1: Locations of the Turkic varieties mentioned in this chapter

The Turkic varieties of Iran
Figure 2: Re-location of nomadic tribes between the 16th and 18th centuries
403
404
Christiane Bulut
Figure 3: Summer and winter locations of Qashqâʾî tribes

baijan. Outside Azerbaijan, speaking Turkî in public is considered shameful: The

usage of a typical low-prestige language implies that the speaker is uneducated,
boorish, impolite or obstinate.
Due to wide spread bi- and multi-lingualism, and the reservations some people
feel about admitting that they know and use Turkic in some areas of life, it is
extremely difficult to estimate the number of actual speakers; and it is even more
difficult to figure out the number of Iranians with some competence in Turkic.
Moreover, not all the regions where Turkic is spoken, either as the dominant local
language or as an inter-ethnic means of communication, have been fully explored.
Doerfer (2006: 93–95) gives some population figures for certain groups of
Turcophone Iranians: Azerbaijan (including the exclaves Galûgâh, Lotf-âbâd
and Daragaz in Khorâsân): over 12 million; South Oghuz (Qashqâʾî and related
dialects): no figure mentioned; Khorâsân (East Oghuz): more than 2 million;
Turkmen: 500,000 speakers; Khalaj: (explored in the late 1960s) 28,000. Similarly
Boeschoten (1998: 13, probably based on older estimates of Doerfer) assumes that
there were 13 million speakers of Turkî in Azerbaijan, 570,000 in the Qashqâʾî
region, 400,000 in Khorasan, plus 500,000 Turkmen and 28,000 Khalaj4, that is:
14,498,000 speakers of Turkic languages, or about a quarter of the total number of
inhabitants of Iran, estimated at 62 million in 1998. The population of Iran today
population is roughly 80 million. Given the strong population increase in the rural
or tribal areas where the speakers of Turkic languages reside, that would imply
that Iran hosts 19 million speakers of Turkî. This figure does not include pockets
of Turkic which have still not been explored.5 The number of people who use
Turkî as a second or third language or as a regional means of communication (and
who display different levels of active or passive command of the language) must,
however, be much higher.
1.2.2. Bi- and multi-lingualism

Nowadays, most speakers of Turkî are at least bilingual. The education system,
improved means of communication and transport, and, last but not least, the
omni-present media promote the spreading of Fârsî/Modern Persian as the official
language. Thus, mono-lingual speakers of Turkî can only be found among the
women of the generation over sixty in regions where Turkî is the dominant lan-
4
Assuming that the Khalaj linguistic community has shared the average growth in
population figures, Boeschoten (1998) should be corrected to 35,226 out of a total of
78,868,711 according to the recent official population statistics of Iran (July 2011).
As the basic estimate of 28,000 was made in the late 1960s/early 1970s, at present the
number of Khalaj speakers rather would be over 40,000.
5
CIA World Factbook mentions 68 millions inhabitants, of which 24 % (16,320,000) are
Turcophone.
guage, as, for instance, in Azerbaijan proper, and in rural and tribal areas with pre-
dominantly Turcophone populations. And even these mono-lingual women have
some passive command of Persian, acquired through contact with their children
or grandchildren, or with the media. Men with the same social background are
mostly bi- or multilingual, as they have served in the military and are exposed to
other languages through work or in their social networks. Normally, they display
active command of Persian, plus maybe one or two regional languages of commu-
nication.
The social contexts of bi- and multilingualism are very diverse. In Sonqor in
the Province of Kordestan, for instance, everyone has some active command of
Persian, while Turkî is used as a home language and for trade in the city; the regional
language of communication, though receding in recent times, is Kolyâ’î-Kurdish.
Armenians from the Central Province or the Esfahan Province speak Armenian,
Persian and Turkî, but sometimes additionally also varieties of Kurdish. Kalhor
Kurds in the eastern Hamadan Province use Kurdish as a home language, Turkî as
the regional language of communication, and speak Persian perfectly well.
1.2.3. Written language

The written language under the Saljuqs was predominantly Persian. Yet, a few
literary texts were also composed in olga-bolga varieties of Oghuz, or, beginning
with the mid-15th century, in the Chaghatay tradition. In the 16th century, poetry
in Turkî or “Azeri” starts to develop. This written language of Iran is sometimes
called Ajam Turkic.6
Although the Constitution of the Islamic Republic grants legal rights for pub-
lishing in minority languages7, relatively few books in Turkic language are pub-
lished in Iran. These books mostly contain poetry, folklore texts, collections of
fairy tales, religious treaties or research on local history. Publishing concentrates
around Tabriz, and to lesser extend, also Qom and Shirâz. In most instances, the
Turkic language is represented by an extended form of the Arabic alphabet, which
was introduced by the journal VARLIK in the 1980s. The linguistic standard is
either the Tabriz variety or Turkish of Turkey, which is considered to be “correct
Turkic”.
6
After the former term lingua turcica agemica, which Raphael du Mans coined in the 2nd
half of the 17th century, see Johanson (1985: 145).
7
Paragraph §15 of the Constitution of the Islamic Republic reads as follows (my transla-
tion): “The common official and literary language of the people of Iran is Fârsî. But the
use of the local or tribal languages (zabân-e maḥallî va qavmî) in printed matters and
in communication of the respective groups, as well as in teaching of their literature in
school, [as a second language] besides Fârsî, is free.”
2. Linguistic features
With the exception of Turkmen and the non-Oghuzic elements in Khalaj, Iran-Tur-
kic varieties are quite similar to one another. They share most of their structural
features and their lexicon, and a considerable proportion of elements copied from
Modern Persian, which additionally contributes to mutual intelligibility. To exem-
plify some of the characteristic developments in the Turkic varieties in context,
the appendix contains two short sample texts. Text 1 from Sonqor displays a
number of individual traces in phonology and morphology, while sample text 2
from Bayâdestân in the Central Province represents main-stream Turkî. Quota-
tions from these texts are marked by a number indicating the text and sentence
number. When referring to texts collected in other regions, I will also indicate the
place of origin in brackets.
2.1. Phonology
With regard to their phonological behaviour, Iran-Turkic varieties are considerably
diverse; consequently, they “sound” different. In most cases, the pronunciation is
heavily influenced by the languages in contact of the neighbouring area. Speakers
from Iraq or Sonqor have a heavy Kurdish “accent”; the Qashqâʾî pronunciation
imitates the local Luri or Shirazi dialects; speakers from the North and the Tabriz
region display heavy “fronting”, which is a typical areal feature. Strangely out of
place appears the so-called Qipchaq isogloss, a morpho-phonological assimila-
tion well-described for North Azerbaijani, which re-surfaces in the varieties of the
Khalaj of Bayâdestân in central Iran (see §5.1.4).
Traditionally phonetic or phonological copying8 seems to have been the result
of convergence in smaller regional units or of the influence of smaller regional
languages of communication, but now it has been widely replaced by replication
from the prestige or standard languages that the media propagate.
2.1.1. Consonant inventory

Across the region the basic consonant inventory of the Turkic varieties displays
• five pairs of voiced/unvoiced stops and affricates, namely: the bilabial stops
p/b, the alveolar stops t/d, the velar stops k/g, the post-velar/uvular stops q/ɢ,
and the affricates č/c [ʧ/ʤ].
• four pairs of voiced/unvoiced fricatives: the labio-dental fricatives f/v, the alveolar
sibilants s/z, and the post-alveolar sibilants š/ǰ [ʃ/ʒ], and the velar fricatives x/ɣ.
• a voiceless glottal fricative h.
8
Based on the code-copying model developed by Johanson (1993), contact-induced lin-
guistic processes are perceived as code-copying.
• two nasals: the bilabial m, and the alveolar n.

• two liquids: the alveolar approximant r and the lateral approximant l.
• one glide: the palatal approximant y [j].
The voiced post-alveolar sibilant ǰ [ʒ] is a foreign element that does not appear in
genuine Turkic lexical items. Moreover, Turkic words do not begin with c- [ʤ], ɣ-,
l-, m-, r-, and z-; the appearance of word-initial n- is restricted to interrogatives such
as, for instance, (nimä) nä ‘what’, niyä ‘why’, näcä ‘how’, and näčä ‘how much’.
Similar h- (< older q-) appears in interrogatives haçan ‘when’, harda ‘where’, and
hangu ‘which’. Word-initial m-, as, for instance, in the personal pronoun 1st singu-
lar män (< bän ‘I’) or the accusative of the demonstrative pronoun (munu < bunu
‘this’:Acc) has originated through regressive assimilation of b- to the following n.
The distribution of initial stops is complicated. According to current theories in
Turcology, Ancient Turkic had t-, k- and b-, with a tendency of the Oghuz branch
to develop lenes variants of t- > d-, and k- > g-; p- rarely occurs word-initially
(Doerfer 2006: 97). In present day varieties of Oghuz Turkic the occurrence of
lenes or fortes allophones of k- or t- varies across different regions, which makes
it unpredictable. Most Azeri or Iran-Turkic dialects prefer word-initial fortes in
Turkic lemmata such as, for instance, tik- (corresponding to Turkish dik-) ‘to
sew’, tök- (instead of Turkish dök-) ‘to pour’, or köč- (for göç-) ‘to nomadize’ (but
mostly gäči for käçi ‘goat’), while degil ‘not’, gör- ‘to see’, or git- ‘to go’ have
lenes in both languages. In combination with back vowels, the old lenes seem
to appear more frequently; the pronunciation of these elements, though, is rather
that of a media lenis (a devoiced d pronounced with strong pressure, resembling
the emphatic (pharygealized) Arabic ḍ), see examples such as dʕuz ‘salt’, or dʕaš
‘stone’. On the other hand Iran-Turkic or Azeri varieties display word-initial b-
in a number of lexical items beginning with p- in Turkish, see examples such as
barmaq (in contrast to Turkish parmak) ‘finger’, or p- in words beginning with
b- in (Standard) Turkish, as in pox (cf. Turkish box) ‘shit’. Other Turkic roots vary
between p- and b-, such as, for instance, piš- ~ biš- ‘to cook’.
Turkic [q] is an allophone of /k/ that appears in +back syllables and occurs in
complementary distribution with [k], compare qul ‘slave’ and kül ‘ashes’. Most
Turkic varieties of the target area have preserved word initial q-, see, for instance
sample text (1/3) [qabər] qabr ‘grave’, and (2/6) [qoɲʃʊsɨnən qızenıñ ] qonšusunun
qızının ‘his neighbour’s daughter’s’; in some regions q- may also be pronounced
as postvelar voiced stop [ɢ], which often happens in instances of Sandhi assimila-
tion. Note that Modern Persian pronunciation does not differentiate between the
voiced velar fricative /ġ/ and the unvoiced postvelar stop /q/, as both have merged
into a unique phoneme pronounced [ɣ]. Copies of this merger can be found across
Iran-Turkic varieties; especially lexical copies from Persian imitate the Persian
pronunciation, see, for instance, qazâ ‘food’, which also appears with initial [ɣ-]
as [ɣæzɒ]. Even word-initial /q-/ in Turkic stems may be pronounced as [ɣ-],
mostly in Sandhi assimilations after voiced consonants or vowels, see the follow-
ing examples from Bayâdestân in central Iran: [xɑlam ɣɨzınıj] < halam qızının ‘my
cousin’s’, [bo ɣejnanasɨ] bu qaynanası ‘this (was) her mother-in-law’, [dɞ ɣojmʏʃ]
de qoymuš ‘and has put’, but also [qojær ͜ əmɨʃ] qoyarmıš ‘one (obviously) used to
put’ (Bayâdestân). In word-internal position [q] quite regularly becomes [x], see
examples such as oqu- [> oxu-] ‘to read’, aqar [> axar] ‘it flows’. Especially in
copies of Persian lexical items word-internal /q/ is often pronounced as [-ɣ-], see,
for instance, the sample text 1 from Sonqor: [mothæɣıd], from Persian (< Arabic)
motaqâʿed ‘convinced’ (1/3), and [bɒːɣı ɣɑlmi] bâqî qalmıš, a mixed copy of the
Persian bâqî mândan ‘to remain’, in (1/4). In word-final position of mono-syllabic
words /q/ mostly becomes –x, as in yoq [> jox] ‘non-existent’, but it also displays
the voiced allophone [ɣ], see forms such as [jɛmɑɣ] ‘food’ (Iraq Turkic) or [ɣɑtɨːɣ
elɛ-] qatıq elä- ‘to prepare dried yogurt’ (Qashqâʾî).
Originally, the members of the pairs of voiced/unvoiced stops p/b, t/d, and
k/g did not form separate phonemes in Turkic. Due to the high percentage of bi-
and multi-lingual speakers in the region, and the fact that a major proportion of
the lexicon has been copied from other languages (such as, for instance, Modern
Persian) where /p/, /b/, /t/, /d/, /k/ and /g/ are distinct phonemes, the situation
has changed in present day Iran-Turkic. There is a practical need to differentiate
between Turkic bul! ‘find!’ and the Persian pul ‘money’, gäč ‘plaster’ and käč
‘crooked’ (< gağ, kağ, both loans from Persian appearing across Iran-Turkic vari-
eties), or Persian kâl ‘unripe’ and Turkic qal! ‘stay!’
Table 2: Inventory of consonants (allophonic variants and non-phonemic consonants in

loanwords are in square brackets)
Bi- Labio- Alveolar Postal- Palatal Velar Postvelar/ Pharyn Glottal
labial dental veolar uvular geal
Stop p b t d k g q ɢ [ʕ] [ʔ]
Nasal m n [ɲ] [ŋ]
Fricative [ß] f v s z ʃ ʒ x ɣ [χ ʁ] [ħ] h
Affricate ʧ ʤ
Approximant/ [w] j
glide
Trill/liquid r
Lateral l
approximant/
liquid
2.1.1.1. Secondary articulation

Aspiration of stops: k, t, p
The unvoiced stops k, t, p are heavily aspirated in syllable initial position and
word-finally before a pause, see examples such as [thɛphø] ‘hill’, [kheː] ‘that’,
(both: 1/1) and [bækh,] ‘beg, prince’ (1/3). The so-called “fronting” (see §4.5) only
occurs in the varieties situated north of a line drawn from Qazvin to Zanjan.
Long consonants in intervocalic position

Another characteristic feature shared by the Turkic varieties of the region of Ana-
tolia, northern Azerbaijan, Iraq, and western Iran is the appearance of long con-
sonants. Lengthening mostly relates to stops and alveolar fricatives such as [ss]
and [šš] in intervocalic position, as, for instance, in the following examples from
Qashqâʾî: [ʔæʃʃɛg] ‘donkey’, [ɑʃʃɑːɣi] ‘downwards’, and [ıssı] ‘hot’. The numer-
als which regularly display long consonants are [ıkkhe] ‘two’, [jedde] ‘seven’ (also
pronounced as [jeʔde]), [sækkhız] ‘eight’, [doqqʊz] ‘nine’, and [otthʊz] ‘thirty’.
2.1.1.2. Copied elements

Pronunciation of ‘ayn [ʕ], bilabial [w] and the pharyngeal fricative [ħ]
Turkic varieties in contact with Semitic languages or Kurdish dialects —such
as, for instance, Iraq Turkic (see the entry on Iraq Turkic: Bulut, this volume,
chapter 3.5), Sonqorî, a Turkic variety surrounded by Kolyâ’î, Feylî and Gorani
varieties, and, to some extend also the Turkic of the Turcophone Kalhor Kurds
in the eastern Hamadan Province—display copied or contact-induced pronunci-
ations of Semitic phonemes, such as the voiced pharyngeal fricative ‘ayn [ʕ], the
unvoiced pharyngeal fricative [ħ], the bilabial [w], and the uvular plosive [q], see
examples from Sonqorî, such as [mæħɛll] ‘place’ (1/2), [ħætthɒ] ‘till, up to’, [wɑːr]
‘exists’, [wæthæn] ‘homeland’ (1/4), and also [sæʕɑd̥] ‘time’.
Note that these copied phonetic items do not have the status of phonemes;
consequently the degree of their realization may vary, and their distribution is not
always true to the original.
2.1.2. Vowel inventory

Turkic languages display a basic inventory of 8 vowel phonemes in highly sym-
metrical distribution: two high/close vowels, namely front i [i] and central ı [ɨ],
and two low/open-mid vowels, namely front ä [ɛ] and back a [ɑ]. Every unrounded
vowel has a rounded counterpart in roughly the same position, forming four pairs:
i [i]/ü [y], ı [ɨ]/u [u], ä [ɛ]/ö [ø], and a [ɑ]/o [ɔ]. Additionally, most varieties of the
region have preserved a ninth vowel phoneme — the so-called closed ė [e].
Table 3: The Turkic vowel inventory

high / close
i ü ı u
front (ė) back
ä ö a o
low / open
The distinctive primary length of older stages of Turkic has been lost, although one
comes across long vowels which coincide with old length (in words such as, for
instance, [ɑːd] ‘name’ or [oːd] ‘fire’, examples from Iraq Turkic).
2.1.2.1. Contact-induced changes

Iranicization
Iranian or Semitic vowel systems, on the other hand, display three distinct vowel
qualities, each of which appears in pairs of phonemes distinguished by length
(namely i/î, a/â, u/û). Modern Persian — at present the most influential contact
language — has lost distinctive length; the 6 vowel phonemes are now a/â, e/i
(< i/î), and o/u (< u/û), while the old quantitative opposition /a/ vs. /â/ has partly
developed into a qualitative opposition of front /a/ [a, æ] vs. back /â/ [ɑ, ɑː, å:].
These additional qualities of /a/ have been copied into Turkic. They appear in
copies of Arabic or Persian words, see examples such as: [ælʔɒn] < Persian <
Arabic alʾân (sample text 1/4) ‘now’, [zæmɒn-iː] < Persian zamân ‘time-poss .3 sg ’
(1/1), [dærs] < Persian < Arabic dars ‘lesson’ (2/1), [ɒdæmɨj] < Persian < Arabic
âdam ‘man, human being.gen ’ (2/4). In Turkic roots the front half-open [æ] some-
times appears as a very open realization of the vowel phoneme /e/, see examples
such as [bækh] bäg ‘lord, prince’ (1/3), and [gæːlın] < gälin ‘bride’ (2/6).
In Turkic stems, thus, [æ] is in most instances an allophone of /e/, while it
represents a realization of short /a/ in Iranian and Semitic roots. With regard to the
Turkic phonological principle of palatal harmony, [æ] is of course problematic, as
it is situated on the borderline between front and back vowels. Labialized [å], on
the other hand, clearly functions as an allophone of Turkic [a].
Delabialization
Another characteristic feature of the so-called Iranization of Turkic vowel systems
is delabialization of the front rounded vowels /ö/ and /ü/. In most instances /ö/
is transformed into its unrounded counterpart [e] (as in öz > [ez] ‘self’, böyük >
[bejek] ‘big, grand’, gör- > [ger-] ‘to see’), while /ü/ becomes [ı], as in süt > [sid
‘milk], or yüŋg > [jıŋg] ‘wool’. Especially in the vicinity of the glide [j] or palatal
[g’] and [k’], [i] < [e] < /ö/ occurs, e. g. böyük > [bijek] ‘big’, or čökäläk > [ʧɪkɛlɪg]
‘fresh cheese’. Delabialization is more or less conventionalized in the southern
varieties of Iraq Turkic (Bulut 1999), in most Qashqâʾî varieties, in Khalaj in
central Iran (Doerfer 1988), in the Afshar varieties of Qabul and in the dialect of
ʿAlî Qûrchî south of Arak (see Doerfer and Hesche 1989: 28–29).
In irregular distribution — mostly as spontaneous variants of labialized forms —
delabialized pronunciations appear across the whole region, in south-eastern Ana-
tolian dialects with a strong Kurdish adstratum, or with bilingual Turkic-Persian
speakers.9
2.1.2.2. Intra-Turkic phonotactic processes

Shortening or elision of high vowels in the first syllable
Shortening or even complete elision of a high/close vowel in the first syllable
occurs, if this vowel is surrounded by two of the followings sounds: stops such as
/k/ [k, kh] and /t/ [t, th], which mostly display aspiration as a secondary feature, the
unvoiced affricate č [ʧ], and the alveolar fricatives /s/, /š/ [ʃ] or /z/, and the velar
fricative [x], e. g. [ʃIkɪl] < šäkil ‘form’, [tsixarɪ] or [tsxar] < čıxar(ı) ‘he/she comes
out’ (Iraq), [ʊʃʃɒːx] < ušax ‘child’, [thIkɛːrde] < tikärdı ‘was sewing’ (Sonqor),
[tʔtalla’] < tutarlar ‘they hold’ (Bayâdestân), and, with a Persian root, [bIʃtærsi] <
bištar.poss .3 sg ‘most of it’ (Esfahan).
This intra-Turkic phonological process, which appears across the Turkic vari-
eties from northern Azerbaijan, Iraq, and western Iran obviously is related to the
internal structure of the syllable: unvoiced/fortis consonants, especially when
combined with aspiration, are longer than voiced, that is, in most instances, lenis
consonants. To avoid overlong syllables, the fortes seem to acquire some of the
length of the vowel, which consequently becomes shorter.
Metathesis and rhotacism

Throughout the region, different forms of metathesis, lambdacism or rhotacism
appear, as in examples such as [torpax] < topraq ‘ground, soil’, [jarpax] < yapraq
‘leaf’, [ʧɪlpax] < čıplaq ‘naked’, [ırɛlı] < iläri ‘forward’, [kørpʏ] < köprü ‘bridge’,
etc. Consequently, stems and morphemes containing /l/ or /r/ are most susceptible
to various kinds of metathesis or assimilations (see below, morpho-phonology).
The consonant group /-lq-/ is pronounced as [x], both in Turkic or in loaned items,
e. g. /-lq-/ > [x]: in [qax] < qalq- ‘get up’, or [xax] < xalq ‘people’.
2.1.2.3. Contact-induced phonotactic developments

Instability of initial y-
A typical Azeri isogloss is the loss of word-initial y- before high/close front vowels
such as, for instance, i, ı, and ü. Across Turkî varieties of the region there is a ten-
9
Doerfer makes similar observations regarding Paradomba (Esfahan province), see
Doerfer and Hesche (1989: 28–29).
dency to delete old y- in front of i, as in igit ‘hero’ (corresponding to Turkish yiğit)

and itik (< yitik) ‘lost’, or to replace the sound group yı- by i-, as in ilan ‘snake’,
il ‘year’(< Turkish yılan, yıl). The same varieties may display variation, e. g. in
forms like yüng ~ üng ‘wool’ used interchangeably in Qashqâʾî. More unusual is
the development of yo- > o-, as in yoxarı ‘upwards, which has the form oxarı in
most Qashqâʾî varieties. In other instances, initial y- is preserved, see examples
such as yıɣ- ‘collect-’, yıγlamax ‘to weep’ (Sonqorî), or yit- ‘to loose’. Note that
word-initial sound groups such as yı-, yi-, or yü- are alien to the contact languages,
such as Modern Persian, where in genuine Iranian words only ya- or yä- occur.
Initial h-
Across the Turkic varieties of the region initial h- is problematic. In some lexical
items, such as his ‘soot’ (cf. Turkish: is), hör- ‘knit’ (Turkish: ör), or hürk- ‘shy’
(Turkish: ürk-), it coincides with older Turkic h-, which has been lost in other
Turkic languages. There are also numerous examples of conventionalized prothetic
h- in connection with foreign words, such as helbet (< Arabic albatte) ‘naturally’,
or häqiq (< Persian < Arabic ʿaqîq) ‘agate-stone’. In other instances spontaneous
h-prosthesis occurs, e. g. härwär-äkä ‘the woman’ (< Arabic ʿawrat-spec ) from
Sonqor, or helämıjälär ‘they don’t do’ (< elämiyirlär) from Bayâdestân. Prothetic
h- may have originated under the influence of contact languages such as Arabic
or Kurdish, which do not display word-initial vowels. Turkic varieties in different
language-contact situations completely avoid h-, e. g. äm ‘too, also’ (< Iranian
ham), or är (< Iranian här) in the Urum variety of Georgia.
2.1.2.4. Replicated areal features

Fronting
One of the most characteristic distinctive features in the pronunciations of the
Turkic varieties of the region is the so-called “fronting” (Stilo 1994), a phonologi-
cal process denoting palatalization of the velar stops /k/ and /g/ to [kj] and [gj], or,
in a more advanced form, affrication to [ʤ] or [ʧ], in which case the original affri-
cates [ʤ] and [ʧ] are fronted to [dz] and [ʦ/tç], see Bulut, this volume, chapter 3.5,
§2.1.1. This areal phenomenon is found in the pronunciation of genetically unre-
lated languages across Iraq, Anatolia, Azerbaijan and Iran. It is most prominent
in the Turkic varieties of the Azerbaijanian exclave Nakhichevan, in Qarabagh, in
some regions of Georgia, around Tabriz and Urmia in Iran, and in northern Iraq
(Bulut, this volume, chapter 3.5).
Zagros-d: /-d-/ > [-r-], [-j-]

In Kurdish and Goranî dialects of the Zagros area, intervocalic -d- is unstable (see
Mann 1930: 96). Sonqorî, a Turkic dialect of this region, has copied the instability
of -d- as an areal phonological feature. The so-called “Zagros-d” becomes [-r-]
with back and front vowels, and sometimes [-j-] in combination with front vowels.
This happens in connection with copied lexical items such as âdam [ɑræm]
(< Persian < Arabic), ‘man’, and adaqlı [araxli] ‘fiance’, as well as with genuine
Turkic roots, such as odun [urın] ‘firewood’, or with the copula past 3sg in idi [irı]
‘was’, and present 3sg in {+ri} < {+dir} ‘is’, see sample text (1/2), or verb forms
such as gedärdi [gijærdı] ‘he/she went along’.
To a lesser extent instability of intervocalic /-d-/ also appears in Qashqâʾî in
the southern Zagros area. In connection with the verb stem ged-, for instance,
intervocalic /-d-/ may either be pronounced as [d], or it may be replaced by the
glottal stop [ʔ] or the palatal glide [j], as the following forms based on the verb
root ged- ‘to go’ demonstrate: [geʔıllær], [gejllær], [gejıllär], all three forms being
pronunciations of gedirler ‘they go’; see also Bulut (2016: 250–251).
South-western /b/ > [-v-] ~ [-w-]

The most characteristic areal feature in the phonology of Qashqâʾî varieties are
secondary diphthongs or long monophthongs, which appear with foreign and
genuine Turkic words. In intervocalic position, especially in combination with
open vowels such as /a/ or /o/, /b/ becomes > [v], or even [w], while /v/ devel-
ops into the voiced bilabial [w]. In a second step, the resultant sound sequences,
such as [awa], [awä], or [uwa] etc. may be contracted to diphthongs such as [oa],
[ou], or develop into a long monophthong, such as, for instance, [oː]; e. g. /-b-/ >
[-v-] in [ʊrdıvehɛʃd̥] < ‘Ordibehešt’ (Iranian month); /-b-/ > [-v-] > [-w-] in [owa]
< oba ‘camp group’, or [owolarına] ‘camp group’.poss .3 pl . dat ; /-aba-/ > [-oa-]
in [boam] < babam ‘my father’; /-ava-/ > [-ʊwa-], [-oʔa-], [-oa-], [-oha-], or [-oː-]
in pronunciations such as [dʊwɒr] < davar (< *tabar) collective ‘sheep and goats’,
[doʔar], [dohɑːr], or [doːrları] < davar.poss .3 pl . A special case is [thoussɒn]
< tâbestân (<*tâwessân < *tâwestân) ‘summer’, which is conventionalized as an
item of the Turkic lexicon. Speakers of Qashqâʾî usually do not connect this to its
Persian source word tâbestân, which they pronounce [thɒbesdɑn].
2.2. Morpho-phonology
2.2.1. Assimilations on morpheme boundaries: an Intra-Turkic feature

The following types of consonant assimilations are found across the Turkic varie-
ties of the region.
2.2.1.1. Assimilation of consonant groups containing the liquids /r/ and /l/
/-nl-/ > [-nn-]: Suffix-initial /l-/ is subject to various types of progressive assimi-
lation on morpheme boundaries. The plural suffix {+lar}, for instance, may assim-
ilate to stem final /-n/ as in [ʤejrɑnnær] < ceyranlar ‘gazelles’ (Iraq), [mɛhmɒn-
nɛr] < mehmanlar ‘guests’ (Sonqor), [sɑzımənnɛr] < sazemanlar ‘organizations’

(Esfahan), and pronominal forms, such as [ʊnnar] < *onlar ‘they’, or [bʊnnar] and
[mʊnnar] < *bunlar ‘these’ (Esfahan).
/-šl-/ > [-ʃʃ-]: Suffix-initial /l-/ may also assimilate to preceding /-š-/, as in
[dɪʃʃɛre] < dišläri ‘her teeth’ (Sonqor). /-sl-/ > [-ss-] occurs stem internally, too,
see forms such as [æssæn] < aslan ‘by no means’ (Bayâdestân, Esfahan).
/-nl-/ or /-rl-/ > [-ll-]: The reverse process, namely regressive assimilation to
/l-/ after stems ending in /-n/ or /-r/ is also frequent; e. g. /-nl-/ > [-ll-] in [ørtsɨllær]
< örtsünler ‘they should cover’ (Iraq); /-rl-/ > [-ll-] in [verɪllɛr] < verirlär ‘they
give’ (Khalaj of Bayâdestân), or in [vɑllɛrə] < vɑrları ‘they own/possess’ (Esfa-
han).
/-ld-/ > [-ll-]: Combinations of /-l-/ and /-d-/ may display progressive assim-
ilation at morpheme boundaries; e. g. [qɑllɨre] < qaldırır ‘he picks up’ (Iraq), or
[ollʊ] < oldu ‘he/she became’ (Esfahan).
2.2.1.2. Assimilation of consonant groups containing the nasal /n/

/-nd-/ > [-nn-]: This type of assimilation occurs stem-internally in Turkic and
Persian roots; e. g. [kɪnnɛ] < künde ‘dung cakes’ (Bayâdestân), [tænnʊr] < tandur
‘oven’ (Khalaj of Bayâdestân), and [zɛnnıgısɛnnɛ] < zändegisindä ‘in her/his life’
(Sonqor) — as well as at a morpheme boundary, as in, for instance, [sɑllɑnnə] <
sallandı ‘was pending’ (Khalaj of Bayâdestân). /-nd-/ > [-nn-] is frequent in com-
binations with the case suffixes of the locative and ablative. There is, however, a
great deal of variation: In Iraq, the initial /d-/ of the locative is often preserved,
e. g. forms such as [ʔɑttanda] ‘while throwing’, while the ablative mostly exhib-
its assimilation [ɑlt'͜ jɑʃɨnnan sora] < altı yašından sora ‘after the age of six’.
Often, assimilated forms of the locative appear alongside non-assimilated ones, as
in [jɑnɨnda] < yanında ‘by his side’, and [ɑʧanna] < ačanda ‘while opening’, both
pronounced by the same speaker.
2.2.1.3. Assimilation of alveolar stops to sibilants

/-zd-/ > [-zz-], /-st-/ > [-ss-]: Similarly, the alveolar stop /d/ and its unvoiced coun-
terpart /t/ assimilate to the preceding sibilants /s/ and /z/. This type of assimila-
tion, which is also frequent in spoken Persian, may occur stem-internally with
Iranian roots; e. g. [dœsse] < Persian daste ‘bunch, set’ (Sonqor), [bıs] < Persian
bist ‘twenty’ (Esfahan). With Turkic lexical items, a similar assimilation may also
appear at a morpheme boundary; e. g. [bɒʃærmɛzzım] < bašarmazdım ‘I did not
succeed’ (Sonqor).
2.2.1.4. The Qıpchaq isogloss (-> Regional feature)

A special type of assimilation of the plural suffix {+lAr} > {+DAr} appears with
speakers of the so-called Khalaj tribe in Bayâdestân10 region. The initial l- of the
plural suffix becomes > [d-] after stem-final –z, as in [øzdær] < özler ‘they them-
selves’, and > [t-] after stem-final -š and -t, as in [dıjɛrmıʃtɛr] < diyärmišlär ‘they
used to say’, or [zatthær] < zatlär > ‘things’. This kind of /l-/ > [D-] assimilation
is generally associated with the Qipchaq language group; yet, it also occurs fre-
quently in North Azerbaijanian dialects (see Širaliev 1983: 104, 130 etc.). For
Khalaj in central Iran Doerfer (1988: 60) quotes a single example of this type,
namely: tuzla > tuzda ‘with salt’.
2.2.2. Realisation of velar /ŋ/ as [ŋ, y, w] as an intra-Turkic isophone

The development of the old velar /ŋ/ in stem internal position is not predictable.
Across the region /ŋ/ has the realization [ŋ] in a number of lexical items, such as,
qaraŋlux ‘darkness; cf. Older Turkic qaraŋgu ‘dark’, but Turkish and North Azer-
baijan qaranlıq, that is /ŋ/ > [n]; yäŋi (< yaŋı) ‘new’, yüŋg/üŋg etc. (< yuŋ) ‘wool’.
In other instances /ŋ/ has become [ɣ], as in yaɣluz/yalɣuz (< yal(ı)ŋuz) ‘alone’.
Iraq Turkic also displays /ŋ/ > [w], as in gewil ‘heart’ (< göŋül), or /ŋ/ > [g], as in
< ög ‘front’ (< öŋ).
Different realizations of old /ŋ/ > [ŋ, y, w] in various different morpho-pho-
nological contexts constitute a prominent feature that distinguishes the Turkic
varieties of the region, e. g. in the genitive suffix {+nIŋ}, or /ŋ/ > [ŋ, y, w], the
possessive suffixes of the second person singular and plural {+(I)ŋ/{+(I)ŋIz}, and,
consequently also in copula/the personal suffixes of the possessive and (later also)
the pronominal conjugation form. The shift /ŋ/ > [w] appears more frequently in
the northern part of the region, in North Azerbaijani dialects and in South Azerbai-
jan (i. e. the Iranian province of Azerbaijan with the capital Tabriz), and in north-
eastern varieties of Iraq Turkic. The southernmost representative of the w-group
seems to be the small Turkic enclave of Sonqor, north of Kermanshah in western
Iran. The development /ŋ/ > [y] is more characteristic of the central and southern
varieties, such as the dialects of Bayâdestân in the Hamadan and Saveh provinces
and the varieties around Esfahan, and also the Qashqâʾî varieties in the province
of Fars in south-west Iran11. The Safî Khânî variety of Qashqâʾî, however, has
preserved /ŋ/ as [ŋ], see Bulut (2016).
10
One of the Oghuz-speaking Turkic tribes in the valley of Bayâdestân between Sâveh
and Hamadan in central Iran also call themselves Khalaj. Their genealogical relation to
the speakers of the archaic Khalaj language is not clear.
11
That {y < * ŋ} is also found in Urfa in eastern Anatolia may be due to the presence of
the Bayat tribe in this region.
2.3. Morphology
Turkic morphology is agglutinative and right-branching; derivation and inflec-
tion rely on suffixes. Postpositions or case markers combined with postpositions
denote location.

2.3.1.1. Plural
The plural morpheme {-lAr} is added to nominal stems before possessive and case
suffixes. It is subject to certain assimilations, see §2.2.1.1.
2.3.1.2. Possessive suffixes

The possessive suffixes are: 1sg {+(I)m}/1pl {+(I)mIz}, 2sg {+(I)ŋ}/2pl {+(I)
ŋIz}, and 3sg {+(s)I}/3pl {+lArI}, with considerable phonological variation,
assimilations and contractions in the forms of the 2nd persons.12 Basically, present
day varieties either display combinations of high unrounded vowel /I/ + y > [+Iy]
or high rounded vowel + v/w > [+Uw], compare forms such as ad-uw/ad-ıy ‘your
name’, äl-üw/äliy ‘your hand’. For the geographical distribution of this character-
istic morpho-phonological change see §2.2.2 above. In most Qashqâʾî varieties
in the province of Fars in south-western Iran the possessive suffixes of the 2nd
persons are {+ıŋ}~ [+ıj] in the singular and {+ıŋız} or [+i:z] in the plural, while
Safî Khânî displays old [ŋ] in all instances. An intermediary stage with palatalized
[ɲ] also appears.13 Presumably, the phonological development is {+ıŋız} > {+ıɲız}
or {+ıjız}, with palatalization of /ŋ/ due to the surrounding close front/central
vowels [i] and [ı], followed by a contraction to > [+i:z].
12
The basic forms of the Oghuz possessive suffixes are 1sg {+(U)m}/1pl {+(U)mUz}
~{+(I)muz}, 2sg {+(U)ŋ}/2pl {+(U)ŋUz} ~ {+(I)ŋuz}, 3sg {+(s)I}/3pl {+lArI}. Doer-
fer (2006: 102) supposes that the forms of the second persons go back to a mixture of
{+(U)ŋ}/{+(U)ŋUz}, with old /ŋ/ > [n], [ŋ], or [ɲ], and {+(U)G}/{+(U)GUz} (only
in inscriptions, see Gabain 1941: 97), where old /G/ became [v], [y], [g], [ɣ], or [ŋ].
Note that the genitive < *{+nUŋ} shows the same phonological varieties without being
traced back to “old G” < *{+nUG}. Writing /ŋ/ as <G> may have been a graphic con-
vention; see also the representation of /ŋ/ as sağır kaf in Arabic script.
13
According to Tuna’s notation (1984), palatalized [ɲ] seems to be the most frequent
realization of old */ŋ/ in Abivardi.
2.3.1.3. Case morphology

Across the region case morphology displays the typical southwestern Oghuz
characteristics, with the following basic forms: Genitive {+(n)Iŋ}, dative {+(y)
A}, non-specific accusative {+Ø}/specific accusative {+(n)I}, locative {+DA},
and ablative {+DAn}.
The genitive {+(n)Iŋ} exhibits a number of morpho-phonological variants:
{+(n)ıŋ} in varieties which preserve the velar nasal, see Qashqâʾî [xɒnɨŋ qɑqɑsı]
xân-ıŋ qaqa-sı ‘the brother of the Khan’, or [ezənıŋ] öz-ü-nüŋ ‘his/her own’; in
the central dialects which display /ŋ/ > [y] it has the forms {+nıy} ~ {+(n)Iŋ}
(in sandhi before k-, g- and q-) ~ {+(n)In}, e. g. [zaːnij] zan-iy ‘the woman’s’,
[kæslærıŋ kı] kæs-lær-ıŋ kı ‘of the ones who’ (Bayâdestân), and both [khæsən]
käs-in ‘someone’s’ and [ɒdæmɨj] âdam-ın ‘the guy’s’ in (2/4); or {+(n)In} as in
Turkish of Turkey, in the northern Iran-Turkic varieties and in Iraq Turkic. Espe-
cially in connection with personal names, genealogical/kinship terms or indefi-
nite pronouns, the possessor in a genitive-possessive construction often does not
display genitive marking; e. g. härkäs šärbäti (everyone.Ø lemonade.poss .3 sg )
instead of härkäs-in šärbäti (everyone. gen lemonade.poss .3 sg ) ‘everyone’s lem-
onade’ in example (24) below; hala-m qız-ı ‘my cousin’ (aunt-poss .1 sg .Ø daugh-
ter-poss .3 sg ), instead of hala-m-ın qız-ı (aunt-poss .1 sg .- gen daughter-poss .3 sg ),
from Bayâdestân, or Aħmad ʾärwår-ı [Ahmad.Ø wife-poss .3 sg ] ‘Ahmad’s wife’,
from Sonqorî. Note that heavily Iranicisized varieties, such as, for instance,
Sonqorî, have a strong tendency to replace the Turkic genitive-possessive con-
struction by the Persian ezâfe-constructions, see sample text 1 in the appendix.
The possessive dative is {+(n)A}, that is: [+A] after consonant, and [+nA] with
pronominal /n/ in combination with the possessive of the 3rd person, as in baš-ı-na
(head-poss .3 sg - dat ) ‘on his/her head’.
The marked accusative in {+(n)I} has the possessive forms {+n} ~ {+nI} (like
Turkish of Turkey).
With the exception of Iraq Turkic (where the locative suffix remains {+dA}),
the locative assimilates to preceding nasals: {+DA} > [+nA]; when in combination
with the possessive of the 3rd person pronominal suffix, n appears, as in göz-däri-
n-nä (< göz-läri-n-dä eye-poss .3 pl -n-loc ) ‘in their eyes’. In connection with the
ablative, the same assimilation {+DAn} > [+nAn] is found across the region. In
Sonqorî, the ablative morpheme has lost its final /–n/, see bannâ-sin-nä (building-
poss .3 sg - abl ) ‘of their buildings’, see sample text no. (1/1).
Across the region the instrumental case is {+(I)nAn}, as in sample text (2/3):
[ʤınnɛr ͜ ənan] cin-lär-ınan ‘with the jinns’, while, as a rule, the form {+(y)dan} is
characteristic of Iraq.14 The equative case is based on {+cA}; pronominal [n] does
not appear; cf. forms such as [dɒlɨʤa] dal-i-ca (< back-poss .3 sg - equ ) ‘behind
14
Sporadically one also comes across an instrumental case in {+(y)dan} in Iran, as, for
him/his back’ (Bayâdestân). An exception is the equative of the demonstrative

pronouns, e. g. forms like [ondza] ‘that much’. The morpheme of the terminal case
(‘up to, till’) is {+(y)AcA}, as in [jerɛʤɛ] yer-äcä; ‘up to the place’ (Bayâdestân).
2.3.1.4. Comparison
Comparison of adjectives is expressed analytically by the adverbs daha ‘more’
and lap (restricted to Azerbaijan) or än ‘most’ preceding the adjective. The Turkic
suffix {+rAK} has survived in most regions of Azerbaijan proper and in Sonqorî.
Parallel, many varieties use the Iranian comparative suffix {+tär} and the superla-
tive suffix {+tärin}, e. g. Sonqorî čox-raq and čox-tär ‘more’. In the South, Iranian
comparative suffixes have nearly completely replaced their Turkic counterparts.
2.3.2. Pronouns
The personal pronouns are 1sg män, 2sg sän, 3sg o, 1pl biz, 2pl siz, 3pl olar
(~ ular etc.); the reflexive or emphatic pronoun is based on öz ‘self’. In contrast
to Turkish, there are only two demonstrative pronouns, namely bu, Pl. bular ‘this’
(with an emphatic form ha bu ‘this one here’), and o, Pl. olar ‘that’. While the
indefinite pronoun is mostly bir ‘one’, Sonqorî and Ben use a special form bice
(see 1/1).
Across Turkic varieties of Iraq, and northwestern, central and southern Iran
(including Khalaj, see Doerfer (1988: 107) a defective pronominal paradigm
based on the stem bilä- appears.15 This always combines with possessive suffixes
and case markers, as, for instance, in bilä-mız-ä (bilä + poss .1 pl + dative) ‘for/to
us’, or postpositions, as in bilä-si ’čün (bilä + poss .3 sg + posp ) ‘for him/her’. It
neither appears as an unmarked case or nominative, nor in the function of adjec-
tival öz ‘own’. bilä is restricted to object positions of dative, accusative, ablative
and instrumental, and combinations with the postposition ičin ‘for’; it does not
combine with genitive suffixes to form possessive pronouns (see also Bulut 2003).
instance, in the variety of the city of Ben, in the North-east of Chaharmahal and Bakh-
tiari Province.
15
The lexical meaning of this stem is unclear; the best guess is Soper (1996), who main-
tains that bilä goes back to the adverb belä ‘so’, which indeed may combine with pos-
sessive suffixes.
Table 4: Pronominal paradigms

Personal Personal:D at Possessive Reflexive Remoteness/
modesty:D at
1 sg män mänä mänım özüm/özım bilämä (‘to me’)
2 sg sän sänä säniy/sänuw öziy/özüw biläyä (‘to you’)
3 sg o ona onij/onuy özi/özü biläsinä (‘to him/
her/it’)
1 pl biz bizä bizim özümüz bilämizä (‘to us’)
2 pl siz sizä siziy/sızuw özüyiz/özüwüz biläyızä/biläyüzä
(‘to you’)
3 pl olar olara olarıy özläri/özdäri bilälärinä
(‘to them’)
2.3.3. Verb morphology

Turkic verbs express diathesis, time/aspect/mood, possibility and agent by suffixes
attached to the stem in this order.
2.3.3.1. Copula
Across the area, the heterogeneous forms of copula and possessive suffixes func-
tion as a distinctive feature of the Turkic varieties, see Table 5 below.16 While the
copula suffixes of the first and second persons derive from the personal pronouns,
the third persons are either unmarked {-Ø}, or take the form of a suffix in {-DIr/-
DUr etc.} that goes back to the verb dur- ‘to stand’. Frequent usage of the copula
of the 3rd person may point to the influence of Iranian languages, where an overt
marking is obligatory. The plural suffix {-lAr} quite regularly displays regressive
assimilation of word-final [-r] to the [l-]; forms like olurlar ‘they become’, are
pronounced [olullar] in most Iranian and Iraq-Turkic dialects.
16
In the tables below, <X> indicates +/- rounded high vowels such as {X = u, ü, ı, i}; 
represents unrounded high vowels {I = i, ı}, rounded high vowels {U = ü, u}, and
<A> open unrounded vowels{A = a, ä}.
Table 5: Copula forms

Old Standard East Standard Iraq Sonqor Bayâdestân Qashqâʾî
Anatolian Turkish Anatolia/ Azeri (Safi
Turkic Diyarbakir Khânî)
1 sg -vAn, -Am -(y)Xm -(y)Am -(y)Am -(y)Am -(y)Am -(y)Am -(y)Am
2 sg -sAn -sXn -sAn -sAn -sAn -sA ~-saw -(y)Ay -(y)Aŋ
~-say17
3 sg -Durur -Ø/-DXr -Dır -DXr -DX -Ø/-dI/-rI -Ø/-dX(r) -Ø/-dI(r)
> -DUr
1 pl -(v)Uz -(y)Xz -(y)ıx -(y)XK -(y)Xx/ -(y)AK -(y)XK -(y)AK
(y)Xɣ
2 pl -siz -sXnXz -sız -sXnXz -sXz -sIz -(y)Xːz -(y)AŋIz
3 pl -DUr -(DXr)lAr -(Dır)lär -(DXr)lAr -(DX)lAr -(DI)lAr -DX(lA(r)) -DI(lA(r))
(-lar)
Especially in the varieties of central and south-western Iran, the personal suffixes
of the possessive and pronominal type have become more similar to each other in
the 1st and 2nd persons and are only differentiated by a high/close (in possessive suf-
fixes) or an open (in pronominal suffixes) vowel. This differentiation has already
collapsed in some of the Central dialects, where — due to internal assimilation to
/y/—the 2nd persons may display identical pronunciations, see Table 6 below.
Table 6: Personal suffixes of the possessive and pronominal type/South and Central
1st singular 2nd singular 1st plural 2nd plural
possessive {-Im}, {-Xm} {-Iŋ} > {-iy} {-Ik}, {-Xk} {-IŋIz} > {-I:z}
type
pronominal type/ {-Am} {-Aŋ} {-Xk}, {-Ak} {-AŋIz}, {-I:z}
copula > {-Ay) > {-ey}
> {-iy}
2.3.3.2. Tense-mood-aspect
Focal present and aorist
Across the region, most present paradigms reflect the Azeri type: The morpheme
of the focal present, i. e. progressive present, is {-Xr}, which differs from the
aorist in {-Ar} only with regard to its high/close vowel. Stem-final vowels are
17
In Kifri, Tuzkhurmati etc.
normally deleted; thus, for instance, išlä- ‘to work’ forms a focal present išl-ir(i)
‘he is working’, and an aorist išl-är ‘he (usually) works’.18
Deviations from this model are frequent. In varieties that display a morpholog-
ically different focal present, high vowels such as {X = u, ü, ı, i} may also appear
in the aorist, their distribution being similar to that of Turkish of Turkey.
Sonqorî uses a focal present in {-(X)(y)ou(r)}19, e. g. issyouräm (also con-
tracted: isiyoum) ‘I want’, bılowsä: ‘you know’, iliyou ‘he/she is doing’, issyourax
‘we want’, bılousez ‘you (pl ) know’, and issiyouseːz ‘you (pl ) want’, and issiyou-
lar ‘they want’. Consequently, high/close and open vowels appear in the aorist,
as in the following examples from sample text 1: dɑnɨʃıllɛː speak.aor .3 pl (1/1),
dıˈjɛllɛː say.aor .3 pl (1/2) ılɛllɛː make.aor .3 pl (1/5). In other varieties, relics of a
present construction in {*-Iyori} occur alongside the more frequent forms based
on {-Ir}, e. g. giriyöm (< gör-iyür-äm) ‘I see’, and bilmiyüräm ‘I don’t know’, both
from the Bayat variant of Fars, or qalıyür ‘he stays’ from the Khalaj of Bayâdestân.
Due to a tendency across the region to shorten or delete word-internal sylla-
bles, the distinctive feature of the present or aorist morpheme ({-Xr} vs. {-Ar}) is
often blurred. Moreover, in the 1st person singular and plural the vowel of the aorist
is regularly omitted after stems ending in /-r/ or /-l/, e. g. vur-r-am (< vur-ar-ram)
‘I beat’ or allam (< al-ar-am) ‘I take’; see also Bulut, this volume, chapter 3.5,
§2.1.2.1.
Personal suffixes of the present

The personal suffixes of the present represent the so-called pronominal type of
conjugation. Thus, the agent markers of the first and second persons of the present
are identical with the copula forms, with the same phonological variation, espe-
cially in the second persons. Across the area, the suffix of 1st person singular is
{-Am}, as in bilir-äm ‘I know’; the 1st person plural is based on {-IK} in most
northern and central varieties, as, for instance, in säläš-ir-ig ‘we speak’ (Iraq), or
alıšırıx (Bayâdestân) ‘we learn’; in the South {-AK} also appears, as, for example,
in iss-your-ax ‘we want’ (Sonqor), or bız gäl-ır-äk ‘we come’ (Bayat of Fars).
In the northern varieties, the 2nd persons have the forms {-sAn} (or rarely also:
{-sAy}) in the singular and {-sXŋXz}, {-sX:z}20, or {-sXz} in the plural, e. g.
bilirsän ‘you know’, or alısız ‘you.pl take’, with elision of the /-r/ of the present
marker in Iraq Turkic.
18
An exception to this rule are monosyllabic verb stems ending in vowel, such as ye- ‘to
eat’ and de- ‘to say’, which may, among other possible forms (such as deri, deyiri, diyiri
etc.), have a present tense deyir(i) ‘he says’, and an aorist diyär ‘he (usually) says’.
19
Like the Turkish focal present, this tense marker is based on *{-(X)yori}), consisting of
a gerundial form of the lexical verb plus an auxiliary < *yori- ‘to go’.
20
Contraction of {-siŋiz/-sıŋız} or {-seŋiz/-saŋız}.
In the central and southern varieties of the region, the personal suffixes of the 2nd
person singular and plural go back to an older intermediary form {-Aŋ} or {-AŋIz},
which the Safî Khânî-variety of Fars has obviously preserved, e. g. sän gäl-ır-äŋ
‘you.sg come’ or sız gäl-ır-äŋız ‘you.pl come’. Other varieties from the Îl-e Qash-
qâʾî or from central Iran exhibit different stages of the phonological development
of {-Aŋ} > [-Ej] > [-ej] > [-ıj]. The Begdilli and Bayat variants of Fars, and the
central dialects of the Hamadan Province and the region around Esfahan have gäl-
ır-äy, or rarely also gäl-ır-ıy, in the singular, while the corresponding forms of the
2nd plural are gäl-ır-äyz or gäl-ır-eyz (with an additional variant gäl-ır-ıyz in the
Bayat variety of Fars and some central dialects).
Across the region, the 3rd person singular appears without an agent marker, as,
for instance, in gör-ör ‘he sees’ (2/6); yet, it also may exhibit an additional element
{-I} = [ı, e], as in ölıre ‘he/she dies’ (Iraq), or istıre ‘he/she wants’ (Bayâdestân), or
der-e ‘he says’ in sample text 2/4. The final /-r/ of third person plural often assim-
ilates to the initial /l-/ of the plural suffix, as, for instance, in gäl-il-lär (come.
prs .3 pl ) ‘they come’ (2/4), deʾillär (say.prs .3 pl ) ‘they say’ (Esfahan), de-ʾäl-lär
(say. aor .3 pl ) ‘they (usually) say’ (2/5).
The functional distribution of the aorist (habituality, iterativity, ability, inten-
tion) and present (focality) is not clear-cut. In Standard Azerbaijani and in the
northern varieties, it mostly corresponds to that of Turkish of Turkey. As explained
above (see §2.3.3.2 on Focal present and aorist), the central dialects and Iraq
Turkic sometimes exhibit a formal merger of the positive forms, which makes it
impossible to differentiate between aorist and present. The functional distribution
of different paradigms which may exist in parallel in the same variety — such
as the present in {-Ir} and relics of a present in < {*-Iyori} in the Bayat variety
of Fars — is also unclear. In some Qashqâʾî-varieties the present in {-Ir} has
lost focality, in that its function has been extended to include not only the focal
present but also iterativity or habituality, where one would expect the aorist. Con-
sequently, the function of the aorist is restricted to expressions of modality, such
as predictions of future actions or necessity (see Bulut 2016).
The formal and functional vagueness, and the fact that the distribution of both
present types is blurred (one may find present were one expects aorist, and vice
versa), may create the impression that either the two have merged into one, or that
a focal present is still not fully developed. This may be due to the language contact
situation, as the Iranian languages have just one morphologically simple present.
The negative forms of present and aorist, on the other hand, are morphologically
distinct, see, for instance, the negative forms of the first person singular of the
focal present ged-mır-äm ‘I don’t go (now)’, vs. the aorist ged-mäm/ged-män-äm
‘I never go’, or 1st plural ged-mir-ik ‘we don’t go’, vs. gedmäːk ‘we never go’, see
Table 7 below.
Table 7: Negative forms of the present and the aorist

present aorist present aorist
1 sg ged-mir-äm ged-mäm/ged-män-äm 1 pl ged-mir-ik gedmä:k
2 sg ged-mir-äy ged-mäz-äy 2 pl ged-mir-i:z ged-mä-yi:z
3 sg ged-mir(i) ged-mäz 3 pl ged-mir-lär ged-mäz-lär
A non-evidential perfect
Across the region, the perfect displays three morphologically different types.
Perfect paradigms are based either on the old morpheme {-mXš}, on a mixed
paradigm consisting of {-mXš} and a ‘new’ perfect marker which goes back to
the gerund in {-(y)Xb}, or solely on {-(y)Xb}. The personal suffixes are of the
pronominal type, while the 3rd persons may optionally combine with the copula
{+DXr} < durur.21
The Qashqâʾî-varieties of the Safî Khânî and the Bayat (see Bulut 2016), and
also Sonqorî have preserved the old paradigm in {-mXš} in all persons, see the
examples from sample text 1, with the characteristic /-š/ > [-Ø], such as: gɛlmı
‘come.prf .3 sg ’, ılɛmı ‘make.prf .3 sg ’, and vermı ‘give.prf .3 sg ’ (1/4). Other vari-
eties have completely restructured their perfect. Paradigms on the basis of the
new element {-(y)Xb} are found in Altunköprü in Iraq, in Qalʿä Gözäldärä in the
eastern part of Bayâdestân (bilingual Kalhor Kurds), in the Province of Esfahan,
and in the Begdilli variety of Qashqâʾî. More frequent are mixed paradigms with
{-mXš} for 1st persons and {-Xp} for 2nd and 3rd persons (most varieties of Iraq),
or {-mXš} in the 1st and 2nd persons and {-Xp} for 3rd persons, as in Baku in North
Azerbaijan and in Bayâdestân and other central varieties of Iran. The geographical
distribution of the different perfect paradigms is extremely irregular, and often
varies from one village to the next (see also Širaliev 1983: 115 on North Azerbai-
jani dialects). Moreover individual speakers may use different forms in parallel,
producing third persons in {-(y)Xb} and in {-mXš}, which could be interpreted as
reflecting development in progress.
In contrast to the Turkish perfect in {-mIš}, the perfect occurring in the region
we are concerned with (eastern Anatolia, western Iran, Azerbaijan and Iraq) is
strictly resultative and has no marked inferential or evidential qualities. Inferential
or evidential connotations are expressed by the evidential/inferential copula in
{-ImIš}, as in yatmıš-ımıš ‘she had obviously gone to sleep’ (Khalaj of Bayâdestân);
21
According to Bodrogligeti (1968: 30), the complex forms in {-(y)IbDIr} may have been
inspired by the model of the Persian perfect, which—in contrast to the Turkic zero-mor-
pheme—has an explicit morphological element to mark the 3rd person singular on the
surface. Thus, the complex form gälübtur ‘has come’ in the glossary of the Esfahan
anonymous, for instance, would be a structural copy of the Persian âmade ast (come.
pst . ptcp + copula 3sg ).
see also a contracted form in sample text 1: qejr-ımi (< qeyrmıš-ımıš) ‘has founded
(as they say)’ (1/2), and maɣoly͜ ımı (1/3) ‘was a Mongol (as I infer)’.
The imperative of the second person singular in {-ginen}

As in most Turkic languages, the bare verbal stem functions as the imperative of
the 2nd person singular. Combinations with an additional suffix {-ginen}, which
appear across the dialects of the area under consideration, express a somewhat
milder or more polite form of request or demand, see example no. (4) below.
Potential and impossibility: {-abil-}, {-abilme-}

As is characteristic of the area, the potential verb form {-abil-} has the negation
{-abilme-} (in contrast to {-(y)AmA in Turkish of Turkey), e. g. gäl-äbil-mäm ‘I
will not be able to come.’
The necessitative in {-AsI}

The necessitative in {-AsI} combines with suffixes of the copula (or: the so-called
pronominal conjugation) to form predicates. Its predecessor is the so-called future
in {-ası/-äsi}, which appears in a wider range of syntactic functions in Old Anato-
lian Turkic, see Mansuroğlu (1959: 173–174), and Bulut (2016). The modal char-
acter of this element (‘to have to do something’) may be re-enforced by copied
modal adverbs, such as the Persian hatman ‘really, absolutely, by all means’, as
in sän hatman gid-äsi-yäŋ ‘you absolutely have to go’ (Safî Khânî/Il-e Qashqâʾî).
The entity is present in the Qashqâʾî varieties of the Safî Khânî, Begdili, Bayat and
Amaleh in the extreme Southwest of the area; while it seems to be missing (or has
been replaced by the necessitative in {-mAlI})22 in the varieties of central Iran and
Iraq, it re-surfaces in the northwest, in Azerbaijani proper (see Širaliev 1983: 135).
Table 8: Necessitative mood

(1) Safî Khânî Amaleh according North Azerbaijanian
to Jarring (1943–1944), dialects according to
contracted forms with * Širaliyev
(1983: 135)
män hatman gıd-äsi-yäm gedäsim* danıš-asi-yäm
sän hatman gıd-äsi-yäŋ gedäsiŋ* danıš-asi-san
o hatman gıd-äsı ged-äsi danıš-asi-di
bız hatman gıd-äsi-yäg ged-äsi-yäg danıš-asi-yuɣ
sæn hatman gıd-äsi-yäɲız ged-äsi-yäŋiz danıš-asi-suz
oUnlar hatman gıd-äsi-lär ged-äsi-lär danıš-asi-di (lär)
22
Whether {-AsI} appears in complementary distribution with the necessitative in
{-mAlI} is a question for further research.
2.4. Morpho-syntactic elements
2.4.1. The question particle {mI}

In the entire area the question particle {mI} does not appear. Questions are marked
by a copied intonation pattern, the so-called drawl, which consists of a rising into-
nation and lengthening of the clause-final syllable; see also §2.5.2 below.
2.4.2. Copied morpho-syntactic units
2.4.2.1. Comparative in {+tar} and superlative {+tarin}

See under §2.3.1.4.
2.4.2.2. The specifiers in {+AkA} or {+i}

{+AkA} is a global copy of the Kurdish or, originally, Gorani marker of specificity
(“definite article”). It appears across Turkic varieties in contact with these Iranian
languages, such as, for instance, Iraq-Turkic, Sonqorî, and varieties of Qashqâʾî.
Attached directly to nominal stems, it precedes possessive and case suffixes and
adopts palatal harmony, see examples such as dev-äkä ‘the (aforesaid) demon’
(Iraq), and šär-äkä-sı-nı (poem-spec - poss .3 sg - acc ) ‘this certain/aforesaid poem
of his’ (Sonqorî). The unit also marks the head of restrictive relative clauses fol-
lowing Indo-European patterns. In this function, {+AkA} is competing with the
Persian yâ-ye ešâret in {+i}, copies of which exist in most Iran-Turkic varieties
(see below, §2.5.2.1).
2.4.2.3. Copies of Iranian enclitic personal pronouns

Global copies of Iranian enclitic personal pronouns
In southern varieties of Iraq-Turkic global copies of Iranian clitics—attached to
the verb after the personal ending—represent the dative/benefactory case or the
direct object; see Bulut, this volume, chapter 3.5, §2.4.4.2.
Similarly, Sonqorî uses the Iranian clitic of the 3rd person singular {-iš} to
denote the direct object, as in: apar-a-ši (take-opt .3 sg -clit . pron .3sg ) ‘to take
her away.’
Functional copies of Iranian enclitic personal pronouns

With regard to structure, Turkic possessive markers are similar to Iranian per-
sonal clitics. The main difference is functional: Turkic possessive suffixes are
restricted to expressions of possession, while Iranian personal clitics denote the
oblique case of the personal pronoun, e. g. genitive, directive/dative, and accu-
sative. Paraphrases for ‘to need’ (both in Iraq-Turkic and Iran-Turkic) and for ‘to
own’ (occurring solely in Iran-Turkic) imply that in the given language contact
situation, Turkic possessive suffixes have copied additional functional properties
of Iranian clitics and may thus also represent the dative or benefactory case (see
below).
Paraphrasing ‘to need’

To paraphrase ‘to need’, many Turkic languages use a combination of the copied
Arabic noun lâzım or the Turkic gerek ‘necessary’ and the personal pronoun in the
dative/benefactory case, compare Standard Turkish bana lâzım (me.dat necessary
(is)) ‘I need’.
In a different construction used across Iraq-Turkic and the central varieties of
Iran-Turkic, the Turkic possessive suffix may function as a morphological marker
indicating the benefactory case of the personal pronoun, see examples such as
lâzım-ım (dır) (necessary-poss .1 sg (is)) ‘I need’, literally ‘necessary for me (is)’
(Iraq), or lazım-mız (necessary-poss .1 pl (is)) ‘we need’ (Bayâdestân). The under-
lying structure implies that Turkic possessive suffixes have copied the functional
property of Iranian clitics to express oblique/benefactory case (see also Bulut, this
volume, chapter 3.5, §2.4.4.2).
Paraphrases for ‘to have’

Turkic languages do not possess a full verb to express ‘to possess, have’. Instead
they use a construction based on the existence adjectives var ‘existent’ or yox ‘not
existent’, while marking the possessor by a possessive suffix on the possessum:
{noun . poss + var/yoq}; see sample text (2/2) sâhıb-xâne-miz var-ıdı (landlord.
poss .1 pl existent-cop . pst .3 sg ), literally: ‘Our landlord existed’, that is: ‘we had
a landlord’.
Iran-Turkic also displays an alternative construction where reference to the
possessor is encoded in a possessive suffix on the existence adjectives var or yox;
see sample text no. (2/3) ertebâṭ var-ı (relation existent- poss .3 sg ), literally ‘his
contact (is) existent’, that is: ‘he has contact’. As a rule, the latter construction
seems to be limited to Turkic varieties spoken in Iran; sporadically it also occurs
in the border region of Iraq around Khanaqin and Mandali. One may assume that
again Iranian languages provided the structural model; so, for instance, Mackenzie
(1981: 102) describes a similar paraphrase of ‘to have’ in the Kurdish dialects of
Suleimaniye and Arbil. Kurmanji, on the other hand, displays the first type where
the possessor is marked by a possessive suffix on the possessum, i. e. a structure
corresponding to that of Standard Turkish. Across Iran-Turkic varieties speakers
generally use both paraphrases for ‘to have’ in parallel.
Given the extended functional properties which Turkic possessive suffixes
copied from Iranian clitics (see above), constructions such as var-ım (existent.
poss .1 sg ) ‘I have’ or yox-ey (non-existent.poss .2 sg ) ‘You don’t have’ may be
interpreted as a variety of the mihi est pattern: ‘X is existent for me’ ≈ ‘X belongs
to me; I have/possess X’—with the Turkic possessive suffix representing the
dative/benefactory case.
2.4.2.4. The dative/directive in postverbal position: A copied word order property

The Turkic dative case in {-(y)A} may function as benefactory case, or as a direc-
tive case designating the aim to which an action expressed in the verb is directed.
Across the region (Azerbaijan, Anatolia, Iraq, Iran), the dative often occurs in
postverbal position.23 This is probably due to the influence of Iranian languages
such as spoken Persian or Kurdish, where postverbal position of the directive after
verbs expressing motion is the rule (compare also Bulut, this volume, chapter 3.5,
§2.4.5.1); see also the following example from sample text (1/1): vä bu gälmi
Sonqore ‘and this (guy) came to Sonqor.dat .’
2.4.2.5. Turkic postpositions vs. copied Iranian prepositional phrases

Although Iran-Turkic has preserved most of the Turkic inventory of postposi-
tions, copied prepositional phrases after Iranian models are extremely frequent in
varieties with strong Iranian influences, such as, for instance, Sonqorî. In sample
text (1/1) both types of adposition appear in the same sentence, see râjeʿ be târîh
‘about the history’, with the complex Persian preposition râjeʿ be ‘concerning’,
vs. täpä üstündä ‘on a hill’, with the Turkic secondary postposition üst-ün-dä (<
surface/top-poss .3 sg - loc ), literally: ‘hill-on its top’; more examples of copied
prepositional complexes in the same text are: mærbud be ‘pertaining to’, az ‘from,
of’, both in (1/3), or hattâ ‘until, up to’ in (1/5).
2.4.2.6. Copies of ezâfe constructions

Across Iran-Turkic varieties, copies of the Iranian ezâfe, a unit which originally
connects substantives, adjectives or pronouns to the head of a noun phrase, are
used as a productive pattern. The copied ezâfe constructions, though, are restricted
to combinations of (substantive-EZ + adjective) or (substantive-EZ + substantive),
mostly connecting Iranian lexical elements. Again the frequency of ezâfe-con-
structions indicates the degree of Iranicization, as the following copies of Persian
noun phrases from Sonqorî demonstrate: târîh-e Sonqor (history-ez Sonqor) ‘the
history of Sonqor’ in sample text (1/1), maḥall-e dafn-e mâlik-i (place-ez burial-ez
king-spec ) ‘the place of burial of the king’ (1/2), or u taraf-e Sonqor (this side-ez
23
Note that Turkic languages may make use of similar deviations from the unmarked
SOV word order to put a certain element in focus, which is not the case in constructions
with the postverbal dative in Turkic of Iran and Iraq.
Sonqor) ‘this side of Sonqor’ (1/5), where the Persian ezâfe connects substantives;
and afsar-ân-e Maġol (soldiers-pl - ez Mongol) ‘the Mongol soldiers’ (1/3, with a
Persian plural marker on the first element), or zabân-e Torkî (language-ez Turkic)
‘the Turkic language’ (1/4), where the ezâfe links an adjective to a substantive. For
Iraq Turkic, see Bulut, this volume, chapter 3.5, §2.4.5.2, where the usage of ezâfe
constructions is rather limited.
2.5. Syntax
Turkic languages mainly rely on non-finite verb forms to express the syntactic
equivalents of Indo-European dependent clauses. In principle adverbial action
clauses are based on gerunds, while nominalized verb forms such as verbal nouns
and participles denote agent clauses (relative clauses) or nominalized action
clauses (such as, for instance, complement clauses).24 In Iran-Turkic, and, to a
lesser extent, also in Iraq-Turkic and in the Turkic varieties spoken in Azerbai-
jan proper, these principles of Turkic syntax are largely abandoned. In irregular
distribution one comes across a very limited inventory of surviving gerunds and
nominalized subordinators, examples of which are given under §8.1 below.
2.5.1. The Turkic principle of subordination: Non-finite subordinators

The following paragraphs 2.5.1.1 to 2.5.1.2 present examples of surviving Turkic
non-finite subordinators which are rather unusual in Iran-Turkic.25
2.5.1.1. Gerunds
Temporal clauses based on the complex gerunds in {-(y)AndA} and {-(y)AndAn}
Characteristic of the region (including Iran, Iraq, eastern Anatolia and Azerbai-
jan) are complex gerunds in {-(y)AndA} and {-(y)AndAn}, combinations of the
present participle in {-(y)An} with the case suffixes of the locative or ablative
(+ postposition sonra). The complex gerund in {-(y)AndA} encodes a notion of
simultaneousness (see ex. 1) or anteriority (ex. 2) of the action expressed in the
temporal clause:
24
See Johanson (1990: 199–200).
25
With shorter speech samples, one may get the impression that non-finite subjunctors are
completely extinct across Iran-Turkic. Most of the examples quoted here originate from
Bayâdestân in central Iran, where the present author conducted a large-scale field study
(see Bulut: forthcoming).
(1) Kerkuk
asmane attanda obürı dašı qaldırı yerdän…
sky.dat throw.ger other.acc stone.acc takesup.prs .3 sg ground. abl
‘While throwing (the first stone) up to the sky, he lifts up the other one
from the ground, …’
(2) Bayâdestân
čöldän gäländä bilälärä nahar âmad’ elärdik
outskirts.abl come.ger them lunch ready make.aorii .1 pl
‘When they came back from (the fields) outside the village, we would
prepare lunch for them.’
The complex gerund in {-(y)AndAn} stresses anteriority of the action expressed in
the gerundial phrase, as in example (3) below:
(3) Bayâdestân
xarman eliyännän sora o qururdu; qurıyännän
threshing floor come.ger posp that dry. aor . pst .3 sg dry. ger
sora o ökiz aparardılar …
posp that oxen bring. aor . pst .3 pl
‘After they had piled it (=the wheat) up on the threshing floor, it would
dry; after it had dried, they/one would bring the oxen …’
Iraq-Turkic uses complex gerunds based on the verbal nouns in {-DIK} or {-mEK}
in a similar function (see Bulut, this volume, chapter 3.5, §2.5.1.1). In the Safî
Khânî variety of Qashqâʾî, the verbal noun in {-DIK} combines with the loca-
tive in {+DA} and the ablative in {+DAn} to form complex temporal gerunds, as
in köč-dig-dä (migrate.vn . loc ) ‘while migrating’, and är-dig-dän (reach.vn . abl )
‘after having reached’ (see Bulut: 2016).
Other relics of gerundial syntax

While Iraq-Turkic still has preserved a rich inventory of gerunds26, in most Turkic
varieties of Iran gerunds are nearly completely extinct. Among the few exceptions
to this rule are Khalaj proper (see Doerfer 1988: 130–145), and the varieties of the
Bayat and Khalaj of Bayâdestân in central Iran. In example (4) below, the gerund
in {-(y)Ip} functions as a non-modifying terminal gerund (see Johanson 1990),
reflecting a temporal sequence of the semantic contents of gerundial phrase and
main clause, with different agents of main clause and gerundial phrase in the first
sequence:
26
Such as, for instance, the gerunds in {-yE}, {-(y)Ip}, {-EnI (< -ElI)}, {-(y)IncE},
{aorist+ -kEn}, and {-dIKčE}; see also Bulut, this volume, chapter 3.5, §2.1.3.1.
(4) Bayâdestân
Ušaɣiy olar, män gälip sänä
child.poss .2 sg be. aor .3 sg I come. ger you. dat
yetišäbilmäm. Sän gälip burda qalginän!
reach. poss . aor .1 sg You come. ger here stay.imp
‘When your child is born, I won‘t be able to come and wait on you. So come
and stay here!’
In double verb constructions the gerund modifies the action expressed in the finite
verb, as, for instance, in tutup gätir- (< hold.ger bring) ‘to bring along’, or götürüp
ged- (< take_away.ger go) ‘take away/take to another place’; see also durub gäl-
(< get_up.ger come) in example (5) below:
(5) Khalaj of Bayâdestân
Sän Tährannan durub gälmišäy.
you Tehran.abl get_up. ger come. prf .2S g
‘You have come all the way from Tehran.’
2.5.1.2. Verbal nouns and participles

The verbal noun in {-mAK}
The verbal noun in {-mAK} combines with possessive and case suffixes (similar
to Turkish {-mA}) to form complements of verbs indicating motion/direction/
beginning; cf. the verbal noun in the dative after the aspect verb šoruʿ elä- ‘to
begin’27 in example no. (6), and ({-mAK} + P oss + D at ) in example (7) below:
(6) Bayâdestân
Oɣlan šoru‘ elärdi gedmägä o qızın
boy begin.aor . pst .3 sg go.vn . dat that girl. gen
ävinä.
house. poss .3 sg . dat
‘So the boy would begin to visit the house of the girl.’
(7) Bayâdestân
Gidäk qızıŋ görmäginä.
go.opt .1 pl girl. gen see.vn . poss .3 sg . dat
‘Let’s go to see the girl!”
27
See also Bulut, this volume, chapter 3.5, §2.4.5.1, example (7).
The verbal noun in {-DIK}

In some rare instances, the verbal noun in {-DIK} combined with possessive
suffixes as agent markers and additional case suffixes forms absolutive relative
clauses, as in example (8) below:
(8) Bayâdestân
apardoɣumüzüy da ordä yârsı
bring.vn . poss .1 pl . gen too there half.poss .3 sg
yiyärdig gälärdig.
eat.aor . pst .1 pl come. aor . pst .1 pl
‘Normally we would eat half of what we brought along over there before
we returned home.’
The so-called present participle in {-(y)En}

While Turkish employs the so-called present participle in {-(y)An} primarily as a
non-finite subordinator in constructions corresponding to Indo-European relative
clauses (agent clauses), the unit functions as a nomen actoris in the Turkic varieties
across the region, as in išläyän ‘worker’.
The participle is normally not attributed to a head of a relative clause. Exam-
ples such as gälin gälän gecä (bride come.ptcp night) ‘the night the bride arrived’
or hava xoš olan günnär (weather fine be.ptcp days) ‘days when the weather was
fine’ (both examples from Bayâdestân) are exceptional; note that the head in both
cases semantically refers to time.
Thirdly, the participle occurs in a special type of headless/absolutive relative
clause. In such constructions both participle and finite verb form derive from the
same stem, the agents of subordinator and verb being identical, e. g. qačan qačar
ımıš (flee.ptcp flee.aor .3 sg + ev . cop ) ‘and (they relate that) whoever was able
to flee run away’, or aɣlayan aɣlasun (weep.ptcp weep.imp .3 sg ) ‘who(ever) is
weeping shall go on/who is able to weep shall do so.’.
Fourthly, the participle in absolutive position may display its verbal poten-
tial to take objects, as in säni veräniy övi âbâdan ossun! (you.acc give.ptcp . gen
house.poss .3 sg prospering be.imp .3 sg ) ‘May the house of the one who gave you
(into marriage) prosper!’
2.5.2. Syntactic patterns imitating Indo-European strategies of clause

combining
Most Turkic varieties of the region have almost completely replaced the Turkic
syntactic patterns of infinitization/nominalization. Like in the correspond-
ing Indo-European or Iranian models, the dependent clause is based on a finite
verb form. The finite verb may be connected to the main clause by a conjuction,
which, at the same time, expresses the semantic relation between main clause
and dependent clause. Consequently, the Turkic varieties of the region exhibit
a reduced inventory of non-finite subordinators, such as gerunds, verbal nouns
and participles. The far-reaching Iranicization of syntax is strongest in the vari-
eties spoken in Iran proper, and nowadays less prominent in the varieties of the
peripheries.
Structural copying in syntax is often accompanied by phonological copying.
Copied syntactic structures may also imitate supra-segmental intonation patterns
of the model. The copied sentence melody or the respective patterns of pitch are
part of the process of syntactic copying. In questions and co-ordinated clauses
formed according to Iranian models, the tone often rises towards the end; addition-
ally, the vowel of the last syllable may be lengthened, with a markedly rising and
afterwards shortly falling tone. Like, for instance, in spoken Persian this so-called
drawl—the rising tone at the end of a clause—signals that the sentence will go on,
or that a semantically connected (main-)clause will follow. See sample text no.
(1/2,3), where a drawl is indicated by .
2.5.2.1. Indo-European relative constructions

The structure of Indo-European relative constructions is fundamentally differ-
ent from that of Turkic syntactic patterns. While Turkic (attributive) relative
clauses are based on a non-finite subordinator preceding the head, Indo-European
post-positive relative clauses exhibit a finite verb form. A conjunction indicates
the semantic relation between main clause and dependent clause.
The underlying pattern of copied restrictive relative clauses in the Turkic vari-
eties of the area consists of {(dem ) + head (+ spec ) + conj }, see examples (9) to
(11) below; the head of the relative clause may be introduced by the cataphoric
pronoun o or ol ‘that’, as in example (9) below, or anaphoric bu, as in example no.
(10); see also sample text (2/6).
(9) Bayâdestân
O qizi ki istillär alallar
that girl.acc conj want. prs .3 pl take. aor .3 pl
oġlannarına.
son. poss .3 pl . dat
‘They will chose the girl they like best for their sons.’
Andan bu kurelär ki qazıblär her
moreover these shaft.pl conj dig.prf .3 pl every
biri älli metirdır.
one.poss .3 sg fifty metre.cop .3 sg
‘Moreover, every single one of these (aforesaid) side-canals they have dug
is 50 metres long.’
(11) Bayâdestân
ušaxlar ki taze ʿamele28 gälillär Fârsî
child.pl conj new world.dat come.prs .3pl Persian
danıširlär.
speak.prs .3pl
‘The children who are born now speak Persian.’
Eventually, copies of Iranian markers of specificity may be attached to the head,
such as, for instance, the Persian yâ-ye ešâret in examples (12) from Sonqor and
(13) from the Khalaj of Bayâdestân, or the Kurdish clitic {+Aka}, as in example
(14) from the Abivardi variety of Shiraz (after Tuna 1984: 227):
(12) Bu maḥall-e dafn-e malik-î ri
this place-ez burial-ez king-spec cop .3 sg
ke Sonqor qeyrımi.
conj sonqor found.inf . prf .3 sg
‘This is the place of burial of the king who (they say) has founded the town
of Sonqor.’
(13) Be Xoda, o Xoda-yi kê mäni yaratub,
with God that God.spec conj me create.prf .3 sg
bu yuxardan bu üč tane
this further_up.abl this three quant
qurt hämčin mäni či kâr êlärdilär?
Wolf also me what do.aor . pst .3 pl
‘By God, by the Lord who has created me, what would these three wolves
up there do to me?’
(14) Abivardi29
kitab-eku ke aldem
book. spec conj I bought
‘The book I bought’
2.5.2.2. Temporal clauses

A pattern with identical surface structure {(dem ) + noun (+ spec ) + conj } forms
temporal clauses. It also owes its existence to copies of a model that is wide-
spread across spoken Persian and Kurdish (for Iraq Turkic see Bulut, this volume,
chapter 3.5, §2.3.2.2); see example (15) from sample text (2/1) and example (16)
below, which exhibit the same surface structure as the restrictive relative clauses
discussed in §2.5.2.1:
28
Metathesis of ‘âlem ‘world’.
29
The example is presented in Tuna’s transcription.
(15) Män özüm ke därs oxıjardım dabirestândan,

I myself conj study.aor . pst .1 sg high_school.abl
Noubarânda bir yerdä oturardıq.
noubarân.loc some place live.aor . pst .1 pl
‘When I was studying at high school, we were living in a place in
Noubarân.’
(16) Bayâdestân
o šoxm ke elämišdik äkärdik.
that plough conj do. plup .1 pl sow. imp .1 pl
‘After we had ploughed (the fields), we used to sow.’
2.5.2.3. Complement clauses

Turkic complement clauses may rely on non-finite subordinators (verbal nouns) as
the direct object of verba dicendi et sentiendi. Across the region this construction
is replaced by copies of Iranian patterns (for Turkic in Iraq, see Bulut, this volume,
chapter 3.5, §2.3.2.3): The postpositive complement clause is based on a finite
verb form and is linked to the main clause by the copied conjunction ki/ke, see
sample text (2/6), and examples (17)–(19) below. The same construction is applied
to render indirect speech, as in example (19).
(17) Kalhor of Bayâdestân
Görällä’ ke dädäm huša gälär
see.aor .3 pl conj father.poss .1 sg consciousness come.aor .3 sg
‘They saw that my father recovered consciousness.’
(18) Bilsinler ki mänim naharım yanıbdı da
know.imp .3 pl conj my lunch.poss .1 sg burn.prf .3S g also
‘They shall know that my lunch has burned.’
Dedim ki ilan čox daha varde.
say.pst .1 sg conj snake many more existent.cop . prs .3 sg
‘I said that there were many more snakes.’
2.5.2.4. Periphrastic expressions of modality

Turkic languages possess a diverse inventory of finite verb forms to express various
moods, such as imperative, voluntative, necessitative or conditional. Moreover,
nominalized verb forms/non-finite subordinators, such as the so-called future par-
ticiple, the necessitative {-mAlI}, or combinations of an impersonal modal expres-
sion (such as gerek or the copied lâzım ‘necessary’) and the verbal nouns in {-mA}
or {-mAk} etc. can express distinctive shades of modality. In Iranian languages, on
the other hand, the subjunctive is the sole category that indicates mood or marks
dependent verbs. In situations of intensive contact with Iranian languages (or other
Indo-European languages, such as, for instance, Greek), many Turkic varieties have
restructured their means of expressing modality, with the Turkic optative-subjunc-
tive or the imperative spreading into positions of equivalence held by the multi-func-
tional Indo-European subjunctive (Soper 1996: 284–285; Bulut 2000, and 2009:
67–68) — a development which is already reflected in Old Anatolian Turkic texts.
The intentional ‘to want to’

The Turkic expression for ‘to want to do something’ consists of a lexical verb
in the form of a verbal noun that functions syntactically as a complement of the
modal verb ‘to want’. Turkish, for instance, would render ‘I want to go’ as gitmek
istiyorum {go.vn want.prs .1 sg }. Turkic varieties across the region, and, espe-
cially Iran-Turkic, use a periphrastic construction consisting of iste- ‘to want’ and
a dependent lexical verb in the optative-subjunctive or the imperative, such as,
for instance, istiräm gidäm {want.prs .1 sg go.opt .1 sg }, and istiräm gitsin {want.
prs .1 sg go.imp .3 sg } ‘I want him/her to go’. As the three translations of the sen-
tence ‘I want to go’ below demonstrate, the underlying model is the Iranian peri-
phrastic modal construction, the order of constituents being modal verb—lexical
verb, while the optative in Iran- or Iraq-Turkic corresponds to the subjunctive of
the dependent verb in the Iranian model.
Turkish Turkic varieties Iran/Iraq Modern Persian
Gitmek istiyorum İstiräm gidäm Mi-xwâham beravam
go.vn want. want. go.opt .1 sg dur -want. go. subj .1 sg
prs .1sg prs .1sg prs .1 sg
Possibility
In the Turkic varieties of the area the so-called (im)possibility form (‘to be (not)
able to’) exhibits a strongly grammaticalized combination of lexical verb and the
former modal verb (bil- ‘to know, to be able to’)30, e. g.
Gördüm daha gidäbilmir.
see.pst .1 sg more go.imposs . prs .3 sg
‘I realized that it couldn’t walk anymore.’
30
To express (im-)possibility or (in-)ability, the Turkic varieties of the region (Azerbai-
jan, Iraq, Iran and some regions in eastern Anatolia) may employ the deverbal verb suf-
fix {-Ebil-} attached to the stem of the lexical verb. As for negated forms, {-Ebilmi-},
as in (2: 30) yıɣ-abilmæz {collect-imposs . aor .3 sg } ‘he will not be able to collect’, or
{-EmE}, as in Turkish of Turkey, appear in irregular distribution, or, in some regions,
even in parallel.
The notion of (im-)possibility or (in-)ability may also be periphrased by using

the Turkic full verb bašar- ‘to succeed, can, be able to’ as an auxiliary, which
combines with the lexical verb in the subjunctive/optative, see example (21) from
Sonqorî:
(21) Sonqor
bašarmazmı(š) qaza qeyrä
succeed.aor . neg . inf .3 sg food prepare. opt .3 sg
‘She is (as they say) not able to/cannot prepare food.’
The model for this periphrastic construction is in all probability the Iranian struc-
ture relying on the modal auxiliary ‘can’ and the lexical verb in the subjunctive,
as, for instance in the Persian mî-tavân-am be-gu-y-am or the Kurdish dı-kar-ım
be-ǰ-im {dur -can. prs -1 sg + subj -say.1 sg } ‘I can say’, literally: ‘I can (that) I
say’.
‘to tolerate, let’

Structurally similar is the expression for ‘let someone do something’, which is
based on the Turkic verbs bıraq- ‘let’- or qo(y)- ‘put’ as auxiliaries, combining
with the lexical verb in the optative or imperative. This periphrastic modal con-
struction clearly goes back to a calque of the Iranian/Persian compound expression
for ‘to let someone do something’. As in the case quoted above, in Persian a former
full verb (gozâštan ~ goḏârdan ~ gozârdan, present stem: goḏâr ~ gozâr ‘to put,
let, leave, permit, tolerate, suffer’) with reduced lexical content takes up the func-
tion of an auxiliary, meaning ‘let, permit’. It combines with the lexical verb in the
subjunctive, as in bezâr31 bexâbam {let.imp .2 sg sleep.subj .1 sg } ‘let me sleep!’
Both in the Persian model and in the Turkic copy, the agents of the auxiliary and
lexical verbs may differ, see examples (22) and (23) below:
(22) Kerkuk
Qoy zad sänä deyim.
put.imp .2 sg thing you.dat say. opt .1 sg
‘Let me tell you something.’
(23) Bayâdestân
Qoymadı qačsın.
put.neg . pst .3 sg get_away.imp .3 sg
‘He didn’t let him get away.’
31
Note that in colloquial Persian, the stem-initial /g-/ is deleted; bezâr thus is a short/
contracted form of the imperative of the 2nd person singular begozâr.
Impersonal modal expressions

Impersonal modal expressions such as gäräk or lâzım ‘necessary’ also combine
with the lexical verb in the optative or imperative, which hold a position of equiv-
alence to the Iranian subjunctive (see Bulut, this volume, chapter 3.5, §2.5.2.4, and
Bulut 2000). The Iran-Turkic varieties under consideration quite regularly display
the optative in corresponding constructions, see, for instance, e. g.
(24) Esfahan
härkäs šärbätinän gäräk dâmâd ičä
everyone sorbat.poss .3 sg . abl necessary bridegroom drink.opt .3 sg
‘the bridegroom has to drink from everyone’s lemonade …’
Instead of genuine Turkic expressions such as gäräk, global copies of impersonal
Persian modal expressions may also appear, as, for instance, na-bâdâ ‘it shall not
be; may God forbid that …’ in example (25) from Sonqor:
(25) Sonqor
Na-bâdâ âl gälä bunu apara.
it shall not be al come. opt .3 sg her. acc take away. opt .3 sg
‘It shall not be that the Al comes to take her away.’
Expressions of purpose and intention
The notion of finality (intention or purpose/aim to which the action is directed)
can be expressed the Turkic way by a non-finite subordinator (the verbal noun in
{-mAK}) combined with the postposition için, see Bulut, this volume, chapter 3.5,
§2.5.2.4. More frequently used, though, is a construction imitating Iranian models,
where the non-finite verb form is replaced by a finite verb in the subjunctive/opta-
tive as in example (26), or in the imperative.
yarı gecä Al gälär munuŋ ušaɣın apara
midnight Al come. aor .3 sg her child.poss .3sg .acc take_away.opt .3 sg
‘At midnight the Al comes to take her child away.’
2.5.2.5. Conditional clauses32

Turkic conditionals establish a separate category of mood explicitly marked by the
morpheme {-sA}, which may combine with additional tense/aspect markers. The
reduction of Turkic modal categories according to Iranian models based solely on
the subjunctive (see above) has also affected Iran-Turkic expressions of condition-
ality. Thus, the optative (which has widely been identified with the Iranian sub-
32
For a more thorough discussion and more examples on Iran-Turkic conditional con-
structions see Bulut (2009).
junctive) now also appears in Turkic conditional clauses. As in the Iranian model,
conditionality is no longer expressed by morphological means, but by the copied
Iranian conjunction ägär ‘if’; see the conditional construction expressing poten-
tial in example (27) and irrealis (signalized by the optative past) in example (28)
below:
ägär čörägi tökäy toyuɣä, (…) bäräkätı
if bread.acc pour. opt .2 sg chicken.dat blessings.poss .3 sg
gidär.
go. aor .3 sg
‘If you give the bread to the chickens, […] its barakat, its blessing will go.’
(28) Bayâdestân
Äyän onnan vurmiyaydım, hälnuz qanı durmamıšdı.
if it.abl pour. opt II.1 sg still its blood stop.neg . plu .3 sg
‘If I had not made use of it (medicine), the blood would still be flowing.’
2.6. Lexicon
The Turkic core vocabulary is based mainly on Turkî, that is the South Oghuz or
Central Oghuz (Azeri) lexicon. Characteristic of Iran-Turkic is a high proportion
of Persian vocabulary, the ratio of which varies across the different varieties. At
present most speakers of Turkî have at least a bi-lingual lexicon of Turkic-Persian
synonyms at their disposal. Thus, it is difficult to say to what degree copies of
Persian words are conventionalized in any particular variety, or which copies have
outnumbered their Turkic counterparts. For a good survey of Iran-Turkic lexicon,
see Hacaloğlu (1992).
Copies of the lexicon of smaller regional languages obviously pertain to older
strata of the language. The Turkic tribes of Bayâdestân, for instance, have copied
special terminology pertaining to agriculture and irrigation techniques from the
neighbouring Tati populations. This implies that the nomadic tribes, who came
into the area roughly two centuries ago, adopted techniques of cultivation and the
related lexicon of their close neighbours.
No research exists on idiomatic expressions, many of which use the same
semantic means across the different languages of the greater area.
Text 1 (Sonqor)
On the history of Sonqor

(1/1) ʔæmmɑː unʧı‿ kheː rɑʤı bı tɑːrıx-eː Soŋɢor
but so conj about history Sonqor
dɑnɨʃıllɛː, bıʤɛ thɛphø‿ʏssınɛn, bıʤɛ bʊ
speak.aor .3 pl one hill.posp one this
gonbæd-mɒnɛnd sɑxtəmɑn wɑːr k eː
h
bo ʊ
gunbæd
dome-like building exist conj this dome
Sælʤuɣılær zæmɒniː ˈbænnɒsınnɛ‿dej.
Saljuq.pl time.poss .3 sg building. poss .3 sg . abl . cop .3 sg
‘Yet, concerning how people talk about the history of Sonqor, there is a
dome-like building on the top of a hill, and this dome is from the
Saljuqian period.’
(1/2) vɛ dıˈjɛllɛː↗, mæħɛll-ɛ dæfn-e mɒːlıkh-i‿ri
and say.aor .3 pl place.ez burial. ez king. spec . cop .3 sg
kheː Soŋɢore qejrımi.
conj Sonqor. acc found.inf . prf .3 sg
‘And they say that it is the place of burial of the king who founded
Sonqor.’
(1/3) vɛ bıʤeː dɛ mothæɣıd‿dəlɛː33 k hı bʊ qabər-ı k hı
and some also convinced.cop .3 pl conj this grave. spec conj
wɒr‿diː↗ mærbud diː be Soŋɢor bækh, k he
exist. cop .3 sg pertaining. cop .3 sg prep Sonqor Beg conj
æz sær-bɒzɒn‿ o æfsærɒn-e maɣol‿ımı.
prep soldier. pl and officer. pl . ez Mongol. inf - cop .3 sg
‘And some are also convinced that this grave over there pertains to
Sonqor Beg, who is said to have been of the soldiers or officers of the
Mongols’.
(1/4) vɛ bʊ gɛlmı Soŋɢorɛ, ʊrdæ wæthæn ılɛmı
and this come.prf 3. sg Sonqor. dat there home make.prf .3 sg
zɛndegı‿ ılɛmı; vɛ bo zæbɒːn-e thørkhıniː
life make.prf .3 sg and this language.ez Turkî.acc
33
From Persian (< Arabic) motaqâ‘ed ‘convinced’.
ræwɒʤi vermı,34 khe ælʔɒn tɑ‿ ’neʤın35 bɒːɣı ɣɑlmi36.

promote.prf .3 sg conj now till now remain.prf .3 sg
‘So he came to Sonqor and settled down and spent his life there; and he
promoted the Turkic language that has survived to this day’.
(1/5) vɛ lɒː thæmɒː37 ħætthɒ jɛkh færsæx, ʃıʃ khılʊmɛtr ʊ
and not just till one parasang six kilometres that
thæræf-ə Soŋɢor thæmɒm Kurdı soßɛth ılɛllɛː.
side.ez Sonqor all Kurdish speak.aor .3 pl
‘And not even one parasang, six kilometres to the other side of Sonqor,
they only speak Kurdish.’
Text 2 (Bayâdestân)
The wedding of the jinn

(2/1) mɛn øzʏm k he dærs oxıjardɨm dævırısdɑːn,
I myself conj lesson read.aor . pst .1 sg high_school.abl
noʊwærɑndɞ bır jerd’‿ otʊrardeq.
Noubarân.loc one place.loc live.aor . pst .1 pl
‘When I was studying at high school, at the dabîrestân, we were living in a
place in (the town of) Noubarân.’‘
(2/2) mostæʤır‿ədıg, sʕɒːʔəb-xɒnamɞz vɑr‿ədə, o ʧox
tenant.cop . pst .1 pl landlord.poss .1 pl exist.pst .3 sg he very
ɞ ʤınnə’-o ɒːldan tæʔrɞflɛrdə38.
that jinn.pl - conj Al.abl relate.aor . pst .3 sg
‘We had rented the place, and our landlord used to talk a lot about these
demons and the [birth-demon] Al.’
34
From Persian ravâğ dâdan ‘to spread, promote’.
35
Contracted form of indicin ‘now.instr ’
36
From Persian bâqî mândan ‘to remain’.
37
For Arabic ka-mâ ‘just, only’.
38
For ta‘rîf elä- to describe, praise, relate etc.
(2/3) mæsælæn deɛrdı: “bır dʕænɛ noʊbærændɛj

for_instance say. aor . pst .3 sg one quant Noubarân.loc
khıʃe vɑr, ʤınnɛr‿ənan ertəwat39 vɑrə.”;
man exist jinn.pl . instr relation exist.poss .3 sg
‘So, for instance, he would say: “There is a man here in Noubarân who is
in contact with the demons”;
(2/4) ın‿kheː bır khæsən tojındaː bʊ ʤınnɛr,
conj one person.gen wedding.poss .3 sg . loc this jinn.pl
derə, mæsælæŋ gɛlıllɛr, bu bızım
say. prs .3 sg for instance come.prs .3 pl this our
ɒdæmɨj zɒdlærɨn, lıbɒːsɨn, zɒd‿
man.gen thing.pl . poss .3 sg . acc clothes.poss .3 sg . acc thing
ɒparɞllɛj geʔıllɛr.
take_along.prs .3 pl wear. prs .3 pl
‘so that, as he said, at somebody’s wedding these demons would come
and take this person’s things, his clothes along to wear them.’40
(2/5) annan o khiʃıne ʤinnɛr, deʔɛllɛr, tojlarɨnə,
conj that guy.acc jinn.pl say. aor .3 pl wedding.poss .3 pl . dat
deʔɛllɛr o ʤınnɛr‿ənæ̃n ræfd-ɒmæd vɑr,
say.prs .3 pl that jinn.pl .with contact exists
dæwɛt elɛllɛr bʏlæsın.
invitation make.aor .3 pl him.acc
‘Now people say that the jinn invite this man who keeps contact with
them to their weddings.’
39
From Persian (< Arabic) ertebâﬂṭ ‘relation, contact’.
40
This story is somehow related to the belief of the ta‘bia, a twin everyone (or especially
every woman?) has among the people of the jinn. If a girl gets married, her alter ego
of the demons will hold a wedding on the same day. Therefore, she borrows her human
counterpart’s wedding dress.
(2/6) annan khiʃı gørør khıː, bʊ qoɲʃʊsɨnən

then guy see.prs .3 sg conj this neighbor.poss .3 sg . gen
qızenıñ lıbɑːsʊj k hı thojde
daughter.poss .3 sg . gen dress.poss .3 sg conj wedding.loc
gjeːjmIʃde, o gæːlıŋ khı, ʤiːn k hı gæːlın
wear. plup .3 sg that bride conj jinn conj bride
olʊb, onən nɛ ejnẽndɛ‿ də.
become. prf .3 sg her also body. poss .3 sg . loc cop .3 sg
‘And there this man sees that the bride, the bride of the jinn, is wearing
the bridal dress that his neighbor’s daughter had been wearing at her
wedding.’
Abbreviations
abl Ablative instr Instrumental

acc Accusative loc Locative
aor Aorist neg Negation
conj Conjunction opt Optative
cop Copula pl Plural
dat Dative plup Pluperfective
dem Demonstrative posp Postposition
ez Ezafe poss Possessive
gen Genitive prf Perfective
ger Gerund pst Past
imp Imperative sg Singular
imposs Impossible spec Specificity
inf Inferential
References
Boeschoten, Hendrik. 1998. The speakers of the Turkic languages. In Lars Johanson & Éva
Á. Csató (eds.), The Turkic Languages, 1–15. London & New York: Routledge.
Bulut, Christiane. 1999. Klassifikatorische Merkmale des Iraktürkischen. Orientalia Sue-
cana, vol. 48. 5–27.
Bulut, Christiane. 2000. Optative constructions in Iraqi-Turkman. In Aslı Göksel & Celia
Kerslake (eds.), Studies on Turkish and Turkic Languages. Proceedings of the IXth In-
ternational Conference on Turkish Linguistics, Lincoln College, Oxford, 12–14 August,
1998, 161–169. (Turcologica 46). Wiesbaden: Harrassowitz.
Bulut, Christiane. 2003. Pronominal systems in the transitional varieties of the Turkic
dialects in East Anatolia, Iraq and Western Iran. In Sumru Özsoy (ed.), Studies in
Turkish Linguistics: Proceedings of the Tenth International Conference in Turkish Lin-
guistics, Bogaziçi University, 16–18 August, 2000, 321–335. Istanbul: Bogaziçi Yayın-
ları.
Bulut, Christiane. 2009. Conditional clauses in Iran-Turkic. In Éva Á. Csató, Gunvald Ims, Joakim
Parslow, Finn Thiesen & Emel Türker (eds.), Turcological letters to Bernt Brendemoen,
35–75. Oslo: The Institute for Comparative Research in Human Culture. Novus Press.
Bulut, Christiane. 2016. Convergence and variation in the Turkic varieties of Iran: Ex-
amples from Qashqâ’î. In Éva Á. Csató, Lars Johanson, András Róna-Tas & Bo Utas
(eds.), Turks and Iranians: Interactions in Language and History, 235–282. (Turcolog-
ica 105). Wiesbaden: Harrassowitz.
Bulut, Christiane. Forthcoming. On language and culture in the Turkic region of Bayâdestân/
Iran. Habilitation thesis, Mainz 2006.
Islamov, M. I., Aghajev, Ä. G., Behbudov, S. M., Ahmadiv, T. M., Mammadov, N. Ij.,
Tagıyev, B. M. & Z. A. Hasıyev (eds.). 1990. Azärbaycan Dilinin Dialektoloji Atlası
[Dialectological Atlas of the Azerbaijani language]. Baku: Älm.
Doerfer, Gerhard. 1988. Grammatik des Chaladsch. Wiesbaden: Harrassowitz.
Doerfer, Gerhard. 2006. Irano-Turkic. In Lars Johanson & Christiane Bulut (eds.), Tur-
kic-Iranian Contact Areas, 93–113. (Turcologica 62). Wiesbaden: Harrassowitz.
Doerfer, Gerhard & Wolfram Hesche. 1989. Südoghusische Materialien aus Afghanistan
und Iran. Wiesbaden: Harrassowitz. Hacaloğlu, Recep Albayrak. 1992. Azeri Türkçesi
Dil Kılavuzu [Azeri Turkish Language Guide]. Ankara: Türk Tarih Kurumu.
Gabain, Annemarie von. 1941. Alttürkische Grammatik. Wiesbaden: Harrassowitz.
Hacaloğlu, Recep Albayrak. 1992. Azeri Türkçesi Dil Kılavuzu [Azeri Turkic language
guide]. Ankara: Türk Tarih Kurumu.
Johanson, Lars. 1985. Die mittelaserbaidschanischen Transkriptionstexte der Universitäts-
bibliothek Uppsala. In Beşinci Milletler Arası Türkoloji Kongresi. İstanbul 1985. Te-
bligler I. Türk Dili, Cilt 1. 143–147.
Johanson, Lars. 1990. Subjektlose Sätze im Türkischen. In Bernt Brendemoen (ed.), Altaica
Osloensia. Proceedings from the 32nd Meeting of the Permanent International Altaistic
Conference, 193–218. Oslo: Universitetsforlaget.
Johanson, Lars. 1993. Code-copying in immigrant Turkish. In Guus Extra & Ludo Ver-
hoeven (eds.), Immigrant languages in Europe, 197–221. Clevedon, Philadelphia &
Adelaide: Multilingual Matters.
Mann, Oskar. 1930. Die Mundarten der Gûrân, besonders der Kändûläî, Auramânî und
Bâdschälânî, (ed. by Karl Hadank). Berlin.
Mansuroğlu, Mecdut. 1959. Das Altosmanische. In Philologiae Turcicae Fundamenta I,
161–181. Wiesbaden: Steiner.
Širaliev, M. S. 1983. Dialekty i govory azerbajd¢anskogo jazyka [Dialect and speech of the
Azerbaijani language]. Baku.
Soper, John. 1996. Loan Syntax in Turkic and Iranian. (Rev. and ed. by Andras J.E. Bodro-
gligeti). Bloomington, Indiana: Eurolingua.
Tuna, Osman Nedim. 1984. Ebi Verdi: Iran’da bir Türk diyalekti [Ebi Verdi: A Turkic dia-
lect in Iran]. Türk Dili Arastırmaları Yıllıgı, Belleten, 215–245.
Windfuhr, Gernot. 2006. Linguistic Sketch of Azeri. In Encyclopædia Iranica 8.4, 396–401.
New York: Center for Iranian Studies, Columbia University. http://www.iranicaonline.
org/articles/iran-vii7-turkic-languages (accessed 15 February 2018).
4.3. Bakhtiari
Erik Anonby and Mortaza Taheri-Ardali
1. Introduction
The Iranic (Indo-European) language Bakhtiari is spoken by about one million

people in the Zagros Mountains of south-western Iran. The main part of the Bakh-
tiari language area is divided among four provinces: Khuzestan, Chahar Mahal va
Bakhtiari (=C&B), Lorestan and Esfahan. The largest cities with significant Bakh-
tiari-speaking populations are Masjed Soleymān and Izeh (Khuzestan), Shahr-e
Kord (C&B), Dorud (Lorestan) and Aligudarz (Lorestan) (Anonby and Asadi
2014, hereafter AA 2014). The approximate location of the Bakhtiari language
area is shown in Fig. 1.
The Bakhtiari ethnic confederation (B. ayl) has played a prominent role in the
history of Iran (Digard 1988), and their nomadic livelihood is one of the iconic
images of cultural diversity in the country. Aspects of Bakhtiari culture, such as
the weaving of carpets and traditional clothing, including the distinctive indigo-
and-white striped woolen vests (B. čuqā), are well-known inside and outside of
Iran (Ittig 1988; Digard 1992).
Within the Iranic family, Bakhtiari has been classified as Southwestern, and
more specifically as part of the Lori (or “Perside”) language group (Windfuhr
2009; MacKinnon 2011; Anonby 2017). The Lori group forms a continuum
between Persian and the Kurdish language bloc to the north-west (Anonby 2003,
2016). Bakhtiari is geographically and structurally intermediate between North-
ern Lori and Southern Lori, although it is more similar to the latter (Amanollahi
1991; MacKinnon 2011; Anonby 2003, 2017). Within the Bakhtiari language, no
clear dialect groups have been identified; speakers tend to refer to subvarieties
of Bakhtiari in reference to urban centres, or major ethnic divisions (e. g., the
Haflang vs. Chārlang ethnic moieties) (AA 2014: 17), but no linguistic evidence
has been advanced to support these assessments, and an overall picture of the
internal dialect situation remains unclear.
In the first published study relating to Bakhtiari, Mann (1910) made general
remarks on the language and collected some texts as part of a larger work on the
Lori language group. This was supplemented by a more substantial description of
Bakhtiari by Zhukovskij, which was published posthumously in 1923 (Abdullaeva
2009), and recently re-edited and translated into Persian by M. Shafaghi and M.
Dadres (Zhukovskij 1923/2017). Lorimer, whose (1922) work is the best-known
study of the language, treated Bakhtiari phonology in a way that is both extensive
and detailed, but not phonemically rigorous. He also prepared and published many
Bakhtiari texts (Lorimer 1930, 1954, 1955, 1963, 1964; Lorimer and Lorimer
446 Erik Anonby and Mortaza Taheri-Ardali
Figure 1: The Bakhtiari language area (based on AA 2014:17)
1919). While Kerimova’s (1982) overview of Bakhtiari brings together and organ-
izes a wide cross-section of data from these prior sources, Windfuhr’s (1988) article
concentrates on historical phonology and key morphological features of the lan-
guage. Sociolinguistic background on the Bakhtiari language community and its
place within the Lori language continuum are explored in Anonby (2003, 2012). A
detailed phonological description, along with a long glossed text, verb paradigms,
and a 1500-word lexicon, was published by Anonby and Asadi (AA 2014). Madadi
(1996) and others have also produced lexicons of Bakhtiari, and Zolfaghari (2017)
has written a multi-faceted study of the language, with a focus on lexicon, from
an anthropological-linguistic perspective. An in-depth study of the geographic
distribution of Bakhtiari in C&B Province has appeared in ALI (the online Atlas
of the Languages of Iran, eds. Anonby and Taheri-Ardali 2015–18, hereafter AT
2015–18), and a dialectology of Bakhtiari dialects spoken there is underway in the
same forum (Taheri-Ardali and Anonby 2017; Taheri-Ardali 2017). Schreiber et
al. (2017) have looked at areal grammatical convergence between Bakhtiari and
Turkic varieties in C&B. Finally, Khosravi (1996) has made extensive notes on the
grammar of Bakhtiari, and E. Tāheri has provided the most complete grammatical
description to date in his (2010) book.
Bakhtiari 447
Bakhtiari has a long and rich tradition of oral literature, including genres such
as folktales, proverbs and sung poetry. There is no standardized orthography,
but in the last century speakers have adapted Persian orthography to Bakhtiari
and have published many volumes of oral literature (some well-known examples
include: Dāvari 1964; Afsar Bakhtiyāri 1965; Forutan 1996; Ra’isi 1996; Kiyāni
Haft Lang 2000; and Shirin Bayān 2007). Other recent publications in Bakhtiari
include a collection of prose writings on language and culture (Be’vār 2000), a
Bakhtiari adaptation of the Shahnameh epic (Khosraviniyā 2010) and a novel enti-
tled Brotherhood (B. gaʋgari) (Khosraviniyā 2011).
As is the case for all languages in Iran, Persian – in its various historical
incarnations and dialectal forms – has exerted a long-standing influence on Bakh-
tiari. Following many centuries where tendencies of divergence and convergence
between Bakhtiari and its south-western sister Persian were more balanced, over
the last century Bakhtiari structures at all levels of the language are increasingly
hybridizing with their Persian counterparts under the influence of universal school-
ing and media: radio, television, and now internet. Perhaps even more signifi-
cantly, parents in many parts of the language area, and especially urban locations,
are teaching their children Persian as a first language in the home (Taheri-Ardali
2015; AT 2015–18). Consequently, despite a large language community, a pattern
of language shift has been set in motion and the future of the language is un-
certain.
The present study provides a linguistic overview of Bakhtiari, with an account
of key elements of phonology §2, morphology §3, and syntax §4. This descrip-
tion is followed by a glossed text §5. The analysis here is based on the Masjed
Soleymān dialect as described in Anonby and Asadi (AA 2014), unless otherwise
noted, but salient points of dialectal variation in the language are addressed, and
the glossed text is from the Ardal dialect of C&B Province.
2. Phonology
2.1. Segmental phonology

There is significant variation in the phonological inventories of dialects across the
Bakhtiari language area. Urban varieties and Bakhtiari dialects in the east, which
are transitional to the Persian varieties of the Iranian Plateau, tend to pattern more
closely with Persian, whereas dialects in the west are more distinctive.
Some urban Bakhtiari varieties and dialects in the east exhibit a Persian-type
vowel system with six vowels (Taheri-Ardali and Anonby 2017). In most dialects,
however, the Middle Iranic majhul (long mid) vowels ē and (sometimes) ō (see
Perry 1996) live on in various ways, whether as unitary mid-high vowels ē ō,
diphthongs ie uo, or vowel-consonant sequences ey oʋ (contrasting with ay aʋ)
(e. g., zēne / zeyne / ziene ‘woman’) (AA 2014: 56–58; Lorimer 1922: 23, 32). The
historically long vowels i, ā and u – and, where they occur, ē and ō – still tend to
be pronounced with greater phonetic length than the vowels e, a and o. This dif-
ference is not consistent in all contexts, though, and consequently vowel quality
rather than length is the primary criterion for distinguishing among vowels (AA
2014: 60–61).
Table 1: Bakhtiari vowel inventory

i [iˑ] u [uˑ]
(ē [eˑ]/etc.) (ō [oˑ]/etc.)
e [ɛ̝] o [ɔ̝]
a [a] ā [ɒ̜ˑ]
In the Bakhtiari dialect of Sar Āqā Seyyed in northern C&B Province, contrast
between u and its fronted counterpart ü has also been observed, as evidenced by
the minimal pair du ‘yogurt drink’ / dü ‘smoke’ (Taheri-Ardali 2017).
The Bakhtiari consonant inventory contains members from six general places
of articulation, and six manners of articulation. It is more uniform across dialects
than the vowel inventory, with the only significant case of variation being the
absence of a voiceless uvular obstruent q in some eastern dialects; there, it is
replaced by its voiced counterpart ġ (e. g., qalb/ġalb ‘heart’).
Table 2: Bakhtiari consonant inventory

labial alveolar postalveolar palatal/velar uvular glottal
vl. stop p t č [t͡ ʃ] k [k~kj~c] q
vd. stop b d j [d͡ ʒ] g [ɡ~ɡj~ɟ] ġ [ɢ~ʁ]
vl. fricative s š [ʃ] x [χ]
vd. fricative z ž [ʒ]
nasal m n
lateral l
trill r [ɾ]
glide ʋ [w~ʋ~v] y [j] h [h~ɦ]
Phonetically, vowel-initial words have a glottal stop [ʔ] onset, but this segment is
not contrastive ([ʔ]aʋ ‘water’, [ʔ]eyleʋār ‘jaw’, [ʔ]isā ‘you (pl.)’).
While ʋ and h both have fricative realizations in some contexts, Anonby and
Asadi (2014: 25) categorize them as glides because of their distribution and behav-
iour. The glide ʋ in particular has three allophones [v], [ʋ], [w] ([v]aro ‘high’,
ko[ʋ]ār ‘skeleton’, ša[w] ‘night’).
Bakhtiari 449
When h appears after vowels, it is realized phonetically as length on the

preceding vowel: peh  p[ɛ̝ː], māh  m[ɒ̜ː] ‘month, koh  k[ɔ̝ː] ‘mountain’. The
consonantal value of postvocalic h reappears weakly, as a voiced (i. e., breathy) [ɦ]
when a vowel follows, and the vowel is shortened to its original duration (ya kohey
 ya k[ɔ̝ɦ]ey ‘a mountain’).
Voiceless stops are non-contrastively aspirated in most positions ([ph]ar ‘leaf’,
ma[kh]al ‘barren (land)’, hayʋa[th] ‘appearance’), but are not aspirated when
they follow another obstruent (poš[q]āʋ ‘plate’) or precede another consonant
(qā[p]niden ‘to grab’).
Velar stops k and g are palatalized [kj], [ɡj] – and in some dialects, fully palatal
[c], [ɟ] – before front vowels ([kj]ar/[c]ar ‘deaf’, [ɡj]ap/[ɟ]ap ‘big’) and in codas
(hā[kj]/hā[c] ‘dirt’, len[gj]/len[ɟ] ‘leg’).
The voiced alveolar stop d, which is pronounced as [d] word-initially and after
a consonant ([d]ā ‘mother’, gar[d]en ‘neck’), is conspicuously realized as an inter-
dental approximant [ð̞ ] after a word-internal vowel or glide, whether in the middle
of a word (ga[ð̞ ]e, meh[ð̞ ]e – both ‘stomach’) and or word-finally (ka[ð̞ ] ‘waist’,
zey[ð̞ ] ‘s/he hit’). In the Sar Āqā Seyyed dialect of north-western C&B, a com-
plete phonemic shift to r has taken place in the same phonological position (mehre
‘stomach’, zeyr ‘s/he hit’; Taheri-Ardali 2017, AT 2017).
The voiced postalveolar fricative ž [ʒ] is contrastive in some dialects (B. of
Masjed Soleymān žerde ‘torn’, luže ‘evading’, ġož ‘anger’; AA 2014: 29–31, 46),
but in many other dialects it only appears as an allophone of other postalveo-
lar obstruents before another non-continuant segment (e. g., hi[ʒ]dah ‘eighteen’,
ga[ʒ]din ‘scorpion’ ma[ʒ]me/ma[ʒ]mah ‘platter’; Taheri-Ardali 2017; AT 2017).
Geminated (i. e., doubled or strengthened) consonants are uncommon (garr
‘person with ringworm’, takabbor ‘arrogance’), and geminated consonants in
Persian borrowings or cognates are usually simplified (P. darre, B. dare ‘valley’;
P. ammā, B. amā ‘but’).
When n follows a vowel-glide sequence within a coda, it is realized as nasali-
zation on this sequence (zoʋn  z[õw] ‘tongue’, dendoʋn  dend[õw] ‘tongue’,
paynš=tā  p[ãj]š=tā ‘five (count)’). In eastern dialects where cognate sequences
do not contain a glide (e. g., zun ‘tongue’, dendun ‘tooth’) n is retained as a full
phonetic consonant.
2.2. Syllable and word structure

Syllables are composed of an optional simple onset, a nucleus, and an optional coda
with up to three consonants. There are seven attested syllable shapes in Bakhtiari:
V i ‘this’ (as in i dast=o ‘this hand’)

VC āš ‘stew’
VCC asp ‘horse’
CV jo ‘barley’
CVC bur ‘brown’
CVCC leng ‘leg’
CVCCC be.peyžn‘twist, wrap (impv.)’
In inherited vocabulary, there are clear constraints on the voicing and sonority
sequencing of coda consonants. In words borrowed from Arabic (via Persian),
atypical coda clusters are in some cases retained (P.=B. toxm ‘egg’), but in other
cases the clusters are reconfigured (P. hokm > B. hoym ‘order (n.)’, P. fekr > B. ferg
‘thought (n.)’). Codas may contain up to three consonants, but in this case the first
consonant is always a glide y ʋ or h (šoʋnz.dah ‘sixteen’, be.seyʋn ‘stir! (sg.)’).
Vowel sequences are uncommon, with only ai (baig ‘bride’) and āi (māi ‘fish’)
attested within a few inherited morphemes. Additionally, in some dialects, vowel
sequences have emerged in borrowed words where the source language’s glottal
stop, which is not phonemic in Bakhtiari, is lost (sāʔat > B. sāat ‘hour’, P. doʔā
> B. doā ‘spontaneous prayer’; but note that some other dialects ʔ is replaced by
h: B. sāhat, dohā). At certain morpheme boundaries where vowels come together,
vowel sequences are permitted (be- ‘(impv.)’ + ašn ‘hear’  beašn ‘hear! (2sg.)’; ya
… =ey ‘(indef.)’ + doro ‘lie (n.)’  ya doroey ‘a lie (n.)). In most cases, however, the
vowel sequence is eliminated through deletion of one of the vowels (e- ‘(impfv.)’ +
ašn ‘hear’ + -om ‘1sg.’  ašnom ‘I hear’; pā ‘foot’ + =om ‘1sg.’  pām ‘my foot’)
or the insertion of a glide between them (taynā ‘alone’ + =om ‘1sg.’  taynāyom ‘I
am alone’; hālu ‘paternal uncle’ + =ay ‘(voc.)’  hāluʋay ‘hey, paternal uncle!’).
There are no absolute restrictions on minimal lexical words, although words
comprised of a single syllable ending in the historically “short” vowels e, a and
o (see the vowel inventory in §2.1 above) are rare (AA 2014: 60–61); examples
include se ‘three’ and na ‘no’ (see §3.2.3), and a few verb forms such as go ‘s/he
said’ and xa ‘s/he ate’.
Monomorphemic words are almost always one or two syllables long, although
semantically opaque three- and four-syllable nouns originating as compounds or
borrowed words can be found (ey.le.ʋār ‘jaw’, za.lā.la.ti ‘distress’). In morpholog-
ically composite words, there can be more than four syllables (i hoʋ.ne.hā.ho.ne
‘these houses (obj.)’, na.ez.mā.yi.do.me.soʋn ‘I didn’t test them’).
2.3. Stress and intonation

In Bakhtiari, stress (or, more technically, phrasal accent) is associated with content
words: verbs, nouns, adjectives and adverbs. It is normally hosted by the final
syllable of the stem (ˈpā ‘foot’, boˈtol ‘beetle’, kabuˈde ‘poplar tree’). Affixes
Bakhtiari 451
and clitics attached to these words can be inherently stressed or unstressed (an
inventory of affixes and clitics, along with their designation for stress, is given in
Anonby and Asadi (AA 2014: 69–71)). Stressed morphemes override the underly-
ing stress of the stem (ˈpā ‘foot’+=ˈke ‘(def.)’  pāˈke ‘foot (def.)’; ˈna- ‘(neg.)’
+ ˈxa ‘s/he ate’  ˈnaxa ‘s/he didn’t eat’), whereas unstressed morphemes have no
effect, even if they are word-final (ˈpā ‘foot’+=ne ‘(obj.)’  ˈpāne ‘foot (obj.)’).
Intonation is marked in all sentence types through the raising of pitch on the
stressed syllables of pragmatically salient words. Declarative sentences, com-
mands, and content questions are marked by a general decline in pitch. In yes/no
questions, the pitch is raised on the first pragmatically salient element of the sen-
tence, and stays high until the end of the sentence, where it is raised even further.
Further discussion of intonation in Bakhtiari, along with pitch traces for example
sentences, is provided in Anonby and Asadi (AA 2014: 66–69).
3. Morphology
Nominals (§3.1) and verbs (§3.3) are open word classes in Bakhtiari. Closed
classes (§3.2) include numerals, prepositions, pronouns, and a heterogeneous set
of particles.
3.1. Nominals
Nominals, which include nouns (§3.1.1), adjectives (§3.1.2) and adverbs (§3.1.3),
share many structural properties but can be distinguished through their combina-
torial possibilities in morphology and their syntactic distribution.
3.1.1. Nouns
Nouns are inflected for number (plurality) and definiteness. Possession and other
types of grammatical association are expressed through use of an ezāfe construc-
tion (§4.1) rather than case marking, but there is an object marker used with nouns
(see in this section below). There is no grammatical gender.
Nominal plurality is marked with addition of the suffix -(h)ā, but its use is
correlated to contexts where the plural noun is definite, or when plurality is being
emphasized. The allomorph -ā is found with consonant-final stems (čel-ā ‘arms’,
merzeng-ā ‘eyelashes’), and -hā is found with most vowel-final stems (kolo-
hā ‘locusts’, hoʋne-hā ‘houses’). When the stem ends with a high vowel, how-
ever, the corresponding glide is inserted before -ā (ti-y-ā ‘eyes’, soru-ʋ-ā ‘songs’).
The plural suffix -oʋn is used in place of -(h)ā with a small set of nouns that
includes handicapped persons (kar-oʋn ‘deaf people’, šal-oʋn ‘lame people’) as
well as certain animals (gusend-oʋn ‘sheep, goats’, pāzen-oʋn ‘ibexes’).
A collective/plural suffix -gal (after alveolar consonants) / -yal (elsewhere) is

used with a closed set of animate nouns (kor-gal ‘boys’, boz-gal ‘goats’, ādom-
yal ‘people’, hāšo-yal ‘eagles’), as well as with nominalized adjectives referring
to groups or people or animals (tešne-yal ‘the thirsty’, zende-yal ‘the living’). In
southern dialects of C&B Province, which border on the Southern Lori language
area, this suffix takes the form -al and is generalized to inanimate nouns as well
(deraxt-al ‘trees’, mel-al ‘hairs’).
Definiteness and indefiniteness are both explicitly marked in the morphol-
ogy. The definite suffix/clitic (which carries stress, but appears to be syntactically
mobile; see AA 2014: 93) takes the form =ke with vowel-final nouns (pā=ke ‘the
foot’, hoʋne=ke ‘the house’), and =eke with most nouns ending in two consonants
(hast=eke ‘the bone’, ahd=eke ‘the treaty’). Stems ending in a single consonant
can take either form, with the longer form being more emphatic (sar=ke~sar=eke
‘the head’, taʋ=ke~taʋ=eke ‘the fever’). This so-called ‘definiteness marker’ is
not, however, simply the functional equivalent of the definite article in the lan-
guages of western Europe (for example). Investigation of Bakhtiari texts shows
that this marker is often absent on noun phrases even when they are conceptually
‘definite’ (see, among others, Abbott 2004) and would require a definite article
in other languages that mark definiteness. The precise nature of the definiteness
marking in Bakhtiari is a topic for further research.
Indefiniteness is marked using the discontinuous morpheme ya …=ey (ya
doʋdar=ey ‘a girl’, ya pā=ey ‘a foot’; ya …=i in some dialects), with the initial ya
component (cf. yak ‘one’) sometimes omitted. When this morpheme is attached to
a stem ending in a high vowel, the corresponding glide is inserted (ya geli-y=ey ‘a
throat’, ya tu-ʋ=ey ‘a room’).
A demonstrative function is similarly marked on nouns using a discontinu-
ous morpheme i …=(h)o ‘this’ / u …=(h)o ‘that’ (cf. the 3sg. distal/neutral free
pronoun ho; see §3.2.4), with the allomorph =o used for consonant-final stems (i/u
dast=o ‘this/that hand’, i/u del=o ‘this/that heart’) and =ho used for vowel-final
stems (i/u pā=ho ‘this/that foot’, i/u torne=ho ‘this/that tuft of hair’).
Direct objects are usually marked with the clitic =ne (ri=ne ‘face (obj.),
say=ne ‘dog (obj.)), which is reduced to =e with consonants other than glides
(xayār=e ‘cucumber (obj.)’, gom=e ‘footstep (obj.)’, hars=e ‘teardrop (obj.)’).
When an object is also marked as indefinite, the indefinite enclitic precedes the
object marker (xayār=ey=ne ‘a cucumber (obj.)’, and when marked as definite, the
definite enclitic similarly precedes the object marker (xayār=eke=ne ‘the cucum-
ber (obj.)’. However, in certain situations direct objects can be unmarked, e. g.,
berenj ‘rice’ in (58) below, and gā ‘cow’ in (84). The distribution and function of
=ne are evidently distinct from =rā, as are the historical origins of the two object
markers (the etymology of =ne is currently unclear; see the discussion in AA 2014:
89), but the contextual factors that govern the use of =ne have yet to be estab-
lished.
Bakhtiari 453
Pronominal possession is marked by attaching an enclitic pronoun (see also

§3.3.2 below) to the noun.
(1) kor=om ‘my son’
boy=1sg
(2) ri=s ‘his/her face’
face=3sg
(3) tu=toʋn ‘your (pl.) room’
room=2pl
Nominal possession is discussed along with noun phrase morphosyntax in §4.1
below.
3.1.2. Adjectives
There are many unitary adjectives in Bakhtiari, and these are structurally similar
to nouns (pati ‘empty’, taynā ‘alone’, zel ‘curious’). Adjectives are productively
derived from nouns using the suffix -i (aʋ ‘water’, aʋ-i ‘aquatic’; taš ‘fire’, taš-i
‘fiery, angry’). Nouns are also productively derived from adjectives using the
same suffix form -i (zende ‘living’, zende-i ‘life’; nā-xoš ‘ill’, nā-xoš-i ‘illness’).
Adjectives are joined to the noun they qualify using an ezāfe construction (see §4.1
below).
3.1.3. Adverbs
Adverbs are also structurally similar to nouns and adjectives. Whereas nouns and
corresponding adjectives are distinct words that can be derived from one another,
there is no formal difference between adjectives and corresponding adverbs (xoʋ
‘good’, ‘well’; tond ‘fast’, ‘quickly’; bahd ‘subsequent’, ‘later’).
3.2. Closed word classes
3.2.1. Numbers
As in most languages, numbers (yak ‘one’, do ‘two’, se ‘three’, …) form a clear
closed morphological class in Bakhtiari. When a noun is explicitly counted, it
is accompanied by a numeral classifier, most commonly =tā (do=tā gaʋu ‘two
brothers’, se=tā dadu ‘three sisters’). There is no plural marking (cf. §3.1.1) on
nouns accompanied by a number.
3.2.2. Prepositions
There is a large but closed set of prepositions in Bakhtiari. Many prepositions with
a local sense are derived from body parts (kel ‘side’>‘beside’, ri ‘face’>‘on’, pošt
‘back’>‘behind’), and still function as nouns when used in the appropriate con-
texts. Some prepositions are composed of a simple preposition and another word
(men-jā ‘between (lit. in-place)’, ʋā-ri ‘on top of (lit. with-face/on)’, ʋā-pošt ‘at
the back of (lit. with-back/behind)’. These compound prepositions in particular are
often used with an adverbial function as well:
(4) rah ʋā-pošt
go.pst .3sg behind
‘s/he went backwards’
The ezāfe construction (§4.1) is used to join some prepositions to their comple-
ments:
(5) bahd=e ho ‘after that’
after=ez that
(6) men=e sok ‘in the corner’
in=ez corner
Pronominal complements of prepositions are marked using enclitic pronouns
(§3.2.4).
(7) b=om/be=m ‘to me’
to=1sg
(8) kel=et ‘beside you (2sg.)’
beside=2sg
(9) si=s ‘for him/her’
for=3sg
(10) ze=soʋn ‘from them’
from=3pl
3.2.3. Particles
There is a small set of particles with diverse functions. These items, which are
comprised of single syllable with the historically short vowels e, a and o, can be
pronounced as words on their own, for example as single-word answers. However,
when they are used in a string of free speech they attach phonologically to the
adjacent (usually following) word. Particles include the numbers do ‘two’ and se
‘three’, prepositions be ‘to’ and ze ‘from’, the first part of the discontinuous mor-
phemes used for indefinites and demonstratives (§3.1.1), free pronouns (§3.2.4),
Bakhtiari 455
the interjection na (cf. §3.5), pa ‘then’, the coordinating conjunction (ʋ)o (§4.3.1)
and the subordinating conjunction ke ‘that’ (§4.3.2).
3.2.4. Pronouns and other person marking

Six person categories are marked in Bakhtiari morphology: first, second, and
third person, each of which can be singular or plural. Third person free pronouns,
both singular and plural, exhibit an obligatory proximal/distal distinction (thereby
functioning as demonstrative pronouns at the same time; for the demonstrative
function with nouns, see §3.1.1), although the distal form is used in neutral con-
texts as well. Table 3 below summarizes Bakhtiari person marking paradigms,
and examples of each category are given in the relevant sections throughout this
study.
Table 3: Person marking in Bakhtiari

free verb ending verb ending verb enclitic pronoun
pronoun (perfective) (non-past) ending (verbal object, pos-
and non-past (impera- session, prepositional
indicative copula tive) complement, etc.)
1sg mo -(o)m -(o)m =(o)m
2sg to -i -i -Ø =(e)t
3sg ho (dist.), -Ø -e =(e)s
yo (prox.)
1pl imā -im -im -im =(e)moʋn
2pl isā -in -in -in =(e)toʋn
3pl ono (dist.), -(e)n -(e)n =(e)soʋn
ino (prox.)
In the table above, the vowels in parentheses are generally dropped when the mor-
phemes accompany a vowel-final stem (see AA 2014: 69–90 for examples).
Reflexive pronouns are constructed using the root xo ‘self’ with enclitic pro-
nouns (xom ‘myself’’, xosoʋn ‘themselves).
3.3. Verbs
Bakhtiari verbs are of several lexical types. Some are simple (e. g., aʋorden ‘to
bring’), but many also contain additional elements such as:
– a pre-verbal prefix dar-/der- (lit. ‘in’), ʋā- (‘with’), or ʋor- (‘on’): der-
aʋorden ‘to remove’, ʋā-biden ‘to become’, ʋor-čarden ‘to crawl’;
– a pre-verbal nominal: gir aʋorden ‘to obtain’, cf. gir ‘hold (n.)’; bālā aʋorden
‘to vomit’, cf. bālā ‘above, up’; or
– a post-verbal prepositional phrase or adverb: aʋorden belam ‘to lower’, cf.

belam ‘below, down’; laġa zayden be gel ‘to stomp’, cf. laġah ‘kick (n.)’,
zayden ‘to hit’, be ‘to’, gel ‘mud’.
A number of verbs exhibit combinations of these structures (e. g., dast ʋor-aʋorden

‘to caress’, cf. dast ‘hand’; šalāq koften men=e guš ‘to slap’, cf. šalāq ‘whip (n.),
koften ‘to pound’, men=e ‘inside of’, guš ‘ear’).
Each Bakhtiari verb has two stems: a non-past stem, which is used for non-
past indicative, subjunctive, and imperative verb forms; and a past stem. Many
past stems are the same or similar to their non-past counterparts, but with an addi-
tional -id, -d or -est ending (daʋn-/doʋnid- ‘run (non-past/past)’, ʋorčar -/ʋorčard-
‘crawl (non-past/past)’, doʋn-/doʋnest- ‘know (non-past/past)’). For many other
stems, the correspondence is partially irregular (heyl-/hešt- ‘leave, let (non-past/
past)’, estoʋn-/estayd- ‘get, obtain (non-past/past)’) and, in a few cases, supple-
tive (-yā-/(o)ʋeyd- ‘come (non-past/past)’, bin-/did- ‘see (non-past/past)’). Several
very common non-past stems comprise a single consonant (-d-/dād- ‘give (non-
past/past)’, -n-/nehād- ‘put (non-past/past)’, -r-/rahd- ‘go (non-past/past)’; AA
2014: 125).
3.3.1. Non-finite verb forms

Non-finite forms include infinitives and perfect participles. Infinitives are formed
by adding the suffix -en (-an in some eastern dialects) to the past stem (xaʋsid-en
‘to sleep’, tāšid-en ‘to shave’). Infinitives are used as verbal nouns, as the follow-
ing example shows:
(11) gaʋu kočir-e ze zendei kerd-en sar tu ājez
brother young-def from life do-inf on room tired
ābid
become.pst .3sg
‘The young[er] brother became tired of living on the roof.’
Perfect participles, formed by adding the suffix -e to the past stem (see the discus-
sion of perfect verb forms under finite verb forms immediately below), are used as
adjectives (resid-e ‘ripe’, cf. resid- ‘arrive, ripen (past)’; gap ābid-e ‘grown up’,
cf. gap ābid- ‘grow, grow up (past)’ ).
3.3.2. Finite verb forms

Finite forms are inflected for person and TAM (tense/aspect/mood). There are
many irregularities in verb conjugations (see AA 2014: 121–153), as well as many
regular and irregular morphophonemic alternations (AA 2014: 69–90), and only
the most important patterns are presented here.
Bakhtiari 457
Finite verb forms built on the non-past stem include the non-past (or “present”)
indicative, subjunctive, and imperative.
The non-past indicative, which is used for basic present as well as habitual
functions, varies significantly among Bakhtiari dialects. In the dialect of Masjed
Soleyman (AA 2014), which is in focus here, this verb form is usually expressed
with an obligatory imperfective prefix e- on the non-past stem, along with per-
son-marking suffixes.
Table 4: Non-past indicative conjugation of ‘sleep’

1sg e-xaʋs-om
2sg e-xaʋs-i
3sg e-xaʋs-e
1pl e-xaʋs-im
2pl e-xaʋs-in
3pl e-xaʋs-en
For a few verbs in the Masjed Soleyman dialect, however, the prefix e- is absent
or uncommon (e. g., daʋn-om ‘I know’, tar-om ‘I am able’, torok-om ‘I walk,
wander’); and for another subset of verbs in the lexicon, the presence of this prefix
appears to be correlated with an unmarked vs. continuous non-past distinction
(e. g., ašn-om/e-y-ašn-om ‘I hear/I am hearing’, daʋn-om~e-daʋn-om ‘I run/I am
running’, heyl-om~e-heyl-om ‘I leave (tr.)/I am leaving (tr.)’), although the seman-
tics of this distinction are neither clear nor consistent.
In some Bakhtiari dialects of C&B Province, the basic non-past indicative
form is expressed using an unprefixed non-past stem, e. g., xor-om ‘I eat/I am
eating’, xor-i ‘you (sg.) eat/you (sg.) are eating’, xor-e ‘s/he eats/s/he is eating’,
etc. In the C&B dialect of Kuhrang, the prefix e- can also be used for the non-past
indicative (Tāheri 2010: 163–164), although its possible semantic contribution –
for example, to mark continuous or progressive aspect – is likewise unclear, as in
Masjed Soleyman. In Ardal, C&B, where no cognate prefix is available for the
non-past indicative, the unmarked vs. continuous/progressive distinction is sig-
nalled through the addition of the particle hay: hay xor-om ‘I am eating (cont./
prog.)’ (Taheri-Ardali, field notes 2017).
As pointed out by van der Wal Anonby (pers. comm. 2017), the glossed text
from Ardal in §5 below shows that the non-past can also be used with a mirative
extension, that is, to mark “new, unexpected, and surprising information” (van
der Wal Anonby 2011), somewhat analogous to the “dramatic present” in English
(Huddleston & Pullum 2002: 129–131). Examples of this usage to mark an excit-
ing episode in a personal account, repeated here (12–14) from the glossed text,
occur in the middle of a string of past verb forms:
(12) hālā ke bin-e hay čerā takun xor-e!

now compl see.npst -3sg prog stove shaking eat.npst -3sg
‘now she sees that the stove is shaking!’
(13) tā ʋirist-e dā en-e doʋ
until stand.npst -3sg mother put.npst -3sg running
‘so mom stands up, she starts running’
(14) pa harči ogo b-iyaʋ hune
so whatever say.pst .3sg impv -come.impv .2sg house
polof-e!
collapse.npst -3sg
she was saying over and over (lit. so she said whatever), “Come, the house
is collapsing!”
Among Iranic languages, this mirative use of the non-past differs from Persian
and Tajik, where mirativity is an extended function of the perfect (Lazard 1985:
28–29, 2001: 361), and from Kumzari, where a dedicated mirative verb form has
been identified (van der Wal Anonby 2011).
The subjunctive is marked, in Bakhtiari dialects generally, with a prefix be-
and the same person-marking suffixes as are used for the non-past indicative (the
data here is from AA 2014):
Table 5: Subjunctive conjugation of ‘sleep’

1sg be-xaʋs-om
2sg be-xaʋs-i
3sg be-xaʋs-e
1pl be-xaʋs-im
2pl be-xaʋs-in
3pl be-xaʋs-en
Some verbs are incompatible with the basic subjunctive conjugation and instead
use a periphrastic perfect participle + subjunctive ‘be’ construction.
(15) tarest-e bu-m
be_able-pf .part be.sbjnc -1sg
‘that I be able…’
The imperative is formally equivalent to the subjunctive, except that its 2sg. form
is marked with a null affix rather than the 2sg. suffix -i.
Bakhtiari 459
Table 6: Imperative conjugation of ‘sleep’

2sg be-xaʋs
1pl be-xaʋs-im
2pl be-xaʋs-in
A few common verbs have irregular 2sg. imperative forms (biyaʋ/biyā ‘come!’,
raʋ ‘go!’).
Finite verb forms built on the past stem include the simple past, imperfective past,
and the perfect. Person-marking suffixes on these forms are the same as those
used with the non-past indicative (see above), except that the 3sg. suffix is a null
morpheme.
For the simple past, the person-marking suffixes are simply attached to the
past stem.
Table 7: Simple past conjugation of ‘sleep’

1sg xaʋsid-om
2sg xaʋsid-i
3sg xaʋsi(d)
1pl xaʋsid-im
2pl xaʋsid-in
3pl xaʋsid-en
When simple past stems end in a vowel+d or glide+d sequence, this final d is
systematically omitted in the third person singular, depending on positional and
discourse-related factors (discussed further in AA 2014:93–94, 127): dā(d) ‘s/he
gave’, di(d) ‘s/he saw’, rah(d) ‘s/he went’, zey(d) ‘s/he hit’.
The imperfective past is formed by adding the prefix e-, which is also used for
the non-past indicative (see above), to the past stem.
Table 8: Imperfective past conjugation of ‘sleep’

1sg e-xaʋsid-om
2sg e-xaʋsid-i
3sg e-xaʋsi(d)
1pl e-xaʋsid-im
2pl e-xaʋsid-in
3pl e-xaʋsid-en
The perfect is formed by adding the suffix -e after the person marking on the
past stem (rahd-om-e ‘I have gone’, did-im-e ‘we have seen’). This contrasts with
ketābi (literary) Persian, where the perfect suffix -e is attached directly to the stem
(ketābi P. raft-e-am ‘I have gone’, did-e-im ‘we have seen’), but is matched by
many varieties of Southern Kurdish, which have a similar ordering of morphemes,
as well as in some NENA dialects, which likewise have a person ending followed
by what is presumably a third person form of the copula (Geoffrey Khan, Geoffrey
Haig, p.c.).
Table 9: Perfect conjugation of ‘sleep’

1sg xaʋsid-om-e
2sg xaʋsid-i-y-e
3sg xaʋsid-e
1pl xaʋsid-im-e
2pl xaʋsid-in-e
3pl xaʋsid-en-e
In some Bakhtiari varieties, the perfect is formed differently. In Ardal and some
other areas of C&B Province, for example, it is marked in the third person singu-
lar with the suffix -ak (resid-ak ‘s/he has arrived, it has ripened’, cf. resid ‘s/he
arrived, it ripened’), but for other persons there is no dedicated verb form and the
simple past is used (resid-om ‘I have arrived’/‘I arrived’). Interestingly, in these
dialects the perfect participle is formed with -e (not -ak), as is the case elsewhere
in the language area (e. g., resid-e ‘ripe’; see further examples in the discussion of
perfect participles in §3.3.1 above).
3.3.3. Complex verb forms

Important complex verb forms include the pluperfect and the progressive. In most
dialects, the pluperfect is built by attaching the person-marking suffix directly
to the past stem, followed by an invariable auxiliary bi (cf. bi ‘he/she/it was’).
Parallel to what happens with the perfect, this structure differs from other Iranic
languages such as Persian: there, person is marked on the ‘be’ auxiliary rather than
the stem. The patterning of the Bakhtiari pluperfect is illustrated in the following
example by the form jam ābid-im bi ‘we had gathered’:
Bakhtiari 461
(16) gaʋ-yal dafe niyā=i

brother-pl time previously=adjzr
ke daʋr=e yak jam ābid-im bi
compl around=ez one gathered become.pst -1pl be.aux
manesti ādomizād ičo bi o
as.if human here be.pst .3sg and
harf-ā=moʋn=e ašni o
word-pl =1sg =obj hear.pst .3sg and
‘Brothers! The previous time, when we had gathered together, it seems a
human was here and heard our conversation…’ (AA 2014: 94–95)
The formation of the progressive varies greatly among Bakhtiari varieties, with
various combinations of auxiliary, verbal repetition, and a dedicated progressive
adverbs. Progressive constructions in western dialects of Bakhtiari have not been
studied, but in varieties of C&B Province in the east, some examples of the past
progressive construction in particular, used for the construction ‘I was going’, are
as follows (Schreiber et al. 2017; Taheri-Ardali, field notes 2016):
Table 10: Attested past progressive constructions in several dialects of C&B Province:
‘I was going’
rahd-om be-r-om (Ardal)
go.pst -1sg sbjnc -go.npst -1sg
hey rahd-om be-r-om (Ardal)
prog go.pst -1sg sbjnc -go.npst -1sg
hey rah-m be-r-am (Lordegān)
prog go.pst -1sg sbjnc -go.npst -1sg
i-rahd-om be-r-om (Ardal, Lordegān)
impfv -go.pst -1sg sbjnc -go.npst -1sg
i-rah-m be-r-om (Dastenā)
impfv -go. pst -1sg sbjnc -go.npst -1sg
dāšt-om i-rah-m (Dastenā)
have.pst -1sg impfv -go.pst -1sg
dāšt-om i-rah-m be-r-om (Dastenā)
have.pst -1sg impfv -go.pst -1sg sbjnc -go.npst -1sg
3.4. The copula

In its non-past indicative form, the copula is expressed with the same person-mark-
ing morphemes as those used for non-past indicative verb forms.
Table 11: Non-past indicative copula with zarākāl ‘farmer’

1sg zarākāl=om ‘I am a farmer’
2sg zarākāl=i ‘you (sg.) are a farmer’
3sg zarākāl=e ‘s/he is a farmer’
1pl zarākāl=im ‘we are farmers’
2pl zarākāl=in ‘you (pl.) are farmers’
3pl zarākāl=en ‘they are farmers’
When the non-past indicative copula is used with a free pronoun (including
demonstrative pronouns) or an interrogative pronoun, n is inserted between the
two parts: mo-n=om ‘it’s me, yo-n=e ‘it’s him/her/it/this one’, isā-n=in ‘it’s you
(pl.)’, hami=ho-n=e ‘it’s that very one’, koye-n=im ‘where are we?’, key-n=in?
‘who are you (pl.), kay-n=e ‘when is it?’. Alternative explanations for this struc-
ture, including a possible historical relation to the differential object marker =ne
(see §3.1.1 above), are discussed in AA 2014: 88–89.
For other verb forms (subjunctive, imperative and past), the verb stem bu-/
bid- (‘be’, non-past/past) is used (zarākāl bid-i ‘you (sg.) were a farmer’, zarākāl
bu-y ‘that you (sg.), be a farmer’, zarākāl bu ‘(you sg.) be a farmer!’). A full con-
jugation of the verbal copula, which shows significant variation depending on the
speaker as well as the discourse context, is as follows:
Table 12: Conjugation of the verbal copula bu-/bid- ‘be’

subjunctive imperative past
1sg bu-m/bu-om/bu-ʋ-om bid-om
2sg bu-y/bu-i/bu-ʋ-i bu bid-i
3sg bu/bu-e/bu-ʋ-e bi/bid
1pl bu-ym/bu-im/bu-ʋ-im bu-ym/bu-im/bu-ʋ-im bid-im
2pl bu-yn/bu-in/bu-ʋ-in bu-yn/bu-in/bu-ʋ-in bid-in
3pl bu-n/bu-en/bu-ʋ-en bid-en
3.5. Verbal negation

For all verb forms, verbal negation is marked on the inflected stem with the nega-
tive prefix na-. Its patterning is straightforward in non-prefixed indicative forms,
such as the simple past, where na- is simply attached directly to the stem:
Bakhtiari 463
Table 13: Basic verbal negation

verb negative form
neyašt-om ‘I looked’ na-neyašt-om ‘I didn’t look’
torokest-om ‘I walked’ na-torokest-om ‘I didn’t walk’
xaʋsid-om ‘I slept’ na-xaʋsid-om ‘I didn’t sleep’
There are a few common verbs whose initial stem consonant is deleted, however,
sometimes along with other irregular alternations, when na- is attached.
Table 14: Contracted negative verb forms

doʋn-om ‘I knew’ na-ʋn-om ‘I didn’t know’
did-om ‘I saw’ na-yd-om ‘I didn’t see’
hed-om ‘I am (there)’ n-eyd-om ‘I am not (there)’
With subjunctive as well as imperative verb forms, na- displaces the subjunctive/
imperative prefix be- (see above).
Table 15: Negative subjunctive verb forms

be-neyar-e ‘that s/he look’ na-neyar-e ‘that s/he not look’
be-torok-e ‘that s/he walk’ na-torok-e ‘that s/he not walk’
be-xaʋs-e ‘that s/he sleep’ na-xaʋs-e ‘that s/he not sleep’
Table 16: Negative imperative verb forms

be-neyar ‘look! (sg.)’ na-neyar/mah-neyar ‘don’t look! (sg.)’
be-torok ‘walk! (sg.)’ na-torok/mah-torok ‘don’t walk! (sg.)’
be-xaʋs ‘sleep! (sg.)’ na-xaʋs/mah-xaʋs ‘don’t sleep! (sg.)’
As shown in Table (16) above, the prefix mah- is used alongside na- in prohibitive
(negative imperative) constructions.
When na- is attached to stems that begin with the non-past indicative prefix e-,
the two morphemes fuse and are realized as ney-.
Table 17: Negative non-past indicative verb forms

e-xaʋs-om ‘I sleep’ ney-xaʋs-om ‘I don’t sleep’
e-d-om ‘I give’ ney-d-om ‘I don’t give’
e-ger-om ‘I take’ ney-ger-om ‘I don’t take’
The negative prefix is in some cases found before a preverb (na-dar-garnid

‘s/he didn’t turn over (tr.)’, na-der-aʋord ‘s/he didn’t bring’), and in other cases it
intervenes between the preverb and the stem (dar-na-borid ‘s/he didn’t protest’,
der-na-xard ‘s/he didn’t turn around’). The placement of the prefix is therefore
lexically determined, and depends on the internal morphological structure of the
verb: it may even pattern in both ways when a single stem has two distinct mean-
ings (na-da-rahd ‘it didn’t set (sun)’, dar-na-rahd ‘s/he didn’t turn around’; AA
forthcoming).
3.6. Pronominal verbal objects

Pronominal verbal objects are normally marked with enclitic pronouns, which are
hosted by the inflected verb. Table 18 gives the full paradigm of object clitics, fol-
lowing a third-person singular simple past verb. Table 19 illustrates clitic objects
in combination with different values for subject marking. In light verb construc-
tions, the clitic pronoun is hosted by the nominal element (see (20) below):
Table 18: Pronominal object marking on 3sg. past verb forms

did=om ‘s/he saw me’
did=et ‘s/he saw you (sg.)’
did=es ‘s/he saw him/her/it’
did=emoʋn ‘s/he saw us’
did=etoʋn ‘s/he saw you (pl.)’
did=esoʋn ‘s/he saw them’
Table 19: Pronominal object marking with different person values for subject
did-om=et ‘I saw you (sg.)’
did-i=m ‘you (sg.) saw me’
did-im=es ‘we saw him/her/it’
did-im=etoʋn ‘we saw you (pl.)
did-in=esoʋn ‘you (pl.) saw them’
did-en=emoʋn ‘they saw us’
We are not aware of any significant differences between the tenses with regard
to the deployment of object clitic pronouns; that is, the paradigms for clitic pro-
nouns given in Tables 18 and 19 are essentially the same with a non-past verb form
as host. Bakhtiari thus seems to lack tense sensitivity in alignment, resembling
Persian in this regard (Haig 2008), though more research on Bakhtiari syntax is
needed.
When both direct object and indirect object are found as pronouns, there is
a constraint against marking both with enclitic pronouns: typically, one of the
Bakhtiari 465
objects is dropped (17, 18) or the direct object is retained as a full pronoun in
pre-verbal position (19).
(17) dād-om=es
give. pst -1 sg =3 sg
‘I gave [the recipient] it’ or ‘I gave him/her [it]’
(18) dād-om=et
give. pst -1 sg =2 sg
‘I gave you [it]’
(19) ho=ne dād-om=et
3sg . dist = obj give. pst -1 sg =2 sg
‘I gave you it’
In light verb constructions, the enclitic pronouns are attached to the nominal com-
ponent of the construction.
(20) raʋne=s kerd
sending_away=3sg do.pst .3sg
‘s/he sent him/her away’ (AA 2014: 99)
4. Syntax
The present study focuses on three topics central to Bakhtiari syntax: the ezāfe con-
struction (§4.1), clause constituent order (§4.2), and clause linkage (§4.3), which
includes coordination and subordination. Other constructions with importance for
both syntax and morphology have been integrated into the preceding discussion;
for example, noun affixes marking definiteness, demonstrative functions, and dif-
ferential objects are discussed under noun morphology (§3.1.1).
4.1. The ezāfe construction

The ezāfe (associative) construction is used in Bakhtiari to form complex phrases
of various types, including noun+noun, noun+adjective, and preposition+adjective
phrases. The phrasal head is always on the left periphery of the phrase. When the
noun phrase head is consonant-final, the associative morpheme =e is attached to it.
(21) doʋdar=e pādešā ‘king’s daughter’
daughter=ez king
When the noun phrase head is vowel-final, however, there is no overt associative
morpheme.
(22) loʋne peleštuk ‘swallow (bird) nest’

nest swallow
Noun+adjective (23, 24) as well as preposition+noun (25, 26) ezāfe constructions
are formed in the same way.
(23) xayāl=e āside ‘peace of mind (lit. unworried thoughts)’
thought=ez unworried
(24) gā gap ‘big cow’
cow big
(25) neheng=e aftaʋdaroʋ ‘close to sunset’
near=ez sunset
(26) ze zemin ‘from [the] ground’
from ground
When the non-head element in such constructions is a pronoun rather than a noun,
enclitic pronouns, which attach directly to the phrasal head, are used (§3.2.2,
§3.2.4).
4.2. Clause constituent order

For clauses with an overt subject (whether a noun or a free pronoun) and a verb,
the default order is SV (subject-verb).
(27) piyā rahd
man go.pst .3sg
‘[the] man went’
(28) mo rahd-om
I go.pst -1sg
‘I went’ (with “I” as a new topic)
When a noun or pronoun has already appeared as the subject of a previous sen-
tence, it is typically dropped in the subsequent sentence. The identity of the subject
is still clear, however, from person-marking on the verb.
(29) rahd
go.pst .3sg
‘s/he went’
(30) rahd-om
go.pst -1sg
‘I went’ (with subsequent mention of “I”)
Bakhtiari 467
In clauses with a nominal object, the default order is (S)OV (31–33); the direct
object marker =ne is introduced in §3.1.1 above.
(31) piyā hoʋne=ne xeri
man house=obj buy.pst .3sg
‘[the] man bought the house’ (cf. AA 2014: 113)
(32) doʋar=eke siʋe=ka=ne xa
girl=def apple=def =obj eat.pst .3sg
‘the girl ate the apple’ (Ardal dialect)
(33) hoʋne=ne xerid-om
house=obj buy.pst -1sg
‘I bought [the] house’
Nominal subjects and objects that are emphasized, however – for example, in
contrast with another possible noun phrase – may move to post-predicate (i. e.,
post-verbal) position.
(34) siʋe=ka=ne xa doʋar=eke
apple=def =obj eat.pst .3sg girl=def
(35) doʋar=eke xa siʋe=ka=ne
girl=def eat.pst .3sg apple=def =obj
(36) xa siʋe=ka=ne doʋar=eke
eat.pst .3sg apple=def =obj girl=def
Pronominal objects, which are expressed using enclitic pronouns attached to the
predicate, are discussed separately in §3.6.
The indirect object of a verb of transfer usually follows the predicate. The
word das (lit., ‘hand’) optionally introduces the recipient.
(37) setāre čaġu=ne dā (das) Borjali
Setare knife=obj give.pst .3 sg (hand) Borjali
‘Setare gave Borjali the knife’ (Ardal dialect)
As is the case for recipients, goals of motion verbs likewise follow the predicate,
as in the following (see (43) for additional post-predicate goals).
(38) rahd sar=e tu
go.pst .3sg head/on=ez room/house
‘s/he went onto the roof (lit. head of [the] room/on [the] room)’
(AA 2014: 107)
Addressees, however, precede the verb and are introduced with the preposition ze
‘from’ (ez in some dialects). (Further examples of citations are found in the discus-
sion on subordination in §4.3.2 below.).
(39) setāre ez Borjali ogo unun duš ičo
Setare from Borjali say.pst .3sg 3 pl . dist yesterday here
bi-y-en
be.pst -y-3 pl
‘Setare said to Borjali, “They came here yesterday”.’ (Ardal dialect)
With regard to post-predicate elements, Bakhtiari thus follows the pattern of Vafsi
(Stilo 2010) and the northwesterly dialects of Northern Kurdish (Haig 2017), in
that recipients and goals of verbs of motion follow the predicate, while addressees
precede the predicate.
4.3. Linkage
4.3.1. Coordination
A range of constructions are coordinated with the particle (ʋ)o ‘and’. This particle
(cf. §3.2.3 above), which can attach phonologically to the preceding or the follow-
ing word, is transcribed here separately. When (ʋ)o attaches to a vowel-final word,
it is realized as ʋo (dā ʋo baʋu ‘parents (lit. mother and father)’), and it appears
as o elsewhere.
In the following example (40), (ʋ)o is used to coordinate noun phrases (here
represented by the nouns bamr o pelang ‘tiger and leopard’), verb phrases (gā
gapeyne šekāl kerden o aʋorden ‘hunted and brought a big cow’), and clauses
(bamr o pelang rahden o gā gapeyne šekāl kerden o aʋorden ‘tiger and leopard
went and hunted and brought a big cow’).
(40) bamr o pelang rahd-en o gā gap=ey=ne
tiger and leopard go.pst -3pl and cow big=indef =obj
šekāl kerd-en o aʋord-en
hunting do.pst -3pl and bring.pst -3pl
‘[The] tiger and [the] leopard went and hunted a brought [back] a big cow.’
(AA 2014: 101)
The particle (ʋ)o can also be used to coordinate independent sentences, as the fol-
lowing example (41) shows:
Bakhtiari 469
(41) hakāat=e xers ke tamom ābi

story=ez bear compl finished become.pst .3sg
šeyr go
lion say.pst .3sg
hame=moʋn ājez=im.
all=1pl tired=be.1pl
‘When [the] bear’s story came to end, the lion said: “We are all tired”.’
si amšaʋ diye bas=e
for tonight other enough=be.3sg
‘“That’s enough for tonight”.’
o hame ʋoreystā-n xaʋsid-en.
and all get_up.pst -3pl sleep.pst -3pl
‘And they all got up and went to sleep.’
The end of the last sentence of (41) (ʋoreystān xaʋsiden ‘they got up, they went
to sleep’) also shows that verb phrases or clauses, which are in this case indistin-
guishable, can be coordinated through juxtaposition.
The conjunction amā ‘but’ is used for adversative coordination of clauses (42)
as well as sentences (43).
(42) yak=i ze i do gaʋu
one=indef from this two brother
ādom=e sāf o sādoq=ey bi,
person=ez smooth and honest=indef be.pst .3sg
amā u yak=i rendal bi.
but that one=indef crafty be.pst .3sg
‘One of these brothers was a simple person, but the other one was crafty.’
(AA 2014: 96)
(43) aʋal=es gemoʋn kerd
first=3sg imagination do.pst .3sg
doz-oʋn oʋayd-en-e men=e āsiyaʋ
robber-pl come.pst -3pl -prf in=ez mill
tā čiy-ā dozi=ne bayn=e xo=soʋn
until thing-pl stolen(adj.)=obj between=ez self=3pl
bahr bo-kon-en.
portion sbjnc -do.npst -3pl
‘First, he thought the robbers had come inside the mill to divide the stolen
things among themselves.’
amā xoʋ ke niyašt,

but well compl look.pst .3sg
di ya ruʋā=yey oʋayd ʋāibā
see.pst .3sg a fox=indef come.pst .3sg within
o ʋā din=es āsiyaʋ=ne jāru kerd o
and with tail=3sg mill=obj broom do.pst .3sg and
nešast ya guše=ey.
sit_down.pst .3sg a corner=indef
‘But when he looked carefully, he saw a fox that came inside and swept the
mill with its tail and sat down in a corner.’ (AA 2014: 100–101)
4.3.2. Adverbial clauses

In narrative texts, clauses that depict a prior event or condition commonly precede
the main clause. They often contain the particle ke, which then immediately
precedes the verb of the adverbial clause. This may be seen in the preceding
example (43) (amā xoʋ ke niyašt ‘but when he looked closely’), as well as the
following sentence (44):
(44) ya modat=ey ke godašt
a while=indef compl pass.pst .3sg
gaʋu kočir=e […] ājez ābid
brother young=def tired become.pst .3sg
‘a while having passed, the younger brother […] became tired’
(AA 2014: 98–99)
Other adverbial complementizers include simple conjunctions such as ayar ‘if’
(45) or more complex conjunctive constructions – often containing ke: ʋaxtey ke
‘when (lit. a time that)’ (46) or bahde yo ke ‘after (lit. after this that)’ (47), which
occur at the beginning of the adverbial clause.
(45) ayar ādomizād doʋnest
if human know.pst .3sg
e-rahd o u hoʋne=ne e-xeri o
impfv -go.pst .3sg and that house=obj impfv -buy.pst .3sg and
‘If a human knew, s/he would go and would buy that house …’ (AA
2014: 105–106)
Bakhtiari 471
(46) ʋaxt=ey ke hame xoʋ seyr ābid-en,

time=indef compl all good full become.pst -3pl
šeyr go
lion say.pst .3sg
xāʋ amšaʋ key hakāat=e aʋal=e e-go?
well tonight who story=ez first=obj impfv -say.npst .3sg
‘When all of them became completely satisfied, the lion said: “Well, who is
going to tell the first story tonight?”’ (AA 2014: 101–102)
(47) bahd=e yo ke hakāat=e gorg be āxer rasi,
after=ez 3sg .prox compl story=ez wolf to end arrive.pst .3sg
xers sine xo=ne sāf kerd
bear chest self=obj smooth do.pst .3sg
‘After the wolf’s story came to end, the bear cleared its throat…’ (AA
2014: 105)
While many adverbial clauses precede the matrix clause, as shown in the preced-
ing examples, clauses expressing purpose or subsequent action are found after the
matrix clause.
(48) piyā rah
man go.pst .3sg
tā rasi be jā=yey
until arrive.pst .3sg to place=indef
ke xers god-e bi.
compl bear say.pst .3sg -prf be.aux
‘The man went until he got to the place that the bear had said.’
Direct speech (49, 50) and perceptual complement clauses (51) also follow the
matrix clause. Both are introduced without a complementizer.
(49) pah go biy-aʋ
so say.pst .3sg impv -come.impv .2sg
‘So he said, “Come…”’ (AA 2014: 96–97)
(50) ho ogo amšaw iyā-m
3sg .dist say.pst .3sg tonight come.npst -1sg
‘S/he said, “I am coming tonight”.’ (Ardal dialect)
(51) di ze ro tu tā āsemoʋn gaf-tar=e.
see.pst .3sg from top room until sky big-cmpv =be.3sg
‘He saw that [the part] from the roof to the sky was larger’. (AA 2014: 98)
Semi-direct (52) and indirect (53) citations, on the other hand, use the particle ke.
(52) ho go ke amšaw iyā-m

3sg .dist say.pst .3sg that tonight come.npst -1sg
‘S/he said that s/he is coming tonight.’ (semi-direct speech) (Ardal dialect)
(53) ho go ke amšaw iyā
3sg .dist say.pst .3sg that tonight come.npst .3sg
‘S/he said that s/he is coming tonight.’ (indirect speech) (Ardal dialect)
5. Glossed text: zelzeleā ‘The earthquake’
The following text is from the dialect of Ardal, in C&B Province. Ardal is at the
eastern end of the Bakhtiari language area and, as shown in §2.1 and elsewhere
above, the dialects of this region display some additional features in common
with Persian and other Southwestern dialects of the Iranian Central Plateau. The
text is a personal account told by Homa Asadi-Ardali, a 60 year-old woman, who
describes her experience during an earthquake forty years earlier, in 1977. Here,
she is speaking to her nephew in the context of a larger audience of relatives.
(54) hebdah ruz bahd ez ayd bi baʋ=m,
seventeen day after from festival be.pst .3sg father=1sg
sāat čār bi.
hour four be.pst .3sg
‘It was seventeen days after [the Now Ruz] festival, my dear (lit. my
father), it was four o’clock.
(55) čār bi, di šom=emun=am nād-im –
four be.pst .3sg again dinner=1pl =add put.pst -1pl
It was four, and we had put dinner on [to cook] –
(56) uso doʋre čerā bid=o gāz.kapsuli bi –
then period stove be.pst .3sg =and gas.cylinder be.pst .3sg
back then it was the time of stoves (i. e., burners) and gas cylinders –
(57) nād-im=eš sar čerā.
put.pst -1pl =3sg on stove
we put it on the stove.
(58) gušt=en=am sohr kerd māmā kemutar,
meat=obj =add red make.pst .3sg grandmother Kemutar
xāst berenj bo-kon-e pā=š.
want.pst .3sg rice sbjnc -make.npst -3sg foot=3sg
Grandma Kemutar fried the meat and was going to make rice to go with it.
Bakhtiari 473
(59) di ya takun=i …
again indef shaking=indef
Then a shaking …
(60) hālā ke bin-e hay čerā takun xor-e!
now compl see.npst -3sg prog stove shaking eat.npst -3sg
now she sees that the stove is shaking!
(61) ogod-om dā, hay čerā takun xor-e!
say.pst -1sg mother prog stove shaking eat.npst -3sg
I said, “Mom, the stove is shaking!”,
(62) ogo na! zelzele-ā ġedim=e,
say.pst .3sg no earthquake-pl ancient=3sg
she said, “No! Earthquakes are [something that used to happen] long ago”,
(63) hay ogo barkat.
prog say.pst .3sg blessing
she was saying, “Bless me (lit. blessing)”.
(64) ogod-om hay dā barkat di xelās –
say.pst -1sg prog mother blessing again finished
I said, “Mom, right now that blessing belongs to the past (lit. blessing
finished again) –
(65) yo hay takun ed-e!
3. sg . prox prog shaking give.npst -3sg
it’s shaking!”
(66) tā ʋirist-e dā en-e doʋ
until stand.npst -3sg mother put.npst -3sg running
er-e duʋen,
go.npst -3sg down/outside
So mom stands up, she starts running, she goes down outside,
(67) ogod-om dā!,
say.pst -1sg mother
I said, “Mom!”,
(68) pa harči ogo b-iyaʋ hune
so whatever say.pst .3sg impv -come.impv .2sg house
polof-e!
collapse.npst -3sg
she was saying over and over (lit. so she said whatever), “Come, the house
is collapsing!”
(69) ogod-om na! tā mahtāb=e na-yār-om,

say.pst -1sg no until Mahtab=obj neg -bring.npst -1sg
n-iyā-m.
neg -come.npst -1 sg
I said, “No! If I don’t take Mahtab out, I won’t come.
(70) mahtāb hamučo ʋas mi ārd-ā.
Mahtab right_there fall.pst .3sg in flour-pl
Mahtab fell right there into the flour.
(71) tāpu ārd-i=mun katāyun mi ya ini
clay.pot flour-adjzr =1pl Katayun in one what’s_it_called
bi jā … komod-i,
be.pst .3sg place dresser-indef
Our clay pot for flour, Katayun [an addressee in the audience], was in a
what’s-it-called, clothes … alcove (lit. place … of dresser),
(72) oʋay ʋas pišġaʋi.
come.pst .3sg fall.pst .3sg spread_all_over.pst .3sg
it happened that it had fallen and spilled out all over.
(73) mahtāb faġir ʋast min=eš,
Mahtab poor fall.pst .3 sg in=3sg
Mahtab, poor thing, had fallen into it,
(74) mahtāb=e bord-om,
Mahtab=obj carry.pst -1sg
I took Mahtab,
(75) ġand-ā=ne bord-om,
rock.sugar-pl =obj carry.pst -1sg
I took rock sugar chunks,
(76) legan.nuni=ne bord-om …
bread.box=obj carry.pst -1sg
I took the bread box …
(77) šaʋ ogo ʋā na-ʋāst-in.
night say.pst .3sg must neg -stand.npst -2pl
[When it was] night, they (lit. s/he) said, “You (pl.) mustn’t stay”.
(78) hami inā ke bid-en
this_same these compl be.pst -3pl
oʋayd-en si masan komak bo-kon-en,
come.pst -3pl for for_example help sbjnc -do.npst -3pl
All these [people] who were there came to, you know, help,
Bakhtiari 475
(79) go na-ʋāst-in ičo ʋā be-r-in

say.pst .3sg neg -stand.npst -2pl here must impv -go.npst -2pl
sar šāzeaziz.
head/on Shaze.Aziz
they said, “You mustn’t stay here, you have to go over to Shaze-Aziz
[shrine]”.
(80) šaʋ rahd-im sar šāzeaziz anul faġir=am bi.
night go.pst -1pl head/on Shaze.Aziz Anul poor=add be.pst .3sg
In the night we went over to Shaze-Aziz, poor Anul was [with us] as well.
(81) ez hami šomi hay zemin takun xa
from this_same dinnertime prog earth shaking eat.pst .3sg
xodā=am hay bārun zay.
God=add prog rain hit.pst .3sg
Right from dinnertime the earth was shaking, rain was pouring down (lit.
God was raining).
(82) hay zayd zayd zayd tā dam=e
prog hit.pst .3sg hit.pst .3sg hit.pst .3sg until beginning=ez
sohʋ …
morning
It was raining, raining, raining till the break of dawn …
(83) gusiyal=mun=am mi madrese.kār.dāneš i.duʋen bid-en.
sheep.pl =1pl =add in technical.school down be.pst -3pl
The sheep were in the technical school down below.
(84) dah-bis=tā miš dāšt-im, do-se=tā gā
ten-twenty=num ewe have.pst -1pl two-three=num cow
dāšt-im, do … ya jof ʋarzā dāšt-im …
have.pst -1pl two one pair bulls have.pst -1pl
We had ten or twenty ewes, we had two or three cows, two … we had one
pair of oxen …
(85) bāʋā hamziy=am šixun bi.
grandfather Hamzi=add Shikhun be.pst .3sg
Grandpa Hamzi was in Shikhun.
(86) šixun bi, hā hā … be.bahd …
Shikhun be.pst .3sg yeah yeah afterward
He was in Shikhun, mhmm … after …
(87) gusiyal=i ke ʋel kerd-im xo=šun rahd-en.

sheep.pl = spec compl release do.pst -1pl self=3pl go.pst -3pl
When we freed the sheep, they themselves went out.
(88) ra[hd-en] harki bord=ešun min amu
go.pst -3pl whoever carry.pst .3sg =3pl in that_same
madrese.kār.dāneš bači-yal bord=ešun.
technical.school child-pl carry.pst .3sg =3pl
[they went,] someone (lit. whoever) took them into that same technical
school, the young people took them.
(89) hālā harki bord=ešun bord=ešun učo …
now whoever carry.pst .3sg =3pl carry.pst .3sg =3pl there
Well, whoever took them, took them there …
(90) tā haf ruz=am xeʋar ne-dāšt-im ez=ešun.
until seven day=add news neg -have.pst -1pl from=3pl
For seven days we still had no news of them.
(91) hič na dunest-im gā koja-n=e na či …
any not know.pst -1pl cow where-n=3sg not what
We didn’t know where the cows were, or anything (lit. any of us neither
knew where the cows are, nor anything) …
(92) hālā ke bāʋā hamzi oʋay
now compl grandfather come.pst .3sg come.pst .3sg
māmā kemutar rah jeloʋ=š ogo
grandmother Kemutar go.pst .3sg in_front=3sg say.pst .3sg
pa hamzi mand-i hamičo, bači-yal=om xu bi
so Hamzi stay.pst -2sg right_here child-pl =1sg good be.pst .3sg
na-mord-en!
neg -die.pst -3pl
As Grandpa Hamzi arrived Grandma Kemutar went up to him and said to
him, “So Hamzi, you stayed right here?! Our (lit. my) children were fine –
they didn’t die!”
(93) rahd-en zir ogo na, hiči=šun
go.pst -3pl down say.pst .3sg no something=3pl
na-ʋaʋi!
neg -become.pst .3sg
They went down, he said, “No! Nothing happened!”
Bakhtiari 477
(94) bāʋā hamzi ez pele-ā er-e

grandfather Hamzi from stair-pl go.npst -3sg
er-e hamučo
go.npst -3sg right_there
ogo na, b-iyo hamičo be-nešin-im.
say.pst .3sg no impv -come.impv .2sg right_here impv -sit.npst -1pl
Grandfather Hamzi goes to the steps, he goes right over here, he said, “No!
Come and sit right here”.
(95) nešast ba=am hamučo xaʋsi mi tuʋ-ā
sit.pst .3sg father=1sg right_there sleep.pst .3sg in room-pl
ogo hiči=šun ni.
say.pst .3sg anything=3pl be.pst .neg .3sg
My father sat right there, he lay down in the house, he said, “Everything is
okay (lit. their anything is not)”.’
Abbreviations
1 first person neg negative

2 second person npst non-past
3 third person num numeral classifier
AA Anonby & Asadi (see O/obj object
References) P. Persian
add additive marker part participle
adj. adjective pf perfect
adjzr adjectivizer pl plural
AT Anonby & Taheri-Ardali prox proximal
(see References) pst past
aux auxiliary S subject
B. Bakhtiari sbjnc subjunctive
C consonant sg singular
C&B Chahar Mahal va Bakhtiari spec specifier
cmpv comparative tr. transitive
compl complementizer V vowel; verb
cont. continous V̄ long vowel
def definite voc vocative
dist distal ~ varies between
ez ezāfe (associative morpheme) - morpheme boundary
impfv imperfective = clitic attachment
impv imperative ‘ stress/accent
indef indefinite . syllable boundary
n. noun
References
Abbott, Barbara. 2004. Definiteness and indefiniteness. In Laurence R. Horn & Gregory
Ward (eds.), Handbook of pragmatics, 122–149. Oxford: Blackwell.
Abdullaeva, Firuza. 2009. Zhukovskiĭ, Valentin Alekseevich. Encyclopædia Iranica. New
York: Center for Iranian Studies, Columbia University. http://www.iranicaonline.org/
articles/zhukovskii-valentin-alekseevich (accessed 25 September 2017).
Afsar Bakhtiyāri, Dārāb. 1965/1344. Montakhabāti az aš’ār-e šā’er-e farzāneh Dārāb Af-
sar Bakhtiāri [A selection from the poems of the eminent poet Dārāb Afsar Bakhtiyāri],
ed. by Abd-ol-Hossein Sepantā. Tehran: Mash’al.
Amanollahi, Sekandar. 1991/1370. Qowm-e Lor [The Luri people]. Tehran: Āgāh.
Anonby, Erik. 2003. Update on Luri: How many languages? Journal of the Royal Asiatic
Society 13(2). 171–97.
Anonby, Erik. 2012. Sociolinguistic status of Lori. Encyclopædia Iranica. New York:
Center for Iranian Studies, Columbia University.
http://www.iranicaonline.org/articles/lori-language-ii (accessed 25 September 2017).
Anonby, Erik. 2016. Mapping the languages of western Iran’s Zagros Mountains. Network
Meeting of the Alexander von Humboldt Foundation, University of Freiburg, 5–7 Oc-
tober, 2016.
Anonby, Erik. 2017. The languages of Iran: A working classification. In AT 2015–18.
(AA 2014=) Anonby, Erik & Ashraf Asadi. 2014. Bakhtiari studies: Phonology, text, lexi-
con. Studia Iranica Upsaliensia 24. Uppsala: Acta Universitatis Upsaliensis.
(AA forthcoming=) Anonby, Erik & Ashraf Asadi. (forthcoming). Bakhtiari studies: Bakh-
tiari orthography. Studia Iranica Upsaliensia. Uppsala: Acta Universitatis Upsaliensis.
(AT 2015–18=) Anonby, Erik & Mortaza Taheri-Ardali (eds.). 2015–18. Atlas of the lan-
guages of Iran. Ottawa: GCRC (Geomatics and Cartographic Research Centre), Carle-
ton University. http://iranatlas.net (accessed 25 September 2017).
Anonby, Erik & Mortaza Taheri-Ardali. 2017. Chahar Mahal va Bakhtiari Province in the
Atlas of the Languages of Iran. Paper presented at the 1st North American Conference in
Iranian Linguistics (NACIL1), Stony Brook University, New York, 28–30 April, 2017.
Dāvari, Bahrām. 1964/1343. Zarb-ol-masalhā-ye Bakhtiāri [A collection of Bakhtiari prov-
erbs]. Tehran: Tahuri.
Digard, Jean-Pierre. 1988. Baḵtīārī tribe: Ethnography. Encyclopædia Iranica 3(5). 553–
560. New York: Center for Iranian Studies, Columbia University. http://www.iranicaon
line.org/articles/baktiari-tribe#pt1 (accessed 25 September 2017).
Digard, Jean-Pierre. 1992. Clothing of the Baḵtīārīs and other Lori speaking tribes. En-
cyclopædia Iranica 5.8, 852–854. New York: Center for Iranian Studies, Columbia
University. http://www.iranicaonline.org/articles/clothing-xxv (accessed 25 September
2017).
Forutan, Fariborz. 1996/1375. Ganjine-ye masalhā-ye Bakhtiyāri [A treasury of Bakhtiari
proverbs]. Tehran: Enteshārāt-e Khuzestān.
Haig, Geoffrey. 2017. Post-predicate arguments in Iranian languages. Paper presented at the
7th International Conference on Iranian Linguistics (ICIL7), Moscow State University,
28–30 August, 2017.
Bakhtiari 479
Huddleston, Rodney & Geoffrey K. Pullum (eds.). 2002. The Cambridge grammar of the
English language. Cambridge: Cambridge University Press.
Ittig, Annette. 1988. Baḵtīārī tribe: Baḵtīārī carpets. Encyclopædia Iranica 3(5). 553–560.
New York: Center for Iranian Studies, Columbia University. http://www.iranicaonline.
org/articles/baktiari-tribe#pt3 (accessed 25 September 2017).
Kerimova, Aza A. 1982. Lurskie i Bakhtiyarskie dialekty [Luri and Bakhtiari dialects].
In Vera S. Rastorgueva (ed.), Osnovy iranskogo iazykoznaniia iazyki III. Novoiran-
skie iazyki: zapadnaia gruppa, prikaspiiskie iazyki [Fundamentals of Iranian languages
III: New Iranian languages – Western group, Caspian languages], 287–315. Moscow:
Nauka.
Khosravi, Abdol’ali. 1996/1375. Guyesh-e Bakhtiāri: Ketāb-e chahārom [Bakhtiari dialect:
Fourth book]. Esfahan: Nashr-e Ghazal.
Khosraviniyā, Javād. 2010/1389. Shāhnāme Bakhtiyāri [Bakhtiari Shahnameh]. Esfahan:
Shahsavāri.
Khosraviniyā, Javād. 2011/1390. Gowgari [Brotherhood]. Shahr-e Kord: Niyosheh.
Kiyāni Haft Lang, Kiyānush. 2000/1378. Zarb-ol-masalhā-ye Bakhtiāri [Bakhtiari prov-
erbs], 2nd edn. Ahvaz: Enteshārat-e Khuzestān.
Lazard, Gilbert. 1985. L’inférentiel ou passé distancié en persan. Studia Iranica 14(1).
27–42.
Lazard, Gilbert. 2001. On the grammaticalization of evidentiality. Journal of Pragmatics
33(3), 359–367.
Lorimer, David L. R. 1922. The phonology of the Bakhtiari, Badakhshani, and Madaglashti
dialects of modern Persian, with vocabularies. London: Royal Asiatic Society.
Lorimer, David L. R. 1930. A Bakhtiari prose text. Journal of the Royal Asiatic Society,
April 1930, 347–364.
Lorimer, David L. R. 1954/1955/1963. The popular verse of the Bakhtiari of southwestern
Persia I/II/III. Bulletin of the School of Oriental and African Studies 16/17/26. 542–
555/92–110/55–68.
Lorimer, David L. R. 1964. A Bakhtiari Persian text. In Georges Redard (ed.), Indo-Iranica:
Mélanges présentés à Georg Morgenstierne, 129–133. Wiesbaden: Harrassowitz.
Lorimer, David L. R. & E. O. Lorimer. 1919. Persian tales, written down for the first time
in the original Kermani and Bakhtiari. London: Macmillan.
MacKinnon, Colin. 2011. Lori dialects. Encyclopædia Iranica 18. New York: Center for
Iranian Studies, Columbia University.
http://www.iranicaonline.org/articles/lori-dialects (accessed 25 September 2017).
Madadi, Zohrāb. 1996/1375. Vāzhehnāmeh-ye zabān-e Bakhtiāri [Lexicon of the Bakhtiari
language]. Tehran & Esfahan: Enteshārāt-e Āyāt.
Mann, Oskar. 1910. Die Mundarten der Lur Stämme im südwestlichen Persien, Berlin:
Georg Reimer.
Perry, John R. 1996. Persian in the Safavid period: Sketch for an Etat de langue. In Charles
Melville (ed.), Safavid Persia: The history and politics of an Islamic society, 269–283.
London: I.B. Tauris.
Ra’isi, Dārāb. 1996/1375. Golbāng-e Bakhtiyāri [Cry of the Bakhtiari], 3rd edn. Shahr-e
Kord: Enteshārat-e Il.
Schreiber, Laurentia, Mortaza Taheri-Ardali, Geoffrey Haig & Erik Anonby. 2017. Con-
tact-induced change in Irano-Turkic morphosyntax. Paper presented at the Conference
on Language Contact and Language Change in Western Asia, Goethe University Frank-
furt, 10–12 March, 2017.
Shirin Bayān, Sa’id. 2007/1386. Yek eyl, yek sedā [One tribe, one voice]. Tehran: Serāj-e
Nur.
Stilo, Donald. 2010. Ditransitive constructions in Vafsi: A corpus-based study. In Andrej
Malchukov, Martin Haspelmath & Bernard Comrie (eds.), Studies in ditransitive con-
structions: A comparative handbook, 243–276. Berlin & Boston: De Gruyter Mouton.
Tāheri, Esfandyār. 2010/1389. Guyesh-e Bakhtiāri-ye Kuhrang [The Bakhtiari dialect of
Kuhrang]. Tehran: IHCS Press.
Taheri-Ardali, Mortaza. 2017. Salient linguistic features and isoglosses in Chahar Mahal
va Bakhtiari Province, Iran. Paper presented at the 1st North American Conference in
Iranian Linguistics (NACIL1), Stony Brook University, New York, 28–30 April, 2017.
Taheri-Ardali, Mortaza. 2015. Guyesh-e Bakhtiāri: zavāl yā baghā? Negāhi digar be kār-
bord-e guyesh-e Bakhtiāri, guneh-ye Ardal [Bakhtiari dialect: decline or survival? A
re-examination of the usage of the Bakhtiari dialect, Ardal variety]. Paper presented
at the International Conference on Linguistic Heritage, Tehran, 20–21 October, 2015.
Taheri-Ardali, Mortaza & Erik Anonby. 2017. Language distribution in Chahar Mahal va
Bakhtiari Province of Iran. Paper presented at the 1st International Conference on Lan-
guages, Dialects and Linguistics (ICCILDL1), Pazhoheshgaran Andishmand Institute,
Ahvaz, Iran, 2–3 February, 2017.
Wal Anonby, Christina van der. 2011. Remarks on the remarkable: Mirativity in Kumzari.
Paper presented at the 4th International Conference on Iranian Linguistics (ICIL4),
Uppsala University, Uppsala, 17–19 June, 2011.
Windfuhr, Gernot. 1988. Baḵtīārī tribe: The Baḵtīārī dialect. Encyclopædia Iran-
ica 3(5). 553–560. New York: Center for Iranian Studies, Columbia University.
http://www.iranicaonline.org/articles/baktiari-tribe#pt2 (accessed 25 September 2017).
Windfuhr, Gernot. 2009. Dialectology and topics. In Gernot Windfuhr (ed.), The Iranian
languages, 5–42. London & New York: Routledge.
Zhukovskij, Valentin A. 1923. Materialy dlya izucheniya persidskikh narechij, pt. 3: Nar-
echie bakhtiyarov chekharleng i kheftleng [Materials for the study of Persian dialects,
part 3: The dialect of the Chahārlang and Haftlang Bakhtiari], ed. by S. F. Ol’denburg.
Petrograd.
Zhukovskij, Valentin A. 1923/2017. Mavāddi barā-ye motāle’eh-ye guyesh-e Bakhtiāri
[Materials for the study of the Bakhtiari dialect], transl. and ed. by Maryam Shafaghi
and Seyyed Mehdi Dadras. Tehran: Allameh Tabataba’i University Press.
Zolfaghari, Sima. 2017. The Bakhtiaris: An anthropological-linguistic lexical study of Haft
Lang nomads of southwest Iran. Leiden: Leiden University dissertation.
4.4. The Neo-Aramaic dialects of western Iran
Geoffrey Khan
The Neo-Aramaic dialects that are the subject of this chapter belong to two sub-
groups. They include:
(i) dialects belonging to the North-Eastern Neo-Aramaic (NENA) subgroup,
which were spoken in the Kordestan and Kermanshah provinces of western
Iran until the middle of the twentieth century.1
(ii) The neo-Mandaic subgroup of dialects, which were spoken further south in
the towns of Ahvāz and Khorramshahr in Khuzestan province of western Iran.
The NENA dialect area includes also the region of northern Iraq (chapter 3.4)
and the region of southeastern Turkey and northwestern Iran (chapter 2.5). The
NENA dialects of these latter two regions exhibit considerable diversity, as has
been shown in the chapters devoted to them. The NENA dialects of the region
described in this chapter, by contrast, exhibit very little diversity. The main split
is between Jewish and Christian varieties of NENA. The Jewish variety consists
of a cluster of dialects spoken by Jewish communities in various localities in an
area that includes Sainqala, Bokan, Săqəz on its northern border, Sanandaj in the
centre, Bijar on the eastern border, and in the south Kerend and Qasr-e Širin. The
Christians variety consists of a single dialect spoken by Christians in the town of
Sanandaj.
The Jewish cluster of dialects is remarkably uniform and only minor differ-
ences are found among the dialects of the aforementioned places where the dialects
were spoken. Most of the data on this variety of NENA for this chapter are taken
from the Jewish dialect of Sanandaj (henceforth referred to as J. Sanandaj),2 which
has been described in detail in Khan (2009). Studies on other Jewish dialects of
this cluster include those by Israeli (1998) on the dialect of Săqəz and by Hopkins
(2002) on Kerend. The Jewish dialects of the region belong to the so-called trans-
Zab subgroup of Jewish NENA. Within trans-Zab they are most closely related
to the Jewish dialect of Sulemaniyya in northeastern Iraq (chapter 3.4 and Khan
2004). Their relationship to the Jewish trans-Zab cluster of dialects of the West
Azerbaijan province of Iran in the Urmi region is more distant to the extent that
1
The term North-Eastern Neo-Aramaic was coined by Hoberman (1988: 557).
2
As in chapters 2.5 and 3.4, Jewish dialects are distinguished from Christian dialects in
this chapter by the prefixing J. and C. respectively to the names of the locations where
they were spoken.
482 Geoffrey Khan
speakers of dialects from the western Iran cluster had difficulties communicating
with Jews from Urmi.
The Christian NENA dialect of Sanandaj (henceforth referred to C. Sanandaj)
is, likewise, very similar to the Christian dialect of Sulemaniyya (chapter 3.4) but
substantially different from the Christian dialects spoken in the Urmi region of
Iran. Grammatical and lexical studies on the C. Sanandaj include Panoussi (1990,
1991), Heinrichs (2002) and Kalin (2014). Short extracts of texts in the dialect
can be found in Panoussi (1990: 120–128) and Macuch and Panoussi (1974: 39).3
These authors refer to the dialect as the Senaya dialect, from the Kurdish name of
the town ‘Sena’. A brief overview of the NENA dialect situation in Iran in general
can be found in Hopkins (1999).
Neo-Mandaic is spoken by Mandaeans in southwestern Iran. This exists in
two known varieties originally spoken in the towns of Ahvāz and Khorramshahr
respectively (Macuch 1965, 1989, 1993; Häberl 2009, 2011; Mutzafi 2014). The
native name of this spoken language is raṭnā, derived from a verbal root meaning
‘to whisper or mutter’.
2. History of the speech communities
Very little is documented concerning the history of the NENA-speaking commu-

nities before the twentieth century. The lack of diversity in the Jewish cluster of
dialects suggests that the communities who spoke them migrated in a single wave
into the region in relatively recent centuries. The ancient heartland of NENA must
have been in what is now northern Iraq and southeastern Turkey, where there is
great dialectal diversity. The close relationship of the Jewish dialects of western
Iran with the J. Sulemaniyya dialect points to northeastern Iraq as the origin of
the migration. The isolated Christian dialect spoken in Sanandaj must, likewise,
have been the result of migration. It is closely related to the Christian dialect of
Sulemaniyya and there were family relationships between these two communities
in living memory of speakers. It is likely, therefore, that the Christians of Sanandaj
were originally migrants from the region of Sulemaniyya in Iraq.
We know that some of the Jewish communities who settled in the towns of the
region originally lived in surrounding villages. The Jews of Sanandaj, for example,
moved into the town after its foundation in the 17th century from a village known
as Qalʿat Ḥasanʾābād (Khan 2009: 1).
The Christians of Sanandaj belong to the Chaldean Church. In the 19th century
several Christian families moved to Qazvīn, where their speech developed the
3
Data on C. Sanandaj cited in this chapter are mainly taken from the publications of
Panoussi or from personal communications from him, which I acknowledge with grati-
tude.

The Neo-Aramaic dialects of western Iran
483
Figure 1: Locations of NENA and Neo-Mandaic varieties covered in this chapter

484 Geoffrey Khan
distinctive trait of the realization of /w/ as /v/ under the influence of Persian (Hein-
richs 2002: 238). In the middle of the twentieth century the Chaldean diocese of
Sanandaj moved to Tehran and the Christian Neo-Aramaic speakers moved with it.
Neo-Mandaic is spoken by the Mandaeans, who follow a religion that is a
descendant of a pre-Islamic Gnostic sect. The traditional homeland of the Man-
daean community is the south of Iraq and the adjacent Khuzestān province of south-
west Iran. They are known in Iraq and Iran as ‘Sabians’ (Arabic ṣābiʾūn, colloquial
ṣubba), who are one of the ‘peoples of the book’ (ʾahl al-kitāb) recognized in Islam.
Neo-Mandaic appears to have ceased to be the spoken language of the Man-
daeans of Iraq by the beginning of the 19th century. There are references to a few
speakers in Iraq in the 20th century, but these seem to be of Iranian origin (Häberl
2009: 36–37). After the first Gulf War in 1991 the Iraqi Mandaean community was
displaced from their homes in southern Iraq by the army of Saddam Hussein.
Up until the 19th century Neo-Mandaic was spoken in a variety of localities in
the Khuzestān region. The Mandaeans subsequently came to be concentrated in
Khorramshahr and Ahvāz, where two distinct varieties of the language survived
until modern times. During the Iranian revolution in 1979 and in subsequent con-
flicts with Iraq, Khorramshahr was largely destroyed and abandoned by its inhab-
itants, including the Mandaean community. Within Iran the language seems now
to be spoken only in Ahvāz (Mutzafi 2014: 1–5).
3. Current status of the speech communities
In 1952 many NENA-speaking Jews from the region emigrated to the newly
founded State of Israel. Over the subsequent two decades there was a gradual emi-
gration of the Jews either to Tehran or abroad, mostly to Israel. After the Iranian
Revolution in 1979 most of the remaining Jews left the region, the majority set-
tling in Los Angeles in the USA and the remainder in Israel or Europe. Today only
about half a dozen elderly Jews are reported to be still living in the town.
After the Christian NENA-speaking community moved en masse from
Sanandaj to Tehran, they gradually left Iran and settled abroad. The majority of
the migrants have settled in the Los Angeles area of the USA.
As a result of these migrations and the disintegration of the NENA speech
communities, the NENA dialects that were spoken in western Iran are now highly
endangered.
There are numerous Mandaeans living in the urban centres of Iraq and in com-
munities that have settled outside of the Middle East, especially in Sweden, Australia
and the USA. The vast majority, however, do not speak neo-Mandaic. The number
of competent speakers of the language is rapidly dwindling. Häberl (2009: 8) esti-
mated there to be around 100–200 elderly speakers, most of whom are living in Iran.
Neo-Mandaic, therefore, will inevitably become extinct within the next few years.
The Neo-Aramaic dialects of western Iran 485
The Muslims in the region speak varieties of Sorani Kurdish. Persian is the official
language in schools and government administration. Most Jews and Christians of
the region were able to communicate fluently in Kurdish and Persian (Farsi) as
well as Neo-Aramaic. This resulted in extensive influence of these contact lan-
guages on the Neo-Aramaic dialects in all levels of the grammar and in the lexicon.
The Mandaeans in the Iranian province of Khuzestān are trilingual. In addi-
tion to neo-Mandaic they speak the local dialects of Arabic, which constitute the
vernacular of much of the Muslim population of the area, and also Persian, which
is the official language and language of education. Some of the speakers use an
adapted form of the Classical Mandaic script to write down the vernacular lan-
guage.
5. Phonology
5.1. bgdkpt consonants and q

vowels, which can be represented *[ḇ], *[ḡ], *[ḏ], *[ḵ], *[p̄], *[ṯ].
5.1.1. NENA dialects

In the NENA dialects of the region the fricatives *[ḇ], *[ḡ], *[ḵ], and *[p̄] have
reflexes that are common to the NENA subgroup as a whole (see chapter 2.5, §5.1
and chapter 3.4, §5.1):
*[ḇ] > /w/ J. Sanandaj: dənwe C. Sanandaj: gănawa
(< *dənḇē) ‘flies’ (< *gannāḇā) ‘thief’
In the variety of C. Sanandaj spoken in Qazvīn this reflex is pronounced /v/.
*[p̄ ] > /p/ J. Sanandaj: kepa (< *kēp̄ ā) C. Sanandaj: napel (< *nāp̄ el)
‘stone’ ‘he falls’
*[ḵ] > /x/ J. Sanandaj băxe (< *bāḵē) C. Sanandaj: baxe
‘he weeps’
The velar fricative *[ḡ] has in most cases been weakened to zero, e. g.
*[ḡ] J. Sanandaj: šrata (< *šrāḡtā) C. Sanandaj: čraya (< *šrāġā)
‘lamp’ ‘lamp’
This process of weakening of the velar fricative *ḡ involved an intermediate stage
in which the velar shifted to the pharyngeal *ʿ (Tsereteli 1990). In J. Sanandaj the
486 Geoffrey Khan
pharyngeal is preserved in the word loʿá ‘inside’, the development of which can be
reconstructed as loʿa < *l-ʿoya (by metathesis, this being the form of the word in
J. Amedia) < *l-ḡoya. The preservation of the pharyngeal would have been condi-
tioned by a pharyngealized pronunciation of the word, no doubt facilitated by the
/l/, though this has now been lost. The word is still pronounced with suprasegmen-
tal pharyngealization in J. Urmi (+lwa [lˤwˤɑˤ]).
Jewish and Christian NENA of the region differ with regard to the reflexes of
the interdental fricatives *[ḏ] and *[ṯ]. In the Jewish cluster of dialects the reflex
of both is generally the lateral /l/, as in other trans-Zab Jewish dialects:
J. Sanandaj bela (< *bayṯā) ‘house’
qlila (< *qḏila) ‘key’
The lateral /l/ occurs in a number of words where the cluster of Jewish trans-Zab
dialects in northwestern Iran have /d/ rather than the normal reflex /l/:
J. Urmi J. Sanandaj
ida ʾila < *ʾīḏā ‘hand’
od ʾol < *ʿāḇəḏ ‘he does’
In a few words in the Jewish dialects the reflex of the unvoiced interdental *ṯ is the
unvoiced pharyngeal fricative /ḥ/, e. g.
J. Sanandaj ʾaḥra (< *ʾaṯrā) ‘town’
təlḥa (< *tlāṯā) ‘three’
láḥmal (< *lā-ṯimmal) ‘the day before yesterday’
naḥale (< *nāṯāṯā) ‘ears’
In such words the *ṯ was originally weakened to *h and this subsequently shifted
to a pharyngeal /ḥ/. Such words must have been originally pronounced with
suprasegmental pharyngealization, originating no doubt from the consonants r,
l or m. The pharyngealization was subsequently lost as a suprasegmental feature
but left a vestige in the pharyngeal segment /ḥ/: ʾaḥra < *ʾˤaˤhˤrˤaˤ < *ʾaṯrā. The
intermediate stage of development /h/ and pharyngealization is found in J. Urmi:
+
ahra, +nahale.
In some plural forms of nouns in the Jewish dialects the reflex of *ṯ is zero,
e. g.
J. Sanandaj malăwáe (< *māṯawāṯā) ‘villages’
An original /d/ in post-vocalic position in the Jewish dialects sometimes shifts to
the fricative /z/:
J. Sanandaj koza ‘liver’ cf. J. Urmi koda
guza ‘wall’ cf. J. Urmi guda
The articulation of the consonant has been further weakened in a few cases to zero,
e. g.
J. Sanandaj xar ‘He becomes’ cf. J. Urmi xadər
šar ‘He sends cf. J. Urmi šadər
In C. Sanandaj the reflexes of *[ḏ] and *[ṯ] are /d/ and /s/ respectively, as in
C. Sulemaniyya in Iraq, e. g.
C. Sanandaj ʾida (< *ʾīḏā) ‘hand’
besa (< *bayṯā) ‘house’
The /s/ may become pharyngealized in environments that are liable to pharyngeal-
ization, such as words containing the sonorant /r/:
C. Sanandaj baṣra (< *baṯrā) ‘behind’
In the Jewish dialects /q/ is normally realized as an unvoiced uvular stop, e. g.
baqá [bɑːˈqɑ] ‘to’. After a vowel or /w/, it is occasionally realized as an unvoiced
uvular fricative (Khan 2009: 21), e. g.
J. Sanandaj
qoqé [qoːˈχeː] ‘pots’
šəwqá-y [ʃɪfˈχaj] ‘He has left’.
5.1.2. Neo-Mandaic
Neo-Mandaic has preserved all of the fricative bgdkpt consonants except fricative
*ḏ, which has shifted to a stop /d/ (Macuch 1965: 32–40; Häberl 2009: 48–65).
The examples below are taken from Häberl’s description of the Khorramshahr
dialect:
*ḇ [v] gaḇrā [ˈgævrɔ]4 ‘man’, [w] in the environment of back
rounded vowels: əḇod [əˈwod] ‘do! (ms.)’
*ḡ [ɣ] palḡā [ˈpalɣɒ] ‘split’, loḡrā [ˈloɣərɔ] ‘leg’
*ḏ [d] idā [ˈiːdɔ] ‘hand’
*ḵ [χ] əḵal [aˈχɑl] ‘he ate’
*p̄ [f] nəp̄ aq [nəˈfɑq] ‘he went out’
*ṯ [θ] bieṯā [ˈbiɛ̆θɔ] ‘house’, hāṯā [hɔːθɔ] ‘sister’
4
Following the practice of Häberl in his publications, in the section on phonology exam-
ples from Neo-Mandaic are given in a phonemic transcription in italics and in a pho-
netic transcription in IPA characters in roman enclosed in square brackets.
488 Geoffrey Khan
The shift of the original *ḏ to the stop d in Neo-Mandaic seems to be a relatively

recent development. A glossary of the spoken language written in Mandaic script
in the 17th century indicates the existence of the fricative pronunciation of the letter
in many native Aramaic words by a diacritical dot (Borghero 2004: 70–71).
Before a syllable beginning with the sonorant [l], the unvoiced interdental [θ]
loses its oral articulation and shifts to [h], e. g.
ehle [ˈɛhle] ‘I have’ < *eθ-le
In addition to the inherited fricative bgdkpt consonants, Neo-Mandaic also exhib-
its the fricativization of the inherited voiced stops /b/, /g/, /d/ to the voiceless
fricatives /f/, /χ/ and /θ/ respectively before voiceless stops (Häberl 2009: 89), e. g.
rabbā [ˈræbbɔ] ‘great’ rap̄ tar [ˈræftæɹ] ‘greater’
In Neo-Mandaic the voiceless uvular stop /q/ is often realized as voiced uvular
fricative [ʁ] after a vowel in a stressed syllable and in word-initial position it is
occasionally realized as a voiced uvular stop (Häberl 2009: 57), e. g.
láqben [ˈlɑʁ.bɛn] ‘we don‘t want’
qĕhazelli [ga.ˈzɛl.li] ‘they see him’
5.2. Laryngeals and pharyngeals

In the Jewish NENA cluster of dialects of the region, an initial laryngeal stop *ʾ in
some words shifts to a laryngeal fricative /h/, e. g.
J. Sanandaj hămər ‘(that) he says’ < ʾămər
hezəl ‘(that) he goes’ < ʾezəl
As generally in NENA, the unvoiced pharyngeal fricative *ḥ has shifted to the
velar fricative /x/ in both Jewish and Christian NENA of the region, e. g.
J. Sanandaj xmara (< *ḥmārā) ‘ass’ qamxa (< *qamḥā) ‘flour’
C. Sanandaj xmara qamxa
The original pharyngeal has been preserved in some words and verbal roots of
Aramaic stock that contain /q/ or a historical pharyngealized consonant, e. g.
J. Sanandaj ḥănəq ‘he drowns’
C. Sanandaj raḥúqa ‘distant’
In C. Sanandaj *ḥ is preserved in some words relating to religion which are taken

from Classical Syriac, e. g. mšiḥa ‘Messiah’.
In word initial position the reflex of an historical voiced pharyngeal fricative
*ʿ is normally the laryngeal stop /ʾ/, e. g.
J. Sanandaj ʾela (< *ʿēḏā) ‘festival’
C. Sanandaj ʾeda
In word-internal or word-final position the voiced pharyngeal *ʿ has been weak-
ened to zero in most cases in both Jewish and Christian NENA, e. g.
J. Sanandaj beta (< *bēʿtā) ‘egg’, šăme (< *šāmaʿ) ‘he hears’
C. Sanandaj ṭena (< *ṭaʿnā) ‘load’; šame (< *šāmaʿ)
The pharyngeal *ʿ has been retained in some words of Aramaic stock that contain
/q/ or a consonant that was historically pharyngealized, e. g.
J. Sanandaj ʿaqəwra (< *ʿaqəbrā) ‘scorpion’; ṭaʿna (< *ṭaʿnā) ‘load’
C. Sanandaj ʿarəq (< *ʿārəq) ‘he flees’; ʿarəṭ (< *ʿārəṭ) ‘he farts’
In some cases its reflex is a laryngeal /ʾ/ in such contexts:
C. Sanandaj ṣaʾər (< *ṣāʿər) ‘he upbraids’
Pharyngeals are also preserved in the Jewish dialects in environments where a
consonant adjacent to the pharyngeal became pharyngealized at some stage of
development. This is found especially in the environment of the sonorant conso-
nants /m/, /b/, /l/ and /r/, e. g.
J. Sanandaj dəmʿe (< *dəṃʿe, < *dəmʿe) ‘tears’
gulʿa (< *guḷʿa < *gulʿā) ‘kernel of fruit’
zaʿra (< *zaʿṛa, < *sʿārā) ‘barley’
5.2.2. Neo-Mandaic
The original Aramaic pharyngeals have been lost in Neo-Mandaic. The normal
reflex of *ḥ is /h/ and that of *ʿ is zero.
hamšā [ˈhæmʃɔ] ‘five’ < *ḥamšā
ālmā [ˈɒlmɔ] ‘world’ < *ʿālmā
ārbin [ɔɹˈbin] ‘forty’ < *ʾarbʿīn
When /h/ closes a syllable in word-medial position, it is often strengthened to [x]
in order to prevent its elision, e. g.
Khorramshahr (Häberl 2009: 59)
qəhazenḵon [qax.ˈzin.χon] ‘I see you’
490 Geoffrey Khan
The speakers of Neo-Mandaic from Khuzestān are today all bilingual in Arabic
and their Neo-Mandaic contains loanwords from Arabic containing the Arabic
pharyngeal consonants. In the pronunciation of these loanwords the pharyngeals
are either retained as they are articulated in Arabic or they are weakened to laryn-
geals, most likely under the influence of Persian (Häberl 2009: 58–59), e. g.
Khorramshahr
ḥākem [ˈħɒːkɛm] ‘governor’ < Arabic ḥākim ‘judge’
ḥašiš [hæːˈʃiʃ] ‘hashish’ < Arabic ḥašīš ‘grass’
ṣunʿa [ˈsˤʌnʕɑ] ‘good’ < Arabic ṣunʿ ‘benefit’
jamiʿa [ʤɛˈmiːʔa] ‘all’ < Arabic jamīʿ ‘all’

C. Sanandaj preserves the pharyngealized consonants /ṣ/ and /ṭ/, e. g. ṣale ‘he
prays’, ṭaše ‘he hides’.
In the Jewish dialects the original pharyngealization of these consonants is
generally weakened. There is, therefore, minimal spreading of pharyngealization
in the word (Khan 2009: 17–18, 24), e. g.
J. Sanandaj ṭura [tʰuːˈra] ‘mountain’, xaṣa [xaːˈsa] ‘back’.
If, however, there was an original pharyngeal fricative *ʿ in the word, this is
retained in words containing the originally pharyngealized consontants /ṣ/ and /ṭ/,
e. g.
J. Sanandaj ṭaʿna [tʰaʕˈna] ‘load’ (< *ṭaʿnā)
This process of development seems to be due to convergence with the behaviour
of pharyngealization in the Kurdish dialects described by Margaret Kahn (1976:
49–52). In Kahn’s analysis a pharyngealized consonant in Kurdish is the result of
the presence of an underlying pharyngeal segment in the word. Such words can
also be realized with the pharyngeal on the phonetic surface in place of the phar-
yngealized consonant.
The Jewish dialects of the region do, however, have a pharyngealized sonorant
/ḷ/. It occurs in loanwords from Kurdish, e. g.
J. Sanandaj puḷe ‘money’, qaḷi ‘carpet’, zuxāḷ ‘coals’
It is found also in a number of words of NENA origin. In such cases it corresponds
to original /r/, e. g.
J. Sanandaj pšiḷa (< *pšira) ‘melted’; joḷe (< *jore) ‘urine’
5.3.2. Neo-Mandaic
As in the Jewish NENA dialects of the region, in Neo-Mandaic the pharyngeal-
ization of historical *ṭ and *ṣ is often weakened. This weakening is particularly
common when these consonants are in contact with a following non-pharyngeal-
ized consonant (Häberl 2009: 53–54), e. g.
Khorramshahr
ṭāb [tˤɔʊ] ‘good’
baṭluḵtā [bɑtˈlʊχtɔ] ‘misfortune’
ṣəḇyi [ˈsˤɛvjɪ] ‘he baptized him’
miṣrā [ˈmɪsrɑ] ‘boundary’
5.4. Diphthongs

The original diphthongs *ay and *aw have contracted to /e/ and /o/ respectively in
both Jewish and Christian NENA:
J. Sanandaj bela (< *bayṯā) ‘house’; mola (< *mawṯā) ‘death’
C. Sanandaj besa, mosa
5.4.2. Neo-Mandaic
Also in Neo-Mandaic the original diphthongs *ay and *aw have generally con-
tracted (Häberl 2009: 61, 62,74, 88–89). In unstressed open syllables the reflex of
*ay is /e/, e. g.
Khorramshahr
hemanuṯan [hemaˈnuːθæn] (< *haymānūṯan) ‘our faith’
In stressed open syllables *ay sometimes contracts to /i/, e. g.
Khorramshahr
inā [ˈiːnɔ] (< *ʿaynā) ‘eye’
In some words the process of contraction of this diphthong results in the hiatus
sequence /ie/, which has developed by a breaking of the vowel *e (i. e. *ay > *e
> /ie/), e. g.
Khorramshahr
bieṯā [ˈbɪɛ̆θɔ] (< *bayṯā) ‘house’
492 Geoffrey Khan
This breaking of the diphthong, which has been documented in recent years in both
the Khorramshahr and the Ahvāz dialects, is not found in records of Neo-Mandaic
from earlier in the twentieth century. This suggests that it may be a recent devel-
opment under the influence of languages in contact, such as the Arabic dialect of
Khuzestan, where the phenomenon is also found (Häberl 2009: 38–39, 74; Macuch
1993: 38). The diphthong *aw generally contracts to /o/ or /u/ in open stressed
syllables, e. g.
Khorramshahr
nodā [ˈnoːdɔ] (< nawdā) ‘quake’
yumā [ˈjuːmɔ] (< yawmā) ‘day’
Diphthongs other than *ay and *aw and those resulting from inflectional processes
or in loanwords are generally preserved in the Khorramshahr dialect, but they have
a tendency to contract in the Ahvāz dialect (Häberl 2009: 51, 258, 2011: 728), e. g.
Khorramshahr gāw [gɔʊ] ‘in’
Ahvāz gu [guː]
5.5. Stress

The basic stress position in the Jewish cluster of dialects of the region is word-fi-
nal, as in Jewish trans-Zab dialects elsewhere, whereas C. Sanandaj has basic
penultimate stress:
J. Sanandaj C. Sanandaj
belá bésa ‘house’
qăṭə́l qáṭəl ‘he kills’
In the Jewish dialects, however, retraction of the word-final stress frequently
occurs in various non-pausal contexts within an intonation group, e. g.
xá ʾăxóna xet-àf|
a brother other-3fs
‘Another brother of hers’ (Khan 2009: 53)5
In J. Sanandaj a short /ə/ or /u/ in an unstressed syllable may undergo devoicing.
This renders the vowel /ə/ inaudible, e. g.
*šəmá > [ʃɪ̥ ˈma] ‘heaven’ (Khan 2009: 40)
5
The intonation group boundary is indicated by the symbol |. The nuclear stress is repre-
sented by a grave accent and non-nuclear stresses by acute accents.
When short /u/ is devoiced in this way, the lip-rounding gesture of the vowel
remains, resulting in labialized pronunciation of the preceding consonant, e. g.
*duká > [tʰwˈkʰa] ‘place’ (Khan 2009: 43)
5.5.2. Neo-Mandaic
According to Häberl’s description of the Khorramshahr dialect (Häberl 2009:
77–81), in Neo-Mandaic the placement of stress depends on vowel quality and
syllable structure. The stress generally falls on a vowel with the qualities [i], [u],
or [ɔ], which Häberl (2009: 59) calls “tense vowels”, and preferably upon a closed
word-final syllable. If the word-final syllable is not suitable, the stress moves
towards the front of the word until it rests on a suitable syllable. Virtually all words
have the stress on the final, penultimate or antepenultimate syllable.
The stress falls on the final syllable when it is closed and contains a tense
vowel, e. g.
qəmahrəḇā́ t [qə.mæh.rɛ.ˈwɔːt] ‘you destroy’
or when the word-final syllable does not contain a tense vowel but there are no
suitable preceding syllables, e. g.
əḵál [a.ˈχɑl] ‘he ate’
In words containing two syllables, if the final syllable is open or contains a lax
vowel, the stress falls upon the penultimate syllable, if it contains a tense vowel
or is closed, e. g.
bā́ ḇā [ˈbɔː.wɔ] ‘father’
gáḇrā [ˈgæv.rɔ] ‘man’
In words of three or more syllables, if neither the final nor the penultmate syllable
is suitable for the stress placement, it recedes to the antepenultimate syllable, if
this is closed or contains a tense vowel, e. g.
gaṭélnāḵon [ga.ˈtˤɛl.nɒ.χon] ‘I will kill you’

In C. Sanandaj consonant gemination has been lost, including after short unrounded
/ə/, where it has been preserved in most other NENA dialects. When the gemina-
tion is lost after this vowel, the vowel remains short:
494 Geoffrey Khan
C. Sanandaj
də́ma (< *dəmma) ‘blood’ cf. C. Urmi dəmma
lə́ba (< *ləbba) ‘heart’ cf. C. Urmi ləbba
A consonant that was geminated secondarily due to the assimilation of an adjacent
consonant also loses its gemination in some cases. Such geminaton is, likewise,
preserved in most NENA dialects:
C. Sanandaj
šadrə́te (< *šadrətte) ‘you send him’
ʾə́tan (< *ʾəttan < *ʾītlan) ‘we have’
Within the Jewish cluster of dialects of the region consonant gemination has been
totally lost in J. Sanandaj, as in C. Sanandaj, but it has been retained after /ə/ in
other Jewish dialects of the cluster in conformity with the majority of NENA dia-
lects:
J. Sanandaj J. Săqəz
dəmá dəmmá ‘blood’
ləbá ləbbá ‘heart’
In some contexts a short /ə/ in an open syllable arising from loss of gemination is
eliminated by resyllabification, e. g.
J. Sanandaj
šəqlí ‘I bought’ < *šqəli < *šqəlli
zábna ‘I sell’ < *zăbəna < *zăbə́nna
In J. Sanandaj, but not other Jewish dialects of the region, consonants that are
geminated secondarily due to assimilation of another consonant also lose their
gemination. In this case the vowel in the opened syllable is lengthened:
J. Sanandaj J. Săqəz
garšéte garšə́tte ‘you pull him’ < *garšet–le
Some other features of syllable structure that are distinctive specifically of the
western Iranian Jewish cluster of dialects include the following.
Within a word two vowels may follow one another without an intervening
glottal stop /ʾ/. Such sequences should be analysed as diphthongs rather than two
separate syllable nuclei. In word-final sequences of vowels the existence of the
diphthong is reflected by the fact that stress that would normally be expected to
be put on a final vowel nucleus (§5.5.1.) is retracted to the vowel preceding it,
indicating that the final vowel is treated as non-syllabic, e. g.
J. Sanandaj
huláe [CV.CVV̯ ] ‘Jews’
ʾóa [CVV̯ ] ‘that one’
In cases where stress is put on the second vowel of the sequence in word-final posi-
tion, the diphthong should be interpreted as rising V̯ V. This is found in sequences
where the first of the two vowels is /o/. The non-syllabic status of /o/ is reflected
by the fact that it is often realized as a semi-vowel [w], e. g.
J. Sanandaj
šoá [ʃwa] ‘seven’
ntoá [ntwa] ‘high’
Sequences of three vowels with a medial /o/ occur in some forms. These should
be interpreted as VV̯ V, the medial vowel being the on-glide of a diphthong, the
phonetic realization of which is often the semi-vowel [w], e. g.
J. Sanandaj xăoé [xaˈwe] ‘to see’
Short /ə/ is often devoiced in unstressed syllables. This results in phonetic realiza-
tions such as the following in which the /ə/ vowel is inaudible:
J. Sanandaj qəṭmá [qhə̥thˈma] ‘ash’
šəmšá [ʃə̥mˈša] ‘sun’
5.6.2. Neo-Mandaic
The gemination of consonants is widely attested in Neo-Mandaic. A distinction
should be made between lexical gemination and phonological gemination, accord-
ing to the terminology proposed by Häberl (2009: 75–77).
Lexical gemination includes geminated consonants inherited from earlier
stages of the language, those formed through the total assimilation of one conso-
nant to another, and gemination in loanwords, e. g.
lébbā [ˈlɛbbɔ] ‘heart’ < *libbā
moqaddás [moqɑdˈdɑs] ‘holy’ < Arabic muqaddas
Phonological gemination depends on vowel quality and stress. It occurs only after
a stressed lax vowel, i. e. [e], [o] or [a]. The most common context where it is
found is where an inflectional morpheme beginning with a vowel (typically a pro-
nominal suffix) is added to a closed stressed syllable containing a lax vowel, e. g.
həzon [hə.ˈzon] + a [a] > həzónna [hə.ˈzon.na]
see.pst .3 pl + obj .3 fs see.pst .3 pl . obj .3 fs
‘They saw her’
496 Geoffrey Khan
All consonants can undergo phonological gemination in this way, with the excep-
tion of the interdental fricative /θ/. In a context where the phonological gemination
of /θ/ is expected, the cluster [χt] occurs instead of [θθ], e. g.
eṯ [ˈɛθ] + aḵ [aχ] > éḵtaḵ [ˈɛχ.taχ]
cop + 2 ms cop .2 ms
‘You are’
6.1. Pronouns

Both the Jewish cluster of dialects and C. Sanandaj have two series of demon-
strative pronouns, which express near and far deixis respectively. Unlike many
NENA dialects in other regions, they have not developed three series in which the
original far deixis form is used specifically for anaphoric reference. The near and
far deixis may be used with either deictic or with anaphoric function. The forms
are provided in Table 1.
Table 1: Demonstratives in the Jewish and Christian NENA dialects

J. Sanandaj J. Bokan J. Kerend C. Sanandaj
Near deixis
3ms. ʾay, ʾéa, ʾéxa ʾay, ʾáya ʾay, ʾáya ʾay, ʾáya, ʾíya,
ʾayánān, ʾiyánān
3fs. ʾay, ʾéa, ʾéxa ʾay, ʾáya ʾay, ʾáya ʾay, ʾáya, ʾíya,
ʾayánān, ʾiyánān
3pl. ʾayní, ʾənyé, ʾənye ʾanyé ʾān, ʾáni,
ʾənyexáe ʾanínān
Far deixis
3ms. ʾo, ʾóa, ʾóxa ʾo, ʾóya ʾo, ʾáwa ʾo, ʾáwa, ʾawánān
3fs. ʾo, ʾóa, ʾóxa ʾo, ʾóya ʾo, ʾáwa ʾo, ʾóya, ʾoyánān
3pl. ʾóni, ʾonyé, ʾóni, ʾonyé ʾóni, ʾonyé ʾōn, ʾóni, ʾonínān
ʾonyexáe
As can be seen, most of the demonstrative pronouns in the dialects have short and
long forms. The long forms tend to be used only when the pronoun is independent
and not combined with a noun. Only singular forms can be used in combination
with a noun. These are the shortest forms in the list presented above. These singu-
lar adnominal forms are used also with plural nouns, e. g.
J. Sanandaj
ʾay gora ‘this man’
ʾay naše ‘these people’
ʾo gora ‘that man’

ʾo naše ‘these people’
A few minor differences are found across the Jewish dialects of the region with
regard to the contraction of diphthongs in the pronouns (J. Sananadaj ʾéa < *ʾayha,
ʾóa < *ʾawha vs. J. Săqəz, J. Kerend ʾáya < *ʾayha, ʾáwa < *ʾawha). In both the
Jewish dialects and C. Sanandaj there is a levelling of gender in the singular. In
C. Sanandaj, however, differences in contraction of a diphthong are exploited to
mark a distinction in gender in the far deixis form (ʾáwa ms. < *ʾawha, ʾóya fs. <
*ʾawha). The 1st and 2nd person pronouns are given in Table 2.
Table 2: First and second person pronouns in Jewish and Christian Sanandaj
2ms. ʾāt ʾayət
2fs. ʾāt ʾayat
2pl. ʾaxtu ʾaxtoxən
1s ʾana ʾana
1pl. ʾaxni ʾaxni
C. Sanandaj expresses gender distinction in the 2nd singular with innovative forms,
found widely elsewhere in NENA, formed by the attachment of suffixes taken
from the paradigm of D-suffixes that are used in verbal inflection.
6.1.2. Neo-Mandaic
Neo-Mandaic has two series of demonstrative pronouns in the singular, expressing
near and far deixis (Häberl 2009: 161–163). There is only one plural demonstra-
tive pronoun, which is historically a far deixis form, but now is neutral as to dis-
tance. The singular pronouns are neutral as to gender and occur in an independent
and an adnominal dependent form, which are shown in Table 3.
498 Geoffrey Khan
Table 3: Neo-Mandaic demonstratives

Independent Adnominal
Near deixis
3s. āhā ā ‘this’
3pl. ahni — ‘these’
Far deixis
3s aḵu aḵ ‘that’
tā
3pl. ahni — ‘those’
The use of the singular adnominal demonstratives is extended to plural nouns,

since the plural demonstrative does not have an adnominal form, e. g.
ā bieṯā ‘this house’
ā šeršānā ‘these religions’
In addition to the near and far deixis demonstratives, there is another series of 3rd
person pronouns that functions as anaphoric personal pronouns. This differs from
the NENA dialects of the region, in which the deictic pronouns have taken over
the function of the anaphoric pronoun. The full paradigm of independent personal
pronouns is as follows. Some pronouns have two forms, which Häberl (2009: 156)
terms “lexical” and “contextual” respectively (Table 4).
Table 4: Lexical and contextual pronouns in Neo-Mandaic

Lexical Contextual
3ms. huwi huw
3fs. hidā hid
3pl. honni —
2s. āt —
2pl. atton —
1s. anā an
1pl. ani an
The contextual forms, the final vowel of which has been elided, are subject pro-
nouns before a verb that do not carry the nuclear stress of the utterance (a). The
lexical form of the pronoun is typically used when it takes the nuclear stress of the
utterance and has a contrastive function (b):
(a) an yəhm-it pahli-d-aḵ

1sg sit.pst -1 sg beside-gen -2 sg . m
‘I sat beside you.’
(b) anā yəhm-it pahli-d-aḵ
1sg sit.pst -1 sg beside-gen -2 sg . m
‘I (was the one who) sat beside you.’
6.2. Pronominal suffixes

The pronominal suffixes attached to nouns and prepositions are shown in Table 5.
Table 5: Pronominal suffixes in Jewish and Christian Sanandaj

3ms. -ef -e
3fs. -af -e
3pl. -u, -un -u
2ms. -ox -ox
2fs. -ax -ax
2pl. -ăxun -oxən
1s -i -i
1pl. -an, -ăni -an
The 3rd person singular suffixes in the Jewish dialects have an augment –f, which
derives historically from –u; cf. the forms –eu 3ms. and –aw, which occur in
several NENA dialects in other regions (chapter 3.4, §6.1). In C. Sanandaj the
orginal 3ms form –e has been extended to refer also the 3fs.
The variant form of the 3pl. suffix in J. Sanandaj –un, which preserves the
original final /n/ of the pronoun (< *-hun), is used only before the co-ordinative
enclitic –u, e. g. xa-dana mənun-u ‘one of them and …’ (Khan 2009: 61).
500 Geoffrey Khan
6.2.2. Neo-Mandaic
The pronominal suffixes attached to nouns and prepositions and also to verbs,
when expressing the object, are given in Table 6.
Table 6: Pronominal suffixes in Neo-Mandaic

3ms. -i
3fs. -a
3pl. -u
2ms. -aḵ
2fs. -eḵ
2pl. -ḵon
1s. -e
1pl. -an
When attached to nouns these function as genitive suffixes, e. g.

emm-e
mother-1 sg
‘my mother’
The genitive pronominal suffixes are attached to loanwords with an intervening
genitive particle d- (Häberl 2009: 158), e. g.
ʿumər-d-e
age-gen -1 sg
‘my age’

The inflection of nouns and adjectives in the NENA dialects of the region follow
the same basic patterns as other NENA dialects (chapter 3.4, §6.3).
In the Jewish dialects the feminine singular ending has the form –da or –la
in words in which the original ending was *-ṯa with an interdental fricative. The
ending –da occurs in a few nouns whose stem ends in one of the sonorant conso-
nants /l/, /n/ or /r/:
J. Sanandaj qarda (cf. C. Barwar qarθa) ‘cold’
šənda (cf. C. Barwar šənθa) ‘sleep’
The ending –la occurs in nouns the stem of which ends in a vowel or the sonorant
consonants /w/ and /m/:
J. Sanandaj šala (cf. C. Barwar šaθa) ‘fever’

šwila (cf. C. Barwar šwiθa) ‘bed’
xmala (cf. C. Barwar xmaθa) ‘mother-in-law’
In C. Sanandaj the feminine ending has the form –sa in these contexts: šənsa
‘sleep’, šasa ‘fever’, šwisa ‘bed’, xmasa ‘mother-in-law’.
In both the Jewish dialects and C. Sanandaj the common nominal plural end-
ing–e has been analogically extended to the feminine plural ending *-āṯā. In the
Jewish dialects this results in the form –ale or –ae and in C. Sanandaj the resultant
form is–ase:
ʾilale ʾidase ‘hands’ < *ʾīḏāṯā
šukyale šəkyase ‘testicles’ < *šekyāṯā
belăwáe baswase ‘houses’ < *bayṯāwāṯā
malăwáe maswase ‘villages’ < *māṯāwāṯā
Both the Jewish dialects and C. Sanandaj have borrowed a definite article suffix
from Sorani Kurdish with the form –ăke, e. g. J. Sanandaj bela ‘house’, belăke ‘the
house’; C. Sanandaj besa ‘house’, besăke ‘the house’. The form –ăke corresponds
to the oblique form of the Kurdish definite article (-ăkay) rather than the nomina-
tive form (–ăka).
In all NENA dialects indefiniteness is expressed by the cardinal numeral ‘one’
(xa), e. g. J. Sanandaj xa bela ‘a house’.
6.3.2. Neo-Mandaic
In Neo-Mandaic the original Aramaic nominal inflection -ā, is retained, but the
various Aramaic plural morphemes have been mostly replaced by the morpheme
-ān-, which may be identified with the Persian plural morpheme of the same form
(Häberl 2011: 730). This plural morpheme is placed before the nominal inflection
-ā. It is used with nouns of both masculine and feminine gender, e. g.
Gender Singular Plural
m. kədāḇ-ā ‘book’ kədāḇ-ān-ā ‘books’
f. id-ā ‘hand’ id-ān-ā ‘hands’
The Aramaic plural morpheme -āṯ- (sometimes augmented by the glides /w/ or /y/)
has been preserved in some nouns. It regularly occurs in feminine nouns that have
retained the Aramaic feminine singular morphme -t- in the singular, e. g.
f. tur-t-ā ‘cow’ tur-āṯ-ā ‘cows’
502 Geoffrey Khan
It is occasionally found in other nouns of feminine gender without the feminine

morpheme -t- in the singular, e. g.
f. bieṯ-ā ‘house’ bieṯ-wāṯ-ā ‘houses’
f. hewān-ā ‘animal’ hewān-yāṯ-ā ‘animals’
Most loanwords are inflected with the Persian plural suffix -(h)ā́ , e. g.
m. jihel ‘child’ jihel-ā́ ‘children’
Some frequent nouns have irregular plurals that have a different stem from the sin-
gular and in some cases do not contain a plural morpheme (Häberl 2009: 130), e. g.
m. gaḇr-ā ‘man’ goḇr-ā ~ goḇr-ān-ā ‘men’
f. eṯṯ-ā ‘woman’ enš-ā ~ enš-ān-ā ‘women’
f. hāṯ-ā ‘sister’ ah-wāṯ-ā ‘sisters’
Neo-Mandaic has no definite article but it uses various strategies borrowed from
Persian to mark indefiniteness. A referentially indefinite noun, which has a spe-
cific or non-specific referent, may be marked by the suffix -i, by the preposed par-
ticle ya (< Persian yak ‘one’) or by both (Häberl 2009: 144). The suffix -i replaces
the nominal inflection -ā, e. g.
barnāš-ā ‘the person’ barnāš-i ‘a person’
barnāš-ān-ā ‘the people’ barnāš-ān-i ‘some people’
mend-ā ‘the thing’ mend-i ‘a thing, something’
ya kədāḇ mandāyí
a book Mandaic
‘a Mandaic book’
ya tājer-i
a merchant-indf
‘a merchant’
This indefinite marker is not used productively in NENA, but the word məndi
‘thing, something’ is a common word in all NENA dialects. It is possible that this
spread into NENA by areal diffusion from Mandaic at an earlier period.
Plural markers are used in principle only with referential nouns. They do not
occur on nouns denoting a generic class, e. g.
barnāš-ā ‘the person’ (singular referential)/‘people’ (generic)

Nominal annexation involves the formation of a complex nominal phrase consist-
ing of a head noun and a dependent noun. In C. Sanandaj the Aramaic genitive
particle d is suffixed to the head noun, which generally results in the ending -əd:
C. Sanandaj ʾoṭaqəd qaša ‘the room of the priest’
saʿatəd mose ‘the hour of his death’
našəd ʾasra ‘the people of the town’
When a demonstrative pronoun is in the dependent phrase, both the demonstrative
and the head noun have d. The form of the demonstrative beginning with d can be
regarded as an oblique allomorph of the pronoun:
šəməd do marḥūm ‘the name of the deceased’
In the Jewish dialects the most common way of annexing one nominal to another
in a genitive relationship is simply to juxtapose the two. The original Aramaic
genitive particle d-, which regularly occurs in such constructions in other NENA
dialects, is rarely used in the Jewish dialects of this region:
J. Sanandaj bela băruxi ‘the house of my friend’
The d- particle has survived only in demonstrative pronouns. The form of the
pronoun beginning with d- can be regarded as an oblique allomorph of the pronoun:
J. Sanandaj bela do naša ‘the house of that man’
In the Jewish dialects in the south of the region, such as J. Kerend, the d- is gener-
ally omitted even on demonstrative pronouns:
J. Kerend bela ʾo naša ‘the house of that man’
Occasionally the Iranian ezafe particle –e is suffixed to the head noun in the Jewish
dialects:
J. Sanandaj belá-e băruxi ‘the house of my friend’
belá-e do naša ‘the house of that man’
In a few closely-knit phrases in both Jewish and Christian NENA the head noun of
an annexation phrase has no inflectional ending, e. g.
J. Sanandaj reš-šata ‘beginning of the year (= New Year)’
C. Sanandaj mār-kalba ‘the master of the dog’
504 Geoffrey Khan
6.4.2. Neo-Mandaic
In Neo-Mandaic nominal annexation is expressed by the elision of the nominal
ending -ā. This occurs when a noun is immediately followed by another noun or
adjective expressing a genitive or attributive relationship (Häberl 2009: 91, 132,
2011: 730), e. g.
bieṯā ‘house’ bieṯ bābe ‘the house of my father’
bieṯā ‘house’ bieṯ baʿid ‘the distant house’
If a phrase containing an adjective is indefinite, the indefinite suffix -i may be
attached to the adjective at the end of the phrase, e. g.
qazġān honin-i ‘a small cooking pot’ (Häberl 2009: 147)
The distribution of such shortened noun forms replicates the pattern of distribu-
tion of Persian ezafe constructions. The Persian ezafe particle itself may be used,
e. g.
kol dukkān=e Amrik-ān
all shop=ez American-pl
‘all the American shops’
This is sporadically found in the Khorramshahr dialect of Neo-Mandaic (Häberl
2009: 133–134) but is more frequent in the Ahvāz dialect (Macuch 1993: 60–61).
The shortened ‘contextual’ forms, to use Häberl’s terminology, are analogous to
the shortened ‘contextual’ forms of pronouns (§6.1.2.).
A genitive relationship between nouns may also be expressed by the particle
d-. In such cases the head noun is in its lexical form with the ending -ā, e. g.
qanāyā d=kaspā
smith of=silver
‘silver smith’ (Häberl 2009, 231)
7.1. NENA dialects

The basic principles of verbal derivational morphology and inflection in the NENA
dialects of the region are the same as those that have been described for the NENA
dialects of Iraq (chapter 3.4, §7). Here we shall concentrate on some selected fea-
tures.
In C. Sanandaj the original NENA system of three derivative stems has under-
gone some degree of levelling, with the result that stem II is partially assimilated
to the other stems. The present base of stem II is identical in form to that of stem
I, whereas the past base has the form of that of stem III. The only base that is dis-
tinctive to stem II is the imperative.
As in other NENA dialects (chapter 3.4, §7), the present and past inflectional
bases are inflected for person and number by two sets of suffixes, termed D-suf-
fixes (i. e. direct suffixes) and L-suffixes (i. e. oblique suffixes deriving historically
from prepositional phrases containing the dative/agentive preposition l-). These
suffixes indicate the grammatical relations of verbal arguments in the clause. In
C. Sanandaj D-suffixes mark the subject of the present base and L-suffixes the
subject of the past base, as shown in Table 7.
Table 7: Verb stems with D- and L-suffixes in Christian Sanandaj

Stem I
g-r-š I ‘to pull’
Present base: garəš-D
Past base: grəš-L
Imperative: groš
Stem II
š-d-r II ‘to send’
Present base: šadər-D
Past base: məšdər-L
Imperative: šadər
Stem III
x-l-p III ‘to exchange’
Present base: maxləp-D
Past base: məxləp-L
Imperative: maxləp
The morphology of the verbal derivative stems of the Jewish cluster of dialects
has undergone considerable development. The main surviving stems of the orig-
inal three-stem system of NENA are Stem-I (derived from the peʿal of earlier
Aramaic) and Stem-III (derived from the causative ʾap̄ʿel of earlier Aramaic). As
in C. Sanandaj, stem II has virtually completely merged in form with Stem-I, but a
distinction in the imperative form justifies identifying it as a separate stem. Another
development has taken place that is not found elsewhere in NENA, with the excep-
tion of the closely related dialect of J. Sulemaniyya. In each stem the vocalic
pattern of the bases differs according to whether the verb is used transitively or
intransitively. Broadly speaking the original vocalic patterns of the bases of the
causative Stem-III have been extended to transitive Stem-I verbs. Conversely the
intransitive Stem-I verbs preserves the original vocalic patterns of Stem-I and this
has been extended to intransitive verbs in Stem-III. Similar vocalic patterns are
found in the transitive and intransitive verbs of Stem-II. The subject of all present
bases is expressed by D-suffixes. There is, however, a split in the inflection of past
506 Geoffrey Khan
bases. Transitive past bases are inflected by L-suffixes but intransitive past bases
are inflected by D-suffixes, as shown in Table 8, while Table 9 shows the forms of
the suffixes themselves.
Table 8: Verb stems with D- and L-suffixes in Jewish Sanandaj

Stem I
g-r-š I ‘to pull’ (transitive) s-m-x I ‘to stand’ (intransitive)
Present base: gărəš-D săməx-D
Past base: grəš-L smix-D
Resultative participle: gərša smixa
Imperative: gắrəš, gruš smux
Infinitive: găroše sămoxe
Stem II
z-b-n II ‘to sell’
Transitive (active) Intransitive (anticausative)
Present base: zăbən-D –
Past base: zbən-L zbin-D
Resultative participle: zəbna zbina
Imperative: zắbən –
Infinitive: zăbone –
Stem III
m-rš-x ‘to cause to walk’ m-sk-r ‘to become lost’
(transitive) (intransitive)
Present base: marxəš-D maskər-D
Past base: mərxəš-L məskir-D
Resultative participle: məršxa məskira
Imperative: márxəš mə́skur —
Infinitive: marxoše maskore —
Table 9: D- and L-suffixes in Jewish Sanandaj

D-suffixes L-suffixes
3ms -Ø -le
3fs -a -la
3pl -i -lu, -lun
2ms -et -lox
2fs -at -lax
2pl -etun -lăxun
1ms -na -li
1fs -an -li
1pl -ex -lan
It is a common feature of the NENA dialects that the subject of present base verbs
is expressed by D-suffixes and the pronominal object by oblique L-suffixes. The
Jewish NENA dialects of western Iran, however, deviate from this pattern in one
detail, in that the pronominal object of 1st person singular present base verbs are
expressed by suffixes from the series that are attached to nouns to express posses-
sors (see §6.2.1), e. g.
J. Sanandaj (Khan 2009: 154–155)
gărəš-Ø-le
pull.prs - d .3 ms - l .3 ms
‘he pulls him’
garš-a-le
pul.prs - d .3 fs - l .3 ms
‘she pulls him’
garš-i-le
pull. prs - d .3 pl - l .3 ms
‘they pull him’
gărəš-n-ef
pull. prs - d .1 ms - poss .3 ms
‘I (ms) pull him’
garš-ăn-ef
pull. prs - d .1 fs - poss .3 ms
‘I (fs) pull him’
In the Jewish dialects of the region the D-suffixes are used also to mark the object
of transitive verbs, at least when the object is 3rd person. The alignment of clauses
with verbs constructed on past bases in these dialects, therefore, is ergative. If the
subject or object are specified by noun phrases in the clause, these are cross-refer-
enced by the verbal suffixes. This is illustrated in the following examples, which
show how the subject of an intransitive verb (1a) aligns with the object of a tran-
sitive verb (1b). In each case the common marker is the 3fs D-suffix on the verb:
(1) J. Sanandaj
a. brat-i smix-a.
daughter-my stand.pst -d .3fs
‘My daughter (S) stood up.’
b. baruxăwal-i brat-i gərš-a-lu.
friends-my daughter-my pull.pst - d .3 fs - l .3 pl
‘My friends (A) pulled my daughter (O).’
Unergative intransitive verbs, with agentive subjects, take L-suffixes on the past
base. The dialects, therefore, should be characterized as Split-S, according to the
508 Geoffrey Khan
terminology of Dixon (1994: 71), rather than canonically ergative. This applies,
for example, to verbs expressing the emission of sounds:
(2) J. Sanandaj
kalba nwəx-le.
dog bark.pst - l .3 ms
‘The dog barked.’
Intransitive verbs with D-suffixes may be termed unaccusative since they undergo
a change of state and are typically not agentive. They include verbs of movement,
which express a change in locational state rather than change in configurational
state, e. g.
(3) J. Sanandaj
brat-i zil-a.
daughter-my go.pst -d .3fs
‘My daughter went.’
The division between unergative and unaccusative in verbs of emission of sound
is lexicalized slightly differently across the various dialects of Jewish NENA of
the region, e. g.
Unergative (L-suffix) Unaccusative (D-suffix)
‘She coughed J. Sanandaj (šəh-la) J. Qar Hasan (šhil-a)
J. Tikab (šhəl-la) J. Bokan (šhil-a)
J. Kerend (šhəl-la)
‘He sneezed’ J. Sanandaj (təp-la) J. Bokan (tpil-a)
J. Tikab (tpəl-la) J. Qar Hasan (tpil-a)
J. Kerend (tpəl-la)
In the Jewish dialects the D-suffixes on the past base of transitive verbs can express
only the 3rd person object. A 1st or 2nd person object of a past base verb must be
expressed by an independent object phrase outside of the verbal complex:
(4) J. Sanandaj
a. grəš-le ʾəl-i.
pull. pst -l .3 ms obl -1 s
‘He pulled me.’
b. grəš-le ʾəl-ox.
pull. pst -l .3 ms obl -2 ms
‘He pulled you.’
In C. Sanandaj past base verb forms cannot express any pronominal objects by
D-suffixes. Another construction must be used consisting of a past converter prefix
təm- and the present base, which expresses the pronominal object by L-suffixes:
(5) C. Sanandaj
a. təm-garš-i-le.
pst -pull.prs - d .3 pl - l .3 ms
‘They pulled him.’
b. təm-garš-a-le.
pst -pull.prs - d .3 fs - l .3 ms
‘She pulled him.’
In both the Jewish dialects and C. Sanandaj ditransitive verbs cannot take two pro-
nominal object suffixes. In cases where there is a direct and an indirect pronom-
inal object, the indirect one must be expressed by an independent prepositional
phrase:
(6) J. Sanandaj
kwi-le ʾəl-ef.
give.prs .3 pl - l .3 ms to-3 ms
‘They give it (m.) to him.’
(7) C. Sanandaj
kewi-le tlas-e.
give.prs .3 pl - l .3 ms to-3 ms
‘They give it (m.) to him.’
Calques of phrasal verbs in the languages in contact containing a light verb and
noun are common in both the Jewish and Christian dialects, e. g.
ḥāz ʾol ‘he desires’ (literally: he makes desire; cf. Kurdish haz kirdin)
komak ʾol ‘he helps’ (literally: he does help; cf. Persian komak kardan).
The inflection of the enclitic copula in the Jewish dialects of the region corre-
sponds to that of the irrealis verb h-w-y ‘to be’ throughout the paradigm. This
represents a more advanced state of assimilation of the enclitic copula to verbal
inflection than is found in other NENA dialects. C. Sanandaj exhibits a par-
adigm of the enclitic copula that is more usual in NENA whereby the 1st and
2nd person forms of the copula correspond to verbal endings, but the 3rd person
forms have not undergone levelling and they remain distinct in form from verbal
inflection:
510 Geoffrey Khan
Table 10: Enclitic copulas in Jewish and Christian Sanandaj

Enclitic Copula Verb h-w-y Enclitic Copula Verb h-w-y
3ms. =ye hăwe =ile hawe
3fs. =ya hawya =ila hoya
3pl. =yen hăwen =ilu hawi
2ms. =yet hăwet =yet hawet
2fs. =yat hawyat =yat hoyat
2pl. =yetun hăwetun =iton hawiton
1ms. =yena hăwena =yen hawen
1fs. =yan hawyan =yan hoyan
1pl. =yex hăwéx =yox hawox
Most of the Split-S Jewish dialects of the region express perfects by combining
the resultative participle with the copula. The morphological form of the participle
differs according to whether the verb is transitive or intransitive unaccusative.
This distinction in the form of participle is found in the J. Sulemaniyya dialect of
eastern Iraq but not in the vast majority of NENA dialects that form resultative
perfects with a participle and copula construction (chapters 2.5, §7. and 3.4, §7).
The cluster of Jewish dialects in western Iran differ from all other NENA dialects,
including J. Sulemaniyya, in that the participle and the copula in transitive clauses
agree with the object and not with the subject (Khan 2009: 90–92, 323–326), e. g.
(9) J. Sanandaj
a. baxt-ăke qimta=ya
woman-the rise.ptcp . fs - cop .3 fs
‘The woman has risen.’
b. ʾo-gora baxt-ăke grəšta=ya
that-man woman-the pull.ptcp . fs - cop .3 fs
‘The man has pulled the woman.’
It can be seen that the transitive subject argument in the transitive construction
in (9b) has no explicit agreement marking on the verb in the form of an ergative
L-suffix. When there is no transitive subject argument nominal in the clause, the
subject can be interpreted as 3rd person of any gender or number:
(10) J. Sanandaj
baxt-ăke grəšta=ya
woman-the pull. ptcp . fs - cop .3 fs
‘He/she/they has/have pulled the woman.’
Such transitive perfect constructions with zero subject can be used only when the
subject is 3rd person. By contrast, the perfect construction with a copula is availa-
ble for subjects of all persons in intransitive unaccusative clauses:
J. Sanandaj
3ms smixá=ye ‘He has stood up’
3fs smixtá=ya ‘She has stood up’
2ms smixá=yet ‘You (ms.) have stood up’
2fs smixtá=yat ‘You (fs.) have stood up’
1ms smixá=yena ‘I (ms.) have stood up’
1fs smixtá=yan ‘I (fs.) have stood up’
An irrealis perfect may be formed by replacing the copula by the paradigm of the
irrealis form of the verb h-w-y ‘to be’. In the irrealis perfect construction in tran-
sitive clauses the participle and copula agree with the object, as in the indicative
perfect, but, unlike in the indicative perfect, the subject is marked by an ergative
L-suffix. Since there is explicit subject marking, the construction can be used with
subjects of all persons (Khan 2009: 92–94):
(11) J. Sanandaj
a. ʾo-gora baxt-ăke grəšta-hawya-le
that-man woman-the pull. ptcp . fs -be.irrealis .3 fs - l .3 ms
‘That man may have pulled the woman.’
b. ʾāt baxt-ăke grəšta-hawya-lox
you (ms.) woman-the pull. ptcp . fs -be.irrealis .3 fs - l .2 ms
‘You (ms.) may have pulled the woman’
c. ʾana baxt-ăke grəšta-hawya-li
I woman-the pull. ptcp . fs -be.irrealis .3 fs - l .1 s
‘I may have pulled the woman’
In C. Sanandaj the perfect is not expressed by a resultative participle but rather by
a construction in which the particle gi- is prefixed to a past base verb, e. g.
C. Sanandaj
grəš-le gi-grəš-le
‘he pulled’ ‘he has pulled’
This particle gi- is likely to be a reduced form of the copula with an indicative
prefix < *k-i.
Perfects denote the resultative and also can have an evidential function used
to express reported events or events in the remote past (Khan 2009: 296, 2012).
Neither Jewish nor Christian NENA have prefixed particles to mark future
tense. In Jewish NENA the basic present base serves to cover general present,
progressive present and future:
512 Geoffrey Khan
J. Sanandaj gărəš ‘he pulls/he is pulling/he will pull’

Both Jewish and Christian NENA have vestiges of an indicative prefix k- (or g-)
which is attached to the present base of a few verbs with weak initial radicals when
it expresses realis:
Present realis Present irrealis
C. Sanandaj k-azəl ʾazəl ‘he goes’
J. Sanandaj g-ezəl hezəl ‘he goes’
In J. Sanandaj the present progressive may be expressed by combining an inflected
present base form of the verb with a preceding infinitive form (Khan 2009: 332),
e. g.
J. Sanandaj
găroše gărəš
pull.inf pull.prs .3 ms
‘he is pulling’
In constructions with verbs that take a k-/g- prefix in the present indicative, the
k- is reduplicated on the infinitive, e. g.
J. Sanandaj
g-zala g-ezəl
ind -go.inf ind -go.prs .3 ms
‘he is going’
In C. Sanandaj the progressive is expressed by combining an inflected form of the
copula clitic with a present base verb (Panoussi 1990: 118), e. g.
gareš=ile
pull.prs .3 ms = cop .3 ms
‘he is pulling’
garš-en=yen
pull.prs - d .1 ms = cop .1 ms
‘I am pulling’
Some Jewish NENA dialects express the progressive by combining the present
indicative with a possessive expression (hit-e ‘he has’, hit-a ‘she has’ etc.), e. g.
J. Kerend (Hopkins 1999: 325)
hit-e k-e
poss -3 ms ind -come.prs .3 ms
‘he is coming’
7.2. Neo-Mandaic
Neo-Mandaic has preserved the suffix conjugation of earlier Aramaic to express
the perfective past. The historical active participle expresses both the imperfective
realis and irrealis. The realis is distinguished from the irrealis by a prefixed indica-
tive particle, which has the form qə- in Khorramshahr (Häberl 2009: 180) and qa-
in Ahvāz (Macuch 1993: 69). This is analogous, both in form and function, to the
indicative particles k- and g- of the NENA dialects, though it may be of a different
etymology. The general profile of the Neo-Mandaic and NENA verbal systems in
relation to Middle Aramaic, the term generally used to refer to the historical stage
of Aramaic before that of the modern dialects, is as follows (shading indicates
continuity of Middle Aramaic forms in the Neo-Aramaic dialects):
Middle Aramaic Neo-Mandaic NENA
Perfective past qṭal gəṭal qṭil-le
Imperfective realis qāṭel qə-gāṭel k-qaṭəl
Imperfective irrealis yiqṭul gāṭel qaṭəl
Imperative qṭul gəṭol qṭol
Neo-Mandaic has lost the Middle Aramaic imperfective irrealis form (yiqṭul), but
its verbal system is still more archaic than that of the NENA dialects, which have
lost both the Middle Aramaic imperfective irrealis (yiqṭul) and the perfective past
(qṭal).
The verbal stems include the historical peʿal (also know as the G-stem), paʿʿel
(agentive or denominal D-stem), ʾap̄ ʿel (causative C-stem) forms as well as the
T-stems (expressing middle and passive voices) ʾeṯpeʿel (tG stem), ʾeṯpaʿʿal (tD
stem) and ʾettap̄ ʿal (tC stem). This is a further aspect of the verbal system that is
more conservative than NENA, in which the T-stems have been lost. The prefixed
t morpheme of the historical T-stems has, however, been assimilated in all verbal
roots except those that have a sibiliant as initial radical, with which it has been
metathesized, e. g. eṣtəḇā ‘to be baptized’ (< ṣ-ḇ-w/y). An overview of the basic
forms (3ms perfective and imperfective, ms. imperative) across the various deriv-
ative stems is provided in Table 11 (based on Häberl 2011: 733).
Table 11: Verbal stems in Neo-Mandaic

Stem Perfective Imperative Imperfective
G-stem gəṭal gəṭol gāṭel ‘to kill’
tG-stem epseq epseq mepseq ‘to be cut off’
D-stem kammer kammer əmkammer ‘to return’
tD-stem kammar kammar mekammar ‘to turn back’
C-stem ahreḇ ahreḇ mahreḇ ‘to destroy’
tC-stem ettar ettar mettar ‘to wake up’
514 Geoffrey Khan
Compound verbal constructions consisting of light verbs expressing person, tense,

mood and aspect combined with a preceding noun or adjective expressing lexical
content, are very frequent. These are mostly calques on Persian light verb con-
structions. The most common light verbs are əḇad ‘to do’, əhaḇ ‘to give’, məhā ‘to
hit’ and tammā ‘to become’ (Häberl 2009: 227), e. g.
hiṯrā əḇad ‘to celebrate’
tamām tammā ‘to be finished’
The Neo-Mandaic perfective and imperfective forms are inflected with suffixes as
follows in the Khorramshahr dialect, as shown in Table 12.
Table 12: Verbal inflection in Neo-Mandaic

Perfective Imperfective
3ms gəṭal-Ø qə-ġaṭel
3fs gəṭl-at qə-ġaṭl-ā
3pl gəṭal-yon qə-ġaṭl-en
2s gəṭal-t qə-ġaṭl-et
2pl gəṭal-ton qə-ġaṭl-etton
1s gəṭl-it qə-ġaṭel-nā
1pl gəṭal-ni qə-ġaṭl-enni
Macuch (1993: 68) reports the occurrence of distinct suffixes of the prefective for
the 2fs, 3fpl and 2fpl in the Ahvāz dialect, but these do not occur in the material
gathered by Häberl (2009: 180), who gives the paradigm above.
Pronominal objects on perfective and imperative verbs are expressed by the
pronominal suffixes listed in §6.2.2. The addition of these suffixes in many forms
results in resyllabification of the base of the verb. The paradigm of the perfective
form of the verb ‘to kill’ (gəṭal) with the 3ms. (-i) and the 2pl. (-ḵon) object suf-
fixes is given in Table 13 (after Häberl 2009: 181). In this paradigm there is pho-
nological gemination (§5.6.2.) of the /n/ before the 3ms. object suffixes attached
to 3pl. and 2pl. verbs.
Table 13: Pronominal objects with perfective and imperatives in Neo-Mandaic

3ms. gəṭl-i gəṭal-ḵon
3fs. gəṭal-t-i gəṭl-at-ḵon
3pl. gəṭl-onn-i gəṭl-on-ḵon
2s. gəṭal-t-i
2pl. gəṭal-tonn-i
1s. gəṭal-t-i gəṭl-it-ḵon
1pl. gəṭal-n-i gəṭal-ni-ḵon
The 1st and 2nd person imperfective forms of the verb express pronominal objects
by object suffixes, but 3rd person imperfective forms express pronominal objects
by oblique suffixes consisting of an originally dative preposition l- and pronomi-
nal suffixes, analogous to the L-suffixes of NENA. The paradigm of the imperfec-
tive realis form of the verb ‘to kill’ (qə-gāṭel) ‘he kills’ with the 3ms. (-i, -l-i) and
the 2pl. (-ḵon, lə-ḵon) object suffixes is given in Table 14 (after Häberl 2009: 182).
Table 14: Pronominal objects with imperfective verbs in Neo-Mandaic

3ms. qə-gaṭel-l-i qə-gaṭel-lə-ḵon
3fs. qə-gaṭl-ā-l-i qə-gaṭl-ā-l-ḵon
3pl. qə-gaṭl-əl-l-i qə-gaṭl-əl-ḵon
2s. qə-gaṭl-āt-i —
2pl. qə-gaṭel-tonn-i —
1s. qə-gaṭel-n-i qə-gaṭel-nā-ḵon
1pl. qə-gaṭel-nann-i qə-gaṭel-nan-ḵon
In this paradigm there is phonological gemination (§5.6.2.) of the /n/ before the
3ms. object suffixes attached to 1pl. and 2pl. verbs. The /n/ of the 3pl. plural
subject inflection -en assimilates totally the /l/ of the oblique suffix.
Unlike most NENA dialects, Neo-Mandaic has not developed innovative verbal
forms to express the present perfect, but rather uses the suffix conjugation form
with this function in addition to the function of expressing the past perfective. The
present perfect function includes resultative and stative (Häberl 2009: 240), e. g.
(12) Khorramshahr
meḵt-at əšta
die.pst -3 fs . now
‘Now she is dead.’
The imperfective form expresses a variety of subtypes of imperfectivity, including
progressive, habitual and narrative present. It also expresses the future (Häberl
2009: 241–243).
The Neo-Mandaic indicative copula has a short and a long form. The short
form consists of clitics which are in origin light clitic forms of personal pronouns.
Häberl (2009: 230) found the paradigm of this to be defective in the Khorramshahr
dialect, with no short forms being in use for the 1st and 2nd plural forms, and no
distinctions in gender made in any person:
3s. =ye 2pl. —
3pl. =non 1s. =nā
2s. =yāt 1pl —
516 Geoffrey Khan
Macuch (1993: 53) presents a slightly fuller paradigm for the Ahvāz dialect:
3ms. =ye ~ =ya 2s. =yāt ~ =yet
3fs. =i 2pl. —
3mpl. =nɔn, =non 1s. =nɔn
3fpl. =nen 1pl. —
The long copula, which bears its own stress, is formed by combining the pro-
nominal suffixes listed in §6.2.2 with the base of the existential particle eṯ-. This
results in phonological gemination of the interdental /ṯ/, which, by the process
discussed above in §5.6.2., has the outcome /ḵt/. The paradigm of the long copula
is complete, with 1pl. and 2pl. forms and with gender distinctions in the 2nd and 3rd
persons singular (Häberl 2009: 231):
3ms. eḵt-i 2fs. eḵt-eḵ
3fs. eḵt-a 2pl. eḵt-əḵon
3pl. eḵt-u 1s. eḵt-e
2ms. eḵt-aḵ 1pl. eḵt-an
In addition to the preverbal particle qə-, which is used productively to express the
realis of the imperfective, the non-productive preverbal particle d- can be identi-
fied in the imperative form of the verb aṯā ‘to come’, viz. doṯi! ‘come!’ (Häberl
2009: 191; Macuch 1965: 306). This appears to be the same particle that is used
before imperatives in various NENA dialects to express intensity.
8.1. Copula clauses

The syntax of copula clauses in NENA has been described in chapter 3.4, §8.1.
Here we shall only make the observation that both in the Jewish dialects of the
region and in C. Sanandaj the copula is fixed on the predicate and cannot be moved
onto other constituents of the clause when they are in focus. This feature of the
syntax of the copula is found also in the trans-Zab Jewish dialects of Iraq and
northwestern Iran. The lack of movement is illustrated in the following exam-
ples:6
6
Intonation group boundaries are expressed by the symbol | and nuclear stress by a grave
accent.
(13) J. Sanandaj (Khan 2009: 336)

tat-óx tajə̀r=ye?| mam-ì tajə́r=ye| lá
father-your merchant=cop .3 ms uncle-my merchant=cop .3 ms neg
tat-ì.|
father-my
(14) C. Sanandaj
báb-ox tàjər=ile?| ʾaxòn-i tájər=ile| lá
father-your merchant=cop .3 ms brother-my merchant=cop .3 ms neg
bàb-i.|
father-my
The copula is negated by combining it with the negative particle la, e. g. J. Sanan
daj =y(e) ‘he is’, la=y ‘he is not’. In C. Sanandaj the /l/ of the 3rd person copula
forms is elided in the negative, e. g. =ile ‘he is’, la=y ‘he is not’.
8.1.2. Neo-Mandaic
The enclitic short copula of Neo-Mandaic is most commonly used in clauses in
which the predicate is an adjective (15), a resultative participle (16), a locative
expression (17) or interrogative expression (18) (Häberl 2009: 230, 253):
(15) Khorramshahr
šəbir=nā
good=cop .1 s
‘I am well’
(16) Khorramshahr
yehəm=ye
sit.ptcp = cop .3 s
‘He is seated’
(17) Khorramshahr
ahni barra əm-belāṯ=non
they outside from-city=cop .3 pl
‘They are outside the city’
(18) Khorramshahr
man=ye?
who=cop .3 s
‘Who is he?’
518 Geoffrey Khan
A clause with a short enclitic copula is negated by the part lu placed before the
predicate:
(19) Khorramshahr (Häberl 2009: 291, IX.10)
bāb-e lu-hnā=ye
father-my neg -here=cop .3 s
‘My father is not here’
The long copula is regularly placed after the predicate. It may be used with the
types of predicate exemplified in (15)–(18) and also with predicates containing a
nominal, where it is the norm (Häberl 2009: 231), e. g.
(20) Khorramshahr
an qanāyā d-kaspā eḵt-e
1sg smith of-silver cop -1 sg
‘I am a silver smith’
The long copula is negated by prefixing the negative particle la to the base of the
copula, the vowel of which is elide, e. g.
(21) Khorramshahr
šəbir l-eḵt-e
good neg - cop -1 s
‘I am not well’
Copula clauses that are not indicative present are constructed with a form of the
copula verb həwā, i. e. həwā past, hāwi irrealis, qə-hawi imperfective progressive/
future.
8.2. Verbal clauses

In the Jewish dialects of the region the direct object constituent of a clause is
normally placed before the verb, as in the Jewish trans-Zab dialects of Iraq and
northwestern Iran. This is also the normal word order in C. Sanandaj, which is
noteworthy since in other regions the Christian dialects are generally distinguished
from the Jewish trans-Zab dialects in having a basic verb–object word order.
In all dialects of the region a noun phrase that expresses a goal is normally
placed after the verb that expresses movement, e. g.
(22) J. Sanandaj (Khan 2009: 358)
zil-ex bela
go.pst - d .1 pl house
‘We went home.’
(23) C. Sanandaj (Panoussi 1990: 120)

ʾaz-ən besa
go.prs - d .1 s house
‘Let me go home.’
All verbs forms, including imperatives, are negated by the particle la, e. g.
la gărəš-Ø
neg pull.prs - d .3 ms
‘He does not pull.’
la gruš!
neg pull.imper
‘Do not pull!’
In the Jewish dialects when la negates a perfect formed by the combination of a
resultative participle and a copula auxiliary, the negator is placed before the parti-
ciple rather than the auxiliary, e. g.
(25) J. Sanandaj (Khan 2009: 97–98)
la smiqa=y
neg redden.ptcp = cop .3 ms
‘It has not become red.’
If, however, the predicate is an adjective, the negator is placed before the copula:
(26) J. Sanandaj (Khan 2009: 97–98)
smoqa la=y
red neg = cop .3 ms
‘It is not red.’
8.2.2. Neo-Mandaic
In Neo-Mandaic the position of the direct object constituent depends to a large
extent on its specificity (Häberl 2009: 135–137). Objects that refer to generic
classes or indefinite objects, marked with the indefinite suffix -i, with non-specific
referents are generally placed before the verb:
(27) Khorramshahr (Häberl 2009: 135)
šamʿ qə-masriḵ-en
candle ind -kindle. ipfv -3 pl
‘They light candles.’
520 Geoffrey Khan

jisr-i tum əḇad-yon
bridge-indf again do.pst -3pl
‘They built another bridge.’
Compound verb constructions with light verbs (§7.2.), which replicate the pattern
of Persian models, regularly place the generic nominal element before the verb, e. g.
(29) Khorramshahr
hiṯra əḇad
celebration make.pst .3 ms
‘he celebrated’
A definite object with an identifiable referent is generally placed after the verb and
is introduced by the preposition əl- (‘to/for’). There is also object agreement on
the verb in the form of an anticipatory object suffix, though this is elided before
the object marker and its original presence is often only discernible in the syllabic
structure of the verb (e. g. the phonological gemination of the /n/ of the 3pl. subject
suffix in 30):
aḇd-onn-Ø əl-ā konfərens
do.pst -3 pl - obj .3 ms obj -this conference
‘They held this conference.’
The specific direct object of a compound verb is generally marked by the prepos-
iton qam ‘to/for’. In such constructions there is no object agreement on the verb:
čāre aḇd-etton … qam tā jisər
remedy do.ipfv -2 pl to that bridge
‘You will fix that bridge.’
The constituent expressing the goal of an action is regularly placed after the verb,
e. g.
(32) Khorramshahr (Häberl 2009: 278, III, 6)
aḵ waxt ezg-ini qam Haft Tappé
that time go.pst -1 pl to Haft Tappe
‘At that time we went to Haft Tappe.’
Perfective and imperfective forms of verbs are negated by the particle la, which
bears the main word-stress:
lá-qə-mḥaṯṯen
‘they do not talk’
Unlike the NENA dialects of the region, imperative forms cannot be negated in
Neo-Mandaic. Negative commands and prohibitions are expressed by combining
the negative particle la with the irrealis imperfective, e. g.
l-all-etton ġer welāt
neg -go.irr -2 pl outside city
‘Do not go outside the city.’

The means of coordinating linkage of main clauses in the NENA dialects of the
region corresponds to the typology of the dialects of Iraq (chapter 3.4, §9.1). It may
be asyndetic or marked by a particle. Asyndetic linkage is particularly common in
the dialects of this region and appears to be the norm in C. Sanandaj judging by the
texts published by Panoussi (1990).
In the Jewish dialect main clauses are occasionally linked by the co-ordinat-
ing particle u. This is generally attached as an enclitic to the last item of a clause
before an intonation group boundary.
(35) J. Sanandaj (Khan 2009: 368)
xa-yoma zil-Ø lag-ef=u mir-e baq-ef
one-day go.pst - d .3 ms to-pro .3 ms =and say.pst - l .3 ms to-pro .3 ms
‘One day he went to him and said to him.’
Another strategy of coordinating linkage is to place a particle marking topical
prominence on the subject of the linked clause:
(36) J. Sanandaj (Khan 2009: 372)
ʿuč-lu ʾe-jəl-ăke ʾana-č ʾasr-ăna-nu
trample.imp - l .3 pl dem -clothes-def I-top wring.prs - d .1 fs - l .3 pl
‘Trample the clothes and I shall wring them out.’
522 Geoffrey Khan
9.1.2. Neo-Mandaic
Coordinating linkage in Neo-Mandaic may be marked by the coordinating particle
u (37) or it may be asyndetic (38):
(37) Khorramshahr (Häberl 2009: 280, IV.19)
inšānā aṯon u qāren əlāw mienā
women come.pst .3 pl and recite.ipfv .3 pl to water
‘Women came and enchanted the water.’
(38) Khorramshahr (Häberl 2009: 290, VIII.6)
Hafté horettā aṯā daš
week other come.pst .3 ms enter.pst .3 ms
‘He came another week and entered.’

The general subordinating particle d of earlier Aramaic has been lost before subor-
dinate clauses in the dialects of the region.
Relative clauses are normally asyndetic in C. Sanandaj, without explicit
linkage between the clauses and their heads:
(39) C. Sanandaj
darmane doktor tm-ewe-lu
medicines doctor pst -give.prs . d .3 ms - l .3 pl
the medicines that the doctor has given
Occasionally the Iranian particle ke is used to introduce the relative clause:
(40) C. Sanandaj
ʾay-xamra ke t-mand-ox-le
this-wine which pst -throw. prs - d .1pl .- l .3 ms
this wine which we have pressed
In J. Sanandaj asyndetic relative clauses are mainly restricted to cases where the
head noun is indefinite:
(41) J. Sanandaj (Khan 2009: 381)
xa-twka qărira hăw-e
a-place cool be.prs -3 ms
‘a place that is cool’
When the head noun is definite the relative clause is generally introduced by the
Iranian particles ya or ke:
(42) J. Sanandaj (Khan 2009: 377–378)
ʾo-naše ya-daʿwat kol-i-wa-lu
the-people rel -invitation make.prs - d .3 pl - pst - l .3 pl
‘the people whom they invited’
(43) J. Sanandaj (Khan 2009: 377–378)
xa-məndix=ye ke pərče koma ko-lu.
a-thing=cop .3 ms rel hair.pl black make.prs - l .3 pl
‘It is a thing that makes hair black.’
Factive and irrealis content clauses that are direct complements of verbs are some-
times introduced by the Iranian particle ke:
(44) J. Sanandaj (Khan 2009: 386–388)
hămər-Ø ke ʾay-brona həl-day brata gb-e.
say.irr - d .3 ms comp that-boy obj - dem . obl girl love.prs - d .3 ms
‘[in order that] he say that the boy loves the girl’
(45) J. Sanandaj
ʾijaza hulmu ke-ʾaxni xlula hol-ex
permission give.imper . pl comp -we wedding make.irr - d .2 pl
‘Give permission for us to hold the wedding.’
(46) C. Sanandaj
sfareš kewəl-Ø ke ʾana gi-se-li
message give.prs - d .3ms comp I perf -come.pst - l .1 s
‘He gives a message that “I have come.”’
Nouns functioning as adverbials at the head of a subordinate clause are often
linked to the clause by the Iranian ezafe particle:
(47) J. Sanandaj (Khan 2009: 394)
waxt-e xlula wi-li
time-ezafe wedding do.pst - l .1 s
‘when I married’
(48) C. Sanandaj
waxt-e ḥak-i
time-ezafe speak.prs - d .3 pl
‘when they speak’
The Aramaic conditional particle has been completely replaced by the Iranian par-
ticle ʾăgar:
524 Geoffrey Khan
(49) J. Sanandaj (Khan 2009: 396)

ʾăgar xast=et ʾat gbe gănet.
if tired=cop .2 ms you must sleep.prs - d .2 ms
‘If you are tired, you must sleep.’
(50) C. Sanandaj
ʾăgar ʾawa ʾas-e ʾana k-az-ən
if he come.prs . irr - d .3 ms I ind -go-d .1 ms
‘If he comes, I shall go.’
9.2.2. Neo-Mandaic
In Neo-Mandaic the Aramaic subordinating particle d is still used in noun phrases
to introduce genitive attributes, but it is no longer used to introduce subordinate
clauses.
Relative clauses use relative particles that have been borrowed from contact
languages, viz. ke ‘that’ from Persian and illi ‘which’ from Arabic. The particle ke
is used to introduce restrictive relative clauses, e. g.
aṯṯ-on barnaš-ānā ke šiḇih-ānā kāḇš-el-l-u
bring.imp - pl person-pl rel demon-pl subdue.ipfv -3 pl - obj -3 pl
‘Bring the people who will subdue the demons.’
In some cases the head noun has the Persian ‘specifying’ -i suffix, which is bor-
rowed from Persian relative clause constructions. This is seen in (52), which also
exemplifies the use of an object resumptive pronoun to express the object function
of the referent of the head noun in the relative clause:
ezg-it dukk-ān-i ke həz-it-u awwál
go.pst -1 sg place-pl -i rel see.pst -1 sg -3 pl before
‘I went to the places which I saw before.’
In the Khorramshahr dialect the particle illi introduces non-restrictive relative
clauses with a definite head noun, e. g.
(53) Khorromshahr (Häberl 2009: 165)
q-abi-n amər-Ø genz farwāh Profesər Buckley
ind -want.ipfv -1 sg say.irr -1 sg many thanks Professor Buckley
illi genz əḇād aḇd-at qam Manday-ānā
rel much work do.pst -3 fs for mandaean-pl
‘I want to give many thanks to Professor Buckley, who has done so much
work for the Mandaeans.’
In the Ahvāz dialect the Persian relative particle ke is used in such non-restrictive
relative constructions (Häberl 2009: 260).
When the head noun of a non-restrictive relative clause is indefinite with a
specific referent, there is no relative particle, e. g.
gāw Aḥwāz həwā tarmid-i ešm-i šiex ʿAbdalla
in Ahvāz be.pst .3 ms tarmida name-3ms sheikh Abdallah
‘In Ahvāz there was a tarmida,7 whose name was Sheikh Abdallah.’
Adverbial clauses are introduced by a preposition or are asyndetic. The verb is in
the irrealis or realis form according to whether the event has been realized or not,
e. g.
(55) Temporal clause, Khorramshahr (Häberl 2009: 248)
qamāy malp̄ -āt-i ana gaṭel-nā-ḵon geš
before teach.irr -2 sg -3 ms 1sg kill.irr -1 sg - obj .2 pl all
‘Before you teach him, I will kill you all!’
(56) Causal clause, Khorramshahr (Häberl 2009: 249)
aṯ-on gāw Irān qə-beyy-en komak
come.pst -3 pl in Iran ind -want.ipfv -3 pl help
aḇd-əl-l-Ø əl-Rusya
do.irr -3 pl - obj -3 fs obj -Russia
‘They came to Iran (because) they wanted to help Russia.’
In conditional constructions the Persian conditional particle agar is used (Macuch
1965: 432; Häberl 2009: 250), e. g.
(57) agar pərāhā eh-l-e turt-i qə-zaḇen-nā
if money cop -to-1 sg cow-indf ind -buy. ipfv -1 sg
‘If I have money, I shall buy a cow.’
10. Concluding Remarks
The NENA dialects of western Iran represent the far eastern periphery of the
NENA dialect area. They exhibit a number of features that are not characteristic
of the vast majority of NENA dialects in other regions. This applies in particular
to the cluster of Jewish dialects. One of the most conspicuous differences between
the Jewish dialects and the rest of NENA is their split-S alignment in clauses with
verbs formed from past bases. In Khan (2017) I have argued that this develop-
ment is likely to be an innovation within NENA, triggered by a greater degree of
7
The first grade of the Mandaean priesthood.
526 Geoffrey Khan
convergence to the ergative alignment of Kurdish. The use of D-suffixes with the
past base is likely to have begun as a means of expressing the stative-type resul-
tative perfects in maximally stative unaccusative verbs, as is found in some other
peripheral NENA dialects, such as J. Urmi and C. Hertevin (chapter 2.5, §7). In
the Jewish dialects of western Iran this would have developed further and spread
to the expression of the perfective. The development of intransitive verbal forms
in Ṭuroyo and Mlaḥso with the inflection of D-suffixes on past bases should be
considered to be an independent development.
The lexicon of the Jewish dialects and C. Sanandaj contains many loanwords
from Kurdish and Persian. The available material on C. Sanandaj dialect also
exhibits a number of loanwords from Arabic. The core vocabulary remains pre-
dominantly Aramaic in origin, but even this exhibits some loanwords (Khan 2009:
401–409).
Neo-Mandaic, which is a separate subgroup of Neo-Aramaic, exhibits numer-
ous differences from the NENA dialects of Western Iran in phonology, morphol-
ogy, syntax and lexicon. In a variety of features Neo-Mandaic is more archaic than
NENA.
The morphology of the pronouns in Neo-Mandaic, for example, is more con-
servative in some respects than the NENA dialects of the region, such as its pres-
ervation of distinct forms for anaphoric third person personal pronouns (§6.2.2.).
The morphology of the verb in Neo-Mandaic (§7.2.) is more archaic in its
profile than the verbal systems of NENA dialects, preserving, for example, the
original perfective suffix conjugation and a wider range of derivative verbal forms
and lacking innovations such as distinct forms for the present perfect. Verbal
clauses in Neo-Mandaic, moreover, exhibit uniformly accusative alignment.
The form of the Neo-Mandaic copula appears to be more archaic than NENA.
Parts of the paradigm of the short copula still have the resemblance of a clitic
pronoun, the historical origin of the copula. The long copula, formed by the com-
bination of the existential particle and pronominal suffixes, is a common feature
of earlier Aramaic dialects, but does not exist in NENA.
The expression of prohibitions by negation of the irrealis imperfective rather
than the imperative (§8.2.2.) is also more in line with earlier Aramaic dialects than
with the NENA dialects of the region, which negate the imperative.
The core vocabulary of Neo-Mandaic is predominantly of Aramaic origin.
Neo-Mandaic and NENA share a number of lexical items that are not found in
other Neo-Aramaic subgroups (Mutzafi 2014: 117–143). Although the Neo-Man-
daic communities in south-western Iran are now not in contact with NENA speak-
ers further north, some lexical isoglosses between the two dialect groups may
have spread at an earlier period by areal diffusion. This applies in particular to
the common NENA word məndi, which appears to be of Mandaic origin (§6.3.2.).
11. Glossed texts
11.1. North-Eastern Neo-Aramaic

J. Sanandaj dialect
Winter (Khan 2009: 430)

An acute accent (v́ ) marks non-nuclear stress. A grave accent (v̀ ) marks the nuclear
stress of an intonation group. A vertical line (|) marks an intonation group bound-
ary.
(58) ʾăwăḷán twkáne là ʾasr-í-wa.| našé
first shops neg close.prs - d .3 pl - pst people
g-ez-í-wa wárya ba-talgà-č.| xor-twkané là
ind -go.prs - d .3 pl - pst outside in-snow-also indeed-shops neg
ʾasr-í-wa-lu.| wa-ba-ʿălawà-č| rásm
close.prs - d .3 pl - pst - l .3 pl and-in-addition-also custom
ga-dokà=ye-le| măsălán ga-ʾaxə́r-e
in-there=cop . pst - l .3 ms for_example at-end-ez
páyiz qárwa sətwà| hár xanăwadé ta-nòš-ef,|
autumn close winter each family for-self-poss .3 ms
hár məšpaḥá ta-nòš-ef, |
g-ezə̀l-Ø-wa| xa-dána
each family for-self-poss .3 ms ind -go.prs - d .3 ms - pst one-single
tórta šăqə̀l-Ø-wa-la.| k-m-e-wa-l-ó
cow buy.prs .- d .3 ms - pst - l .3 fs ind -bring-d .3 ms - pst - l .3 fs - pvp
ga-béla nòš-ef.|
in-house self-poss .3 ms
First of all, they did not close shops. People would go outside even in the
snow. Indeed they did not close the shops. In addition, there was a custom
there, for example at the end of autumn just before winter8 each family,
each family went in their turn and bought a cow. They would bring it back
to their home.
(59) ráb-an k-é-wa dăbəḥ-Ø-wa-l-ó
rabbi-poss .1 pl ind -come.prs .3 ms - pst slaughter-d .3 ms - pst - l .3 fs - pvp
baq-èf.| tămíz k-òl-Ø-wa-la.| xăé-wa
for-poss .3 ms clean ind -do.prs - d .3 ms - pst - l .3 fs see.prs .3 ms - pst
ṭărefà lá-hawy-a,| xărā̀ b lá-hawy-a.| ʾo-waxtắra
unclean neg -be.irr - d .3 fs polluted neg -be.irr - d .3 fs that-time
qăšàng| ʾo-năqolé-u măqolè-u| ʾonyexáe yá
beautiful dem -unclean-and unclean-and those rel
8
Literally: close to winter.
528 Geoffrey Khan
gdidìm9=yen| xărā̀ b=yen,| lá k-əxl-i-lù|

sinews=cop .3 pl polluted=cop .3 pl neg ind -eat.prs - d .3 pl - l .3 pl
doq-Ø-wa-lú-u šoḷət-Ø-wa-lù.|
seize.prs - d .3 ms - pst - l .3 pl -and throw-d .3 ms - pst - l .3 pl
Our rabbi would come and slaughter it for them. He cleaned it. He checked
lest it be ritually unclean, lest it be polluted. Then he carefully took out
what had to be removed, those parts that were sinews and were polluted,
which they did not eat, and threw them away.
(60) pəsr-ăké k-wə́l-Ø-wa-le ba-ʾilá máre bel-ăkè.|
meat-def ind -give. prs - d .3 ms - pst - l .3 ms in-hand master house-def
máre bel-ăké mən-dáy pəsrà| mătú-Ø-wa-le
master house-def from-that.obl meat put. prs .3 ms -Ø- pst - l .3 ms
ga-tănurá băšə̀l-Ø-wa-le. |
k-əmr-í-wa-le
in-oven cook.prs -d .3ms -pst - l .3 ms ind -say-d .3 pl - pst - l .3 ms
qawurmà.| năt-èn-wa-le| ga-twká qărirá
preserved_meat take. prs - d .3 pl - pst - l .3 ms in-place cool
măt-ì-wa-le.| yaxčā́ l lìt-wa-lan
place-d .3 pl - pst - l .3 ms fridge there_is_not-pst - l .1 pl
xor-ʾo-waxtărá.| măt-í-wa-le ga-xá-twka qărirà
still-that-time place-d .3 pl - pst - l .3 ms in-one-place cool
hăwé-Ø| péš-Ø ta-sətwà.|
be.irr - d .3 ms stay.irr - d .3 ms for-winter
He then gave the meat to the head of the household. The head of the
household put out some of this meat and cooked it in an oven. They called
this qawurma. They took it and placed it in a cool place. We still did not
have a fridge at that time. They put it in a place that was cool for it to
remain until winter.
(61) ʾáy qawurmá ʾăxà=ye-le| galgl-í-wa-le
this preserved_meat thus=be.pst - l .3 ms stir. prs - d .3 pl - pst - l .3 ms
ga-pliyáw məšxá nòš-ef,| məšxá tort-ăkè,| lá məšxà,|
in-middle oil self-poss .3 ms oil cow-def neg oil
denulà.| ga-denulá nóš-af bašl-í-wa-la
fat in-fat self-poss .3 fs cook.prs - d .3 pl - pst - l .3 fs
qăšàng.| măt-í-wa-le ga-xá … ʾó waxtắra
well put.prs - d .3 pl - pst - l .3 ms in-one that time
qoq-è- hit-wa.| qoq-é xa-zárf,
pot-pl there_is-pst pot-pl one-container
9
Hebrew.
zrúf-ek=ye-lu mə́n ʾápr-e smoqà.| ʾənyexáe

containers-indf =be.pst - l .3 pl from clay-ez red they
măt-í-wa-lu ga-sətwà.| mən-dóa rába
place.prs - d .3 pl - pst - l .3 pl in-winter. from-that.obl much
ʾəstəfáda k-ol-ì-wa.|
advantage ind -do.prs - d .3 pl - pst
This qawurma was like this, they would stir it round in its own oil, the oil
of the cow. Not oil, fat, they cooked it well in its own fat. They put it in a—
at that time there were pots. Pots were a container, containers made of red
clay. They would serve this in winter. They made good use of this.’
11.2. Neo-Mandaic
Khorramshahr dialect
The cooking pot (Häberl 2009: 292)

(62) ya yum-i ezgā-Ø Joḥa ṭəlab-Ø
one day-indf go.pst -3 ms Joḥa ask.pst -3 ms
əm-šəḇāḇ-ān-i qazġān-i tāp̄ i-Ø
from-neighbour-pl - poss .3 ms cooking_pot-indf cook.irr -3 ms
gāww-i
in-poss .3 ms
‘One day, Joha went to ask his neighbours for a cooking pot in order to
cook in it.
(63) baʿad so ruz kammar-Ø, bədaq-Ø kars-i
after three day return.pst -3 ms put.pst -3 ms belly-poss .3 ms
qazġān honin-i
cooking_pot small-indf
After three days he returned, having put a small pot inside it.
(64) šəḇāḇ-ān-i həz-onn-(i)10 əl-qazġān u
neighbour-pl - poss .3 ms see.pst -3 pl -3 ms obj -cooking_pot and
əḵad-Ø-i minjo-d-i
take.pst -3 ms -3 ms from_within-gen -3 ms
His neighbours saw the pot, and he took it out of it.
10
Brackets enclose suffixes that have been elided in sandhi.
530 Geoffrey Khan
(65) mal-lonn-(i) əl-Joḥa mu=yye āhā? Joḥa əmall-u

say.pst -3 pl -3 ms to-Joḥa what=cop .3 ms this Joḥa say.pst -3 pl
āhā qazġān-ḵon
this cooking_pot-2pl
He said to Joha “What is this?” Joha said them “This is your cooking pot.”
(66) həwā-Ø qār-e tam-Ø qār-i jihel.
be.pst -3 ms by-1ms become.pst -3 ms by-3 ms child
aṯi-t-i qam-də-ḵon
bring.pst -1 s -3 ms for-gen -2 pl
While it was at my place, it gave birth to a child. I brought it for you.
(67) ṣāḥeb bieṯā əmall-Ø-i muḥāl=ye dəri-Ø
owner house say.pst -3 ms -3 ms absurb=cop .3 ms take.pst -3 ms
u aḥbi-Ø
and keep-3 ms
The owner of the house said to him “That is absurd”. He took it and kept it.
(68) hafté horettā aṯā-Ø Joḥa u ṭəlab-Ø minn-i
week other come.pst -3 ms Joḥa and ask.pst -3 ms from-3ms
ya qazġān-i
one cooking_pot-indf
The next week Joha came and asked him for a cooking pot.
(69) haḇ-lonn-i ya tafšā rabb-i. dər(i)-Ø-(i)
give.pst -3 pl -3 ms one stew_pot big-indf take.pst -3 ms -3 ms
əl-rabbā. lá-kamr-i qam-didu
obj -big neg -return-3 ms to-gen .3 pl
They gave him a large stew pot. He took the large one. He did not return it
to them.
(70) baʿad esbuʿa ezgā-Ø šəḇāḇā qam Joḥa.
after week go.pst -3 ms neighbour to Joḥa
əmall-Ø-i qazġān-d-e elli=ye?
say.pst -3 ms -3 ms cooking_pot-gen -1 s where=cop .3 ms
A week later, the neighbour went to Joha. He said to him “Where is my
cooking pot?”
(71) əmall-Ø-i qazġān meṯ-Ø. əmall-Ø-(i) əl-Joḥa

say.pst -3 ms -3 ms cooking_pot die.pst -3 ms say.pst -3 ms -3 ms to-Joḥa
man qabul q-āḇed-Ø? qazġān mujur
who acceptance ind -do.prs -3 ms cooking_pot how
qə-māyeṯ-Ø?
ind -die.prs -3 ms
He said “The cooking pot has died.” He said to Joha “Who believes that a
cooking pot could die?”
(72) Joḥa gəheḵ-Ø əmall-Ø-i man qabul q-āḇed-Ø
Joḥa laugh-3 ms say.pst -3 ms -3 ms who acceptance ind -do.prs -3 ms
qazġān tam-Ø qār-i ṣṭāna ʿād qabul
cooking_pot become.pst -Ø by-3 ms boy then acceptance
lá-q-āḇed-Ø mujur meṯ-Ø?
neg - ind -do.prs -3 ms how die.pst -3 ms
Joha laughed, [and] said to him “Who would believe that a cauldron has
given birth to a boy, but not believe that it has died?”’
Abbreviations

d = D-suffix (i. e. direct, nominative pronominal suffix)
l = L-suffix (i. e. oblique pronominal suffix)
pvp = post-verbal particle
ez = ezafe
References
Borghero, Roberta. 2004. Some linguistic features of a Mandaean manuscript from the sev-
enteenth century’. Aram 16. 61–83.
Dixon, Robert M. W. 1994. Ergativity. Cambridge Studies in Linguistics 70. Cambridge:
Häberl, Charles. 2009. The Neo-Mandaic Dialect of Khorramshahr. Semitica Viva 45.
Häberl, Charles. 2011. Neo-Mandaic. In Stefan Weninger, Geoffrey Khan, Michael Streck
& Janet Watson (eds.), The Semitic Languages: An International Handbook, 725–737.
Berlin & Boston: De Gruyter Mouton.
Heinrichs, Wolfhart. 2002. Peculiarities of the verbal system of Senaya within the frame-
work of North Eastern Neo-Aramaic (NENA). In Werner Arnold & Hartmut Bobzin
(eds.), “Sprich doch mit deinen Knechten Aramäisch, wir verstehen es!” 60 Beiträge
uur Semitistik. Festschrift für Otto Jastrow zum 60. Geburtstag, 238–268. Wiesbaden:
Harrassowitz.
532 Geoffrey Khan
Hopkins, Simon. 1999. The Neo-Aramaic Dialects of Iran. In Shaul Shaked & Amnon Net-
zer (eds.), Irano-Judaica IV, 311–327. Jerusalem: Magnes.
Hopkins, Simon. 2002. Preterite and perfect in the Jewish Neo-Aramaic of Kerend. In Wer-
ner Arnold & Hartmut Bobzin“Sprich doch mit deinen Knechten Aramäisch, wir ver-
stehen es!” 60 Beiträge zur Semitistik. Festschrift für Otto Jastrow zum 60. Geburtstag,
281–298. Wiesbaden: Harrassowitz.
Israeli, Yaffa. 1998. Ha-Aramit ha-ḥadashah shebe-fi Yehude Saḳiz (derom Kurdisṭan). Je-
rusalem: Hebrew University dissertation.
Kahn, Margaret. 1976. Borrowing and variation in a phonological description of Kurdish.
Michigan: University of Michigan dissertation.
Kalin, Laura M. 2014. Aspect and argument licensing in Neo-Aramaic’. Los Angeles: Uni-
versity of California dissertation.
ies in Semitic Languages and Linguistics 44. Leiden & Boston: Brill.
gias Press.
Khan, Geoffrey. 2012. The evidential function of the perfect in North-Eastern Neo-Aramaic
dialects. In Rebecca Hasselbach & Na’ama Pat-El (eds.), Language and Nature. Papers
presented to John Huehnergard on the occasion of his 60th Birthday, 219–228. Studies
in Ancient Oriental Civilization 67. Chicago: University of Chicago.
Press.
Macuch, Rudolf. 1965. Handbook of Classical and Modern Mandaic. Berlin & Boston: De
Gruyter Mouton.
Macuch, Rudolf. 1989. Neumandäische Chrestomathie: Mit grammatischer Skizze, kom-
mentierter Übersetzung und Glossar. Porta Linguarum Orientalium. Neue Serie 18.
Macuch, Rudolf. 1993. Neumandäische Texte im Dialekt von Ahwāz. Semitica Viva 12.
Macuch, Rudolf & Estiphan Panoussi. 1974. Neusyrische Chrestomathie. Porta Linguarum
Orientalium. Neue Serie 13. Wiesbaden: Harrassowitz.
Mutzafi, Hezy. 2014. Comparative lexical studies in Neo-Mandaic. Studies in Semitic Lan-
guages and Linguistics 73. Leiden & Boston: Brill.
Panoussi, Estiphan. 1990. On the Senaya Dialect. In Wolfhart Heinrichs (ed.), Studies in
Neo-Aramaic, 107–129. Atlanta: Scholars Press.
Panoussi, Estiphan. 1991. Ein vorläufiges Verbglossar zum aussterbenden Neuaramäischen
Senaya-Dialekt. Rivista Degli Studi Orientali 65(3/4). 165–183.
Tsereteli, Konstantin. 1990. The velar spirant ġ in Modern East Aramaic dialects. In Wolf-
Scholars Press.
4.5. Hawrāmī of western Iran
Parvin Mahmoudveysi and Denise Bailey
1. Introduction
1.1. Background
Hawrāmī is spoken in the Hawrāmān (Persian: Ourāmān) region located in the
northern Zagros mountains in northwestern Iran and northeastern Iraq. Hawrāmī
is a variety of the language that is identified by the historical name of Gorani (or
Gūrānī), traditionally classified as Northwestern Iranian. Gorani, in turn, can be
described as a continuum of closely related varieties, which nonetheless forms
what is essentially a single language. In Iran, Hawrāmī shares this continuum with
the endangered varieties of Kandūlayī, Zardayāna, and Gawraǰūyī, as well as with
others no longer in active use, such as Riǰābī, Gahwāraī, and Bēwanīǰī.1 In Iraq,
Hawrāmī shares this continuum with Bāǰaɫānī, which is historically spoken near
Sar Pol-e Zohāb but also spoken northwest of Mosul, as well as with the varie-
ties identified as Šabakī, Kākayī, Šēxānī, Zangana, and Rožbayānī (see Haig, this
volume, p. 297, for a map).2 In comparison to the other Gorani varieties, Hawrāmī
is considered to be a “more archaic variant in many ways” (MacKenzie 2002: n.p.)
and “probably […] the best preserved of the group” (MacKenzie 1966: 4). Mac-
Kenzie’s characterization is supported by the relatively complex morphology of
Hawrāmī, which includes extensive use of grammatical gender, regular oblique
case marking, the Imperfect, and two forms of the ezāfe particle.3
Much of known Gorani history has been in close association with Kurdish.
Gorani had a highly significant role as the literary and court language of the
Kurdish Ardalān dynasty from the fourteenth to nineteenth centuries (MacKenzie
1

See Mahmoudveysi (2016: 3).
2

Actual names for Gorani and varieties can differ, depending on speaker and location
(see Leezenberg 1993: 7). In this study, we use the name ‘Gorani’ to refer to the gen-
eral northwestern language and ‘Hawrāmī’ to refer to the specific cluster of varieties
spoken in the Hawrāmān area (for more discussion of the identification of Gorani, see
Mahmoudveysi, Bailey, Paul, and Haig 2012: 2–4).
3
Gorani has been the focus of scholarly research since at least the early 1800s, with
Hawrāmī figuring in many studies, including Benedictsen and Christensen (1921),
Mann and Hadank (1930), Christensen (1936), and MacKenzie (1961, 1966, 1987b,
2002), Blau (1989), Naqšbandī (1996), Paul (2007), Holmberg and Odden (2008),
Mahmudweyssi and Haig (2009), Zolfaqari (2010), and Bistoon, Gheitury, and Kazzazi
(2013).
534 Parvin Mahmoudveysi and Denise Bailey
2002: n.p.). They share many linguistic features that serve as evidence of contact
and convergence from centuries of proximity.
Hawrāmī is mainly found within the national boundaries of Iran. It is used
in the Hawrāmān region, which is located in Kermanšāh and Kurdistan prov-
inces and situated between the cities of Marīvān, Sanandaǰ, and Pāwa (Persian:
Pāveh). There are more speakers of Hawrāmī than of the other Gorani varieties
in Iran, such as those in Zarda, Gawraǰū, and Kandūla.4 Hawrāmī is also spoken
in Iraq, where the western part of Hawrāmān includes the settlements of Tawēɫa
and Bīyāra. Hawrāmī speakers live in Xurmāl and in larger Kurdish cities as well,
including Halabǰa.5
The Hawrāmān region is constituted of four main parts: 1) Hawrāmān-i Luhōn;
2) Hawrāmān-i Taxt; 3) Šāmyān and Dizlī; and 4) Hawrāmān Řazāw-u Kamara.6
There are also two other parts: 5) Hawrāmān Gāwaro; and 6) Hawrāmān Žāwaro
(these might be a part of Řazāw-u Kamara).7 The first part, Hawrāmān-i Luhōn,
has about thirty-four settlements in Iran, nine of which are populated by speak-
ers of Jāfī, a dialect of Central Kurdish. The settlements in Hawrāmān-i Luhōn
include the historical center, Nawsūd, and also the city of Pāwa and villages
such as Hāna Garmaɫa, Kaymna, Dizāwar, Nodša (Notša), Haǰīǰ, Narwē, Šošmē,
Qaɫāgā, Darmūr, Xānagā, Girāɫ, Šēxān, and Naysāna. Settlements in Iraq include
Darē Mař, Sargat, Guɫp, Baxa Kon, Dagā Šēxān, Sosakān, Bīyāra, and Tawēɫa.
The second part, Hawrāmān-i Taxt, consists of the center, Šār-i Hawrāmān, and the
villages of Waysīā, Kamāɫa, Sar-u Pīrī, Silēn, Biɫbar, Žīwār, Kalǰī, Nāwa, Nwēn,
Daɫ, Daɫamarz, Zom, and Asparēz, as well as Bānī Binok in Iraq. The third part,
Dizlī (Dezlī) (and Šāmyān and Dizlī), includes the center, Dezlī, and the villages
in Iran of Bārāmāwā, Zalka, Qaɫaǰē, Qaɫāgā, Tifɫīya, Tāzāwā, Gořgayī, Tēžtēž,
Tāɫawān, Xošāw, Xirosa, and Dara, and in Iraq, as Hāna, Quɫē, Zaɫm, and Aħmađ
Āwā. The fourth part, Hawrāmān Řazāw-u Kamara, consists of the center, Řazāw,
with the villages of Kařāwa, Dēwaznāw, Dagāgā, Xānagā, and Nāwa. The fifth and
sixth parts, included by Hawrāmānī (2001), are Hawrāmān Gāwaro, with villages
of Darwēšān, Galēn, Dē Moɫē, Hašamēz, Tawrēbar, Wasē, and Tāw Doɫaw; and
the area of Hawrāmān Žāwaro, with the villages of Māzībin, Borīdar, Čamšīdar,
4
For Gorani of Gawraǰū, see Mahmoudveysi, Bailey, Paul, and Haig (2012); for Gorani
of Zarda, see Mahmoudveysi and Bailey (2013); and for Gorani of Kandūla, see Mann
and Hadank (1930).
5
For the locations of these varieties, see Haig (this volume, chapter 3.3, Fig. 4). Fur-
ther information on locations is found in Mahmoudveysi (2016: 6–7) and MacKenzie
(1966: 5). In Iraq, Gorani and its varieties are sometimes referred to collectively as
‘Māčo’ (or also ‘Hawrāmī’, as noted by Leezenberg 1993: 7).
6
See also MacKenzie (1987a), MacKenzie (1966: 5), Paul (2007: 287), and Zolfaqari
(2010: 324) for parts.
7
See Hawrāmānī (2001: 617–619) for these areas and names of villages.

Hawrāmī of western Iran
535
Figure 1: Locations of Hawrāmi mentioned in this chapter

Bēsārān, Žanīn, Tangībar, Pālingān, Žirēža, Hāna-w Ħisayan Bagī, Sūra Tifē, Dēr
Moɫē, Kēɫāna, and Takya.8
1.2. Names
The name Hawrāmī is used by people of the region to designate in general the
speech of the settlements in Hawrāmān. More specific names include Pāwayī,
used by people from Pāwa to refer to their own speech, in which case Hawrāmī is
then used as a reference to the speech of Nawsūd.9
1.3. Literature
Gorani is used for most of the sacred literature of the Yārsān (Ahl-e Haqq) com-
munity. Gorani is especially known for its literature produced from the fourteenth
to the nineteenth centuries, with its golden age from the seventeenth to nineteenth
centuries. It was the literary language for all Kurdish and Gorani speakers in the
entire area from Ilam to Kirkuk. The most prestigious of the Gorani poets, Sayyed
Abdul Karim Tawgozī, also known as Mawlawī Kurd (1806–1882), was in fact
primarily a Kurdish speaker who composed all of his poetry in Gorani.
Gorani literature developed and flourished during the Ardalan dynasty (1169–
1867). After the downfall of the Ardalan rulers, Gorani declined in its position
as the language of poetry, and Sorani Kurdish poetry gradually took on this role
and gained prestige through the support of the rulers of Sulaymaniya. Gorani
poets include Mullā Mustāfā Besārānī (1642–1701), Xānāy Qubādī (1700–1759),
Saydī Hawrāmī (1784–1852), Ranǰūrī (1750–1809), Ghulam Razā Arkawāzī
(1775–1840), Mirzā Šafīʕ Jāmarēzī (1776–1836), Aħmad Bag Komāsī (1798–
1878), Malā-y Jabbārī (1806–1876), and Sayyed Abdul Karīm Tawgozī, known as
Mawlawī Kurd (1806–1882). There are many other Gorani poets; these names are
only a selection of them.
8
A new administrative division of cities and villages in the Kurdistan province in Iran
includes Sarvābād, consisting of Sarvābād and Hawrāmān Taxt. This also means that
the Hawrāmān region is divided into two parts: 1) Hawrāmān Taxt (villages: Hawrāmān
Taxt with Sar-u Pīrī, as a unit, Kamāɫa, Řēwar, and Wīsīan); and 2) Šālyār (villages:
Silēn, Biɫbar, Žīwār, Wargawīr, Nwēn, Kalǰī, Nāw, ʕabāsābād, and Asparēz) (according
to Sadeqi, n.d.).
9
Other local names for Hawrāmī spoken near Pāwa may be derived from the village
name, for example, Xānagāyī, spoken in Xānagā, or Dišayī, spoken in Diša.
Hawrāmī of western Iran 537
2. Background to the Hawrāmī speech community (Pāwa and Nawsūd)
The city of Pāwa (Persian: Pāveh) belongs to the county (šahrestān) of Pāveh in
Kermānšāh province. It is located approximately 120 kilometers northwest of Ker-
mānšāh. Pāwa is the capital; other towns include Nawsūd, Nodša, Bānawřē, and
Bayangan. Hawrāmī is spoken in Pāwa and Nawsūd, while Jāfī Kurdish is spoken
in Bānawřē and Bayangān. Jāfī is also spoken in most of the villages between
Bānawřē and Pāwa, as well as in about four villages between Pāwa and Nawsūd.
Hawrāmī is used in the remainder of the villages.
Recent estimates of Gorani speakers range from 200,000–300,000 (Paul
2007: 285) up to perhaps more than 500,000 (Mahmoudveysi 2016: 8). The ‘whole
Avromani ethnic continuum’ is estimated at 80–90,000, with about 50,000 in Iran
and about 30–40,000 in Iraq (Zolfaqari 2010: 238). The population of Pāwa and
villages is estimated to be around 56,000; that of Nawsūd is estimated at 7,984;
and that of Nodša as 3,547 (as of the 2006 census). The population of Sarvābād is
about 70,000 people.10
Traditional daily life involved animal husbandry of sheep, goats, and cattle, with
production of milk, yogurt, butter, and cheese. This work, however, has become
less common in modern life. Most people tend gardens and orchards, growing
walnuts, figs, apples, pomegranates, and grapes, as well as tomatoes and okra,
which are ingredients in traditional Hawrāmī dishes such as Doyna and Šalamīna.
Most people in Pāwa are well educated, with an estimated literacy rate of 99
percent of the population between 10 and 49 years of age.11 Men and women are
trained in many types of professions and work in schools, clinics, and offices.
People in the Hawrāmān area primarily follow traditions of Sunni Islam. Haw-
rāmān is an important location for two Sufi (mystical Islamic) orders. Pāwa is one
of the main locations for both orders, while the village of Neǰar, close to Pāwa, is an
important location for the Qāderīya order. The region also exhibits historical influ-
ence of other religions. One of the largest ancient fire temples of Zoroastrianism is
found in Pāwa (Shahbazi, n.d.).12 It is on a high mountain of the same name, Ātašgā.

As mentioned above, the primary languages of this area are Hawrāmī and Jāfī, a
variety of Central Kurdish. Hawrāmī is spoken in the cities and in most villages.
10
See Mahmoudveysi (2016: 7–8).
11
See Paveh Press (2013), with this percentage estimated by Eqbal Zomorodi, Director of
the Office for the Abatement of Illiteracy.
12
In Gawraǰū and Zarda villages, located south of Hawrāmān, Gorani speakers are of
Yārsān (Yārī, Ahl-e Haqq) background. In Kandūla, the community is Twelver Shiʿite
(as noted by Paul 2007: 285).
It is used by an estimated 70 percent of the population of the Hawrāmān region

in Pāwa, Nawsūd, and the villages between Pāwa, Nawsūd, and Marīwān. Jāfī is
spoken in villages between Řawānsār and Pāwa, including Šimšēr, Darabāyān,
Nosma, Čoržī, Bindara, and Dūrīsān, and also between Pāwa and Nawsūd, in the
villages of Niǰār, Noryāw, Wiřā, and Komara.
In terms of language use, both Hawrāmī and Jāfī are spoken within the family
domain. Hawrāmī is normally spoken at home and learned by all children. Hawrāmī
is used by speakers in work-related and marketplace domains as well. Hawrāmī
has high prestige, and people have very positive attitudes towards it. In terms of
multilingualism, it is observed that, generally, Jāfī speakers do not learn Hawrāmī,
with rare exceptions. Most Hawrāmī speakers, however, are able to speak Jāfī.
When Hawrāmī and Jāfī speakers converse, they generally choose to do so in Jāfī.
There are still a number of monolingual speakers of Hawrāmī, mostly women
of the older generations. These monolingual speakers can nonetheless communi-
cate to a degree with Jāfī speakers, with each participant using their own language.
There is no official education in Hawrāmī, though some young educated
people have begun teaching Hawrāmī courses in Marīwān, Sanandaǰ, Pāwa, and
the vicinity. A Hawrāmī festival takes place annually in Halabǰa or Sulaymaniya.
Media include radio programs, and in recent years, an increasing number of local
writers are producing literature in Hawrāmī. A magazine, ‘Hawrāmān’, is regu-
larly published, and there are children’s books, short stories, and poetry, including
a translation of Gulestan by the great Iranian poet Saadī.
Persian, as the official language of the Islamic Republic of Iran, is also spoken
in the area. Modern Standard Persian is used in all official institutions, such as
schools, hospitals, and the military, and also in the media. It is the language of wider
communication for people who have come from other Iranian cities. It is learned by
children in school and through exposure to television, but it is not typically used in
the Hawrāmī family domain. Colloquial Persian is common in tourism and trade.
2.2. Present study

The rest of this study summarizes some of the linguistic features of Hawrāmī from
Nawsūd and from Pāwa. The description of Hawrāmī of Nawsūd is based on the
work of MacKenzie (1966). The description of Hawrāmī of Pāwa is based on
material from two native speakers: Parvin Mahmoudveysi, one of the authors of
this study, from Pāwa, forty-six years of age, who also speaks Kurdish, Persian,
German, and English; and Jahangir Mahmoudveysi, also from Pāwa, fifty-six years
of age, who also speaks Kurdish, Persian, and English. Jahangir Mahmoudveysi is
a writer and author of poems and short stories, including the story presented at the
end of this study, a contribution that is gratefully acknowledged.13
13
The authors would like to offer sincere appreciation to Jahangir Mahmoudveysi for his
In this description, language material can be identified as from Pāwa or from

Nawsūd according to the sources. Pāwa material comes from the text by Jahangir
Mahmoudveysi at the end of this study (bracketed numbers refer to these exam-
ples), from Mahmoudveysi and Bailey 2004, abbreviated as [M&B 2004], or from
Mahmoudveysi 2017, abbreviated as [P.M. 2017]. Unspecified Pāwa material is
also from Parvin Mahmoudveysi. Nawsūd material is from MacKenzie (1966),
abbreviated as [D.M. 1966].
3. Phonology
The Hawrāmī material in this study is written with a mostly phonemic orthog-
raphy.14 Material in this section on phonology is also transcribed in IPA (Inter-
national Phonetic Alphabet; International Phonetics Association 2015/2005). The
table below shows the correspondence of the orthographic symbols to the IPA
symbols.15
Table 1: Orthographic symbols with IPA correspondences

Vowels Consonants
ī [iː] p [pʰ] n [n] č [tʃʰ]
i [ɪ-ɨ] b [b] l [l] ǰ [dʒ]
ē [eː] t [tʰ] ɫ [ɫ] x [x]
e [ɛ] d [d] ř [r] gh [ʁ] (loan)
a [æ-ɛ-ə] đ [ð] r [ɾ] ħ [ħ]
ā [aː] k [kʰ] f [f] h [h]
ɔ [ɔ] g [g] s [s] y [j]
ō [oː] q [q] z [z] w [w]
u [ʊ] ʕ [ʕ] š [ʃ]
ū [uː] m [m] ž [ʒ]
contribution to this study and to Nicholas Bailey for his technical assistance and review
of the grammatical description. Special thanks is also due to Laurentia Schreiber for her
helpful editorial work.
14
Where relevant, Hawrāmī material from other sources (including MacKenzie 1966) is
adapted to this orthography to facilitate comparison. The orthography also includes the
symbol <đ>, which, as an allophone, does not have full phonemic status, but it is still
regarded here as important to signify.
15
Phonetic material is enclosed in square brackets, […], phoneme symbols are enclosed
by slash marks, /…/, and orthographic symbols are enclosed by brackets, <…>. Primary
stress is marked with [ˈ] before the syllable. Language material in the prose sections is
in italic font.
3.1. Consonants
Table 2: Consonant phonemes (IPA)
Bi- Labio- Alveolar Post- Palatal Velar Uvular Pharyn- Glottal
labial dental alveolar geal
Plosive pʰ b tʰ d kʰ g q
Affricate tʃʰ dʒ
Fricative f sz ʃʒ x (Pāwa: ħʕ h
loan ʁ)
Nasal m n
Trill r
Tap ɾ
Lateral l
Lat. ɫ
velarized
Semivowel w y
The voiceless plosives /p, t, k, q/ [pʰ, tʰ, kʰ, qʰ] have non-contrastive aspiration. A
non-phonemic glottal plosive [ʔ] is slightly audible before a word-initial vowel.
The voiced plosives /b/ and /g/ maintain obstruent articulation in various environ-
ments.
The voiced plosive /d/ is of particular interest on account of the ways it is real-
ized within the varieties in the Hawrāmān region and in the wider area. In Pāwa,
/d/ maintains obstruent articulation in word-initial and post-consonantal posi-
tions, such as following /ɾ/: wirdī [ˈwɪɾdi] ‘small’ [M&B 2004: P85]. In post-vo-
calic environments, however, /d/ undergoes lenition. It is realized as what might
be described as “a half-close central continuant caused by the tip of the tongue
approaching the upper teeth without making contact” (MacKenzie 1966: 7–8).
This sound in the Pāwa material seems to involve a slight quality of tongue-back-
ing and with the lips slightly spread. It is symbolized here with đ [ð]. Examples
include: ađā [ʔəˈða] ‘mother’ [M&B 2004: P31]; bađan [bəˈðən] ‘body’ [M&B
2004: P135]; diđān [dɪˈðan] ‘tooth’ [M&B 2004: P19] (also [dɪˈɹa]) [P.M. 2017];
and ēđ [ʔeð] ‘prox .3 sg ’ [M&B 2004: P3]. It seems to be close to that which is
described as the alveolar approximant [ɹ] by Naqšbandī (1996: 125), who also
notes devoicing at the end of a word, thus [ɹ̥ ], as in [maħˈmʷuɹ̥ ] ‘Mahmoud’ (our
stress transcription).
This phoneme maintains an obstruent articulation following /ɾ/ not only in
Pāwa, but also in the speech of the villages between Kāmyārān and Sanandaǰ.
Moreover, it is found further south in the Gorani variety of Zardayāna, as illus-
trated by bardan [bæɾˈdæn] ‘has taken’ (Mahmoudveysi and Bailey 2013: 150)
and in the variety of Gawraǰūyī, as illustrated by wardē [wæɾˈde] ‘has eaten’
(Mahmoudveysi, Bailey, Paul, and Haig 2012: 94; with possible reduction or dele-
tion word-finally or preceding a pause). In the village of Nawsūd, however, as well
as in the villages between Pāwa and Nawsūd, extending to Marīvān, the sequence
/ɾd/ is found as an approximant. For example, /ɾd/ in Pāwa karday ‘to do’ and
marday ‘to die’ is found as /ɹ/ in the corresponding words in the Nawsūd area.
The phoneme /d/ also undergoes lenition in the post-vocalic environment in
Zardayāna, where it can be realized as [ð] or [j], for example, mido [mɪˈðo] ‘gives’
and ād [ʔaj] ‘dist .3 sg ’, or in vowel length, as in ādam [ʔaːm] ‘human being’
(Mahmoudveysi and Bailey 2013: 13). The realization as [j] is also found in the
speech of Hawrāmī speakers outside of the Hawrāmān region, as illustrated by ayā
[ʔæˈja] ‘mother’, by speakers living in the city of Kermānšāh and in other cities
outside of Hawrāmān, such as Karaj and Tehran.
The sound can also be found as a lateral approximant. Thus, the word pro-
nounced by speakers in Pāwa as [χwoˈɹɑ] ‘God’ is pronounced by speakers outside
of Hawrāmān in the city of Kermānšāh as [χwoˈɾɑ]/[χwoˈlɑ], as described by
Naqšbandī (1996: 286, his transcription, our stress placement).
MacKenzie (1966: 8) mentions that /t/ in the second singular clitic pronoun
=it can be realized as a ‘continuant’ in word-final position. This realization is also
possible in Pāwa. The sound is provisionally symbolized a t̪ . (See (14), (20), (37)
in our glossed text at the end of this study.)
The fricative series exhibits some complexities. The sound /x/ only has a
voiced counterpart in loanwords, /ʁ/. The phoneme /f/ also has irregular instances
of a voiced counterpart. In Nawsūd and Nodša, /v/ is an alternative for /w/ in
word-initial position. In Pāwa, there is a sound that might be interpreted as [vʷ], as
in warwa [ˈvʷəɾwə] ‘snow’ [M&B 2004: P68]), or as this sound in free variation
with [w]: waratāw [wəɾəˈtʰaw] or [vʷəɾəˈtʰaw] ‘sun at noon’ [M&B 2004: P55];
winī [ˈwɪni] or [ˈvʷɪni] ‘blood’ [M&B 2004: P21]; and wirdī [wɪɾˈdi] or [vʷɪɾˈdi]
‘small’ [M&B 2004: P85]. Some words beginning with /w/, however, are more
consistently realized as such: wārān [waˈɾan] ‘rain’ [M&B 2004: P67].
There are distinctions between a glottal fricative /h/ hašt [həʃtʰ] ‘eight’ [M&B
2004: P107] and a voiceless pharyngeal fricative: ħaft [ħəftʰ] ‘seven’ [M&B
2004: P106]; between a velarized lateral approximant /ɫ/ and simple /l/ (Pāwa):
guɫ ‘flower’ and gul ‘dirty’; kaɫ ‘mountain goat’ and kal ‘partially broken’; and
between a trill /r/ and flap /ɾ/: hařa [həˈrə] ‘mud’ and hara [ˈhəɾə] ‘donkey’ [P.M.
2017].16 Neither /ɫ/ nor /ɾ/ occur in word-initial position. The voiced pharyngeal
fricative /ʕ/ is found mainly in loanwords from Arabic.
Preceding a pause (word-finally), /n/ and /g/ occur as a velar nasal [ŋ]: māng
16
The general term for ‘flower’ in Hawrāmī is wilī, while guɫ is also found in literary
Hawrāmī.
[maŋ] ‘moon’ [M&B 2004: P57]. A velar nasal can precede a velar plosive in
word-medial position: řangat [rəŋgət] ‘your color’ (14).
There are only a few instances of gemination. In our Pāwa text, an example
appears in enna ‘so much’ (22). Paul (2007: 289–290) also describes a particular
type of nasal gemination developing from a combination of /n/ and /d/ in intervo-
calic environments.
3.2. Vowels
Table 3: Vowel phonemes (IPA)
Front Central Back
Close i u
ɨ (P: ɪ-ɨ) ʊ
Close-mid e o
ɛ
Open-mid N: æ (æ-ɛ), P: ə (ɛ-ə) N: ɔ
Open a
As noted for Nawsūd by MacKenzie (1966: 8–9) and also found in Pāwa, the
phoneme /ɨ/ (MacKenzie’s transcription: ɪ, our orthography: i) is often elided
between consonants in syllables that do not bear the primary stress of the word:
wit ‘he slept’ and ˈnawt ‘he did not sleep’ [D.M. 1966: 9, his gloss corrected here].
MacKenzie also notes that the phonemes /ɨ/ and /ʊ/ tend to be reduced in initial
syllables in some contexts.
The vowel /ɔ/ in Nawsūd, but not Pāwa, has limited distribution, only occur-
ring in the postposition =(a)wa after a consonant or vowels /i/ and /u/ (MacKenzie
1966: 10). This vowel /ɔ/ appears to correspond to the sequence /aw/.
In Nawsūd, /æ/ (MacKenzie’s transcription: a, also our orthography) is described
as ranging from “an open front [a], normally” (that is, assumed here as IPA: [æ])
“to near half-open [ɛ]” (MacKenzie 1966: 9). In the Pāwa material, this vowel
can also be described as having a range of phonetic realizations from [ɛ] to [ə].17
3.3. Stress
Stress is not entirely predictable, though it tends to occur on the final syllable of
stems.18 It occurs regularly on morphemes marking plural, the demonstrative par-
ticles =ˈa and =ˈē, subjunctive ˈbi-, and negation markers. It occurs on the final
17
Our analysis of vowel phonemes and the ranges of realization is provisional.
18
For discussion of stress in Hawrāmī, see Mahmoudveysi (2016: 76–80).
syllable of the definiteness markers. The position of word stress can mark a change
in meaning, which can be illustrated by the shift to the first syllable of multisyl-
labic proper nouns to indicate vocative case (see also MacKenzie 1966: 21). For
example: Paˈrī is realized as ˈParī in vocative case, similarly Rēˈbīn as ˈRēbīn
[P.M. 2017].
3.4. Common phonological processes

There is a tendency towards devoicing /ɾ/, /z/, and /ʒ/ preceding a pause (word-fi-
nal position): ser [sɛɾ / sɛɾ̥] ‘head’ [M&B 2004: P11]; āyir [ʔaˈjɪɾ / ʔaˈjɪɾ̥] ‘fire’
[M&B 2004: P51]; sawz [səwz / səws] ‘green’ [M&B 2004: P78]; and dirēž [dɪˈɾeʒ
/ dɪˈɾeʃ] ‘long’ [M&B 2004: P91]. Voiced plosives tend to be devoiced and also
unreleased preceding a pause: zerd [zɛɾd̚ / zɛɾt̚] ‘yellow’ [M&B 2004: P79]; serd
[sɛrd̚ / sɛrt̚] ‘cold’ [M&B 2004: P88].
Devoicing is important in the formation of certain contracted forms whereby
the initial consonant in the simple prepositions ba ‘to, with’ and ǰa ‘from, to, in’ is
devoiced when combined with a postposition or other element. For example, the
absolute preposition pana ‘to, by’ is actually a contraction of ba ‘to’ + =ana (post-
position) ‘over, in’; while the absolute preposition čana ‘in’ is a contraction of ǰa
+ =ana (see MacKenzie 1966: 55). Another example in the Pāwa material is pā ‘to
that’ (3), a contraction of ba ‘to’ + ā ‘distal demonstrative adjective’.
Another process is assimilation of certain vowels. For example, the final
short vowel /a/ in a stem undergoes assimilation to an initial vowel ē in a follow-
ing suffix (see MacKenzie 1966: 11), as illustrated in Pāwa: hāna ‘spring’, but
hān-ēwa ‘spring-indf ’ [P.M. 2017]. A stem-final vowel ī can be assimilated to
a following a: twānī ‘stone’, but twān=a ‘stone=compd ’ (22). The initial vowel a
of the definiteness marker -aka ‘def . m ’ (-akē ‘def . f ’, -akē ‘def . pl ’) may also be
assimilated: zāwřo-ka=t ‘child-def . m =2 sg ’ (‘your child’) (34).
4. Morphosyntax
4.1. Nominal morphosyntax

The main categories associated with Hawrāmī nominals are grammatical gender,
case, number, definiteness, and indefiniteness.19
19
The categories of gender, case, and number are also marked on adjectives in agreement
with nouns they modify (MacKenzie 1966: 13). Due to space limits, this topic is not
discussed in our study.
Grammatical gender as masculine or feminine is evident in the form of the noun

stem. A masculine noun stem can end in a consonant, a stressed vowel -a, -ī, -o, or
ū, or often in the stressed vowel -ā (MacKenzie 1966: 13). For example (Pāwa):
hēˈɫa ‘egg’, hanˈnī ‘watermelon’, dˈro ‘lie’, žūˈžū ‘hedgehog’, and pīˈā ‘man’
[P.M. 2017]. A feminine noun stem can end in the stressed vowel -ē, unstressed -a
or ī, or sometimes stressed -ā. For example: samāˈwara ‘samovar’, ˈtwanī ‘stone’,
wāˈɫē ‘sister’, ˈāwī ‘water’, and dāˈđā ‘grandmother’ [P.M. 2017].
Case is distinguished as direct and oblique (see MacKenzie 1966: 14–15).
Direct case is not overtly expressed on singular nouns. It is fused with number on
plural nouns. Oblique case for singular nouns distinguishes masculine and femi-
nine. It is also fused with number on plural nouns (see below). Oblique case on
masculine singular nouns is marked with -ī (-y following vowel). Oblique case on
feminine singular nouns is -ē (allomorphs: stem-final unstressed -a assimilates to
-ē; stem-final -ā is raised to -e [ɛ] in Nawsūd, but to -āy in Pāwa). For example:
warg ‘wolf’ (direct), warg-ī ‘wolf-obl .sg . m ’, mangāw ‘cow’ (direct), mangāw-ē
‘cow-obl .sg .f ’, karga ‘hen’ (direct); karg-ē ‘hen-obl .sg .f ’, ađā ‘mother’ (direct)
[examples from P.M. 2017], ađ-e ‘mother-obl .sg .f ’ [D.M. 1966: 15] (Nawsūd,
oblique), but ađ-āy ‘mother-obl .sg .f ’ [P.M. 2017] (Pāwa, oblique).
Oblique case marks nouns functioning as: 1) the complement of an adposition;
2) the dependent element in a genitive (ezāfe) construction; and 3) the argument
expressing object in a present tense clause. It can also mark 4) the agent of a past
transitive clause (see MacKenzie 1966:13), though there are no instances of this
in our Pāwa text. Examples include (Pāwa): 1) ǰa warg-ī matarsū ‘from wolf-
obl .sg . m I fear’ (‘I fear the wolf’)[P.M. 2017]; 2) dāstāna=w dāđā-y ‘story=ez _2
grandmother-obl .sg .f ’ (‘Grandmother’s story’) (1); 3) mansūr-ī nimažnāsū
‘Mansūr-obl .sg . m I do not know’ (‘I don’t know Mansūr’) [P.M. 2017]; and in
Nawsūd, 4) wārān-ī nāst ‘rain-obl .sg . m did not permit’ (‘the rain did not permit’)
[D.M. 1966: 51].
Number is marked and also fused with case. Plural number and direct case is
expressed by -ē (stem-final vowel ā to e [ɛ], or stem-final vowel a to ē). Plural
number and oblique case is expressed by -ā (stem-final ā is followed by -yā;
stem-final ē is assimilated to ā) (see MacKenzie 1966: 14–15). In Pāwa, the form
-ā occurs in written texts, whereas -ān is found in spoken speech. Examples
include: warg-ē ‘wolf-dir .pl ’ [P.M. 2017]; yān-ē ‘house-dir .pl ’ [D.M. 1966: 14]
(stem is yāna); ađ-e ‘mother-dir .pl ’ [D.M. 1966: 15] (stem is ađā); warg-ˈā ‘wolf-
obl .pl ’ [P.M. 2017] (Pāwa, written form); warg-ˈān ‘wolf-obl .pl ’ [P.M. 2017]
(Pāwa, spoken form); yān-ā ‘house-obl .pl ’ [D.M. 1966: 14] (stem is yāna); and
ađā-yā ‘mother-obl .pl ’ [D.M. 1966: 15] (stem is ađā).
Indefiniteness and singularity is marked in Nawsūd on a masculine singular
noun with -ēw and a feminine singular noun with -ēwa. In Pāwa, there is no distinc-
tion of gender in this form, and only the form -ēwa occurs. Stress remains on the
stem of the noun. Examples include: kuř-ēw ‘son-indf . m ’ [D.M. 1966: 66]; warg-
ēwa ‘wolf-indf ’ [P.M. 2017] (Pāwa); and yān-ēw ‘house-indf . m ’ [D.M. 1966: 16]
(stem is yāna); yān-ēwa ‘house-indf ’ [P.M. 2017] (Pāwa). There is another indef-
inite marker, -ya, for example: dēq-ya ‘sorrow-indf _2’ (22).
Nouns marked for plural (with case), but which show no additional marking of
definiteness/indefiniteness, can express a sense of indefinite plural, as ‘some’ (see
MacKenzie 1966: 15). For example: yān-ē ‘house-dir .pl ’ (i. e., ‘some houses’)
[P.M. 2017].
Nouns are marked for definiteness, or rather identifiability, by -aˈka (masculine
nouns) and -aˈkē (feminine nouns), identical to -aˈkē (plural) (following certain
stem-final vowels, the initial vowel of the morpheme is assimilated). For example:
warg-aka ‘wolf-def . m ’ [P.M. 2017]; yānaka ‘house.def . m ’ [D.M. 1966: 16];
ađā-kē ‘mother-def .f ’ [D.M. 1966: 16]; palawar-akē ‘bird-def .pl ’ [P.M. 2017];
and ađe-kē ‘mother-def .pl ’ [D.M. 1966: 16].20
4.1.2. Demonstrative particle and the demonstrative construction

The demonstrative particle distinguishes gender: =a ‘dem . m ’ and =ē ‘dem .f ’ (see
MacKenzie 1966: 16–17). The particle is stressed and attaches as an enclitic nor-
mally to the final element of a noun phrase construction consisting of a noun as
head, preceded by the proximal demonstrative adjective ī or the distal form ā.
The gender of the particle agrees with that of the head noun. The construction is
exemplified here (Pāwa): ī hanār=a ‘prox .dem .adj pomegranate=dem . m ’ (‘this
pomegranate’); and ā žan=ē ‘dist .dem .adj woman=dem .f ’ (‘that woman’) (stem
is žanī) [P.M. 2017]. This particle can also simply attach to a noun phrase in voca-
tive use, as in Pāwa: gīyān=a ‘life/soul=dem . ptcl . m ’ (‘O beloved one’) (21) (see
also MacKenzie 1966: 20).
4.1.3. Ezāfe and compound particles in noun phrase constructions

There are two main forms of ezāfe particles (both unstressed). The first is =ī (=y
following some vowels; absent following -ī, stressed -ī, or -ē), here glossed as
‘ez _1’. The second is =ū (=w following vowel), glossed as ‘ez _2’. The ezāfe
particle attaches to a head noun phrase and links it with a dependent element in
what is termed the “ezāfe construction” (see MacKenzie 1966: 17–19). The ezāfe
particle =ī links a head noun phrase with an attributive adjective (or a noun in
apposition), while the ezāfe particle =ū links a head noun phrase with another
noun phrase, pronoun, preposition, or adverb.
20
The segmentation between the stem-final vowel of yāna and the initial vowel of -aka is
not clear.
Another linking particle is the “compound marker” =a. The compound marker
has a particular use in definite noun phrases (discussed in relationship to =ī ‘ez _1’
below).
The ezāfe construction with =ī is exemplified here: z(i)msān=ī čapaɫ ‘win-
ter=ez _1 dirty’ (‘dirty winter’) (2); kitēb=ī sīāw ‘book=ez _1 black’ (‘black book’)
[D.M. 1966: 17]; and yān-ēw=ī gawra ‘house-indf . m =ez _1 big’ (‘a big house’)
[D.M. 1966: 17] (stem is yāna).
The ezāfe construction with =ī can be found on a head noun that is unmarked
or marked for indefiniteness (the ezāfe then follows that marker; see preced-
ing example, yān-ēw=ī gawra). When a head noun is specified for definiteness
(marked with either a definiteness marker, or modified by a demonstrative), the
ezāfe =ī is not used. Instead, the compound marker =a is employed in what is
termed the “open” compound construction (see MacKenzie 1966: 18). Examples
include: kitēb=a sīāw-aka ‘book=compd black-def . m (‘the black book’) [D.M.
1966: 18]; and ā kināč=a zarīf=ē ‘dist .dem .adj girl=compd beautiful=dem .f ’
(‘that beautiful girl’) [D.M. 1966: 18] (stem is kināčē).
The ezāfe construction with =ū is illustrated here: yāna=w xuđā-y ‘house=ez _2
God-obl .sg . m ’ (‘the house of God’) (46). It can also link several elements: sar=ū
bān=ū mizgī=wa ‘on=ez _2 roof=ez _2 mosque=postp #_1’ (‘on the roof of the
mosque’) (57).
4.2. Pronouns and the reflexive
4.2.1. Independent pronouns

Hawrāmī has independent as well as clitic forms of personal pronouns. The inde-
pendent forms consist of a set of first and second person pronouns as well as two
sets of demonstrative pronouns with third person reference (for tables, see Mac-
Kenzie 1966: 24–25, with minor modifications).
The first and second person pronouns distinguish person and number but not
case:
Table 4: First and second person pronouns

Person Singular Plural
1 min ēˈma
(Pāwa: amin)
2 to šiˈma
There are two sets of demonstrative pronouns. Those of the first set can be used
pronominally in third person reference and also as demonstratives distinguished
for distance, as proximal and distal. Forms of the second set are noted as “purely
demonstrative” (MacKenzie 1966: 24). The sets are shown here (some forms are
not segmented further):
Table 5: Demonstrative pronouns, Set 1 and Set 2

Set 1 Set 2
Proximal Distal Proximal Distal
Singular Direct M. ēđ āđ īˈna āˈna
F. ˈēđa ˈāđa īˈnē āˈnē
Singular Oblique M. ˈēđī ˈāđī īnaˈya ānaˈya
F. ˈēđē ˈāđē īˈnē āˈnē
Plural Direct ˈēđē ˈāđē īˈnē āˈnē
Plural Oblique ēđīˈšā āđīˈšā īnīˈšā ānīˈšā
4.2.2. Clitic pronouns

Clitic pronouns distinguish person (first, second, third) and number (singular and
plural). (Forms in parentheses are allomorphs following vowels.)
Table 6: Clitic pronouns

1 =im (=m) =mā
2 =it (=t), variant: =it̪ =tā
3 =iš (=š) =šā
Clitic pronouns are used in functions that typically require oblique case. Most of
the functions occur in our glossed text at the end of this study: 1) possessor in a
noun phrase (14), (25), (34); 2) complement of adpositional phrase (26), (48); 3)
direct object (O) argument of present tense clause (2), (13); 4) agent (A) argument
of past tense clause (11), (24), (36), (41), (54); 5) specifying referent on reflexive
wē (4), (62); 6) experiencer or possessor in a copula clause (5), (34), (51); and 7)
indirect object argument (32).
4.2.3. Reflexive
The form of the reflexive is wē. It is always followed by a clitic pronoun that
specifies the person and number of the referent. It is used to express a referent that
is identical to that of an antecedent within the clause as agent or subject (or in an
imperative, the addressee, see example in MacKenzie 1966: 27). Uses of the reflex-
ive are (with examples from our glossed text): 1) to express possessor in possessive
constructions (53); 2) as complement of an adpositional phrase (43); 3) to express
a type of emphasis involving the referent (4); and 4) to express direct object (O) in
transitive clauses when the referent is the same as the agent (A), for example, wē=š
fārā ‘refl =3 sg disguise.pst ’ (‘she/he disguised her/himself’) [D.M. 1966: 27].
4.3. Verbal morphosyntax
4.3.1. Stems
A typical verb has two stems: past and present. These form the basis for finite verb
constructions. The past stem forms the basis for the non-finite verb constructions
of the participle and the “infinitive verbal noun”. The participle is formed from
the past stem and the ending -ˈa (unmarked or masculine), -ˈē (feminine), or -ˈē
(plural) (MacKenzie 1966: 36). The infinitive verbal noun is built with the ending
-ˈ(a)y (MacKenzie 1966: 28).
The form of a verb stem can be simple or complex. Examples of simple verb
stems include: kar- ‘do.prs ’ and kard- ‘do.pst ’; zān- ‘know.prs ’ and zānā- ‘know.
pst ’; (w)āč- ‘say.prs ’ and wāt- ‘say.pst ’; and ār ‘bring.prs ’ and āwrd ‘bring.pst ’
(see glossed text in Section 5).
Complex forms can be of three types: 1) a “light verb” construction; 2) a “post-
verbal” construction; and 3) a “preverbal” construction. The light verb construction
is formed with a nominal, an adjective, or an adverb as a first element and a “light
verb” as the second element. For example: tamāša kar- ‘looking (noun) do.prs’ (‘to
look at’) [D.M. 1966: 110]; and bar šī- ‘out (adverb) go.pst _2’ (‘go out’) (9).
The postverbal construction involves a simple stem combined with a particle
anˈa, arˈa, or awˈa (Pāwa) (Nawsūd: ɔ following ī, ū, or a consonant). These parti-
cles also occur in other contexts as postpositions (MacKenzie 1966: 31). They can
occur on a verb and give it a different lexical sense. For example: nīšt- ‘sit.pst ’ >
nīštara- ‘sit.down.pst ’; kard- ‘do.pst ’ > karđana- ‘put.on.pst ’; and ward- ‘eat.pst ’
> warđēwa- ‘drink.pst ’ [D.M. 1966: 31].
The preverbal construction is built with a simple verb stem and a preverb, hur
‘up’ (Pāwa hor), which also gives the stem a new sense. For example: gēr- ‘keep.
prs ’ > hor gēr ‘lift up.prs ’ (20).
As noted by MacKenzie (1966: 48–49), the present causative stem is formed
by adding -n added to a present stem, while the past causative stem is formed
by adding -ˈnā to a present stem: giraw- ‘weep > girawn- ‘make.weep.prs ’ and
girawnā- ‘make.weep.pst ’ [D.M. 1966: 49]. MacKenzie also notes that the present
passive stem is built with the addition of -īa, while the past passive stem is formed
by adding -īa to a present transitive stem, while the past passive stem is formed by
adding -īˈā- to a present transitive stem. For example: wān- ‘read’ > wānīa- ‘be.
read.prs ’ and wānīā- ‘be.read.pst ’ [D.M. 1966: 48].
4.3.2. Prefixes
The following prefixes indicate mood, aspect, and negation (see MacKenzie
1966: 32):
Table 7: Verbal prefixes

Use Form
Indicative mi- (Nawsūd) or ma- (Pāwa) (m- preceding
vowel or absent on some stems)
Subjunctive, Imperative bi- (b- preceding vowel; or absence of
marking and instead a stress shift to first
syllable of stem)
Negation (Present Indicative and ma- (nim- preceding ā and a), ni- (Pāwa)
Prohibitive constructions)
Negation (constructions other than na-
Present Indicative)
4.3.3. Verb suffixes: Present

The following verb suffixes are used in finite constructions formed from the present
stem and also in those formed from past stems in transitive clauses. Most language
material in the following tables is from MacKenzie (1966: 33), with our glosses
and notes. Forms in parentheses are normally the allomorphs following vowels.
The present suffixes are shown here. The form -ūna ‘-1 sg . prs ’ is found in
Pāwa (as well as in Nawsūd and Nodša) as a variant for first person singular. In
our glossed text, the form mainly occurs in subjunctive constructions, as in (13).
There also appears to be a second singular form -īna, as in (24) (not included in
the table below; more study is needed). The present suffixes function to mark the
person-number of the subject (S) argument of intransitive verbs and agent (A) of
transitive verbs in present tense clauses.
Table 8: Verb suffixes: Present

1 -ū (-w) -mē (-yˈmē)
-ūna
2 -ī (-y) -dē (-yˈdē)
3 -o -ā (Pāwa: ān)
The Imperfect is unusual in that even though it is formed from a present stem, it
is used to denote situations occurring in past time, with imperfective aspect. The
suffixes mark the person-number of the subject (S) and agent (A) argument in
Imperfect constructions.
Table 9: Verb suffixes: Imperfect

1 -ˈēnē -ˈēnmē
2 -ˈēnī -ˈēndē
(Pāwa: -ēšī)
3 -ˈē -ˈēnē
The suffix in an Imperative construction indicates the addressee:
Table 10: Verb suffixes: Imperative

Person-number Form
2 sg -a (unmarked following vowel)
2 pl -dē (-ydē)
4.3.4. Verb suffixes: Past

A different set of verb suffixes is found in constructions formed from past stems of
intransitive verbs. These suffixes mark the person-number of the subject (S) argu-
ment in past intransitive clauses. They also sporadically mark the person-number
of the object (O) argument in past transitive clauses. For third person singular, the
suffix also distinguishes gender (with masculine as unmarked).
Table 11: Verb suffixes: Past

1 -ā(nē) -īmē (-ymē), (-aymē)
2 -ī (-y), (-ay following ī) -īdē (-ydē) (-aydē)
3 (Unmarked for masc.) -ē (-y)
-a (fem.)
4.3.5. Copular forms

The next table shows the copular forms (as found in MacKenzie 1966: 34, 37, 38).
Present forms attach as enclitics; and the third singular masculine allomorph =n
follows a stressed vowel, while the allomorph =na follows an unstressed vowel:
Table 12: Copula verb: Present, Past, and Imperfect forms

Person-number Present Past Imperfect
1 sg =anā (=nā) ˈbīā(nē) ˈbēnē
2 sg =anī (=nī) ˈbīay ˈbēnī (Pāwa: bēšī)
3 sg masc =ā (=n, =na) bī bē
3 sg fem =ana (=na) ˈbīa bē
1 pl =anmē (=nmē) ˈbīaymē ˈbēnmē
2 pl =andē (=ndē) ˈbīaydē ˈbēndē
3 pl =anē (=nē) ˈbīē ˈbēnē
The copula forms the basis for constructions with a noun phrase complement (16),
(čarma is a feminine noun) probably an adpositional complement (no example
available), and an adjectival complement (12), (61). The copula also is used in
possessive constructions (34) and experiencer constructions (5), (15).
There is another third person singular form of the copula, mawo, only attested
once in our text (22), and also a further unanalyzed copula, nā (5), (22).
4.3.6. Verb constructions

The finite verb constructions are summarized here:
Table 13: Finite verb constructions

Construction Description Example
Present mi- (Pāwa: ma-), Present ma-wīn-ū ‘I see’ (15)
Indicative stem, Present verb suffix
Present Subjunctive bi-, Present stem, Present bi-sān-o ‘he/she may buy’ (37)
verb suffix
Present Continuous ma-, reduplicated stem řamāy maram-ūna ‘I am now
adverb with -āy, Present running’ [M&B 2004]
stem or Imperfect, Present
verb suffix21
Present bi-, Present stem, Imperative bi-đa ‘Give!’ [P.M. 2017]
Imperative suffix
Present Prohibitive ma- negation, Present stem, ma-đa ‘Don’t give!’
Imperative suffix [D.M. 1966: 38]
21
This construction is noted by MacKenzie (1966: 50) as a “Continuous” tense formed
from the stem-based adverb and either the Present Indicative or Imperfect.

Imperfect Present stem, Imperfect verb kar-ēnē ‘I was doing’ (44)
suffix
Past Perfective (Simple Past stem, Past verb suffixes lūwā ‘he went’ (54)
Past) (clitic pronoun with transi-
tive verbs)
Past Past stem, Past Conditional lū-enē ‘I would have gone’
Conditional verb suffixes [D.M. 1966: 46]
The Present and Past Perfect constructions, formed from a combination of finite
and non-finite verb forms, are summarized here:22
Table 14: Perfect constructions

Present Perfect Past participle, Present witē=nmē ‘we have slept’
copula [D.M. 1966: 39]
Past Perfect (Pluperfect) Past participle, Imperfect witē=bē ‘she had slept’
copula [D.M. 1966: 40]
Perfect Subjunctive Past participle, Present wita=bo ‘he may have slept’
Subjunctive copula [D.M. 1966: 39]
Perfect Conditional Past participle, Past wita=bīe ‘he would have slept’
Conditional copula [D.M. 1966: 40]
4.3.7. Clause argument indexing and alignment

The patterning of clause argument indexing in finite verb constructions is condi-
tioned by the tense and transitivity of the verb. In a present tense clause, subject
(S) of an intransitive verb or agent (A) of a transitive verb will be indexed on the
verb by means of a present suffix.
In past tense clauses, S of an intransitive verb is indexed on the verb with a past
suffix. O of a transitive verb is indexed with a past suffix. (This analysis is provi-
sional, since we do not have examples of all person-number markers in our text.)
22
Note that the participle in the Present and Past Perfect constructions can be marked
for masculine or feminine gender, or for plural number. It is also noteworthy that the
copula in the Past Perfect construction is a special form of the Imperfect copula in
which gender is distinguished in all singular person forms. In contrast, the independent
form of the Imperfect copula does not have gender distinctions (see next table with the
Imperfect copula).
In past tense transitive clauses, A is indexed by a clitic pronoun. If A is expressed

by a full noun phrase, the clitic pronoun still occurs as resumptive reference.23
In terms of case and alignment, a noun phrase is marked with oblique case
when expressing O in a present tense transitive clause, or A in a past tense transi-
tive clause. (We do not find an oblique-marked noun phrase expressing A in our
text, but an instance is found in MacKenzie 1966: 51, wārān-ī n-āst ‘rain-obl .sg . m
neg _2-let. pst ’ ‘the rain did not permit.’)
Types of patterning are exemplified by sentences from our glossed text:
Present tense clauses:

S in present tense clause with present suffix: (7 (magnū)), (64).
A in present tense clause with present suffix: (4), (8), (21), (40).
Past tense clauses:

S in past tense clause with past verb suffix: (52).
O in past tense clause with past suffix: (36), (41 (kard-a)), (49 (kard-a,
hormižā-y)). It is also observed in (39) that the past suffix -ānē ‘1sg . pst ’ indexes
an indirect object, not O.
A in past tense clause with clitic pronoun: (20), (41 (=mān, =m)), (43).
4.4. Other observations

There are a other word classes not discussed here, including the class of absolute
prepositions (such as čanī ‘with’, pora ‘on, to’, pana ‘to’). Further areas of mor-
phosyntax that deserve more study include the use of the preposition pay ‘to’ and
the directional particle =a; the use of the additive particle =īč (variant: =īš) in dis-
course; and relativization, including the irregular use of ka as a complementizer.
These and many other topics remain to be described elsewhere.
5. Sample text in Hawrāmī (Pāwa)
This text is a true story, written in August 2011 by Jahangir Mahmoudveysi. He

is from Pāwa and has authored other short stories and poetry in Hawrāmī. He is
fifty-six years old. He also read this story aloud as an oral version, which was
recorded and transcribed by Parvin Mahmoudveysi in the summer of 2017.24
23
See MacKenzie (1966: 51).
24
For helpful comments on several aspects of our translation and morphemic glossing
of the text, we would like to thank the participants of the Hawrāmī course taught by
Parvin Mahmoudveysi at the 2017 Kurdish Language Autumn School at the University
of Bamberg, Germany.
Grandmother’s Story, by Jahangir Mahmoudveysi252627

(1) dāstāna=w dāđā-y
story=ez _2 grandmother-obl .sg .f
Grandmother’s story.25
(2) dāđā: xuđā n-ār-o=t
grandmother: God neg _2-bring.prs -3 sg .prs =2 sg
ay z(i)msān=ī čapaɫ=u bē kaɫk
O winter=ez _1 dirty=and without benefit
Grandmother: “May God not bring you (here) (i. e., you are not welcome),
O dirty and useless winter!”26
(3) dāđā čēš m-āč-ī pā
grandmother what ind -say.prs -2 sg .prs with.dist .dem .adj
giz(ī)=ya=w das-ī=t=a ā
broom=dem .ptcl . m =ez _2 hand-obl .sg . m =2 sg =dem .ptcl . m dist .dem .adj
warw-ē p(ay)čī ma-pižgn-ī
snow-obl .sg .f / dem . ptcl . f why ind -scatter.prs -2 sg .prs
(Āzād speaks) “Grandmother, What are you saying? Why are you scattering
that snow with that broom in (literally, of) your hand (literally, that hand of
yours)?”27
(4) řoɫa āzād gīyān wāz=im čana b-ār-a
dear_child Āzād life/soul open=1 sg in sbjv -bring.prs -2 sg .imp
wē=m ma-zān-ū čēš ma-kar-ū
refl =1 sg ind -know.prs -1 sg .prs what ind -do.prs -1 sg .prs
(Grandmother speaks) “My dear beloved child Āzād, leave me in peace.
I myself know what I’m doing.
25
The form of the feminine oblique -y is an allomorph of -āy, as found in Pāwa Hawrāmī;
Nawsūd is -e.
26
The word xuđā follows Pāwa pronunciation. The speaker originally pronounced it as
xwā here and elsewhere.
27
The gloss as ‘obl .sg . f /dem .ptcl . f ’ is provisional. Both endings would be expected in
this context, but only one form appears. The oblique is a suffix and the demonstrative
particle is a clitic, but otherwise these two endings are phonologically identical. A sim-
ilar issue is found below in (7).
(5) amin qīnī=m=ana z(i)msān-ī

1 sg hate=1 sg =cop .prs .3 sg .f winter-obl .sg . m
z(i)msān pēsa tīr-ya nā gīyān=m=ana
winter like arrow-indf _2 is.na life/soul=1 sg =postp #_2
I hate winter. Winter is like an arrow in my soul.
(6) ǰa řanga=ū čarm-ē waš=im
from color=ez _2 white-obl .sg .f pleasure=1 sg
ni-m-ay
neg _1-ind -come.prs .3 sg
I don’t like the white color (literally, color of white).28
(7) řanga=ū čarm-ē ma-wīn-ū ma-gn-ū
color=ez _2 white-obl .sg .f ind -see.prs -1 sg .prs ind -fall.prs -1 sg .prs
yāđ=ū ā hatīm-ya
memory=ez _2 dist .dem .adj irresponsible_guy-obl .sg . m /dem .ptcl . m
(When) I see the white color, I remember that irresponsible guy.
(8) ba hāsān-ī lūwā=u na-tāwā=š
with ease-obl .sg . m go.pst _1=and neg _2-can.pst =3 sg
hīč=īš kar-o
nothing=add _2 do.prs -3 sg .prs
He went (around) with ease, and he could not do anything either.
(9) pēsa ǰarda-k-ān bar-šī=u
like thief-def . m -pl out-go.pst _2=and
ā k(i)nāč=a xās-ē=m=a
dist .dem .adj daughter=compd good-dem . ptcl . f =1 sg =dem .ptcl . m
ǰa das dā
from hand give.pst
Like the thieves, he fled, and I lost that good daughter of mine.29
28
The word čarma ‘white (color)’ is a feminine noun.
29
The form of the demonstrative particle as =a (normally masculine) can agree with a
head noun of either gender when other morphemes occur between the head noun and
the particle (see MacKenzie 1966: 16).
(10) ay řoɫa gīyān āzīz-akē=m duwagīyān-akē

O dear_child life/soul dear_one-def .f =1 sg pregnant-def .f
kē haq=it bi-sān-o ay xuđā
who right=2 sg sbjv -take.prs -3 sg .prs O God
īna=t=a čanī qabūɫ kard
prox =2 sg =dem .ptcl . m how accept do.pst
O dear beloved child, my dear one, you were pregnant (literally, the
pregnant one), who may take your right? O God! How did you accept this?
(11) pī hatīm-ān=a pēsa=t=a
to.prox .dem .adj orphan-pl =dem .ptcl . m such=2 sg =dem .ptcl . m
p(ay)čī kard
why do.pst
Why did you do such (as this) to these orphans?”
(12) dāđā gīyān p(ay)čī dēwyāy=nī
grandmother life/soul why gone_mad=cop .prs .2 sg
(Āzād speaks) “Dear Grandmother, why have you gone mad (i. e., are
distraught)?”
(13) āzāđ gīyān b-āz-a=m bā hāwār
Āzād life/soul sbjv -let.prs -2 sg .imp =1 sg let_it_be shout_for_help
kar-ūna bā ī warw-ān=a
do.prs -1 sg .prs .sbjv let_it_be prox .dem .adj snow-pl =dem .ptcl . m
šēwn-ūna
shovel.cause .prs -1 sg .prs .sbjv
(Grandmother speaks) “Dear Azad, let me, let me shout, let me shovel
(away) this snow (literally, these snows)!”
(14) dāđā gīyān wasē=n lič-akē=t̪
grandmother life/soul enough=cop .prs .3 sg . m lip-def .pl =2 sg
zard-ē bīyē=nē=u řanga=t
yellow-pl .agr cop .ptcp . pl = cop . prs .3 pl =and color=2 sg
wāřyā=yna
change.ptcp = cop . prs .3 sg .f
(Āzād speaks) “Dear Grandmother, it is enough, your lips have become
pale (literally, yellow) and your color (i. e., of your face) has changed.”30
30
This realization of the second person singular clitic pronoun as = t̪ is a variant of =t,
allomorph of =it (see also 20, 37). The different realization as =it in the subsequent
instance could be due to the influence of other languages.
(15) řoɫa gīyān amin qīna=m=ana řang(a)=ū

dear_child life/soul 1 sg hate=1 sg =cop .prs .3 sg .f color=ez _2
čarm-ē waxt-ēwa řang(a)=ū čarm-ē
white-obl .sg .f time-indf _1 color=ez _2 white-obl .sg .f
ma-wīn-ū ma-dēwy-aw
ind -see.prs -1 sg .prs ind -go_mad.prs -1 sg .prs
(Grandmother speaks) “Dear beloved child, I hate white color. When I see
white color, I go mad.
(16) ā šaw=a g(i)rd=ū dagā-y čarma
dist .dem .adj night=dem .ptcl . m all=ez _2 village-obl .sg .m white
bīya=bē kip=u
cop .ptcp = cop . ipf .3 sg closed_up=and
bē dang sirtēwa n-ē bar
without sound/voice whisper.indf .f neg _2-come.ipf .3 sg out
That night, all of the village had become white; completely silent and
without sound, (even) a whisper was not coming out (of it).
(17) warwa pēsa lēf-ī kēšyā=bē
snow like blanket-obl .sg . m pulled_over.ptcp =cop .ipf .3 sg
sar=ū dagā-y=ra
head_2=ez _2 village-obl .sg .m =postp #_3
Snow like a blanket was pulled over the village.31
(18) kas zāt=iš nawē
someone/no-one daring_courage=3 sg neg .cop .pst .3 sg
ǰa yāna=na b-ay bar
from house=postp #_2 sbjv -come.prs .3 sg out
No-one had (the) courage to come out of the house.
31
The participle in the Past Perfect in Pāwa (here: kēšyā=bē) may be marked in certain
instances for gender (masculine/unmarked or feminine) or for plural number. In most
of the text occurrences here and elsewhere, the marking is assumed to be assimilated
to the final -ā vowel or it may be identical as -a to an unmarked form, and it is thus left
unglossed.
(19) ā basazwān=ē ba lama=y

dist .dem .adj poor_creature=dem .ptcl .f with belly=ez _1
pař-a pā k(i)řēw=ana čanī=w
full-f . agr at.dist .dem .adj snowstorm=postp #_2 with=ez _2
šū-aka=yš mašyāya bi-lūwā-ya
husband-def . m =3 sg should.pst sbjv -go.pst _1-na
That poor one (who was) pregnant, (that) she should go (out) in that
snowstorm with her husband!”32
(20) dāđā das hor-gēr-a warwa-kē=t̪
grandmother hand up-keep/lift.prs -2 sg .imp snow-def .f =2 sg
tāwnā=wa ītr wasē=n
melt_cause.pst =postp #_1 anymore enough=cop .prs .3 sg . m
(Āzād speaks) “O Grandmother, let it be, you melted the snow. It is enough
now.”
(21) gīyān=a to čēš ma-zān-ī
life/soul=dem .ptcl . m 2 sg what ind -know.prs -2 sg .prs
(Grandmother speaks) “O beloved one, what do you know?
(22) dēq-ya nā sar=ū diɫ-ī=m=ana
sorrow-indf _2 is.na head_2=ez _2 heart-obl .sg . m =1 sg =postp #_2
enna=w twan=a gawra-kē=w arab-ak-ān mawo
so_much=ez _2 stone=compd big-def .f =ez _2 Arab-def . m -pl be.3 sg
There is a feeling of sorrow over my heart, it is (as big as) the big stone of
the Arabs.33
32
The idiom basazwān, literally ‘bound/closed tongue’, is normally used to refer to an
animal, which does not speak, but it can also refer to a person who is quiet and without
fault.
33
The ‘big stone of the Arabs’ refers to the stone in a ritual practiced by pilgrims to
Mecca. In the ritual, pilgrims throw pebbles at the stone, which is believed to represent
the devil.
(23) ā bābaħīz=a ghazab-bag-ī

dist .dem .adj lecher/[expletive]=dem .ptcl . m rage-Bag-obl .sg . m
m-āč-ūna zor=iš pay šū-waka=yš
ind -say.prs -1 sg .prs .sbjv pressure=3 sg to husband-def . m =3 sg
āwrda=bē
bring.ptcp =cop .ipf .3 sg
That lecher (i. e., [expletive]), I mean Honorless Bag, had put (literally,
brought) pressure on her husband.34
(24) wāta=bē=š
say.ptcp =cop .ipf .3 sg =3 sg
mašyo t(i)fang-ēwa=m pay b-ār-īna
should.prs gun-indf _1=1 sg for sbjv -bring.prs -2 sg .prs .sbjv
He had said: “You should bring me a gun.
(25) wa(r)na sawāy pīyā-k-ān=im ma-kyān-ū
if_not tomorrow man-def . m -pl =1 sg ind -send.prs -1 sg .prs
gīyān=(i)t
life/soul=2 sg
If not, tomorrow I will send my men for your life (i. e., to take your life).”
(26) āđ=īč šon-a=š gim-a bo
dist . m _1=add _1 trace-f . agr =3 sg lost-f . agr sbjv .cop .prs .3 sg
zor=iš na-mařyā
power=3 sg neg _2-break.pst
mil=š=ara řā=w bar-šīya=yš girt=a
on=3 sg =postp #_3 way=ez _2 out-go.inf _2=3 sg take.pst =dir .ptcl
war
ahead
As for him, may (every) trace (literally, the trace) of him be lost, he had
no power, going out, he fled.
(27) bafrāw=iš dā war
Bafrāw=3 sg give.pst ahead
He had Bafrāw go ahead (literally, he put Bafrāw ahead).
34
The word ghazab, literally, ‘rage’, is used here with bag as a proper name to express the
referent’s lack of honor and fairness. A figure from earlier times, the bag of a village
had a role similar to that of a mayor. A bag was often unfair in dealings with the village
people.
(28) pā nīmašaw=a dilē=w ā

at.dist .dem .adj midnight=dem .ptcl . m in_middle=ez _2 dist .dem .adj
k(i)řēwa-y=na ǰa dagā=na
snowstorm-obl .sg . m =postp #_2 from village=postp #_2
bar-a-šī-ya
outside-f . agr -go.pst _2-3 sg .f
At (literally, at that) midnight in the middle of that snowstorm, she went
out of the village.
(29) am(i)n=īš kor-a bīyē =bēnē
1 sg =add _2 blind-f . agr cop .ptcp . f = cop . ipf .1 sg .f
hīč=im pay na-k(i)ry-ē
nothing=1 sg for neg _2-be_done.prs -3 sg .ipf
As for me (i. e., Grandmother), I had become blind, it was not possible for
me to do anything.
(30) zāt=īč=im nawē hīč
daring_courage=add _1=1 sg neg .cop .pst .3 sg nothing
b-āč-ūna ā
sbjv -say.prs -1 sg .prs .sbjv dist .dem .adj
gamāɫbāba=ya ažnās-ēnē
dog_father[expletive]=dem .ptcl . m know.prs -1 sg .ipf
I did not have courage to say anything, I knew that rotten guy (literally,
[expletive]).
(31) zoɫ=u ǰargsaxt=u bē-řaħim bē
bastard=and stubborn=and merciless cop .ipf .3 sg
He was a bastard and stubborn and merciless.
(32) ħama gēɫnā=š=o čēš=šān
Ħama report.pst =3 sg =postp #_1 what=3 pl
sara-āma=n
head_1-come.ptcp . m = cop . prs .3 sg . m
Ħama reported what had (literally, has) happened to them.35
(33) ay řoɫa gīyān řoɫa=y
O dear_child life/soul dear_child=ez _1
kizoɫ-a=m hatīm=a ǰwān=a-marg-akē
innocent_poor_child-f . agr =1 sg orphan=compd young=compd -dead-def .f
O dear beloved child, my poor, innocent dear child, dead-too-young orphan!
35
The Present Perfect is used in reported speech introduced with a verb in Past Indicative.
(34) zāwřo-ka=t či gunāħ-ēwa=š bē

child-def . m =2 sg what sin-indf _1=3 sg cop .ipf .3 sg
Your child, what sin did she have?
(35) hatā kay ǰazrawa=w to kēš-ūna
until when suffering=ez _2 2 sg pull/endure.pst -1 sg .prs .sbjv
How long should I endure your suffering?
(36) čwār zāwřo=ym ba hatīm-ī
four child=1 sg with orphan-obl .sg . m
čī řo-xrāwy=a=na way
in.prox .dem .adj day-bad=dem .ptcl . m =postp #_2 care_for
kard-ē
do.pst -3 pl .pst
I parented four children without their father in these bad days.
(37) kē tāwān=it̪ bi-sān-o
who penalty/compensation=2 sg sbjv -buy.prs -3 sg .prs
Who can compensate you (for this loss)?
(38) ay xuđā xās=tān pay na-kar-o
O God goodness=2 pl to neg _2-do.prs -3 sg .prs
O may God not have mercy on you!
(39) āzāđ gīyān ħamađ pana=š wāt-ānē
Āzād life/soul Ħamad to=3 sg say.pst -1 sg .pst
Dear Azad, Ħama told me:36
(40) kiřēwa n-āz-ē čam-ē kar-mē=wa
snowstorm neg _2-let.prs -3 sg .ipf eye-pl .agr do.prs -1 pl .prs =ptcl #
“The snowstorm was not allowing us to open (our) eyes.
(41) řā-kē=mān gim-a kard-a hatā lā=m
way-def .f =1 pl lost-f . agr do.pst -3 sg .pst .f until side=1 sg
kard=o dī=m bafrāw-a čarm-ē
do.pst =postp #_1 see.pst =1 sg Bafrāw-f . agr white-obl .sg .f
bīyē=na=wa
cop .ptcp . f = cop . prs .3 sg .f =ptcl #
We lost the way, as soon as I turned back, I saw (that) Bafrāw had (literally,
has) become pale (literally, white).
36
The Kurdish form of this name is Ħama. The speaker sometimes uses this name but
here he uses Ħamađ.
(42) ba ħāɫ dang=iš bar ē

with situation voice=3 sg out come.ipf .3 sg
She spoke very poorly (literally, with situation).
(43) časpinā=m wē=m=ara hatā garm=iš
hold_close.pst =1 sg refl =1 sg =postp #_3 until warm=3 sg
bo=wa
sbjv .cop .prs .3 sg =ptcl #
I held her close to myself in order to warm her.
(44) bē dasaɫāt hāwār kar-ēnē kas
without power shout_for_help do.prs -1 sg .ipf no_one
diyār=iš nawē
apparent=3 sg neg .cop .pst .3 sg
I was shouting for help, powerless, no-one was to be found.
(45) baɫām āđa hatā ē har sard-a
but dist .f _1 until come.ipf .3 sg gradually cold-f . agr
bē=wa
cop .ipf .3 sg =ptcl #
But gradually she became cold.
(46) w(i)nī=š hatā yāna=w xuđā-y
blood=3 sg until house=ez _2 God-obl .sg . m
miǰyā=bē
pour_out.ptcp =cop .ipf .3 sg
Her blood had poured out endlessly (literally, to the house of God).
(47) warwa pēsa xāmēwa čarm-ē
snow like cream.indf white-obl .sg .f
kēšyā=bē gīyān=(i)š=ara
pulled_over.ptcp =cop .ipf .3 sg body=3 sg =postp #_3
Snow, like white cream (literally, a white cream), was spread over her body.
(48) tanyā=u bē dasaɫāt dinyā miǰyā=bē
alone=and without power world pour_out.ptcp =cop .ipf .3 sg
mil=m=ara
on=1 sg =postp #_3
Alone and powerless, the world had fallen apart (literally, poured out) on
me.
(49) hatā tāwā=m bāwaš-a=m pora kard-a

until can.pst =1 sg embrace-f . agr =1 sg to do.pst -3 sg .pst .f
dimāyīn hanāsa-kē=š=im hor-mižā-y
last breath-def .pl =3 sg =1 sg up-breathe.pst -3 pl .pst
As much as I could, I embraced her, I breathed her last breaths.
(50) gīyān=im larz-ē na-zānā=m
body=1 sg shake.prs -3 sg .ipf neg _2-know.pst =1 sg
čanī āwrdē=na=m wārī
how bring.ptcp . f = cop . prs .3 sg . f =1 sg downwards
My body was shaking. I did not know how I (literally, have) brought her
down.
(51) ni-ma-zā-ū xatā=m nawyēna
neg _1-ind -know.prs -1 sg .prs guilt=1 sg neg .cop .prs .prf .3 sg .f
I do not know, I did not have any guilt.37
(52) tarsa=ū ā pīyākuš(ī)=ya bar-šī-yānē
fear=ez _2 dist .dem .adj murderer=dem .ptcl . m out-go.pst _2-1 sg .pst
I fled because of fear of that murderer.
(53) žan-kē=w zāwřo-ka=m ǰa das dā
wife-def .f =and child-def . m =1 sg from hand give.pst
šarmanda=w wē=m=u
shame/mortification=ez _2 refl =1 sg =and
zāwřo-ka-y=m=anān
child-def . m -obl .sg . m =1 sg =cop .prs .1 sg
I lost my wife and child. I am full of shame for myself and my child.”
(54) xwāħafīzī=š kard hatā āro
farewell=3 sg do.pst until today
na-đīya=n=(i)m lūwā hatā yāwā ērāq
neg _2-see.ptcp . m = cop . prs .3 sg . m =1 sg go.pst _1 until reach.pst Iraq
He said (literally, did) farewell, and (since then) until today, I have not seen
him; he went until he reached Iraq.
(55) řoɫa gīyān laš=ū bafrāw=ū am(i)n=īš
dear_child life/soul body=ez _2 Bafrāw=ez _2 1 sg =add _2
gēɫā=wa pay dagā-y
return.pst =postp #_1 to village-obl .sg .m
Dear beloved child, as for the body of my Bafrāw, it was returned
(literally, it came back) to the village.
37
The form zā ‘know.prs ’ here is a variant of the verb zān.
(56) dagā žiɫžyā=bē

village crowd.ptcp =cop .ipf .3 sg
The village was crowded.
(57) sar=ū bān=ū mizgī=wa kābrā=y dang-zil
head_2=ez _2 roof=ez _2 mosque=postp #_1 guy=ez _1 loud_voiced
qēřnā=š b-a-ydē dam=mān=o
bellow.pst =3 sg sbjv -come.prs -2 pl .imp mouth=1 pl =postp #_1
On the roof of the mosque, a loud-voiced guy bellowed: “Come help us!
(58) kafan=īš wīr=tān na-š-o
white_burial_shroud=add _2 memory=2 pl neg _2-go.prs _2-3 sg .prs
Don’t forget the white burial shroud, too!”
(59) duwa kuř-akē=m ī kināč-akē=m
two.compd son-def .pl =1 sg prox .dem .adj daughter-def .f =1 sg
qīža=w girawa=šān yāwā āsmān
wailing=and crying=3 pl reach.pst heaven
Both my sons, this (other) daughter of mine, their wailing and crying
reached heaven.
(60) qēřnā=m kafan=iš garak niyan
bellow.pst =1 sg white_burial_shroud=3 sg need neg .cop .3 sg
I bellowed: “She doesn’t need a burial shroud!”
(61) řasūɫ=īč gēɫnā=š=o ā řo
Řasūɫ=add _1 report.pst =3 sg =postp #_1 dist .dem .adj day
yāna=w bag-ī pař bīya=bē ǰa
house=ez _2 Bag-obl .sg . m full cop .ptcp =cop . ipf .3 sg from
mēmān-ī hatā wēraga wašī=šān karda=bē
guest-obl .sg . m until evening enjoyment=3 pl do.ptcp . m =cop .ipf .3 sg
Rasūl also reported (that), (on) that day, Bag’s house was full of guests, and
until evening, they were enjoying their time.
(62) tozqāɫ-ya=yč wē=š nařanǰnā=bē
tiny_amount-indf _2=add _1 refl =3 sg not_regret.ptcp =cop .ipf .3 sg
He (i. e., Bag) himself was not even a tiny bit sorry.
(63) n-āsa=bē=š wašy-aka=šān
neg _2-let.ptcp =cop .ipf .3 sg =3 sg enjoyment-def . m =3 pl
bi-šēwy-o
sbjv -destroy.prs -3 sg .prs
He did not let it ruin their enjoyable (i. e., pleasant) time.”
(64) dāđā wasē=n ā

grandmother enough=cop .prs .3 sg . m dist .dem .adj
bēɫēm=a pana ba warw-ē čarm-ē
shovel=dem .ptcl . m to to/by snow-pl .agr white-pl .agr
čī yān=ana bi-man-o
in.prox .dem .adj house=postp #_2 sbjv -remain.prs -3 sg .prs
(Āzād speaks) “Grandmother, it is enough. Give me the shovel. I should
not let any white snow remain in this house.”
[Speaker says: “Jahangir Mahmoudveysi, 20. 05. 1390” (i. e., 11 August
2011).]
Abbreviations for interlinear morphemic glosses38
Gloss Definition Gloss Definition

_1 Variant 1 (2, 3, etc.) ind Indicative
1 pl First person plural indf Indefinite
1 sg First person singular inf Infinitive
2 pl Second person plural ipf Imperfect
2 sg Second person singular m Masculine
3 pl Third person plural na Not analyzed
3 sg Third person singular neg Negation
add Additive particle obl Oblique
adj Adjective pl Plural
agr Agreement postp Postposition
compd Compound marker prf Perfect
cop Copula prox Proximal
def Definite prs Present
dem Demonstrative pst Past
dir Directional ptcl Particle
dist Distal ptcp Participle
ez Ezāfe refl Reflexive
f Feminine sbjv Subjunctive
imp Imperative sg Singular
38
Most of these abbreviations and symbols follow the Leipzig Glossing Rules (revised
version 2008, updated May 2015).
Symbols
= (Equals sign) indicates an enclitic boundary in examples and glosses.
- (Hyphen) separates the segmentable morphemes in examples and glosses.
. (Period) separates multiple metalanguage elements that correspond to a single
object language element.
_ (Underscore) separates multiple object language elements that correspond to
a single metalanguage element or to a unity of multiple metalanguage ele-
ments.
# (Number sign) refers to a form glossed by word class category.
References
Benedictsen, Åge Meyer & Arthur Christensen. 1921. Les dialectes d‘Awromān et de Pāwä,
textes recueillis par Åge Meyer Benedictsen, revus et publiés avec des notes et une
esquisse de grammaire par Arthur Christensen. København: Det Kgl. Danske Vidensk-
abernes Selskab. Historisk-filologiske Meddelelser. VI, 2.
Bistoon, Abbasi, Amer Gheitury & Kerstin Kazzazi. 2013. Semantics of Hawrami kinship
terms. The International Journal of Humanities of the Islamic Republic of Iran 20(4).
1–21.
Blau, Joyce. 1989. Gurânî et zâzâ. In Rüdiger Schmitt (ed.), Compendium Linguarum Iran-
icarum, 336–340. Wiesbaden: Reichert.
Christensen, Arthur. 1936. Some new Awromānī material prepared from the collections of
Åge Meyer Benedictsen. Bulletin of the School of Oriental Studies 8(2/3). 467–476.
Hawrāmānī, Muħamad Amīn. 2001. Mēzhūy Hawrāmān, (617–619). Tehran: Balx. http://
www.hawraman.org/sites/default/files/article/%D8%AA%D9%82%D8%B3%D9%8A
%D9%85%D8%A7%D8%AA%20%D8%B3%D9%8A%D8%A7%D8%B3%D9%8A
%20%D9%87%D9%88%D8%B1%D8 %A7%D9%85 %D8%A7%D9%86.pdf
(accessed 04 October 2017).
Holmberg, Anders & David Odden. 2008. The noun phrase in Hawrami. In Simin Karimi,
Vida Samiian & Donald Stilo (eds.), Aspects of Iranian linguistics, 129–151. Newcas-
tle: Cambridge Scholars Publishing.
http://www.ling.ohio-state.edu/~odden/The%20Noun%20Phrase%20in%20Hawrami.
pdf (accessed 04 October 2017).
International Phonetics Association. 2015/2005. Reproduction of the International Phonetic
Alphabet.
https://www.internationalphoneticassociation.org/content/full-ipa-chart (accessed 07
October 2017).
Leezenberg, Michiel. 1993. Gorani influence on Central Kurdish: Substratum or prestige
borrowing? ILLC Prepublication Series, X-93-03. Amsterdam: Institute for Logic, Lan-
guage & Computation (ILLC), University of Amsterdam.
Leipzig Glossing Rules. (Revised version of February 2008, updated May 2015). https://
www.eva.mpg.de/lingua/pdf/Glossing-Rules.pdf (accessed 04 October 2017).
MacKenzie, David Neil. 1961. The origins of Kurdish. Transactions of the Philological
Society 1961. 68–86.
(D.M. 1966=) MacKenzie, David Neil. 1966. The dialect of Awroman (Hawrāmān-ī Lu-
hōn). Grammatical sketch, texts, and vocabulary. København: Det Kongelige Danske
Videnskabernes Selskab.
MacKenzie, David Neil. 1987a. Avroman. Encyclopaedia Iranica 3.1, 110–111. New York:
http://www.iranicaonline.org/articles/avroman-hawraman-persian-owraman-a-moun
tainous-region-on-the-western-frontier-of-persian-kurdistan (accessed 03 January 2015).
MacKenzie, David Neil. 1987b. Avromani. Encyclopaedia Iranica 3.1, 111–112. New York:
http://www.iranicaonline.org/articles/avromani-the-dialect-of-avroman-q (accessed 05
December 2014).
MacKenzie, David Neil. 2002. Gurāni. Encyclopaedia Iranica 6.4, 401–403. New York:
http://www.iranicaonline.org/articles/gurani (accessed 14 March 2012).
Mahmoudveysi, Parvin. 2016. The meter and the literary language of Gūrānī poetry. Ham-
burg: Universität Hamburg dissertation. http://ediss.sub.uni-hamburg.de/volltexte/2016/
7935/pdf/Dissertation.pdf (accessed 22 June 2017).
(P.M. 2017=) Mahmoudveysi, Parvin. 2017. Unpublished notes for Hawrāmī (Pāwa).
(M&B 2004=) Mahmoudveysi, Parvin & Denise Bailey. 2004. Unpublished field notes:
Word list and grammar notes for Hawrāmī (Pāwa).
Mahmoudveysi, Parvin & Denise Bailey. 2013. The Gorani language of Zarda, a village of
West Iran. Wiesbaden: Reichert.
Mahmoudveysi, Parvin, Denise Bailey, Ludwig Paul & Geoffrey Haig. 2012. The Gorani
language of Gawraǰū, a village of West Iran. Wiesbaden: Reichert.
Mahmudweyssi, Parwin [=Mahmoudveysi, Parvin] & Geoffrey Haig. 2009. The typology
of modality in some West Iranian languages. In Christine Allison, Anke Joisten-Prusch-
ke & Antje Wendtland (eds.), From Daēnā to Dîn: Religion, Kultur und Sprache in der
iranischen Welt. Festschrift für Philip Kreyenbroek zum 60. Geburtstag, 41–52. Wies-
baden: Harrossowitz.
Mann, Oskar & Karl Hadank. 1930. Mundarten der Gûrân, besonders das Kändûläî, Au-
ramânî und Bâdschälânî, bearbeitet von Karl Hadank. Berlin: Verlag der Preussischen
Akademie der Wissenschaften in Kommission bei Walter de Gruyter & Co.
Naqšbandī, Šahrām. 1996. Nezām-e āvāyī guyeš-e Hawrāmī (guneye Pāveh) az dīdgāh-e
vāǰ-šenāsī-ye zāyešī va vāj-šenāsī-ye ǰozʔ-e mostaqel [The phonology of Hawrāmī
(Pāveh variety), a Generative and Autosegmental perspective]. Tehran: Allameh Tabat-
aba’l University, unpublished MA thesis.
Paul, Ludwig. 2007. Zur Lage der Gōrānī-Dialekte im Iran und ihrer Erforschung. In Ma-
ria Macuch, Mauro Maggi & Werner Sundermann (eds.), Iranian languages and texts
from Iran and Turan. Ronald E. Emmerick Memorial Volume, 285–296. Wiesbaden:
Harrassowitz.
Pāveh Press. 2013. Bištarin mizān-e bāsawādi dar satħ-e ostan, marbut be šahrestān-e Pāveh
[The highest rate of literacy in the province belongs to the county of Pāveh]. Online
article, 29 December, 2013. http://www.pavehpress.ir/ (accessed 10 August 2017).
Sadeqi, Amir. No date. Fasl-e avval: moqeʕiat va taqsimāt-e siyasi va moʕarefi-ye baxš-e
Horāmān [First capital: The situation and political division and introduction to part of
Horaman]. Website of Sāzmān-e Mirās-e Farhangi, sanāyeʕ-e dasti u gardešgari [Or-
ganization of Cultural Heritage, Handiworks, and Tourism].
http://www.hawraman.org/sites/default/files/article/%D8%AA%D9%82%D8%B3%D
9%8A%D9%85%D8%A7%D8%AA%20%D8%B3%D9%8A%D8%A7%D8%B3%D9
%8A%20%D9%87%D9%88%D8%B1%D8%A7%D9%85%D8%A7%D9%86.pdf
(accessed 10 October 2017).
Shahbazi, Shapur. No date. Iranian geography: Pâveh, the city of a thousand mausoleums.
The Circle of Ancient Iranian Studies (CAIS). http://www.cais-soas.com/CAIS/Geog
raphy/paveh.htm (accessed 28 June 2017).
Zolfaqari, Ardashir. 2010. The Avromani-speaking area in Iran. Iran and the Caucasus 14.
323–330.
4.6. Persian
Ludwig Paul1
1. General remarks
1.1. Current size and status

Persian enjoys the status of an official or state language in the three Iranophone
countries of Iran, Afghanistan, and Tajikistan, where it is called Fārsī, Darī, and
Tojiki respectively. It is spoken as a native language by the majority of the popula-
tion in Iran and Tajikistan, and by a large minority in Afghanistan (whose members
are also called ethnic “Tajiks”). In all three countries, it is widely used as a lan-
guage of communication, education and commerce also by non-native speakers.
In Uzbekistan, a vibrant minority of indigenous Tajik speakers still live in the
cities of Samarkand and Bukhara. Besides, sizeable expatriate or exile communi-
ties of Persian speakers live in adjacent countries, such as the Arab Emirates, Iraq,
Turkey, Pakistan, and in other countries such as Russia, Australia, Japan, Israel,
USA, Canada, and in various countries of western Europe. Most Iranians, Afghans
and Tajiks have migrated there since the second half of the 20th century.
Reliable statistics for native speakers of Persian are not available for any of
the three Iranophone countries. Conservative estimates would be around 50 % for
Iran, 30–35 % for Afghanistan, and 80 % for Tajikistan. Based on recent, partly
estimated population figures (Iran 76 million, Afghanistan 30 million, Tajikistan
5 million), and taking into account also approximately 5 million expatriates (1–1.5
million in Turkey, 1 million in Russia [mainly Tajiks], 0.5 million in Pakistan
[mainly Afghan Hazara in Quetta], 0.5–0.6 million in the USA, 0.4 million in
Western Europe, etc.), this would give approximately 57 million native speakers
of Persian.
In the three countries, however, many non-Persians are also fluent, or indeed
native in Persian (esp. in Iran, see §1.4 on the sociolinguistic situation). A more
realistic estimate of native Persian-speakers would probably be in the region of
75–80 million, with another 15–20 million that are fluent or near-native speakers
of one of the three official variants of Persian. The estimates of Persian speakers
living outside Iran are included in these figures but have their difficulties: statis-
tics usually count holders of Iranian/Afghan/Tajik passports, whose second- or
third-generation descendants may no longer be proficient in Persian. On the other
1
I would like to thank Ramin Shaghaghi, Pejman Firoozbakhsh and Maximilian Kinzler
for the inspiring discussions I had with them, and for the many helpful remarks on a
draft of this article.
570 Ludwig Paul
hand, naturalized immigrants no longer show up in the statistics but may still be
native speakers. Besides, the Iranian community living in Turkey consists largely
of Iranian (Azeri) Turks, some of whom would not qualify as native speakers of
Persian.
The official status of Persian has been laid down for the Islamic Republic of
Iran in the Constitution of 1979 (§15: “Fārsī is the … official language of Iran”)
and for Afghanistan in the Constitution of 2004 (§16: “Pashto and Darī shall be
the official languages of the State”). While the status of Fārsī as the official and
national language of Iran has not been seriously challenged in the recent history
of the country, the status of Darī has been more disputed in modern Afghanistan.
Pashto, the native language of the Afghan majority population (of 50–55 %), was
given political preference over Darī since the 1930s, and was called the “national
language” of Afghanistan in the Constitution of 1964 (§3). It lost this privilege
after the 1978 communist coup d’état, and has since been only one of the two
official languages alongside Darī. Darī has consequently consolidated its de facto
dominant position in administration and inter-ethnic communication, but the Pash-
to-Tajik ethnic and cultural antagonism continues to permeate Afghan politics
until today (see Nawid 2012).
The Tajik Constitution of 1994 specified Tajik as the state language and Russian
as the language of inter-ethnic communication, but in 2009 a law was passed in
parliament that confirmed Tajik as the sole official language, thus restricting the
use of Russian (Landau and Kellner-Heinkele 2012: 177–179). The politics of
linguistic Tajikization had been pursued systematically after independence in
1991, similar to “Uzbekization” in neighbouring Uzbekistan, etc. (Landau and
Kellner-Heinkele 2012: 179–182). The Tajiks’ attitude towards Russian seems to
be pragmatic nowadays, in that it remains an important language of education and
academia, and is likewise important for those Tajiks who go to Russia as guest
workers. The real challenge is for the Uzbek minority of about 15 % that is increas-
ingly being marginalized, and perceived as the domestic agents of Uzbekization in
Tajikistan (Landau and Kellner-Heinkele 2012: 184).
While the unity of Persian as one language in three Iranophone countries can
be asserted on linguistic grounds and would be upheld by most speakers in the
three countries (especially in Iran and Afghanistan), a tendency to develop a dis-
tinct linguistic identity may be postulated for Tajik. This is mainly based on the use
of the Cyrillic script (as against the Arabic script used for Fārsī and Darī), on the
strong lexical influence of Russian during the 20th century, and on the long-stand-
ing influence of Uzbek on the grammar of Tajik (especially on its northern dia-
lects). While a counter-tendency to bring Tajik again closer to Fārsī has also been
at work after the demise of the Soviet Union in the 1990s (see Landau and Kell-
ner-Heinkele 2012: 182–184), it remains uncertain where Tajik-speakers will look
for their linguistic affiliation and identity in the future.
Persian 571
1.2. Genetic affiliation

Persian is commonly grouped among the southern branch of the West Iranian lan-
guages, together with adjacent languages spoken in southwestern Iran like Lorī,
Lārī, Kumzārī etc. The linguistic features that distinguish the Southwest Iranian
languages from the Northwestern branch can be traced back to Middle Persian
(3rd–7th century C.E.) and Old Persian (6th–4th century B.C.E.), e. g., the develop-
ment of IE. *ǵ to Persian d in OP dan-, MP/NP dān- ‘know’, as against the north-
western (or better: non-southwestern) development to z in Avestan zan-, Parthian
zān-, Kurdish zān-. Both Old and Middle Persian had their home in the historical
province of Persis (Fārs) in southern Iran, from where the Achaemenid and Sasa-
nian dynasties originated.
The SW-/NW-distinction was introduced by Tedesco in 1921, on the basis
of features that distinguish Middle Persian as a typical SW, as against Parthian
as a NW-Iranian language (spoken in the 3rd century C.E. in northern Mesopo-
tamia); according to these features, Tedesco grouped also the modern W-Ira-
nian languages as either “NW” (Tālešī, Semnānī etc.) or “SW” (Persian, etc.).
The NW-/SW-distinction has been modified since the 1960s, and more recently
been replaced by scalar models that group the W-Iranian languages on a “scale
of Northwestern-ness”, in which Kurdish takes an intermediate position (see
Paul 1998; Gippert 2007/08), but the SW-/NW-distinction, as two poles on a scale,
can be maintained in this modified sense. The wanderings of the Iranian tribes
throughout history, and the mutual influences their languages and dialects exerted
on each other, were so intensive and complex that it is unlikely that any simple or
reductionist model can do justice to the historical development of the W-Iranian
languages.
From the outset (i. e., from Old Persian), Persian underwent stronger phonetic
and morphological changes than the other, more conservative Iranian languages.
For example, Middle Persian (between the 3rd and 7th centuries C.E.) had already
lost almost all morphological distinctions of case and gender that had character-
ized Old Persian as an ancient Indo-European language, while these distinctions
have still been preserved (albeit in reduced form) in many modern NW-Iranian
languages until today. Persian has also undergone strong lexical influence from
NW-Iranian, probably from Parthian or “Parthoid” dialects in which Iranian epics
were transmitted.
1.3. History of the speech community

The evolution of Persian (or more precisely of “New Persian”) after the Arabic-Is-
lamic conquest of Sasanian Iran (636–651 C.E.), and of its speech community,
is long and complex. It is commonly held that the conquest dealt a fatal blow to
Sasanian culture and its Middle Persian language. Subsequently, Arabic apparently
572 Ludwig Paul
took over all official functions in the Caliphate and Persian only started regaining
its old strength and literary, administrative etc. functions more than 200 years later,
from the late 9th century onwards, in the course of a cultural reawakening under the
Samanid dynasty that ruled NE-Iran.
A closer look, however, shows a different, and less simple, picture. Although
Middle Persian ceased to be used as the official language of an empire around
650 C.E., it continued to be used on coins and in local administration well into the
8th century; for the 8th century MP documents from Tabaristan, see recently, e. g.,
Gyselen (2012). In the countryside of the vast area, many local strongholds of
Zoroastrianism remained more or less intact for some time, where Middle Persian
continued to be used as a Zoroastrian “church language”. The 8th/9th centuries saw
a period of thriving Zoroastrian Middle Persian literature. This is likely to have
been due also to the interaction of parts of the Zoroastrian clergy with other reli-
gions in the relatively “open-minded” atmosphere of the Caliphate during this
period, when Muʿtazilism had a strong influence. The rise of Sunni orthodoxy in
the late 9th century may have ended this.
A Jewish Iranian religious minority had lived in Iran since Achaemenid times,
and wrote Persian already before the Muslim Persians did. The oldest Judaeo-Per-
sian texts, two letters written in Hebrew script, were found in Dandan Uiliq
(W-China) and are assumed to be from the 8th century C.E., which would make
them the oldest extant New Persian texts. There is a flow of further Judaeo-Persian
texts and documents from the late 10th century onwards. Since most of these texts
are from SW-Iran, close to the still thriving centres of Zoroastrianism, they share
certain dialectal features with Zoroastrian Middle Persian of S(W)-Iran (e. g., the
use of the ezafe to introduce relative clauses), as against the new form of (Muslim)
New Persian that was emerging in NE Iran at about the same time. The Zoroastri-
ans and Jews referred to their languages with two historical versions of the same
glottonym, MP Pārsīg / JP Pārsī, while the new Muslim form of New Persian
in the NE was originally called Darī by its speakers (later the two glottonyms
merged into Pārsī-ye Darī, or Pārsī). The use of these and other glottonyms (e. g.,
Pahlavī, which was the name of dwindling Parthian at the time) in early Arabic
sources contributes to our understanding of the linguistic situation of the time, and
the development of Persian (see the pioneering article by Lazard 1971, and a more
recent account in Paul 2013b).
1.3.1. First steps in Persian literature

During the 10th century C.E., “Dari” New Persian written in Arabic script was grad-
ually extended in its use in the Samanid Empire that ruled over NE-Iran, from lyric
court poetry into further literary and scientific domains. Many of the first prose
works written in Persian were translations from Arabic, e. g., the translation of the
Tārīx-e Ṭabarī (“History of Ṭabarī”), or of the Tafsīr-e Ṭabarī (“[Qurʾān] com-
Persian 573
mentary of Ṭabarī”, both translated probably in the 960s C.E.); for an overview of

the earliest NP prose texts, see Lazard (1963: 36–116). Although the genres, forms
and contents of NP Muslim literature were novel to Iranian culture, elements of
continuity from pre-Islamic Persian literature can be found in them.
The Iranian national epic Šāhnāme was composed by the poet Ferdousī (ca.
940–1019) probably on the basis of the “Abūmanṣūrī” prose version, which was
derived from the (lost) Middle Persian Xwadāy-nāmag. The genre of New Persian
wisdom/andarz literature, e. g. works like the Siyar al-Molūk (Siyāsat-nāme),
written (in 1092) by the Seljuq vizier Neẓām ol-Molk, goes also back to pre-Is-
lamic times, but the transmission from Middle to New Persian went partly through
Arabic, or was at least substantially influenced by the way Arab writers like
al-Jāhiẓ developed the adab genre in Arabic. Taken together with the preponder-
ance of translations from Arabic among the earliest New Persian prose works, this
shows how much NP literature owes to the classical Arabic tradition, how thor-
oughly it was influenced by it, yet how it managed to transport its own distinct,
Iranian character into the new Islamic period. NP lyric poetry adopted the Arabic
metric system, adapting it to the Persian syllable structure.
In the late 10th century C.E., Persian written in Arabic script was firmly estab-
lished as a language of literature, science and administration at the Samanid court
(NE Iran), and in the following (11th) century, Persian consolidated this status
as the main language of the Ghaznavid and Seljuq Empires. Through the inten-
sive mutual influence with Arabic language and culture since the 9th century,
Persian had acquired the character of a deeply-rooted Islamic language, capable
of expressing the most sophisticated contents of Islamic Civilisation. In science,
especially in theology and philosophy, Arabic continued to be dominant for some
time, but through epic works like the Šāhnāme, and the lyric of poets like Rūdakī
(10th century), Farrokhī Sīstānī or Manūčehrī Dāmγānī (both 11th century), Persian
had won a literary prestige that would pave its way as the dominant language,
the upholder and disseminator of Islamic Civilisation, of the Eastern Caliphate.
Through the military expansion of dynasties like the Ghaznavids, Seljuqs, later
Ghurids and many more to come, Persian saw a massive extension of its geo-
graphical spread, into large areas of Central Asia, India, and Asia Minor, in the
following centuries.
1.3.2. Persian as a lingua franca

Through the 11th–19th centuries, Persian was used as a language of literature,
administration and diplomacy in these areas and beyond, as the lingua franca of
the “Eastern Caliphate”, an area for which Fragner (1999) coined the term “Per-
sophonie”, as an equivalent for the Persian term Qalamrow-e zabān-e Fārsī (i. e.,
“realm of the Persian language”). The linguistic dominance of Persian ended only
in the modern period, when the colonial powers of Great Britain and Tsarist Russia
574 Ludwig Paul
advanced into these territories, and vernacular (especially Indian and Turkic) lan-
guages rose to literary and national languages, though not without being deeply
influenced by Persian.
The manner, intensity, and functions in which Persian was used over such a
long period, varied greatly from region to region and from time to time. It was
most prominent in India, where Persian had already gained a foothold as a court
and literary language under the Delhi Sultanate (1206–1555). It saw its heyday
under the Mughal dynasty (1526–1858), whose ruler Akbar the Great made
Persian the official language of the Empire in 1582. Through the 17th–19th centu-
ries, India is likely to have hosted far more (literate) readers of Persian than Iran
itself (Cole 2002: 18). Persian was extensively studied and used also by Hindus
who aspired to a career in state administration, and who produced excellent phil-
ological works on Persian, e. g. dictionaries (Alam 1998: 328–330). India was at
that time actually not only hosting a Persophone diaspora, it was a centre of Per-
sophony, on equal terms with Iran proper. Even under British rule, India retained
its exceptional role for publications in Persian well into the 20th century; e. g., out
of twenty-four known Šāhnāme lithographs from between 1846 and 1900, twen-
ty-one are from India, and only three from Iran (Marzolph 2011: 70–71).
In the Ottoman Empire, Persian was highly esteemed as a language of learn-
ing, culture and poetry, and was part of the didactical “canon” of languages to be
taught to Islamic students in madrasas, besides Arabic and Ottoman Turkic, up
to the early 20th century (Kreiser 1993). That Persian was used as a language of
trade and inter-ethnic communication also in regions north of the Ottoman Empire,
on the Black Sea coast east of the Crimea, is shown by the Codex Cumanicus, a
manuscript from about the 13th century that was probably composed by Christian
missionaries and contains a Low Latin-Persian-Cuman Turkish dictionary (Mac-
Kenzie 1992).
In China, Persian never had the same prestige, or number of speakers or users,
as it did in India or the Ottoman Empire, but in the administration of the (Mongol)
Yuan Empire (1271–1368), it was the third official language (after Mongolian and
Chinese), and remained an important language of study for the Chinese Muslims
well into the Ming period (1368–1644) (Liu 2010: 89). The Mongols considered
it as the language of the peoples living beyond China’s western borders, it thus
became an important language of international diplomacy, see, e. g., already the
letter of the Mongol Khan Göyük to Pope Innocence IV in 1246, written in Latin
and Persian (Spuler 1985: 377). It was in 17th century China that the first Persian
grammar was written in Persian (Kauz 2009).
The use of Persian over such vast areas and long periods was not uniform;
although it declined in certain areas where vernacular languages rose to languages
of administration and literature (e. g., in the Ottoman Empire after the 16th, or in
India after the 18th century), it kept its high prestige as a language of classical
Islamic culture and literature, in all areas, up to the early 20th century and beyond.
Persian 575
In the late 19th century, the Albanian poet Naim Frashëri (1846–1900) still wrote
poetry in Persian, as did the Indian Muslim poet and philosopher Muhammad Iqbal
(1873–1938), who by so doing aimed at a greater international dissemination of
his ideas in the Muslim World than would have been possible through the medium
of Hindustani/Urdu (Rypka 1968: 371).
Although the expansion and development of Persian as a lingua franca was
always closely aligned to developments in Iran proper, it must be noted that it
was generally independent of any political, financial, or other institutional support
from Iran’s ruling dynasties. A concrete measure like the introduction of Persian
as a language of administration in the Mughal Empire (in 1582) was, of course, a
deliberate political decision, but it was not an ideological measure to promote a
language in the modern sense. Rather, it was a practical decision: Persian was con-
sidered the language available and most suitable for meeting the requirements of
administering a large empire, and its high prestige as a language of court, admin-
istration, literature, learning and Islamic culture further recommended it for this
role.
Interestingly, almost none of the many dynasties that used Persian as an official
language throughout the 11th–19th centuries, including the dynasties ruling Iran
proper, were of Iranian origin. Most (like the Mughals) were of Turkic extraction,
and assimilated to the Persian that they regarded as more suitable for adminis-
tration, or more prestigious, than their own vernacular language. Paradoxically,
while the rulers of dynasties outside Iran like the Mughals, often spoke Persian,
the Safawid rulers of Iran proper often talked Turkish at court. The Shia-Sunna
antagonism that emerged in the region through the rise to power of the Safawid
dynasty in 1500 and the following Shiization of Iran, had no effect on the cultural
and literary exchange between Iran and the Persianate world: Persian was already
too well-established to be affected adversely by this.
1.3.3. The modern period

The demise of the “Persian cultural Commonwealth” of the Eastern Islamic World
was brought about not only by regional vernaculars like Hindustani that became
established as literary languages from the 18th century onwards, but mainly by
the two colonial powers of Russia and Great Britain. Both empires extended their
territories considerably during the 19th century, at the cost of Persophone regions
of Central Asia and India, and introduced new forms of government and adminis-
tration and new means of communication there, replacing Persian as a language of
administration and communication by Russian, English, and Hindustani. Persian
was abolished as the official language of administration in India by the British
rulers in 1835 (Bonakdarian 2006: 26).
Iran proper did not remain unaffected by the advances of the two colonial
empires. In the early 19th century, Iran lost most of its Caucasian territories to
576 Ludwig Paul
Russia, and in 1857, it finally had to waive its claims to the province of Herat
that became integrated into the new Afghan “buffer” state between Russia and
British India. Some members of the Iranian ruling elite realized that in order to
survive, Iran was in dire need of reform and modernisation. In the cultural sphere,
certain Iranian literati started criticizing the bombastic and verbose “Indian” style
of poetry that had held sway at the Safawid and Mughal courts through the 16th-
18th centuries, and advocated the simpler “Khorāsānī” and “Iraqī” styles of clas-
sical authors. This movement was called bāzgašt-e adabī “literary return”. Its
most important protagonist, in poetry and prose, was the writer, poet, vizier, and
reformer Qāʾem Maqām Farāhānī (1779–1835; see Rypka 1968: 335–336). One
of the best-known samples of the new, simpler Qajar prose works composed in
the late 19th century are Šāh Nāṣeroddīn’s diaries on his three journeys to Europe
between 1873 and 1889.
In the 20th century, Iranian prose writers started including passages of collo-
quial Persian in written form for the first time, e. g. Moḥammad ʿAlī Ǧamālzāde
in his collection of short stories Yekī būd, yekī nabūd (1921). With the adoption
of European genres like short story or romantic novel, “realistic” writing became
the dominant paradigm in Iranian prose literature. The rapid changes in style and
grammar of 20th century written Persian, however, were due rather to a sudden
breakup of literary conservatisms, than to an actual change of the language struc-
ture; a closer look at the grammar of Persian will show that many innovations
and colloquialisms could be found in written Persian already well before the
19th century, and that the structure of Persian changed steadily over time rather
than rapidly in response to new genres.
There is no universally accepted periodization of the history of Persian.
National Iranian philology prefers a subdivision into styles (sabk) that are identi-
fied with certain regions where these styles developed, or flourished most prom-
inently. Although one can identify “classical” periods of these styles (Khorasani
9th–11th centuries, Iraqi 12th/13th centuries, Indian 16th–18th centuries, followed by
the Return [bāzgašt] movement), they are in principle independent of time, and
any poet could, and can, write poetry in any of these styles at any time (on this and
alternative subdivisions, Rypka 1968: 112–119). Western linguists have proposed
various subdivisions. The one advocated here follows Lazard (1963: 24, shared
also by Khanlari), into Early (8th–12th centuries), Standard (13th–19thcenturies), and
Modern High New Persian (19th century—). It should be noted that “classical” is
a literary term covering classical authors of various times (Ferdousī, Saʿdī, Ḥāfeẓ,
Jāmī etc.), and is, therefore, of limited linguistic value, since the language of these
authors is by no means uniform (see Paul 2002). In terms like: “Early New Persian
and Classical Persian”, the latter may, however, be used to denote the earliest stage
of Standard Persian, i.e. the language of the classical authors of the 13th–15th
centuries.
Persian 577

As the official or state language of Iran, Afghanistan and Tajikistan, Persian enjoys
a privileged position in the state institutions of these countries, and in Iran, it
receives further support from a national language academy (Farhangestān). In all
three countries, sizeable non-Persian ethnic minorities (in Afghanistan, a majority)
have been resident for a long time, among whom bilingualism with Persian as a
second language is more or less wide-spread. Persian naturally advances among
the non-Persian minorities through administration, the educational system, and
the mass media. At the same time, in all three countries, Persian is under pressure
from other languages, in Afghanistan (expectedly) from Pashto, in Iran from Azeri
Turkic, and in Tajikistan from Uzbek.
1.4.1. Bilingualism
The ethnic and linguistic communities’ command of their own language(s),
dialects, and of Persian, and thus the grades of bilingualism, vary greatly from
case to case. While the predominantly Sunni minorities of Iran, the Kurds and
Baloch, adhere to their ethnic languages quite firmly, reinforced by their distinct
religious identity, the large Shia linguistic minorities of northern Iran (Gilan and
Mazandaran), who number approximately 3–4 million, have already been largely
assimilated to Persian, rendering the future demise of the languages very likely
(Shahidi 2008: 295). The same is true of many other smaller language or dialect
communities of NE and Central Iran like Semnānī, Vafsī, etc. Among Shii Kurds
of SW Iran, e. g. in the city of Kermanshah, Kurdish is still strong but has been
more mixed with Persian, and with local idioms like Lakī or Kalhorī, than the
“Sunni” Kurdish of Sanandaj.
Idioms like Gīlakī, Māzandarānī or Semnānī that are clearly languages in their
own right linguistically, are not granted the status of a “language” in the Iranian
world-view. They do not possess a written literature, or literary tradition, com-
parable to that of Persian, are mainly used orally and in informal situations, and
are, therefore, called “dialects” (gūyeš). This does not do justice to Māzandarānī,
which can boast of a rich and relatively old written literature, going back at least to
the early 19th century, and possibly as far back as the 11th century, if Māzandarānī
is to be identified with “New Ṭabarī” (Borjian 2009). But the knowledge about
this remains restricted to specialists and folklorists. It also attests to the unique
status and extraordinary prestige of Persian, which is perceived as the national
and literary language par excellence, and in comparison to which no other lan-
guage is even granted the name of “language”. The Kurds and Baloch, whose
distinct (Sunni) ethnic-religious identities are accepted by Shii Iranians, are,
however, exempted from this practice and their idioms are usually referred to as
“languages”.
578 Ludwig Paul
The case of Azeri Turkic is exceptional and contradictory. Although the Azeri
Turks of Iran are Shiites and firmly grounded in the Iranian-Shii cultural and national
identity, and although they still seem to take for granted the “superiority” of Persian
over Turkic as a language of literature and culture, the position of Azeri Turkic is
very strong in the official Turkophone provinces (W- and E-Azerbaijan, Ardabil),
and Persian is less well-known and used there than in all other Shii regions of Iran.
Azeri Turkic even seems to be advancing geographically and demographically in
various provinces like Zanjan, Qazvin, Hamadan or Gilan, at the cost of minority
languages like Tālešī (most Tāleš, especially the majority who are Shii, seem now
to be Turkicized), Tātī and Gīlakī, but also of Persian. For the large Azeri commu-
nity of Tehran, a study says that although there is an ongoing shift from Azeri-Per-
sian bilingualism towards Persian, the vitality of the Azeri language is not in danger
(Bani-Shoraka 2005: 197–202). Dialects of Azeri Turkic are the undisputed “L1”
in all Turkophone provinces and beyond, while in Gilan and Mazandaran, but also
in Kermanshah and other regions of Iran, part of the local (ethnically non-Persian)
population would probably call Persian their first mother tongue.
From the “horizontal” (or geographical) bilingualism, a “vertical” one may be
distinguished, indicating the coexistence of various historical forms and styles like
classical literary Persian (of various [sub-]forms), modern literary high Persian,
and modern colloquial Persian. Each native speaker can, depending upon his edu-
cation, switch more or less freely between these styles, according to situation,
addressee, place, time, and other factors; mixed or intermediary forms, like high
Persian with elements of colloquial influence, are also possible (see Paul, forth-
coming). The mixture of styles and registers, especially the continuous normative
influence of Classical Persian on the modern written language, makes it difficult to
define a fixed or homogeneous standard of contemporary Persian. SNP may rather
be defined as a flexible system of rules that fall more or less within the limits of
a standard.
While the social status of many Iranians may be guessed from their accent, or
from the way and extent they use colloquial style, another, totally different system
of rules determines and permeates Iranian social interaction, and thus also linguis-
tic behaviour, namely the system of politeness called taʿārof. It determines the
way that any Iranian talks to or about a person of equal or superior social position,
requiring the use of certain forms, words or expressions which are different from
those he would use if talking to a child, or a person of (visibly) inferior social
position. Taʿārof is variously described as—neutrally—the “system of politeness
specific to Iran”, or—negatively—as the “most hypocritical exaggeration and dis-
tortion of a system of politeness”. It affects mostly the lexicon (e. g., using verbal
constructions like tašrīf āvardan “to come”, literally “to bring honouring”, or the
noun bande “slave”, for the 1 sg pronoun “I”), but also the grammar (3 pl , instead
of 3 sg , for talking about a “superior” person, e. g. īšān hastand “he is present”,
literally “he/they are present” (see Beeman 1986: 151–162 for an overview).
Persian 579
1.4.2. Education, administration

In the educational system of Iran, Persian is the only language of instruction, but
local minority languages seem to be tolerated in schools to a certain extent. In
administration, Persian is again the only official language, but other languages
may be used pragmatically, e. g., if a person at court does not speak Persian well
enough. The printing of books and journals in minority languages is not discour-
aged or prohibited as such, on linguistic grounds, and there are no limitations,
either, for the use of minority languages in public. Under the pressure of Persian as
a language of education and the media, an unforced assimilation of many minority
languages can be witnessed, often with the (parents’) motivation not to hamper
the children’s prospects for their later professional career through a “provincial”
accent.
1.4.3. National academies

Of the three Iranophone countries, only Iran has a fully-fledged national language
academy (Farhangestān), which is engaged in various philological (editorial, lex-
icographical etc.) activities, and in an ambitious terminology programme, coining
thousands of Persian terms in over sixty fields of knowledge every year, with the
aim of replacing scientific terms of foreign (English, Arabic) origin. For an over-
view of the Farhangestān’s history since 1935 and its main activities, see Jazay-
ery (2000).
In Afghanistan, a Pashto Language Academy (Paš ̣tō Ṭōləna) was founded in
1974, with the aim of promoting the “under-developed” national language. Since
Persian had always been the dominant language of administration and culture in
recent Afghan history, obviously no need had been felt to support Persian through
an institution of its own until today. In Tajikistan, there is a national Academy of
Science going back to the Soviet period, whose philological section is also active
in linguistic matters from time to time, but there is no institution comparable to
the Iranian Farhangestān. In its cooperation with Afghan and Tajik scholars, the
Iranian Farhangestān clearly tries to cover commitments that go beyond Iran and
Fārsī alone, and to take responsibility for “Persian” as a cover term for Fārsī, Darī,
and Tojiki.
2. Remarks on grammars and reference works
No comprehensive grammatical description of the Persian of Iran (Fārsī) exists

that takes its historical development into consideration, and complies with modern
linguistic standards. The historical grammar of Khānlarī 1986, parts of which
have been translated into English (1979), remains within the framework of tradi-
580 Ludwig Paul
tional philology. Lazard 1957 (English translation 1992) is the standard reference
grammar up to the present. It describes modern standard Persian, but takes into
account also colloquial registers, and important rules of Classical literary Persian.
In his 1989 sketch grammar, Lazard provides an updated, condensed version of his
views, shifting the emphasis in certain details.
All other grammars of Persian lag behind Lazard in comprehensiveness, pre-
cision, and method. Phillott (1919) remains a rich collection of linguistic mate-
rial, but lacks linguistic method. Jensen (1931) offers a remarkable coverage of
syntax, uncommon for its time. Windfuhr (1979) is still a valuable research report,
with a comprehensive bibliography. A reliable and detailed recent Persian ref-
erence work on Persian grammar is the dictionary, alphabetically arranged, by
Ṭabāṭabāyī (2016).
The first comprehensive grammar of Tajik in a “Western” language is
Perry 2005, published almost 50 years after Lazard’s very useful study of 1956.
The first systematic comparative grammar of Fārsī and Tajik was given by Wind-
fuhr and Perry (2009). Methodologically, this study includes many ad hoc exam-
ples from both idioms to illustrate the grammatical rules, rather than authentic
examples from actual linguistic corpora. Besides the titles mentioned, only works
on specific periods or variants of Persian, or on specific parts of its grammar, exist,
three important ones of these being Lazard’s seminal study on Early New Persian
(1963), Paul’s grammar of Early Judaeo-Persian (2013a), and Pisowicz’s compre-
hensive historical phonology of Persian (1985).
3. Phonology
3.1. The phonological system

The phonemic system of modern Standard New Persian (SNP) of Iran includes the
following system of 26 consonants (including two semi-vowels) (Table 1) and 6
vowels (Table 2) (the symbols in the tables represent those of standard transcrip-
tion, rather than the IPA).
Persian 581

labial labio- alveolar post- palatal velar uvular glottal
dental alveolar
plosive voiceless p t k q (ʾ)
voiced b d g
fricative voiceless f s š x h
voiced v z ž (γ)
affricates voiceless č
voiced j
liquid nasal m n (ŋ)
lateral l
trill r
semi- (glides) (w) y
vowels
Table 2: Vowel inventory

front back
closed ī ū
middle e o
a
open ā
Diphthongs: ey, ow, ay (short; ay only in Arabic loanwords); āy, ūy (long). If ow

is regarded as a biphoneme, a “limited” phoneme [w] must be postulated for SNP
(Pisowicz 1985: 24–27).
3.2. Remarks
The phonemic status of some of these sounds, particularly those in brackets, is
controversial. The glottal stop [ʾ], whose phonemicity is disregarded by some
scholars, is considered phonemic here, following Pisowicz (1985: 47–51). The
voiced fricative [γ] is understood as a positional allophone of uvular [q] (both
sounds constitute one phoneme, but there is a discussion about which is its main
representative, see Pisowicz 1985: 42–44). The glide w occurs only in limited pho-
netic contexts, like in the diphthong [ow] (Pisowicz 1985: 24–26). The nasal [n]
has an allophone [ŋ] before the velar or uvular stops [g, k, q] (Pisowicz 1985: 18).
It should be noted that the above system represents modern standard Persian
(SNP) in current Tehran “standard” pronounciation. Different styles, or regional
varieties of Persian outside Tehran, may affect the above system in details. For
582 Ludwig Paul
example in certain registers the glottal stop may be dropped in some phonetic
contexts, without, however, giving up its phonemicity (see below). On the other
hand, in careful and slow Tehrani pronounciation, and in certain regions of Iran
(e. g., Fārs) generally, the [γ] retains its phonemic character, being pronounced
differently from [q] in all positions.
3.3. Historical changes

The changes of the phonemic system in the history of Persian have been scarce
in consonantism, but more substantial in vocalism. The most important consonant
changes include:
a) The [w] of Early and Classical Persian changed to modern [v], which did not
exist in Classical Persian; In Tajik, [w] still exists today, besides [v].
b) In Early and Classical Persian, the distinction of q/γ was still phonemic, as it
is still today in many regions of Iran, and in all Afghanistan and Tajikistan.
c) In Early New Persian, [β] and [δ] existed as allophones of the voiced stops [b]
and [d] (Pisowicz 1985: 107–111).
d) In Early and Classical Persian, [xw] was a (single) phoneme, which was de-la-
bialized to [x] in later Persian (Pisowicz 1985: 121–123)
3.4. Vowel system

The Early New Persian (ENP) vowel system, where quantity was phonemic, devel-
oped into the modern system, in which the (historical) long vowels are still longer
in articulation, but in which the main load of the distinction is on vowel quality. In
Tajik, vowel quantity has been given up, and the development of the vowels differs
considerably from the Fārsī one. The following scheme gives an overview of the
main changes:
Tajik today i e a o ů u
ENP (10–12 cent.) i ī ē a ā ō ū u
Fārsī today e ī a å ū o
Figure 1: The historical development of the Early New Persian vowel system
This again describes only the two main vowel systems of Tehrani SNP and Literary
Tajik. There are regional variants with divergent developments. In Afghanistan,
and in most variants of Khorasan Fārsī, ENP [ē] is preserved as such (instead of
SNP [ī]); in certain N-Tajik dialects, [ū/u] has developed into front closed rounded
[ü] (instead of u), and [ō] into a stable, short [u] (Lazard 1956: 128).
Persian 583
3.5. Other phenomena

Besides the phonemic features described so far, there are many processes of co-ar-
ticulation and allophonic variation in the pronounciation of modern colloquial
Persian, that do not (or only marginally) touch upon the phonemic system of SNP.
A selection of these includes:
a) SNP ā before nasal (m, n) > CollP ū (e. g., -ān > -ūn, Ṣādeqī 1984).
b) The short vowels [a, e, o] are unstable and may undergo changes in unstressed
position, e. g. kelīd ‘key’ > kilīd.
c) The glottal stop [ʾ] and the ʿayin in Arabic loanwords, if followed by a con-
sonant, are realized as a lengthening of a preceding vowel, e. g. maʿlūm >
maalūm ‘known’.
d) [q/γ] is fricativized before a voiceless fricative, e. g. raqṣīdan [raxsīdan] ‘to
dance’, exceptionally also before [t], e. g. in waqt [vaxt] ‘time’ or eγtešāš
[extešāš] ‘riot’.
e) [k] and [g] are palatalized before the vowels [a, e, i] e. g. kār kjardam ‘I
worked’.
3.6. Influence of Arabic

The massive lexical impact of Arabic on Persian during the first centuries of Islam
notwithstanding, the influence of the Arabic sound system on the Persian one
has remained limited, the only phonemes clearly borrowed into Persian being [q]
and [ʾ] (on the borrowing of [q], see Ṣādeqī 2006–07). Although pronounciations
of Arabic emphatic and pharyngeal sounds like [ḍ] or [ḥ] may have existed in
“learned” registers of Persian at certain times, they did not exert a lasting influence
on the Persian sound system.
3.7. Geminates
In New Persian, geminate consonants are phonemic, but most distinctions are
imported from Arabic through the many Arabic loanwords with geminate conso-
nants, e. g. NP māde ‘female’: mādde (< Ar.) ‘matter; material’; or with a geminate
in the Persian word, see NP kore (< Ar.) ‘globe’: korre ‘foal’.
3.8. Syllable structure, consonant clusters

The following six types of syllables occur in modern Standard Persian: (C)V, (C)
VC, (C)VCC, with V being a short or long vowel. If one considers (as some schol-
ars do) the syllable types with initial vowel as starting with [ʾ], the number of types
would decrease to three (CV, CVC, CVCC). Following Pisowicz (1985: 50–51),
584 Ludwig Paul
in initial position, the glottal stop may be understood not as a distinctive, but only
as a delimitative feature.
NP syllable structure does not allow initial consonant clusters. Zoroastrian
Middle Persian had initial clusters (ZMP stārag ‘star’, fradāg ‘tomorrow’), but in
Manichaean MP, a prothetic [i-] preceded clusters with s-/š- (MMP istārag ‘star’;
but ‘tomorrow’, MMP fradāg). Early New Persian exhibits variation between pro-
thesis and epenthesis to break the cluster (e. g., ENP sitāra/istāra ‘star’, Lazard
1963: §105; EJP istāra ‘star’, Paul 2013: §47.d). Clusters with consonants other
than s-/š- mostly took an epenthetic vowel (EJP firdā ‘tomorrow’, Paul 2013a: §40).
In later Persian, epenthesis prevailed in all clusters (NP setāre, fardā). In modern
NP, loanwords with s-/š-clusters again receive a prothetic vowel (e. g., estekān
‘glass’ < Russ. stakan ‘glass’, eslāyd ‘slide’), while those without s-/š- show epen-
thesis (kelās ‘class’, terāmvāy ‘tramway’); see also Lenepveu-Hotz 2011.
3.9. Hiatus
If a word ending in vowel takes an ending or clitic that also starts with a vowel,
a hiatus is normally avoided through a glide, mostly -y-, e. g. xāne-y-e man “my
house” (-e ezafe), pā-y-am “my foot” (-am possessive suffix), more seldom -v-,
e. g. karre v-o nān “butter and bread” (o “and”). The clitic copula, however, allows
either hiatus or -y-: īnjā-(y-)am “I am here” (the form with -y- being rather collo-
quial).
A hiatus after -a or -ā may alternatively be avoided by contraction, e. g. pā-m
“my foot”, īnjā-m “I am here”. After other vowels, only the possessive suffix
allows contraction, hence the distinction between pahlū-m “beside me” vs. pahlū-
y-am, meaning either “beside me” or “I am beside”.
3.10. Stress
Stress generally falls on the final syllable of each stem, e. g. ketā́ b ‘book’, ketābī́
‘bookish’, but ráft-am ‘I went’. In the last word, -am is a verbal suffix that remains
unstressed. The extent to which grammatical suffixes and clitics attract stress varies
considerably (see Kahnemuyipour 2003 for a comprehensive theoretical analysis).
In general, clearly inflectional items do not bear stress, for example person and
number agreement suffixes on verbs, or clitics such as the indefiniteness suffix -ī,
ezafe, clitic pronouns, postposition =rā, the short copula, etc. Derivational affixes,
as -ī in ketābī ‘bookish’, on the other hand, are stress-bearing items. The nominal
plural suffixes -ān and –hā, the comparative suffix -tar and the superlative suffix
-tarīn are also stress-bearing items (e. g., ketābhā́ ‘books’, bozorgtár ‘bigger’).
The verbal prefixes mī- (present indicative, durative), be- and na-/ne- (negation)
are also always stressed, e. g. mī́ -raft-am ‘I used to go’, ná-raft-am ‘I didn’t go’
Persian 585
(Pisowicz 1985: 55–56). If negation and mī- occur together, the former takes the
stress: né-mī-raft-am ‘I did not go’.
Stress can be distinctive, see ketābī́ ‘bookish’: ketā́ b-ī ‘a book’ (with the indef-
initeness suffix -ī, §4.5.); see also the distinction between simple past and present
perfect (§6.8.).
4.1. Nouns
New Persian nouns distinguish number and definiteness, but there is no morpho-
logical case system, nor a distinction of animacy or gender; in Arabic loanwords,
however, the Arabic gender distinction (m /f ) has been imported into New Persian
to some extent. The NP bare noun is generally neutral with respect to definiteness
and number:
ketāb ‘book, a book, the book, books’
4.2. Distinction between nouns and adjectives

The distinction between nouns and adjectives is not always clear-cut in NP, e. g.
many adjectives can be substantivized (bozorg ‘big’ > bozorg ‘the big (one)’, pl.
bozorgān ‘the big (ones), the elders’); certain nouns, in turn, may be adjectivized
ad hoc through an adverb of measure (xeilī ‘very’), or the comparative suffix -tar
(see the noun mard “man” in: xeilī mard ast ‘he is very man(ly)’; mard-tar ast ‘he
is more manly, more (of a) man’).
4.3. Number
A NP simple noun may be understood generically, and be used instead of an
English plural:
(1) īnjā sīb ziyād ast
here apple.sg much cop .3 sg
‘Here there are many apples’
(2) man ketāb mī-xaram
I book.sg dur -buy.prs .1 sg
‘I buy books’
(3) az peste xoš-am mī-y-āyad
from pistachio.sg good-my dur - glide -come.prs .3 sg
‘I like pistachios’
586 Ludwig Paul
The simple form of the noun is also used for plural nominal predicates:
(4) ānhā kārgar-and
they worker. sg - cop .3 pl
‘They are workers’
In most other cases, the plural is marked by one of the stressed suffixes -ān or -hā,
by an imported Arabic suffix (-āt, -īn, etc., §4.4.), or by an Arabic ‘broken’ plural
form. The distribution and functions of these distinct plural formatives may be
summarized in the following way.
The stressed plural suffix -ān goes back to a MP plural oblique ending -ān. In
Classical and modern Standard Persian, it is used for animate human nouns (e. g.,
zanān ‘women’), and for a small number of other nouns (e. g., deraxtān ‘trees’,
dastān ‘hands’). Stressed -hā goes back to the MP adverbial ending -īhā (the EJP
plural suffix is still -ihā in most texts) and denotes the plural of inanimate, and of
animate non-human nouns (e. g., saghā ‘dogs’).
In modern colloquial Persian, -hā has almost completely replaced -ān, also for
human nouns, e. g. mardhā ‘men’ (-hā with human nouns appears in prose litera-
ture already in the 16th century, see Paul 2002: 31). Certain nouns have developed
different meanings in SNP, depending on the use of -ān and -hā, e. g. sarān ‘heads,
leaders’ vs. sarhā ‘heads (i. e., the body part)’. In some cases, the use of -ān and
-hā is idiomatic, e. g. xānomhā va āqāyān ‘ladies and gentlemen’ (*āqāhā is not
possible).
Unlike English, Persian plural forms can be used with mass nouns and infinitives,
to denote large quantities or repeated actions:
(5) īn pūl-hā
dem money-pl
‘these lots of money’
(6) īn raftan-hā
dem going-pl
‘this going and going’
In certain contexts, -hā may make a noun phrase definite:
(7) čahār (tā) doxtar
four num daughter (for numerals, see §4.6.)
‘four daughters’ (ordinal numbers occur regularly with the singular, §4.7.)
(8) čahār tā doxtar-hā
four num daughter-pl
‘the four daughters’
Persian 587
The SNP verb obligatorily agrees in number with a plural subject of a sentence
only if this denotes an animate being. With inanimate subjects, the verb often
remains in the singular:
(9) čerāγ-hā xāmūš šod
lamp-pl extinguished become.pst .3 sg
‘The lamps went out’ (Lazard 1992: §170, p. 179)
4.4. Arabic plural forms in NP

Together with the many Arabic loanwords, the following Arabic plural forms were
also introduced to Persian:
i. -īn (< Ar. gen .- acc . pl . m ): in SNP, after a limited number of Ar. nouns denot-
ing mostly masculine humans, e. g. moʿallemīn ‘(male) teachers’, but also
humans of both sexes, e. g. mosāferīn ‘travellers’.
ii. -ūn (< Ar. nom . pl . m ): after a very small number of Ar. nouns ending in -ī, e. g.
rūḥānīyūn ‘clergymen’ (neither -īn nor -ūn have preserved the original Ar.
case function).
iii. -āt (< Ar. f . pl ): regularly after Ar. (f ) nouns in -e, e. g. kalamāt ‘words’, moqa-
ddamāt ‘preliminaries’ (sg. kalame, moqaddame), but also after certain other
Ar. nouns, e. g. entexābāt ‘elections’, touẓīḥāt ‘explanations’ (sg. entexāb,
touẓīḥ).
iv. ‘broken’ Ar. plural forms, e. g. afkār ‘thoughts’ (sg. fekr).
The use of the plural ending -āt, and of the broken plurals, has been extended
from Arabic loanwords to originally Persian words, and become productive to
some extent in SNP, e. g. in: deh-āt ‘villages’, kārxāne-jāt ‘factories’, sabzī-jāt
‘vegetables’; the additional -j- goes back to a MP g that developed into j in MP
loanwords in Arabic. The broken plural is found in (originally Persian) ethnonyms
like Akrād ‘Kurds’ (sg. Kord), or in darāvīš ‘dervishes’ (sg. darvīš). Arabic plurals
sometimes receive an additional, “pleonastic” Persian or Arabic pl. ending, e. g.:
taẓāhorāt-hā “demonstrations”, javāher-āt “jewels” (taẓāhor-āt pl , javāher broken
pl )
In modern colloquial Persian, all Arabic plural forms have (like Persian -ān)
been largely replaced by -hā, e. g. mosāferhā ‘travellers’, touẓīḥhā ‘explanations’,
kārxānehā ‘factories’. In those registers where both plural options (Arabic and
Persian) are possible, this may cause a semantic distinction, with the Arabic plural
form often taking a more collective, abstract, or specialized meaning, e. g., fekrhā
‘thoughts’, but afkār-e ʿomūmī ‘public opinion’; or moqaddamehā ‘(printed) fore-
words’ against moqaddamāt ‘(abstract) preliminaries, first steps’ (from sg. fekr
‘thought’, moqaddame ‘foreword’). For a comprehensive study of Arabic loan-
words in Persian that end in –a(t), see Perry (1991).
588 Ludwig Paul
The mixture of Arabic nouns with Persian endings, and vice versa, shows that
although most Arabic loanwords can still be easily distinguished from originally
Persian words in the Persian lexicon today, they have been firmly integrated into
the latter; together with them, the Arabic plural forms and rules have also found
their place, with adjustments and simplifications, within Persian nominal mor-
phology.
The Arabic Dual (gen - acc ) ending -eyn has been borrowed into Persian
together with a limited number of nouns denoting pairs, e. g. vāledeyn ‘parents’.
This ending has not become productive in Persian.
4.5. (In)definiteness
An unstressed -ī (< CP/ENP -ē) may be suffixed to a NP noun, giving it the
meaning of indefiniteness and singularity: ketāb-ī ‘a book, some book, any book;
one book’. Although both aspects of this morpheme are given separate names by
Persian grammar (yā-ye vaḥdat ‘ī of one-ness’ / yā-ye nakere ‘ī of indefiniteness’),
they cannot really be separated in modern Persian.
The singular component of the meaning of this suffix is evidently overwritten
when the -ī follows a plural noun to yield the meaning of indefinite quantity, e. g.:
zanhā-(y)ī ‘some/certain women’. The -ī of indefiniteness/unit can be replaced by
yek ‘one’ (zan-ī = yek zan ‘a woman, one woman’); in modern colloquial Persian,
both occur often together: yek mard-ī ‘some man, a man; one man’. Yek may also
occur together with the plural and -ī, e. g.: yek mardhā-(y)ī ‘some/certain men’.
A phonetically identical suffix -ī also indicates that a noun is indefinite, but it

specifies it by “announcing” a following attribute that defines it (e. g., a relative
clause, §8.2.; this -ī is called yā-ye ešārat ‘ī of reference’). It occurs regularly
before relative clauses in modern Persian, but less often in the Classical language.
It is probably of a different etymological origin from the indefinite -ī, with which
it seems increasingly (but not yet completely) to merge, to form one morpheme
(Lazard 1966, 1989: 275–276).
The marker of definite direct objects, =rā, and its occurrence together with
indefinite -ī, is discussed in §5.5.
Definite é
Neither Classical nor Standard Persian have a definite article. In modern colloquial
Persian, there is a stressed suffix -é that makes a noun definite, e. g.: pesar-é ‘the
boy (in question)’. It occurs only with singular nouns, and together with demon-
stratives (e. g., īn pesaré ‘this boy’, Lazard 1992: 73–74). It may be followed by
=rā. It does not occur together with indefinite -ī, but with specifying -ī, e. g. ān
pesaré’ī ke dīdam ‘that boy whom I saw’.
Persian 589
4.6. Adjectives
SNP adjectives, if they qualify a noun, follow it with the ezafe (but see §7.1.3.)
and are always in their basic (sg.) form, even when the noun is in the plural, e. g.:
(10) xānehā-ye bozorg
house.pl - ez big.sg
‘(the) big houses’
A comparison is expressed with the comparative suffix -tar and the preposition az
‘from’:
(11) az man bozorg-tar ast
from me big-comp cop . prs .3 sg
‘(he) is bigger than me’
4.7. Numerals and classifiers

In all variants of NP, cardinal numbers are used with the singular of nouns, e. g.
do mard ‘two men’, dah xāne ‘10 houses’. In all variants of NP, there is a class of
‘numerative’ nouns (or ‘classifiers’) that may be inserted between a number and
the noun that is counted, e. g. SNP yek dast lebās ‘one suit’ (lit., ‘one hand (of)
suit’), se nafar sarbāz ‘three soldiers’ (‘three person soldier’). In CP and high reg-
isters of SNP, there were (or still are) dozens of these nouns whose usage depends
on the semantic class of the counted noun, e. g. dast ‘hand’ for clothes, or raʾs
‘head’ (< Ar.) for cattle.
In modern spoken Fārsī and colloquial registers, the number of numerative
nouns has been reduced to three (nafar ‘person’ for persons; tā ‘piece’ and dāne
‘grain’ for things). Their use is not obligatory, but they are often used. In colloquial
registers, tā tends to be generalized also for persons (dah tā mard ‘10 men’), as
does dūne (< SNP dāne), but the latter only for one-ness (ye dūne mard ‘a man’;
see Lazard 1992: §75, Moinfar 1980, and Nešāṭ 1990).
4.8. Pronouns
The full paradigm of pronouns of Standard Persian are given in Table 3. Pronouns
do not inflect for case.
Table 3: Standard Persian pronoun paradigm

Personal pronouns
man ‘I’ mā ‘we’
tō ‘you’ šomā ‘you’
ū ‘he, she, it’ (ENP/CP also: vey) ānhā ‘they’
590 Ludwig Paul
When followed by the direct object marker =rā, man ‘I’ may lose the final nasal
(ma=rā ‘me’); alternatively, in colloquial Persian, where =rā is shortened to -o,
the -n is preserved (man-o ‘me’). The 2 sg polite address is 2 pl šomā. In modern
polite language, a 3 sg polite form īšān has developed, which goes back to the
former ENP 3 pl pronoun ēšān ‘they’ (that was replaced by ānhā, which was orig-
inally the pl. of the demonstrative pronoun ān ‘this’). Polite 3 sg īšān takes a pl.
verbal form (e. g., īšān hastand [he.pl cop .3 pl ] ‘he is here’). There is also a form
of “halfway/reduced” politeness, by using īšān with a sg. verb, e. g. īšān goft ‘he
said’. The noun bande ‘slave’ is used as a polite form for the 1 sg pronoun (as it
is considered polite to belittle oneself). This occurs with 1 sg verbal forms (e. g.
bande raft-am [slave go.pst -1 sg ] ‘I went’). In CP, bande in the sense of “I” was
still used with a 3 sg verb in the 13th century, while polite ēšān occurs at least since
the 16th century (Paul 2002: 31–32).
In colloquial Persian, the pl. pronouns mā and šomā may take the pl. ending
-hā (māhā ‘we’, šomāhā ‘you [pl ]’), probably by analogy to 3 pl ānhā, or in the
case of šomā, to distinguish it from the polite form šomā, which is used for sg.
address.
4.9. Pro-drop
New Persian is a pro-drop language, in which not only a subject may be dropped,
e. g. raftam [go.pst .1 sg ] ‘I went’, but—in appropriate contexts—also a direct
object pronoun:
(12) mašq=at=rā neveštī? bale, neveštam
exercise=your=obj write.pst .2 sg yes write.pst .1 sg
‘Did you write your exercise? Yes, I wrote (it)’
(13) CollP
vaqtīke āmad, be=h=eš mī-gam
when come.pst .3 sg to=him dur -say.prs .1 sg
‘as soon as he comes, I will tell (it) him’
(14) CollP
dīdī goftam?
see.pst .2 sg say.pst .1 sg
‘Have you seen (it) I told (you)’ (= What did I tell you!)
Persian 591
4.10. Pronoun clitics

Besides the full pronouns, there is a series of pronominal clitics in SNP, as follows:
Table 4: SNP pronominal clitics

1 sg =am 1 pl =emān (coll. =emūn)
2 sg =at (coll. =et) 2 pl =etān (coll. =etūn)
3 sg =aš (coll./ENP(!) =eš) 3 pl =ešān (coll. =ešūn)
When they follow a word that ends in vowel, the pronominal clitics may take a
glide, or lose their initial vowel, to avoid a hiatus, see §3.9. Throughout the history
of Persian, the pronoun clitics may be suffixed to verbs, prepositions and nouns,
and serve the same grammatical functions as the full pronouns, with the exception
of sentence subject (but see below), e. g.:
direct object
(15) mībīnam=et
see. prs .1 sg =2 sg
‘I see you’
indirect object
(16) nešān=et dādam (or, less often: nešān dādam=et)
sign=2 sg give.pst .1 sg
‘I showed (it) you’
prepositional complement
(17) be=h=et goftam
to=2 sg say.pst .1 sg
‘I told (it) to you’
possessive complement
(18) barādar=at
brother=2 sg
‘your brother’
A pronoun clitic may even be infixed between a preverb and a verbal stem, e. g.
bar-aš dār! ‘remove it!’ (< bar-dāštan ‘remove’). It is also worth noting the condi-
tions where clitic pronoun and full pronoun are not interchangeable, e. g.:
(19) ketāb=am=rā na-dīd-am
book=1 sg = obj neg -see.pst -1 sg
‘I have not seen my book’
but not:
592 Ludwig Paul
(20) *ketāb-e man=rā na-dīd-am

book-ez I=obj neg -see.pst -1 sg
(21) az īn čīz xoš=am mi-y-āyad

from this thing good=1 sg dur - glide -come.prs .3 sg
‘I like this thing’
but not:
(22) *az īn čīz xoš-e man mi-y-āyad.
from this thing good-ez I dur - glide -come.prs .3 sg
In colloquial Persian, a third person singular sentence subject may be indicated by

a personal clitic, e. g. goft=eš [say.pst =3 sg ] ‘he said’, āmad=eš [come.pst =3 sg ]
‘he came’. The same construction occurs already in ENP (Lazard 1963: §319–321,
e. g. guft=aš ‘he said’, bud=aš ‘he was’). It is a survival from regular MP erga-
tive marking, although the MP personal clitic was usually attached not to a verb,
but to the sentence-initial word, e. g. u=š guft ‘and he said’ (Durkin-Meisterernst
2014: 292). This construction may always have existed, throughout the history of
Persian, as a possible, albeit (until today) non-preferred, sub-standard construc-
tion.
Combinations of more than one personal clitic are not possible in NP, i. e. in dit-
ransitive constructions like *dādam=ešān=at ‘I gave them to you’ (/ ‘I gave you to
them’). Instead, one of them must be expressed separately (be-h=at dādam=ešān).
In modern SNP, the preference for a clitic direct or indirect object depends largely
on the verb. E. g., with dādan “give”, both clitics are possible (i. e., dādam=eš
can mean both: “I gave it” and, although less familiar, “I gave (it) to him”). As for
goftan “say”, however, forms like goftam=et “I told (it) you” are theoretically pos-
sible but hardly used; be=h=et goftam is clearly preferred. Forms like goftam=at
are, however, frequent in Classical Persian literature.
It is interesting to note that a 1 sg pronoun clitic can be attached to a 3 sg past

verbal form to yield an expression that is identical with a inflected 1 sg verbal
form, e. g. šenāxt=am [recognize.pst .3sg =me] ‘he recognized me’ vs. šenāxtam
[recognize.pst .1 sg ] ‘I recognized (it)’.
4.11. Demonstratives
The SNP demonstrative pronouns are īn ‘this’ and ān ‘that’, pl. īnhā, ānhā (in CP,
there was also īnān, ānān). When they qualify nouns as demonstrative adjectives,
they precede the noun and remain always in the sg., e. g.: īn mard ‘this man’, ān
mardhā ‘those men’.
Persian 593
4.12. Reflexivity
4.12.1. Modern Persian

In modern Persian, reflexivity is expressed through an uninflecting reflexive
marker SNP xod (Tajik xud), to which clitic pronouns and, if necessary, the object
clitic =rā may attach:
(23) man xod=am=rā mī-bīnam
I refl =1 sg = obj dur -see. prs .1 sg
‘I see myself’
(24) Aḥmad bā xod=aš ḥarf mī-zanad
Ahmad with refl =3 sg word dur -strike.3 sg
‘Ahmad talks to himself’
The use of the reflexive marker is not normal, however, when a possessor is
expressed, unless there is a strong sense of emphasis (see below):
(25) mā dūst-hā-y=emān=rā (no xod!) mī-bīnīm
we friend-pl - glide =1 pl = obj dur -see.prs .1 pl
‘We see our friends’
Xod may also be used as an identity marker together with the enclitic pronoun in
order to emphasize the identity of the person concerned or to contrast with other
posssible interpretations, e. g.:
(26) mā dūst-hā-y-e xod=emān=rā mī-bīnīm
we friend-pl - glide - ez emphat =1 pl = obj dur -see. prs .1 pl
‘We see our own friends’
(27) Aḥmad xod=aš mī-y-āyad
Ahmad emphat =3 sg dur - glide -come.prs .3 sg
‘Ahmad himself comes’
4.12.2. Diachronic developments

In the history of NP, there was both a morphological and a systemic change in the
expression of reflexivity. ENP had the two reflexive markers xwēš and xwēštan
(the former used mainly for reflexive possession, the latter for all other reflexive
relations). Xwad (later xod) was originally an emphatic adverb ‘even, just’, but
started replacing xwēš and xwēštan in their reflexive meanings from late MP /
ENP onwards, in a gradual development that took centuries to be completed (Paul
2002: 29–30).
In contrast to modern SNP xod, the ENP and CP reflexive markers xwēš,
xwēštan and (increasingly) xwad were still used without additional enclitic pro-
594 Ludwig Paul
nouns, hence no marking of person and number; and they were still necessary to
mark reflexive possession, e. g.:
(28) ENP/CP
Aḥmad xwēštan(=rā) mī-bīnad
Ahmad refl (= obj ) dur -see.prs .3 sg
‘Ahmad sees himself’
(29) Aḥmad bā xwad ḥarf mī-zanad
Ahmad with refl word dur -strike.prs .3 sg
‘Ahmad talks to himself’
(30) mā dūstān-e xwēš(=rā) mī-bīnīm
we friends-ez refl (= obj ) dur -see.prs .1 pl
‘We see our friends’
Constructions like X. xod=rā mī-bīnad ‘X. sees himself’, without personal clitic,
are possible also in modern SNP, but they would represent a formal register of
SNP, one that is actually emulating CP.
5. Adpositions
5.1. Prepositions
SNP Adpositions fall into four categories: simple prepositions (used without an
ezafe); a large set of prepositions that are linked to their complement through an
ezafe; prepositional expressions, i. e. combinations of adverbs or adjectives with
simple prepositions; the postposition, or clitic, =rā (on which, see §5.5. below);
and circumpositions (§5.6.).
5.1.1. Simple prepositions

The main SNP ‘simple’ prepositions, that are used without the ezafe, are the fol-
lowing: az ‘from’ (Abl.), bā ‘with’ (Instr., Comit.), bar ‘upon’, be ‘to, towards’
(indir. Obj./Dat.), bī ‘without’, dar ‘in’ (Loc.), ellā ‘except’, joz ‘except’, tā ‘until,
to’. Of these, az, bā, be and dar cover core case functions that are here abbreviated
with the traditional terms of ablative, instrumental, etc.
5.1.2. Prepositions with ezafe

These may be divided into two groups: a smaller group of prepositions with a non-lo-
cal meaning, e. g. barā-ye ‘for (the sake of)’ (Benefactive), bedūn-e ‘without’,
darbāre-ye ‘about’, mes̱l-e ‘like’, and a larger (in principle open) class with a local
Persian 595
meaning, most of which go back to nouns or adverbs, e. g.: bālā-ye ‘over, above’,
beyn-e ‘between’, dāxel-e ‘inside’, pīš-e ‘to(wards)’ (someone, +animate), pošt-e
‘behind’, zīr-e ‘under’. Most of these prepositions were formerly combined, as
nouns, with a simple preposition, and can still be used in such a way in formal
language; the simple preposition can then serve to distinguish various meanings,
etc.: dar dāxel-e ‘inside’ (position), be dāxel-e ‘inside’ (direction), from the noun
dāxel ‘interior, inside’. In less formal language, the simple prepositions be and dar
can be dropped in such expressions (see also §5.4.).
5.1.3 Prepositional expressions

Prepositional expressions include examples like baʿd az ‘after’, qabl az ‘before’,
rājeʿ be ‘concerning’, or of nouns with preceding simple prepositions (containing
also the “formal” varieties of the prepositions) and an ezafe, e. g.: bā vojūd-e ‘in
spite of’ (lit., ‘with the existence of’), be vasīle-ye ‘by means of’, az dast-e ‘by
(means of)’, rū-be-rū-ye ‘opposite’.
5.2. Historical development

The form and function of some of these prepositions changed considerably in the
history of Persian. In ENP, there were historical forms like abar, abā, bā(z), ba(d),
etc. (later > bar, bā, be(d)). In EJP, there was still pa(d) (~ ENP ba(d), MP pad), and
a continuation of the MP preposition ō ‘to(wards)’ (pronounced [u], or [a], in EJP).
The semantic shifts that some prepositions underwent are too complex to be dealt
with here in detail, but it is worth mentioning that bā was also used for addressees
of a verb of speech (e. g., bā ō guft ‘he said to him’), and in EJP, the preposition
u (later a) was also used as a direct object marker (see Paul 2013a: §180). See the
following overview, which includes also =rā (§5.5.):
Table 5: Historical development of prepositions

SNP ENP/EJP
recipient of a verb of transfer be (a)bā, ba(d), be, u, =rā
addressee of a verb of speech be (a)bā, be, =rā
instrument bā, be abā, bā(z), ba(d)/pa(d)
comitative bā abā, bā(z)
goal/direction of a motion verb be bā(z), ba(d)/pa(d), be, u
direct object of a transitive verb =rā =rā, u
596 Ludwig Paul
5.3. Modern colloquial Persian

In modern colloquial Persian, there is a tendency to replace some of the simple
prepositions by synonymous ones that are used with the ezafe, e. g. SNP dar ‘in’ >
CollP tū-ye, bar ‘upon’ > rū-ye (lit., ‘[on] the surface of’), bī ‘without’ > bedūn-e
(< Ar. bi-dūni ‘without’).
In fast or substandard colloquial speech, the ezafe may be dropped, e. g.: SNP
dar xāne ‘in the house’ > CollP tū-ye xāne > tū-xūne. The preposition bar already
seems to have been completely replaced by rū-ye in CollP; bī has largely fallen
out of use as a (free) preposition and is mostly used as a bound prefix in adjectives
like bī=kār ‘jobless’ nowadays. Similarly, be in its instrumental use has undergone
lexicalisation and is used mostly in adverbial expressions like be=zūdī ‘soon’,
be=rāḥatī ‘simply’.
For other prepositions (simple and with ezafe), there is a similar but weaker
tendency in colloquial speech to be replaced by nouns with ezafe, e. g. be ‘towards’
> ṭaraf-e (‘[to] the side of’) or > dast-e ([to] the hand [= side] of’); barā-ye ‘for’ >
vāse-ye (in fast speech, without ezafe: > vāse). All these prepositions belong then
to the second group of §5.1.
5.4. Ommission of prepositions

Another non-standard option of colloquial Persian worth of mention is the expres-
sion of location, direction or indirect objects without the prepositions dar or be,
e. g.:
(31) man Tehrān-am
I Tehran-cop .1 sg
‘I am (in) Tehran’
(32) raftam Tehrān (less often, but also possible: Tehrān raftam)
go. pst .1 sg Tehran
‘I went to Tehran’
(33) be-deh ʿAlī!
sbjv -give.imper ʿAlī
‘give (it) to Ali!’
The same constructions are possible in Tajik dialects (šahr raft ‘he went to the
city’, Lazard 1956: 141), and already in ENP (Lazard 1963, §713–714). This sug-
gests that the construction has been possible in colloquial registers of New Persian
throughout its history.
Persian 597
5.5. The particle =rā
5.5.1. Historical developments

This postposition is the most intensively discussed issue of Persian grammar. It
evolved from the MP postposition rāy with benefactive/recipient (indirect object)
sense, which was preserved in ENP =rā, e. g.:
(34) xalq=rā rāh-e rāst be-namāyam
people=rā way right sbjv -show.prs .1 sg
‘(that) I show the people the right way’ (Siyāsatnāme, 11th cent. C. E.)
Already in ENP, the semantic range of =rā was extended to mark also direct
objects (mostly animate ones), e. g.:
(35) ū=rā be man sepord
he=rā to I entrust.pst .3 sg
‘he entrusted him to me’ (Čahār maqāle, 12th cent.; references: see Paul
2008: 334)
During the history of NP, the functions of =rā shifted gradually away from indirect,
to mark predominantly direct objects; unlike ENP, not animate but definite ones.
The historical evolution of =rā, its grammaticalization from a benefactive/indirect
object marker to a definite direct object marker, has been discussed in Lazard (1970).
Paul (2008) has drawn attention to the importance of animacy for this process in
ENP, which still plays a minor role also in SNP (Lazard 1992, §173: 185).
Although marking a definite direct object is the commonly accepted SNP core
function of =rā (or coll. =ro, =o, Tajik =ro, =a), the exact nature of the condi-
tioning factors determining the presence of =rā remains a matter of ongoing con-
troversy. The most comprehensive and reliable study of SNP =rā continues to be
Lazard (1982; see also Lazard 1992: §§173–177). The following sketch aims to
present some basic constructions, and to discuss a selection of special uses.
5.5.2. Basic constructions

A SNP direct object is clearly definite (in the sense of identifiable) and takes
always =rā, if it is a name or pronoun, or a noun that is complemented by a name
or a (full or enclitic) pronoun, e. g.:
(36) Ạḥmad=rā mī-bīnam
Aḥmad=rā dur -see.prs .1 sg
‘I see Ahmad’
598 Ludwig Paul
(37) māšīn-am=rā mī-forūšam

car-my=rā dur -sell.prs .1 sg
‘I sell my car’
In all other cases, the presence of =rā depends on the context, and on the inter-
play of a number of complex factors. See the following basic examples from SNP,
where =rā marks a direct object that is definite from the context, but not a generic
one, or one that is marked as an indefinite object by -ī:
(38) ketāb=rā mī-xaram
book=rā dur -buy.prs .1 sg
‘I buy the book (definite)’
(39) ketāb mī-xaram
book dur -buy. prs .1 sg
‘I buy a book/books (generic)’
(40) ketāb-ī mī-xaram
book-indef dur -buy. prs .1 sg
‘I buy a book, some book (one, indefinite)’
=rā may, however, be attached to a noun that is marked by the indefinite-specific
-ī, e. g.:
(41) ketāb-ī=rā mī-xaram
book-indef =rā dur -buy. prs .1 sg
‘I buy a (certain) book (specific, explained afterwards)’
The expressions ketāb=rā and ketāb-ī=rā are complementary with respect to time:
while ketāb=rā points to the past (the book has been known beforehand), ketāb-
ī=rā points to the future (the book will be explained soon, e. g. through a relative
clause). Definition is obviously not a semantic feature of a noun, but a value that
is attributed to it by the speaker, his knowledge of the context, and the situation.
The numeral yek ‘one’ may emphasize or replace the indefinite -ī in all the
cases above (e. g., yek ketābī-rā, etc., see §4.5.1.).
5.5.3. =rā with generic objects

Just like nouns with -ī, nouns used in a generic sense are, by definition, not definite.
Nevertheless, a generic object may also take =rā under certain conditions, e. g.
when it is set up in contrastive opposition to another sentence constituent, and/or
when it is separated from the predicate by other constituents, cf. (Lazard 1989: 280):
(42) xargūš dūst dāram
hare liked have. prs .1 sg
‘I like (to eat) hare’
Persian 599
(43) xargūš=rā hame gūštxwārān ne-mī-xorand

hare=rā all carnivores neg - dur -east. prs .3 pl
‘Not all carnivores eat hare’
Sporadically a generic noun that is not emphasized/thematicized may take the =rā:
(44) serke šīr=rā mī-borrad
vinegar milk=rā dur -cut.prs .3 sg
‘Vinegar makes milk curdle’
5.5.4. Topicalization
Lazard (1982) had already shown that besides marking a definite direct object, =rā
can be used also to “topicalize” an important “pole” of a sentence. Recent studies
like Dabir Moghaddam (1992) or Meunier and Samvelian (1997) also focused on
pragmatic factors like topicalization, thematization etc. See the following example
of topicalisation by =rā:
(45) otāq=o, dar-eš=o bastam (colloquial)
room=rā door-his=rā close.pst .1 sg
‘as for the room, I closed its door’ (Dabir-Moqaddam 1992: 553)
5.5.5. Special verbs with =rā

The question as to whether semantic features of the verb or noun have any influ-
ence on the presence of =rā has been rather neglected so far. Lazard (1982: 186)
noticed that verbs with low semantic content are less likely to occur with =rā,
but he did not develop this argument. Indeed, there is a group of transitive verbs,
semantically defined as “verbs that exert a clear and lasting influence on the direct
object”. These verbs practically cannot be used without =rā in natural language
use, even if a direct object is marked as indefinite (and not specific) by -ī. This
includes verbs like: koštan ‘kill’, taγyīr dādan ‘change’, xarāb kardan ‘destroy’,
bombārān kardan ‘bombard’. See the example:2
(46) man (yek) mard-ī=rā koštam
I (one) man-indef =rā kill.pst .1 sg
‘I killed a man’ (not specific)
A use of koštan without =rā would be feasible with a generic understanding of the
object, but this sounds odd and unnatural:
2
I owe this important point to a discussion with Maximilian Kinzler and Ramin Shagh-
aghi.
600 Ludwig Paul
(47) *emrūz man mī-ravam mard mī-košam (?)

today I dur -go.prs .1 sg man dur -kill.prs .1 sg
‘Today, I go killing men (generically, i. e., like a profession)’
5.5.6. Special nouns and =rā: abstract nouns

Most studies so far have concentrated on countable, concrete nouns like: ketāb
‘book’, mard ‘man’, xāne ‘house’, xargūš ‘hare’ as direct objects, sometimes
also on non-countables like āb ‘water’ or gūšt ‘meat’. Abstract nouns like āzādī
‘freedom’ have hardly been considered. It is difficult to say if, e. g., ‘freedom’
as such, is definite or not. In simple sentences, abstract nouns seem to be used
without =rā, e. g.:
(48) rāḥatī / āzādī be dast āvardam
rest / freedom to hand bring.pst .1 sg
‘I attained rest / freedom (etc.)’
But like generic nouns, they may do so when they are in contrastive focus (=rā is
optional here):
(49) āzādī=rā dūst dāram, na bandegī=rā
freedom=rā friend have.prs .1 sg not slavery=rā
‘I love freedom, not slavery’
Abstract nouns that tend to take =rā when used as direct objects include zendegī
‘life’ or marg ‘death’, which usually refer to the speaker and is therefore “inher-
ently definite” in sentences like:
(50) zendegī=rā dūst dāram
life=rā friend have.prs .1 sg
‘I love life’
5.5.7. Sciences, techniques, etc. with =rā

Another special group of nouns are sciences, techniques, arts and languages. They
occur usually without =rā:
(51) Fārsī balad-am
Persian informed-cop .1 sg
‘I know Persian’
(52) riyāẓiyāt / tārīx (etc.) tadrīs mī-konam
mathematics / history teaching dur -do.prs .1 sg
‘I teach mathematics / history (etc.)’,
Persian 601
But they may take =rā when they are in contrastive opposition:
(53) emrūz Fārsī=rā tadrīs mī-konam, fardā
today Persian=rā teaching dur -do.prs .1 sg tomorrow
adabiyāt=rā
literature=rā
‘Today I teach Persian, tomorrow literature’
A science that takes a nominal complement with the ezafe, becomes definite and
takes =rā:
(54) tārīx-e Čīn=rā dūst dāram
history-ez China=rā friend have.prs .1 sg
‘I love Chinese history (lit., the history of China)’,
unless the whole noun phrase corresponds to the name of a subject that is regularly
taught at academic institutions:
(55) tārīx-e Čīn tadrīs mī-konam
history-ez China teaching dur -do.prs .1 sg
‘I teach (the subject:) “Chinese history”’
In this example, tārīx-e Čīn represents the conventional name of an academic
subject. The exceptional absence of =rā in such cases may be due to the verb
tadrīs kardan, which occurs typically with names of academic subjects.
5.6. Circumpositions
Besides the prepositions and the postposition =rā, a small number of circum-
positions are attested in ENP, and in some CP texts, though no longer in use in
SNP. They can be divided into two groups (other West Iranian languages such as
Kurdish continue to make extensive use of circumpositions, see Haig, this volume,
chapters 2.3 and 3.3):
a) Combinations of (mostly simple) prepositions with =rā, see Lazard (1963:
§539–552), e. g. ba …=ra ‘to, towards’, az …=rā ‘because of’, in EJP also
‘concerning’ (Paul 2013a: §184.a); azmar-i …=rā ‘for’ occurs only in EJP
(Paul 2013a); mar …=rā, which has the same meaning as plain =rā, is a
special case, because mar without =rā is not a well-established preposition of
ENP (Lazard 1963: §752–756).
b) Combinations of various simple prepositions with each other. In ENP, it was
quite common for ba to be combined with other simple prepositions in post-
positional use, e. g. ba … (an)dar ‘in’ (Lazard 1963: §619–621), or a “dou-
bling” of prepositions like (an)dar … (an)dar ‘in’ (Lazard 1963, §623).
602 Ludwig Paul
6.1. Generalities
Verbal forms of all varieties of NP are based on two stems, present and past, from
which a system of tenses and moods is built with the help of endings, verbal par-
ticles, and auxiliaries. The tense systems of the modern varieties of Persian, espe-
cially the written standard forms, are structurally similar and share certain parallel
developments, but they are quite different formally. Tajik dialects have developed
modal categories such as “presumptive” or “dubitative” that do not exist in Fārsī.
In ENP the basic verbal forms of later SNP and Tajik were already in use, but
the various verbal particles (see below) became fully grammaticalized only in the
course of later centuries. The tense system of ENP was thus structurally very dif-
ferent from SNP. This was also because it was less standardized and allowed for
many regional varieties. A recent account of the evolution of the NP verbal system
has been given by Lenepveu-Hotz (2014).
6.2. Certain verb classes
6.2.1. Compound verbs

Certain NP simple verbs (e. g., zadan ‘strike’) can combine with nouns or adjec-
tives (e. g., ḥarf ‘letter, speech’) to form “light verb constructions”. These con-
structions, which are very widespread in New Persian and show various levels
of amalgamation, correspond to simple verbs in many other languages (ḥarf
zadan ‘speak’, lit. ‘to strike a word/speech’). A correct understanding of these is
essential for a correct understanding of the syntax of the Persian verb phrase (see
Ahadi 2001 and Samvelian 2012).
There is no clear-cut distinction between ‘light verb constructions’ and ‘phrasal
verbs’ with a more general meaning (e. g., māhī gereftan ‘to fish, angle’, lit. ‘to
catch a fish’), as shown by various syntactic tests in Ahadi (2001: 220–228). Both
groups form a continuum, on the loose edge of which follow verbal constructions
with a generic noun like ketāb xarīdan ‘buy books’.
Other groups of compound verbs include verbs with preverbs, e. g. bar-dāštan
‘remove’; prepositional verbal constructions like be donyā āmadan ‘be born’ (lit.,
‘come to the world’); and verbal constructions with ezafe like: montaẓer-e …
būdan ‘wait for …’, with a participle (montaẓer ‘waiting’) followed by the object
with the ezafe.
Persian 603
6.2.2. Impersonal verbal constructions

A large and increasing number of verbal constructions in CollNP is formed with a
3 sg verb and an experiencer, often a personal clitic, to express physical and psy-
chological processes, e. g.:
(56) xoš-am mī-y-āyad
good-me dur -glide-come.prs .3 sg
‘I like (it)’
(57) xwāb-at bord
sleep-you bring.pst .3 sg
‘You fell asleep’
(58) gorosne-am ast
hungry-me cop .3 sg
‘I am hungry’
The latter is used alongside the SNP personal construction gorosne-am ‘I am
hungry’ (see Rāsex Mahand 2007).
6.2.3. Auxiliary verbs

NP uses the following auxiliary verbs: xwāstan ‘want’ (for the future, §6.7.), the
copula (compound past tenses, §6.8.), and šodan ‘become’ (passive, §6.13.). For
the progressive, SNP uses dāštan ‘want’, Darī raftan ‘go’, and Tajik istodan ‘stay’
(§6.12.).
6.3. Person and number agreement suffixes

These are almost identical in each variety of NP, and across all tenses. The sole
exception is SNP 3 sg , which is -ad in the present, and -Ø in the past tenses. The
enclitic present copula is identical with these endings, with the exception of 3 sg
(§6.5.). The endings in overview, with some of the more important regional/chron-
ological variations are as follows:
Table 6: Person and number agreement in Persian

sg pl
1 -am (ENP/Taj./Darī also -om) -īm (ENP -ēm, Taj. -em)

2 -ī -īd (ENP -ēd, Taj. -e(d); CollP -īn, Darī -ēn)
3 -ad (coll. -e) (present)/- Ø (past) -and (coll. -an)
Politeness is expressed with the 2nd person plural. For impersonal ‘one’, the 3rd
person plural is used, e. g. mī-gūyand [dur -say.prs .3 pl ] ‘they say, one says’.
604 Ludwig Paul
6.4. Verbal affixes

The following affixes (Table 7) may be combined with Persian verbs to express
tense, modality, and negation:
Table 7: Verbal affixes

mī (ENP (ha)mē, Darī mē, Taj. me): indicative present tense, imperfect
(used with all verbs except dāštan ‘have’)
be (allomorphs bo, bī etc. depending on present subjunctive and imperative
verbal stem; ENP bi or bu):
na (SNP ne before mī): negation
ma (= ENP, in modern Persian na): prohibitive
-ē (ENP -ē(δ), CP -ē): irrealis/habitual of the past (no longer used
in modern Persian)
Negative na- can be combined with mī-/me- (always preceding it, e. g. SNP
ne-mī-ravam ‘I don’t go’), but not with be-, which it replaces. In ENP, many par-
ticle combinations were still possible that fell out of use later, e. g. (ha)mē + bi-
(Lazard 1963: §357) or bi- + na- (Lazard 1963, §443), e. g. bi-na-y-āmaδ [sbjv -
neg - glide -come.pst .3 sg ] ‘He did not come’. For verbs with preverbs, in SNP a
prefix is always inserted between the preverb and the verbal stem, while in Tajik
and (partly) Darī (e. g., Kabuli), it regularly precedes the prefix, e. g.:
(59) SNP
bar-mī-āyad
out-dur -come.prs .3 sg
‘he comes out, appears’
(60) Taj.
me-bar-oyad
dur -out-come.prs .3 sg
‘he comes out, appears’
6.5. Present tense, imperative

The SNP/Taj. present tense is formed by mī-/me- + the verbal present stem + the
endings of Table 6, e. g. present indicative SNP mī-rav-am ‘I go’. The present
subjunctive is indicated by SNP be-, which may be dropped in certain cases (e. g.,
with compound verbs). In central and northern Tajik dialects, the subjunctive is
expressed by a bare present stem, e. g. kun-am ‘(that) I may do’ (Lazard 1956: 145).
In some Tajik verbs, a historical be- (or bu- etc.) has been lexicalized and inte-
grated into the verbal stem, e. g. buraftan ‘go’.
Persian 605
The present tense also serves to express the future in SNP and CollNP, e. g.:
(61) fardā man mī-ravam bāzār
tomorrow I dur -go. prs .1 sg bazaar
‘Tomorrow, I shall go to the bazaar’
The imperative is expressed by be-/bo- (etc.) and the bare present stem for the sg.,
and with an additional 2 pl ending for the plural, e. g.: bo-xor ‘eat!’ / bo-xorīd ‘eat
(pl )!’
In CollNP (Fārsī and Tajik), a small number of verbs that are of high frequency
exhibit shortened forms of the present stem, e. g.: mī-r-am ‘I go’, mī-g-am ‘I say’,
mī-d-am ‘I give’ (SNP mī-rav-am, mī-gū-y-am, mī-dah-am; Tajik coll. mē-r-am,
etc.).
6.6. The copula

There are two forms of the SNP present tense copula, a full and an enclitic one:
Table 8: Two forms of the SNP present tense copula

‘to be; to be there’ (full) ‘to be’ (enclitic)
1 sg hastam -am
2 sg hastī -ī
3 sg (h)ast -ast (coll. -e; after long vowel: -st, coll. -s)
1 pl hastīm -īm
2 pl hastīd -īd (coll. –īn, Darī -ēn)
3 pl hastand -and (coll. -an)
The full form of the copula can be used with a stress on the verbal stem in the
emphatic sense of: man hástam ‘I am here’, but also as a normal substantive verb,
like the enclitic form, cf.
(62) man Ālmānī-y-am / man Ālmānī hastam
I German-glide - cop .1 sg / I German cop .1 sg
‘I am German’.
The copula is necessary in sentences stating identity or quality. It is not used in
exclamations and other elliptic expressions, e. g.:
(63) xodā=rā šokr
God=rā thanks
‘Thanks (be) to God’ (also in ENP, Lazard 1963: §779)
606 Ludwig Paul
(64) xoš be ḥāl-at

good for condition-your
‘Good for you!’
and with kū ‘where?’:
(65) Ḥasan kū?
Ḥasan where?
‘Where is Ḥasan?
The copula has the simple past būd- (from the infinitive būdan ‘be’) and the irreg-
ular subjunctive stem bāš-, e. g. the imperative sg.: movāẓeb bāš! ‘be attentive /
take care!’
The negative form of the copula is: nīst-am ‘I am not’, nīst-ī ‘you are not’, etc.
6.7. Future
Besides the present tense, which may express the future, especially in CollNP
(§6.4.), there is a morphological future tense formed with the present stem of the
verb ‘to want’, the appropriate personal endings and the shortened infinitive, e. g.:
xwāham raft ‘I shall go’.
This tense did not yet exist in ENP, in which xwāham raft would still mean ‘I
want to go’ (also expressed by xwāham raftan with the full infinitive). Of course
the development of future meanings from a desiderative ‘want’ is a well-attested
grammaticalization pathway, but it was only with the development of modal con-
structions with the subjunctive in CP (§6.15.) that the future developed as a sepa-
rate tense.
6.8. Past tenses

The basic system of past tenses of SNP and literary Tajik is based on three forms:
simple narrative past (raft ‘he went’), durative-iterative imperfect (mī-raft / me-raft
‘he used to go’), both with the appropriate personal endings of the past (§6.2.); and
present perfect, built with the past participle (rafte / rafta ‘gone’) and the enclitic
copula (§6.6.): rafte-ast / rafta-ast ‘he has gone’.
In all varieties of NP a combination of imperfect and present perfect has also
developed, expressing in SNP a durative past action, which is completed in the
past, e. g.:
(66) bīš az emrūz az xodā mī-tarsīde-and
more than today from God dur -fear.ptcp - cop .3 pl
‘(… a time when the poor) were more in fear of God than today’ (Lazard
1992: 154–155)
Persian 607
In Tajik this verbal form developed into a resultative mode (Lazard 1956: 151). It

is already attested, though sparsely, in ENP (Lazard 1963: §389).
The Persian tense system is completed by the pluperfect that is built from the
PPP and the past copula (rafte būd ‘he had gone’).
6.9. Present perfect

In colloquial registers of both Fārsī and Tajik, the present perfect endings are amal-
gamated with the stem, and can partly be distinguished from the simple past only by
stress (e. g., Fārsī rafté-am ‘I have gone’ > raftá(a)m: simple past ráftam ‘I went’;
Tajik raftá-y-am ‘I have gone’ > raftém: simple past ráftam, Lazard 1956: 147).
In CollNP, the 3 sg present perfect has lost its ending (rafté ‘he has gone’), which
can be either explained through the colloquial form of the copula (e instead of ast,
thus: rafté-e > rafté), or by analogy with the zero form of 3 sg in the simple past
(cf. Table 6).
6.10. Tajik “resultative”

In Tajik dialects, on the basis of the present perfect a new “resultative” verbal mode
has emerged (Lazard 1956: 148–151). On closer inspection, Tajik dialects offer a
lot of constructions that do not exist in Fārsī. For example, derivations from a past
participle in -agī (not found in Fārsī) can be used both as nouns (omadagī-am ‘my
coming’) and verbally, in constructions like: az man omadagī ‘since I have come’
(Lazard 1956: 171). This and other constructions show a tendency in Tajik towards
clause subordination through non-finite verb forms (participles), as opposed to the
predominance of finite verbs used in subordination in Fārsī.
In the Tajik dialect of Varzob, the participle in -agī, together with the copula,
forms a separate verbal mode, the “presumptive”, e. g.:
(67) qošuq kujo? — xona budagī-st
spoon where? — house be.pst . ptcp - cop .3 sg
‘where are the spoons? — they must be (/ are presumably) in the house.’
A presumptive exists also in the Darī dialect of Kabul, but it is built in a different
way, with the adverb xāt (< xwāhad ‘he wants’, Lazard 1956: 160). The presump-
tive can be considered a sub-type of evidential, or indirect source. Turkic (Uzbek)
influence may have played a role in this development.
Forms in -agī are not found in ENP in Arabic script, but they occur in certain
EJP exegetical texts (tafsir) (Paul 2013: §161), e. g. kardagī [correct for: kardagē]
‘he has done’, often with a resultative meaning.
608 Ludwig Paul
6.11. ENP
The verbal system of ENP is difficult to describe, since it is not uniform or stand-
ardized and has a lot of regional/dialectal varieties. The earliest stage of Standard
Persian, the language of the classical authors of the 13th–15th centuries, repre-
sents a transitory stage of the language, in which many ENP grammatical features
are still found, dialectal standardisation has been attained, and the rules of modern
Standard Persian are gradually evolving. Some important features in which the
ENP verbal system diverges from the later SNP one include:
a) In some of the earliest EJP documents, the present perfect with the past part-
ciple in -a (SNP -e) had not yet been fully developed as a tense, participles
like nibišta had still an adjectival-static (passive) meaning, e. g., in expres-
sions like: nibišta hest ‘it is written’ (Paul 2013: §165.b). In most ENP texts,
the present perfect with -a had already been fully developed, but there are
still examples of passive-adjectival past participles (e. g., man kušta am ‘I
am killed’, Lazard 1963: §487; in modern SNP examples of lexicalized past
participle, e. g. baste ‘closed, tied; package’).
b) The EJP present perfect can still be built with the plain past stem, e. g., nibišt
hest ‘he has written’ (Paul 2013a: §164.b). In ENP, this is very rare and is
practically restricted to one manuscript (Lazard 1963: §485).
c) In ENP, there is a variety of auxiliaries used for compound past tenses that no
longer exist in SNP, like hast-, buv-, b-, etc. (e. g., dīda hastē ‘have you (pl )
seen …?’, Lazard 1963: §482–483).
d) In some ENP texts from northeastern Iran (e. g., in the national epic Šāhnāme)
there is the “present perfect II” of the kardastam (1 sg ), kardastī (2 sg ) etc.
type, probably of dialectal origin, which is used besides the “normal” present
perfect, in the same sense. It occurs occasionally also in later Classical poetry,
but has vanished from SNP.
e) (ha)mē and bi- were still prosodically independent particles in ENP and would
be grammaticalized only in the course of Classical literature. In early CP, bi-
was often used also with the simple past (e. g., bi-raft ‘he went’).
f) In ENP and (early) CP (but no longer in SNP) there was a suffix -ē(δ) (EJP
-ē(h)), deriving historically from a MP optative particle, which had the mean-
ings of irrealis (conditional, potential, etc.) and habitual, e. g. (conditional):
(68) agar man ānjā būdam-ē
if I there be.pst .1 sg - irr
‘if I had been there’ (Lazard 1963: §456)
Persian 609
6.12. Progressive
Both Fārsī, Kabuli Darī and Tajik have developed a progressive verbal mode,
present and past, with a similar function but a completely different structure, see
Table 9 (Lazard 1956: 160, 165–166):
Table 9: Progressive verbal mode

Present Past
SNP dāram mī-ravam ‘I am going’ dāštam mī-raftam
Kabuli rafta mē-ravom rafta mē-raftom
Tajik rafta istodam rafta istoda budam
(in dialects shortened to: rafsodam) (to: rafsodudam)
It is worth noting that the progressive mode does not allow negativization, i. e.
there is no *nadāram mīravam or *dāram nemīravam. Typologically, this may
reflect a universal tendency: it does not make much sense to point to an ongoing
action or process in progress, if it does not take place. Besides indicating that an
action is still in progress, there is an “intentional” use of the progressive mode
in CollP., referring to intended actions in the future, e. g.: dāram fardā mī-ravam
Ālmān ‘I am about to go to Germany tomorrow’.
6.13. Passive
A SNP passive is built with the past participle and the auxiliary verb šodan
‘become’, e. g. xarīde šod ‘(it) was bought’. The passive is mainly employed when
the agent is unknown; in formal language an agent may be added with compound
prepositions like az ṭaraf-e … ‘by, from … side’. In ENP, there was also a passive
with āmadan ‘come’, e. g. gufta āmad ‘it was said’.
6.14. Specific usages

There are specific, noteworthy usages of the SNP tenses that differ from that of
English (Lazard 1992: §131):
A present verb may be used to express a persistent situation where English uses
the present perfect, e. g.:
(69) čand sāl ast īnjā zendegī mī-konī?
how.many year cop .3 sg here living dur -do. prs .2 sg
‘For how many years have you been living here?
610 Ludwig Paul
(70) az zamān-e … sī sāl mī-goẕarad

from time-ez … thirty year dur -pass. prs .3 sg
‘From the time of…, thirty years have passed.’
A past verb may indicate an action that is accomplished (and witnessed) at the very
moment of speaking, e. g.:
(71) āmadand
come.pst .3 pl
‘They have (just now) come’
or going to be completed in the immediate future (Lazard 1992: §138):
(72) āmadam!
come.pst .1 sg
‘I am coming (just now)!’
A past verb may also be used in a future context in conditional clauses (§8.4.2.3.).
6.15. Modal verbal constructions

In the history of NP, constructions involving modal verbs changed their structure
substantially, the best example being the verb tavānistan ‘to be able’. While in
ENP and CP, tavānistan was followed by the full or short infinitive, e. g.:
(73) tavānad raft(an) (or, inverted: raft(an) tavānad)
can. prs .3 sg go.inf
‘He can go’
in modern SNP it is complemented by a subjunctive verb (Paul 2002: 26–28):
(74) mī-tavānad be-ravad
dur -can. prs .3 sg sbjv -go. prs .3 sg
‘He can go’
An intermediary stage was a dependent clausal construction with subordinating
particle ke ‘that’ (mī-tavānad ke be-ravad ‘He can go’). SNP impersonal modal
constructions with the short infinitive like bāyad raft ‘one has to go’ can be
regarded as remnants of the older constructions.
In Tajik dialects, and in Kaboli Darī, a construction with the passive participle
has developed (rafta mē-tavānam ‘I can go’), but what looks like a past participle
in these constructions has been explained as being derived historically form an
old infinitive (raftan) whose -n has been dropped (Lazard 1956: 176). The inverse
order of modal and full verb here, in contrast to modern Fārsī, may reflect Turkic
influence.
Persian 611
Table 10: Overview of the most important developments of the NP tense system
SNP  ENP  Tajik
Present: mīravam ([ha]mē) rawam meravam
Subjunctive: beravam (bi) rawam ravam
(be-: lexicalized)
Future: xwāham raft — xoham raft
Simple past: raftam (bi) raftam raftam
Imperfect past: mīraftam ([ha]mē) raftam meraftam
Present perfect (I): rafte-am rafta-am, rafta buvam etc. rafta-am
(EJP) raft hem
Present perfect (II): — raftastam —
Durative present mīrafte-am ([ha]mē) rafta-am (rare) merafta-am
perfect: (resultative)
Irrealis: — ([ha]mē/bi) raftam-ē —
Resultative: — (EJP) raftagī raftagī
Modal construction: mītavānad tavānad raft(an) rafta metavonad
beravad
This section on syntax presents only the basic constructions, and highlights, very
selectively, a small number of noteworthy phenomena. This applies especially to
the syntax of complex sentences. An excellent comprehensive description of SNP
syntax can be found in Lazard (1992: 177–257).
7.1. Noun phrases

In SNP, the modifier of a noun phrase, be it a substantive or descriptive adjective,
always follows the head noun with an unstressed connecting particle -(y)e, which
is called “ezafe”, e. g.:
(75) xāne-ye pedar
house-ez father
‘the father’s house’ (N-N)
(76) mard-e bozorg
man-ez big
‘the big man’ (N-Adj)
612 Ludwig Paul
Demonstrative, quantifying and indefinite adjectives always precede the noun and
remain in the singular, e. g., īn xāne ‘this house’, čand xāne ‘some (/ how many)
houses’.
In Tajik dialects, and in the Darī of Kabul, a construction with an inverted
ordering of elements has developed, e. g.:
(77) muallim-a kitob-aš
teacher-to book-his
‘the teacher’s book’ (Lazard 1956: 182, 184; Windfuhr and Perry 2009:
443)
This is most conspicuous in the northern Tajik dialects that have been under heavy
Uzbek influence. In these Tajik dialects, adjectives also exhibit a tendency to
precede the modifier without ezafe (Lazard 1956: 182). A similar construction
without -a is widespread also in colloquial registers of modern Fārsī, e. g.:
(78) Aḥmad zan-aš kojā-st? (for SNP: zan-e Aḥmad kojā-st?)
Aḥmad wife-his where-cop . prs .3 sg
‘Where is Aḥmad’s wife?’
In ENP, the word order of noun phrases was still less strict than in SNP, and
certain high-frequency adjectives like ‘big, small, good, bad’ could precede the
head without the ezafe (Lazard 1963, §165–172, e. g. baδ mardān ‘the bad men’,
Lazard 1963, §167; similarly in EJP, Paul 2013a: §189.b). Less often, modifying
nouns could precede the head, e. g.:
(79) jihān šahriyār
world ruler
‘the ruler of the world’ (Lazard 1963: §163).
In ENP and CP, an -ē that marked the head noun of a noun phrase as indefinite
(§4.5.) replaced the ezafe before the descriptive adjective, e. g.:
(80) mard-ē bozorg
man-indef big
‘a great man’
In SNP, and in high registers of spoken Fārsī, this construction still exists as an
emulation of classical style. In “normal” spoken Fārsī, the indefinite noun phrase
retains the ezafe, and the -ī (< ENP/CP -ē) is attached at the end of the noun phrase
(mard-e bozorg-ī).
Persian 613
7.2. Word order
7.2.1. SNP
For the word order of a simple SNP sentence, the following rules apply:
a. The subject (S), if overtly expressed at all (see §4.8. on pro-drop), precedes
the predicate (Pred). The predicate may be a full verb or a copula.
b. Direct or indirect objects (Od, Oi), goals of verbs of motion (Dir), and predi-
cate complements (CoPr) are inserted between S and Pred. With regard to the
relative ordering of Od and Oi in SNP, see Faghiri et al (2014) for a recent
corpus-based study.
c. In ditransitive sentences, a definite Od precedes the Oi, an indefinite Od
follows it.
Examples for these rules:

(81) a. Ḥasan mī-ravad(S–Pred)
Ḥasan dur -go. prs .3 sg
‘Hasan goes’
b. Ḥasan ʿAlī=rā mī-bīnad (S–Od–Pred)
Ḥasan ʿAlī=rā dur -see.prs .3 sg
‘Hasan sees Ali’
Ḥasan be Tehrān mī-y-āyad (S–Dir–Pred)
Ḥasan to Tehran dur - glide -come. prs .3 sg
‘Hasan comes to Tehran’
Ḥasan tambal ast (S–CoPr–Pred)
Ḥasan lazy cop .3 sg
‘Hasan is lazy’
c. Ḥasan ketāb=rā be ʿAlī dād (S–Od.def–Oi–Pred)
Ḥasan book=rā to ʿAlī give.pst .3 sg
‘Hasan gave the book to Ali’
Ḥasan be ʿAlī ketāb dād (S–Oi–Od.indef–Pred)
Ḥasan to ʿAlī book give.pst .3 sg
Hasan gave Ali a book/books’
In SNP, exceptions to these rules may occur, especially if a constituent is empha-
sized. In colloquial registers, almost any exception is possible, or even usual. For
example an object may follow the predicate if emphasized (Lazard 1992: §193):
(82) ʿomr-ešān=rā (Od) dādand (Pred) be šomā (Oi)
life-their=rā give.pst .3 pl to you
‘They gave you their life’ (i. e., idiomatically, ‘they passed away’)
614 Ludwig Paul
(83) šomā (S) dīdīd (Pred) ān yārū=rā (Od)?

you see.pst .2 pl that fellow=rā
‘Did you see the fellow in question?’
A direct object may also be topicalized and be placed before the subject. Less
often, a predicate precedes the subject when it is strongly emphasized, e. g.:
(84) tamām na-šod (Pred) kār-et (S)?
finished neg -become.pst .3 sg work-your
‘Isn’t your work finished?’ (Lazard 1992: §193)
Besides emphasis and topicalisation, there may be a “rhythmic” reason to break
the word order rules, in that very long noun phrases in object or copula comple-
ment function tend to follow the predicate (this applies also to SNP), e. g. (Lazard
1992: §193):
(85) īn (S) būde-ast (Pred) natīje-ye ʿadam-e tavajjoh (CoPr)
this be.ptcp - cop .3 sg result-ez lack-ez attention
‘This was the result of lack of attention’
Following Windfuhr and Perry (2009: 480), literary Tajik seems to comply with
the word order rules, and exceptions, of SNP. The most frequent deviation of word
order in Tajik is object fronting (OSV, OVS). For the dialects of Tajik, no study of
word order is currently available.
7.2.2. ENP
In ENP, word order is said to be much freer than it is in SNP (Lazard 1963: §788,
who however does not distinguish between Od and Oi). The study of EJP, which
may be taken to represent ENP, has identified word order rules that are not as strict
as in SNP, but already quite similar to them.
ENP exhibits a preference for S–O–Pred, but where constituents are given
emphasis, many exceptions are possible, e. g. Pred–S (Lazard 1963: §796, §799),
or even O–Pred–S, possibly reflecting colloquial style (Lazard 1963: §801). In
EJP, Oi usually follows the Od, but can also follow the Pred, as in modern collo-
quial Persian. The two preferred possibilities for the main constituents’ position
would be S–Od–Oi/Dir–Pred and S–Od–Pred–Oi/Dir (Paul 2013a: §193). Excep-
tions occur, e. g. Od follows Oi (S–Oi–Od–Pred,Paul 2013a: §199), or it follows
even the Pred (Paul 2013a: §195.c):
(86) pa čē bi-šināsī (Pred) durustīh i-ēn tis (Od)
by what sbjv -know. prs .2 sg truth ez -this matter
‘How do you know the correctness of this matter?’
Persian 615
The case of sentence-initial Pred in legal document should be considered separately,

since this seems to be a pecularity of legal formularies (Paul 2013a: §194.b), e. g.:
(87) u-guft (Pred) ēn Danyāl (S)
and-say.pst .3 sg this Daniel
‘and this Daniel said: …’
To summarize, the main differences between ENP and SNP seem to be:
in ENP, the Oi/Dir could more often follow the Pred, whose final position is
less strict.
in ENP, the Od was more strictly positioned before the Oi than in SNP.
8. Syntax of complex clauses
8.1. Noun clauses
8.1.1. SNP
SNP clauses that are dependent on verbs of belief, speaking, wishing, etc., regularly
follow the main clause, are introduced by the particle ke ‘that’, and always have a
finite verb form. Direct speech is preferred. The tense of the dependent clause is
independent of that of the main clause, e. g. future (dependent) following past (main
clause). Indirect speech is also possible, in which case only the personal suffix
changes, as seen in the following examples (taken from Lazard 1992: §203–204):
(88) Aḥmad goft ke pedar-am xwāhad āmad
Ahmad say.pst .3 sg that father-my will come. inf
‘Ahmad said that his father would come’ (dir. speech)
(89) Aḥmad goft ke pedar-aš xwāhad āmad
Ahmad say.pst .3 sg that father-his will come.inf
‘Ahmad said that his father would come’ (indir. speech)
After verbs meaning ‘want, wish, tell (to do), hope’ etc., the verb in the dependent
clause is in the present subjunctive:
(90) meyl dāštam ke be Tehrān be-ravam
inclination have.pst .1 sg that to Tehran sbjv -go. prs .1 sg
‘I wanted to go to Tehran’
(91) moʾallem be šāgerdhā goft ke ketābhā-y-etān=rā
teacher to pupils said that books-glide -your=rā
be-gīrīd
sbjv -take.2pl
‘The teacher told the pupils to take their exercise-books’
616 Ludwig Paul
(92) moʾallem be šāgerdhā goft ke ketābhā-y-ešān=rā

teacher to pupils said that books-glide -their=rā
be-gīrand
sbjv -take.3pl
‘The teacher told the pupils to take their exercise-books’
In colloquial Fārsī the ke is often dropped:
(93) Aḥmad goft pedar-am
Ahmad say.pst .3 sg father-my
mī-y-ād
dur - glide -come. prs .3 sg (present for future)
‘Ahmad says his father would come’
(94) Aḥmad goft mī-y-ād-aš
Ahmad say.pst .3 sg dur - glide -come-3 sg (cf. §4.9.2.)
‘Ahmad said he would come’
The dependent clause may be topicalized, in which case it is introduced by a cat-
aphoric demonstrative expression, and then precedes the main clause (Windfuhr
and Perry 2009: 512):
(95) īn (xabar) ke ʿAlī xāne=rā sāxt dorūγ ast
this news that Ali house=rā build.pst .3 sg lie cop .3 sg
‘(The news) that Ali built a house is a lie’
8.1.2. ENP
The few examples in EJP (Paul 2013a: §204) and the remarks in Lazard (1963: §810)
show that also in ENP direct speech was possible in these types of sentence.
8.1.3. Tajik
In Tajik (Windfuhr and Perry 2009: 513), direct speech seems to be impossible,
or unusual, in these types of sentence. Sentences of the indirect speech type (v.s.)
are possible, but in literary Tajik and Tajik dialects, Uzbek-type nominal clauses
that precede the main clause have become widespread, e. g. (Windfuhr and Perry
2009: 514):
(96) xud-i ū ki-st? gufta man az Rahim Qand
refl -3 sg he who-cop .3 sg say.ptcp I from Rahim Qand
pursidam
ask.pst .1 sg
‘Who is he, actually? I asked Rahim Qand’ (literally: ‘Who is he? – [that]
having said, I asked from Rahim Qand’)
Persian 617

SNP dependent relative clauses follow their head noun, which often has the “spe-
cific” -ī (§4.5.2.), and contain finite verb forms. They are introduced by the invar-
iable relative particle ke, e. g.:
(97) mard-ī ke mī-bīnī dozd ast
man-ī rel dur -see.prs .2 sg thief cop .3 sg
‘The man you see is a thief’
The SNP relative clause has been dealt with in detail by Lehmann (1978; see also
Lazard 1992: §207–213). In ENP the specifying -ē (> SNP -ī) occurred more rarely
than in SNP (Lazard 1966), in EJP it occurs very rarely (Paul 2013a: §207.e). EJP
has even preserved, from MPī, the ezafe i introducing relative clauses (besides ki;
Paul 2013a: §206). Ṣādeqī (2016) has recently shown that it exists, very rarely,
also in ENP.
The data in Windfuhr and Perry (2009) suggest that the SNP strategies for con-
structing relative clauses are possible in literary Tajik. In Tajik, however, there
is also a broad range of nominalized and pre-nominal relative clauses that corre-
spond partly to the Uzbek model, e. g. (Windfuhr and Perry 2009: 509):
(98) ob-e ki mo az hawz ovardem
water-e rel we from pool bring.pst .1 pl
‘the water that we brought from the pool’
(99) ob-i mo az hawz ovardagi
water-i we from pool bring. ptcp
(100) az hawz ovardagi-mon ob
from pool bring.ptcp -1 pl water
8.3. Temporal clauses

Persian distinguishes anterior, concomitant and future temporal clauses. One inter-
esting feature is that anterior temporal clauses always employ the present subjunc-
tive, even if the main clause is in the past:
(101) qabl az ānke ū be-ravad, man vāred
before from comp he sbjv -go.3 sg I entering
šodam
become.pst .1 sg
‘before he went (away), I came in’
618 Ludwig Paul
Future temporal clauses, on the other hand, usually require a past verb (but see
§8.4.), even if the action lies in the future, e. g.:
(102) baʿd az ānke raftī Ālmān, mī-bīnī
after from comp go.ps t.2 sg Germany dur .see.2 sg
ke …
that
‘After you (will) go to Germany, you (will) see that …’
Concomitant temporal clauses are often introduced by the conjunction vaqtīke
‘when’:
(103) vaqtīke Aḥmad ānjā raft, dīd kasī
when Aḥmad there go.pst .3 sg see.pst .3 sg nobody
nīst
neg . cop .3 sg
‘When Aḥmad went there, he saw (that) there was nobody’
Vaqtīke may be replaced by ke moving forward through the sentence:
Aḥmad ke ānjā raft, dīd kasī nīst
Aḥmad ānjā ke raft, dīd kasī nīst
Aḥmad ānjā raft ke dīd kasī nīst
The first two versions express practically the same meaning as the sentence above
with vaqtīke. In the 3rd version, with ke following the verb, the general meaning
of the whole sentence remains similar, but a “subordinator switch” (Windfuhr and
Perry 2009: 522) takes place and the second part of the sentence is syntactically
focused: ‘Aḥmad went there and saw: there is nobody’.
8.4. Conditional clauses
8.4.1. Impossible conditionals

There is a basic distinction between impossible and possible conditions in SNP.
Impossible conditions in the present, or ones that have not been realized in the
past, are expressed with the imperfect in the protasis and apodosis:
(104) agar mī-y-āmadam, mī-dīdam-aš
if dur -glide -come.pst .1 sg dur -see.pst .1 sg -him
‘If I had come, I would have seen him’, or: ‘If I came (but I won’t!), I
would see him’
Past unrealized conditions can also be expressed with the pluperfect in the prota-
sis:
Persian 619
(105) agar āmade būdam, mī-dīdam-aš

if come.ptcp cop . pst .1 sg dur -see.pst .1 sg -him
‘If I had come, I would have seen him’
8.4.2. Possible conditionals

Possible conditionals fall into the following three categories:
a. Conditions whose fulfilment is certain, or has already happened, employing
the present indicative in the protasis:
(106) agar mī-ravī bīrūn, ū=rā mī-bīnī
if dur -go.prs .2 sg outside he=rā dur -see.prs .2 sg
‘If you go outside (I know you will!), you (will) see him’
b. Conditions whose fulfilment is not certain, with the present subjunctive:
(107) agar be-ravī bīrūn, ū=rā mī-bīnī
if sbjv -go.prs .2 sg outside he=rā dur -see.prs .2 sg
‘if / in case you go outside, you (would) see him’
c. Conditions whose fulfilment is not certain, with the simple past; this empha-
sizes the immediate realization of the apodosis if the condition is fulfilled:
(108) agar raftī bīrūn, ū=rā mī-bīnī
if go.pst .2 sg outside he=rā dur -see.prs .2 sg
‘If you go outside, (right after doing that) you (will) see him’
8.4.3. Conditions without conjunction

In colloquial NP, any kind of condition may lack a conjunction, e. g.:
(109) mī-rī bīrūn, ū=rā mī-bīnī
dur -go.prs .2 sg outside he=rā dur -see.prs .2 sg
‘(If) you go outside (I know you will!), you (will) see him’
(110) raftī bīrūn, ū=rā mī-bīnī
go.pst .2 sg outside he=rā dur -see.prs .2 sg
‘(If) you go outside, (right after doing that) you (will) see him’
These sentences may alternatively be understood as concomitant temporal clauses
(‘when you go outside’), with which they share an identical structure and sequence
of tenses. The latter is actually the syntactically determining feature. Both types of
subordinate clauses can be distinguished only by using the conjunction agar “if”
or (vaqtī)ke “when”.
The close affinity of conditional and temporal clauses is also shown by the
fact that in future temporal clauses with an indicative present in the main clause,
620 Ludwig Paul
simple past and present subjunctive are equally employed in the protasis, the sub-
junctive introducing a doubt/condition into the temporal clause, e. g.:
(111) baʿd az ānke raftī / be-ravī Ālmān,
after from comp go.pst .2 sg / sbjv -go.prs .2 sg Germany
mī-bīnī ke…
dur .see.2 sg that
‘After you (will / were to) go to Germany, you (will) see that …’
8.5. Subordination vs. coordination

In modern colloquial Fārsī, many types of subordinate clauses can be constructed
without the SNP multi-functional conjunction ke “that, which (etc.)”. Besides tem-
poral, conditional (§8.3., §8.4.) and noun clauses (§8.1.), this concerns also rela-
tive clauses and clauses of purpose, e. g.:
(112) īn čī būd, xordī?
this what be.pst .3 sg eat.pst .2 sg
‘What was it (that) you ate?’
(113) ketāb-o xūndī, xoš-et ūmad?
book=rā read.pst .2 sg well-your come.pst .3 sg
‘The book (that) you read, did you like (it)?’
(114) Īrān rafte būdī čī kār konī?
Iran gone be.pst .2 sg what work do.sbjv .2 sg
‘You had gone to Iran, (so as) to do what?’
and other constructions, like the following, which may be understood concessively:
(115) xeilī zūr zad, na-šod
very power strike.pst .3 sg neg -become.pst .3 sg
‘(Although) he tried very hard, it did not work’, or:
‘He tried very hard, (yet) it did not work’
The missing ke in these clauses turns a construction that is subordinate in SNP, into
a paratactic sequence of clauses whose functional interdependency can only be
understood from the context, and from the sequence of tenses. What seems to be a
process of loss of ke as a clause conjunction in modern colloquial Fārsī, goes hand
in hand with a de-grammaticalization of ke where it is still used. A widespread
function of ke in colloquial NP is to emphasize the preceding word, without having
any grammatical function, e. g.:
(116) man ke ne-mī-dānestam!
I emphat neg - dur -know.pst .1 sg
‘I really/actually didn’t know!’
Persian 621
This emphatic meaning of ke can be understood to replace its (former) grammati-

cal meaning as conjunction in certain cases (see §8.3.2.), e. g.:
(117) man ke ānjā raftam, dīdam …
I [that] there go.pst .1 sg see. pst .1 sg
‘when I went there, I saw …’ > (preferred understanding:) ‘I actually went
there, (and) saw …’
See Maḥmūdī Baxtiyārī et al (2013) for recent discussion of these additional func-
tions of ke in contemporary spoken Persian. It remains to be seen if this “para-
taxization” of colloquial Fārsī syntax will affect also the grammar of SNP in the
future.
Abbreviations
1, 2, 3 First, second, third person irr Irrealis

comp Complementizer neg Negation
cop Copula num Numeral classifier
dem Demonstrative obj Object
dur Durative pl Plural
emphat Emphatic prs Present tense
ez Ezāfe pst Past tense
glide Glide ptcp Participle
imper Imperative refl Reflexive
indef Indefinite sbjv Subjunctive
inf Infinitive sg Singular
References
Ahadi, Shahram. 2001. Verbergänzungen und zusammengesetzte Verben im Persischen:

Eine valenztheoretische Analyse. Wiesbaden: Reichert.
Alam, Muzaffar. 1998. The pursuit of Persian: Language in Mughal politics. Modern Asian
Studies 32(2). 317–349.
Bani-Shoraka, Helena. 2005. Language choice and code-switching in the Azerbaijani com-
munity in Tehran: A conversation analytic approach to bilingual practices. Uppsala:
Uppsala Universitet dissertation.
Beeman, William O. 1986. Language, status, and power in Iran. Bloomington: Indiana
University Press.
Bonakdarian, Mansour. 2006. India. VIII. Political and cultural relations: Qajar period, the
19th century. Encyclopædia Iranica 8.1, 26–32. New York: Center for Iranian Studies,
Columbia University.
Borjian, Habib. 1388 [2009]. Motūn-e Ṭabarī [Ṭabarī texts]. Tehrān: Mīrās̱ -e Maktūb.
(Āyene-ye Mīrās̱ , supplement nr. 15).
622 Ludwig Paul
Cole, Juan R. 2002. Iranian culture and South Asia, 1500–1900. In Nikki R. Keddie & Rudi
Matthee (eds.), Iran and the surrounding world: Interactions in culture and cultural
politics, 15–35. Seattle & London: University of Washington Press.
Dabir Moghaddam, Mohammad. 1992. On the (in)dependence of syntax and pragmatics:
Evidence from the postposition -rā in Persian. In Dieter Stein (ed.), Cooperating with
written texts: The pragmatics and comprehension of written texts, 549–573. Berlin &
Boston: De Gruyter Mouton.
Durkin-Meisterernst, Desmond. 2014. Grammatik des Westmitteliranischen (Parthisch und
Mittelpersisch). Wien: Österreichische Akademie der Wissenschaften.
Faghiri, Pegah, Pollet Samvelian & Barbara Hemforth. 2014. Accessibility and word order:
The case of ditransitive constructions in Persian. In Stefan Müller (ed.), Proceedings of
the 21st International Conference on Head-Driven Phrase Structure Grammar, Univer-
sity at Buffalo, 27–29 August 2014, 217–237. Stanford, CA: CSLI Publications.
Fragner, Bert. 1999. Die “Persophonie”: Regionalität, Identität und Sprachkontakt in der
Geschichte Asiens. Berlin: Das Arabische Buch.
Gippert, Jost. 2007/08. Zur dialektalen Stellung des Zazaki. Die Sprache 47. 77–107.
Gyselen, Rika. 2012. Scellements dans lʾarchive post-sassanide du Tabaristan (1). In Rika
Gyselen (ed.), Objets et documents inscrits en pârsîg, 97–108. Bures-sur-Yvette:
Groupe pour l’Étude de la Civilisation du Moyen-Orient.
Jazayery, Mohammad Ali. 2000. Farhangestān. In Ehsan Yarshater (ed.) Encyclopædia
Iranica 9, 273–279. New York: Center for Iranian Studies, Columbia University.
Jensen, Hans. 1931. Neupersische Grammatik. Heidelberg: C. Winter.
Kahnemuyipour, Arsalan. 2003. Syntactic categories and Persian stress. Natural Language
& Linguistic Theory 21. 333–379.
Kauz, Ralph. 2009. Chang Zimei und der Islam in Shandong im 17. Jahrhundert: Akkultur-
ation oder Abgrenzung? Zur Geschichte einer persischen Grammatik. Saeculum 60(1).
91–113.
Khānlarī, Parvīz Nātel. 1979. A history of the Persian language, vols. 1, 2. New Delhi:
Sterling Publishers et al.
Khānlarī, Parvīz Nātel. 1365 [1986]. Tārīx-e zabān-e Fārsī [A history of Persian], 3 vols,
(revised ed.). Tehrān: Našr-e nou.
Kreiser, Klaus. 1993. Persisch als Schulsprache bei den osmanischen Türken: Von der
Tanzîmât-Zeit zur frühen Republik. In Jens Peter Laut & Klaus Röhrborn (eds.),
Sprach- und Kulturkontakte der türkischen Völker: Materialien der zweiten Deutschen
Turkologen-Konferenz, Rauischholzhausen, 13.-16. Juli 1990, 123–131. Wiesbaden:
Harrassowitz.
Landau, Jacob M. & Barbara Kellner-Heinkele. 2012. Language politics in contemporary
Central Asia: National and ethnic identity and the Soviet legacy. London: Tauris.
Lazard, Gilbert. 1956. Caractères distinctifs de la langue tadjik. Bulletin de la Societé Lin-
guistique 52(1). 117–186.
Lazard, Gilbert. 1957. Grammaire du persan contemporain. Paris: Librairie C. Klinck-
sieck. [Nouvelle édition: Téhéran: Institut Français de Recherche en Iran, 2006]
Lazard, Gilbert. 1963. La langue des plus anciens monuments de la prose persane. Paris:
Librairie C. Klincksieck.
Lazard, Gilbert. 1966. L’enclitique nominal -i en persan: un ou deux morphèmes? Bulletin
de la Societé Linguistique 61(1). 249–264.
Persian 623
Lazard, Gilbert. 1970. Étude quantitative de l’évolution d’un morphème: la postposition rā

en persan. In David Cohen (ed.), Mélanges Marcel Cohen, 381–388. Berlin & Boston:
De Gruyter Mouton.
Lazard, Gilbert. 1971. Pahlavi, Parsi, Dari: Les langues de l’Iran d’après Ibn al-Muqaffaʿ.
In Clifford Edmund Bosworth (ed.), Iran and Islam, in memory of the late Vladimir
Minorsky, 361–391. Edinburgh: Edinburgh University Press.
Lazard, Gilbert. 1982. Le morphème râ en Persan et les relations actancielles. Bulletin de la
Societé Linguistique 77(1). 177–208.
Lazard, Gilbert. 1989. Le persan. In Rüdiger Schmitt (ed.), Compendium Linguarum Iran-
icarum, 263–293. Wiesbaden: Reichert.
Lazard, Gilbert. 1992. A Grammar of Contemporary Persian. Costa Mesa: Mazda Publish-
ers.
Lehmann, Christian. 1978. Yā-ye ešārat. Zur Grammatik des persischen Relativsatzes. In-
dogermanische Forschungen 82 (1977). 97–106.
Lenepveu-Hotz, Agnès. 2011. Sipāh-i Ispahān: devenir des groupes consonantiques initiaux
moyen-perses en person. Faits de Langues 38. 79–95.
Lenepveu-Hotz, Agnès. 2014. Lʾévolution du système verbal persan (Xe – XVIe siècles).
Leuven & Paris: Peeters.
Liu, Yingsheng. 2010. A Lingua Franca along the Silk Road: Persian Language in China
between the 14th and the 16th centuries. In Ralph Kauz (ed.), Aspects of the Maritime
Silk Road: From the Persian Gulf to the East China Sea, 87–95. Wiesbaden: Harrasso-
witz.
MacKenzie, David Neil. 1992. Codex Cumanicus. Encyclopaedia Iranica 5.8, 885–886.
New York: Center for Iranian Studies, Columbia University.
Maḥmūdī Baxtiyārī, Behrūz & Farzāne Tāǧ Ābādī. 2013. Čand kārbord-e kalāmī-ye dīgar
az ‘ke’ dar Fārsīye goftār: Moṭāleʿāt-e zabān va gūyešhā-ye γarb-e Īrān [Some other
pragmatic functions of ‘ke’ in spoken Persian]. Journal of Language and Western Ira-
nian Dialects 1(2), 1392 [2013]. 103–116. Kermanshah: Razi University.
Marzolph, Ulrich. 2011. Das Schahname im Druck: Zu den lithographischen Ausgaben
des persischen Nationalepos. In Julia Gonnella & Christoph Rauch (eds.), Heroische
Zeiten: Tausend Jahre persisches Buch der Könige, 64–71. Berlin: SMB, Museum für
Islamische Kunst, Staatliche Museen zu Berlin.
Meunier, Annie & Pollet Samvelian. 1997. La postposition râ en persan: ses liens avec la
détermination et sa fonction discursive. Cahiers de Grammaire 22. 187–232. Toulouse:
Université de Toulouse – Le Mirail.
Moinfar, Mohammad Djafar. 1980. Les classificateurs en persan. In G. Brettschneider (ed.),
Wege zur Universalienforschung, 317–320. Tübingen: Narr.
Nešāṭ, Maḥmūd. 1368 [1990]. Šomār va meqdār dar Fārsī [Number and quantity in Per-
sian]. Tehrān: Amīr Kabīr.
Nawid, Senzil. 2012. Language policy in Afghanistan: Linguistic diversity and national
unity. In Harold F. Schiffman (ed.), Language policy and language conflict in Afghan-
istan and its neighbors: The changing politics of language choice, 31–52. Leiden:
Brill.
Paul, Ludwig. 1998. The position of Zazaki among West Iranian languages. In Nicholas
Sims-Williams (ed.), Proceedings of the 3rd European Conference on Iranian Studies.
Part 1, Old and Middle Iranian Studies, 163–176. Wiesbaden: Reichert.
624 Ludwig Paul
Paul, Ludwig. 2002. A Linguist’s Fresh View on “Classical Persian”. In Maria Szuppe (ed.),
Iran. Questions et connaissances. Actes du IVe Congrès Européen …, vol. 2, Périodes
médiévale et modern, 21–34. Paris: Peeters.
Paul, Ludwig. 2008. Some remarks on the Persian suffix =rā as a general and historical lin-
guistic issue. In Simin Karimi, Vida Samiian & Donald Stilo (eds.), Aspects of Iranian
Linguistics, 329–337. Newcastle: Cambridge Scholars Publishing.
Paul, Ludwig. 2013a. A Grammar of Early Judaeo-Persian. Wiesbaden: Reichert.
Paul, Ludwig. 2013b. Persian Language i. Early New Persian. Encyclopædia Iranica. New
York: Center for Iranian Studies, Columbia University. http://www.iranicaonline.org/
articles/persian-language-1-early-new-persian (accessed on 22September 2017).
Paul, Ludwig. (forthcoming). “Be nice to each other!” – Moral precepts at Tehran Metro
walls as examples of written colloquial Persian. In Pavel Lurije (ed.), Proceedings of
the SIE conference held in St. Petersburg, 15–19 September 2015.
Perry, John. 1991. Form and meaning in Persian vocabulary: The Arabic feminine ending.
Costa Mesa: Mazda Publishers.
Perry, John. 2005. A Tajik Persian Reference Grammar. Leiden: Brill.
Phillott, Douglas C. 1919. Higher Persian Grammar. Calcutta: University.
Pisowicz, Andrzej. 1985. Origins of the New and Middle Persian Phonological Systems.
Kraków: Nakł. Uniwersytetu Jagiellońskiego.
Rāsex Mahand, Moḥammad. 1386 [2007]. Touṣīf-e afʿāl-e morakkab-e peybastī dar zabān-e
Fārsī va šīve-ye ẓabṭ-e ānhā dar farhanghā [The study of Persian enclitic compound
verbs and their presentation in general dictionaries]. Farhangnevīsī 1(1). 236–252.
Rypka, Jan. 1968. History of Iranian Literature. Dordrecht: D. Reidel Publisher.
Ṣādeqī, ʿAlī Ašraf. 1363 [1984]. Tabdīl-e ‘ān’ va ‘ām’ be ‘ūn’ va ‘ūm’ dar Fārsī-ye goftārī
[The replacement of ‘ān’ and ‘ām’ by ‘ūn’ and ‘ūm’ in spoken Persian]. Majalle-ye
Zabānšenāsī 1(1). 52–71.
Ṣādeqī, ʿAlī Ašraf. 1385 [2006–07]. Taḥavvol-e ṣāmet-e ‘q’-ye ʿArabī dar zabān-e Fārsī
[The development of the Arabic consonant ‘q’ in Persian]. Majalle-ye Zabānšenāsī
21(1–2). 3–32.
Ṣādeqī, ʿAlī Ašraf. 1395 [2016]. Kasre-ye eẓāfe be jā-ye ‘ke’-ye mouṣūl [A short i instead
of relative ‘ke’]. Farhangnevīsī 11. 3–7.
Samvelian, Pollet. 2012. Grammaire des prédicats complexes: Les constructions nom-
verbe. Paris: Hermes Science Publications.
Shahidi, Minoo. 2008. A Sociolinguistic Study of language shift in Mazandarani. Uppsala:
Uppsala Universitet dissertation.
Spuler, Bertold. 1985. Die Mongolen in Iran. Politik, Verwaltung und Kultur der Ilchanzeit
1220–1350, 4. edn. Berlin: Akademie-Verlag.
Ṭabāṭabāyī, ʿAlāʾeddīn. 1395 [2016]. Farhang-e touṣīfī-ye dastūr-e zabān-e Fārsī [De-
scriptive dictionary of Persian grammar]. Tehrān: Farhang-e moʿāṣer.
Tedesco, Paul. 1921. Dialektologie der westiranischen Turfantexte. Monde Oriental 15.
184–258.
Windfuhr, Gernot Ludwig. 1979. Persian Grammar: History and State of its Study. Den
Haag: Mouton Publishers.
Windfuhr, Gernot & John R. Perry. 2009. Persian and Tajik. In Gernot Windfuhr (ed.), The
Iranian Languages, 416–544. London: Routledge.
4.7. Kumzari
Christina van der Wal Anonby
1. Introduction
Kumzari is a mixed language with Indo-Iranian and Arabian ancestors. The

language is named after the village of Kumzar in Oman, which faces the Gulf
from the Musandam Peninsula. Speakers of Kumzari live mainly in Kumzar and
Khasab, with a few in scattered locations in Oman and the United Arab Emirates
(see Figure 1). There is also a small population across the Gulf on Larak Island in
Iran. In total, Kumzari speakers number approximately five thousand.
Kumzari’s mixed nature places it within the Indo-Iranian, western Iranian
family on the one side, and in the south Semitic family on the other side. An early
brief description of the language was written by a British officer, Bertram Thomas,
in 1929–1930. References to the people of Musandam speaking a different lan-
guage appeared in earlier travelers’ notes in the 19th century (Palgrave 1866; Ross
1874: 195; Jayakar 1902: 246; Zwemer 1902: 57; Miles 1994: 379, 436–448).
From these accounts, and historical records of the region, as well as oral history
of the people themselves (Lancaster and Lancaster 2011), a picture emerges of the
origins of the Kumzari people.
The Musandam peninsula, in the ancient era called Maka or Mazun, was occu-
pied intermittently by Persians of the Achaemenid, Parthian, and Sassanian dynas-
ties, from the sixth century BC until the seventh century CE. During the Achae-
menid rule of Maka, Aramaic was the lingua franca, and was used along with
local languages to transmit messages from the Persian government (McWhorter
2007: 155; Potts 2012: 114). Several waves of Azd people fled Yemen after the
breach of the Ma’rib dam and subsequent flooding from 145 BC and until the
5th century CE. Some Azd eventually settled in Musandam, with its capital at
Dibba. The Azd migrants spoke a South Semitic language and settled among the
Persian colonists in Oman.
There were wars and governing power went back and forth between the Azd
and the Persians over seven centuries (Ross 1874: 116; al-Rawas 2000: 29; Ulrich
2008: 64). Dibba was a thriving centre of maritime trade in the third to seventh
centuries CE, even called the capital of Oman, and Persian bishops were desig-
nated to represent Musandam to the Church of the East (de Perceval 1853: 14;
Neale 1873: 132; Ross 1874: 75–79; Hawley 1970: 63; King 2001: 59–61). Within
the Sassanian Empire, the Persians appointed Azd chiefs as their deputies to
govern Oman. Northern Arabian tribes from Medina began raiding Oman in the
7th century. Some of the Azd chiefs joined them and war erupted. The Dibba wars
eventually resulted in the expulsion of the ethnically mixed population in 633;
626
Christina van der Wal Anonby
Figure 1: Locations of Kumzari speech communities

Kumzari 627
Persian and Azd rebels battling the North Arabians were defeated, killed, or chased
from Dibba. The refugees settled in Musandam’s mountains and remote coasts.
Over the following centuries, they gained a reputation both as aloof bedouins and
fierce guardians of the Strait between Arabia and Persia.
Linguistically, Kumzari resembles both the descendants of Middle Persian
isolated from its modern counterpart in Iran, and the indigenous languages of
southern Arabia. Core vocabulary comprises items of Semitic and Indo-Iranian
sources, with both represented as well in the lexicon as a whole. Kumzari pho-
nology maintains the complexity of Persian and Arabian inventories, including
a full set of Semitic emphatic phonemes, even in lexical items of Iranian origin.
The South Arabian retroflex approximant [ɻ], but also Iranian-specific sounds like
the voiceless bilabial stop [p], the voiceless palato-alveolar affricate [tʃ], and the
voiced uvular stop [g], are attested in the language. The verb system in Kumzari
is fundamentally mixed, with both Arabian triliteral roots and Persian-origin finite
verbs. Like South Arabian languages but unlike Iranian, Kumzari has post-constit-
uent and multiply-marked negation. It has a split system of nouns, adjectives, and
adverbs derived from Semitic verbs.
Kumzari has SOV constituent order similar to Iranian languages, but verb
phrase syntax more closely aligned with South Arabian varieties. Within the
Kumzari three-term system of evidentials, the firsthand sensory information source
marker resembles particles with similar functions in Iranian Balochi dialects and
South Arabian Mehri and Shihhi Arabic. Kumzari discourse incorporates features
found in languages on east and west sides of the Gulf. Kumzari’s mixed ancestry
is reflected in the language as it is currently spoken.
Today the Kumzari consider themselves to be Arabs of the Shihhuh con-
federacy, with the identification of their unique language. Most Kumzaris lead
a nomadic lifestyle. In the village of Kumzar, people subsist on fishing. They
live in stone houses with central courtyards, with dwellings tightly wedged in the
wadi, adjoined only by narrow paths. In summer months they travel by boat to
the Kumzari quarter of the city of Khasab, where many have date palm planta-
tions. They are there to harvest dates and trade goods and hold weddings. Kumzari
are generally endogamous, but some intermarry with the Arabic-speaking Shihuh
whose people make up the majority of the population on Musandam. The Kumzari
are distinguished in lineage as part of a clan: Aqelī, Ghōšbanī, and Bō’īnī. By reli-
gion, the Kumzari follow the Sunni branch of Islam, and each of the two main clans
has a mosque in Kumzar. Some religiously oriented oral traditions are adopted
from Arabic, while Kumzari’s own oral traditions include poetry, folktales, fishing
and other work songs, lullabies, and qawals.
Kumzari is an unwritten language, but there is improvised use of the Arabic
alphabet, such as to convey Kumzari text messages on mobile telephones. Kumzari
local authorities approved an orthography developed by linguist Erik Anonby
for use by Kumzari speakers in 2010, and there is some movement toward its
628 Christina van der Wal Anonby
implementation in primary education. All but a few families teach their children
Kumzari exclusively until school age. However, all media is in Arabic, and for
schooling past primary grades children must move to Khasab for the full year. This
presents a mixed prognosis for the future vitality of Kumzari: people are proud
of speaking their own language, but it is in danger of being engulfed by Arabic
in many domains of use. Most Kumzari speakers have some degree of fluency in
Arabic, and those who remain in Khasab throughout the year are surrounded by
speakers of the Shihhi dialect of Arabic.
Studies of Kumzari based on fieldwork are Jayakar (1902), Lorimer (1915),
Thomas (1930), Dostal (1972), Najmabadi (1988), and Skjærvø (1989). The
present author has written a Kumzari reference grammar, including discourse,
texts, and lexicon (van der Wal Anonby 2015). Along with the author’s collection,
collaboration with a Kumzari speaker who has gathered several thousand words
will form the basis of a dictionary.
2. Phonology
Kumzari has three short vowels /i/ /u/ /a/, which have a mid-centralised quality in
comparison to its five long vowels /ī/ /ū/ /ē/ /ō/ /ā/. Vowels are always separated
by a glide or glottal stop, even with clitics, and there are no vowel sequences. All
vowel-initial words begin with a glottal stop.
Table 1: Vowel phonemes in Kumzari

front central back
high ī ū
ɪ u
mid ē ō
low a
ā
Kumzari distinguishes twenty-eight consonants. Its inventory draws both from its
Persian heritage, in the phonemes /p/, /č/, and /g/, and from its Arabian heritage, as
in its emphatic consonants: pharyngeal /ḥ/, uvular /q/, and velarised alveolars /ṭ/, /ḍ/,
/ṣ/, /ẓ/, and /ḷ/. Emphatics occur even in words not of Arabian origin, e. g. ṣirx ‘red’,
pānḍa ‘fifteen’, čāẓ ‘lunch’. All emphatic consonants in Kumzari have non-em-
phatic counterparts, with the notable exception of /ẓ/, which is always velarised.
Like Shihhi Arabic (Bernabela 2011: 26) and some South Arabian lan-
guages (Simeone-Senelle 1997: 381–382), Kumzari lacks certain phonemes from
north-central peninsular Arabic varieties: the voiced pharyngeal fricative [ʕ] and
Kumzari 629
interdental fricatives [θ], [ð], and [đ]. In common with the western Iranian lan-
guages Bakhtiari and southern Luri, Kumzari has a bilabial approximant /w/, in
place of the Persian labiodental fricative /v/. Unlike Gulf Arabic and Persian but
similar to the South Arabian languages Mehri and Hobyot and Shihhi Arabic, the
Kumzari /r/ is a retroflex approximant; in some cases it can be realised as an alve-
olar or retroflex flap or trill (Bernabela 2011: 23–25; Simeone-Senelle 1997: 383).
Table 2: Consonant phonemes in Kumzari

labial alveolar velarised (alveo-) velar uvular pharyn- glottal
(-dental) alveolar palatal geal
stop/affricate pb td ṭḍ čj kg q ʾ
fricative f s ṣẓ š xğ ḥ h
nasal m n
approximant w lr ḷ y
Kumzari lacks sonority restrictions in its consonant clusters. Thus there are many
words that violate the sonority sequencing principle (i. e. nasals, liquids, and
approximants are found at the edges of a syllable, even outside a less sonorous con-
sonant): mṣaww ‘barnacle’, faql ‘porcupine fish’, rkāḥ ‘sandal’, ntōr ‘treats’. This
may be due to vowel elision in a reduced syllable, similar to the process in other
Bedouin Arabic dialects that allow initial consonant clusters (Watson 2012: 901).
All non-marginal consonants1 are found in both first and second positions in
initial and coda clusters. There are also no restrictions on word-internal consonant
sequences across syllable boundaries. Most consonants in Kumzari form gemi-
nates, and consonant clusters separate at syllable boundaries.
The minimal syllable structure in Kumzari is CV. The following syllable shapes
occur in monomorphemic words:
CV pi ‘from’
CVV čō ‘well’
CVC kaf ‘palm, sole’
CVVC xōx ‘peach’
CVCC bukr ‘firstborn’
CVVCC qāpṭ ‘white fish sp.’
CCVV xwā ‘salt’
CCVVC qbīb ‘narrow’
CCVCC mṣarr ‘men’s headdress (turban)’
CCVVCC stārg ‘star’
1
I. e. all consonants except glottals, semi-vowels, and /ḷ/.
Words of a single syllable and a short vowel must be closed; with a long vowel
they may be open or closed syllables. Kumzari has penultimate syllable stress.
Words with more than three syllables are rare.
Phonological processes that facilitate conformity to syllable limitations in
Kumzari include assimilation, insertion, deletion, and resyllabification.
Word-final vowels ū and ī assimilate to w and y with the addition of a suffix:
gēdū ‘water-pipe’, gēdw-ē ‘a water-pipe’; qrādī ‘bull shark’, qrādy-ē ‘a bull shark’.
When the the definite suffix –ō is added, i may be lowered to ē or backed to u and a
raised and u lowered to ō: battil ‘dhow’, battēl-ō ‘the dhow’; rikd ‘corner’, rukd-ō
‘the corner’; langal ‘anchor’, langōl-ō ‘the anchor’; ğambur ‘niche’, ğambōr-ō
‘the niche’.
Verb suffixes Realis –d and Perfect–s prompt the insertion of an epenthetic
vowel u for verb roots ending in a single consonant and ī for verb roots with a final
consonant cluster: gnūn ‘believe’, real gnūnud, perf gnūnus; gird ‘go around’,
real girdīd, perf girdīs. The same verb suffixes may affect the deletion of r in
verb roots ending in that phoneme: ēnar ‘hide’, real ēnid, perf ēnis, or the r may
be retained as a flap (lenition) in the place of Realis –d: ōdur ‘hold on’, real ōdur,
perf ōdus.
Other cases of deletion in Kumzari are Imperfect prefix t- before alveolar-ini-
tial verb roots and the replacement of initial w with the Imperfect prefix: jōr ‘ask’,
imperf jōr; wāt ‘want’, imperf tāt, as well as elision of initial y in an unstressed
word following a short vowel-final word: inda yē ‘inside it’ > indē.
When a morpheme is added to a word ending with a consonant cluster, the final
consonant joins the affixed syllable: bukr ‘firstborn’ CVCC, buk.r-ō ‘the firstborn’
CVC.CVV. Resyllabification also occurs with the Semitic prefix mu- on a syllable
with a long vowel or a geminated coda; the u is deleted and the m becomes part of
an initial consonant cluster in Kumzari: mḥāfiḍ ‘governor’, mrād ‘reason’.
Multiple phonological processes may occur to adhere to phonotactic constraints
when suffixes are added, particularly in maintaining penultimate syllable stress.
In the word tarqit ‘wedding poem’ with the definite suffix tēruqt-ō ‘the wedding
poem’, a is raised and fronted to ē, i is deleted, and an epenthetic u is inserted.
3. Word Classes
Word classes include noun, pronoun, verb, deverb, adjective, demonstrative, quan-
tifier, numeral, adverb, evidential, preposition, and clitics or particles functioning
in negation, subordination, interrogative, existential, and discourse roles.
Kumzari 631
4. Nouns and Pronouns
Nouns in Kumzari have no case or gender, but are inflected with definite (–ō),
indefinite (-ē), or plural (–an) suffixes, being generic by default: qiṣr ‘palace’,
qiṣr-ō ‘the palace’, qiṣr-ē ‘a palace’, qiṣr-an ‘palaces’. Counted items take no
plural suffix; instead the count suffix -ta is on the number: gōsin ‘goat’, di-ta
gōsin ‘two goats’; its equivalent for counting humans is kas. The exception is inal-
ienable nouns, which take both the count suffix -ta on the number and the plural
suffix -an: kōrk- ‘son’, af-ta kōrk-an ‘seven sons’. Certain nouns can be abstract
plurals, implying a state of affairs or a period of time, and similar semantics:
šartağ ‘storm’, šartağ-an ‘season of stormy weather’; aqil ‘mountain’, aqil-an
‘mountainous region’.
Kumzari has a derivational suffix -īn- that turns a noun into an agent of that
nominal property: ‘the one who (uses n.)’: jāmal ‘camel’, jāmalīnō ‘the cam-
el-rider’. The resulting noun is obligatorily inflected with a definite, indefinite, or
plural suffix. Another derivational suffix is –ī. It makes a noun into an adjective or
abstraction of that nominal property: indur ‘inside’, indurī ‘inner’. A derivational
suffix –ītī forms an adverb when appended to a noun, deverb, or adjective: wuxr-ītī
‘instantly’.
Pronouns follow a noun phrase to modify it, or can stand as independent pro-
nouns. There are three personal pronouns, each with a singular and a plural form,
indicated in the table below. The same forms are used as possessive pronouns
following the noun: xistar yē ‘her fiancé’. Singular personal pronouns also have
emphatic forms as shown in the following table.
Table 3: Personal pronouns

person singular pronoun plural pronoun emphatic (S) pronoun
1 mē mā ummē
2 tō šmā uttō
3 yē šan iyyē
The reflexive pronoun xō is obligatory for an argument that agrees with the subject
of the verb. Kumzari has a proximal demonstrative pronoun yā, a distal demon-
strative pronoun yē, and a relative pronoun ar, as well as seven interrogative pro-
nouns: kē ‘who’, čē ‘what’, giya ‘where’, kāy ‘when’, čāb ‘how’, kārim ‘which’,
and čanta ‘how many’. In the absence of an interrogative pronoun, the clause-final
interrogative enclitic =ā indicates a yes-no question.
Nouns can function as subjects, objects, or adverbial complements of move-
ment verbs such as ‘go’, ‘come’, and ‘arrive’:
(1) gab twāra wābur ğuzr-ō.

immediately shelter become:3s . real deep-the
‘He immediately took shelter in the deep [part of the well].’ (R1331)
5. Verbs and Deverbs
Kumzari has both verbs and deverbs, deriving from its dual heritage languages.
Verbs are of Indo-European origin. Deverbs are of Semitic origin, usually in the
form CaCaCa2. Properties of deverbs are listed in the table below. Deverbs can
form predicates with the existential enclitic or in a compound with a light verb.
Table 4: Word class properties of deverbs

Verb-like properties of deverbs Non-verb-like properties of deverbs
– originate in Semitic verbs – do not take verbal inflections
– require a subject and can take an object – are not etymologically related to Kumzari
– underived forms cannot be subjects or Indo-European verbs
objects – do not have a complete conjugation into
– can be transitive or intransitive six TAMM forms like Indo-European
– when deverbs function as preverbal ele- verbs
ments in compounds, deverbs take the – require an existential enclitic or light verb
object, thus following the same syntac- to be a predicate
tic rule as a simple verb in Kumzari. – take the comparative suffix -tar
– carry the semantic load of a verb in – some deverbs express concepts that are
compounds typically associated with adjectives not
– pronominal (existential) enclitics on verbs, e. g. lakaša ‘fat’, rakka ‘super’,
deverbs might be understood to be lawata ‘weak’, šaqšaqa ‘funny’.
equivalent to pronominal suffixes on – derive to form nouns, adjectives, adverbs
verbs
Compound verb constructions consist of a preverbal element followed by a light

verb. The semantics of the compound rests in its preverbal element, which is most
often a deverb drawn from an open class. In contrast, the light verb is one of two
verbs distinguishing voice: tka ‘do’ for active and tōʾa ‘become’ for passive, and
these are inflected for aspect, mood, mirativity, person, and number.
5.1. Derived forms

Kumzari deverbs can be derived to form nouns, adjectives, and adverbs. Derived
nouns and adjectives follow Semitic derivational morphology in which CV pattern
2
Detailed information on Kumzari deverbs can be found in van der Wal Anonby 2015.
Kumzari 633
through stem alternation determines word class. Adverbs are derived from deverbs
through the addition of a Kumzari suffix. Derived forms can be inflected accord-
ing to their new word class.
Deverbs derive to nouns with deletion of the second vowel (or raising in quad-
riliteral roots) and addition of the suffix –it to the form CaCCit.
baraẓa (dv.) ‘appeared, appearing’
barẓit (n.) ‘appearance’
Deverbs derive to adjectives in the form CaCC.
qayama (dv.) ‘stood, standing’
qaym (adj.) ‘upright’
Deverbs derive to adverbs by adding the suffix –iti (dropping the final a is mor-
phophonemic).
čaraxa (dv.) ‘straddled, straddling’
čaraxītī (adv.) [e. g. sitting] ‘astride’
5.2. Verb forms

Kumzari has six verb forms: Realis, Perfect, Imperfect, Imperative, Irrealis, and
Mirative. Aspect, modality, and mirativity are grammaticalised, while tense is
coded lexically. Singular and plural of first, second, and third persons have sepa-
rate morphemes.
Realis takes the form of a suffix –d. It is not a label of any temporal notion;
rather, it is used for epistemic certainty, both for past completed events and certain
future events.
The Perfect verb form, using the suffix –s, has a meaning of current relevance,
especially resultative, of prior or complete events.
The Imperfect is a verbal prefix t- and labels ongoing or incomplete events. It
is also used for progressive aspect, general statements, current statives, intended
but unrealised future plans, the habitual, rhetorical questions, and purposive.
Imperatives are commands or requests. They are zero-marked, taking the pro-
nominal suffixes directly on the verb stem, and the second person singular is zero-
marked. Kumzari has a complete paradigm of Imperative forms for all persons and
numbers.
The Irrealis also does not have distinct verbal inflection, but differs from other
verb forms in its second person singular suffix –ī. The Irrealis verb form is used for
the uncertain, such as future events which may not happen, and it covers semantic
hypotheticality, potentiality, jussivity, conditionality, obligativity, and desiderativ-
ity.
The Mirative is a zero-marked morpheme (bare verb stem) for all persons. It
marks unexpected happenings or surprising information, including events with

violence or magic.
Kumzari verb forms function on a scale of prominence in discourse, with
foregrounding and backgrounding roles in structuring texts (see Hopper 1979). In
narratives, the inciting incident and coda use Realis verb forms, the peak is punc-
tuated with the Mirative, and the dénouement has both Realis and Mirative verb
forms. Other discourse structures, including the introduction, nodus, and epilogue,
generally consist of verbless clauses.
5.3. Verb inflection

Inflection affixes for verb forms, number, and person are shown the table below.
Table 5: Verb affixes

person Realis Perfect Imperfect Imperative Irrealis Mirative
1sg -dum -sum t- -um -um -um
2sg -dī -sī t- -ī Ø -ī
3sg -diš/-d -sē t- -a -a -a Ø
1pl -dim -sim t- -im -im -im
2 pl -dē -sē t- -ē -ē -ē
3pl -din -sin t- -in -in -in
Verbs with simple morphology have one stem serving all verb forms. Verbs of
more complex morphological type, namely -ft and –št verbs and b- and w- verbs,
have one root to build the Realis and Perfect forms and a second simpler root as
the basis of the Imperfect, Imperative, Irrealis, and Mirative forms.
Examples of simple and complex verb paradigms are given in the tables below.
Table 6: Example paradigm for the verb fān ‘send’

1sg fāndum fānsum tfānum fānum fānum
2sg fāndī fānsī tfānī fān fānī
3sg fāndiš fānsē tfāna fāna fāna fān
1pl fāndim fānsim tfānim fānim fānim
2pl fāndē fānsē tfānē fānē fānē
3pl fāndin fānsin tfānin fānin fānin
Kumzari 635
Table 7: Example paradigm for the verb xwaft, xwā’ ‘sleep’

1sg xwaftum xwaftum txwā’um xwāʾum xwāʾum
2sg xwaftī xwaftī txwā’ī xwaw xwāʾī
3sg xwaft xwaftē txwā’a xwāʾa xwāʾa xwaw
1pl xwaftim xwaftim txwā’im xwāʾim xwāʾim
2pl xwaftē xwaftē txwā’ē xwāʾē xwāʾē
3pl xwaftin xwaftin txwā’in xwāʾin xwāʾin
In addition to these, Kumzari has a large number of irregular finite verbs with idi-
osyncratic inflection paradigms. The most common irregular verbs are ām ‘come’;
dār, dōʾ ‘give’; gid, ka ‘do’; gid, gir ‘take’; jīr, mēš ‘see’; raft, čō, rō ‘go’; wābur,
tōʾ ‘become’.
5.4. Verb Phrase

In the verb phrase, verbs and deverbs follow the same syntactic rule: object com-
plements in the form of a full noun precede the de/verb and those in the form of
a pronoun follow the de/verb. In compounds, only a direct object in the form of a
pronoun or an inalienable noun may come between the deverb and light verb.
(2) Semitic deverb with noun object (object precedes deverb)
mī’ī waẓaʾa tka ba diryīʾīn-an.
fish distributing do:3s . impf to fisher-pl
‘He distributes fish to the fishers.’
(3) Indo-European verb with noun object (object precedes verb)
mī’ī dōʾa ba diryīʾīn-an.
fish give:3s . impf to fisher-pl
‘He gives fish to the fishers.’
(4) Semitic deverb with pronoun object (object follows deverb)
waẓaʾa šan tka ba diryīʾīn-an.
distributing 3 p do:3s . impf to fisher-pl
‘He distributes them to the fishers.’
(5) Indo-European verb with pronoun object (object follows verb)
dōʾa šan ba diryīʾīn-an.
give:3s. impf 3 p to fisher-pl
‘He gives them to the fishers.’
Some Arabic varieties of Oman and the UAE have a similar distinction; that is, an
object as a noun or a pronoun determines morphological marking on their verbal
participles (Holes 1990: 48).
Each verbal complement is negated and subordinated separately in addition to
the verb. Elements in a compound verb share negation and subordination.
Kumzari has several auxiliary verbs. Generally these occur with another verb
to indicate modality. They precede the verb or compound, and the verb phrase
syntax functions the same as a verb without an auxiliary.
Table 8: Auxiliary verbs

auxiliary verb gloss
wayda ‘hold’ keep doing (one time)
mād ‘stay’ keep doing (habitual)
čō ‘go’ going to do
tāta ‘want’ want to or would do
wēl ‘let’ allow to do
rāya, čwana ‘be able’ can do
Verb goal arguments are clause-final and take no preposition. They encompass
locative, benefactive, and instrumental complements. Factive verb phrases have
the same syntax, with the entity that is “coming into existence” at the end of the
clause. Factive syntax is used in the narrative exposition formula to introduce a
character.
(6) raft mardk-ē wa ẓank-ē
go:3s.real man-a and woman-a
‘There was a man and a woman’ (U9)
(7) sīd-in šan madrēsit-ō.
put:real -3 p 3 p school–the
‘They put them in the school.’ (B175)
(8) gardīd-iš xō ṭēr–ē.
turn_into:real -3s refl bird–a
‘He turned himself into a bird.’ (B231)
Kumzari has truncated verbal marking in subordinated clauses, serial verb con-
structions, and clause chains, where the full form marked with aspect, modality,
mirativity, person, and number is in the main clause or final verb. Verbs in such
multi-verb constructions generally share a subject.
Kumzari 637
6. Existentials
The existential enclitic is uninflected for tense, aspect, mood, voice, and mirativ-
ity properties, distinguishing only the person and number. It is clause-final and
requires a complement in its function as a predicate in a clause. It is not compatible
with another verb, suggesting that it may have emerged historically by dropping
the *h-verb stem, then broadening the pronominal endings’ scope to accommodate
all non-verbal predicates, at some point before Middle Persian adopted the istad
forms. Existential enclitics are formally different from both pronominal verbal
suffixes and possessive pronouns.
Table 9: Existential enclitics

person singular plural
1 =um =im
2 =ī =ē
3 =Ø =in
Existentials cliticise to the end of any word class or phrase other than verbs; in
doing so they undergo morphophonemic alternation, e. g. with an epenthetic glottal
stop following a vowel.
Existentials can take existential, ascriptive, and identificational predicates
(Pustet 2003). The three semantic types of predicates with an existential are given
below.
(9) [existential predicate]
ka pi yē si-ta=in ā, ka pi yē bātar!
if from 3s three-count =ex :3 p sub if from 3s better
‘If there were three of them, it would have been even better!’ (K597)
(10) [ascriptive predicate]
ammū šan ẓank-an=in yaʾnī.
all 3p woman-pl =ex :3 p that_is_to_say
‘That is to say, all of them were women.’ (S419)
(11) [identificational predicate]
tō aḥmad=ī? mē aḥmad tka=um.
2 s ahmad=ex :2 s 1s ahmad “does-it”=ex :1s
‘You are Ahmad?’ ‘I am Ahmad-Does-It.’ (A110)
7. Negation
The particle of negation in Kumzari is na. The negative follows the constituent
being negated. Post-constituent negation is not found in Iranian languages or
Arabic. However, it is a “remarkable” distinguishing feature of the South Arabian
languages of Oman (Waltisberg 2011: 319), and is possibly an innovation in these
languages (Watson and Eades 2012: 3).
The negative is marked on the verb and on each post-verbal complement.
(12) mēy-ō fōšnīs-um ba šmā, jōʾar-ō fōšnīs-um na ba
fish–the sell:perf -1s to 2 p pearl–the sell:perf -1s neg to
šmā na.
2p neg
‘I sold the fish to you; I did not sell the pearl to you.’ (K117)
Direct objects and deverbs in a compound are not negated separately.
(13) skafya k–ē na pi mē na.
concealing do:imper -2 p neg from 1s neg
‘Don’t conceal from me.’ (S339)
An object in the form of a full noun precedes the verb, and thus does not take the
negative. When the object is a pronoun, it follows the verb, and the negative parti-
cle then follows the object pronoun. The contrasting syntax between full noun and
pronoun objects is shown in the following pair of examples:
(14) dar-ō twākš-um na.
door–the open:impf -1s neg
‘I will not open the door.’ (S771)
(15) twākš-um yē na.
open:impf -1s 3s neg
‘I will not open it.’ (S775)
The South Arabian language Mehri also varies negation syntax depending on
whether a noun or pronoun is used (Rubin 2010: 265).
Prohibitive and negation of existential enclitics follow the same syntax: na
follows the verb and each complement thereafter. In rejection, however, the nega-
tive particle precedes its referent.
(16) na baẓẓā wa na bīdar!
neg beggar and neg peasant
‘Neither beggar nor peasant!’ (B851)
Kumzari 639
8. Evidentials
Evidentials mark the source of information for a statement with grammatical,

rather than lexical, meaning. They occur in clause-initial position, and are fol-
lowed by the subordinating enclitic =ā, and then their complement in the form of
a clause or noun phrase.
The three-term system of evidentials in Kumzari classifies the information
source as firsthand sensory tamna, non-firsthand reportive awa, or non-firsthand
inferred ēka. While non-firsthand sources are obligatorily marked, unmarked
information is assumed to be firsthand. The extended function of tamna empha-
sises immediate observation or discourse peak.
The firsthand evidential emphasises information as being sensorily acquired,
whether through visual, auditory, or tactile means, or even by emotions or premo-
nition. In narrative texts, it can have presentative semantics or signal the discourse
peak, and is often used to denote violent or magical happenings.
(17) tamna ā asp-ē insī inda yē wa ḥawṭ-ē ṣirx.
sens subhorse-a humanlike in 3s and pool-a gold
‘He saw that a talking horse was in it and a pool of gold.’ (B351)
Reported information sources use the non-firsthand evidential awa, for both quo-
tative and indirect speech.
(18) awa ā ẓīn-ō kišt-in.
rep sub thief-the kill:real -3 p
‘It is said they killed the thief!’ (R550)
Inferred sources of information are labelled with the non-firsthand evidential ēka.
They include information obtained from deduction or general knowledge, and can
convey pragmatic extensions of irony, deception, or disbelief.
(19) ar ğēla-an mā gis-ē ā, ēka ā yā
that/who wheat-pl 1 p take:perf -3 s sub inf sub this
ğrāb-ō.
crow-the
‘The one who has taken our wheat, it must have been this crow.’ (G213)
Grammaticalised evidentials have not been reported for other languages of West
Asia (as an extension of other categories see Johanson and Utas 2000), but are
well-known in the Indo-Aryan family (Bashir 2006, 2010). The occurrence of
potentially grammaticalised evidential particles has been suggested for languages
of both Western Iranian and South Arabian families (van der Wal Anonby 2015).
Evidentiality often diffuses in cases of intensive language contact (Aikhen-
vald 2004: 288).
9. Discourse particles
Discourse particles are clause-initial constituents that function grammatically to

establish coherence in the structure of a text. The nine most commonly occurring
plot structuring particles in Kumzari discourse are laid out in the following table.
They are set on a scale of prominence that indicates their role in foregrounding or
backgrounding information in a text.
Table 10: Discourse particles

more foregrounded
(=high prominence)
ka, amū, sā sā
byō
tamna
sā
čāb kin?
lumrād, filḥāl
more backgrounded
(=low prominence)
At the discourse peak of a narrative, clusters of foregrounding particles occur.

(20) bārē gid-iš, dikkara sātē. sā sā sikkara ba yē,
once do:real -3s twice now right.away thrice to 3s
wa
and
‘He did it once, twice now. Right away, as he was going around the third
time, and
(21) bast yaʾnī, wa ka dakka pā kin bā
finishing that_is_to_say and quickly digging foot do:mir on
yē
3s
he is just finishing, I mean, and quickly he [the boy] stands firmly
(22) ẓamy-ō. byō rāstağ yē ka byō dist
ground-the come:mir straight 3s suddenly come:mir hand
xō
refl
on the ground. Immediately he [the sorcerer] comes in line with him,
suddenly he [the boy] comes and puts his hands
Kumzari 641
(23) sō ẓekon yē. yē wa ka kēsafī kin

put:mir backside 3s 3s and suddenly plunging do:mir
inda yē,
inside 3s
on his [the sorcerer’s] backside. Suddenly he comes to him, and suddenly
he plunges him into it,
(24) inda qiẓ’an–ō.
inside cauldron-the
into the cauldron.’ (B560)
10. Prepositions
Kumzari prepositions precede their complements. Prepositions are separate words

in Kumzari, except wā which can cliticise to its complement.
Table 11: Prepositions

preposition gloss
ba to, for, of, on, with
bağa without
bar belonging to
ğay except
ḥata until
inda in, inside
mayya between, among
naxa aboard (a vehicle)
pana beside, near
pi from, than, since
sīna toward
tē before
wā with, at, -ward
10.1. Proclitic directional and possessive preposition wā

The preposition wā is a directional, with a locational noun as its complement. In a
directional function, wā cliticises to its complement.
(25) č-in wā=ēbar wa tāʾ–in
go:impf -3 p at/-ward=over_here and come:impf -3 p
wā=ēbar.
at/-ward=over_here
‘They would go here and there.’ (R309)
Kumzari also uses the preposition wā with a complement as a possessive construc-

tion, in place of a ‘to have’ verb. The complement is the possessor in the form of
a noun or pronoun, and follows the preposition. A similar possessive construction
occurs in the South Arabian languages, where it is đ-/d- with a pronoun comple-
ment (Simeone-Senelle 1997: 386). In the Kumzari possessive construction, wā
does not lose stress to its complement as it does when it functions as a directional.
The possessive wā does not have verbal inflection for categories such as aspect,
mood, or mirativity. The possessive construction falls at the end of a clause, unless
it is a factive (Payne 1997: 59).
(26) sā šēx–ō ā, jāmal wā yē.
now sheikh–the top camel with 3s
‘Now, the sheikh, he had a camel.’ (A397)
Like factive verb syntax, when it denotes the coming into existence of an entity,
the wā construction occurs clause-initially.
(27) raft šēx wālēyit–ō, wā yē si-ta ditk-an.
go:3s.impf sheikh country–the with 3s three-count daughter-pl
‘There was a sheikh of the country; he had three daughters.’ (S12)
11. Adjectives
Adjectives in Kumzari can function as attributes or predicates. However, attrib-

utes are more often expressed by nouns, deverbs, abstract plurals, and verbs in
Kumzari.
In a noun phrase, adjectives occur after the noun they modify and before its
possessive noun or pronoun. In this role, they agree in definiteness and number
with the noun they modify.
(28) qēṣarit-ō jwān-ō ğaraqa gid-iš.
date-the good-the swallowing do:real -3s
‘He swallowed the good date.’ (N98)
As predicates, adjectives occur with a following existential enclitic or verb tōʾa
‘become.’
(29) čēnağ=in.
thirsty=ex :3 p
‘They are thirsty.’ (R1220)
Adjectives can take nominal suffixes such as definite -ō, indefinite -ē, and plural
–an; so inflected they can function syntactically as a noun in a clause argument:
Kumzari 643
(30) gap-an raft-in xāna ba rōr-an āmō xō.

big-pl go:real -3 p marriage with child-pl uncle(paternal) refl
‘The older ones [literally: ‘bigs’] married their cousins.’ (S27)
Adjectives may be derived from other constituents, such as nouns, with the suffix
–ī.
awwēlī ‘first’ awwal ‘first, firstly’
wardī ‘pink’ ward ‘flower’
ẓēranī ‘lower’ ẓēran ‘bottom’
Other adjectives are derived from deverbs with the Semitic triliteral root following
the pattern CaCC:
ǧayb ‘absent’ ğayaba ‘finishing with’
ḥaml ‘cautious’ ḥamala ‘heeding’
ṭafṣ ‘rotten’ ṭafaṣa ‘rotting’
The triliteral stem of the adjective is CaCaCa when it takes the comparative suffix
–tar (lengthening of the final vowel in the stem is morphophonemic).
balağa dv. ‘ripening’
balğ adj. ‘ripe’
balağātar adj. ‘riper’
12. Demonstratives
Kumzari has two demonstratives: proximal yā and distal yē. Unlike possessive
pronouns, they precede the noun they modify, and must accompany it; they cannot
stand alone. The noun they modify takes the definite suffix –ō.
(31) yē čāb kin, yē ṣāḥar-ō ā? gardīdiš
3s how do:mir dem sorcerer-the interr turn_into:3s.real
xō ṭēr–ē.
refl bird-a
‘How did he do it, that sorcerer? He turned himself into a bird.’ (B228)
13. Quantifiers
Quantifiers precede the noun they modify, expressing an amount.

(32) nikta arma dīʾ-in ba mā.
a.bit date give:impf -3 p to 1 p
‘They would give a bit of dates to us.’ (K48)
Common quantifiers are listed in the table below.
Table 12: Quantifiers

ay ‘any’
iš ‘none, any’
dasta ‘a number’
nikta ‘a bit’
kam ‘a little’
ḥabba ‘a few’
qadar ‘some’
ammū ‘all’
14. Adverbs
Kumzari has two types of adverbs with differing morphosyntax: clause-initial

adverbs modify the entire clause, and post-verbal adverbs relate only to the verb.
(33) [clause-initial adverb]
quṭ gōsin mē jīs-ī ā?
ever goat 1s see:perf -2 s interr
‘Have you ever seen my goat?’ (N28)
(34) [post-verbal adverb]
ṭiyar tīʾ-im šarg.
finished become:impf -1 p quickly
‘We’re finishing quickly.’ (U559)
Post-verbal adverbs can only be negated when the verb they modify is negated,
but clause-initial adverbs can be separately negated. Like other adverbs, derived
adverbs with the suffix –ītī follow the verb they modify.
Kumzari 645
15. Clauses
A minimal clause in Kumzari is a subject and a predicate. However, the subject

may be implicit, and the predicate may be a verb phrase, an existential enclitic,
an evidential, a possessive construction, or an abstract plural noun. Subject-Ob-
ject-Verb is the unmarked constituent order, but topicalisation or factivity may
reposition an argument.
(35) āmō mē ṭālbū tō gis-ē ba ēnar-ō.
father-in-law 1s request 2 s do:perf -3s for henna-the
‘My father-in-law has requested you for [doing] the henna.’ (S436)
In clause-initial position there may be discourse particles, evidentials, or adverbs
pertaining to the entire clause. In clause-final position there may be prepositional
phrases and verb goal arguments.
(36) sā sōr wa pīma tk-um ba šmā čāẓ.
now salt_fish and green_onion do:impf -1s for 2 p lunch
‘Now I will make salt fish and green onions for you for lunch.’ (G865)
Nominal, adjectival, and locational predicates require an existential enclitic.
(37) šēx=in na ā, ādī=ʾin.
sheikh=ex :3 p neg sub normal=ex :3 p
‘They were not royalty; they were common.’ (U12)
(38) ḥalla ba ṭaʾr=in.
landing on mountain-ledges=ex :3 p
‘They are landing on mountain-ledges.’ (B599)
Existential enclitics attach to predicates consisting of perfect participles, or deverbs
not in a compound.
(39) āwwa bār maxlōq-an ğafala=in.
first time people–pl unsuspecting=ex :3 p
‘The first time the people were unsuspecting.’ (A166)
(40) bukrit-an ẓās-in=in.
kid-pl give_birth:perf -3 p =ex :3 p
‘The kids are born.’ (N24)
15.1. Complex clauses
15.1.1. Coordination
Coordinated clauses in Kumzari are linked by conjunction, disjunction, adversa-
tive coordination, or causal coordination. Each has its own set of linking forms.
Conjunction uses wa ‘and’ or ka ‘also’ between clauses, with their subjects
having coreferential or disjoint reference.
(41) ḥubbō-ō xaṭṭ-ē kataba gid-iš.
grandmother-the message-a writing do:real -3s
‘The grandmother wrote a message.’
(42) wāqā gid-in wa fānd-in yē.
signature do:real -3 p and send:real -3 p 3s
They signed and they sent it.’ (P348)
The consecutive conjunction ka preceding each phrase marks listing parallelism
with multiple coordinands (Haspelmath 2007: 15).
(43) šaw ā, ka maylat gid-in, wa ka ammū
night sub list wedding.poetry do:real -3 p and list all
‘At night, also they did the Maylad [poetry], and also
čī gid-in wa ka srō kēšid-in
thing do:real -3 p and list sung_poetry pull:real -3 p
they did everything, and also they sang the Sro [poetry],
wa ṭiya būr-in.
and finished become:real -3 p
and they finished.’ (S632)
Causal coordination uses ka alone.
(44) xalaqa ba yē ka tāt-um dig-um yē
(good)_looks to 3s so want:impf -1s take:impf -1s 3s
šū-ī.
husband-advr
‘He is handsome, so I want to take him as a husband.’ (B773)
Disjunction uses waḷa ‘or’ between the two elements, or the emphatic form wana
‘either, or’ preceding each clause for multiple coordinands (Haspelmath 2007: 15).
(45) kam ğāẓ dō-um ba yē waḷa ṣirx tāt-a
how_much money give:impf -1s to 3s or gold want:irr -3s
ā,
sub
‘I will give her however much money, or if she wants gold,
Kumzari 647
(46) ṣirx dō-um ba yē.

gold give:impf -1s to 3s
I will give her gold.’ (P468)
Two other counterfactual conditionals used in Kumzari are kana and midam, both
meaning ‘otherwise’.
Kumzari has the adversative conjunction lakin ‘but’, and the more rarely heard
balkē ‘however’.
(47) ka byō ba yē lakin adliy-ē ba yē.
right_away come:mir to 3s but gown-a to 3s
‘Right away he came to her but she was wearing a gown!’ (S426)
Clauses can also be linked asyndetically, with only juxtaposition and rising-falling
intonation to indicate coordination.
15.2. Subordination
Subordination in Kumzari uses four types of multiple verb constructions: rela-
tive clauses, clause chains, adverbial clauses, and complement clauses. All signal
clause relationships through the subordinating enclitic =ā, which occurs clause-fi-
nally and has allomorphs =ō, =wā, and =yā.
(48) bard–in yā jitt-ō ō, dakka yē gid-in
carry:real -3 p this corps -the sub burying 3s do:real -3 p
inda maqbart-ō.
in grave-the
‘Carrying this corpse, they buried it in the grave.’ (A290)
In a subordinated clause, the subordinator is obligatorily marked on every comple-
ment of the verb. The enclitic ā follows both the verb and the verbal complement.
(49) wa āmad-in ā ba xwā ā, bang-an.
if/when come:real -3 p sub for salt sub dusk-pl
‘When they came for the salt, it was nightfall.’ (B649)
Adverbial clauses and complement clauses take the subordinating enclitic ā clause-
finally.
(50) tē ba rēs-a ba y’=ā, grab-ō pōrid.
before to arrive:irr -3s to 3s =sub crow-the fly:3s.real
‘Before he reached it, the crow flew away.’ (G221)
(51) mār, aqrab inda yē ā, dām na.
snake scorpion in 3s sub know:1 s . impf neg
‘I don’t know [whether] there was snake or scorpion in it.’ (P110)
The subordinator follows all evidentials: sensory, reported, and inferred.

(52) sā wa barẓa wāb ba bāğ almowẓ
now if/when appearing become:mir to garden ar .:(the-banana)
ā,
sub
‘Now when he showed up at the banana garden,
(53) tamna ā yē ḥaps=in.
sens sub 3s bound=ex :3 p
he saw that his brothers were bound.’ (B1138)
The subordinating enclitic ā also joins semi-subordinate constructions such as
medial clauses in clause chains and repeated clauses in anadiplosis, and it can
follow a noun phrase as a topicaliser. Consecutive linked clauses in logical or tem-
poral sequence use the rare subordinator bēw in place of ā.
Other clause subordination is achieved through the irrealis verb form, perfect
participles, and deverbs, taking advantage of their atypical properties to form rel-
ative, complement, or adverbial clauses.
Subordinated clauses may shorten the verb stem by reducing inflection for
aspect, mood, and mirativity. This strategy is used for adverbial and complement
clauses, and for serial verb constructions such as chains, where only the final verb
contains aspect, mood, and mirativity information.
15.2.1. Relative Clauses

For forming relative clauses, Kumzari has a clause-initial relative pronoun ar
(post-nominal allomorph /a/) and a clause-final relative particle na (the latter can
be fused to the subordinating enclitic as in the example below).
(54) ar jamma tk-a ẓīn-ō, jamma tk-a,
that/which/who bending do:impf -3s thief–the bending do:impf -3s
‘The one who bends down is the thief, [the one who] bends down,
sayya ba xō n=ā, ẓīn-ō.
picking_up for refl of_which=sub thief–the
who picks up, is the thief.’ (A641)
Relative clauses in Kumzari may also use a gap strategy, with a nominalised verb
in the form of a deverb or perfect participle.
(55) xaṭṭ-ē āma ba mē, law yē ḥaraqa.
message-a come:3s to 1s edge 3s sealed_with_wax
‘A message came to me, [of which] its edge was sealed with wax.’
(P1099)
Kumzari 649
Prepositional phrases with pi ‘from’ are commonly employed as a relativisation

strategy, similarly to the endemic use of a relativiser bū (in place of illi) in Arabic
dialects of northern Oman (see Eades 2009: 91).
(56) fān yē ba rōk-ō, pi raft-ē ḥijj.
send:2 s . imper 3s to boy–the from go:perf -3s hajj_pilgrimage
‘Send it to the boy, who has gone on the Hajj pilgrimage.’ (P530)
15.2.2. Clause Chains

Kumzari has clause chains that link several clauses of co-subordinated, truncated
medial verbs each followed by the subordinating enclitic =ā; an independent final
clause with a fully inflected verb takes the aspect-mood-mirativity marking for the
entire chain. The clauses in a chain share the same subject and are linked semanti-
cally, such as in an action sequence. Serial verb constructions are known to occur
in many languages of western Iran (Stilo 2004: 294, 296).
Anadiplosis in Kumzari repeats a preceding clause in a subordinated clause
with the enclitic =ā, as presupposed information in a discourse.
(57) ka jaḥḥa kin ba rōk–ō awēlī, rōk-ō jwān-ō.
peak swooping do:mir for boy-the first boy-the fine-the
‘Immediately he swooped for the first boy, the fine one.
(58) jaḥḥa kin ba yē ā wa sayy yē pi
swooping do:mir for 3s sub and lift_up:mir 3s from
mayya rōk-an.
midst boy-pl
He swooped on him and lifted him up from among the boys!’ (B238)
Kumzari narratives have embedded poems with frame and list structure (van der
Wal Anonby 2009). The poems often begin with anadiplosis in the frame, with the
list being verbless or using Irrealis or truncated verbs with the enclitic =ā, similar
to clause chains.
15.2.3. Adverbial Clauses

Adverbials in Kumzari encompass time, location, purpose, reason, circumstan-
tial, simultaneous, additive, and conditional clauses. Conditional clauses use the
clause-initial subordinating conjunctions wa (reality), ka (unreality), wana (pre-
dictive), or kana (counterfactual), with varied meaning depending on the verb
forms in the joint clauses and whether the subordinate clause contains the enclitic
=ā.
(59) ēka ā y’=ā kas tāt –a yē na, kaft -ē

inf sub dem =sub pers want:impf -3s 3s neg fall:perf -3s
ba čāf -ō bē.
on beach-the only
‘Obviously no one wanted this, [since] it was just left on the beach.’
(S244)
(60) ḥata tay wā=bāla ā, gōsin-an txōr-in.
until come:3 s . impf -ward=up sub goat-pl eat:impf -3 p
‘The goats were eating until he came up.’ (G142)
(61) āxur bang šmā bẓēn-um ā, arẓamē u yēkē
after calling 2 p hit:impf -1s sub one_at_a_time
byāt.
come:3 s . irr
‘After I call out to you, [you] should come one at a time.’ (R455)
15.2.4. Complement Clauses

Kumzari allows for subject, object, and oblique complements. Complement
clauses may take truncated verbs with reduced aspect-mood-mirativity marking,
or a diminished form such as irrealis, deverb, or perfect participle. Each comple-
ment of the verb is marked with the clause-final subordinating enclitic =ā, and
negation on the matrix verb and the entire complement clause. A clause-initial
complementiser used specifically for oaths is inna.
(62) ṣwāl mām xō gid-iš ka jilbē yē
question mother refl do:real -3s if/when scarf 3s
čišt-ē.
wash:perf -3s
‘She asked her mother whether she had washed her scarf.’ (N20)
(63) ātiš-ē jīr-iš, ātiš-ē āntē čō-ō ā, ātiš-ē labaqa.
fire-a see:real -3s fire-a there well-the sub fire-a burning
‘He saw a fire, a fire that was there by the well, a burning fire.’ (R360)
(64) dit mē ṭal bur tō gis-ī
daughter 1s decided become:3s.real 2 s take:perf -2 s
xustār-ī.
fiancé-advr
‘My daughter has decided that you be taken as a fiancé.’ (B931)
Kumzari 651
16. Discourse
Kumzari discourse uses particles, formulae, verb forms, and other means to struc-
ture a text and achieve coherence.
16.1. Formulae
There are two kinds of formula in Kumzari: narrative and thematic. Narrative for-
mulae have conventional wording and occur obligatorily once in every narrative
text. Thematic formulae recur throughout the text but are idiosyncratic for each
narrative.
There are only four narrative formulae: two in the exposition (qiṣṣitē wa ḥakāy-
itē ‘a story and a telling’, raft yēkē ‘there went someone’), and two in the conclu-
sion (tō raftī wa mi āmadum ‘you went and I came’, xalaṣ ‘the end’). Thematic
formulae are closely linked to content in a narrative (Kossmann 2000: 75), and
may resemble proverbs. A single thematic formula specific to a particular narrative
is repeated at several points in the text.
16.2. Speech Type

Each section of the plot in narrative features quoted speech, drama speech (without
quote margin e. g. ‘she said’), or no speech. Like verb forms, speech types function
on a scale of prominence aligned with their grounding role in discourse. The expo-
sition and conclusion of a tale do not have speech. In the body of the tale, informa-
tion is foregrounded with drama speech, or backgrounded through quoted speech.
16.3. Plot structure

The narrative plot in Kumzari is divided into eleven narremes, each within one
of three sections: the exposition, body, and conclusion. A summary of Kumzari
narremes and their content is in the table below. Each narreme in plot structure
is characterised by the use of certain discourse particles, verb forms, formulae,
speech types, and other features.
Table 13: Summary of plot structure narremes

exposition:
aperture “a tale. there went someone.”
introduction characters and setting
nodus presentation of a problem
body:
inciting incident the first happening that turns the course of
the story
intentus development of tension, several pivots
accalmie calm before the storm
peak highest point of conflict
dénouement unravelling of tension
conclusion:
coda resolution of central conflict, homecoming
finis “you came, I went. the end.”
epilogue “was it good? that’s how it happened.”
16.3.1. Exposition
The aperture contains one or both of the narrative formulae opening the tale. The
introduction gives background information in the form of characters and setting. It
is generally verbless, with much lexical repetition and paraphrase.
The nodus extends the setting before any pivot in the tale, by presenting the the-
matic problem, complication, or conflict; it usually focuses on wealth or lineage.
The nodus is generally speechless and verbless and contains much repetition and
parallelism.
16.3.2. Body
The inciting incident is the initial event that directs the course of the narrative. It
begins immediately with Realis verb forms and the text’s first instance of direct
speech and culminates in the main character leaving home (see “absentation”
Propp 1968: 26). The intentus is a series of ascending pivots, with straightfor-
ward description of action and little repetition. Grounding develops tension in this
narreme.
Pivots are initiated with the discourse particles tamna and bīyō and conclude
with lumrād. The accalmie provides a lull in the action, containing backgrounded
information that draws out tension to highlight the peak. There is much repetition,
lists, embedded poems, and the low-prominence discourse particles sā and ‘čāb
kin?.
The peak represents the high point of the plot, with the opening of conflict pre-
sented in the nodus. Speech is replaced by action and there is the “crowded stage”
Kumzari 653
effect with all characters present. The evidential tamna and Mirative verb forms
are used along with foregrounding discourse particles ka, amū, and sā sā. After
the peak, the dénouement cools the conflict. It still has Mirative and Realis verb
forms, but features quoted speech prominently and uses backgrounding discourse
particles lumrād and filhal to summarise information.
16.3.3. Conclusion
The coda resolves the nodus, with the central character returning home in success.
It uses Realis verbs, no speech, and often includes an explanation directed to the
audience. The coda is followed immediately by the finis formulae, and then an
interactive epilogue that summarises the narrative theme (cf. Utas 2006: 227).
17. A text from the Kumzari folktale Sōntyō
(65) rōẓ-an dgur nwāšam y’=ā, ra ba yē dgō

day-pl next evening 3s =sub go:3s.real to 3s say:3 s . impf
ba yē
to 3s
‘In the evening of the next day, he went to her, he said to her,
(66) maš, hã! wana dar-ō wākiš, wana
see:2 s . imper well either/or door-the open:2 s . imper or_else
nwāz
tomorrow
“See here. Either you open the door, or else tomorrow
(67) nijjar-an tār-um, lōḥ-ō šaraxa tk-in.
boatbuilder-pl bring:impf -1s wood-the chopping do:impf -3 p
taftafa yē tk-im
smashing 3s do:impf -1 p
I will bring the boat builders, [and] they will chop the wood. We will smash
it
(68) ba ēmağ. šan wā maqṭa-an šan wa mīšar-an šan,
for firewood 3 p with chisel-pl 3 p and saw-pl 3p
wa ādamī
and person
[the boat] into firewood! They, they have their chisels and their saws. And
people
(69) txēn-in bā tō. dar-ō wākid-iš ba yē.

laugh:impf -3 p against 2 s door-the open:real -3s for 3s
will laugh at you.” She opened the door to him.
(70) dgō ba yē maš, hã! mē ẓan tō=um.
say:3 s . impf to 3s see:2 s . imper well 1s wife 2 s =ex :1s
gid-ī mē,
take:real -2 s 1s
She said to him, “See here. As for me, I am your wife. You married me,
(71) wa lakin tāt-ī tēbur-ī mē wā= xā- šmā
and except want:impf -2 s carry:impf -2 s 1s -ward house 2p
and unless you want to take me to your house
(72) ahla tō mēš-in mē ā, wa raft-ī ba mē
relatives 2 s see:irr -3 p 1s sub and go:perf -2 s with 1s
xāna
marriage
so that your relatives can look at me… since you have been married to me
(73) rōẓ-ē, di-rōẓ, bar mē xā šmā. ādī
day-a two-day carry:2 s . imper 1s house 2 p normal
č-um
go:impf -1s
a day or two, you must take me to your house. It’s tradition that I go
(74) wā tō, bāla ya’nī. tō tāt-ī xwā’-ī
with 2 s up that_is_to_say 2 s want:impf -2 s sleep:irr -2 s
pana mē
beside 1s
up with you, that is to say. You want to sleep beside me
(75) naxa sōnty-ō, wa sā tu kardīd-ī mē inda
aboard raft–the and reproach drop:real -2 s 1s inside
muṣībit-ē!
calamity-a
on the raft, and now you! You have thrown me into calamity!
(76) mē nwāẓ č-um bāla qāniṣ wa gō ahla
1s tomorrow go:impf -1s up hunting and say:irr relatives
tō
2s
[saying:] ‘Tomorrow I am going up hunting…’ and telling your relatives
Kumzari 655
(77) āmad-in ba mē: wōwōwō pis šēx-ō adafa

come:real -3 p to 1s woe! son sheikh-the injuring
būs-ē,
become:perf -3s
to come to me: [saying] ‘Woe! The sheikh’s son has been hurt!’
(78) mē fajja būr-um, sā ba rū mē jīr-in!
1s shocked become:real -1s now of face 1s see:real -3 p
I was shocked! Now they have seen my face!
(79) sīna mē jīr-in! mū-an mē jīr-in! ātiš-ō labaqa
chest 1s see:real -3 p hair–pl 1s see:real -3 p fire-the igniting
gid-ī
do:real -2 s
They have seen my breasts! They have seen my hair! You have shamed me
[literally: set my body on fire]!
(80) lāḥm mē! damb gid-ī bā mē! čābē inča
body 1s sin do:real -2 s against 1s how? like_this
gid-ī
do:real -2 s
You have sinned against me! How could you have done this
(81) ba mē ā? mē škašt-um xō pi yā
to 1s interr 1s break:real -1s refl from dem
kār-ō.
work-the
to me?!! I want nothing more to do with this matter [literally: I have broken
myself over this matter].”
(82) dgō ba yē jwān. sā bā yē na, samaḥa mē
say:3 s . impf to 3s good now all_right forgiving 1s
kin.
do:2s .imper
He said to her, “OK. Now, all right, forgive me.
(83) br-im bāla xān mā. dgō ba yē br-im.
come:imper -1 p up house 1 p say:3sIMPF to 3s go:imper -1 p
Let’s go up to our house.” She said to him, “Let’s go.”’ (S787)
Abbreviations
advr adverbializer n noun

ar Arabic modern loanword gloss neg negative
c consonant p plural
count count marker (inanimate) perf perfect verb form
dem demonstrative pl plural noun suffix
dv deverb real realis verb form
evid evidential refl reflexive pronoun
ex existential rep reportive evidential
imper imperative verb form s singular
impf imperfect verb form sens sensory evidential
inf inferred evidential sub subordinating enclitic
interr question morpheme top topicaliser
irr irrealis verb form v short vowel
mir mirative verb form vv long vowel
References
Aikhenvald, Alexandra Y. 2004. Evidentiality. Oxford: Oxford University Press.

Anonby, Christina van der Wal. 2009. Words within words: The embedded poems of
Kumzari fairy tales. Paper presented at the Conference Documenting oral traditions in
the non-Western world, Leiden University, 27–29 August, 2009.
Anonby, Christina van der Wal. 2014. Traces of Arabian in Kumzari. In Orhan Elmaz & Ja-
net C.E. Watson (eds.), Languages of Southern Arabia: Special session of the Seminar
for Arabian Studies, 137–146. Oxford: Archaeopress.
Anonby, Christina van der Wal. 2015. A Grammar of Kumzari: A mixed Perso-Arabian
language of Oman. Leiden: Leiden University dissertation.
Anonby, Erik J. 2010. Kumẓarītī [Kumzari alphabet chart]. Unpublished manuscript.
Anonby, Erik J. 2011. Illustrations of the IPA: Kumzari. Journal of the International Pho-
netic Association 41(3). 375–380.
Bashir, Elena. 2006. Evidentiality in South Asian languages. In Miriam Butt & Tracy Hol-
loway King (eds.), Proceedings of the Lexical-Functional Grammar 2006 Conference,
30–50. Palo Alto, U.S. A.: CSLI Publications.
Bashir, Elena. 2010. Traces of mirativity in Shina. Himalayan Linguistics 9(2). 1–55.
Bernabela, Roy S. 2011. A phonology and morphology sketch of the Šiħħi Arabic dialect of
əlĞēdih (Oman). Leiden: Leiden University, unpublished MA thesis.
Caussin de Perceval, Armand P. 1853. Essai sur l’histoire des Arabes avant l’islamisme,
pendant l’époque de Mahomet, et jusqu’à la reduction de toutes les tribus sous la loi
musulmane. The Calcutta Review 21. 1–76. Calcutta: Sanders, Cones & co.
Dixon, R. M. W. & Alexandra Y. Aikhenvald. 2002. Word: A typological framework. In
R. M. W. Dixon & Alexandra Y. Aikhenvald (eds.), Word: A cross-linguistic typology,
1–41. Cambridge: Cambridge University Press.
Dostal, Walter. 1972. The Shihuh of northern Oman: A contribution to cultural ecology. The
Geographical Journal 138. 1–7.
Kumzari 657
Eades, Domenyk. 2009. The Arabic dialect of a Šawāwī community of northern Oman. In
Enam Al-Wer & Rudolf de Jong (eds.), Arabic Dialectology: In honour of Clive Holes
on the occasion of his sixtieth birthday, 77–98. Leiden: Brill.
Haspelmath, Martin. 2007. Coordination. In Timothy Shopen (ed.), Complex Constructions
(Language Typology and Syntactic Description, vol. 2), 1–51. New York: Cambridge
University.
Hawley, Donald. 1970. The Trucial States. London: Allen & Unwin.
Holes, Clive D. 1990. Gulf Arabic (Croom Helm Descriptive Grammars). London: Rout-
ledge.
Hopper, Paul J. 1979. Aspect and foregrounding in discourse. In Talmy Givón (ed.), Dis-
course and syntax (Syntax and Semantics, vol. 12), 213–241. London: Academic Press.
Jayakar, A. S. G. 1902. The Shahee dialect of Arabic. Journal of the Bombay Branch of the
Royal Asiatic Society 4. 246–277.
Johanson, Lars & Bo Utas (eds.). 2000. Evidentials: Turkic, Iranian and neighbouring
languages (Empirical Approaches to Language Typology 24). Berlin & Boston: De
Gruyter Mouton.
King, Geoffrey R. 2001. The coming of Islam and the Islamic period in the UAE. In Ibra-
him al Abed & Peter Hellyer (eds.), United Arab Emirates: A new perspective, 70–97.
London: Trident Press.
Kossmann, Maarten. 2000. A Study of eastern Moroccan fairy tales (Folklore Fellows’
Communications 274). Helsinki: Suomalainen Tiedeakatemia Academia Scientiarum
Fennica.
Lancaster, William O. & Fidelity C. Lancaster. 2011. Honour is in contentment: Life before
oil in Ras al-Khaimah (UAE) and some neighbouring regions. Berlin & Boston: De
Gruyter.
Lorimer, John G. 1915. Gazetteer of the Persian Gulf, Oman, and Central Arabia. Calcutta:
Superintendent Government Printing.
McWhorter, John H. 2007. Language interrupted: Signs of non-native acquisition in stand-
ard language grammars. Oxford: Oxford University Press.
Miles, Samuel B. 1994. The countries and tribes of the Persian Gulf. Reading, UK: Garnet
Publishing.
Nadjmabadi, Shahnaz. 1988. Identité ethnique contre nationalité: Le cas de l’île Larak
(Golfe Persique). In Jean-Pierre Digard (ed.), Le fait ethnique en Iran et en Afghanistan,
65–74. Paris: Editions du CNRS.
Neale, John Mason. 1873. A History of the Holy Eastern Church. London: Rivingtons.
Palgrave, William Gifford. 1866. Narrative of a Year’s Journey through Central and East-
ern Arabia (1862–63), vol. 2, 3rd edn. London: MacMillan.
Payne, Thomas Edward. 1997. Describing morphosyntax: A guide for field linguists. Cam-
Potts, Daniel. 2012. In the land of the Emirates: The archaeology and history of the UAE.
Abu Dhabi: Trident Press and Sultan bin Zayed’s Culture and Media Centre.
Propp, Vladimir. 1968. Morfologiia skazki [The morphology of the folktale]. Austin,
U.S. A.: American Folklore Society.
Pustet, Regina. 2003. Copulas: Universals in the categorization of the lexicon. Oxford:
Oxford University Press.
al-Rawas, Isam. 2000. Oman in early Islamic history. Reading, UK: Garnet & Ithaca Press.
Ross, Edward C. 1874. Annals of Oman. Journal of the Asiatic Society of Bengal 43. 111–
196.
Rubin, Aaron D. 2010. The Mehri language of Oman. Leiden: Brill.
Simeone-Senelle, Marie-Claude. 1997. Modern South Arabian languages. In Robert Hetz-
ron (ed.), The Semitic languages, 378–423. London: Routledge.
Skjærvø, Prods Oktor. 1989. Languages of southeast Iran: Lārestānī, Kumzārī, Baškardī.
In Rüdiger Schmitt (ed.), Compendium Linguarum Iranicarum, 363–369. Wiesbaden:
Reichart.
Stilo, Donald. 2004. Coordination in three Western Iranian languages: Vafsi, Persian and
Gilaki. In Martin Haspelmath (ed.), Coordinating constructions, 269–330. Amsterdam
& Philadelphia: Benjamins.
Thomas, Bertram. 1930. The Kumzari dialect of the Shihuh tribe (Musandam), Arabia, and
a vocabulary. Journal of the Royal Asiatic Society 62(4). 785–854.
Ulrich, Brian John. 2008. Constructing Al-Azd: Tribal identity and society in the Early
Islamic centuries. Ann Arbor, U.S. A.: ProQuest.
Utas, Bo. 2006. ‘Genres’ in Persian literature 900–1900. In Gunilla Lindberg-Wada (ed.),
Literary history: Towards a global perspective, vol. 2, 199–241. Berlin & Boston: De
Gruyter.
Waltisberg, Michael. 2011. Syntactic typology of Semitic. In Stefan Weninger (ed.), The
Semitic languages: An international handbook, 303–329. Berlin & Boston: De Gruyter
Mouton.
Watson, Janet C.E. & Domenyk Eades. 2012. Documentation and ethnolinguistic analy-
sis of the Modern South Arabian languages. Leverhulme Trust Project Grant outline.
http://www.salford.ac.uk/__data/assets/pdf_file/0012/173010/MSALProjectInforma
tion.pdf (accessed 16 February 2013).
Watson, Janet C.E. 2012. Dialects of the Arabian Peninsula. In Stefan Weninger (ed.), The
Semitic Languages: An international handbook, 897–908. Berlin & Boston: De Gruyter
Mouton.
Zwemer, Samuel Marinus. 1902. Three journeys in northern Oman. The Geographical
Journal 19(1). 54–64.
5. The Caspian region and south Azerbaijan:
Caspian and Tatic
Donald L. Stilo
1. Introduction
This chapter focuses on two Northwest Iranian (NWI) families to the south and
west of the Caspian Sea, Caspian and Tatic. The internal relationships of these
languages remains a topic of controversy; the presentation adopted here follows
Stilo (1981). Much of the data stems from fieldwork undertaken in the region in the
1970’s, but supplemented by more recent published data. The Caspian group rep-
resents one long, unbroken dialect chain extending some 500 km from the Mazan-
derani (Māzanderāni) dialects of the Gorgān area in the east to the Gilaki dialects
of Anzali in the west. Within Caspian, a major division occurs between Eastern
Gilaki (EG) and Western Gilaki (WG), whose boundary is formed by the Sefidrud
river. WG includes the dialects of Rasht, Fuman, Anzali, and EG includes Lāhijāni,
Langerudi, Māchiāni, among others. In addition, I consider the transitional dialects
roughly of the area from Chālus to Rāmsar to be a third “Central Caspian” language.
The Tatic family consists of: A) highly diverse Tati (Tāti) dialects that are
sparsely spread over a discontinuous area extending from Vafsi in the south near
Saveh to Kilit (now extinct) of Nakhichevan/Naxjavān province of the Republic
of Azerbaijan north of the Araxes; B) Talyshi (Ṭāleshi/Taleshi in Iranian sources),
ranging from the northernmost dialect of the Masally area of Azerbaijan to the
Rudbār valley in Iran; and C) “Tatoid”, two Tati-like offshoots, Rudbāri and Tāle-
qāni/Alamuti, as discussed below. Also note the usage of “Talysh people”/”the
Talysh” vs. “the Talyshi language”.
These sub-groups and varieties are summarized in the following list. The
numbers in parentheses refer to the numbering in Fig. 1 below:
Caspian
Mazanderani (1)
Central Caspian (2)
Ramsar, Tonekabon, Kelardasht
Eastern Gilaki (3)
Lahijan, Langerud, Machian
Western Gilaki (4)
Rasht, Fuman, Anzali
660 Donald L. Stilo
Tatic
Tati
S. Tati (7)
C. Tati (8)
N. Tati (9)
Talyshi
S. Talyshi (10)
C. Talyshi (11)
C.-N. transitional (12)
N. Talyshi (13)
Tatoid
Taleqāni-Alamut Tatoid (5)
Rudbār Tatoid (6)
In this chapter, I have singled out four varieties to represent the four main groups
discussed here for more detailed treatment: Sāravi (Mazanderani), Lāhijāni (Gila-
ki), Leriki (N. Talyshi), and Vafsi (Tati), but other, especially undocumented,
varieties will be introduced when space permits.
1.1. Geographical Situation

The Caspian littoral, due to the presence of several of Iran’s major rivers and their
tributaries as well as to its relatively low elevation blocked off from the plateau
by the high physical barriers provided by the Alborz and Talesh mountain ranges,
has as a result a quite humid, mild, subtropical climate and is known for its dense
forests, rice paddies, tea plantations, citrus groves, fisheries, and diverse fauna.
Due to the climate and fertile land, Caspian and Talyshi groups are spoken in
densely populated areas.
The Caspian languages are located along the southern Caspian littoral extend-
ing eastward almost to Gorgān. Talyshi is located along the southwest and lower
areas of the western Caspian littoral, especially in the cities of Māsule, Māsāl,
Hashtpar/ Asālem in Iran, extending northwards into the Republic of Azerbaijan
through the major cities of Astara, Lenkoran, and Masally. Varieties of Caspian
(e. g., Kelārdashti, Elāshti, Velātrui) and Talyshi (esp. Māsulei, Anbarāni, Leriki)
are spoken higher up in the adjacent verdant mountain slopes and valleys facing
the Caspian. Mazanderani has spread as far south as Ure between Demāvand and
Āb-e Ali.
There is, however, a complete linguistic break with no mutual intelligibility
between Caspian and Talyshi, specifically neighboring Western Gilaki and South-
ern Talyshi. They belong to two different subgroups of NWI that arrived in the area
from different directions and have no transitional forms as a bridge between them.
By contrast to Talyshi, Tati languages are located on the Iranian plateau from
The Caspian region and south Azerbaijan: Caspian and Tatic 661
Figure 1: Locations of Caspian and Tatic varieties

662 Donald L. Stilo
the Araxes in the north and southwards to Saveh and Arak, and are sparsely dis-
tributed among villages that generally only support small populations. Notable
exceptions are the Tākestān area, Kolur, and Vafs. At least half of the populations
speaking the latter two are located in Tehran, although they retain regular contact
with their villages.
It should also be noted that in the last 40–50 years, the major cities along the
Caspian coast in the Talyshi language zone, especially in Iran, have become pre-
dominately Azerbaijani Turkish speaking (due to in-migration). Today we find the
majority of Talyshi speakers in villages outside the major cities.
2. The internal relationships of Caspian and Tatic

2.1. Caspian
Claims are often made for Galeshi as a separate Caspian dialect. While they may
be considered by some (and themselves) to be a distinct ethnic group, certainly
based on a distinct life-style, the linguistic data do not support any putative claims
for a single unified dialect. The Galesh are cowherds, whence the word Gāleš <
*gawa-raxšaka- ‘the protector of cows’ (Asatrian and Borjian 2005: 50), in moun-
tainous areas extending along the Alborz from at least the Sefidrud through to
eastern Mazanderan. Varieties of Galeshi differ from point to point just as the dia-
lects of the lowlands do. In fact, any given Galeshi dialect is probably much closer
to the corresponding lowland dialect to the north than it is to another Galeshi
dialect at a similar east-west distance.
This connection makes perfect sense given that until recently the pastoral
population probably led a summer camp/winter camp (yeylaq/qishlaq) transhu-
mant life-style, migrating down to the closest lowland point in the winter season.
This caused their “outside” focus to turn to the lowland area, rather than to other
Galesh groups to the east or west. While Galesh dialects across most of Gilan may
be mutually intelligible, they clearly exhibit some differences from one another.
The Galeshi of Gilan is totally different from the Galeshi of Mazanderan (see
Borjian and Borjian 2007a, 2007b, 2008). Further discussion of Galeshi will have
to remain for a separate investigation.
2.2. The Tatic family

Tatic consists of two closely related main groups, Tati and Talyshi, both of which
can be further grouped into Northern, Central, and Southern clusters. While the
Talysh population is quite dense and forms a classic dialect chain extending
southwards from (Azerbaijan) Masally, Lenkoran, Lerik, Āstārā, (Iran) Anbarān,
Qal’eh, Sayyādlar, Levandavil; Vizne, Jowkandān; Kargānrud, Asālem, Nāvrud;
664 Donald L. Stilo
Paresar, Tāleš-Dolab; Xošābar; Shāndermen, Māsāl, Māsule, culminating in the

easternmost (as far as I know) Oskulak in the Rudbār valley and southernmost
Chenār Rudxān (Bazin 1981: 113).
Caucasian Tat, a Southwest Iranian (SWI) language resettled from southern
Iran in late Sassanian times, is not associated in any way with the Tati family or
the term Tatic. Its closest relative is Persian and the similarity in names is purely
coincidental.
The Tati languages by contrast are disjointed and dispersed, as follows:
• Southern Tati with nine dialects in the Tākestān area of Qazvin (Yarshater 1969),
Chāli, Tākestāni, Sagzābādi, etc. and to the east, Eshtehārdi. Razajerdi merits
special status as a transition from Southern Tati to Tāleqāni (Tatoid).
• After a break from Southern Tati, Central Tati picks up to the northwest: (1) the
Tāromi group (Yarshater 1970) —Hezārrudi, Kalāsari, Bākoluri, Charazei, etc.
of Upper Tārom in Zanjān province—and including Kelāsi and Kafteji in
Gilan province; (2) Koluri, Shāli, etc. in Shāhrud district (bæxš) of Khalkhāl,
Ardabil province (Yarshater 1959); (3) Gandomābi, Siāvarudi, Nowkiāni,
etc.; (4) Kahali, Geylavāni, Tahāromi, etc.; (5) Karani, Lerdi, Dizi, Karnaqi;
and (6) slightly to the west, Kajali (Yarshater 1960) of Kāqazkonān district,
Miāneh county, showing some transitional lexical features to Northern Tati.
The first five groups above form a spectrum-like dialect chain but seasonal
work and trade bring them into regular contact with the Southern Talysh of
Māsule, Māsāl, and Shāndermen, creating a conduit for lexical and grammat-
ical borrowing, probably in both directions. The Kajalis’ Sunni denomination
also brings them into similar contact with the Sunnis of the Central Talyshi
areas.
• To the north of Central Tati, after a substantial break, we find disparate
clusters of Northern Tati dialects along the Aras (Araxes) river: Kalāsuri-
Khoynarudi, Keringāni, Harzani, and (extinct) Kilit (on the north side of the
Araxes, near Ordubad, Nakhjavan province, Republic of Azerbaijan).
2.2.1. Notes on specific enclave speech varieties and their communities

Certain languages are the results of population movements that brought speakers
of a few Tati or Talyshi varieties to other areas within Tatic, to the periphery of this
zone, or outside it. In this chapter, I will use the term “enclaves” for these isolates.
Talyshi-speaking groups
Charozh is a type of Central Talyshi (< čarāj ‘a Talyshi tribe of Iran’ (‘Abdoli
Koluri 1992: 43)), spoken by some 100–200 people in Sarak and Degadi in the
Astara Talyshi-speaking area in the Republic of Azerbaijan. As Sunnis who felt
some persecution in Shiite Iran, they were invited by Tsarist Russia to resettle
north of the border soon after the 1828 Treaty of Turkmanchay. While Charozh has
retained many emblematic features of C. Talyshi, it also displays heavy influence

from the surrounding Northern Talyshi.
Oskulak, a village in the Rudbār valley with some 1000 inhabitants at the
time of my 1976 visit, is evenly divided between two linguistic groups: Oskulaki-
Talyshi and Oskulaki-Rudbāri (Tatoid, §2.3). The former group is quadrilingual
(Talyshi, Rudbāri, Gilaki, Persian) and the latter trilingual (Rudbāri, Gilaki,
Persian).
Tati-speaking groups
Dikin, a village of 850 inhabitants at the time of my visit in 1976, located to
the northeast of Qazvin city in the Alamut valley (Stilo 2018b), has three dif-
ferent groups reflecting a divide between: A) adherents of a secret sect found in
the Alamut area who call themselves “Marāqei” or “Kalle-bozi (Goathead)”, who
speak Dikin-Marāqei (ca. 500 people), and the regular Shiite villagers (called
“Pashei (Mosquito)” by the Marāqeis), who speak either B) Dikin-Pashei, a typical
Alamuti (Tatoid) dialect of the area, or C) standard Tehrani Persian as a first lan-
guage. Marāqei groups are found in some 16 villages in the area where they only
coexist with non-Marāqei Shiites and have no villages exclusively of their own.
Marāqei dialects are generally of a highly conservative Tati type (§4.2), very dif-
ferent and unintelligible to the Tatoid “Pashei”.
Vafsi (Stilo 2004a), spoken in four villages (the Vafs-Chehreqān-Fark cluster
and Gurchān, somewhat removed), has a seemingly very complex history. It is often
classified with either Tati (Stilo 1981), the Central Dialect group (Lecoq 1989),
or as transitional between them (Redard 1970). In my opinion, it reflects a Tatic
admixture from farther north overlaying a Central Dialect substratum, as seen in
certain shared innovations typical of N. Tati, not found in Southern Tati. There is
an additional stratum of a Kurdish-speaking group, yet to be determined. This area
has attracted many different groups, as it is a highly fertile, high mountainous plain
endowed with a wide variety of orchards that extend for kilometers around the
villages and is especially attractive for its cooler weather in the summer months.
My views on this complex layering in the history of Vafsi is a topic for future
publication.
Kafteji and Kelāsi, spoken in Kafti and Kelās (Pers. Kabate and Kalās), are
located just within Rudbār district of Gilan province and are the easternmost of
the Tāromi cluster. In the early 1970s the villages, located some 12 km apart with
no intervening villages, consisted of about 350 inhabitants each. With the socio-
economic upheavals of the 1970s, however, almost all the villagers left for larger
cities and industrial centers.
Table 2A: Some common lexemes across the Caspian continuum
666
FEATURE Rashti Lāhijāni Galeshi Langerudi Rāmsari Tonkāboni Kelārdašti1 Kandelusi2 Sāravi Velātrui Urei Elāšti Xatirabadi
crow kəlač, kəlaj kəlaj, kəlač kelač kelač kelač, kælač kelaj, kælač kelaq
kəlaq qæla kelaq
throw, i-/ to-da gent, də- (r) tæ-væ-d ingæt enget ængen d-i(n)gu engess ængæn* d-engo
drop-pt də-gad, əgəne, gəne, (pres.)
ta-vəda tua-da tua-da
Donald L. Stilo
boy re(y), rikə pəsər rikə væčæ, væče rika rika rika ri:kɔ rika rika væče
re(e)k reka
sister xaxur xaxor xaxor xaxor xaxor xaxor xaxor xaxer xaxer xɔxɛr xaxer xaxɛr
buy-pr hin hen hin hi(n) hin xær xærin xær xrin xær
burn-pr suj suj suj suj suj suj suj suz suj
cook-pr pəz, pəč pəč puč puč pač, poč poč pæč pæjon pæj, pej, pæj
pej peč
louse səbəj subuj subuj subuj səbəj, espej esbij esbij esbij
osboj
nose vini dəmaγ vini vini vini fini veni feni vɛni veni feni dɛmaγ
*Forms unavailable for Elāšti (Pahlavān 2003) were supplied by Yuši data (Tāhbāz 1963), another mountainous dialect
1
Kelārdašti material is from Kalbāsi 1376/1997
2
Kandelusi material is from Jahāngiri, ’AliAsghar 1367/1988
2.3. Comparative lexicon of Caspian and Tatic
2.3.1. Caspian lexical features

Tables 2A–C present various aspects of the lexical composition and distribu-
tion across the Caspian continuum. Examples are restricted here to only a few
in each category. As Table 2A shows, many lexical items distributed across the
Caspian family still have their expected NWI reflexes and/or some special Caspian
feature.
The Caspian family has had intimate contact with Persian over a rather long period
as is evident in both lexical and grammatical domains. It is my opinion that a
certain number of Persian lexical borrowings are from a much earlier stage of
Caspian prehistory since they are very consistently distributed throughout, and
are completely integrated into, the Caspian family as a whole, e. g., Lāhijāni: don/
donəst ‘know’, zən/ ze ‘hit’, gu/gut ‘say’, zənæy ‘woman’, səg ‘dog’, mayi ‘fish’,
atəš ‘fire’, gul ‘flower’, asiyow ‘mill’, fərde ‘tomorrow’, diruz ‘yesterday’, bəhar
‘spring’, su ‘three’.
By contrast, Table 2B shows that numerous later Persian borrowings are neither
universal throughout the family nor integrated into the more conservative dialects.
The original NWI-type word is still extant in peripheral and even urban areas.
Table 2C presents a small sampling of the distribution of lexical items serving as
areal “links” along the Caspian dialect chain. Such a study must perforce allow for
gaps in our knowledge of these languages for undocumented synonyms that would
be included if the data were present.
2.3.2. Tatic lexical features

Although Tati and Talyshi are very closely related, there is wide variety within
each group—much wider than that of the Caspian group. As I have shown else-
where (Stilo 2015), Northern and Southern Talyshi should be considered two inde-
pendent languages. The diversity within Tati is even more striking, with Northern
Tati being an extremely unusual case within West Iranian overall.
As a counterpoint to this statement about diversity within Tati and Talyshi,
Table 2D briefly shows that Northern Talyshi is clearly more closely related to
Northern Tati (rows 1–6) than it is to Southern Talyshi. Central Talyshi has close
ties to the more northerly Central Tati dialects (rows 7–9), and Southern Talyshi is
quite closely related to other forms of Central Tati (rows 10–12).
2.4. Tatoid groups: a brief statement

The Rudbāri group, with many mutually intelligible dialects, is located in the
Rudbār valley on both sides of the Sefidrud river extending some 35 km from
Table 2B: Some Persian loanwords from a later period
668
Rashti Lāhijan Galeši Langrudi Māč3 Rāmsar Tonkab Kelrdšt Sāravi Urei Elašti Kordxl Xatirb
down jir jir jir jir jir jir jir jir zir jer zɛr
(jir)
hungry višta gušnə vəšnə gursənəy vešna vešna væšna vešna væšna vešna vešna gašnI
kidney fək væk γolve qolve qolbe qolve
Donald L. Stilo
pull out, birun avər kən kən kən kæn kæn boj kæn
take off (clothes)
seed tuxm tuxm tuxm, tum tuxm toxm, tum toxm, tum tim tim tim tim, toxm
sew-pr duz duz duj duj duj, duz duj duj duj duj, duz
snow berf, vərf berf, vərf vərf bərf, vərf værf værf, bærf bærf værf, bærf bærf værf varf værf
son-in-law damad damad zomə damad zoma zoma zama zoma domad damad domad damad
tear (drop) ašk ašk osïru osure æseri æsli æsli
up (jor) jor jor jəwr jor jor (bu)jor jar, jor jor bala yor bala
weave, knit baf baf vaj aj vaf b~vof
wolf gurg gurg gurg vərg verg verg verg verg verg verg vɛrg
3
Māčiāni material is from Farzpur-Māčiāni 1964, 1965

Table 2C: Intra-Caspian lexical isoglosses
Item Rashti Lāhijāni Galeshi Langrud Māčiān Rāmsari Tonkbn Kelārdš Kandlus Sāravi Velātrui Ure Elašti Xatirbd
1 3
1 sparrow čičini čušnək məljə čiš(ə)nək meljæ milje melje mička mička, mička~ mi:ška mička jika
(1, 5) məlijə miška mijka
2 child zay, zaak zak, zaak zæk/zak væčæ væčə væčče væč(č)e væčæ væče væčæ væče væče væče
kočtay kulə/-tə
3 large, pile, pila, pillə pile, pillə pilla, gætə gæt gæt, kæl gætæ, gæt gæt gæt gæt gæt
big (4, pil(l)ə pildanI pilæči gæt
12)
4 throw, ta-vəda, to-da gent, tua-da, ægent1 tæ-væ-d ingæt enget ængen d-i(n)gu engess d-ingu d-engo
drop-pt i-/də-gad tua-da, də- (r) ‘put’
əgəne gəne
5 sparrow čičini čušnək məljə čiš(ə)nək meljæ1 milje melje mička mička, mička~ mi:ška mička1 jika
(1, 5) məlijə miška mijka
6 ant (6, mur(čə), pitar pitar pitar potor putar mælije mejilæ melije mɛjile melije melijI
14) putur
7 cat (7, piča pičə pičə pičə püča puča/ə piča piča bameši bameši bæmši bamši bamši
17)
8 hole surax, sulax luk, luk, sulax luka, luka veher, le:x, sulax li selax
xulə sulaγ surəx loxar, ve:he:r
lex
9 spider labdon labdon labdon kartælæm karton kartæn vænnali bænd kartænæk2
10 lip ləb ləb, məč məčí tok tok tek, loše tek, lo: tek, lo, lu:šæ tek, læb
məččə luče luš(ɛ)
The Caspian region and south Azerbaijan: Caspian and Tatic
669
Item Rashti Lāhijāni Galeshi Langrud Māčiān Rāmsari Tonkbn Kelārdš Kandlus Sāravi Velātrui Ure Elašti Xatirbd
670
11 girl kor(ta) laku laku laku kija kija kija kija kija kija kija kija kija dɛtɛr
12 large, pile, pila, pillə pile/ə, pilla, gætə gæt gæt, kæl gætæ, gæt gæt gæt gæt, gæt
big (4, pil(l)ə pildanI pilowtI pilæči gæt gæti
12)
13 broom ja:ru jaru jaru jaru taru toru saje saje sajæ saze sɔ:jɛ sazæ2
Donald L. Stilo
14 ant (6, mur(čə), pitar pitar pitar potor putar mælije mejilæ melije mɛjile mejile melije melijI
14) putur
15 come! bía be biæ bíæ boro (y)a beru bɛru~bɔro beru bía
1
16 self xu~xo, xu xu xo, xu, xoštə(rə) xoštɛrɛ, šer še: še, xod xæ, xɛ, še, še xaš,
xud- xud xod- xod- xɔd- xed- xad-
17 cat (7, piča pičə pičə pičə püča puča/ə piča piča bameši bameši bæmši bamši bamši
17)
1) Forms unavailable for Māčiāni were supplied by Vājārgāhi data (Amirzādeh-Vājārgāh 1375/1996), some 10 km to the east of Māčiān.
2) Forms unavailable for Elāšti were supplied by Yuši data, another mountainous dialect.
3) Forms unavailable for Velātrui were filled in by Shahpoli in the same region.

Table 2D
Gloss Northern Tati Northern Talyshi C. Talyshi Southern Talyshi Central Tati
4 5
Harzani Keringān Kalāsuri N.Talyshi Viznei Jowkand Asālemi Māsāli Māsulei Kajali Koluri Karani Nowkiāni Kafteji
1 lip lev löbüt löbüd lïpüt, lïv lev lev liv ləb, lev læv bik
læfe
2 stomach gæd qædæ qædæ gædæ, lævæ ternæ lævær ternæ

lævæ
3 neck giri geri gïy məɫ1 gærdæn gærdæn mællæ gærdæn
4 moon öšmæ, ušma, ušmæ ošum mang mæng ma mung monga ma ma
5 break- sisd sist sist si, pe-si e-(š)ši iši (a-)čækəst čækæ čækæ iškist čækis škes škes
pt/intr
6 laugh- sər, sir ser ser sïr sər xur xændæ xænnæ k xor xur
pres k
7 neck giri geri gïy məɫ1 gærdæn mællæ gærdæn
8 small, vede(k) güdæ güdæ gædæ, gəč, geč ruk ruk villæ velle, velleg xordæ
little rük geč
9 laugh- sər, sir ser ser sïr ser xur xændæ xænnæ k xænnæ k xor xor xur
pres k
10 frog qərbaqa bæzæzuq væzæx værzæk væzzæk værzæk guzga guzuka guzga guza bïzγaz
11 elbow nïven- nærənjunæ əškub uškuf eškof škif

jænæ
12 stomach gæd qædæ qædæ lævæ lævæ lævæ ternæ lævær ternæ škæm
1) A velarized /l/, sometimes also called ‘dark’ or ‘back’ /l/.

671
4
Harzani material is from Zoka 1336/1957
5
Keringāni materal is from Zoka 1332/1954
Pan-Tatic lexicon
672
The following is a small sampling of lexical items that all (or almost all) Tatic languages, not including Tatoid, share (also
found outside Tatic):
Table 2E
Item N.Talyši Asālemi Māsulei Harzani Kalāsur Kajali Koluri Karani Gandomāb Hezārrud Kafteji Dik.-Marāq Chāli Alviri Vafsi
Donald L. Stilo
daughter kinæ keɫæ kellæ kinæ kinæ détæ det détæ kina détæ détæ kina tétæ tetæ kelje
girl kinæ keɫæ kellæ kinæ kinæ killig kille kéllæ kelleg kilégæ kelékæ killig teti kenækǽ kelle
dog sïpæ esbæ esbæ əsba esbæ esbe espæ sebǽ sebæ sbæ esba æsbæ æsbæ æsbæ
flower vïl vəɫ gəl vel vel vel vel vélæ vəl* vélæ velæ vélæ
hole hïl xəɫ xəl henæ helæ xel xol líæ xöliyæ, holle
kidney vek vek verek qolba vek væk væk vék(k)æ vékæ vǽk(æ) vékkæ
shovel hi(y)æ xeyæ, xiæ xeyæ hiyæ xeyæ xeyǽ, xiǽ xuyǽ xuyǽ xeǽ xoyyæ xöyæ fíæ
burn-pr væš væš væš væš væš væš væš væši de-rgesen væš veš
jump-pr vašt væz væz næv dæ-væz væz de-væz* vaz k. væz væz, (j)jæ ö-væz væz
jump-pt vašt væšt væšt nævest dæ-væšt væšt de-væšt* væšt vašt væšt ö-væšt væsd
stand-pr ïndïn mon vənden* vendæ vendær vendær išt endar is a-vendær har-
vender
*ɫ = velar l *Māsāli *‘run’ *‘bud’

Rudbār city in the south to the northernmost Oskulak. The fertile Rudbār valley is
known for its olive groves and rice paddies. The dialects that I was able to identify
and record during multiple trips to the area in 1976 consist of: (west bank, Sefid-
rud) Rudbāri; Lakei; the Juban cluster (Jubani, Sarāmarzi, Bāboni, and others);
Rostamābādi (formerly Kalurazi); Jamshidābādi; Oskulaki; (east bank) Shahrāni,
Tutkāboni. To this list we can add the dialect of Dogowharân as presented in
Lazard (1990).
The other Tatoid group, Tāleqāni-Alamuti, consists of a dialect continuum
spoken in an extensive area more densely populated than most plateau village
areas. Its North-South extension extends from the highest Alborz villages to
the Karaj-Abyek areas south of the local highway. Its East-West breadth ranges
between the two major routes leading from the plateau through the Chālus and
Rudbār passes to the Caspian. The western dialects, along the Shāhrud river, to my
knowledge, have never been documented and it is not clear exactly how far they
extend and where Rudbāri and Gilaki take over. The area is also interspersed with
numerous Azeri and Kurmanji-speaking villages.
The only groups identified with labels until now are Tāleqāni and Alamuti,
spoken along the parallel and adjacent Tāleqān and Alamut rivers northeast of
Qazvin. The confluence of these two rivers forms the larger Shahrud river which in
turn helps form the Sefidrud. For convenience sake, I shall refer to this linguistic
continuum henceforth in this chapter as Tāleqāni with the understanding that it
also includes Alamuti.
Tāleqāni and Rudbāri merit a special status that I have called “Tatoid”. Both
groups, in my opinion, were originally Tati languages that lost all the structural
hallmarks of Tati under heavy Caspian and Persian influence. The only traces,
albeit not many, that hint at Tati origins are in the lexical domain, as seen in the
brief sample in Table 2F. Table 2F also hints at a larger study of lexical items
connecting Tāleqāni and Rudbāri to the Tatic and Caspian languages that are geo-
graphically closest to each of them — Kelārdashti and Chāli for Tāleqāni-Alamuti;
Kafteji/Kelāsi and Rashti for Rudbāri.
Table 2F: Lexical similarities between Tatoid groups and the Caspian families
674
G ilaki C. T ati R udbāri (T atoid ) S. T ati T āleqāni (T atoid ) C.C asp .

6
Alamut area Tāleqān
Rashti Kafteji-(Kelāsi) Tutkāboni Jubani Chāli Kelārdašti
L exeme Dikin-Pašei Gāzarxān Gurāni
ant putur murčanæ putir moronj mörčönæ murčane murčanæ
Donald L. Stilo
cook-pr pəč pəč pæč pæč pej, peč pæč pæj pæč
crow kəlač qælač kelač kelač qolač qolač kelač
cry-pr gəryə (N) bəræm* gerye (N) (be)rmæm goræ (N) borme (N) berme (N)
down jir jir, jer jir jer jær, jæra jir jir jir jir
fall-pr dæ-kəf gen dæ-k(ah) gæn gen k- ki kow (dæ-)kæf
fall-pt dæ-kəft kæt dæ-kæt kæt de-kæt ke(t) kkæt kæt (dæ-)kæt
fox ruba los rubah luas luas luas luos ruba
girl kor kelékæ kilka(y) kelka teti dutær dotær kija
kidney fək* vǽkæ væk væk* væk,vurdu væk qolve
large pile pillæ pila pile pil(l)æ gæt* pil(o) gæt gæt
louse səbəj sbəj isbij šepeš espéjæ, f. espej osbəj esbij
mince-pr de-rjen ar-injin ænjin ænjæn enji (pt )* (æ)njæn dær-enj
sell-pr foruš xruš fruš ruš ruš (h)ruš ruš ruš
sell-pt foruxt xrut fruxt rut rut -yrut rut rut
sickle daz dáræ, f. dære dære
6
Gāzarxāni material is from Ivanow 1931.

G ilaki C. T ati R udbāri (T atoid ) S. T ati T āleqāni (T atoid ) C.C asp .
Alamut area6 Tāleqān
Rashti Kafteji-(Kelāsi) Tutkāboni Jubani Chāli Kelārdašti
L exeme Dikin-Pašei Gāzarxān Gurāni
son-in-law damad zama zama zama zo:ma zama zæma zeyma zoma
sparrow čičini mærgijə mælije mælije veškénjæ čuček melujæ čičok miška
wolf gurg værg gorg gorg vǽrgæ, f. verg verk verg
Alternates *Gāleši *Māsāli *Tākestāni *Owrāzāni
675
676 Donald L. Stilo
3. Phonology
3.1. Consonants
The general consonant inventory for most of the languages presented here is:
Table 3A
Dental/
Labial Alveolar Palatal Velar Uvular Laryngeal
voiceless p t č k (ʔ)
voiced b d j g q [g ]
voiceless f s š x [ʁ̥]
voiced v z h [ɦ]
m n
r
l
y
General observations:
• In all the languages of these groups, voiceless stops, except for /ʔ/, are aspi-
rated.
• In most languages /k/and /g/ are palatalized before front vowels and word-fi-
nally. In the Vafsi of Vafs, velars are not heavily palatalized but they are so in
Gurchān Vafsi. In Gilaki, most of Talyshi, and a few neighboring Tati languages,
these consonants are always unpalatalized. Palatalization has gone so far in
Sagzābādi that most velars have merged with the palatals before front as well
as back vowels, but not consistently: Front (voiceless) čærd ‘did’, mæmlečæt
‘country’, čöštæ ‘killed’, mömčen ‘possible’; (voiced) u-jir ‘pick up!’, vǽljæ
‘leaf’; Back (voiceless) čar ‘work’, ču(h) ‘mountain’, šečar ‘hunt(ing)’, čuæst
‘pounded’, (voiced) jow ‘want’ (< *gæv-); Unpalatalized veškenj ‘sparrow’,
mekæšest ‘pulled’, væk ‘kidney’; čærgæ ‘hen’, a-gærdæst ‘he returned’, gow
‘cow’ (see also Stilo 1994).
In Harzani, as in the local Azeri dialect, the palatalization of /k/ in final position
yields a subphonemic mid-palatal voiceless fricative [ç]; /čök/ [čöç] ‘good’.
Sokolova (1953: 113–114) lists non-palatalized vs. palatalized velars (/k/ vs.
/ḱ/, /g/ vs. /ǵ/) as separate phonemes in N. Talyshi. Neither Miller nor Pirejko
list them separately and there is no accommodation for them in the Latin or
Cyrillic alphabets used for Talyshi. In my own observations —although I never
singled this issue out for investigation— I have found that palatalized velars
exist independently in place names and in Azeri loans that are only partially
integrated into the Talyshi lexicon. Hence these phones should probably be
classified as marginal phonemes with a very low functional yield of the pala-
talized variants.
• In Mazanderani and the languages of the plateau, the uvular /q/ (always voiced)
has 1) a stop phone [g ] in initial position or when geminated, 2) a fricative
phone [ʁ] (henceforth γ) in intervocalic position, and 3) either variant in final
position. I know of no contrasting examples of [g ] vs. [γ] in the languages of
this area.
In Gilaki and Talyshi, except in cases of heavy bilingualism with Persian, there
is no uvular stop variant and the phoneme only has the variant /γ/ throughout:
Rashti: γurbán [γurbǽ>n]7 ‘sacrifice’; Lāhijāni: γələ́m [γεlέm] ‘pen’; Māsulei:
γævi ‘strong’. This feature is a hallmark of a Gilaki or Talyshi accent in Persian.
• My observation is that /h/ is voiced [ɦ] throughout the area.
• The glottal stop is highly marginal and rare. It occurs in words of Arabic origin
and is a feature of erudite (and bilingual) speech and is generally elided.
• When an original syllable-final /h/ or /ʔ/ is elided, compensatory lengthening
in the neighboring vowel occurs: Sāravi šæ:r (~ šæhr) ‘city’, mæ:n (~ mænʔ)
‘obstacle’; Lāhijāni ma:kəmə (~ mahkəmə) ‘tribunal’; ta:rif (~ taʔrif) ‘defini-
tion’, Vafsi qæ:r (~ qæhr) ‘anger’, mæ:ni (~ mæʔni) ‘meaning’.
Additions and permutations to the phoneme inventory above:

• Some languages have a /ž/, contrasting with both /š/ and /j/, but it generally
has a low functional load. Exx: Leriki žæ ‘hit (ppl.)’ – jæv ‘barley’ – šæ ‘gone’,
šæž ‘you have gone’ – bevæj ‘bad’ – šæš ‘six’; Asālemi že ‘to hit’ – jego
‘place’ – še ‘to go’; nə́mæž ‘I don’t hit’ – nə́mæš ‘I don’t go’ – ávæj ‘pull off!’;
Koluri žen ‘lost’ – jel ‘cheek’ – šet ‘milk’. There is no strong evidence for /ž/
in Mazanderani, Gilaki, or S. Tati. The four examples that Jahāngiri (2003: 28)
adduces mostly occur in the context VžC and probably do not contrast with
/j/ (further research is needed on its status). The Tonekāboni /ž/ phoneme
(Amini 2003: 43–44) has an extremely low load functional load. Vafsi has no
/ž/ phoneme although it does occur as an allophone of /j/.
• Vafsi has a voiceless fricative trill phoneme /ř/ with low functional yield. It
consists of a simultaneous voiceless alveolar trill and voiceless alveolar-api-
cal fricative, somewhat similar to the Czech phone /ř/ but voiceless. It occurs
only medially, either VřV (ařé ‘mill’ guřækenlé ‘badger’) or CřV (bínřiæ ‘I
bought’), although it perhaps also occurs initially.
• Vafsi /v/ has a frequent allophone [w] which (along with the glides ew and
əw) is emblematic for Vafsi speakers. It occurs intervocalically next to a back
vowel, most prominently /a/. The other vowel may be a non-high front or back
vowel. Its distribution is particularly obvious in morphological contexts, e. g.,
ærvendom ‘I find’ vs. háwend ‘find!’, báve [báwe] ‘s/he came’ (with historical
(m > v/w). When the other vowel is /æ/ (æva, avæ), the whole sequence changes
7
The superscript symbols > and < denote retracted and advanced vowel qualities respec-
tively.
678 Donald L. Stilo
to awa: ærvazom > báwazom ‘I say (present > Subjunctive)’ < *bǽvazom; yæv
‘barley’ vs. yawá ‘threshed barley’, but note the feminine morpheme in áwæ
‘water’. It seems, however, that the word vávæ ‘almond’ (*váwæ) contrasts
with áwæ, thus conferring phonemic status to both /v/ and /w/, but a fuller
statement must remain for future investigation. When only front vowels are
present, the [w] phone is not used: bǽvæ ‘take!’, dív/e ‘face/s’. The [v] allo-
phone is also retained in sequences of -iva-, e.g., ivan ‘threshing floor’, but not
-avi- > -ewi-.
3.2. Vowels and Glides

There is a general split in these languages between a 6-vowel and 7-vowel inven-
tory, plus various diphthongs. In the plateau area, essentially most of Tati, there is
often a strict division of front vs. back, with no central vowels. Some languages,
however, have a central, a high back unrounded, or a front rounded vowel as a
seventh vowel. An eighth phoneme /ü/ is quite rare, but occurs as a common allo-
phone of /u/.
Table 3B
Vafsi, Hezārrudi Razajerdi, Alviri, Chāli Koluri, Māsulei
Gandomābi, Nowkiāni Dikin-Marāqei, Kajali Kafteji-Kelāsi
F B F B F C B
i u i u i u
e o e ö o e ə o
æ a [ɒ] æ a [ɒ] æ a [ɒ]
Rashti, Lahijāni Mazanderani Leriki

F C B F B F B
i u i u i ï u
e ə o e o e o
←a→ ɛ ←æ→ ←a→
æ a [ɒ]
One unclear issue for Mazanderani in general is the distinction of two front vowels
/e/ and /ε/. The two main grammars of Sāravi, Shokri (1995) and Yoshie (1996),
make no mention of this distinction, but it is made for other dialects in Rastorgueva
and Edel’man (1982), Borjian and Borjian (2007b), Lambton (1938), and in my
own field notes for Bābolsari.
Minimal (or near-minimal) pairs in various dialects: Kordkheyli (Borjian) tε
‘you’ vs. te ‘your’; me ‘my’ vs. mε ‘I (subject)’; vénε ‘must, should’ vs. vέne ‘his,
her, its’; ære ‘yes’ vs. kærε ‘butter’; Bābolsari (Stilo field notes) te ‘your(s)’ vs.
tε ‘you’; se ‘apple’ vs. sε ‘three’; me ‘my’ vs. mεn ‘I’; šεmé ‘your (pl.)’ vs. jεmέ
‘shirt’; vésse ‘for’ vs. vέssene ‘it tears’; Velātrui (Lambton) véni ‘must’ vs. vεní
‘nose’; (Rastorgueva & Edel’man) dε ‘two’ vs. de(h) ‘village’ vs. dæ ‘ten’; ser
‘sated’ vs. sεrx ‘red’. Shokri, however, does occasionally show a type of contrast:
se ‘three’ vs. se: ‘apple’; ser ‘secret’/serx ‘red’ vs. se:r ‘sated’. Contrasts are rare
but a token count of a portion of Borjian’s corpus shows the low functional load of
/e/: ε 2228, a 971, e 730, æ 494.
In Leriki /æ/ can be as far front as in Persian or Azeri but both /æ/ and /a/ are
mostly articulated slightly more centrally while still retaining their distinction: da
‘ten’ vs. dæ ‘valley’; ha ‘each’ vs. hæ ‘donkey’; hani ‘sleep (oblq.)’ vs. hæni ‘still,
yet’.
General observations on Gilaki vowels:
• /e/ and /o/ have a rather low functional load in Gilaki, slightly higher in Lāhi-
jāni.
• An expected /ə/ changes to /a/ in contact with /h/ (which may then be elided
in postvocalic position yielding compensatory lengthening: Lāhijāni hasən
‘Hasan’, ma:kəmə ‘tribunal’ (NJ:35).
• As Figure 2 shows, the /a/ and /ə/ phonemes have a wide range of free-varying
allophones; the phone [æ] is much more common in Lāhijāni than in Rashti.
The allophones of /a/ and /ə/ are often difficult to distinguish, e. g., hæsə́n=a
(Objective) and hæsə́n=ə (Possessive) often sounded the same.
• For interest’s sake, the vowels of the Lāhijāni text in §10.1 are left in allo-
phonic transcription.
Figure 2: Phonetics and phonology of Lāhijāni vowels

680 Donald L. Stilo
Examples for Fig. 2:

/ə/: [ə] bəyn ‘look!’, bə́kəti ‘he fell’, də́vədənəm ‘I close’, sər ~ sεr ‘head’
[i ] óri ‘there’, šə́ni ‘goes’, búdin ‘they did’, xəndí ‘laughter’; də́ri ‘is in’
[ε] γεlέm ‘pen’, bútε ~ -ti ‘he said’, bə́- ~ bέhεm ‘I bought’, miέn ‘(with)
in’, γɛzá ‘food’, irəj [irɛ>j] ‘man’s name’
[ï] rikḯ ‘boy’, bḯ~bέ~bə́mæ ‘he came’, bḯšun ‘(that) he go’,
rḯng~rεng~rəng ‘color’
/a/: [a] bέma~bə́mæ ‘he came’, zakón ‘children’, dubaré ‘again’, zæmán
‘time’
[æ] tifǽng ‘gun’, æmó ‘we’, nǽkun ‘don’t do!’, ǽgə ~ ǽgε ‘if’, ǽni ‘you
come’, də́væssyem ‘I closed’, hæsə́n ‘P.N.’, mæ ‘me (oblq.)’
[a<] isæm~isa<m ‘I am at (Be3)’
/u/: [u ]: búšo ‘s/he went’, búgom ‘(that) I say’, turš ‘sour’, púrsi ‘you used
to ask’
[u]: bútε ‘s/he said’, mu ‘I’, γurí ‘teapot’, ruz ‘day’, pul ‘money’, bum ‘I
was’
Glides and other additions and permutations in the above vocalic systems:
Vafsi: The Vafsi mid, front vowel /e/ is realized phonetically as a higher [I] in all
positions, except with off-glides. Vafsi glides are: /ey ~ æy/, the marginal /uy/,
/oy/, /ay/, and the rather distinctive (and emblematic for the Vafsi speakers) /əw/
and /εw/. The frequencies of each Vafsi vowel (in tokens) from a small corpus
containing 6579 vowel nuclei are as follows: /æ/ 1830; /e/ 1595; /a/ 1244; /i/ 1138;
/o/ 580; /u/ 192.
Vafsi vowels and glides:

æ: ærvæzóm ‘I jump’, ayrǽ ‘hives’ ey: eyd ‘holiday’, leyléy ‘boy (oblq.)’
e: esdařé ‘star’, qeréq ‘frost’ uy: šuy ‘husband (oblq.)’ (rare and
avoided)
a: ána ‘that direction’, ban ‘roof’ oy: (rare) roy ‘place name’
i: in ‘this’, vilí ‘sleet’ ay: ayr ‘fire’, vaym ‘I said’, kay
‘where’
o: orián ‘mumps’, goló ‘throat’ əw: əwr ‘cloud’, vəwsi ‘Vafsi’, təw
‘taste’
u: uí ‘hope’, durú ‘lie’ ew: ewí ‘open field’, hewlí ‘donkey
foal’, hæréw ‘hæram, ritually
unclean’
Vafsi consonants:
p: pilæ ‘pod’, hápærs ‘ask!’, s: seza ‘voice’, ræsænǽ ‘rope’, luas
čærp ‘fatty’ ‘fox’
b: bæbǽkæ ‘pupil (eye)’, z: zellé ‘woman’, azan ‘this way’,
qærb ‘grave’ æz ‘I’
t: tútæ ‘mulberry (f.)’, š: šišæk ‘2-year-old lamb’, qeyš
kirfit ‘match(es)’ ‘belt’
d: dade ‘older sister’, gæsd ‘bad’ x: xerxelle ‘esophagus’, feráx
‘wide’
č: či ‘fodder’, gerǽč ‘chalk’ h: hólæ ‘ash’, ahén ‘iron’, dæh ‘ten’
j: jujíæ ‘chick’, qolenj ‘cramp m: mama ‘midwife’, čæmm ‘eye’
k: kokk ‘fat’, tik ‘thorn’ n: nošoáni ‘evening’, ætén ‘now’
g: gælǽgæ ‘basket’, værg ‘wolf’ r: rékkæ ‘line’, širíšæ ‘leek’, ayr
‘fire’
q: qereq ‘frost’, aqáte [aγáte] ř: ařé ‘mill’
‘speech’
f: fǽrfæ ‘snow’, saf ‘smooth’ l: leylǽ ‘boy’, bol ‘tarantula’
v: vávæ ‘almond’, gændev ‘wheat’ y: yúæ ‘heifer’, xeyar ‘cucumber’
Māsulei vowels and glides:
æ: æmæ ‘we’, šæm ‘we go’ u: uškám ‘I split open’, pučú ‘cat’
e: emé ‘this (obl.)’, xelék ‘spade’ ey: veyv ‘bride’, šey ‘shirt’
ə: əštə́ ‘your’, bəz ‘goat’ æy: æy ‘him (oblq.)’, pæydo
‘apparent’
a: adoa ‘had given’, xam ‘I want’ uy: uy ‘he would come’, xuyš ‘you
wanted’
i: íllæ ‘one, a’, vieri ‘he would oy: paltóy ~ paltói ‘a coat’
take’
o: ošan ‘tonight’, zonó ‘knee’ ay: aynæ ‘mirror’, xay ‘you want’
Māsulei consonants:
p: pierə́m ‘I pick up’, veprazəm z: zonə́m ‘I know’, væzə́m ‘I jump’,
‘I lean’ æz ‘I’
b: bərá ‘brother’, lablab ‘barking’ š: šen ‘to go’, xorošə́m ‘I sell’,
bə́nəš ‘sit!’
t: taxt ‘galloping’, šə́təmæ ž: žænə́ ‘hits’, bə́žænəm ‘(that) I hit’
‘I was able’
d: darə́m ‘I have’, šædid ‘intense’ x: xər ‘fog’, dǽxon ‘call!’, bə́čærx
‘turn!’
č: čem ‘eye’, əčə́ra ‘why?’, hič γ: γaz ‘goose’, dæγóštəmæ ‘I drew
‘none’ water’,
682 Donald L. Stilo
j: jevæjə́m ‘I pull down’, ávæj ‘pull ræfeγ ‘friend’

up’
k: katek ‘cottage’, čækə́ ‘it breaks’ h: hærə́m ‘I eat’, gahen ‘cows’
g: gusǽnd ‘sheep’, agærdə́m ‘I m: mǽmon ‘don’t stay’, am ‘I come’
return’,
xesə́rg ‘mother-in-law’ n: næm ‘I put’, venen ‘they see’
f: færš ‘carpet’, kæfá ‘up’, sef r: ruk ‘small’, bəroær ‘brother
‘apple’ (obl)’
v: vervoærəm ‘I unload’, lev ‘leaf’ l: lækə́m ‘I fall’, kilil ‘key’
s: siə́ ‘it burns’, æsə́rg ‘tear’, y: yen ‘wife’, dayen ‘aunt
péles ‘lick!’ (MoBroWi)’
Lāhijāni vowels and glides:
ə: də́vəd ‘close!’, rikə́ ‘boy’ əy: bəys ‘stay!’, alnə́y ‘right now’
e: ére ‘here’, čére ‘why?’ ay: háytəm ‘I got’, kočtay ‘child’
a: ánəm ‘I come’, háda ‘s/he gave’ oy: lakóy ‘girl’
i: ísi ‘you are’, tínəm ‘I can’ uy: kuyta ~ koyta ‘which one?’
o: óre ‘there’, xonə́ ‘house’, æmo aw: aw ‘water’
‘we’
u: úsan ‘pick up!’, murjú ‘lentil’ ow: ištóws ‘he heard’, gow ‘cow’
Lāhijāni consonants:
p: pildanə́ ‘big’, bə́purs ‘ask!’ z: zak ‘child’, bə́zən ‘hit!’, aγúz
‘walnut’
b: bə́bin ‘cut!’, gəb ‘talk’ š: šəmə ‘you’, ušón ‘they’, piš ‘ago’
t: tódi ‘throw!’, bútəm ‘I said’, x: xaxor ‘sister’, surx ‘red’
kot ‘coat’
d: də́vəd ‘close!’, hádam ‘I gave’ γ: γelém ‘pen’, baγón ‘gardens’,
duróγ ‘lie’
č: čére ‘why?’, bə́čin ‘pick!’, h: hádi ‘give!’, bə́hem ‘I bought’
hič ‘none’
j: jujə ‘chick’, xoj ‘(wild) pear’ m: mi ‘my’, bə́mam ‘I came
k: kərk ‘chicken’, laku ‘girl’ n: nánəm ‘I don’t come’, livón
‘glass’
g: gúnəm ‘I say’, ágə ‘if’, sang r: rikə́ ‘boy’, óre ‘there’, pir ‘old’
‘stone’
f: fúxus ‘hide!’, tifǽng ‘gun’, l: lakú ‘girl’, pələ́ ‘pilaff’, jəngə́l
tif ‘thorn’ ‘forest’
v: víris ‘get up!’, də́vəd ‘tie!’, y: yad ‘memory’, diyát ‘village(s)’
div ‘demon’
s: sib ‘apple’, kəsif ‘dirty’, durus
‘right’
Sāravi vowels and glides (Shokri 1995):

æ: æger ‘if’, ræsen ‘rope’, dæ ‘ten’ u: u ‘water’, lulu ‘bogey man’
e: esbe ‘white’, kerk ‘hen’ æy: æyd ‘holiday’, bæyr ‘take!’
a: azad ‘free’, esa ‘now’ ay: ayne ‘mirror’
i: izi ‘type of aphid’, tim ‘seed’ ow: dowr ‘around’
o: oresi ‘shoe(s)’, kol ‘dull’
Sāravi consonants (Shokri 1995):
p: pærčim ‘fence’, hápers ‘ask!’, s: saze ‘broom’, lesek ‘snail’,
čæp ‘left’ kindes ‘medlar’
b: bǽkub ‘pound!’, zebún ‘tongue’ z: zezem ‘wasp’, kuze ‘jug’, maz
‘bee’
t: tæli ‘thorn’, deter ‘girl’, gæt š: šal ‘jackal’, vešna ‘hungry’,
‘big’ héniš ‘sit!’
d: dǽgerd ‘return!’, badem x: xalik ‘saliva’, tælxun ‘tarragon’,
‘almond’ sulax ‘hole’
č: čečem ‘type of weed’, nuč h: hæli ‘plum’, ahén ‘iron’
‘sticky’
j: jeme ‘shirt’, keje ‘where’, kelaj m: mi ‘hair’, nemašun ‘evening’,
‘crow’ dim ‘face’
k: kæk ‘flea’, esseka ‘bone’, n: nemek ‘salt’, kenes ‘stingy’,
fenizek ‘mucus’ kelen ‘ash’
g: gæt ‘large’, æger ‘if’, verg r: rika ‘boy’, pe:ra ‘day after
‘wolf’ tomorrow’, mar ‘mother’
q: qotele ‘teeny’, mæqez [γ] ‘fly’, l: læmbik ‘termite’, geles ‘drool’,
daq ‘hot’ gæl ‘mouse’
f: fiye ‘shovel’, æšnofe ‘sneeze’, y: yæx ‘ice’, saye ‘shade’
jif ‘pocket’
v: væk ‘frog’, kævez ‘turtle’
Leriki vowels and glides:
æ: ævæsór ‘spring (season)’, æy: æyšte ‘to get up’, pæynǽ ‘day
æmǽ ‘we’ before yesterday’, šæy ‘shirt’
e: éfešïn ‘sneeze!’, hæyé ‘to itch’ uy: (rare > u) ruy ‘river (oblq.)’
a: asp ‘horse’, bala ‘cub’ oy: boylu ‘little brother’, moy ‘fish’
i: ižnæn ‘again’, ævindí ‘he would ay: ayb ‘shame’, ovaynǽ ‘mirror’
see’
ï: ïštǽn ‘self’, angïštḯ ‘ring’ ïy: gïy ‘neck’
o: oγó ‘awake’, astovǽ ‘star’ iy: žiy ‘down’
u: umi ‘hope’, tul ‘mud’, lu [lü]
‘peel’
684 Donald L. Stilo
Leriki consonants
p: piyo ‘over’, bǽpe ‘must’, z: zínæ ‘yesterday’, væzǽx ‘frog’,
bip ‘quince’ az ‘I’
b: barz ‘high’, sïbḯž ‘louse’, lïmb š: šïndḯ ‘ladder’, éfešïn ‘sneeze!’,
‘snot’ meš ‘bee’
t: tæγærs ‘hail’, kütlæ ‘puppy’, ž: žiǽ ‘rope’, bæží ‘alive’, nimǽž
lïpǘt ‘lip’ ‘noon’
d: di ‘late’, pédïr ‘rip!’, péšand x: xun ‘blood’, oxo ‘end’, væzæx
‘pour!’ ‘frog’
č: čóknæ ‘how?’, nečï ‘wolf’, γ: γandé ‘to send’, æγḯl ‘child’, sïγ
hič ‘none ‘stone’
j: jæv ‘barley’, penjo ‘fifty’, h:; havz ‘green’, éhašt ‘hang!’
bevæj ‘bad’
k: kæ ‘house’, ókæ ‘open!’, ak m: mæsé ‘to hear’, æmæ ‘we’, yem
‘sweat’ ‘fodder’
g: gæv ‘mouth’, égïn ‘fall!’, kag n: neči ‘wolf’, zonḯ ‘knee’, bḯndïn
‘hen’ ‘stand!’
f: fik ‘thought’, gofe ‘cradle’, r: rïvós ‘fox’, morǽ ‘mouse’, bḯsïr
lef ‘quilt’ ‘laugh!’
v: voš ‘rain’, ævon ‘they’, ov l: kælæ ‘bull’, bḯlïv ‘move!’, hïl
‘water’ ‘hole’
s: sük ‘rooster’, osḯn ‘iron’, ars y: yol ‘big’, væyǽ ‘wedding’
‘tear’
3.3. Stress
When no inflectional morphology is involved, including feminine forms, lexical
stress is word-final in most word classes: nouns, pronouns, adjectives, numerals,
and non-finite verb forms, e. g., infinitives and participles. Conjunctions (e. g.,
ǽge ‘if’) and sometimes adverbs (e. g., bǽlke ‘perhaps’) allow for final or non-fi-
nal stress.
Table 3C: Stress, main categories

Noun Numeral Pronoun Adjectives Adverb
Sāravi riká yazzé emá vešná ferdá
Lāhijāni rikə́ amó gursənə́ pildanə́ fərdé
Rashti pəsə́r yazdə́ amá vištá pilé, pil(l)ə́ fərdá
Māsulei zoǽ yunzǽ (Māsāli) ænəštə́/višír sæbá
Asālemi zuǽ æmǽ veší sævsǽr
Leriki zoǽ yonzǽ æmǽ væší mášti
Kalāsuri zurǽ æmén væší yækǽ sæbá
Koluri kulǽ yanzǽ æmǽ višír pillǽ ærá
Kelāsi kotǽ yazzǽ amá xará pillǽ ará
Chāli purǽ yazzǽ æmá göšnǽ pil(l)ǽ æhrá
Vafsi leylǽ yazdǽ awán gosnǽ mæzǽn soáy
‘boy’ ‘eleven’ ‘we’ ‘hungry’ ‘large’ ‘tomorrow’
Languages with feminine gender in nouns (and in some languages, adjectives mod-
ifying them) place the stress routinely on the penultimate. This does not include
the Vafsi Feminine II type (§5.2.1), which takes final-stress in the Direct singular.
Table 3D: Stress and nominal morphology

Vafsi ‘wheat stalk (f . dir . sg .)’ suálæ sualǽ ‘unglazed pot shard’
(m . obl . sg .)
‘hens (dir . pl .)’ kǽrge kærgé ‘hen (f . obl . sg .)’
‘big (f . dir . sg .)’ mæzǽnæ mæzænǽ ‘the big one (m . dir . sg .)’
‘starlings (dir . pl .)’ čílle čillé ‘scarlet fever
(fem II. dir . sg .)’
Kafteji ‘hens (dir . pl .)’ kǽrgə kærgə́ ‘hen (f . obl . sg .)’
Since the punctual marker bV- often occurs in both the Preterit and the Past Parti-
ciple, the former (a finite form) is stressed differently from the latter (a non-finite
form). In Vafsi the stress pattern in the Present Perfect (see §6.5.9) generally con-
trasts with the feminine form of the Past Participle. In addition, the stress patterns
for the Imperfect and the Past Subjunctive are different in Leriki Northern Talyshi.
686 Donald L. Stilo
Table 3E: Stress and verbal morphology

Vafsi ‘sewn (ppl )’ bæduttǽ bǽduttæ ‘X sewed’
‘sewn (ppl , f )’ bædutté bædútte ‘X has sewn’
Sāravi ‘broken (ppl )’ beškessé béškesse ‘X broke’ (SH: 40)
Asālemi ‘risen (ppl )’ æišté æíštæ ‘X has risen’
Leriki ‘I picked up (imf )’ peægætím peǽgætim ‘I picked up (pt .sbj )’
Table 3F: Other random stress contrasts

Lāhijāni ‘lady’ xanə́m xánəm ‘I want’
Kordkheyli ‘for’ vessé vésse ‘had to’
Leriki ‘corn, maize’ mækǽ mǽkæ ‘don’t do!’
Bābolsari ‘type of insect’ bæú bǽu ‘become!’
Asālemi ‘don’t get up!’ pe-mǽ-b pé-m-æ-b ‘I get up’

The common syllable structure throughout the area is (C)V(C)(C). Initial conso-
nant clusters, as a general rule, are not allowed. Examples:
V: Sāravi u ‘water’, ema ‘we’; Lāhijāni u ‘he’, óre ‘here’; Leriki e
‘strawberry’, ïsæ ‘now’; Vafsi awan ‘we’, éna ‘this way, in this
direction’
CV: Sāravi še ‘self’, saze ‘broom’, lue ‘bark (dog)’; Lāhijāni su ‘three’, rikə
‘boy’; Leriki lo ‘crow’, væyu ‘bride’, zoæ ‘boy’; Vafsi bi ‘again’, vávæ
‘almond’, luas ‘fox’
VC: Sāravi un ‘that’; Lāhijāni alnə́y ‘right now’; Leriki ak ‘sweat’; Vafsi æz
‘I’
CVC: Sāravi; Lāhijāni jir ‘down’, væsni ‘co-wife’; Leriki gæv ‘mouth’, soγnæ
‘shade’; Vafsi bær ‘door’, kelje ‘girl’
CVCC: Sāravi šeft ‘crazy’; Lāhijāni kərk ‘hen’; Leriki merd ‘man’; Vafsi gæsd
‘bad’
3.5. Gemination
While gemination is in principle possible across the whole area, it is often seen as
a feature of erudite speech, esp. when occurring in Arabic loanwords. And indeed
the gemination is usually retained in highly erudite words. Original geminated
consonants in most words are for the most part degeminated. There are significant
exceptions.
Gemination in Sāravi tends to be less frequent than in other varieties: hæsseka
‘bone’. Geminated nasals created in the Present tense via morphology tend to
undergo dissimilation in Sāravi but in other dialects are retained, e. g., Bābolsari
dun-n-i = Sāravi dundi ‘you know’, *kæfenme > Bābolsari kæfemme = Sāravi
kæfembe ‘I fall’.
Gemination seems to be more frequent in some forms of Tati. It is fairly fre-
quent in Vafsi: koppel ‘tiny’, hottiǽni ‘bedtime’, rékkæ ‘line’, méssæ ‘fly’, esbezze
‘spleen’, ušša ‘pimple’, æjjene ‘s/he hits’, ænnie ‘s/he puts’, holle ‘hole’, zarru
‘child’ and final gemination is usually only retained when followed by another
vowel in a suffix, enclitic, or even another word: rott ‘place’, čæmm/čǽmme
‘eye/s’, verr ‘crazy’. Koluri has mæččæ ‘lip’, killə́ ‘girl’, assen ‘anvil’, fett ‘full’.
Chāli (Yarshater 1969: 55) has béttæj ‘run!’, xæssǽ ‘bone’, xöyyǽ ‘wooden
shovel’. Dikin Marāqei has killégæ ‘girl’, ænnán ‘I put’, néggion ‘I am not’, zollég
‘boy’. Kelāsi has bə́mmæ ‘s/he died’, bə́zzæ ‘hit’, pillæ ‘big’ but very few other
examples appear in my field notes. Gemination is not as common in Leriki as in
other languages but seems to be retained in certain words where it occurs: bællædo
‘oak’. Note also mækǽ ‘corn, maize’ vs. mækkǽ ‘Mecca’.
3.6. Intonation
Due to space limitations, I refer the reader to the claim made in Stilo (2004b: 274–
277) that languages of western Iran have five basic intonational contours requiring
the recognition of five levels of pitch. Four of these pitch levels are contrastive on
the clause level only and come only following the obligatory highest Level 5 pitch
(usually the only one per clause), which is borne by the word in focus according
to the information structure of the clause. In §8.2, points 4 and 5, I discuss two
highly distinctive contours used for 1) sentence-initial adverbial clauses and rela-
tive clauses, which I mark with (,,) in my transcription system, and 2) right-shifted
adverbial clauses. The latter forms a (less common) sixth contrastive intonational
contour and a sixth (extra-low) pitch level.
4. Grammatical isoglosses
4.1. Grammatical isoglosses in the Caspian family

The Caspian family as a whole shows a systemic grammatical restructuring
through convergence with Persian: loss of case (§5.1.1) and gender, the =ra Objec-
tive marker (§5.1.3), the merger of intransitive and transitive verb types (§6.1),
Passivization (§6.10.2) among others. There are some areas, however, that have
688 Donald L. Stilo
resisted Persian influence, including important isoglosses that help define the
Caspian group. Certain of these features are shared by some Talyshi-Tati-Tatoid
dialects (#3–5) or most forms of Tatic (#1, 2):
1. Caspian languages generally retain left-branching word order: Adjective-
HEAD (§5.5.1), Possessor-HEAD (§5.4.1), and Postpositions (§5.7). Due to
recent heavy influence from Tehrani Persian, these word order domains are
now in flux as well.
2. Dedicated preposed Possessive forms of pronouns (§5.1.2)
3. Set2 possessive enclitics on nouns are lacking (see next Point)
4. No Set2 (encoding A, P) in verb paradigms; as opposed to Tatic (for the most
part), the Patient is never encoded in the verb.
5. Restructuring of verbal paradigms did not include the Persian TAM me-/
mi-. By contrast, Caspian and Tatic languages have a wide range of TAMs
(see §6.5.1).
Other isoglosses reflect divisions within Caspian. The base for comparison here is
EG (Lāhijāni). Some of these isoglosses are shared areally between EG and other
Caspian, Tati and/or the Semnan area group (1–9) or between WG and Tati (8–10):
Table 4: Isoglosses of Eastern Gilaki

Eastern Gilaki As opposed to: EG shares with:
1. -(V)n- Present tense marker Ø marker, WG (§6.5.1) rest of Caspian, Semnan
2. -an/-on plural marker -ešan, Central Caspian rest of Caspian
3. *-ft, -xt > -t(t) retention of -ft, -xt, WG most of Caspian, NWI
4. six ‘to be’ verbs (§6.8) three ‘to be’ verbs (§6.8.1) WG, (5 in S. Talyshi)
5. X-ka-Be4-, Progressive II missing, WG (CC?) missing throughout?
6. preverb ha- ~ hæ- fa- ~ fæ-, WG most of Caspian, NWI
7. Be4 (‘be in’), Progressive I all Caspian
(§6.5.8.1)
8. Three main preverbs Multiple Preverbs, WG most of Caspian (§6.3)
9. Ø Imperfect marker (§6.5.7) -i-, WG most of Caspian
10. Stress on Preterit marker stress on root, Preterit, WG most of Caspian
11. -ena- conditional marker missing, WG Rāmsari, Tonekāboni
12. Distinct 2/3pl PAMs Syncretism in Rashti most NWI
13. Future: invariable ‘want’ Future: conjugated ‘want’ missing in most NWI
+ Subjunctive (§6.5.4) ‘want’+ Infinitive (exc. some CPD)
14. mæn-ni- negative of ‘can’ missing elsewhere
4.2. Grammatical isoglosses in the Tatic family (not including Tatoid)

The Tatic family is highly conservative, as demonstrated by the list of features
below, shared by most varieties. Many of these features are also found elsewhere
in NWI.
Conservative Features in Tatic:

1. Grammatical gender (Stilo: forthcoming d), still found in half of the Tati
group.
2. Masculine and Feminine singular forms (and occasionally a Feminine II) and
the plural forms each have distinct Direct vs. Oblique cases (Stilo: 2008b).
3. A special Oblique formant -ær is used for kinship terms of either gender.
4. Oblique case is used to mark Object in the Present system fairly universally,
except for Alviri (no case), Razajerdi, and Harzani (both with=ra marking).
5. Canonical Ergativity is quite robust in some languages: Oblique A, Direct P,
and both coreferenced in verb. Others have vestigial or marginal Ergativity.
6. Distinct Intransitive and Transitive paradigms in tenses built on Past stem.
7. Set2 (Oblique) PAMs co-index the Agent and Set1 (Direct) PAMs often co-in-
dex the Patient, reflecting canonical nominal Ergativity in the Past (§7.3).
8. Regular retention of a synthetic passive formation (§6.10.2).
9. Most Tatic languages retain an Optative paradigm (§6.5.3).
10. Certain Tatic languages retain the 2nd singular Set1 PAM -iš (cf. -i elsewhere).
Tatic Innovations:
11. The Ezafe may exist sporadically, seemingly a borrowing from Persian.
12. Postpositions are usual, but some languages have a mixed typology (§5.7.1).
13. Preposed Genitive case of pronouns and the interrogative ki ‘who’ formed
with a prefix čə- ~ əš- (< *hača ‘from’), quite regular across Tatic, (§5.2.2).
14. Set2 possessive enclitics (§7.3) are highly restricted or are lacking in most of
Talyshi and some neighboring Tati varieties (Taromi group, Nowkiani).
15. Present TAMs have wide variation, areally distributed (see §6.5.1)
16. Imperfect TAMs have wide variation, areally distributed (see §6.5.7)
17. Alignments in the Past range from Ergative to Nominative-Accusative, to
complete loss of case (Neutral alignment).
18. Some languages have Double Oblique marking, in which both A and P are
Oblique-marked, either as the sole alignment or as an alternate (§7.1.3.2).
19. Differential Object Marking (DOM): Oblique case-marking only for salient
Ps.
690 Donald L. Stilo
4.2.1. Talyshi isoglosses

Features shared by all Talyshi dialects (most repeated from above list):
1. Two-term case system at least in the singular, also in plural in many dialects
2. Lack of gender
3. Special genitival cases in pronouns, formed with the prefix čï- ~ ïš-
4. 2nd singular -iš in at least some tense-aspect-modal paradigms
5. leftbranching NP adjuncts (GN, AN) as the dominant pattern
6. Imperfect suffix formant -i- (see also point 12 below)
7. Formal contrast between past system intransitive and transitive paradigms
8. Set2 PAMs generally leftwardly mobile in the past
Features that distinguish some Talyshi dialects from others:

Diachronic Phonology
9. N. Talyshi loss of intervocalic and postvocalic -r (with exceptions): (Leriki)
šin ‘sweet’, kag ‘hen’, do ‘tree’, but pónæ ‘last year’ vs. pornǽ ‘calf’; (Leriki,
Viznei) vit- ‘run’; divo ‘wall’, but (Leriki) bæm- vs. (Viznei) bəræm- ‘cry’.
10. Shift of low vowels: *a (æ) > ā (a): (Leriki, Viznei) kag ‘hen’, mard- ‘die’,
asp ‘horse’, az ‘I’, gandəm ‘wheat’; *ā (a) > o: (Leriki, Viznei) vot ‘say’, boγ
‘garden, osïn ‘iron’, pónæ ‘last year’, sor ‘year’, ov ‘water’, do ‘tree’, divo
‘wall’, čok ‘good’, ro ‘road’, bozor ‘market’.
Synchronic Morphology
11. Plural Direct/Oblique cases merged in N. Talyshi but distinguished in others.
12. Augment formant æ- for the Imperfect tense, common in all of Northern and
Central Talyshi dialects but lacking in Southern Talyshi (and in Tati).
13. Leftward mobility of Set1 PAMs in analytically-formed tenses (§7.4.3).
14. Many Present and Past verb stems merge in N. Talyshi, esp. those ending in d/t
(Leriki: bḯhand ‘read!’), but there is a small subset with very distinct forms,
e. g., žæn/žæ ‘hit’, and a subset with an i formant (čini) ‘pick’ that ‘… includes
all verbs bearing the causative suffixes -n- and -ovn-…’ (Kaye 2013: 201).
Kaye also makes an insightful point on past a stems, never before directly
addressed in the literature: the Preterit of mand-é (inf ) is mándim ‘I stayed’,
and nïšt-é (inf) is nḯštim ‘I sat’, but the ‘1 sg preterit of rasé is not *rásim, but
rasáym [‘I arrived’].’ (Kaye 2013: 205).
5. The noun and the noun phrase
5.1. Nominal morphology in Caspian
5.1.1. Case
The status of case in Caspian languages is a thorny issue. There are two opposing
views, one supporting, and one disclaiming, the existence of formal cases. My own
opinion favors the non-case analysis, especially based on points 5, 6, and 8.
The case for case

1. An Oblique (Windfuhr 1992: 29) or Genitive (Lecoq 1989: 304) case encodes
possessor and adjectival adjuncts. In favor of this analysis is the interpretation
of the unstressed -ə ~ -e as derived from an old Oblique case. As other support,
Vafsi may also mark adjectival modifiers with an Oblique case (ex. 27).
2. Singular SAP pronouns in Lāhijāni have special Objective case forms, derived
from the original =(r)a grammaticalized into a suffix a (see Table 5B).
3. SAP pronouns in Caspian, as well as 3sg for Mazanderani, have special pos-
sessive forms (see Table 5B) potentially classified as the Genitive case, indi-
cating the possibility of a case system in Caspian languages.
The case for non-case

4. As the “Reverse Ezafe” (REZ) (Stilo 2001) is used with possessors, adjectives,
and most postpositions, it looks more like a general linker than a case form. The
Ezafe is head-marking and right-branching, the opposite of the Reverse Ezafe.
5. The Oblique plural case is usually fusional, i. e., Oblique and Plural in one,
but in Caspian languages these categories are agglutinative: plural + REZ >
-on-ə.
6. The most likely candidate for the origin of -ə is the -æ linker (§5.5.2), an areal
feature: Asālemi sə̀r-æ ǽsb ‘red horse’, Koluri šàl-æ dærzen ‘shawl needle’,
Cauc. Tat (SWI): túrš-æ nu ‘sour bread’ (Grjunberg 1963: 156), etc. WG
mostly realizes an original æ as -ə: mədrəsə ‘school’. The Maz. REZ -ɛ may
also derive from æ as ɛ often corresponds to the æ-vowel of other languages
and the ə of Gilaki.
7. All forms of Gilaki also have the Ezafe, a linker of postposed adjuncts to a
head noun (mostly in borrowings from Persian, but not exclusively). Likewise,
the Ezafe clearly does not derive from a diachronic case form.
8. Since the =ræ enclitic attaches to the right-hand margin of the NP, in the right-
branching Ezafe pattern it can be completely removed from the head noun.
9. Pronominal object marking is more similar to the enclitic =ra after nouns:
WG: mə́=ra >EG m-a, WG tə́=ra >EG t-a, Lāhij. xonə́ ‘house’ > xon-á (+ ra ).
10. The Genitive of SAP pronouns may simply be seen as a possessive particle.
692 Donald L. Stilo
5.1.2. Caspian noun and pronoun morphology

Nouns in Caspian lack grammatical gender. Except for occasional Arabic plurals,
there is one plural formant, derived from Caspian *an. Yoshie (1996: 14) gives a
regular plural formant -(h)a, probably recently borrowed from Persian, while the
expected plural is “hardly employed” in Mazanderani. Shokri (1995) gives both
types.
Table 5A: Gilaki and Mazanderani nominal morphology

Singular Lāhijāni Sāravi
Base gul zak xonə́ berar damad ketab
Object. gúl=a zák=a xon-á berár=(r)e damad=(r)e ketab=(r)e
Possess. gúl-ə zák-ə xon-é berár-e damad-e ketab-e
Plural
Base gul-ón zak-ón xon-án berar-ún damad-a ketab-a
Object. gul-ón=a zak-ón=a xon-án=a berar-ún=(r)e damad-a=re ketab-a=re
Possess. gul-ón-ə zak-ón-ə xon-án-ə berar-ún-e damad-a(-e) ketab-a(-e)
‘flower’ ‘child’ ‘house’ ‘brother’ ‘son-in-law’ ‘book’
In Central Caspian the plural formant is an unstressed -ešan/ešon: Rāmsari dar/

dárešon ‘tree/s’, Tonekāboni kija/kijášan ‘girl/s’, Kelārdashti rika/rikašon ‘boy/s’.
This is one of the major isoglosses establishing Central Caspian as a separate
group.
Table 5B: Gilaki and Mazanderani pronominal morphology

Lāhijāni pronouns Sāravi pronouns
Nominative Objective Possessive Nominative Objective Possessive
1sg mu, mo m-a mi men me=re me
2sg tu t-a ti tu te=re te
3sg .prox i(n) ín=a ín=ə ve ve=re ve
3sg .dist u(n) ún=a ún=ə
1pl amə, amo amə=ra ame ema ema=re æme
2pl šəmə, šəmu šəmə=ra šime šema šema=re šeme
3pl .prox išón išón=a išón=ə vešún vešún=re vešún-e
3pl .dist ušón ušón=a ušón=ə
The Lāhijāni long forms (in, un) appear when a following suffix or enclitic begins
with a vowel. The short forms (i, u) also function as demonstrative adjectives:
(01) tin-i i kar=a me=bə bú-kon-i? i ti pəsér=ə?
can-2 s this work-ra I=for pu -do-2 s this your son=cop :3 s
‘Can you do this thing for me?’ ‘Is this/he your son?’
In Mazanderani and most of Central Caspian we find the only languages in a vast
area of Iran to have 3rd person pronouns that are not synchronic demonstratives
(ve/vešún, etc.) These forms are used for both animates and inanimates. Kelārdašti
and Tonekāboni have both types, while Rāmsari is outside the isogloss and seems
to have only synchronic demonstratives as 3rd person pronouns.
(02) Sāravi
ve še zen-á=re kúš-en-e. ve gusfén-e gušt=e.
he own wife-pl = ra kill-pr -3 s it sheep-rez meat=cop :3 s
‘He kills his wives.’ (SY: 54) ‘It is mutton.’ (SY: 68)
5.1.3. The uses of the objective marker =ra

The enclitic =ra flags (specific) Direct Objects (see Alignment, and DOM, §7.1),
Indirect Objects (§7.5), Experiencers (§6.7), and other Dative-like usages (§7.6).
Nouns/Third person pronouns

(03) Lāhijāni: Definite direct object
mu diruz hæsə́n=a bə́-de-m.
I yesterday P.N.=ra pu -see:pt -1 s
‘I saw Hassan yesterday.’
Lāhijāni: Indirect Object
mu hæsə́n=a pul há-da-m.
I P.N.=ra money pvb -give:pt -1 s
‘I gave Hassan some money.’
(04) Sāravi: Definite Direct Object (SY: 68)
šeter gušt=re koje=jem bǽ-it-i?
camel meat-ra where=from pu -take:pt -2 s
‘Where did you get the camel meat from?’
Sāravi: Experiencer (SH: 149)
æslaš me=re særd næ-vu-n-e.
at.all I=ra cold neg -become-3 s
‘I absolutely never get cold.’
694 Donald L. Stilo
5.2. Nouns and pronouns in Tatic: case, gender, number
5.2.1. Noun morphology

As mentioned (§4.2), almost all Tati languages and dialects have retained a two-
term (Direct/Oblique) nominal case system. In half of the Tati group, gender has
also been retained in the singular, while it is lost in the rest of Tatic. The only
languages to have lost the case system but still retain gender distinctions are Raza-
jerdi and Alviri. Both Tatoid groups have lost all original traces of both case and
gender (but see §5.4.4).
Table 5C: Tatic languages distinguishing gender

V afsi D ikin M arāqei K afteji C hāli
Masc. Fem. I Fem. II Masc. Fem. Masc. Fem. Masc. Fem.
dir . sg . æsb-Ø kǽrg-æ kelj-é xordǽ kellég(-an) mærdǽk-Ø kelék-æ bær-Ø
xwák-æ
ob . sg . ǽsb-i kærg-é kelj-í xordég-i kelleg-ǽ mærdǽk-ə kelek-ə́ bǽr-e
dir . pl . ǽsb-e kǽrg-e keljí-e xordeg-ín kelleg-ín mærdǽk-ə kelék-ə bǽr-e xwák-e
ob . pl . æsb-án kærg-án kelji-án xordeg-ón kelleg-ón mærdæk-ón kelek-ón bær-ó(n) xwák-ó(n)
‘horse’ ‘hen’ ‘girl’ ‘child’ ‘girl’ ‘man’ ‘girl’ ‘door’ ‘spade’
7 distinct forms 6 distinct forms 5 distinct forms 4 forms
Table 5D: Tatic languages not distinguishing gender

T ati T alyshi
Khoini Kelāsi Kalāsuri Leriki Viznei Asālemi Māsulei
dir .S g . pǽs mærdǽk-Ø kelék-Ø pæs kag-Ø kag-Ø kærg-Ø kærg-Ø
ob .S g . pæs-é mærdǽk-e kelék-e pæs-í kag-í kag-í kærg-í kærg-í
dir .P l . pæs-é pæs-é kag-é kærg-é kærg-én
mærdæk-ón kelek-ón kag-ón
ob .P l . pæs-án pæs-ún kag-ón kærg-ún kærg-ón
‘sheep’ ‘man’ ‘girl’ ‘sheep’ ‘hen’ ‘hen’ ‘hen’ ‘hen’
3 distinct forms each 4 forms 3 forms 4 forms 4 forms 4 forms
5.2.2. Pronoun paradigms

SAP pronouns and the interrogative ki ‘who’ have special pronominal strategies
for Genitive formation (§4.2, Pt. 13). The Genitive forms of pronouns in Vafsi and
some other Tati types have totally subsumed the original Oblique forms and their
functions.
Table 5E: Vafsi SAP pronouns and ‘who’; Set1 and Set2
Set1a Set1b Set2A Set2B
Direct Oblique (Direct) (Copula) (Oblique) (Prefixes)
1 sg æz tæ-men -om(e) =im(e) =om im-
2 sg tæ es-dæ -i =i =i i-
3 sg . prox in t-in-í* -e ~ Ø =e =es is-
3 sg . dist an t-an-í*
1 pl awan ta-wan -am(e) =am(e) =oan oan-
2 pl soan Ø-soan -a =a =ian ian-
3 pl . prox ín-e t-in-án* -énd(e) =énd(e) =esan isan-
3 pl . dist án-e t-an-án*
‘who’ ki t-e-g-é
*(3sg /pl Oblique alternates: in-t-in-í, an-t-an-í, an-t-an-án, in-t-in-án)
Table 5F: Leriki SAP pronouns and ‘who’; Set1 and Set2
Set1A Set1B Set2
Direct Obliq. Accus. Genitive (Direct) (Copula) (Oblique)
1 sg az mï(n) mï-ní čï-mïn -ïm =im =ïm
2 sg tï tï tï-ní ïš-tï -ïš~ïž =iš~iž =e
3 sg æv æy (æy-í?) č-æy -ï =e =ïš~ïž
1 pl æmæ æmæ-ní č-æmæ -ïmon =imon =ïmon
2 pl šïmæ šïmæ-ní šïmæ -ïšon~ïžon =išon~ižon =on
3 pl ævon ævon-í č-ævon -ïn =in =ïšon~ïžon
‘who’ ki ki čï-ki
Leriki/Northern Talyshi (maximum: 4-term, 1sg )
Table 5G: Hezārrudi pronouns (maximum: 5-term, 1sg , 3pl )

Subject Pres/Obj Past/Obj Poss Past/Ag. Nº of forms
1 sg æz ædem æčem čem(e)(n) men 5
2 sg tæ ætæ ætæ eštæ tæ 3
3m o æjo o jo o 3
3f oæ æjo oæ jo o 3
prox æm æjéme æm jeme æm 3
3f ǽmæ æjemé ǽmæ jeme ǽmæ 3
1 pl æma æčema æčema čema æma 3
2 pl šema æšema æšema šema šema 2
3 pl o æjuan~æjan oe juan~jan oan 5
696 Donald L. Stilo
Table 5H: Kafteji (maximum: 4-term, 1sg , 3sg )

Subject: Pres/Obj Past/Obj Poss Past Agent
1 sg æz čəmən čəmən~čem čəmən~čem mən
2 sg tæ štæ štæ štæ tæ
3m a æja jan ja ja
1 pl ama čəba čəba ? ama
2 pl šoma/šəba šəba šəba ? šoma
3 pl a jon æjon, jon jon jon
‘who’ ki ? čiki čiki ki, čiki
Table 5I: Chāli (maximum: 3-term, 1sg , 3sg .m / f , 3pl )

Subject Pres/Obj Past/Obj Poss Past Agent
1 sg æz čemen čemen čemen men
2 sg tæ eštæ eštæ eštæ tæ
3m æ jæy jæy jæy æy ~ ay ~ yi
3f ǽyæ jǽyæ jǽyæ jǽyæ ǽyæ ~ yæ
3 pl æy jæyǽ jæyǽ jæyǽ æyo(n) ~ yo(n)
‘who’ ki, ke eške, eški eške, eški
5.3. The uses of the cases and Set1/Set2 PAMs in Tatic languages
Two-term case systems typically exhibit rather high polyfunctionality (Arka-
diev 2008). Tati dialects are a typical example of this phenomenon. Table 5J
demonstrates the major functions of the Direct and Oblique cases. Various minor
usages, e. g., Oblique marking of temporals (see ex. 109), have been omitted.
Table 5J: Uses of the cases and Set1 Set2 in Tatic

Direct Case Set1 Oblique Case Set2
Subject, all Intransitives + + - -
Agent, Present system + + - -
Agent, Past system (+) (§7.1.3.1)1 - + (§7.1.3.1)1 + (§7.1.3.1)2
Non-Salient Patient + (§7.1) -? - -
Salient Patient, Present - - + (§7.1.3) + (§7.3)1
Salient Patient, Past + (§7.1.3)1 + (§7.3)3 ± (§7.3) ± (§7.3)
Recipient/Indirect Object - - + (§7.5) + (§7.5.1)4
Experiencer - - + (§6.7) + (§6.7)
Possessor/Genitive - - + (§§) + (§7.3)
Adjunct of Adposition ± - ± +
Goal (no Adposition) + (§9.1) - (+)1 -
Locative (no Adposition) (+)2 - (+)1 -
1. occasional pattern; 2. for most lgs; 3. for some lgs; 4. minor pattern
5.4. Possessor constructions: reverse Ezafe, Ezafe, oblique case

Possessors generally precede the head in Caspian and most of Tatic using the REZ
and the Oblique case, respectively. Ongoing Persian right-branching influence on
Gilaki, however, has now created a split typology based on animacy: preposed pos-
sessors marked with the REZ are predominately animate, while many inanimate
possessors are postposed and marked with the head-marking Ezafe construction.
5.4.1. Caspian: REZ and Ezafe marking of nominal possessors

Of particular interest are the results of a frequency count of the REZ vs. Ezafe
with Animate vs. Inanimate nouns in three Caspian varieties: texts in Rashti (Ras-
torgueva 1971), Sāravi (Yoshie 1996), and Kordkheyli Mazanderani (Borjian and
Borjian 2007a, 2007b, 2008). Rashti shows a preference for the REZ with ani-
mates (REZ: 90.1 %) vs. the EZ with inanimates (REZ: 87.5 %). Mazanderani,
however, presents quite a different picture. While there is a clear preference for
the REZ with both animate nouns (REZ: Sāravi: 97.7 %/Kordkheyli 100 %) and
inanimates (REZ: Sāravi: 65.2 %/Kordkheyli: 87.5 %), the use of the EZ with inan-
imates seems to be only incipient here.
Rashti
(05) Animate Possessors: Reverse Ezafe (Possessor-HEAD)
məryə́m-ə per iskəndə́r-ə harf-án=a gúš kun-è.
P.N.-rez father P.N.-rez speech-pl = ra ear lv -3 s
‘Maryam’s father listens to Iskandar’s words.’ (RR: 273, 118)
(06) Inanimate Possessors: Ezafe (HEAD-possessor)
abul púl-ə nuγrə az xu refeγ akbər fa-gíft-ə.
P.N. money-ez silver from self friend P.N. pvb -take:pt -3 s
‘Abul took silver coins from his friend Akbar.’ (RR: 264, 21)
(07) Inanimate Possessors: Reverse Ezafe (Possessor-HEAD)
nə́-tanəst-i-Ø xu díl-ə gə́b=a iskəndə́r=a bə́-g-ə.
neg -can:pt - imf -3 s self heart-rez speech=ra P.N.=ra pu -say-3 s
‘She couldn’t tell Iskandar her heart’s desires (lit: words)’ (RR: 277, 219)
(08) Lāhijāni
Animate Adjunct (REZ) Inanimate Adjunct (REZ)
i asp-ə sum tu istidad-ə zabon=æ da-n-i.
this horse-rez hoof you talent-rez language=ra have-pr -2 s
‘this horse’s hooves’ (NJ,47) ‘You have language talent.’
698 Donald L. Stilo
(09) Sāravi
ša-e xæzun-e bar=e Ø-værd-e.
shah-rez treasure-rez load= ra imf -take:pt -3 s
‘He would take the load of the Shah’s treasure (to the bazaar).’ (SY: 68)
dæ-ket-e gol-e báγ-e dele.
pvb -fell-3 s flower-rez garden-rez in
‘She fell into a bed of roses (lit: a flower garden).’ (SY: 58)
5.4.2. Tatic: case marking of nominal possessors

Tatic varieties also have an adjunct-marking pattern in which an Oblique case-
marked Possessor precedes the head. While Tatic may have some Ezafe tokens
with postposed Possessors, there is no clear animacy-based split here (but see
Vafsi below).
(10) Leriki
hakïm-i söhbæt bæ-pe čok bḯ-bu-Ø.
P.N.-os talk pr -must good sbj -be-3 s 1
‘Hakim’s talk should go well.’
sef-í donæ, zəməston-í donæ
apple-os seed watermelon-os seed
‘apple seeds, watermelon seeds’
(11) Asālemi Koluri
čin-i ša ešte pi-ær nom če=yye?
China-os shah you:poss father-os name what-cop :3 s 1
‘the king of China’ ‘What is your father’s name?’
(ASA: 88)
(12) Khoini (Central Tati enclave) Dikin Marāqei
pæs-é pæšmæ æ killeg-ǽ nom či=æ?
sheep-os wool that girl-of name what-cop :3 s 1
‘the sheep’s wool’ (YX: 169) ‘What is that girl’s name?’
The Harzani Oblique case is restricted to Possessor-marking (cf. Align-
ment, §7.1.1):
(13) hæsæn-i kætæb badam-i raz
P.N.-os book almond-os orchard
‘Hassan’s book.’ (AAK:108) ‘almond orchard.’ (AAK:108)
5.4.3. The Vafsi possessive constructions

Vafsi has three main strategies with one subtype available for adnominal posses-
sion:
1) In the Preposed Genitive Construction I, used mostly for animate possessors,
the adjunct noun/pronoun is in the Oblique case and precedes the head noun:
(14) Animate nouns (predominant)
yey tajere-y bær dokán-i v-e.
one merchant-om door shop-om cop : pt -3 s 1
‘It was a door of a merchant’s shop.’
Inanimate nouns (minor)
bǽr-i taxtæ or-éysdæ=v-e.
door-om plank pvb -arose=aux : pt -3 s 1
‘The door plank had come up.’
Preposed Genitive Construction II: The above construction is occasionally rein-

forced by an enclitic possessive after the head noun, yielding a circumposed pos-
sessive:
(15) zell-í bera=s b-áw-e.
woman-of 2 brother=3 s 2 pu -come:pt -3 s 1
‘The woman’s brother came over.’
in kelj-í dél=es bǽ-šg-i-a-Ø.
this girl-of 2 heart=3 s 2 pu -break- pass - pt -3 s 1
‘This girl’s heart broke.’
2) The Right-branching “Vafsi Ezafe” and postposed nominal adjuncts

Reminiscent of Kurmanji, the Vafsi right-branching Ezafe Construction is both
head- and adjunct-marked: the head has an Ezafe suffix followed by an Oblique-
marked adjunct, but the ezafe itself is invariable and lacks the gender distinctions
of Kurdish:
(16) bædbæxti-e və́ws-i róbb-e kudu-é
misfortune-ez Vafs-om paste-ez pumpkin-of
‘the misfortune of Vafs’ ‘pumpkin butter’
3) The Postposed Genitive Construction, generally found with non-human
(mostly inanimate) adjuncts, has a postposed possessor in the Oblique case
but lacks an Ezafe:
700 Donald L. Stilo
(17)lášæ ešgár-i čera mašín-i bùæ kæbáb-i

carcass deer-om light car-om smell kebab-om
‘deer carcass’ ‘car headlights’ ‘the smell of kebab’
5.4.4. Other Tatic strategies

In Alviri, to the northeast of Vafsi, possession is exclusively right-branching with
an invariable Ezafe suffixed to the head noun linking it to the postposed Possessor:
(18) Alviri (YA: 179)
pur-e hæsæn e-væz-æ. tete-y hæsæn e-væz-iæ.
son-ez P.N. dur -run-3 sm 1 daughter-ez P.N. dur -run-3 sf 1
‘Hassan’s son is running.’ ‘Hassan’s daughter is running.’
Juxtaposition: left-branching, Ø-marking of both Head and Adjunct
(19) Razajerdi
dar šax qæmbær ærösi
tree branch P.N. wedding
‘tree branch, branch of the tree’ ‘Qambar’s wedding’
The Possessor precedes the head in Rudbāri and in some Tāleqāni varieties. The
Rudbāri possessor is marked with the REZ while Tāleqāni has a marker that resem-
bles the Oblique case form of other Tati varieties, but it is used agglutinatively in
the plural and with pronouns, which indicates it is more likely a Reverse Ezafe:
(20) Tāleqāni: Owrāzāni (AA1: 87)
boz-an-i šir pa-i bon
goat-pl - rez milk foot-rez bottom
‘the goats’ milk’ ‘sole of the foot’
Gurāni
mə́n-i xanæ gusfǽnd-i sær
I-rez house sheep-rez head
‘my house’ ‘the sheep’s head’
(21) Rudbāri: Tutkāboni
i dár-e bul-an aγoz-e pust
this tree-rez branch-pl walnut-rez skin
‘this tree’s branches’ ‘walnut shell’
Jubani
gorg-e zay šiær-e nom
wolf-rez child husband-rez name
‘wolf(’s) cub’ ‘husband’s name’
5.5. Adjectival modification
5.5.1. The reverse Ezafe, the Ezafe, and Ø marking (juxtaposition)

Jahāngiri (2003: 52) states that no linker morpheme is used in Lāhijāni with adjec-
tives, but many of his examples are exceptions to this rule in that they use the REZ.
(22) Lāhijāni, left-branching, no REZ
pəsəronə́ juraf-on xorə́m zak kujə́ pəsə́r
boy:adj sock-pl good boy small son
‘boys’ socks’ (NJ: 109) ‘good child’ (NJ: 62) ‘younger son’ (NJ: 143)
but with REZ:
(23) sə̀rd-ə áw nə̀r-ə gúrg kàl-ə mivə́
cold-rez water male-rez wolf unripe-rez fruit
‘cold water’ ‘male wolf’ ‘unripe fruit’
(NJ: 162) (NJ: 104) (NJ: 145)
The right-branching Ezafe construction also occurs with adjectives in Lāhijāni. (In
actuality most of Jahāngiri’s example sentences show Head-ez Adjective). Sāravi
is more consistently Adjective-HEAD, with a Reverse Ezafe on the adjunct:
(24) Sāravi, left-branching (SH: 148)
inje čæn=ta gæt-e dikun va-bæ-yy-e.
here a.few=unc big-rez shop pvb -become- pt -3 s 1
‘A few big shops opened up here.’
Lāhijāni, right-branching (NJ: 103)
i dirəxt-ə jəngəl=e. (< jəngəli=ə)
this tree-rez forest=cop :3 s 1
‘This is a forest tree’
Vafsi
The adjective in Vafsi is simply juxtaposed after the head noun. In the Direct sin-
gular, the adjective agrees in gender with the noun, but plurality and other cases
are only marked once, on the adjective, while the head noun remains invariable:
(25) Masculine: ‘new broom’
Direct Sg. Oblique Sg. Direct Pl. Oblique Pl.
sizdæ nu sizdæ nú-i sizdæ nú-e sizdæ nu-án
broom new broom new-om broom new-dp broom new-op
702 Donald L. Stilo
(26) Feminine: ‘new shovel’

Direct Sg. Oblique Sg. Direct Pl.
fí-æ nú-æ fí-æ nu-é fí-æ nú-e
shovel-df new-df shovel-df new-of shovel-df new-dp
Oblique Pl.
fí-æ nu-án
shovel-df new-op
Adjectival modification occasionally uses the Vafsi Ezafe, including Oblique
marking of the adjunct. This pattern clearly must be a recent copy from Persian
since Vafsi has not innovated a way for the Ezafe to be combined with feminine
head nouns.
(27) tæbæqe-ye zæíf-i in dúsd-e mehræbán-i=m=e.
class-ez weak-om this friend-ez kind-om -1 s 2=cop :3 s 1
‘the weak(er) class (of society)’ ‘He is my kind friend.’ (VF: B5,3)
5.5.2. General linker -æ: adjectival modification and noun-noun compounds

The unstressed General Linker -æ, suffixed to the preposed adjective or noun
adjunct is a feature common to most of Tatic but is an areal feature that also
encompasses Caspian languages. (See also §5.1.1, point 6 for the Caspian forms).
(28) Leriki
Adjectival modificaton
šàt-æ dó bìž-æ nečí
crooked-link tree sly-link wolf
‘crooked tree’ ‘crafty wolf’
Possessive or Noun Compounding
kàg-æ pór bï̀ z-æ mú
hen-link feather goat-link hair
‘chicken feathers’ ‘goat hair’
(29) Asālemi (ASA: 90)
sə̀r-æ ǽsb čàk-æ zuǽ mə̀s-æ patíl
red-link horse good-link son copper-link pot
‘red horse’ ‘good son’ ‘copper pot’
sìf-æ dár
apple-link tree
‘apple tree’
(30) Māsulei
rùk-æ berá pìll-æ berá
small-link brother big-link brother
‘younger brother’ ‘older brother’
ræzìn-æ galəš bə̀rz-æ kis-on
rubber-link galosh rice-link sack-pl
‘rubber galoshes’ ‘rice sacks’
(31) Kalāsuri Kelāsi Gurāni
kòy-æ usmún miòn-æ xó jòwz-æ dár
blue-link sky middle-link sister walnut-link tree
‘blue sky’ (KL: 272) ‘middle sister’ ‘walnut tree’
Razajerdi
æsìf-æ derǽxt
apple-link tree
‘apple tree’
Vafsi also has this linker but it is no longer productive and is only found in frozen
forms: dèlæ-dǽrd ‘stomach ache’, nèræ xǽr ‘male donkey’.
5.6. Singular and plural vs. definiteness categories

Generic and Indefinite nouns are generally not pluralized but remain in the base
form. Also see Numeral Classifiers (§5.8) for another use of the singular form of
nouns. (See also sentences 9, 47, 86, 89, 129, 174, 183, 184, and 266 for more
examples.)
(32) Sāravi Lāhijāni
hæmišeg bakele=o mærji xær-im-be. sib bə́-kašt-im.
always bean=and lentil buy-pr -1 s 1 apple pu -plant:pt -1 p
‘I always buy beans and lentils.’ ‘We planted apples.’
(SY: 139) (NJ: 46)
(33) Leriki
kag kižæ bárd-æ-Ø.
chicken chick carry-loc -3 s 1
‘It carries off chickens and
chicks.’
Vafsi
vəws sí-æ dir-e, viz dir-e.
P.N. apple-df have-3 s 1 walnut have-3 s 1
‘Vafs has apples, it has walnuts.’
704 Donald L. Stilo
Generally only definite count nouns are pluralized in these languages:

(34) Lāhijāni
Indefinite/Generic
pərviz xəyli gusfənd da-n-ə.
P.N. much sheep have-pr -3 s
‘Parviz has lots of sheep.’ (NJ: 130)
Definite
gusfənd-on=a dú-kun təvile men.
sheep-pl = ra pvb -do stable:ez inside
‘Put the sheep in the stable.’ (NJ: 153)
(35) Sāravi
zæmin=re kel há-kon, bæ:d tim dǽ-šend.
land=ra plow pvb -do then seed PVB-sow
‘Plow the land first then sow seeds.’ (SH: 148)
gosfen-a=re vær-n-e čera.

sheep-pl = ra take-pr -3 s pasture
‘He takes the sheep to pasture.’ (SH: 149)
(36) Leriki
muzikant=æn hïst b-e?
musician=also exist cop : pt -3 s 1
‘Were there musicians, too?’
æ:la b-in čæy muzikant-on.

great cop :3 p 1 he-poss musician-pl
‘They were great, his musicians’
5.7. Adpositions
Caspian and most Tatic languages basically have left-branching NPs, including
a predominance of postpositions. Given the heavy bilingualism with Persian and
the socio-economic impact from Tehran throughout the Caspian, however, many
right-branching patterns, such as prepositions, are entering these languages and
are clearly becoming integrated into their grammars.
(37) Postpositions in Lāhijāni
hasən baγ-ə mien ísa-Ø. ali deraxt-ə sər ísa-Ø.
P.N. garden-ez inside be 3-3 s P.N. tree-EZ on be 3-3 s
‘Hassan is in the garden.’ ‘Ali is up in (lit: on) the tree.’
(38) Prepositions commonly used in Lāhijāni

jujə lá-ye pələ hánna. xa bə́-dar-i ta paiz.
chick within-ez pilaff be 5 must pu -hold-2 s until autumn
‘There is chicken in the pilaff.’ ‘You must keep (it) until autumn.’
(NJ: 47)
(39) Sāravi Postpositions (SY: 62)
e-m-be šeme nal-e sær.
come-pr -1 s you.pl : poss porch-ez on
‘I’ll come onto your veranda.’
b-úr-em æt-ta kar-e dembal.
pu -go-1 s one-unc work-ez behind
‘Let me go after some job.’
(40) Kafteji Postpositions
čél-æ ke dəlæ neh-áya.
spindle-df house inside be 5-3 sf 1
‘The spindle is in the house.’
æm til kovrá=ræ b-om-é?
this mud where=from pu -come:pt -3 s 1
‘Where did this mud come from?’
(41) Dikin Marāqei Postpositions
sǽng=em-an dæ-fú-an čah-ǽ mion.
stone-1 s 2- fem pvb -threw-3 sf 1 well-of inside
‘I threw a stone (f.) into the well.’
æsíf=em-an a-gáti-an zemin=de.
apple-1 s 2- fem pvb -took-3 sf 1 land=from
‘I got an apple (f.) off the ground.’
5.7.1. Mixed adpositional typology

Vafsi is located in an intersection zone between a postpositional isogloss to the
north and a prepositional isogloss in the south (see Stilo 2009, 2012, see also Haig,
this volume, chapter 2.3, §3, on similar phenomena across Northern Kurdish). It
has a very mixed adpositional typology: postpositions, prepositions, circumposi-
tions, as well as doublets (ex. 42), that is, one adposition that doubles as postposi-
tion or preposition, e. g., Locative vs. Allative, respectively:
706 Donald L. Stilo
(42) Locative (Postposition)

kúg-æ otaq=dæ æd-oss-e.
cat-df room=at dur -sleep-3 s 1
‘The cat sleeps in the room.’
Allative (Preposition)
kúg-æ bǽ-sse-Ø dæ benin otaq.
cat-df pu -go:pt -3 s 1 to inner room
‘The cat went into the back room.’
(43) (Circumposition)
kúg-æ sær míz-i=dæ hott-e
cat-df on table-om =at sleep:stat -3 s 1
‘The cat is asleep on the table.’
(Compound Preposition)
kúg-æ bǽ-væst-e o sær míz-i.
cat-df pu -go:pt -3 s 1 to on table-om
‘The cat jumped onto the table.’
The examples below show Leriki Prepositions, Postpositions, and Circumposi-
tions:
(44) sabir væy b-e dï štæ žen-í.
P.N. there cop : pt -3 s 1 with self wife-om
‘Sabir was there with his wife.’
čoštæ=m pat=e bo aγl-on.
breakfast=1 s 2 cook:pt = aux for child-pl
‘I cooked breakfast for ‘the guys’.’
(45) tï máštæ mæktæb=ædæ=ž bæ-b-e?
you tomorrow school=at=2 s 1 all -be-inf
‘Will you be at school tomorrow?’
kæ-y kum=o e-vašt-im.
house-ob roof=from pvb -jump:pt -1 s 1
‘I jumped off the roof.’
(46) æv čï kæ-y=ku be-š-e-Ø.
he from house-ob =from pvb -go:pt -3 s 1
‘He went out of the house.’
æv čï šæhær =o omæ-Ø.
he from city=from come:pt -3 s 1
‘He came from the city.’
5.7.2. Adpositional polysemy

Note that in many Tatic and Caspian languages certain basic adpositions are poly-
semous, occasionally even with opposite meanings, e. g., Ablative-Allative:
(47) Sāravi
=je Allative
hæme jur kar=je Ø-resi-ne.
all kind work=je imf -arrive:pt -3 p
‘They would get to all kinds of work/
They would see to all …’ (SY: 58)
=je Ablative
sere=je dær n-uri.
house=je out neg -come:imper
‘Don’t come out of the house.’ (SY: 66)
(48) =je Comitative
hæsen hosæyn=je košti ger-n-e.
P.N. P.N.=je wrestling take-pr -3 s
‘Hassan wrestles with Hosseyn.’ (SH: 138)
(49) Karani (YSFN)
=ku Locative
xav-e=m=ku do glæ=m æsp b-ind.
sleep-om =1 s 2= ku two unc =1 s 2 horse pu -see: pt
‘I saw two horses in my dream.’
=ku Ablative
pe=m Māsāl-e=ku a-gerdes.
father=1 s 2 P.N.-om = ku pvb -wander:pt
‘My father returned from Māsāl.’
(50) =ku Allative
dé-ček-amen=eš kæ bar-e=ku.
pvb -stick-caus =3 s 2 house door-om = ku
‘Stick it on the door of the house.’
Ø-marking of Allative
i ruj be-še-Ø ku.
one day pu -go:pt -3 s 1 mountain
‘Once he went to the mountain.’
On the other hand, the same sense may be expressed by various synonymous adpo-
sitions, often with the same adpositions that are polysemous.
708 Donald L. Stilo
(51) Leriki
ïm kæ kærpič=ædæ=y.
this house brick-ædæ =cop :3 s 2
‘This house is (made) of bricks.’
ïm kæ čï kærpič=o=y.
this house from brick=from=cop :3 s 2
‘This house is (made) of bricks.’
5.8. Numeral classifiers

West Iranian languages have simple Numeral Classifiers systems, with only two
or three common classifiers after numerals and the quantifier čænd, ‘a few; how
many?’ (see also Stilo 2018a). These classifiers also involve a separate issue: the
use of singular forms of nouns after numerals. Classifiers in this area consist of
two basic types:
1) Universal Numeral Classifier (UNC) used for most nouns (including human):
a) Most Tati languages have the form dane, danæ ‘seed, grain/kernel’.
b) All Talyshi dialects and some neighboring Tati dialects use g(ə)læ, a
UNC, also meaning ‘seed, grain’.
c) Caspian languages use only =ta ~ =tæ throughout, irrespective of the
animacy of the noun or the head numeral/quantifier.
2) S. Talyshi and much of Tati also have the Human Numeral Classifier (HNC)
næfær ‘person’, but the preferred choice for human nouns is still the UNC
listed above. The HNC is rare in Caspian languages.
Numeral Classifiers are generally optional in most of the languages under dis-
cussion. In Caspian languages and probably most of Talyshi, however, the use of
the unc seems to be obligatory, even with human nouns. The numeral ‘one’ also
functions as the indefinite article and the UNC is also commonly used with ‘one’
in this function.
(52) Lāhijāni: UNC ta
do=ta čere-ye jædid íyn-əm. yek=ta lako bə́-ma-Ø.
two=unc face-ez new see-1 s one-unc girl pu -come:pt -3 s
‘I see two new faces.’ ‘A girl came (here).’ (NJ: 52)
(53) Sāravi: UNC ta
in se-ta se*. æt-ta mærdi bi-ye bez dašt-e.
this 3-unc apple one-unc man was-3 s goat had-3 s
‘These three apples’ ‘There was a man (who) had a goat.’
(SH: 72) *(See §3.2 for (SH: 72)
phonology of vowels)
(54) Vafsi: UNC dánæ ~ dané

Inanimates Animates
pež dánæ tofæng do dane keljé dir-om.
five unc rifle two unc girl have-1 s
‘five rifles’ ‘I have two daughters.’
(55) Leriki: UNC g(ə)læ
bæ zo-on=ku se gïlæ sef híst-e.
to boy-pl = ku 3 unc apple exist -3 s
‘Those boys have three apples.’
čïmï čo gïlæ zoæ hḯst-e.
my 4 unc son exist -2 s 1
‘I have four sons.’
In Leriki the UNC with the indefinite marker -y (derived from the numeral i ‘one’)
may function without the numeral as an article:
(56) ïštæ jif=o gïlæ-y dæftær be-várd-æ-Ø.
self pocket=from unc - indf notebook pvb -bring:pt - loc -3 s 1
‘He takes a notebook out of his pocket.’
5.9. Reflexives
In Gilaki and nearby Tati, reflexive pronouns are only used in the 3rd person.
SAP persons use the corresponding case forms of these pronouns (cf. Romance,
German).
(57) Lāhijāni
mu m-a bə i kar adə́t bə̀-da-m.
I I-ra to this work habit pu -give:pt -1 s
‘I’ve accustomed myself to this.’ (NJ: 98)
(58) tu t-a niga nú-kun! xú=š=a bú-kušt-ə.
you you-ra look neg -do self-3 s 2= ra pu -kill:pt -3 s
‘Don’t look at yourself!’ (NJ: 118) ‘He killed himself.’ (NJ: 117)
Mazanderani (še), C. Caspian (xoštə), and most Tatic (VštVn) exhibit one reflex-
ive pronoun for all persons. Tatic varieties that have Set2 possessive enclitics,
e. g., Vafsi, optionally attach them to the reflexive pronoun (cf. Eng. my-self, Pers.
xod-æm).
(59) Sāravi Rāmsari (Central Caspian)
še=re bæzek há-kan! čeræ xoštə=rə zæn-e?
self= ra adorn pvb -do why self-ra hit-2 s
‘Make yourself up!’ (SY: 56) ‘Why are you hitting yourself?’
710 Donald L. Stilo
(60) Leriki
ïštæn-í čï tók=o ayïrmiš bḯ-kæ-m,
self-ob abl current=abl separated sbj -do-1 s 1
‘…in order to separate myself from the (electric) current, …’
Keringāni and Harzani
K: mun hišton yæ=me.
H: mæn ištæn yær=mæ.
I self hit:pt =1 s 2
‘I hit myself.’ (AAK: 85, 107, resp.)
(61) Vafsi
(with Set2) (without Set2)
n-ím-æsdæ ešdén-i=s bǽ-zæn-e. æz ešdén-i r-vin-om.
neg -1 s 2-let:pt self-om =3 s 2 pu -hit-3 s 1 I self-om dur -see-1 s 1
‘I didn’t let him hit himself.’ ‘I see myself.’
Māsulei has mixed typology using both of the above patterns for different functions:
• Direct Objects, all persons: dedicated Reflexive (no Set2 enclitics)
• Adpositions, Possessive, 3rd sg /pl : dedicated Reflexive (no Set2 enclitics)
• Adpositions, Possessive, 1st/2nd sg / pl : Genitive case of SAP Pronouns
(62) æz be əštæn=ni bə́-kəš-əm. tə əštæn mæ-kəš.

I must self=also sbj -kill-1 s 1 you self neg : imper -kill
‘I must kill myself, too.’ Don’t you kill yourself.’
(LM: 38; I: 16) (LM: 38; I: 16)
(63) əčəra čəmən kellæ dæ-benn-əm? a-n əštæn kæ.
why I:poss daughter pvb -tie-1 s 1 come-3 p 1 self house
‘Why should I tie my daughter up?’ ‘They come to their house.’
(LM: 46, II, 33) (LM: 52, IV, 1)
6. Verbs: initial generalities
This section introduces Present vs. Past verb stems, lexical preverbs, negation, and
the formation, diachrony, and usages of the various tenses.
6.1. Present vs. past verbal systems

The set of paradigms based on the present stem (Present, Subjunctive, Imperative)
are called the Present system below and those based on the past stem (Preterit,
Perfect tenses, and usually Imperfect) are called the Past system. The number and
members in each vary by language, e. g., the Talyshi Imperfect belongs to the
Present system.
This division into two systems of tenses is purely morphological and is irrele-
vant for Caspian and Tatoid, but for Tatic it establishes a tense-based split in Intran-
sitive and Transitive conjugations. This split affects the choice of PAMs (§7.3)
and ultimately the alignment of core arguments (§7.1.3.1): Leriki æv vít-e (he:dir
run:pt -3s1) ‘s/he ran’ vs. æy vít=ïž=e (he:obl sift:pt =3s2=aux ) ‘s/he sifted’. The
Present alignment in Tatic is Nominative-Accusative and the Past system may
have Ergative alignment or some vestige of original Ergativity.
6.2. Present and past stem formation

West Iranian languages use multiple strategies for forming the past stems of verbs
and no fixed rules can be set up to derive the past stem from the present. Some lan-
guages have a high preponderance of -d/-t suffixes in the past stem, often accom-
panied by other, perhaps substantial, changes in the present root. Table 6A presents
a small sampling of present and past stems in various Caspian and Tatic languages:
Table 6A: Present and past verb stems in Caspian and Tatic
Lexeme
Harzan
Kafteji
Kering
Koluri
Asālm
Sāravi
Tonkb
Māsāl
Kajal
Chāli
Alvir
Rašti
Vafsi
bring avər hær (y)ar var æ(r) er vær ar or ar or ar
past avərd (y)ærd ard vard ord vard aværd værd ard ord ard ord ard
fall kəf kæ kæf gen gen gin gən læk gen gen gen gen
past kəft kæt kæt/ket geness genesd ginest kkæt gənəst læki kæt kæt kæt kætt
sleep xus xas xas xesa hes fes, het hös xəs xəs xos u-xos xös hoss
past xuft xat xat xet het fesi,het höt xət xət xott u-xott xöt hott
hit zən zæn zæn zæn yæn yæn žæn žæn žæn zæn zæn zæn zen
past ze zæ zu zi yær yi, žiæ že že žænd zzə zæ(n)d zind zæ
say g gu ga vaj ös vuž,vož vaj vaj va vaj (v)aj vaj vaz
past guft gut gut/gat vat öt vut/vot h/vat vat vat vat (v)at vat vatt
take gir gir (g)ir gir ge-n gi(r) gi(r) ger ger gir u-ger gir gir
past gift git (g)it gæt got get(i) gæt gæt get gæ(rt) geræt giræt girætt
In N. Talyshi and some N. Tati there is a merger of many Present and Past stems.
Progressing away from the “epicentre” of this isogloss (N. Talyshi-Kalāsuri/Khoy-
narudi) westward through Keringani to Harzani and southward along the Talyshi
dialect chain, one gradually finds more distinct present vs. past verb roots until we
reach C. and S. Talyshi which fully retain these distinctions. Examples:
712 Donald L. Stilo
Table 6B: One-stem and two-stem verbs in N. Talyshi, N. Tati

1 stem 2 stems
‘cook’ Lr, Ab: pat Ks: pat; Xn: pæt Jk: pej~ž/pæt; Kr: paj/pat; Hz: pæy/pot
‘run’ Lr, Vz: vit; Sy: vit; Xn: vrit Jk: (y)vij/vit; Kr: ver/veri; Hz: veriž/
veret
‘say’ Lr, Ab, Vz: vot Xn: vut; Ks: ut Jk: (v)aj/(v)at; Kr: vož/vot; Hz: ös/öt
‘sell’ Lr: havat; Vz: hæt; Ks: herat Jk: hæš/hæt; Kr: feraš/ferat
‘sit’ Lr: ništ; Ab: nešt; Xn: nešt Jk: nəš/nəšt; Ks: neš/nešt; Kr/Hz: nešin/
nešd
(Lr: Leriki, Ab: Anbarani, Sy: Sayyadlar, Vz: Viznei, Jk: Jowkandāni, Ks: Kalāsuri, Xn.
Khoynarudi, Kr: Keringāni, Hz: Harzani)
Even the dialects at the center of this isogloss keep the distinction in a few verbs:
(Leriki/Anbarani/Viznei) žæn/žæ ‘hit’; (Leriki/Sayyādlar/Viznei) hæ/hard ‘eat’;
stæn/sæ ‘buy, take’; (Jowkandāni) stun/stæ ‘buy, take’; (Kalāsuri) estæ/estær
‘buy, take’, hær/hard ‘eat’; (Keringāni) ta/ti ‘buy, take’, fa(r)/hard ‘eat’, yæn/yi
‘hit’; (Harzani) asda(n)/astar ‘buy, take’, hæn/hord ‘eat’, yæn/yær ‘hit’. (See also
§4.2.1, point 14)
6.3. Lexical preverbs

While preverbs in Tatic and Caspian languages can have a purely directional sense,
they more often serve to create unpredictable lexical extensions or total semantic
shifts in a verb. Lexical Preverbs in Tatic, depending on the language and the
tense, may be separated from the root by mesoclitic Set2 PAMs encoding the A
(and in some cases the P). Languages with prefixed durative TAMs (me-, etc.) or
Allative TAMs that allow preverbs in durative tenses generally place the preverb
before the TAM:
(64) Leriki
o-b-æ-gard-é=m o-æ-gard-í-m
pvb - fut - dur -go.around-inf =1 s 1 pvb - dur -go.around-imf =1 s 1
‘I will return’ (Future) ‘I would return’ (Imperfect)
(65) Kafteji Alviri
a-m-xšuj-əm. a-y-vendær-emæ.
pvb - dur -knead-1 s 1 pvb - dur -stand-1 s 1
‘I knead.’ ‘I stand, stay.’ (YA: 183)
In the Present in Asālemi and Keringāni the preverb attracts the Set1 PAMs (plus
the AUX in Keringani) leftward and it also suppresses the Present bə- (cf. Leriki
Future):
(66) Asālemi
b-æ-gærdest=im > o=m-æ-gærdest.
pr - dur -wander:pt =1 s 1 pvb =1 s 1- dur -wander:pt
‘I go around’ ‘I return’
Keringāni
be-kard=eš=e > u=š=e-kard.
pr do:pt =2 s 1= aux pvb =2 s 1= aux do:pt
‘you do’ ‘you open’
Preverbs with motion verbs usually yield quite transparent lexical results. In the
case of verbs that have no intrinsic directional sense, however, either the preverb,
the verb root, or both may become somewhat semantically bleached:
Table 6C: Lāhijāni lexical preverbs

Base form Meaning Derived forms with Preverbs
xut- sleep fu-xut- hide (intr.)
du-xut- lie in ambush
də-xut- shrink (of a balloon)
va-xut- wake up suddenly, start
git- catch, grab vi-git [> vi-t] pick up
va-git [> væ-yt] light, kindle
ha-git [> hæ-yt] get, buy
če- pick, pluck fi-če gather up
də-če set up, pile up
kašt- plant, sow fə-kašt- line up (trans.)
Vafsi (three preverbs: dær-, or-, har-); keš-/keša-

(67) b-ím-keša (No preverb) ór=om-keša
pu -1 s 2-pull:pt pvb =1 s 2-pull:pt
‘I pulled, dragged’ ‘I weighed; I drew (water from well)’
(68) hár=om-keša dǽr=om-keša
pvb =1 s 2-pull:pt pvb =1 s 2-pull:pt
‘I ground (grain, etc); ‘I struck (match); I drew (line); I set up (wall);
I flattened out (metal)’ I tied up (bundle); I pulled (from middle of
pile)’
714 Donald L. Stilo
Leriki (6 preverbs: dæ-, pe-, e-, o-, be-, sæ-)
Base Root Infinitive Meaning

nïšt nïšt-e ‘to sit’
+ pe- pe-nïšt-e ‘mount (horse), get in/on (car, train, bicycle), ride’
+ dæ- dæ-nïšt-e ‘to bathe (intrans.), swim (go bathing)
+ sæ- sæ-nïšt-e ‘to pursue, run down, drive out, chase, go after’
Gurāni (Tāleqāni): Base form: keš ~ kəš- (1st singular)

(69) mí-kəš-om há-mi-keš-om vé-mi-kəš-om
dur -pull-1 s 1 pvb -dur -pull-1 s 1 pvb -dur -pull-1 s 1
‘I pull’ ‘I stretch out’ ‘I pull on (shoes, etc.)’
vá-mi-keš-om dé-mi-keš-om
pvb -dur -pull-1 s 1 pvb -dur -pull-1 s 1
‘I spread/unfold’ ‘I serve up, dish out’
Many simplex verb roots occur with no preverbs and others never occur without
one:
Vafsi: nešin/nešd (This verb root does not occur without a preverb).
(70) há-nešesde-ym ó-nešesde-ym dǽ-nešesdæ-Ø
pvb -sit:pt -1 s 1 pvb -sit:pt -1 s 1 pvb -sit:pt -3 s 1
‘I sat down’ ‘I mounted (a horse)’‘it alit (of birds); settled’
6.3.1. Lexical preverbs with aspectual functions

Vafsi and Mazanderani only use lexical preverbs in the affirmative of punctual
tenses.
Vafsi: gir-/girætt-
(71) b-ís-girætt. (Preterit affirmative) ór=es-girætt.
pu =3 s 2-take:pt pvb =3 s 2-take:pt
‘S/he caught, grabbed.’ ‘S/he picked up, lifted.’
(72) hár=es-girætt. dǽr=es-girætt
pvb =3 s 2-take:pt pvb =3 s 2-take:pt
‘S/he bought, got (from someone), ‘It started (to rain), (the fire) took;
peeled (fruit, bark), closed (door)’ (the current) got blocked up’
In the Present and the Imperfect and in all negative forms, the preverbs of these
verbs are deleted and the one resulting verb form will have all the above meanings:
(73) ær-gir-e. n-ís-girætt.

dur -take-3 s 1 neg =3 s 2-take:pt
‘S/he catches, grabs, picks up, buys, ‘S/he didn’t catch, grab, pick up,
gets, peels, closes, (fire) takes, buy, get, peel, close (door), etc.’
etc.
6.4. Verbal negation

Negation in the area is usually a simple prefixing of a negator, nV- (with some
vowel harmony but dissimilation in Mazanderani), which always attracts stress
and also suppresses the punctual TAM bV-. Tati has a special Imperative negator,
mǽ (§6.5.2).
(74) Lāhijāni
xó-n-əm > nó-xo-n-əm bə́-he-m > nə́-he-m
read-pr -1 s neg -read-pr -1 s pu -buy:pt -1 s neg -buy:pt -1 s
‘I read > I don’t read’ ‘I bought > I didn’t buy’
(75) Sāravi
xor-m-e > nǽ-xor-m-e Ø-xass-e > né-Ø-xass-e
eat-pr -3 s neg -eat-pr -3 s imf -want-3 s neg - imf -want-3 s
‘s/he eats> s/he doesn’t eat’ ‘s/he wanted > s/he didn’t want’
(76) Vafsi
ær-vin-óm > n-ǽr-vin-om b-ím-diæ > n-ím-diæ
dur -see-1 s 1 neg - dur -see-1 s 1 pu -1 s 2-see:pt neg =1 s 2-see:pt
‘I see > I don’t see’ ‘I saw > I didn’t see’
The negator usually follows a lexical preverb, except for Māsulei where it precedes.
In Vafsi and Mazanderani the preverb is deleted in negation (§6.3.1):
Lāhijāni Koluri Kelāsi Chāli Māsulei
vi-n-gi-n-əm pe-mǽ-gi a-né-m- a-mǽ-di né-pi-ger-əm
pærs=əm
pvb - neg -take- pvb - neg -take pvb - neg - pr - pvb - neg -give neg - pvb -rise-1 s 1
pr -1 s 1 ask-1 s 1
‘I don’t pick up’ ‘don’t pick up’ ‘I don’t ask’ ‘don’t give’ ‘I don’t pick up’
Special cases in negation
In the Asālemi Present negative, the Allative bə- drops leaving the durative -æ
and the negator attracts the aux forward. The parallel N. Talyshi Future has two
forms: 1) the negator is simply prefixed; 2) æ- + (Pres.) stem + Negative:aux
(Miller 1953: 157).
716 Donald L. Stilo
Asālemi Present
Affirmative Negative
No Preverb b-æ-znost=ím nə́=m-æ-znost ‘I know’
Preverb dǽ=m-æ-pærsəst dæ-nə́=m-æ-pærsəst ‘I ask’
Lerik Future
Affirm. b-æ-dó=m ‘I will give’ bæ-sé=m ‘I will get, buy’
Neg. I ní-bæ-do=m ‘I won’t give’ ní-bæ-se=m ‘I won’t get, buy’
Neg. II æ-dǽ=ni-m ‘I won’t give’ æ-stǽ(n)=ni-m ‘I won’t get, buy’
In Talyshi and neighboring Tati, the analytic tenses usually place the negative aux
after the verb stem, an unusual situation within Iranian (see also Lerik Neg. II
above):
(77) Leriki
ički=æn pe-šom-dæ ni-š?
spirits=also pvb -drink-loc neg : aux -2 s 1
‘And you don’t drink?’
geymæt=ïm sæ ni-Ø.
grade-1 s 2 take:ppl aux : neg -3 s 1
‘I haven’t gotten (my) grades.’
(78) Māsulei
æ mærdek-é vat-ǽ ni-æ.
that man-ob say-ppl aux : neg -3 s 1
‘That man hasn’t said (it).’
Koluri
híči=m hærd-ǽ ni-Ø.
nothing-1 s 2 eat-ppl aux : neg -3 s 1
‘I haven’t eaten anything.’
6.5. Verbs: tense formation, diachrony, areal features, and usages
6.5.1. Present tense

The various Present markers in Caspian and Tatic are: A) Ø marking; B) mV,
em-; C) æt; D) Locative markers; E) Allative markers, and F) the -Vn- suffix.
Types B – E are Durative markers used in any given language for both the Present
and the Imperfect. See Progressives (§6.5.8.2) for another Present tense type in
Charozh.
The Ø Present marker forms a small isogloss with languages of four groups:
Western Gilaki, Southern Talyshi, Central Tati, and Rudbāri (Tatoid). It is one of
the main features distinguishing Western Gilaki from Eastern Gilaki. Tonekāboni
and Khoini also exhibit this typology (and both have two Present tenses) but lie
slightly outside the area of this isogloss.:
Contiguous languages belonging to this isogloss
Khoini Tonekāboni
Rashti Māsulei Koluri Tutkāboni Present I Present I
(Gilaki) (S. Talyshi) (C. Tati) (Rudbāri) (C. Tati) (C. Casp.)
1 sg Ø-dan-ə́m Ø-zon-ə́m Ø-zan-ə́m Ø-dan-óm Ø-tæj-em Ø-xár-om
2 sg Ø-dan-í Ø-zon-é Ø-zan-í Ø-dan-í Ø-tæj-i Ø-xár-i
3 sg Ø-dan-ə́ Ø-zon-ə́ Ø-zan-é Ø-dan-é Ø-tæj-ek Ø-xár-e
‘know’ ‘know’ ‘know’ ‘know’ ‘run ‘eat’
6.5.1.1. Languages with the mV- present durative marker

The mV- prefix may be the same as Persian mi-, derived from a particle hamē-
(< *hama-aiva-, Windfuhr 1989: 256), but is unstressed in the languages of this
area.
Kajali Hezārrudi Kafteji Dikin-Marāqei Eshtehārdi
1sg me-mær-ém mi-gen-ém me-gæn-ə́m me-zon-én mi-væz-ém(æ)
2sg me-mær-í mi-gen-í me-gæn-í me-zon-í mi-væz-íš(æ)
3m me-mær-é mi-gen-é me-gæn-ə́ me-zon-ǽ mi-væz-ǽ
3f me-mær-íæ mi-gen-íæ me-gæn-eǽ me-zon-égan ---
‘die’ ‘fall’ ‘fall’ ‘know’ ‘run’
6.5.1.2. Languages with a prefixed æt- marker

Vafsi, Alviri, and Vidari have the durative marker æt-, forming an isogloss extend-
ing southwards to the Persian Gulf (Stilo 2007a: 108). It varies in Vafsi according
to the following consonant. Alviri and Vidari drop the dental before a consonant.
Vafsi: vowel velar/labial alveolar/ Alviri Vidari
palatal
1sg æt-ar-óm(e) ær-gen-óm(e) æd-dó-m(e) e-væz-émæ étt-a-m(e)
2sg æt-ar-í ær-mir-í æn-nevis-í e-va-y é-ven-i
3sg æt-ar-é ær-vin-é æ(č)-čin-é et-ašin-ǽ é-ven-e
3f –– –– –– et-ašin-íæ ––
‘bring’ ‘fall, die, see’ ‘give, ‘run, say, ‘come, see’
write, pick’ throw’
(See also §6.5.2.1 for the palatalization of roots in s-/z-: saz-, zan- > æčazom,
æjanom)
718 Donald L. Stilo
6.5.1.3. TAMs of adpositional (locative/allative) origin

Locatives: Locative morphemes plus the infinitive of the verb, followed by a
copula encoding tense and person in an analytic structure (‘am/was in doing’),
form general durative paradigms in N. Talyshi. In the Caspian group, Harzani, and
Kalāsuri these forms have a Progressive sense (§6.5.8), separate from the Present/
Imperfect tenses.
Northern Talyshi: Locative Present Tense (full and short forms)

Full form -V Full form -t/d Short Form -t/d
Stem + Stressed infinitive formant + Stressed stem + Loc +
Locative + Copula Copula
1sg š-é-dæ=m vot-é-dæ=m vót-dæ=m
2sg om-é-dæ=ž mand-é-dæ=ž mánd-dæ=ž > mánd-æ=ž
‘go, come’ ‘say, stay’ ‘say, stay’
Allatives: The Allative preposition bV, which is different from the Subjunc-
tive marker, is an unstressed formant in another analytic Present/Past durative
in Asālemi and all varieties of Northern Tati, as well as the Future in Northern
Talyshi (§6.5.4).
Harzani Keringāni Kalāsuri Asālemi
1 sg bə-het=en bö-öt=en be-het=en=e be-het=em b-æ-xït=ím
2 sg bə-het=e bö-öt=e be-het=eš=e be-het=eš b-æ-xït=íš
3 sg bə-het=æn bö-öt=æn be-het=i=e be-het=i=æ b-æ-xït=í
‘sleep’ ‘say’ ‘sleep’ ‘sleep’ ‘sleep’
6.5.1.4. Diachronic present participial formation

The Caspian Present formant (V)n (lacking in W. Gilaki), is traditionally derived
from an old present participle through phonetic reduction: *ant > Vnd > (V)n(n). In
Lāhijāni ən occurs after roots ending in most consonants, but n after a vowel. After
a root-final n-, the Present marker is lost; after r, the latter is lost and -n- retained
(but the -r- of the root reappears in the Subjunctive):
Lāhijāni Kelārdashti Tonekāboni
Consonant -V -n -r (xor-) Present II
1 sg xós-ən-əm á-n-əm dón-əm xó-n-əm bær-eme xár-en-om
2 sg xós-ən-i á-n-i dón-i xó-n-i bær-n-i xár-en-i
3 sg xós-ən-ə á-n-ə dón-ə xó-n-ə bær-n-i xár-en-e
‘sleep’ ‘come’ ‘know’ ‘eat ‘carry off’ ‘eat’
Amini (2003) gives two Present tenses for Tonekāboni, one with the -en- suffix
and one with the Ø marking presented above. It seems, however, that these two
Present tenses are quite blurred in usage and often interchangeable:
(79) in či ke Ø-kár-im Ø-xár-im – ye číz-e
this what sub plant-pr 1-1 p eat-pr 1-1 p one thing-ez
ziádi=æm ruš-en-im.
much=also sell-pr 2-1 p
‘That which we plant, we eat—a lot of it we also sell.’ (Amini: 112)
The Mazanderani variant of -Vn-, simply -n- after roots in a single r or l, is lost in
consonant clusters (*xornme), and is -εn- after roots in other consonants. When the
nasal of the Present formant is followed by another nasal, there may be assimila-
tion of the first nasal and dissimilation of the second: *xa-n-me > xa-m-be ‘I want’.
Sāravi Bābolsari
1sg kæfembe <*kæfenme xorme <*xornme darme <*darnme
2sg kæfeni < *kæfeni xorni < *xorni darni < *darni
3sg kæfene < *kæfenε xorne < *xornε darnε < *darnε
‘eat’ ‘have’
Bābolsari (and others) generalizes resulting geminated nasals, Sāravi changes
them to nasal + homorganic stop, while Āmoli (etc.) reduces the gemination:
underlying Bābolsari Āmoli Sāravi Sāravi
Present stems ending in Vowel Stem in final -n
1sg *dεnmε dεmmε demə dembe dumbe < *dun-n-me
2sg *dεni dεnni deni deni dundi < *dun-n-i
3sg *dεnε dεnnε denə dene dunde < *dun-n-e
‘give’ ‘know’
Examples of Present tenses:
(80) Lāhijāni Sāravi
čan saət xós-ən-i? ve unta=re xa-n-e.
how.many hours sleep-pr -1 s s/he that.one=ra want-pr -3 s
‘How many hours do you sleep?’ ‘S/he wants that one.’ (SH: 129)
(81) Leriki Vafsi
gofe lok-n-é-dæ-Ø. tæ če r-kær-i?
cradle rock-caus - inf - loc - aux :3 s 1 you what dur -do-2s1
‘S/he is rocking the cradle.’ ‘What are you doing?’
720 Donald L. Stilo
6.5.2. Subjunctive and imperative:

The Subjunctive/Imperative marker in West Iranian languages is usually described
as a stressed prefix bV́ -. In examining this form more closely, however, I find that
the bV- and the associated stress must be seen as separate phenomena for various
reasons:
a) Verbs with lexical preverbs suppress bæ- but retain the stress in initial posi-
tion. In languages that have a Ø Present marker, therefore, the stress may be
the only way to distinguish the Subjunctive from the Present when preverbs
are present:
Present Subjunctive
Rashti: fadə́m (fa-d-ə́m) fádəm (fa-´-d-əm) ‘I give’
Māsulei: adǽm (a-dæ-ə́m) ádæm (a-´-dæ-əm) ‘I give’
Koluri: diæsə́m (di-æs-ə́m) díæsəm (di-´-æs-əm) ‘I look’
b) In Rashti and the Rudbāri (Tatoid) group, both the Subjunctive and the Preterit
have a bV- marker (see §6.5.6) but it only takes stress in the Subjunctive here.
(See also §4.1, Pt. 10 on the difference in stress placement of WG vs. EG).
c) Some languages have no bV- but still stress the initial syllable of the
Subjunctive on the present root: Razajerdi me-ruš-ém > (Subj.) rúš-em ‘sell’.
Keringāni and Harzani have no be-, but stress is not indicated in the descrip-
tions.
(82) Razajerdi
öštö=re ye hærf pórs-öm.
you-abl one word ask:sbj -1 s 1
‘Let me ask you something.’
Harzani
en öv=e gæræk riy-en.
this water=ra must pour-1 s 1
‘I must pour this water.’ (MH: 483)
d) Similarly to point (c), some languages delete the bV- morpheme before initial
v- of the root, in which case the stress moves to the first syllable of the root,
e. g., Nowkiāni vénder-em ~ bé-vender-em ‘(that) I stand/stay’ (YSFN).
e) If the verb root begins with a vowel, the vowel of bV- is lost, leaving the
vowel of the root to bear the stress: Vafsi b-ór-om ‘that I eat’, Talyshi b-árïšt-
o-m ‘(that) I break’, Lāhijani b-ár-əm ‘that I bring’.
Additional Comments:
1) Some languages have small differences in Present tense vs. Subjunctive
PAMs, Rashti 3sg . xuré/búxurə ‘eats’; Kelārdashti 2sg . bærné/bǽberi ‘you
carry off’.
2) The labial of bæ- in Vafsi is often dropped or converted to w- or y-, depending

on the preceding vowel: ærgo wǽssom ‘I must go’, tæ ǽssey ‘you should go’.
3) The Negative Imperative (or Prohibitive) takes the dedicated negator mǽ- in
Tatic, while Caspian languages have the usual næ- of the other tenses.
4) In only a few Tatic languages, the negator of the Subjunctive is mǽ- as in the
Prohibitive: Nowkiāni čéra mǽ-beræm-em? ‘Why shouldn’t I cry?’ (YSFN),
Māsulei (alternate forms) mǽ-dæ-færs-əm ~ né-dæ-færs-əm ‘(that) I not ask’.
5) The negators nǽ- and mǽ- suppress the Subjunctive/Imperative bǽ- prefix.
6) The singular Imperative has a Ø PAM; the plural is identical to the Subjunctive.
7) In S. Tati, the original Optative (§6.5.3) replaces the Subjunctive form only in
the 2sg , e. g., Chāli b-ášin-aš < b-ášin-a-š ‘(that) you throw’.
8) Keringāni and Kalāsuri have a special situation for transitive Imperatives: in
addition to the bV- marker, they optionally add the 3sg . Set2 PAM, bleached
of its usual marking of the Patient: Keringāni/Kalāsuri: biždüž/bišdüt ‘sew!’,
bižvin/bišvin ‘see!’, bižgi/bišgin ‘take!’ (all: YSFN).
6.5.2.1. Subjunctive paradigms
Lāhijāni: No Preverbs + Preverbs

1 sg bə́-bər-əm bú-šu-m də́-vəd-əm hǽ-de-m ví-gir-əm
2 sg bə́-bər-i bə́-š-i də́-vəd-i hǽ-d-i ví-gir-i
3 sg bə́-bər-ə bú-šu-n də́-vəd-ə hǽ-de-Ø ví-gir-ə
1 neg nə́-bər-əm nú-šu-m də́-n-vəd-əm hǽ-n-de-m vín-gir-əm
imp . sg bə́-bər bú-šu də́-vəd hǽ-di ví-gir
imp . sg . neg nə́-bər nú-šu də́-n-vəd hǽ-n-di ví-n-gir
imp . pl bə́-bər-in bə́-š-in də́-vəd-in hǽ-d-in ví-gir-in
imp . pl . neg nə́-bər-in nə́-š-in də́-n-vəd-in hǽ-n-d-in ví-n-gir-in
‘take away’ ‘go’ ‘close, tie’ ‘give’ ‘pick up’
Rāmsari Tonekāboni Kelārdashti Sāravi
1 sg bá-bor-om bǽ-ber-om bǽ-ber-æm bǽ-ver-em há-d-em
2 sg bá-bor-i/-e bǽ-ber-i bǽ-ber-i bǽ-ver-i há-d-i
3 sg bá-bor-ə bǽ-ber-e bǽ-ber-e bǽ-ver-e há-d-e
1 neg ná-bor-om nǽ-ber-om nǽ-ber-æm nǽ-ver-em nǽ-d-em
imp . sg / pl bá-bor(-in) bǽ-ber(-in) bǽ-ber(-in) bǽ-ver(-in) há-de/-d-in
imp . neg ná-bor(-in) nǽ-ber(-in) nǽ-ber(-in) nǽ-ver(-in) nǽ-de/-d-in
⟹ ‘take away’ ⟸ ‘give’
722 Donald L. Stilo
Chali Leriki: No Preverbs + Preverbs

1 sg b-ášin-om bḯ-kïšt-ïm bḯ-dæ-m pé-gæt-ïm
2 sg b-ášin-aš (<-a-š) bḯ-kïšt-ïš bḯ-dæ-š pé-gæt-ïš
3 sg b-ášin-i bḯ-kïšt-ï bḯ-dæ-Ø pé-gæt-ï
1 neg nḯ-kïšt-ïm nḯ-dæ-m pe-nḯ-gæt-ïm
imp . sg / pl b-ášin/b-ášin-a bḯ-kïšt(-æn) bḯ-dæ(-n) pé-gi ~
pé-gæt(-æn)
imp . neg m-ášin/m-ášin-a mǽ-kïšt(-æn) mǽ-dæ(-n) pe-mǽ-gi ~
pemǽgæt(-æn)
‘throw’ ‘kill’ ‘give’ ‘pick up’
Vafsi: Cons. -V Consonant -V -r
1 sg bǽ-vær-om bǽ-sso-m dǽ-gen-om há-d-om ó-gir-om
2 sg bǽ-vær-i bǽ-sse-y dǽ-gen-i há-de-y ó-gir-i
3 sg bǽ-vær-e bǽ-sso-(æ) dǽ-gen-e há-d-o(æ) ó-gir-e
1 neg nǽ-vær-om nǽ-sso-m nǽ-gen-om nǽ-d-om nǽ-gir-om
imp . sg bǽ-væ-Ø bǽ-ssæ-Ø dǽ-gen-Ø há-dæ-Ø ó- ~ ú-gi-Ø
imp . sg . neg mǽ-væ-Ø mǽ-ssæ-Ø mǽ-gen-Ø mǽ-dæ-Ø mǽ-gi-Ø
imp . pl bǽ-vær-a bǽ-ss-a dǽ-gen-a há-d-a ó- ~ ú-gir-a
imp . pl . neg mǽ-vær-a mǽ-ss-a mǽ-gen-a mǽ-d-a mǽ-gir-a
‘take away’ ‘go’ ‘fall’ ‘give’ ‘pick up’
Those roots in Vafsi that palatalize their initial consonant after the Present TAM
æt- revert to the original present root in the Subjunctive forms:
Root: -saz- -ssæ- (-šur-) -zan-
Palatalization: æt-s- > æč- æt-s- > æč- æt-š- > æč- æd-z- > æj-
Pres. 1sg æ-čaz-óm æ-č-om æ-čur-óm æ(j)-jan-óm
Subj. 1sg bǽ-saz-om bǽ-sso-m bǽ-šur-om bǽ-zan-óm
Subj. 1sg . neg nǽ-saz-om see nǽ-šur-om nǽ-zan-om
Imp. sg /( pl ) bǽ-saz(-a) previous bǽ-šur(-a) bǽ-zan(-a)
Imp. sg /( pl ). neg mǽ-saz(-a) Table mǽ-šur(-a) mǽ-zan(-a)
‘build’ ‘go’ ‘wash’ ‘know’
Irregular Imperative forms

Root -a- -ar- -(h)or-
Subj. 1sg b-a-ym b-ar-óm b-or-óm
Imp. sg / pl b-úri/b-úri-a b-óran/b-óran-a b-óæ/b-ór-a
Imp. sg . neg m-úri/m-úri-a m-óran/m-óran-a m-óæ/m-ór-a
‘come’ ‘bring’ ‘eat’
6.5.2.2. Uses of the subjunctive

Only a few representative uses of the Subjunctive can be mentioned in this chapter:
1) in subordinate verbs after modals (§6.6) and main verbs with control features;
2) in the protasis of conditional sentences implying doubt or uncertainty (§8.2.2);
3) in Purposive clauses (see §8.3.2);
4) in Optative senses (blessings and curses) if an Optative form is lacking
(§6.5.3);
5) in the verb in an indefinite relative clause, as in the following two examples:
(83) Lāhijāni
hiš kə́s=a šənas-ən-i təmóm-ə i ko-n=a
none person=ra know-pr -2 s all-ez this mountain-pl = ra
bú-don-ə?
pu -know-3 s
‘Do you know anybody (that) knows all these mountains?’ (NJ: 127)
(84) Vafsi
esdæ qælǽm=i hær kay yə-wæ-Ø,, ær-go
your pen=2 s 2 each where pu -be-3 s 1 dur -want
há-wend-uæ-Ø.
pvb -find-pass -3 s 1
‘Your pen, wherever it is, has to be found.’
6.5.3. Optative
The Optative, lacking in Caspian, is quite robust in Tatic. It is formed by inserting
the marker -a- into the Subjunctive after the Present stem. It expresses a wish
or hope, as in blessings, curses, etc. The Optative form (Yarshater: Conditional
Present) in Asālemi, Koluri, Gandomābi, Karani, and Kajali also has many Sub-
junctive uses (although there is an independent Subjunctive), esp. in conditional
clauses (exs. 88–90). In Northern Talyshi, the Optative marker is raised to o. Within
Tatic, the Optative is lacking in Vafsi, Khoini, the Tāromi group, and Māsulei,
among others.
Forms of the Optative

Leriki Asālemi Gandomābi Karani Koluri
(N. Talyshi) (C. Talyshi) (Central Tati) (Central Tati) (Central Tati)
1 sg bḯ-hand-o-m bə́-š-a-m be-dæv-a-ym be-kær-a-ym bé-čæk-a-m
2 sg bḯ-hand-o-ž bə́-š-a-š be-dæv-a-š be-kær-a-y bé-čæk-a-š
3 sg bḯ-hand-o-Ø bə́-š-a-Ø be-dæv-a-Ø be-kær-a-Ø bé-čæk-a-Ø
‘read’ ‘go’ ‘run’ ‘do’ ‘break (tr.)’
724 Donald L. Stilo
Purely Optative sense:

(85) Leriki Eštehārdi (ST: 224)
hæmišæ b-o-ž! xoda ji bé-köš-a-Ø
always be- opt -2 s 1 God he:ob pu -kill-opt -3 s 1
‘May you always be!’ ‘May God kill him!’
Karani (YSFN)
yæman=er æ-ger-a-Ø.
anthrax=2 s 2 pvb -take-opt -3 s 1
‘May anthrax take you.’
In Leriki the Optative usually replaces the Subjunctive in the usual Subjunctive
uses, even though both paradigms exist in the dialect. The Subjunctive and the
Optative generally remain distinct in other Northern Talyshi dialects of Azerbaijan.
(86) Leriki
æy pi-dæ=ž-e hežo bḯ-nïšt-o-Ø kitob
he:ob want:inf - loc -3 s 2aux always sbj -sit-opt -3 s 1 book
bá-hand-o-Ø.
sbj -read-opt -3 s 1
‘S/he always wants to sit and read books.’
(87) ovči nḯ-be-Ø ki dï tifang-í æy bḯ-žæn-o-Ø
hunter neg -be:pt -3 s 1 sub with rifle-ob he: ob sbj -hit-opt -3 s 1
‘Wasn’t there a hunter to shoot it with a gun?’
The original Optative may be used more commonly in the Present Conditional usage
in Present/Future conditional clauses since the latter situations are more frequent:
(88) Asālemi Gandomābi
ægær iya bə́-b-a-š,,… ægæ bé-dæv-a-š,,…
if here sbj -be-opt -2 s 1 if pu -run-opt -2 s 1
‘If you should be here, …’ ‘If you run …’
Many languages use the Optative in both senses:
(89) Koluri
Optative sense
xoda xær-e šax mæ-da-ya-Ø!
God donkey-ob horn proh -give-opt -3s1
‘May God not give horns to a donkey!’ (AF: 428)
Conditional sense
ægæ dir á-kær-a-nd,,…
if late pvb -do-opt -3 p 2
‘If they are late, …’
(90) Karani (YSFN)

xoda ešte pi-ær bí-amorz-a-Ø!
God your father-ob pu -take-opt -3 s 1
‘May God absolve your father!’
ægæ bé-vrij-a-š,, Ø-gen-i.
if pu -run-opt -2 s 1 pr -fall-2 s 1
‘If you run, you will fall.’
The Subjunctive is used in these senses in the languages that do not have an Opta-
tive:
(91) Māsulei Vafsi
xəda tə læ:næt bə-kær-ə! xoda laze-y ná Ø-dir-e!
God you curse sbj -do-3 s 1 God son-2 s 2 hold pu -have-3 s 1
‘May God damn you!’ (LM: 54, ‘May God preserve your son!’
IV, 13)
6.5.4. Future tense

While Formal Future tenses occur in some Caspian varieties, they are not often
used. In Northern Tati and Northern Talyshi, the Future is quite robust. In Nowk-
iāni and Alviri it is somewhat less regularly used and is very rare in Vafsi.
The Future formations consist of: A) an invariable form of ‘want’ plus Sub-
junctive of the verb; B) a finite form of ‘want’ plus an invariable form of the verb;
C) the same Allative formation as in the Allative Present (see §6.5.1.3); or D) a
Future participial form in -æni(n) (see next section also). Keringāni seems to have
a unique Future formation (AAK 1954: 97–98): E) Infinitive (short form) + -i- ~
-y- + Copula (ex. 94).
My Lāhijāni fieldwork shows a Type A Future based on xa-n (Present of ‘want’
with a TAM but no PAMs). While Jahangiri (2003) does not discuss a formal
Future, his examples show a similar Future without the TAM, xa, which also
means ‘must’.
Type A Type B Type C Type D
Lāhijāni Rashti Leriki Kalāsuri
1sg xa(-n) bú-šu-m xa-yəm šo-on b-æ-vot-é=m šer-æni=m
‘go’ ‘go’ ‘say’ ‘go’
(Type C + D?)
Alviri
be-vat-ænin-imæ
‘say’
726 Donald L. Stilo
(92) Rashti
ha rå=ya xa-ídi šo-on.
this.very road=ra want-3 p go-inf
‘They will take (lit: go) this same road’ (RR: 240, 28)
Lāhijāni
guya xa bərf b-a-Ø.
supposedly want snow pu -come-3 s
‘It seems it is going to rain.’ (NJ: 170)
(93) Kalāsuri
em ešte pül-i hard-æni=æ.
this your money-ob eat-fut =3s1
‘He will embezzle (lit: eat) your money.’ (KL: 279)
Alviri
mella dærs-e nu be-vat-ænin-i.
mullah lesson-ez new all -say- fut -3 s 2
‘The teacher will teach (lit: say) a new lesson’ (HA: 173)
(94) Keringāni (Type E) (YSFN)
te sæba če kur kard-e-y-šæ? mun sæba
you tomorrow what work do-inf - fut -2 s 1 I tomorrow
ši-e-y-ne šəkar.
go-inf - fut -1 s 1 hunt
‘What will you do tomorrow?’
‘I’ll go hunting tomorrow.’
For examples of the Leriki Future, see exs. (45), (64), (237), (239), (270), (281),
(317).
6.5.5. Future necessitative

In Leriki, which already has a regular Future tense, the participial morpheme
-ænin forms a Future of Necessity, e. g., hærd-ænín=im ‘I will have to eat’. It
also conveys a sense of “intention, strong desire to perform a specific action”
(Miller 1953: 159).
(95) Leriki
az b-æy i dï kælimæ vot-ænin=im.
I to-he:ob 1 2 word say-nec =1 s 1
‘I will have to tell him one or two words (i. e., about the problem)’
6.5.6. Past tense (Preterit)

a. Most languages of the area use the same punctual bV- prefix as that of the
Subjunctive as a marker of the Preterit (Intransitive and Transitive).
i. N. Tati, the enclaves Khoini and Dikin-Marāqei, and all Talyshi (except
for the Jowkandāni area) lack the bV- punctual marker in the Preterit. For
these languages, this marker will henceforth below be labeled as Sub-
junctive (sbj ), rather than punctual (pu ), since this is its sole function
here.
ii. Preverbs and the negator suppress bV- in the Preterit (as in Subjunctive).
iii. The Preterit bV- is generally stressed, except for WG, Rudbāri, and some
neighboring Tati varieties, e. g., Koluri, Kafteji, Kelāsi, Nowkiāni.
b. Some languages have a Ø PAM for the 3sg . in the Preterit, while others have
a morpheme that is identical or similar to the 3sg . PAM of the Present.
c. Most of Tatic makes a sharp distinction in intransitive and transitive con-
jugations in the Past system: Set2 (“Oblique”) PAMs encode the transitive
Agent while Set1 (“Direct”) PAMs encode the intransitive S. Some languages
additionally show the Absolutive nature of the Patient by cross-indexing it in
the verb with Set1 PAMs, but in general “there has always been a tendency to
abandon Object-agreement in favour of A-agreement” (Haig 2008: 128).
d. Caspian, Tatoid, and Razajerdi only have one (Intransitive type) conjugation.
e. Set2 PAMs encoding Past A are leftwardly mobile in Tatic (see §7.4.1).
6.5.6.1. Languages that take the punctual morpheme bV- in the preterit
Lāhijāni: + Preverbs
1 sg bó-xord-əm bú-šo-m də́-kət-əm há-da-m ví-t-əm < *ví-(g)it-əm
2 sg bó-xord-i bú-šo-i də́-kət-i há-da-y ví-t-i
3 sg bó-xord-ə bú-šo-Ø də́-kət-ə há-da-Ø ví-t-ə
‘eat’ ‘go’ ‘fall’ ‘give’ ‘pick up’
Rashti + Preverbs Tonekāboni + Preverbs Sāravi
1 sg bə-kə́ft-əm vi-ríšt-əm bé-git-om vé-git-om bǽ-merd-eme
2 sg bə-kə́ft-i vi-ríšt-i bé-git-i vé-git-i bǽ-merd-i
3 sg bə-kə́ft-ə vi-ríšt-ə bé-git-e vé-git-e bǽ-merd-e
‘fall’ ‘get up, rise’ ‘get, catch’ ‘pick up’ ‘die’
Sāravi + Preverbs
dǽ-væss-eme
dǽ-væss-i
dǽ-væss-e
‘close, tie’
728 Donald L. Stilo
Vafsi
Intransitive Preverb Transitive Preverb
1 sg bǽ-ræsa-ym dǽ-kætte-ym b=ím-vattæ har=om-pærsa
2 sg bǽ-ræsa-y dǽ-kætte-y b=í-vattæ har=i-pærsa
3 sg bǽ-ræsa-Ø dǽ-kættæ-Ø b=ís-vattæ har=es-pærsa
neg nǽ-ræsa-ym nǽ-kætte-ym n=ím-vattæ n=ím-pærsa
‘arrive’ ‘fall’ ‘say’ ‘ask’
Central Tati N. Talyshi
(transitional)
Nowkiāni transitive Kafteji Koluri Jowkandāni
1 sg be-xót-im be-vǽt=em bə-kǽt-im be-vǽšt-im bə-ræs-ím
2 sg be-xót-iš be-vǽt=er bə-kǽt-iš be-vǽšt-iš bə-ræs-íš
3 sg be-xót-Ø be-vǽt=eš bə-kǽt-Ø be-vǽšt-Ø bə-ræs-í=e
3f be-xót-æ bə-kǽt-æ
‘sleep’ ‘weave’ ‘fall’ ‘jump’ ‘arrive’
Tatoid: Tutkāboni (Rudbāri) Gurāni (Tāleqāni)
Intrans. Past Trans. Past Intrans. Past Trans. Past Past
1s bə-mórd-om bə-pǽt-om bá-xot-om bá-xord-om há-git-om
2s bə-mórd-i bə-pǽt-i bá-xot-i bá-xord-i há-git-i
3s bə-mórd-Ø bə-pǽt-Ø bá-xot-Ø bá-xord-Ø há-git-Ø
6.5.6.2. Languages that do not take the Punctual morpheme bV- in the preterit
Khoini transitive Harzani transitive Kalāsuri transitive

1 sg genést-im vínd=em=(e) šér-in öt=ma genést-im mæs=em=æ
2 sg genést-iš vínd=i=(e) šér-i öt=la genést-iš mæs=er=æ
3 sg genést-e vínd=eš=(e) šér-æ öt=ja genést-æ mæs=eš=æ
‘fall’ ‘see’ ‘go’ ‘say’ ‘fall’ ‘hear’
Leriki
intransitive transitive
1 sg omǽ-ym vít-im gïní-m kárd=ïm=e pe-gǽt=ïm=e
2 sg omǽ-yš~ž vít-iš~ž gïní-š~ž kárd=ï=e* pe-gǽt=ï=e*
3 sg omǽ-y vít-e gïní-Ø kárd=ïš=e ~ pe-gǽt=ïš=e ~
kárd=ïž=e pe-gǽt=ïž=e
‘come’ ‘run’ ‘fall’ ‘do’ ‘pick up’
*These forms are generally avoided for phonological reasons.
Dikin Marāqei, in addition to the regular intransitive and transitive conjugations,

has an intransitive-like past Objective conjugation. In Razajerdi only a few verbs
(‘run, buy, hear, slice, read, break’) take the punctual marker, while others do not.
Dikin Marāqei
intransitive transitive Objective
1sg genést-in vínd=em=æ vínd-in
2sg genést-iš vínd=et=æ vínd-iš
3sg genést-æ vínd=eš=æ vínd-æ
3f genést-ian vínd-ian
‘fall’ ‘see’ ‘X saw me, you, etc.’
Razajerdi
(intransitive/transitive)
kǽt-öm göt-öm bə-xúnd-em
kǽt-i göt-i bə-xúnd-i
kǽt-Ø göt-Ø bə-xúnd-Ø
kǽt-æ göt-æ bə-xúnd-æ
‘fall’ ‘say’ ‘read’
6.5.7. Imperfect tense

The various types of durative and/or Imperfect markers in Caspian and Tatic are:
A) Prefixes or Ø: 1) Ø; 2) æt- durative; 3) me- ~ eN- durative; 4) Allative markers;
5) the so-called Augment æ- found only in Northern and Central Talyshi and
only in combination with the -i- suffix of B3 below. Languages with prefixes
A2–4 share these prefixes with the Present tense.
B) Suffixes (not shared by the Present): 1) -enV-; 2) imi (Kelārdashti only); 3) i ~
æy (along with the Augment in Northern/Central Talyshi, but no Augment in
Southern Talyshi, the neighboring Tati dialects, or W. Gilaki).
Comments:
1) The Ø Imperfect marker based on the Past stem is found across the whole
Caspian family, excluding WG and Kelārdashti. These languages generally do
not coincide with those that have a Ø marker in the Present tense.
2) When a verb has a lexical preverb in languages with a Ø Imperfect marker, the
Preterit and the Imperfect are indistinguishable since the preverb suppresses
the punctual bV: Lāhijāni: há-da-m ‘I gave; used to give’.
3) In Central Caspian (except for Kelārdashti) there are two Imperfect types, Ø
and enV; the latter has both irrealis and habitual or even progressive senses
but in EG it is exclusively a conditional marker (see §6.5.11–§6.5.12).
4) WG is the only Caspian variety that has the -i- Imperfect marker; WG also
lacks the conditional marker -enV-.
730 Donald L. Stilo
5) The -i- marker is also used with the past aux (verb + b-i-Ø, etc.) in irrealis
paradigms (§6.5.11–§6.5.12).
6) The Imperfect in some languages is formed on the Present stem: Talyshi,
Koluri, Kelārdashti (-imi-), and partly -ena- in Rāmsari and Tonekāboni.
7) The æy suffix of some C. Tati (and mobile clitic of Keringāni Past Subjunctive,
§6.5.11) is the same as -i- of Points 4–5: æy > ē > i. It may derive from hait,
the Old Iranian 3rd sg. optative of ‘be’ (Windfuhr 1987: 393).
6.5.7.1. Ø-marked imperfects
Lāhijāni I final -V Preverb Rāmsari I Tonekāboni I

1sg Ø-xónd-əm Ø-bye-m há-da-m Ø-gút-əm Ø-gút-əm
2sg Ø-xónd-i Ø-b-i há-da-y Ø-gút-e Ø-gút-e
3sg Ø-xónd-ə Ø-be-Ø há-da-Ø Ø-gút-ə Ø-gút-ə
‘read’ ‘cut’ ‘give’ ‘say’ ‘say’
Sāravi (Preverb suppressed in durative tenses)
1sg Ø-merd-eme Ø-ši-me Ø-væss-eme Ø-gεt-εmε Ø-viness-εmε
2sg Ø-merd-i Ø-ši-i Ø-væss-i Ø-gεt-i Ø-viness-i
3sg Ø-merd-e Ø-ši-ye Ø-væss-e Ø-gεt-ε Ø-viness-ε
‘die’ ‘go’ ‘tie, close’ ‘take, catch’ ‘see’
6.5.7.2. Durative markers shared with present: æt- ~ ær- and me- ~ eN-
Vafsi transitive Hezārrudi Eshtehārdi Khoini

1sg ær-kætte-ym im-ær-vattæ me-kæt-im mi-væšt-im(æ) w
ut-m-išt-im
2sg ær-kætte-ey i-r-vattæ me-kæt-i mi-væšt-iš(æ) w
ut-m-išt-iš
3sg ær-kætte-Ø is-ær-vattæ me-kæt-Ø mi-væšt-Ø w
ut-m-išt-e
‘fall’ ‘say’ ‘fall’ ‘run’ ‘rise, get up’
6.5.7.3. Imperfect -i- ~ -æy- formant, added to past stem
Caspian Rudbāri Central Tati

Rashti (WG) Tutkāboni Dizi Karnaqi Gandomābi
1sg gúft-i-m gut-i-m xærd-ey=m=e =em xand-ey kætt-æy-m
2sg gúft-i (< -i-i) gut-i (< -i-i) kætt-æy-š
3sg gúft-i-Ø gut-i-Ø kætt-æy-Ø
‘say’ ‘say’ ‘eat’ ‘read’ ‘fall’
This formation in Keringāni is only used as a Past Subjunctive (§6.5.11).
Imperfect formants -i- ~ -æy- and -imi-, added to Present stem

Languages with Augment + -i- -i-, no Augment
Leriki Asālemi Māsulei Koluri
pr žæn- hæ- virij- pe- vaj-
pt žæ- hard- virit- pætt- vat-
1sg æ-žæn-i-m æ-hæ-y-m æ-vrij-i-m p-i-m vaj-i-m
2sg æ-žæn-i-š æ-hæ-y-š æ-vrij-i-š p-i-š vaj-i-š
3sg æ-žæn-i-Ø æ-hæ-y-Ø æ-vrij-i-Ø p-i-Ø vaj-i-Ø
1pl æ-žæn-i-mun æ-hæ-y-mon æ-vrij-i-mun p-i-mon vaj-i-man
2pl æ-žæn-i-n æ-hæ-y-n æ-vrij-i-run p-i-n vaj-i-ran
3pl æ-žæn-i-n æ-hæ-y-n æ-vrij-i-n p-i-n vaj-i-nd
‘hit, strike’ ‘eat’ ‘run’ ‘cook’ ‘say’
-imi- Suffix
Kelārdašti
el-
ešt-
el-imi-yæme
el-imi-Ø
el-imi-yæ
el-imi-yæmi
el-imi-yænne
el-imi-yænne
‘put’ (KK: 81)
6.5.7.4. Allative-marked imperfects

This Imperfect is formed similarly to the Allative-type Present (be-) but with the
past aux . (Also see §6.5.8.1 for these forms used as Past Progressives in some
areas.)
Keringāni: ‘do’ Kalāsuri: ‘sleep’ Harzani: ‘say’
1sg be-kard-e bi-ne be-het-é=u-m be-öt-e ber-in
2sg be-kard-e bi-še be-het-é=u-š be-öt-e ber-i
3sg be-kard-e biæ-Ø be-het-é=u-Ø be-öt-e ber-æ
all - vb : pt - inf all - vb : pt - inf = all - vb : pt - inf
aux : pt - set 1 aux : pt - set 1 aux : pt - set 1
732 Donald L. Stilo
6.5.7.5. Uses of the imperfect

In addition to the usual past habitual sense, the Imperfect may have a progressive
sense since the Progressive forms, where they exist, are not necessarily obligatory.
The Imperfect also has Conditional and Past Subjunctive uses, but there are also
dedicated forms for the latter paradigms.
(96) Lāhijāni
γeblen gušt Ø-xord-əm. čiči Ø-gút-i.
before meat imf -eat:pt -1 s what imf -say:pt -2 s
‘I used to eat meat before.’ ‘What were you saying?’
(97) Sāravi
xæzune bar=e Ø-værd-e, Ø-rut-e.
treasury load=ra imf -take:pt -3 s imf -sell:pt -3 s
‘He would take the load of the treasury (and) would sell (it).’ (SY: 68)
(98) Leriki
texnikum=ædæ a-hand-í-mon.
technikum-at dur -read-imf -1 p 1
‘We studied at a technical school.’
Vafsi
in dokan=dæ kár=es ær-kærd.
this shop=at work=3 s 2 dur -do:pt
‘He used to work in this shop.’
Conversely, Conditionals (§6.5.12) are often interchangeable with the Imperfect
in habitual senses in Rāmsari and Tonekāboni but, so far as is known, not in EG.
(99) Rāmsari
ruz xót-enæ=ba-n, šæb hæmlə́ Ø-kòrd-ən.
day sleep:pt - cond = aux : pt -3 p night attack imf -do:pt -3 p
‘They would sleep in the daytime (and) would attack at night.’
(100) Tonekāboni
berǽnj-e koléš=e vé-git-ene ba-n, súj-æne
rice-rez bran=ra pvb -take: pt - cond aux : pt -3 p vb -burn- cond
ba-n, vé=re ælæk Ø-górd-en.
aux : pt -3 p it-ra sieve imf -do:pt -3 p
‘They used to take the rice bran, burn (it and) sift it.’ (Amini: 115)
6.5.8. Progressive aspect (present and past)

As outlined below, there are two main types of Progressives in the area, with
various subtypes. Both may appear together in some languages, creating a mixed
third type.
6.5.8.1. Type I: Locative Formations

Type IA: The Progressive is formed in N. Tati, Lerdi, Chāli, Khoini, and others
according to the formula: infinitive or past stem + locative + copula .
In Type IA the Locative postposition ændæ of Harzani has three variant forms:
1) after a vowel the first vowel of the suffix is lost and with all three variants the
last vowel is also lost before the vowel of the PAM: ama- > ama-nd-en ‘I am
coming’; 2) after a root ending in -t, the -ændæ- suffix loses the first syllable: öt- >
öt-d-en ‘I am saying’; 3) after fricatives and sonorants the -n- remains but the -d- is
lost: bəs- > bəs-n-en.
In Type IB the locative formant of Kalāsuri-Khoynarudi is deleted leaving
only the infinitival and the copular elements. They also have an unusual copula
formed by a demonstrative (en ‘this’, æv ‘that’) + an enclitic copula. In the Pro-
gressive, the demonstrative attracts the copula leftwards. An overt Object usually
suppresses the demonstrative and hosts the copula (101a), but both may appear
together (101b).
Type IA: Harzani
Pres. Progressive Past Progressive
1sg öt-d=en bəs-n-en öt-de ber-in bəs-ne ber-in
2sg öt-d=e bəs-n-e öt-de ber-i bəs-ne ber-i
vb -loc = aux vb-loc aux : pt - pam
‘say’ ‘close’ ‘say’ ‘close’
Type IB: Kalāsuri
Pres. Prog. Past Prog.
en=em kust-e en=u-m kust-e
en=iš kust-e en=u-š kust-e
dem = pam vb - inf
‘pound’
(101) Kalāsuri (KL: 278)
olæt=em šušt-e. æv=u=š olæt šušt-e.
clothes=1 s 2 wash-inf that=aux : pt =2 s 2 wash-inf
‘I am washing clothes.’ ‘You were washing clothes.’
734 Donald L. Stilo
While Leriki uses IA typology for the Present tense and has no independent Present
Progressive, it does have a Past Progressive of this type (Pirejko: “Past Dura-
tive II”; Miller: “Compound Imperfect II”) that formally contrasts with the Imper-
fect (§6.5.7.3), but is often interchangeable with it. Pirejko (1976: 337) states that
this “…tense form denotes any durative action in the past.”
Full form -V Full form -t/d Short Form -t/d
stem +infinitive + locative + stem (stressed)+ locative +copula : pt
copula : pt
1sg š-é-dæ=bi-m vot-é-dæ=bi-m vót-dæ=bi-m
2sg om-é-dæ=bi-š mand-é-dæ=bi-š mánd-dæ=bi-š
‘go, come’ ‘say, stay’ ‘say, stay’
(102) Leriki
aγïl vít-dæ=b-e bæ di mašin-í.
child run- loc =aux : pt -3 s 1 to after car-ob
‘The child was running after the car.’
penj sor vaxt do-y-dæ=b-in.
five year time give-inf - loc =aux : pt -3 p 1
‘They used to give five years leeway.’
Type IC (only Caspian and Rudbāri): instead of a postposition, the copula is a
Locative Be-verb (BE4, §6.8). The infinitive is: a full form (Rashti); a short form,
identical to the past stem (Tonekāboni); or the -n of the infinitive is lost (Lāhijāni):
Lāhijāni, Progressive I
Present Past
1sg xord-é dər-əm xord-é də-bu-m
eat:pt - inf be 4-1 s eat:pt - inf be 4- aux : pt -1 s
1neg xord-e dǽn-ny-em xord-e də́-nye bu-m
‘eat
Present Past
1sg ha-dá dər-əm ha-dá də-bu-m
pvb -give:pt be 4-1 s pvb -give:pt be 4- aux : pt -1 s
1neg ha-da dǽn-ny-em ha-da də́-nye bu-m
‘give’
Rashti Prog. I Langerudi Rāmsari Tonekāboni Tutkāboni
Prog. I Prog. I (Rudbāri)
guft-ə́n dər-əm gut-é dær-əm za dær-əm xárd dǽr-om xard-en dǽr-om
‘say’ ‘say’ ‘hit’ ‘eat’ ‘eat’
(103) Lāhijāni
Present Progressive I
mu γeza xord-é dər-əm.
I food eat-inf be 4-1 s
‘I am eating.’
Past Progressive I
hava rošen=a bo də-bu-Ø
weather bright=cs become:inf be 4- aux : pt -1 s
‘The weather was clearing up.’
(104) Rāmsari (SR: 146) Langerudi
kemi ra ša dær-e? adæm æmá dær-e.
which road go:inf be 4-3 s person come:inf be 4-3 s
‘Which road are you taking?’ ‘Someone is coming.’
Type ID: In Kelārdashti and Gurāni (Tatoid) fully conjugated Present and Past
forms of the Locative Be-verb (Be4) precede the Present and Imperfect:
Kelārdashti: ‘to carry off, take away’
Present Progressive Past Progressive
1sg dær-eme bær-eme dæ-bi-yæme bær-imi-yæme
be 4- aux :1 s take-1 s be 4- aux : pt -1 s take- imf -1 s
Gurāni: ‘to close; bake’
Present Progressive
der-im dé-m-bænd-im
be 4-1 p pvb - dur -bake-1 p
Type IE: In Mazanderani and Dikin Marāqei, an invariable 3sg of the Present/Past

of Be4 precedes the verb. Mazanderani may also optionally delete the PAM (dær-e
> dær) or conjugate both verbs, probably under Persian influence.
Sāravi: ‘come’ ‘eat’ Dikin-Marāqei:
‘become’
Pres. Progressive Past Progressive Pres. Progressive
1sg dær(-e) e-m-be dæy(-ye) Ø-xord-eme dær-i me-b-in
2sg dær(-e) e-n-i dæy(-ye) Ø-xord-i dær-i me-b-i
3sg .m dær(-e) e-n-e dæy(-ye) Ø-xord-e dær-i me-b-ik
3sg . f dær-i me-b-igan
be 4(-3 s ) come- pr - pam be 4: pt (-3 s ) eat- imf - pam be 4-3 s 1
dur -become- pam
736 Donald L. Stilo
(105) Sāravi
bǽ-di-ye zena dær e-n-e.
pu -see:pt woman be 4:pr come-pr -3 s
‘He saw that (his) wife is coming.’ (SY: 66)
at-ta zena at-ta mærdi dæy ši-ne.
1-unc woman 1-unc man be 4:pt go:imf -3 p
‘A man and a woman were going (along).’ (SH: 155)
(106) vešun dær-ene šu-n-ene sere.
they be 4-3 p go-pr -3 p house
‘They are going home.’ (SH: 149)
dæy-ne æli=re šekænje Ø-kard-ene.

be 4:pt -3 p P.N.=ra torture imf -do:pt -3 p
‘They were torturing Ali.’ (SY: 68)
6.5.8.2. Type II: Independent particles derived from kā̆r ‘work’

In type IIA an invariable particle, kVrV (< kā̆r ‘work, doing’, Windfuhr 1989: 256),
accompanies durative tenses to form a Progressive or convey a sense of Imminence
(‘be about to’). This type occurs in Rashti, Southern and Central Talyshi, and in
adjacent varieties of Tati. Some languages use more than one of these strategies.
Invariable Particle kVrV + Main Verb
Type IIA: kə́rə/kə́ra/káræ/kora precedes the regular Present or Imperfect tenses

Māsulei Kelāsi Gandomābi Karani Prog. I
1sg kərə xəs-Ø-ə́m kæræ me-xos-ə́m kora xos-ém kari xoss-ém
2sg kərə xəs-Ø-é kæræ me-xos-í kora xos-í kari xoss-í
3sg kərə xəs-Ø-ə́ kæræ me-xos-ə́ kora xos-é kari xoss-eæ
3f kari xoss-iæ
Koluri Rashti Prog. II Hezārrudi Asālemi Prog. I

1pl kera xes-Ø-ǽm kə́ra xus-Ø-ími kæræ xos-ám káræ bæ-xït-í-mun
2pl kera xes-Ø-ǽ kə́ra xus-Ø-ídi kæræ xos-á káræ bæ-xït-í-run
3pl kera xes-Ø-énd kə́ra xus-Ø-ídi kæræ xos-énde káræ bæ-xït-í-n
‘sleep’
Asālemi and Karani Progressive I forms above mostly indicate Imminence, while
their Progressive II forms (types IIB, IID, resp.) are used as Progressives. The
Māsulei form indicates both senses. In Type IIB (Kajali) ko(re) follows Main Verb
and in Type IIC (Kafteji) kǽrdæ, a locative form, precedes the verb.
Type IIB Type IIC
Kajali Kafteji
1sg me-vræm-em kore. kǽrdæ me-š-ə́m.
‘I am crying/weeping’ ‘I am going’
Progressive kar particle combined with Be-verbs
IID: In Type IID the Asālemi Present Progressive II is formed with the particle kar
preceding the verb in the infinitive form and a mobile copular element, usually
encliticized to kar or even farther leftward. There is also a shortened form of kar
>ka:
Present Short form Past Short form
Progressive Progressive
1sg kár=im vat-è ká=m vat-è kár=b-im vat-è ká=b-im vat-è ‘say’
Charozh, the Central Talyshi enclave in the N. Talyshi zone (§2.2.1), has shifted
this short-form progressive (with N. Talyshi vowel raising: ka > ko) to the sole
Present tense. The Allative-marked C. Talyshi Present is the Charozh Future, as in
N. Talyshi:
(107) Charozh
čo sahat dærs ko=mun do-y.
four hour lesson prog =1 p 1 give-inf
‘We teach for four hours.’
tikæ æmæ zü=mun ko š-e.
bit we early=1 p 1 prog go-inf
‘We go a little early.’
IIE: Lāhijāni Progressive II inserts the particle ka (< kar) between a short infini-
tive of a simplex verb and the Be4. Type IIE is especially common with compound
verbs formed with the light verb kun/gud- ‘do’, in which case kun/gud- is deleted
and the cognate ka takes its place between the NVE and Be4:
IIC: Lāhijāni II (Simplex Verb)
Pres. Progressive Past Progressive
1sg gut-è=ká=dər-əm gut-è=ká=də=bu-m
‘I am saying’
738 Donald L. Stilo
(Compound Verb with gudən)

Pres. Progressive
bazi gudən > bazi ka=dər-əm
‘to play’ ‘I am playing’
IIF: The kar element in Karani and Karnaqi consists of the full Present or Imperfect
tense of the verb kær- ‘do’, following the invariable past participle of the main verb.
IIG: In Eštehārdi the fully conjugated Present/Imperfect form of the main verb is
followed by kar/kær plus a conjugated form of Be4:
IID: Karani/Karnaqi Prog. II IIE: Eštehārdi
Pres. Progressive Past Progressive Pres. Progressive
1sg mærd-æ kær-em mærd-æ kærd-i-m mi-væz-em kar/kær=dær-imæ
‘I am/was dying’ (YSFN) ‘I am running’ (ST: 225)
6.5.8.3. Type III: Simultaneous use of Be4 and the kə́ra particle
Rashti Prog. III

1sg kə́ra guft-ə́n=dər-əm ‘I am saying’
Some languages, e. g., Vafsi, have no strategies for marking Progressives, which
are subsumed under the regular Present and Imperfect tenses, as in exs. 194, 284,
285B, 336, and the first sentence of the Vafsi text (§10.4).
6.5.9. Present perfect tenses

In my analysis, Caspian varieties all lack a Present Perfect which has merged
with the Preterit. Resultative forms (‘he is sitting/seated in the chair’) in Caspian
languages should not be confused with a true dynamic Perfect (‘I have often sat
in that chair’). Some grammars list Caspian Perfects formed with the auxiliary
dašt- ‘have’, but I have never observed such forms in actual speech or in texts.
Jahāngiri (2003) makes no mention of Present Perfect paradigms and he translates
the Preterits of many Lāhijāni sentences using the Present Perfect in Persian.
The stress patterns in most Tatic varieties differ between the Preterit (stress on
the past stem) and the Present Perfect (stress on the participial formant).
Kelāsi Kajali (YSFN) Gandomābi (YSFN)
Preterit Pres. Perf. Preterit Pres. Perf. Preterit Pres. Perf.
1sg bə-kǽt- bə-kæt- be-kǽtt- kætt-é- be-bræmés- bræmes-
im é-m im ym=e im é-m
‘fall’ ‘fall’ ‘cry’
Chāli (ST:197) Jubani (Rudbāri) Khoini (YX:176)

1sg be-tǽt-im be-tæt- bu-gút- bu-gut-á-m kejerést-im kejerest-
é-m om ǽ-ym
‘run’ ‘say’ ‘laugh’
Razajerdi (YSFN) Harzani (AAK: 116–18) Asālemi
1sg kǽt-ay kæt-áy yǽr= yær-ǽ= hǽrd= hærd-
(2 f.) (2 f.) m=æ m=æ əm=æ ǽ=m=æ
‘fall’ ‘hit’ ‘eat’
Māsulei: Intransitive ‘burn’ Transitive ‘eat’
Preterit Present Perfect Preterit Present Perfect
1sg sí-m=æ siǽ=m=æ hǽrd=əm=æ hærdǽ=m=æ
2sg sí-š=æ sié=š=æ hǽrd=i=æ hærdǽ=y=æ
3sg sí-Ø=æ siǽ=Ø hǽrd=əš=æ hærdǽ=š=æ
Some languages, in addition to stress differences, also have slightly different PAMs
in the Preterit vs. the Present Perfect as in the Leriki Intransitive forms below:
Leriki: Preterit vs. Present Perfect

Stems ending in Vowel
Pret. Pres. Perf.
1sg omǽ-ym omǽ-m
2sg omǽ-yž1 omǽ-ž1
3sg omǽ-y omǽ-Ø
1pl omǽ-ymon omǽ-mon
2pl omǽ-žon1 omǽ-žon1
3pl omǽ-yn omǽ-n
1neg nómæym omæ nim
‘come’
Stems ending in Consonant
Preterit Pres. Perf. Preterit Pres. Perf.
vít-im vit-ǽ-m kárd=ïm=e kard-ǽ=m=e
vít-iž1 vit-ǽ-ž1 kárd=e2 used only fronted2
vít-e vit-ǽ-Ø kárd=ïž=e1 kard-ǽ=ž=e1
vít-imon vit-ǽ-mon kárd=ïmon=e kard-ǽ=mon=e
vít-ižon1 vit-ǽ-žon1 kárd=ïn=e kard-ǽ=n=e
vít-in vit-ǽ-n kárd=ïžon=e1 kard-ǽ=žon=e1
nḯvitim vitǽ nim nḯkárdïme kardǽ nime
‘run’ ‘do, make’
740 Donald L. Stilo
1) the -ž in all PAMs can always alternate with -š; 2) these forms are avoided for
phonological reasons (kárd-e < kárd=ï=e) but this is not a problem since the Set2
PAMs are commonly fronted (§7.4.1);
(108) Leriki
ve saγ be ki tï omǽ=ž.
very healthy be sub you come:pt =2 s 1
‘Thank you very much for having come (lit: that you have come).’
gæst=o=žon=e vot-ǽ.
spite=from=3 p 2= aux say:pt - ppl
‘They have said (it) out of spite.’
In a slightly different pattern, the stress in Vafsi and possibly Sagzābādi falls on the
Punctual marker in the Preterit and on the verb stem in all Perfect forms. In Vafsi
the Past Participial -ǽ is replaced in the Present Perfect by the Perfect formant -e,
which is only apparent in the 3sg / pl of Intransitives and all persons of the Tran-
sitives. The -e formant is not used in other Perfect paradigms (§6.5.10). With past
stems ending in -a in Vafsi, the stress is the only way to distinguish the Preterit
(bǽ-resa-ym) from the Perfect (bæ-resá-ym), as in (109).
Vafsi: Preterit vs. Present Perfect
Intransitive Transitive
Preterit Pres. Perf. Preterit Pres. Perf.
1sg bǽ-mærd-e-ym bæ-mǽrd-e-ym b-ím-rutt-æ-Ø b-im-rútt-e
2sg bǽ-mærd-e-y bæ-mǽrd-e-y b-í-rutt-æ-Ø b-i-rútt-e
3sg bǽ-mærd-æ-Ø bæ-mǽrd-e b-ís-rutt-æ-Ø b-is-rútt-e
1pl bǽ-mærd-e-yam bæ-mǽrd-e-yam b-óan-rutt-æ-Ø b-oan-rútt-e
2pl bǽ-mærd-e-ya bæ-mǽrd-e-ya b-ían-rutt-æ-Ø b-ian-rútt-e
3pl bǽ-mærd-æ-nde bæ-mǽrd-e-nde b-ísan-rutt-æ-Ø b-isan-rútt-e
1neg nǽ-mærde-ym n-ím-rutt-æ-Ø n-ím-rutt-e
3neg nǽ-mærdæ-Ø nǽ-mærd-e n-ís-rutt-æ-Ø n-ís-rutt-e
‘die’ ‘sell’
(109) Vafsi
yawa soay in váxt-i, æz bæ-ræsá-yme.
until tmw this time-om I pu -arrive:pt -1 s 1
‘By tomorrow this time, I will have arrived.’
áw=esan obær n-árd-e?

water=3 p 2 to-out neg -bring:pt - perf
‘Haven’t they extracted any water?’
6.5.10. Past perfect and subjunctive perfect

All languages in the area have Past and Subjunctive Perfects, built on the formula:
(punctual bV-) + Past Participle + Past/Subjunctive, resp., of aux :be1. Lāhijāni,
however, replaces the Subjunctive Perfect with the Past Subjunctive. Most of
Talyshi and nearby Tati varieties do not use bV- (§6.5.6.2) and elsewhere it is sup-
pressed by preverbs or negatives. Mazanderani drops the final -æ of the participle
in the Perfects.
Lāhijāni Rashti
Past Perfect Subjunct. Perf. Past Perfect Subjunctive Perfect
1sg bó-xord-ə bu-m (see Past bu-gúft-ə bu-m bu-gúft-ə b-i-m
2sg bó-xord-ə b-i Subjunctive, bu-gúft-ə b-i bu-gúft-ə b-i
3sg bó-xord-ə b-u §6.5.11) bu-gúft-ə bu-Ø bu-gúft-ə be-Ø
‘eat’ ‘say’
Sāravi:
Past Perfect Preverbs Subjunctive Perfect Preverbs
1sg bæ-merd bi-me dæ-či bi-me bæ-mérd bu-(e)m dæ-či bu-(e)m
2sg bæ-merd bi (<i-i) dæ-či bi bæ-mérd bu-i dæ-či bu-i
3sg bæ-merd bi-ye dæ-či bi-ye bæ-mérd bu-e dæ-či bu-e
‘die’ ‘pick’ ‘die’ ‘pick’
Leriki: Past Perfect Subjunctive Perfect
Intransitive Transitive Intransitive Transitive
1sg vit-ǽ b-im hard-ǽ=m be vit-ǽ bu-m hard-ǽ=m bu
2sg vit-ǽ b-iž used only fronted vit-ǽ bu-ž used only fronted
3sg vit-ǽ be-Ø hard-ǽ=ž be vit-ǽ bu-Ø hard-ǽ=ž bu
1neg vit-ǽ nï-b-im hard-ǽ=m nï-be vit-ǽ nï-bu-m hard-ǽ=m nï-bu
‘run’ ‘eat’ ‘run’ ‘eat’
Māsulei: Past Perfect Subjunctive Perfect

Intransitive Transitive Intransitive Transitive
1sg siǽ-m=a hǽrd=əm=a siǽ bù-bu-m hærdǽ=m bù-bu
2sg sié-š=a hǽrd=i=a siǽ bə̀-b-e hærdǽ=y bù-bu
3sg si=a hǽrd=əš=a siǽ bù-bu-Ø hærdǽ=š bù-bu
1 neg siǽ nim=a hærdǽ nim=a siæ̀ mǽ-bu-m* hærdæ̀ =m mǽ-bu*
‘burn’ ‘eat’ *(for the negator morpheme, see
6.5.2, point 3)
742 Donald L. Stilo
As mentioned above, all Perfect paradigms in Vafsi take the punctual marker bV-
in addition to placing the word stress on the past root of the verb.
Vafsi: Intransitive
Past Perfect Subjunct. Perfect
1sg bæ-mǽrd-æ v-im bæ-mǽrd-æ vue-ym
2sg bæ-mǽrd-æ v-i bæ-mǽrd-æ vue-y
3sg bæ-mǽrd-æ v-e bæ-mǽrd-æ vuæ-Ø
1 neg nǽ-mærd-æ v-im nǽ-mærd-æ vue-ym
‘die’
Transitive
Past Perfect Subjunct. Perfect
b-im-rútt-æ v-e b-im-rútt-æ vuæ-Ø
b-i-rútt-æ v-e b-i-rútt-æ vuæ-Ø
b-is-rútt-æ v-e b-is-rútt-æ vuæ-Ø
n-ís-rutt-æ v-e n-ís-rutt-æ vuæ-Ø
‘sell’
Examples of the Past Perfect
(110) Lāhijāni (LTXT)
mualləm n-áma bu-Ø, bú-šo-im futbol bazí
teacher neg -come:pt aux : pt -3 s pu -go:pt -1 p soccer play
bù-kon-im.
pu -do-1 p
‘The teacher had not come (so) we went (out) to play football.’
(111) Sāravi (SY: 66)
čæn=ta dez ke šeter-a=re bæ-dezi bi-ne,
a.few=unc thief sub camel-pl = ra pu -steal:pt aux : pt -3 p
b-urd bi-ne sere dele
pu -gone aux : pt -3 p house in
‘A few thieves who had stolen the camels had gone into the house.’
The Subjunctive Perfect is triggered when a subjunctivizing factor (a modal,
a conditional clause, ‘perhaps’, ‘I hope’, etc.) acts on a Present Perfect: ‘it has
ended’ > ‘it must have ended’, ‘if it has ended’, ‘perhaps it has ended’, ‘I hope it
has ended’, etc.
(112) Sāravi
vε-n-e ve=re ba-ut bu-e.
must-pr -3 s he-ra pu -say:pt aux : sbj -3 s
‘He must have told him.’
šayed xord bu-em.

perhaps eat:pt aux : sbj -1 s
‘ Perhaps I have eaten (it).’ (SH:123)
(113) Leriki
gásti čæy zo-on omǽ bu-n.
perhaps his son-pl come:pt aux : sbj -3 p
‘ Perhaps his sons have come.’
Vafsi
šayæd b-is-vattæ vuæ-Ø.
perhaps pu -3 s 2-say:pt aux : sbj -3 s 1
‘ Maybe s/he as said (so).’
(114) Lāhijāni (Perfect Subjunctive merges with the Past Subjunctive; NJ: 136)
bə gəmon=əm ta amə bə́-rəs-im,, dəro təmən=a
to guess=1 s 2 until we pu -arrive-1 p harvest ended=cs
bo b-i-Ø.
become: pt aux : pt - irr -3 s
‘I guess that by the time we arrive, the harvest will have ended.’
(115) agə nəhar=a bú-xord-ə b-in,, mu gursənə mon-əm.
if lunch=ra pu -eat:pt - ppl aux : pt -3 p I hungry stay-1 s
‘If they have eaten the lunch, I will remain hungry.’ (NJ: 138)
6.5.11. Past subjunctives, conditionals, and other counterfactuals

Certain Caspian and Tatic varieties have special paradigms for counterfactual uses.
Two basic Past Subjunctive types are formed by: 1) the Subjunctive marker bV́ - +
the Imperfect built on the present stem; 2) the Past Participle + irrealis aux in an
analytic Perfect-like form. Most languages given here have more than one of these
paradigms and they often overlap in usage with the Imperfect (conditional senses)
or Past Perfect (irrealis senses). In some languages the latter two paradigms are
virtually the only possibility for expressing irrealis or conditional notions (see also
exs. 141–144).
One feature shared by types 1 and 2 is the irrealis -i- ~ -æy, also the marker of
the Imperfect in many languages. It is used in both synthetic (Leriki: b-ǽ-š-i-š, 2s)
and analytic Perfect-like forms (Lāhijāni: b-əma b-i-Ø, 3s). In the Past Subjunctive
the Asālemi Imperfect -i- changes to -e-, possibly an alternate form of -i-: -æy- >
-e- > -i-.
In Lāhijāni, Langerudi, Vājārgāhi, Rāmsari, and possibly Tonekāboni, all
person and number in the Past Subjunctive have become neutralized to a single
3sg form. This neutralization seems to be optional in some varieties as my own
744 Donald L. Stilo
elicitation materials on Lāhijāni show only personal forms, e. g., (116) vs. (117)
or (135) vs. (134).
What unites these various paradigms is their use in some or all of the follow-
ing:
1. With past necessity when the Present vs. Past of ‘must’ is only a morphological
but not a semantic distinction, e. g., bayæd/bayésti, used only tense-neutrally;
2. After the Desiderative particle kaš ~ kaške/ki ‘would that, if only, I wish’;
3. In the protasis of counterfactual conditional sentences: ‘if I knew/had known’;
4. In a restricted zone, past modals and main verbs with control features gen-
erally require a Past Subjunctive in the subordinate verb (exs. 127, 128, see
also §6.6);
5. Miscellaneous: ‘it would have been better to’; ‘I almost = it was close that I
would (do)’; ‘I would have preferred/liked to have gone’; ‘as if it were’; etc.
The range of irrealis situations is a rather complex topic that has only been mea-
gerly addressed in the literature. The present analyses must be taken as preliminary
and provisory until our knowledge on these issues is furthered. In the case of mul-
tiple paradigms, for example, the exact use of each needs to be fully investigated.
6.5.11.1. The paradigms

Past Subjunctive
Leriki I Asālemi I Māsulei I
1sg b-ǽ-žæn-i-m b-ǽ-vin-e-m bə́-vu-(y)m
2sg b-ǽ-žæn-i-ž/š b-ǽ-vin-e-š bə́-vu-(y)-š
3sg b-ǽ-žæn-i-Ø b-ǽ-vin-e-Ø bə́-vu-(y-)Ø
‘hit’ ‘see’ ‘say’
Māsulei II Harzani (MH:77) Keringāni Leriki II
(intr., ‘come’) (intr., ‘sleep’) (intr., ‘sleep’, (intrans. ‘sit’)
ex141)
1sg omǽ bì-b-i-m hetæ=m b-i-Ø het-en=æy* nïštæ b-æ-bæ-ym
2sg omǽ bì-b-i-š hetæ=r b-i-Ø het-i=æy nïštæ b-æ-bæ-yž
3sg omǽ bì-b-i-Ø hetæ=y b-i-Ø ? nïštæ b-æ-bæ-y
(trans., ‘send’) (tr., ‘close’, ex139) (trans., ‘eat’) (trans., ‘say’)
3sg vennǽ=š bì-b-i bəsda=y b-i-Ø hærd=š=æy votæ=ž b-æ-bæ-y
*(alt: het=æy=n)
Leriki III Leriki IV Asālemi II Asālemi III

Miller: ‘Preterit ‘Potential
Conditional’ Conditional’
(intrans. ‘sit’) (intrans. ‘sit’) (intrans., ‘die’) (intrans., ‘die’)
1sg nïštæ b-æ-i-m nïštæ b-ǽ-b-i-m mærdæ b-e-m mærdæ b-æ-b-e-m
2sg nïštæ b-æ-i-ž nïštæ b-ǽ-b-i-ž mærdæ b-e-š mærdæ b-æ-b-e-š
3sg nïštæ b-æ-i-Ø nïštæ b-ǽ-b-i-Ø mærdæ b-e-Ø mærdæ b-æ-b-e-Ø
(trans., ‘say’) (trans., ‘say’) (trans., ‘see’) (trans., ‘see’)
3sg votæ=ž b-æ-i votæ=ž b-ǽ-b-i vindæ=š b-e vindæ=š b-æ-b-e
Lāhijāni Langerudi/ Tonekāboni Sāravi
Rāmsari
1sg (same as 3s) (same as 3s) ? bæ-rut b-i bu-em~bu-m
2sg (same as 3s) (same as 3s) ? bæ-rut b-i bu-i
3sg b-ut-i b-i-Ø bo-gut-i b-i-Ø bé-git-e b-i-Ø bæ-rut b-i bu-Ø
‘say’ ‘eat’ ‘take’ ‘sell’
(Amini:114,19)
6.5.11.2. The various irrealis usages: conditional, past subjunctive and others
Now each of the five situations listed above that require the Past Subjunctives and
other irrealis forms are examined individually here and in §6.5.12, Conditionals:
1) Past Necessity
While the modal ‘must’ has Present and Past forms, both are often devoid of any
real sense of tense. Time is encoded by the Present Subjunctive (§6.6) or Past Sub-
junctive (or Imperfect, Conditional, etc.) in the subordinate verb. Some languages,
e. g., Leriki, Māsulei, only have one tense-invariable form of the modal ‘must’.
(116) Lāhijāni (NJ: 175)
parsal xasi qənat=a larubi b-ud-i b-i-Ø.
last.year must:pt qænat-ra dredging pu -do:pt - irr aux : pt - irr -3 s
‘We should have dredged the qanat (underground canal) last year.’
One speaker of Lāhijāni offered five alternate forms in example (117). In the short
time I worked with him, however, he never produced a Past Subjunctive form
as introduced above. I only discovered this tense when going through Jahāngi-
ri’s (2003) sentences, and then working with speakers of Langerudi and Ramsari.
746 Donald L. Stilo
(117) bayésti zutter …

must sooner …
V ariant 1 V ariant 2 V ariant 3 V ariant 4 V ariant 5
b-útt-ə=bu-m Ø-gútt-ə= Ø-gútt-am b-útt-enæ= Ø-gútt-
bu-m bu-m enæ=bu-m.
pu -say: imf -say: imf -say:pt -1 s pu -say: imf -say:
pt - ppl = pt - ppl = pt - cond = pt - cond =
aux : pt -1 s aux : pt -1 s aux : pt -1 s aux : pt -1 s
P ast P erf . I P ast P erf . II I mperfect C onditional C onditional
D ur .
‘You should have said (something) sooner.’
(118) Leriki
bæ-pe ïštæ jif=o b-ǽ-hæ-y-m.
pr -must self pocket=from sbj - dur -eat-imf -1 s 1
‘I had to eat out of my own pocket (i. e., on my own
expenses).’
tï bæ-pe vot-ǽ b-æ̀ -b-i-Ø.
you pr -must say-ppl sbj - dur - aux : pt - irr -3 s
‘You should have said (something).’
(119) Māsulei
be omæ bi-b-i-Ø. be b-u-y-š.
must come:ppl sbj - aux : pt - irr -3 s must sbj -come:imf - irr -3 s
‘He was supposed to come/to have come.’ ‘You were supposed to come.’
(Past Subjunctive II) (Past Subjunctive I)
Some languages distinguish between the Present and Past of the modal ‘must’ in
both form and function and always use the Present Subjunctive for the dependent
verb:
(120) Vafsi
Present
ær-go ǽ-ss-ame dar-an dǽ-pič-am.
dur -want pu -go-1 p 1 tree-op pvb -wrap-1 p 1
‘We have to go wrap the rees.’
Past
ær-goa ke nešanæ=s bǽ-d-a.
dur -want:pt sub address=3 s 2 pu -give-2 p 1
‘You needed to give him an address.’
(121) Razajerdi
m-o gu-m. m-os gu-m.
dur -want say-1 s 1 dur -want:pt say-1 s 1
‘I am supposed to say.’ ‘I was supposed to say.’
(122) Kordkheyli Mazanderani
yεk mεn dunε vé-n-ε há-d-i.
one man rice must-pr -3 s pvb -give-2 s
‘You have to give (us) one man (6 kilos) of rice.’ (Borjian: M2,10e)
véss-e pεla bǽ-xεr-εn.
must:pt -3 s rice pu -eat-3 p
‘They were supposed to eat pilaff.’ (Borjian: M2,14d)
In Sāravi, if the tense of the modal is neutralized (123), the Imperfect is used rather
than the Past Subjunctive (so far as is known) in the subordinate verb:
(123) vé-n-e b-ur-e. véss-e Ø-ši-ye.
must-pr -3 s pu -go-3 s must:pt -3 s imf -go:dur -3 s
‘He has to go.’ (SH: 118) ‘He had to go.’ (SH: 118)
2) The Desiderative particle kaš(ke ~ ki) ‘would that, if only, I wish’
With kaški the Past Subjunctive and the Past Perfect are both common:
(124) Lāhijāni
kaški mu b-út-ə bu-m.
if.only I pu -say:pt - ppl aux : pt -1s1
‘I wish I had said (something).’
Verb: Past Perfect
kaški bə-mərd-ə b-i-Ø.
if.only pu -die:pt - ppl aux : pt - irr -3s1
‘I wish I had died.’ (NJ: 173)
Verb: Past Subjunctive
(125) Māsulei
kaški omæ bi-b-i-Ø.
if.only come:ppl sbj - aux : pt - irr -3s 1
‘I wish he had come.’
Verb: Past Subjunctive II
Rāmsari
kaš tə́=ra bó-gut-i b-i-Ø.
if.only you=ra sbj -say:pt - ppl aux : pt - irr -3s
‘I wish I had told you’
Verb: Past Subjunctive I
748 Donald L. Stilo
(126) Leriki
kaški inglisjæ zïnæ=m b-ǽ-b-i-Ø.
if.only English know:pt =1 s 2 sbj - dur - aux : pt - irr -3s1
‘I wish I knew English.’
3) Protasis of past counterfactual conditional sentences
For the protasis in past counterfactual sentences, see §6.5.12, Conditionals.
4) Use after Past Modals (areally restricted)

In some languages, the Past Subjunctive is automatically used after all past modals
and past main verbs with control features over the dependent verb. This is a larger
areal feature that encompasses all of Talyshi, Koluri, N. Tati, as well as Colloquial
Armenian and dialects, Neo-Aramaic, and Georgian:
(127) Māsulei
Present Modal Past Modal
xa-m bÚ-š-om. xu-m ~ xu-y-m bə́-š-i-m.
want-1 s 1 sbj -go-1 s 1 want:imf -1 s 1 want:imf - imf -1 s 1 sbj -go-imf -1 s 1
‘I want to go’ ‘I wanted to go’
(128) Asālemi
b-æ-pist=im=æ bə́-(h)ær-om*
all - dur -want: pt = aux :1 s 1= aux sbj -eat-1 s 1
‘I want to eat.’ *(> [bǽrom])
æ-pi=m bæ bə́-æ-(h)ær-e-m*
dur -want=1 s 2 aux : pt sbj - dur -eat-irr -1 s 1
‘I wanted to eat.’ *(> [bǽrem])
5) Other miscellaneous situations:
Other miscellaneous situations that require a Past Subjunctive include the follow-
ing (among others): ‘it would have been better had I gone (but I did not)’; ‘you
almost (fell, etc.) = it was close that you would (fall)’; ‘I would have preferred to
go/to have gone’; ‘I would have liked to go/to have gone’; ‘(just) as if (it were)’;
etc.
(129) Rāmsari
eynæn xod-e mə́=re hæme pæšə bǽ-ze b-i-Ø.
just.as self-ez I-ra all mosquito pu -hit:pt aux : pt - irr -3s1
‘It was just as if mosquitoes had bitten me all over.’
(130) Asālemi
kæm bæ-mænd-æ gənəst-æ b-i-m ~ bə́-gən-um.
little ?-stay:pt -3 s 1 fall-ppl aux : pt - irr -1s1 sbj -fall-1 s 1
‘I almost fell (lit: little remained, i. e., it was close, that I might fall)’
6.5.12. Conditional paradigms

In the apodosis of irrealis conditional sentences, EG, Central Caspian, and Harzani
have a formal Conditional, distinct from other paradigms. These languages in fact
allow multiple alternates for both the Conditional and the Past Subjunctive. The
regular imperfect is also allowed in the apodosis as in (131–132).
Conditional Paradigms: ‘say’, ‘eat’

Lāhijāni Lāhijāni
Conditional I Conditional II
1sg Ø-gút-ena=bu-m b-út-ena=bu-m
2sg Ø-gút-ena=b-i b-út-ena=b-i
3sg Ø-gút-ena=bu-Ø b-út-ena=bu-Ø
Langerudi Harzani (MH:79)
Conditional I, (II) Condit. (ex. 139)
gút(-en)-enæ=bu-m bæ-b-e ber-in
gút(-en)-enæ=b-i bæ-b-e ber-i
gút(-en)-enæ=bu-Ø bæ-b-e ber-æ
Tonekāboni* Tonekāboni Tonekāboni
Rāmsari Conditional I Conditional II Conditional III
1s xord-enæ ba-m xár-e ba-m xár-en-e ba-m bó-xard-ene ba-m
Tonekāboni Tonekāboni
Tonekāboni
Conditional IV Conditional V
1s xárd-e ba-m xárd-en-e ba-m (cf. Past Perfect:
bó-xard-e ba-m)
*Note that Tonekāboni I, II are built on the present stem, the others on the past stem. All
Tonekāboni forms from Amini: 80-84.
(131) Tonekāboni (Protasis: Past Subjunctive, Apodosis: Imperfect; Amini: 114)

ǽge vi sǽr=e ab b-om-é b-i-Ø,
if his on=ra water pu -come:pt - ppl aux : pt - irr -3s1
jó=e tekan Ø-dæ-Ø.
barley=ra moving imf -give:pt -3 s
‘If the water had come up over it, it would have moved the rice grains
around.’
(132) Sāravi (Protasis: Past Subjunctive, Apodosis: Imperfect; SH: 152)
æger bæ-xord b-i bu-m, te=re Ø-got-eme.
if pu -eat:pt aux : pt - irr aux : pt -1 s you=ra imf -say:pt -1 s
‘If I had eaten (it), I would have told you.’
750 Donald L. Stilo
Since various languages allow some variation in both the protasis and the apodo-
sis, it is not uncommon to find conditional sentences with the same tense in both
clauses.
(133) Lāhijāni (Protasis: Conditional I/Apodosis: Conditional I)
ægər mašin rošen bo-na bu-Ø,, amo
if car lit become:pt - cond aux : pt -3 s we
šo-na b-im.
go:pt - cond aux : pt -1 p
‘If the car would start, we would go.’
(134) agə Ø-donəst-əm,, t-æ Ø-gutt-am.
if imf -know:pt -1 s you-ra imf -say:pt -1 s
‘If I knew, I would tell you’
Pro: Imperfect/Apo: Imperfect
aga Ø-gutt-i,, či bo-na bu-Ø?
if imf -said-2 s what be:pt - cond aux -3 s
‘If you told me, what would happen?’
Pro: Imperf/Apo: Cond I
(135) agə duz=a nə́-šənaxt-ə b-i-Ø, či Ø-gud-i?
if thief=ra neg -know:pt - ppl aux : pt - irr -3 s what imf -do:pt -2 s
‘If you hadn’t recognized the thief, what would you do/would you have
done?’ (NJ: 138) Protasis: Past Subjunctive/Apo: Imperfect
(136) agə bə́-xast-ə bu-m,, xórd-ena bu-m ~
if pu -want:pt - ppl aux : pt -1 s eat:pt - cond aux : pt -1 s
‘If I had wanted to, I would have eaten (it).’(Pro: Past Perf./Apo: Condit. I ),
alternatively:
bó-xord-ə bu-m ~ Ø-xórd-ə bu-m ~ bó-xord-ena bu-m.

pu -eat:pt - ppl aux : pt -1 s imf -eat:pt - ppl aux : pt -1 s pu -eat:pt - cond aux : pt -1 s
Past Perfect I ~ Past Perfect II ~ Conditional II (in order of preference)
(137) Harzani (MH: 76)
ægær bəsda=y b-i-Ø,, čök bæ-b-e ber-æ.
if close:pt =3 s 2 aux :pt - irr -3 s 1 good all -be-inf aux : pt -3 s 1
‘If he had closed (it), it would have been better.’
In those languages that have more than one possible verb form for one or both
clauses of conditional sentences, e. g., Sāravi (132) and (138), it is most likely the
case that there is a semantic difference, cf. ‘If I knew (now), I would tell you’ vs.
‘If I had known (then), I would have told you’. Not enough information is avail-
able about these issues in any one of these languages and most languages either
have no available information or no analyses have ever been made for them. See,
however, Paul (2011: 149–151) for a discussion of Past Counterfactuals in Talyshi
dialects.
In some languages the Imperfect predominates in the apodosis of irrealis sentences:

(138) Sāravi
æger Ø-ši*, ve=re Ø-di*.
if imf -go:imf :2 s he=ra imf -see:pt :2 s
‘If you went, you would see him.’ (SH: 152) *(< Ø-ši-i, Ø-di-i, 2sg.)
Rashti
ager danést-i-m,, tə́=ra gúft-i-m.
if know:pt - imf -1 s you=ra say:pt - imf -1 s
‘If I knew, I would tell you.’
(139) Leriki
gïlæ-y gàm-æ vïræ b-æ-bæ-y,, æmæ=ní ïštæ
unc - indf warm-link place sbj - dur -be: pt - irr we=also self
væy æ-tosp-ïn-í-mon.
there dur -heat-caus - irr -1 p 1
‘If there were a warm place, we could (lit: would) warm ourselves there.’
(140) Keringāni (YSFN)
hard=i=æy,, be-mard-e b-iš=e.
eat:pt =2 s 2= irr all -die-inf aux : pt -2 s 1= aux
‘If you had eaten (it), you would have died.’
æga het-en=æy,, mard-e b-in=e.
if sleep-1 s =irr die-inf aux : pt -1 s 1= aux
‘If I had slept, I would have died.’
Languages with no formal Past Subjunctive or Conditional generally use the Past
Perfect or Imperfect in the various situations presented in this and the previous
section:
(141) Vafsi
xéyli dél-i=m ær-gòa æ-č-im* šæ:r
very heart-ob =1 s 2 dur -want: pt dur -go:pt -1 s 1 city
‘I would rather have gone to town.’ *(< æt-si-im)
kaš zutær i-r-vattæ.
if.only sooner 2 s 2-dur -say: pt
‘I wish you had said sooner’
752 Donald L. Stilo
(142) ǽgæ dǽsd=i n-æj-jæ*,, nǽ-r-vušuz-i-a.

if hand=2 s 2 neg - dur -hit:pt neg - dur -collapse-pass - pt
*(n-æj-jæ < n-æd-zæ)
‘If you hadn’t touched (it), it wouldn’t have collapsed.’
(143) Alviri (HA: 172)
ægær et-omo-Ø, tæ=š e-di.
if dur -come:pt -3 s 1 you=3 s 2 dur -see: pt
‘If he came, he would see you.’
kaš et-omo-Ø.
if.only dur -come:pt -3 s 1
‘I wish he had come.’
(144) Kelāsi
m-óst-e zudtær me-še-m.
dur -want:pt -3 s 1 sooner dur -go:pt -1 s 1
‘I should have gone sooner.’
kaški zudtær a-m-vat.
if.only sooner pvb - dur -say:pt
‘I wish I had spoken up sooner.’
6.5.12.1. Other conditional uses

Note the Past Subjunctive and the Conditional in the following Lāhijāni example:
(145) hær kæsə amra itow b-ud-i b-i,, ti amra dušmən bo-na bo=Ø.
past subjunctive conditional
‘Whoever you acted this way with would become an enemy with you.’
(NJ: 123)
6.6. Modals, main verbs with control features

1) Modals (‘want, can, must’) and main verbs with control features (± coreferential
subject) usually take a Subjunctive in the subordinate verb. Leriki also optionally
allows an Infinitive (see ex. 160), the western Caspian, S. Talyshi and neighboring
Tati less so, in Vafsi occasionally, and in others only a Subjunctive is used. Past
Modals in Talyshi and N. Tati generally require a Past Subjunctive (§6.5.11.2).
(146) Lāhijāni
ušon xá-n-ən æme=məra b-a-n.
they want-pr -3 p we:poss =with pu -come-3 p
‘They want to come with us.’
xass-am bú-šu-m.
want:pt -1 s pu -go-1 s
‘I wanted to go.’
(147) Sāravi
xa-n-e telafi há-kan-e.
want-pr -3 s retaliation pvb -do-3 s
‘He wants to get even.’ (SY: 62)
Ø-xas-e b-úr-e sæfer.
imf -want:pt -3 s pu -go-3 s trip
‘He wanted to go on a trip.’ (SY: 64)
(148) šema beter bæled=eni bǽ-res-in.
you:pl better capable-2 p pu -spin(thread)-2 p
‘You know how to spin better.’ (SY: 64)
vé-n-e emsal ærusi há-kan-e.
must-pr -3 s this.year marriage pvb -do-3 s
‘He must marry this year.’ (SY: 58)
(149) Leriki
Subjunctive
pi-dæ=m ni tï-ni bḯ-voγand-ïm.
want-pr =1 s 2 aux : neg you-acc sbj -send-1 s 1
‘I don’t want to send you.’
Optative (§6.5.3)
az bæ-p-e mæktub bḯ-nïvïšt-o-m.
I fut -want-3 s 1 letter sbj -write- opt -1 s 1
‘I have to write a letter.’
(150) Vafsi
is-ær-go kay so-Ø?
3 s 2- dur -want where go:sbj -3 s 1
‘Where does he want to go?’
tæmen ær-goa henra=s bǽ-sso-m
I:ob dur -want:pt with=3 s 2 pu -go-1 s 1
‘I wanted to go with him.’
(151) Khoini (YX: 179)
pist-ær=im bé-šu-m. pist-æ bi-m sbj -šu-m.
want:pt - loc =1 s 1 sbj -go-1 s 1 want:pt - loc aux : pt -1 s 1 pu -go-1 s 1
‘I want to go.’ ‘I wanted to go.’
754 Donald L. Stilo
(152) Alviri (YA: 182)

em-e-gu-æ bǽ-š-emæ.
1 s 2- dur -want-aux :3s1 pu -go-1 s 1
‘I want to go.
em-e-gust-æ bǽ-š-emæ.
1 s 2- dur -want:pt -aux :3s1 pu -go-1 s 1
‘I wanted to go.’
EG has a special negative form of ‘can’, lacking in WG: (Pres) mə́nn(y)- (Past)
mə́nnyess-, synonymous with the usual verb of Potentiality:
(153) Lāhijāni
mu nó-ton-əm b-a-m. mə́ny-em bə́-bin-əm.
I neg -can-1 s pu -come-1 s can:neg -1 s pu -cut-1 s
‘I can’t come.’ ‘I can’t cut (it).’
Vafsi modals ‘can’ and ‘must’ take the usual TAMs, may be negated, but do not
take PAMs. (See §6.7 for a comparison of ‘want’ and ‘must’ formed from the same
root.)
(154) æz hæni n-æ-ču* v-á-yme.
I else neg - dur -can pu -come-1 s 1
‘I can’t come (here) any more.’ *(< nǽ-æt-su, root: su ‘be able, can’)
The use of the infinitive in the dependent verb is slightly more common with ‘can’
than with other modals or main control verbs. It may precede or follow the modal.
(155) Lāhijāni
mu asla orə šo-n mə́nne-m
I at.all there go-inf can:neg -1s
‘I can’t go there at all.’
Kelāsi
me-tan-ə́m bé-š-im(æ) ~ ši-æn.
dur -can-1 s 1 pu -go-1 s 1 go:pt -inf
‘I can go.’
(156) Leriki
vïškiy-e-dæ=š o-vašt-e.
can-inf -loc =2s1 pvb -jump-inf
‘You can jump.’
æv hič næm-e zïn-dæ ni-Ø.
he none bend-inf know-loc aux : neg -3 s 1
‘He can’t stoop over at all.’
6.7. Experiencer verbs

Experiencers (exp ) are marked by an Oblique case in Tatic and are coindexed by
Set2. The verbal element is an impersonal 3sg form of either a compound with the
light verbs ‘be’, ‘come’, ‘become’, etc. or a simplex verb (restricted to one verb,
‘want/ love’). Generally, ‘want’ and ‘must’ are formed on the same verb root. In
the sense of 'must' no clitics are used and the subject is in the Direct case.
(157) Vafsi
Experiencer
tæmen im-ær-gó bǽ-ss-om
esdæ i-r-gó bǽ-sse-y
taní is-ær-gó bǽ-sso-(æ)
pron : ob pam 2- dur -want pu -go-pam 1
‘I, you, s/he wants to go’
Non-experiencer
æz ær-gó bǽ-ss-om
tæ ær-gó bǽ-sse-y
an ær-gó bǽ-sso-(æ)
pron : dir dur -want pu -go-pam 1
‘I, you, s/he must go’
(158) Kafteji
mən me-gó=m bé-š-im
tæ me-gó=y bé-š-iš
ja me-gó=š bé-š-i (m.)
ja me-gó=š bé-š-eæ (f.)
pron : ob dur -want-set 2 pu -go-pam 1
‘I, you, he, she wants to go’
æz me-gó bé-š-im
tæ me-gó bé-š-iš
a me-gó bé-š-i (m.)
a me-gó bé-š-eæ (f.)
pron : dir dur -want pu -go-set 1
‘I, you, he, she must go’
Leriki (four types)
Set2 Genitive Genitive+Set2 Oblique
‘be cold’ sárd=ïm=e čïmï sárd=e čïmï sárd=ïm=e mï sárd=e
(1/3sg ) sárd=ïž=e čæy sárd=e čæy sárd=ïž=e æy sárd=e
‘be hungry’ væší=m=e čïmï væší=e čïmï væší=m=e mï væší=e
(1/3sg ) væší=ž=e čæy væší=e čæy væší=ž=e æy væší=e
756 Donald L. Stilo
(159) Asālemi Māsulei

ari=m ni-æ. xǽš=ər=æ b-am višir=i=æ?
late=1 s 2 cop : neg -3 s 1 nice=2 s 2=cop pr - come : pt hungry=2 s 2=cop
‘I’m not late.’ ‘you like’ ‘Are you hungry?’
Type one is well-known from Tehrani Persian (sǽrd=æm=e). The fact that Set2 in
Talyshi encodes Ergative Agents and exp s but not possessors is support for viewing
the Set2 PAMs encoding exp s as Indirect Affectees rather than possessives.
With ‘want/love’, Set2 is generally fronted to the Source noun or other element in
the predicate. In Leriki, the dependent verb may be an infinitive:
(160) Leriki
Fronted Set2
dï tï =ž =æn om-e pi-dæ ?
with you=3 s 2=also come:inf want-pr
‘Does he want to come with you, too?’
Unfronted Set2
piæso pi-dæ=ž=e š-e?
P.N. want-pr =3 s 2 go-inf
‘Does he want to go to Pirasora?’
(161) Kafteji
Fronted Set2 Unfronted Set2
čiči=šon me-go? čiči me-go=šon?
what=3 p 2 dur -want what dur -want=3 p 2
Oblique case, no Set2
jon čiči me-go?
they:op what dur -want
‘What do they want?’
The Experiencer structure of the verb ‘love’ mirrors the Ergative PAM set-up. It
should be noted that this alignment is found in all tenses in the Experiencer struc-
ture, but the Ergative is found only in the Past system:
(162) im-ær-góe-y; im-ær-gó-a-y ( exp .) im-æ-resd-a-y ( ergative )
1 s 2- dur -want-2 s 1 1 s 2- dur -want-pt -2 s 1 1 s 2- dur -send:pt - ppl -2 s 1
‘I love you; I loved you’ ‘I would send you’
The Set2 encoding the Experiencer in the Light Verb type is obligatorily fronted to
the NVE. Fronting farther to the left is not licensed:
(163) Vafsi
riš-isbi-an dìv=esan nǽ-tt-a-Ø æzin aqáte
beard-white-pl face=3 p 2 neg - dur -come-3 s 1 thus word
bæ̀ -kær-ende.
dur -do-3 p 1
‘The elders don’t dare talk like this.’
Caspian languages generally do not coreference an overt exp in the verb complex
since the Set2 PAMs we find in Tatic are not native to these languages. The EXP is
instead marked with the (Direct/Indirect) Objective marker =ra.
(164) Lāhijāni
sada=ji m-a xoš n-á-n-ə. m-a xow a-n-e.
plain=from I-ra nice neg -come-pr -3 s I-ra sleep come-pr -3 s
‘I don’t like the plain one.’ (NJ: 116) ‘I am sleepy.’
(165) Sāravi
xæle me=re xoš e-n-e. šema=re gærm=e?
much I=ra nice come-pr -3 s you:pl =ra hot=cop :3 s
‘I like it very much.’ (SH: 150) ‘Are you hot?’ (SH: 156)
On the other hand, depending on the given dialect or even speaker, these Set2 PAMs
“not licensed” for Caspian languages are clearly being borrowed from Persian, but
so far only in peripheral domains, such as Experiencers and Reflexives (§5.9):
(166) Lāhijāni
amu xow=emun a-n-e. but: in=a xow Ø-ama-Ø.
we sleep=1p2 come-pr -3 s he=ra sleep imf -come:pt -3 s
‘We are sleepy.’ ‘He was (Imperfect) sleepy’ (NJ: 176)
6.8. The verbs ‘BE’

In a restricted cluster of languages in the western Caspian area, we find six dis-
tinct Be-verbs that express the usual Copula functions (equation, etc.), existence,
location-at/in, and are then further subcategorized according to the animacy of the
Subject. Included are: EG, WG, Rāmsari, Tonekāboni, S. Talyshi, and Kafteji-Kelāsi.
Be1 is the Copular enclitic (Set1 PAMs) corresponding to Present PAM suffixes of
main verbs (here: ‘to sleep’), sometimes with a slight variation in vowels:
Lāhijāni Bābolsari Leriki
Present Copula Neg. Present Copula Subjunctive Copula
1sg xósən-əm =əm neəm xəs-eme =eme bḯxït-ïm =im
2sg xósən-i =i ni xəsen-i =i bḯxït-ïš =iš/ž
3sg xósən-ə =ə ne xəsen-e =e bḯxït-ï =e
758 Donald L. Stilo
Māsulei Vafsi Keringāni

Present Copula Past Present Copula Subjunctive Copula
1sg xəs-ə́m =em-æ =em-a ædoss-om =im(e) fes-ene =ene
2sg xəs-í =eš-æ =eš-a ædoss-i =i fes-iše =iše
3sg xəs-ə́ =æ =a ædoss-e =e fes-e =e
Be1 Usage: Be1 is used for equation with predicate nominals and adjectives. It can
also replace all six other Be-verbs. Other tense forms are not considered enclitics.
(167) Lāhijāni (Lā.) Leriki (Le.)
mu maríz=æm čok-æ inson=in
I sick=cop :1 s 1 good-link people=cop :3 p 1
‘I am sick.’ ‘they are good people’
Vafsi (Va.)
tawan mérd-e=nde
we:ob worker-pl =cop :3 p 1
‘they’re our workers’
(168) mu maríz ne-əm ïmru čo ni-Ø?
I sick neg : cop -1 s 1 today four neg : cop -3 s 1
‘I am not sick’ (Lā.) ‘Isn’t today Wednesday?’ (Le.)
kè=dæ n-íam
house=in neg : cop -1 p 1
‘we’re not at home’ (Va.)
(169) mu mariz bu-m. æ:la vïræ be-Ø
I sick cop : pt -1 s 1 great place cop : pt -3 s 1
‘I was sick.’ (Lā.) ‘It was a wonderful place’ (Le.)
zævin liz v-e
ground slippery cop : pt -3 s 1
‘the ground was slippery’ (Va.)
Be2 is a full form equivalent of the Copula formed on an independent root. It is
only used in the Present affirmative in Caspian and most of Tati while other tenses
and the negative forms are replaced by the corresponding forms of Be1. In Talyshi
and some Tati dialects, however, Be2 Present also serves as the basis for the Past
paradigms.
Leriki
Lāhijāni Sāravi Vafsi Pres Past
1sg ís-əm hǽss-eme h-im(e) hḯst-im hḯst-im=be
2sg ís-i hǽss-i h-i hḯst-iš hḯst-iš=be
3sg ís-ə hǽss-e h-e hḯst-e hḯst-e=be
Māsulei
Pres Past
éss-em-æ éss-em=a
éss-eš-æ éss-eš=a
éss-æ éss=a
Be2 Usage: Be2 has three main uses: 1) general existence; 2) replaces Present of Be1
(but not in N. Talyshi) in situations where the enclitic cannot occur: when stressed
for focus or as a single, independent word. It may also replace Be1 in unstressed
uses; 3) languages with no ‘have’-verb use Be2 in predicative possession (§6.9).
Existence
(170) Lāhijāni Gandomābi
yə gúrg-i ís-ə har šow a-n-ə. xoda éss-e
one wolf-indf be2-3 s each night come-pr -3 s God be2-3 s
‘There is a wolf (that) comes (around) every night.’ ‘God is./God
(NJ: 127) exists.’ (YSFN)
(171) Sāravi Leriki
æ-tta mærdi bi-ye. tæbbi dïšmen-on hïst-in.
one-unc man be1:pt -3 s natural enemy-pl be2-3 p 1
‘There was a man.’ (SY: 64) ‘There are natural enemies.’
Standing independently, stressed (in the B examples only)
(172) Lāhijāni Vafsi
(A: tu mariz n-i.) B: čére, ís-əm. (A: vəwsi ke n-e.) B: číra, h-e.
you sick be1:neg -2 s why be2-1 s Vafsi empf be1-3 s 1 why* be2-3 s
‘A: You aren’t sick. B: Yes, I am.’ A: He’s not a Vafsi. B: Yes, he is
*(‘why’ is the affirmative response to a negative question or statement.)
Simply replacing Be1, without any triggering by information structure:
(173) Lāhijāni
ki gú-n-ə u divonə ís-ə?
who say-pr -3 s he crazy be2-3 s
‘Who says he’s crazy?’
Vafsi
vəws æzin qæssæbé-i h-e.
P.N. thus village-indf be2-3 s 1
‘Vafs is this kind of village.’
Be2 is replaced by Be1 in negatives and all other tenses lacking in Be2 (also ex. 171):
760 Donald L. Stilo
(174) Vafsi Leriki

go ko hey n-e. konjo=š? ni-ž.
cow empf no.more be1:neg -3 s 1 where=cop :2 s 1 be1:neg -2 s 1
‘There just aren’t any cows any more.’ ‘Where are you (these days)?
You haven’t been (lit: aren’t)
around.’
Be3 is formed with the past participle of ‘stand’ + Be1 in various corresponding
tenses. In Māsulei, ‘stand’ has merged with ‘stay’ > mænn-, and it thus only has
five Be-verbs.
Lāhijāni
Present Negative Past
1 sg íssam náyssam íssæ bum
2sg íssay náyssay íssæ bi
3sg íssa-Ø náyssa-Ø íssæ bu-Ø
Māsulei
Present Negative Past
mænnem=æ mænnǽ nim=æ mænnem=a
mænneš=æ mænnǽ niš =æ mænneš =a
mænnæ-Ø mænnǽ ni=æ mænn=a
Be3 Usage: Be3 encodes location/existence-at/in but only for animates.
(175) Lāhijāni
Location
kelās-e no-e dæbiristan íssæ bu-m
class-ez 9-ez high.school be3 aux : pt -1 s
‘I was in ninth grade of high school’
Existence
muælləm né-ysæ bu-Ø.
teacher neg -be3 aux : pt -3 s
‘The teacher was not around.’
(176) Māsulei
sæba kæ mænn-eš=æ? tə=ni mǽnn-eš=æ?
tomorrow house be3=aux :2 s 1= aux you=also be3-aux :2 s 1=aux
‘Will you be at home tomorrow?’ ‘Will you also be around?
Be4 is formed with an older adposition *dər ‘in, at’ + Be1 in all tenses and nega-
tives, with or without a synchronic locative postposition. Be4 in its narrow sense
is found in WG, EG, Central Caspian, Southern Talyshi, and some neighboring
Tati languages. On the other hand, it is found in its wider sense quite extensively
beyond this area, as discussed under §6.8.1, ‘Diminishing Number of Be-Verbs’.
Lāhijāni Māsulei
Present Negative Past Present Negative Past
1sg də́rəm də́nnyəm də́bum dərím=æ dəryǽ nim=æ dərím=a
2sg də́ri də́nni də́bi dəríš=æ dəryǽ niš=æ dəríš=a
3sg də́rə də́nnyə də́bu dərí=æ dəryǽ ni=æ dərí=a
Be4 Usage: Be4 functions as location/existence-at/in for non-humans; uses with
humans are quite complex and are not included here due to space limitations:
(177) Lāhijāni
Location-in Existence-in
čai γuri mien də́r=ə. γuri mien čai də́r=ə?
tea teapot in be4=3 s teapot in tea be4-3 s
‘The tea is in the teapot.’ ‘Is there (any) tea in the teapot?’
(178) Māsulei
Location-in
pul če jiv-é delæ derí=æ.
money his pocket-ob in be4-3 s 1
‘The money is in his pocket.’
Existence-in
deg-é dəlæ pəla derí=æ?
pot-ob in rice be4-3 s 1
‘Is there any rice in the pot?’
Be5 is based on the past participle of ‘put’ + Be1 in various corresponding tenses.
Be5 only refers to inanimates, and thus only occurs in the 3rd person.
Lāhijāni Māsulei
Present Negative Past Present Negative Past
3sg hánna-Ø n-ə́nnæ-Ø hánnæ-bu-Ø noǽ-Ø noǽ niæ no-a-Ø
Be5 Usage: Be5 indicates existence and location of inanimates.
(179) Lāhijāni
ja:be* mien va-n-e pul hə́nna bu-n.
box:rez in must-pr -3 s money be5 aux : pt -3 p
‘There must be money in the box.’ *(< ja:bə́-ə)
pul ko hánna-Ø?
money where be5-3 s
‘Where is the money?’
762 Donald L. Stilo
(180) Māsulei Tonekāboni

pəla deg-é delæ no-ǽ? quri kuji nia-Ø?
rice pot-ob in be5-aux :3 s teapot where be5-3 s
‘Is there any rice in the pot?’ ‘Where is the teapot?’ (Stilo FN)
Be6 is the verb ‘stay’ and is rare as a Be-verb as it seems to occur only in the sub-
junctive. Since it only pertains to inanimates, it only occurs in the 3rd singular.
Lāhijāni: Present Negative
3sg bə́-mon-ə ??
Be6 Usage: Be6 indicates existence in reference to inanimates. It is an alternate to
the subjunctive of Be5. I was only able to elicit one example for Lāhijāni, but it
was spontaneously offered without any prompting for the use of this verb vs. Be5.
Parallel examples of this verb and its very specific usage also exist for Rashti.
(181) tin-ə* yek češmé-i íra bú-mon-ə.
can-3 s one spring-indf here pu -stay-3 s
‘There could be a spring here.’ *(tin-ə ~ tan-ə)
6.8.1. Diminishing number of be-verbs

As we move out of the core area of this isogloss, the number of Be-verbs decreases
to three. Mazanderani and Rudbāri have Be1,2,4 —they also extend southward at
least as far as Gaz and Esfahan (see Stilo 2007c, 2007b, resp.)— and in these lan-
guages, Be4 subsumes many functions of the other verbs. For example, Be4 in all
these languages, in addition to inanimate nouns, regularly includes animate and
human nouns in the sense of Existence/Location Existence-in/Location-in, gener-
ally avoided in WG/EG:
(182) Sāravi
æ-tta de-e dele de=ta berar dæ-i-ne.
one-unc village-rez in two-unc brother be4-pt -3 p
‘In a village there were two brothers.’ (SY: 58)
hiške me=je pirtær dæ-ni-ye.
no.one I=from older be4-neg -3s
‘There isn’t anyone here older than me.’ (SH: 150)
(183) Tutkāboni
færda xanæ dǽr-i? bǽle, dǽr-om. marγanæ dæ-ní-æ?
tomorrow house be4-2 s yes be4-1 s egg be4-neg -3 s
‘Are you home tomorrow? Yes, I am.’ ‘Aren’t there any eggs?’
6.9. ‘To have’ and ‘to have not’

In the Caspian area we find two types of predicative possession separated by a
clear isoglottic line. The Caspian family, Southern Talyshi, most Tati languages,
Persian, etc. have a transitive ‘have’ verb. To the west, Central/Northern Talyshi,
Northern Tati, as well as Kurdic, Zazaki, Hawrami, Azeri, Udi, Georgian, and
Semitic have periphrastic predicative possessive constructions formed with exist-
ence verbs (or particles/verboids). This line even separates closely related varie-
ties, e. g., Southern Talyshi from Central and Northern Talyshi or Southern and
some Central Tati from Northern and other Central Tati.
Languages of the Transitive Have-verb Zone

(184) Lāhijāni Sāravi (SY: 58)
či da-n-i? me berar zæn=o væče dar-n-e.
what have-pr -2 s my brother wife=and child have-pr -3 s
‘What do you have?’ ‘My brother has a wife and children.’
(185) Māsulei Vafsi
æz i-llæ sə̀r-æ kitáb dar-ə́m. mašin=iz dir-áme.
I one-unc red-link book have-1 s car=also have-1 p 1
‘I have a red book.’ ‘We also have a car.’
The predicative possessive construction is based on the existence verb Be2 (§6.8).
Some languages, including Iranian Azeri, however, through a process of “Have-
Drift” (Stassen 2005: 475), have adapted somewhat to the ‘have’-verb zone by
incorporating the enclitics into the verb to form an independent paradigm for the
periphrastic verb:
Leriki (Northern Talyshi)
Present Affirm. Negative Past Affirm. Pres. Perfect
hḯst=ïm=e ní=m=e hḯst=ïm=be bǽ=m=e
hḯst=e < =ï=e ní=e hḯst=e=be bǽ: < bǽ=e=e
hḯst=ïž=e ní=ž=e hḯst=ïž=be bǽ=ž=e
Colloquial Azerbaijani
Present Aff. Past Affirm.
var-ïm=di(r) var-ïm=ïdi
var-ïn=di(r) var-ïn=ïdi
var-ï=di(r) var-ï=ïdi
Most Tatic languages mark the possessor argument with the Oblique case and/or
cross-reference the possessor NP with floating Set2 clitics in the predicate.
764 Donald L. Stilo
(186) Kalāsuri (KL: 281) Kajali (KJ: 284)

hæsæn-í here=š üæ hest-æ. pænir=em hest bö́ -:ruš-em.
P.N.-ob three=3 s 2 egg exist-3 s 1 cheese=1 s 2 exist pu -sell-1 s 1
‘Hasan has three eggs.’ ‘I have cheese to sell.’
In Leriki, the possessor is commonly marked with an adposition, an Oblique case
(nouns), or Genitive case (pronouns) and Set2 is optional with an overt Possessor:
(187) Leriki
Alternative formations
kitob=ïm hest(=e). (bæ) mï=ro kitob hest=b-e.
book=1 s 2 exist(=cop :3 s 1) to I:ob =for book exist=cop : pt -3 s 1
‘I have a book.’ ‘I had a book.’
(188) čï mæktæb-í mašin=ïš hḯst=e.
poss school-ob car=3 s 2 exist=cop :3 s 1
‘The school has a car.’
čïmï šæš gïlæ æγïl=ïm hḯst=e.
I:poss six unc child=1 s 2 exist=cop :3 s 1
‘I have six children.’
Keringāni and Northern Talyshi also allow elements within the predicate other
than the possessum to host the Set2 PAMs cross-referencing the possessor:
(189) Leriki
d=æmǽn=ï čï qulloγ hïst=e?
with-we=2 s 2 what service exist =3 s 1
‘What job/service do you have with us?’
Keringāni
hæsæn-e pinju=š pæs hesta biæ .
P.N.-ob fifty=3 s 2 sheep exist cop : pt
‘Hassan had 50 sheep.’ (YSFN)
(190) Karganrudi (Central Talyshi)
hærči servæt hest=əš b-e ,…
whatever wealth exist =3 s 2 cop : pt -3 s 1
‘Whatever wealth she had,…’ (Guizzo: 64)
Harzani
hest=æm=æ here zöræ.
exist =1 s 2=cop :3 s 1 three son
‘I have three sons.’ (AAK: 148)
While examples are not numerous, we can also assume that the fronting of Set2
PAMs in the predicative possessive is not obligatory.
6.10. Valency: transitivizing and detransitivizing
6.10.1. Causatives: transitivization (synthetic and analytic)

Morphological causativization in these languages mostly functions as a transitiv-
izer and occurs only sporadically with a few transitive roots. Space only allows for
a very brief discussion of valency changing and a short list of causativized verbs.
The verbs in Table 6E loosely progress in volitionality from the most unaccu-
sative intransitive type base to the one most common transitive verbs that can be
causativized, ‘to eat’, which is generally only used when associated with hospital-
ity, i. e., ‘they fed us well’, but not for feeding a baby, etc. (See also exs. 81, 139,
214, 237, 290.)
The most common Transitivizing strategy is an analytic one, consisting of
light verb constructions, most often formed with ‘do, make’. This strategy is an
equipollent process discussed under Passivization in the next section. In addition,
causativization of transitives is most commonly encoded analytically in this area,
often using the verb ‘to give’ in the area under investigation. This type will not be
further addressed here.
In Table 6E, the intransitive base can be identified simply by dropping the
causative suffix after the root. In cases where causativization requires a phonolog-
ical change in the root, both roots are given in the format of (preverb)-intransitive
> transitive.
Table 6E: Morphological causative formation

Leriki Asālemi Māsulei Vafsi Lāhijāni Sāravi
rot > let pïx-ovn- gend-en-
rot
blaze, væš- >voš-n- væš- > ? veš-en-
be lit
boil gïl-ovn- gəl-avən- gəl-amən- viš-en-
burn tæv > tov-n- si > su-in- suz-en- suz-end-
break si- > sï-ndïn- čæk- čæk- > čak-ən- æšk-en- šk-en- ešk-end
>čok-ən-
tear, rip zïr-n- ve-zer-amən- æzerr-en- boss-end-
stick dæ-čïk- > ar-čək- > čik-amïn- dæ-duss-en- du-čək-an- de-mass-
čok-n- čək-ovn- (*Rashti) end-
rise, get ez-amən or-eyz-en- vi-ris-on- her-ess-
up end-
sleep hït-ovn- xəs-on- xəs-əmən- hoss-en- xus-on- xos-end-
766 Donald L. Stilo
Leriki Asālemi Māsulei Vafsi Lāhijāni Sāravi

laugh sïr-ovn- xur-an1 xænd-amən- xænd-en-
go næv- > gærd-ən- gærd-en- gærd gerd
around nov-n- >gard-æn4 >gard-en
arrive > ræs- > ros-n- ræs- a-res-amən- ræs-en- rəs-on- res-end-
deliver >ros-en-2
fear tars->tors-ïn- tærs-on-3 tærs-amən- tærs-en- tærs-ən- tærs-end-
run vit-ovn- tell-amən- dæv-en- tæj- >
taj-en-
jump vašt- væz- væz- >vaz- væz-en-
>vošt-ïn- >vaz-ən-1 amən-
graze5 čard- > čord- čær- > čær- > čar-ən- čærr-en-
on- ~ čo-vən- čovən-2 ~ čar-əmən-
eat hard-ovn- hær-an-1 xor-amən- hor-en-
1. Yarshater:1996; 2. Kargānrudi, Guizzo; 3. Tulārudi, Guizzo; 4. Vājārgāhi, Amirzādeh;

5. See ex. 214.
(191) Lāhijāni
Intransitive Base Verb
bu-šo-Ø ta bə-rəs-e arbáb-ə xonə.
pu -go:pt -3 s till pu -arrive-3 s boss-rez house
‘She went until she arrived at the boss’s house.’ (NJ: 143)
Transitivized Equivalent
tu i nam-á un=a rəs-ən-en-i?
you this letter-ra he=ra arrive-caus - pr -2 s
‘Will you deliver this letter to him?’ (NJ: 95)
(192) bə-da amə rahat bú-xus-im. zak-on=a bú-xus-on.
pu -give we comfortable pu -sleep-1 p child-pl = ra pu -sleep-caus
‘Let us sleep peacefully.’ (NJ: 156) ‘Put the children to sleep.’
(NJ: 149)
(193) Sāravi
taze hemdige=jem bǽ-resi-ne.
new one.other-je pu -arrive:pt -3 p
‘They just ran into each other.’ (SH: 149)
me selam=re še berar=jem bǽ-res-en.
my hello-ra self brother=je pu -arrive-caus
‘Send my greetings to your brother.’ (SH: 156)
(194) Leriki
institut-í iminji kurs oræxæ-y.
institute-ob first course end-pt -3 s 1
‘The institute’s first course ended.’
johil-í medinstitut=ïš orox-n-i-yæ.
youth-ob med.school=3 s 2 end-caus - pt - ppl
‘The young man finished med school.’
(195) Māsulei
kərə Ø-si-ə́m. kərə Ø-su-in-ə́m.
prog pr -burn-1 s prog pr -burn-caus -1 s
‘I am burning up (e. g., with fever).’ ‘I am burning (something).’
(196) Vafsi
gá-ye píš-ær-gerd-énde. ga-an píš-gerd-en!
cow-pl pvb - dur -return-3 p 1 cow-op pvb -return-caus
‘The cows are coming back.’ ‘Take/bring the cows back.’
6.10.2. “Passives”: detransitivization (unaccusatives, inchoatives, etc.)

Valency-reducing strategies in simplex verb roots form two isoglosses in the area:
1) the synthetic type using various verbal suffixes, and 2) the analytic type formed
with the past participle and the auxiliary ‘be(come)’. Most dialects of Tati and
Southern and Central Talyshi have a synthetic formation. Northern Talyshi, North-
ern Tati, and all Caspian and Tatoid languages, however, only have analytic for-
mations.
Synthetic passive markers, present/past
Hezārrudi Kajali Asālemi Māsulei Chāli Alviri Vafsi

-ax-/-ax-est -is-/-ist(e)~ista, f -i-/-i-st- -i-/-i-ǽ -i-/-i-æs(s)- -i-/-i-æst -uæ-/-i-a
These voice paradigms incorporate a much wider range than the traditional label
of “Passive” in these languages, such as unaccusative or inchoative, e. g., ‘the bud
opened’, ‘the wall collapsed’, ‘the cup broke’, etc. As the examples below show,
“Passive” is at times appropriate (‘be/get eaten/sold/thrown’), while in other cases
no passive sense is implied at all (‘spills out’, ‘tears’). (See also exs. 15, 84, 142,
330.)
768 Donald L. Stilo
(197) Vafsi
ǽgær vəws há-wešaz-uæ-Ø,,… xun bǽ-riz-i-a-Ø.
if P.N. pvb -open-pass -3 s 1 blood pu -pour-pass - pt -3 s 1
‘If Vafs (village) opens up …’ ‘The blood spilled out.’
(198) Hezārrudi (YT:465)
hærči mi-kær-em, æm ní-mi-zær-ax-e.
whatever dur -do-1 s 1 this neg - dur -tear-pass -3 s 1
‘No matter what I do, this will not tear.’
ču a-škæj-ax-est-Ø.
wood pvb -split-pass - pt -3 s 1
‘The wood split open.’
(199) Kajali (KJ: 283)
divar a-škaj-is-t-Ø tænaf æd-ruj-is-t-Ø.
wall pvb -split-pass - pt -3 s 1 rope pvb -throw-pass - pt -3 s 1
‘The wall split open (and) a rope was thrown down.’
N. Talyshi, N. Tati and all of the Caspian family form the Passive analytically.
tumóm-ə γəza bú-xord-e bu-bo-Ø.
all-ez food pu -eat:pt - ppl pu - aux : pt -3 s
‘All of the food was eaten.’ (see also ex. 135)
Sāravi (SY: 40)
parče-a bæ-rut-e bæ-i-ye.
fabric-pl pu -sell:pt aux - pt -3 s
‘The fabrics were sold.’
(201) Kalāsuri (KL: 279)
em izem onjar-e ber-ondæ ni-Ø.
this firewood mince-ppl be:inf - loc . pr ? aux : neg -3 s 1
‘This firewood won’t get cut/will just not cut.’
6.10.3. Light verb constructions and change of state

Simplex verb roots are clearly in the minority in this area. My current Vafsi
glossary consists of 138 simplex and 534 compound roots, but there are proba-
bly more simplex roots to be identified. The majority of transitive/causative vs.
intransitive/passive pairs use equipollent strategies by which Non-verbal Elements
(NVEs), esp. adjectives and nouns, are combined with Light Verbs, esp. ‘make’ vs.
‘be(come)’, resp.
Compare the following pairs derived from NVEs: (Leriki) kok ‘fat’ > kok kard-
(fat-make) ‘fatten’ vs. kok b- (fat-become) ‘get fat’; (Lāhijāni) sefid ‘white’ > sefid
gud- (white-make) ‘whitewash (wall), bleach (clothes)’ vs. sefid bo- (white-become)
‘blench/pale (of face), turn white (hair)’. Various lesser verb pairs of this type use
other sets of Light Verbs, e. g., Sāravi: šekes he-da- (defeat-give) ‘defeat’ vs. šekes
xord- (defeat-eat) ‘be defeated’; Vafsi: ra veyn- (road-throw) ‘send off’, ra gen-
(road-fall) ‘start off (going, -TR)’; Leriki: bæ æmæl vard- (to-action-bring) ‘realize,
implement, execute’, bæ æmæl omæ- (to-action-come) ‘materialize, be realized’.
Change of State: In an isogloss that includes Gilaki, Central Caspian, S. Talyshi
and all areas as far south as Vafsi, those Causative/Passive pairs formed with the
LVs ‘do’ and ‘become’ (but not other LVs) use the “Change of State” (CS) mor-
pheme =a, encliticized to the NVE (usually an adjective, occasionally a noun). In
punctual tenses either the punctual morpheme bV- or, less commonly, the =a may
be deleted, but both may also occasionally occur together. In most cases, the CS or
the punctual bV- is the only way to distinguish ‘be’ from ‘become’, (since ‘be’ does
not take the bV-: Lah. bu ‘he was’ vs. bú-bu ‘he became’), e. g., (205A) in Māsulei.
Languages marked with an asterisk in this section are outside this isogloss and
lack this morpheme.
Causatives and Passives of Non-Simplex Roots: Light Verb Constructions

Unaccusative, ‘Passive’ (Unacc.)
… harf bə-zən-i,, bidár=a nə́-bə-n-ə.
speech pu - lv -2 s awake=cs neg -be-pr -3 s
‘(If) you talk, he won’t wake up.’
Causative > Transitivized (Trans.)
ín-ə bərarzə ín=a bidár=a gud-ə.
he-rez nephew he-ra awake=cs do:pt -3 s
‘His nephew woke him up.’
(203) *Sāravi (SH: 153–154)
yæx u bæ-yy-e. še ja=re ævez ha-kord-ene.
ice water pu -become:pt -3 s self place=ra change pvb -do:pt -3 p
‘The ice melted.’ (Unacc.) ‘They changed their place.’ (Trans.)
(204) *Leriki
vot-dæ=n vašag=æn pæydo b-æ-Ø
say-pr -3 p 1 lynx=also visible be:pt - perf -3 s 1
‘They say lynx were found (back then).’ (Unacc.)
ro-y pæydo kard-e zïn-dæ-ni-Ø.
road-ob visible do-inf know- pr -neg -3 s 1
‘He can’t find the road.’ (Trans.)
770 Donald L. Stilo
(205) Māsulei
eštæ pa čel=a bæ-Ø. reč=a kærd-əm=æ.
your foot muddy=cs be:pt -3 s 1 straight=cs do:pt =1 s 2=aux
‘Your feet got muddy.’ (Unacc.) ‘I fixed it.’ (Trans.)
(206) Vafsi
kar soay támb=a r-bù-Ø.
work tomorrow finished=cs dur -become-3 s 1
‘The work will be finished tomorrow.’ (Unacc.)
kár=om zu támb=a r-kær-om.
work=1 s 2 early finished-cs dur -do-1 s 1
‘I’ll finish my work early.’ (Trans.)
Note that the adjective plus ‘be’ is the base from which the equipollent Causa-
tive-Passive pairs are derived. Since ‘be’ has no Change of State sense, =a is not
licensed.
(207) Vafsi
Stative Unaccusative
čera rušen be. čera rušen=a we.
lamp lit was lamp lit=cs became
‘The lamp was on.’ ‘The lamp went on.’
Transitive
čera rušen=om=a kærd.
lamp lit=1 s 2=cs did
‘I turned the lamp on.’
7. Alignment and marking of core arguments: nouns, pronouns, PAMs
7.1. Alignment systems and differential object marking (DOM)

Most languages of the area have DOM by which salient Ps—definite or specific
nouns (including indefinite specific, see ex. 83; Vafsi also requires animacy)—
are Accusative-marked. Non-salient Ps remain in the unmarked base form (sg ,
see §5.6).
In ra-marking languages—Caspian, Tatoid, Razajerdi, and Harzani—Nomina-
tive-Accusative alignment occurs in all tenses. Alviri has no case or P-marking.
Tatic split alignment is discussed in §7.1.3. Harzani presents an unusual case:
it has retained the formal Direct and Oblique cases of nouns (now reserved for
possessive marking, §5.4.2) but has =ra -marking of Patients and Recipients in all
tenses.
7.1.1. Nominative-accusative alignment, all tenses; =ra marking, DOM

DOM: Salient, marked Non-salient, unmarked
murγ=a xa-n-i? murγ xa-n-i?
chicken=ra want-pr -2 s chicken want-pr -2 s
‘Do you want the chicken?’ ‘Do you want (a) chicken?’
(209) Sāravi (SY: 62)
ven-e mimun=re kúš-em-be.* parsal ve mimun dašt-e.
he-rez guest=ra kill-pr -1 s last.year he guest have:pt -3 s
‘I will kill his guest.’ *(< kúš-en-me) ‘Last year he had a guest.’
(210) Rudbāri: Tutkāboni
gusfǽnd=æ bə-frúxt-om. gusfænd bə-frúxt-om.
sheep=ra pu -sell:pt -1 s sheep pu -sell:pt -1 s
‘I sold the sheep.’ ‘I sold a sheep.’
(211) Tāleqāni: Gurāni
der-im nan=e dé-m-bænd-im.
be 4-1 p bread=ra pvb - dur -bake-1 p
‘We are baking the bread.’
der-im nan dé-m-bænd-im.
be 4-1 p bread pvb - dur -bake-1 p
‘We are baking bread.’
(212) Harzani (see also ex. 280)
Tati:=ra -marking, Present
te kuhurzani læv=e čök zunest-æ.
you P.N. speech=ra good know-loc ?
‘You know Harzani well.’ (AAK: 108)
=ra -marking, Past
mæn en ödæmi=re yær=mæ
I this person=ra hit:pt =1 s 2
‘I hitpast this person.’ (MH: 308)
(213) Razajerdi (ra -marking; YSFN)
bez=æ bær ku. verg-á=ræ köšt-em.
goat=ra take mountain wolf-pl = ra kill:pt -1 s 2
‘Take the goat to the mountain.’ ‘I killed the wolves.’
772 Donald L. Stilo
(214) Alviri (no Accusative marking, Neutral alignment)

u hætmæn čarva e-čær-enden-æ.
he certainly sheep dur -graze-caus -3s 1
‘He will definitely graze the sheep.’ (HA: 174)
æsb=eš deræxt=eš bæst.
horse=3 s 2 tree=3 s 2 tie:pt
‘He tied his horse to a tree.’ (HA: 310)
7.1.2. Case-marked objects in Tatic languages: present system, DOM

Most forms of Tatic have split alignment with Accusative (Oblique case P-Marking)
in the Present. There are more complex alignments in the Past, treated in §7.1.3.
(215) Leriki
DOM: Salient, marked Non-salient, unmarked
ov-í tát kàrd-æ=š. ov=æn bárd-æ=š
water-ob warm do-pr =2 s 1 water=also take-pr =2 s 1
‘You heat up the water.’ ‘You take some water, too.’
(216) Māsulei
zoæ šellaγ-é pi-er-ə. i-llæ šellaγ mə=ra b-oær-æ.
boy whip-ob pvb -take-3 s 2 one-unc whip I=for sbj -bring-2 p 1
‘The boy picks up the whip.’ ‘Bring me a whip.’ (LM: 46, 32)
(LM: 46, 34)
(217) Vafsi (NB: Salient nouns in Vafsi must be both definite/specific and
animate)
xoda tani laze-y ná Ø-dir-è.
God his son-om keeping sbj -have-3 s 1
‘May God preserve his/her son.’
xoda tæmen lazæ há-do-Ø.
God I:ob son pvb -give-3 s 1
‘May God give me a son.’
(218) Gandomābi (YSFN)
æjem guspænd-é bé-bæ!
that:ob sheep-om pu -take.away
‘Take that sheep away!’
qessab guspænd sær a-ber-e.
butcher sheep head pvb -cut-3 s 1
‘The butcher slaughters a sheep.’
(219) Dikin Marāqei

das-ær=et mi-gir-en. man pikæ æsif á-gir-em.
hand-ob =2 s 2 dur -take-3 p I:ob want apple pvb -take-1 s 1
‘I’ll take your hand.’ ‘I want to get/buy an apple.’
7.1.3. Past tense argument alignments in Tatic

DOM is irrelevant in Ergative alignment since Ps, as Absolutives, are not marked.
Vafsi (in two of its three Past patterns), Māsulei, and some other Tatic varieties
have Double Oblique alignment and DOM thus applies to Past and Present Ps
equally here.
Only nominal alignment is addressed in this section. Pronominal alignment is
covered in §7.2. Set2 PAMs coindexing Past Agents are generally optional in Tatic.
Subjects (S) of intransitive verbs are disregarded in the schemata below. The cell
for Set1 (Direct) PAMs coindexing the P is marked [3] if it is restricted to the 3rd
person. If the language does not distinguish gender, the genders are crossed out
and should be considered common singular.
7.1.3.1. Ergative alignment of nouns and PAMs

The best examples of canonical Ergativity in Tatic are Dikin Marāqei and Vafsi III.
In the former this pattern is predominant, while it is only minor in Vafsi (see
Vafsi I and II below). This pattern in both languages also retains coindexing of P
in the verb in all persons. See §6.5.6.2 for the Objective conjugation in the past
tense in Dikin Marāqei.
NP PAM NP PAM
Agent Patient A P Agent Patient A P
m f p m f p m f p m f p
dir + + + + + + + [3]
ob + + + + + + + +
Dikin Marāqei, Vafsi III Kajali
(220) Dikin Marāqei

killig-ǽ æ zolleg-ín=eš vínd-æn.
girl-of that boy-dp =3 s 2 see:pt -3 p 1
‘The girl saw those boys.’
man=di æ jænek vínd-ian.
I:ob =also that woman see:pt -3 sf 1
‘I saw that woman, too.’
774 Donald L. Stilo
(221) Kajali (KJ: 280)

værg-on pæs-e be-hærd-end.
wolf-op sheep-dp pu -eat:pt -3 p 1
‘The wolves ate the sheeppl.’
mærdæk-i žænég-i-æ be-hind-a.
man-om woman-indf - df pu -see:pt -3 sf 1
‘A man saw a woman.’
Khoynarudi (N. Tati) occasionally marks the Past Agent with the Direct case, but
this pattern seems to be marginal and not consistent.
NP PAM NP PAM
dir (+) (+) + + [3] +
+ +
ob + + (+) (+) + + +
Khoynarudi Leriki
(222) Khoynarudi (YSFN)

Past Agent in Direct case
em ferqæči-é sǽr=čun duræ=pe.
this P.N.-dp head=3 p 2 give:pt = pvb
‘These Ferqachis spoke up (lit: raised their heads).’
Past Agent in Oblique case
ferqæči-ǘn em=čün šahar estǽræ.
P.N.-op this=3 p 2 city take:pt
‘The Ferqachis took this city.’
Northern Talyshi has Ergative alignment in nominal arguments but does not
coindex Ps in the verb in any tense. Ergativity is neutralized in plural forms (223)
since case is not distinguished in the N. Talyshi plural.
(223) ramez-í k’amïl=ïž=æn pe-gi š-e-Ø.
P.N.-ob P.N.=3 s 2=also pvb -take go-pt -3 s 1
‘Ramez took Kamil, too, and left.’
žen-on mol-on=ïžon dušæ=y.
woman-pl cattle-pl =3 p 2 milk:pt = aux
‘The women milked the cows.’
7.1.3.2. The double oblique alignment of salient NPs

NP PAM NP PAM
dir + +
+ +
ob + + + + + + + + + +
Chāli Māsulei
Chāli has both Ergative and Double Oblique alignment with nominal arguments in
the Past system, but the Ergative construction has no co-indexing of Ps in the verb:
(224) Oblique-Direct (Ergative)
æmir ærselán-e kæmær=eš de-bæst.
P.N. P.N.-om belt=3 s 2 pvb -tie:pt
‘Amir Arselan fastened his belt.’ (ST: 105, 161)
Double Oblique (+ DOM)
værziær-o bærr-on=ešo b-ašind.
farmer-op spade-op =3 p 2 pu -throw:pt
‘The farmers threw down (their) spades.’ (ST: 76, 106)
Māsulei seems to have exclusively Double Oblique Alignment (with DOM):
(225) æ mərdæk-ón əštə p-ær venn=a.
that man-op your father-ob see:pt = aux : pt
‘Those men had seen your father.’
mən əm-é tə́=ræ voǽrd=əm=æ.
I-ob this-ob you=for bring:pt =1 s 2= aux
‘I brought this for you.’
7.1.3.3. Vafsi past alignments

It should be noted that saliency in Vafsi requires both Definiteness/Specificity and
Animacy. (See Dikin Marāqei above for the schema of the Vafsi Ergative.)
776 Donald L. Stilo
Vafsi NP PAM NP PAM

dir (+) (+)
ob + + + + + + + + + + + + + (+)
Double Oblique I (SOV) Double Oblique II (OSV)
The Past in Vafsi has three different Alignments for marking core arguments:
1) The most common alignment by far is the Double Oblique I. Various strat-
egies help to disambiguate the two Oblique-marked arguments as discussed
below.
2) Vafsi III (Ergative), requiring OSV order, has become rather marginalized.
3) Also rare is the Double Oblique II (Vafsi II), also obligatorily OSV.
Double oblique I (SOV)

As mentioned, this construction is by far the most frequent of the three Past system
alignments in Vafsi. Due to DOM only salient Ps are Oblique-marked:
(226) Salient Ps Non-salient Ps
tæmen gulle-y=m bǽ-ruttæ. tæmen yey gullæ=m bǽ-ruttæ.
I:ob calf-om =1 s 2 pu -sell:pt I:ob one calf=1 s 2 pu -sell:pt
‘I sold the calf.’ (animate, definite) ‘I sold a calf.’ (animate, indefinite)
The P in Double Obliques I/II is also optionally co-indexed in the verb with Set1:
(227) P not co-indexed
belæxæræ tini kell-i=s hágirættæ.
finally he:ob girl-of 2=3 s 2 pvb -take:pt
‘Finally he married the girl.’
P co-indexed
esdæ zarru-án=i ye-dí-e-nde?
you:ob child-op =2 s 2 pu -see:pt - perf -3 p 1
‘Have you seen the children?’
Double oblique II (OSV, no agreement with A or P)
(228) hæzíri šo do dane kærg-án=oan luás-i yǽ-værdæ.
yesterday night two unc chicken-op =1 p 2 fox-om pu -bring:pt
‘Last night the fox carried off two of our chickens.’
Vafsi III: The Ergative Construction (OSV, agreement with P)

(229) in dár-e æhmǽd-i par=es dǽ-nnia-nde
this tree-dp P.N.-om last.year=3 s 2 pvb -put:pt -3 p 1
‘Ahmad planted these trees last year.’
Vafsi has various strategies for disambiguation of A and P in these constructions:
1) The Agent is always Oblique-marked. Thus if there is only one Oblique it
must be the A. This unambiguous interpretation, therefore, resolves all word
order restrictions, even if there are no PAMs to help with disambiguation:
(230) tini kelj-e hágirætæ.
he:ob girl-df 2 pvb -take:pt
agent patient patient
‘He married a girl.’
molla xə́w-i yæ-vǽrdæ v-e.
P.N. sleep-ob pu -take:pt aux : pt -3 s 1
agent
‘Sleep had overtaken Mullah.’
2) If there is at least one PAM (Set1/Set2) coindexing different persons or nouns
of different number or gender, then word order is no longer an issue;
3) If the A and P are equal in type and sense and there are no PAMs (or they are
also equal), OSV is the default word order that disambiguates A from P;
4) When Set2 is fronted, it is commonly (but not obligatorily) hosted by the P;
5) Fronting of Set2 to the Agent or over an Agent is not licensed. Thus in (227)
above, the nouns hosting the Set2 PAM can only be a P with SOV word order;
(231) Double Oblique II (OSV)
leyle-y merde-y bǽ-diæ-Ø.
boy-om man-om pu -see:pt -3 s 1
‘The man saw the boy.’
‘*The boy saw the man.’
Double Oblique I (SOV) or II (OSV)
kelj-í merde-y bǽ-diæ. (OSV)
merde-y kelj-í =s bǽ-diæ. (SOV)
man-om girl-of 2=3 s 2 pu -see:pt
‘The man saw the girl.’
‘*The girl saw the man.’
While SOV (Double Oblique I) is preferred, if the A and P are equal (both m , f , or
animate pl , or identical 3 sg / pl pronouns), Sets1&2 cannot indicate which noun is
co-indexed. Or if Set2 is unfronted or deleted, the typical Set2 host (P) is no longer
the disambiguator. Only the default OSV word order can distinguish A from P.
778 Donald L. Stilo
(232) Double Oblique II (OSV)

kærg-án=oan luas-án bǽ-værd-ende
kærg-án=oan luas-án b-ísan-værd-ende
kærg-án=oan luas-án bǽ-værdæ ⟷
kærg-án=oan luas-án b-ísan-værdæ ⟷
chicken-op =1 p 2 fox-op pu -3 p 2-take:pt (- set 1)
‘The foxes carried off our chickens.’
Double Oblique I (SOV)
paša-y dana-y =s bǽ-zæ
Double Oblique II (OSV)
dana-y paša-y bǽ-zæ-Ø
dana-y paša-y b-ís-zæ-Ø
P.N.-om P.N.-om pu -3 s 2-hit:pt -3 sm 1
‘Pasha (m.) hit Dana (m.)’
7.2. Pronominal argument-marking and alignment in Tatic

Tatic pronominal alignment is generally mixed since different persons show dif-
ferent marking or Ø-marking. Thus Object-marking in pronouns can be more
complex or less complex than nouns or may exhibit the same level of complexity,
as, for example, in Vafsi pronouns with the same three alignments in the Past as
nouns (§7.1.3.3), including the SOV/OSV issues. As for less morphological com-
plexity, most of Tatic lacks a Direct-Oblique case distinction in the 2sg , 1pl , and
2 pl .
Greater morphological complexity is seen in the cluster of Hezārrudi, Gan-
domābi, Nowkiāni, Kafteji which have innovated two new cases for P-marking
in certain persons creating a four-way case distinction: 1) Present A/S, 2) Past A,
3) Pres. P, and 4) Past P. An additional possessive yields a total of five case forms
in these persons (see §5.2.2). On the other hand, with separate tense-based splits
in both Agent and Patient marking, Alignment here is Nominative-Accusative in
the Present and Tripartite (Haig 2008: 188) in the past. In Tripartite alignment the
agent of a transitive verb (A), the patient of a transitive verb (P), and the single
argument of an intransitive verb (S) are each marked in different ways, viz., A is
ergative, P is accusative and S is absolutive. These varieties are in line with Haig’s
(2008: 189) Universal of Animacy-based splits in Iranian by which the alignment
of SAP-pronouns is never higher on the Ergativity Continuum (p. 187) than that
of nouns.
7.2.1. Tripartite and mixed pronominal inventories

Kafteji, the easternmost of the Tāromi group, shows all four cases in the 3sg , but
the 1sg has merged the case forms for the Present P and Past P with the Genitive.
See §5.2.2 for the Hezārrudi and Kafteji four-way distinction in 1st and 3rd person
singular and also Yarshater (1970) for additional Hezārrudi examples of their use.
(233) Kafteji
Subject and Present Agent Present Patient (Nom-Acc Alignment)
a bə-dævə́st-Ø. tæ æja æ-m-vin-i?
he:dir pu -run:pt -3 sm 1 you he:acc 1 pvb - dur -see-2 s 1
‘He ran.’ ‘Do you see him?’
(234) Past Agent
ja vend=eš.
he:ob see:pt =3 s 2
‘He saw.’
Past Patient (Tripartite Alignment)
jan bə-gǽ ~ jan=əm bə-gǽ.
he:acc 2 pu -catch:pt ~ he:acc 2=1 s 2 pu -catch:pt
‘I caught him/her.’
7.2.2. Mixed pronominal typologies

Dikin Marāqei pronouns show Present Nominative-Accusative (1sg , 3sg / pl ),
Present/Past Neutral (2sg / pl , 1pl ), Past Ergative (3sg / pl ) and Past Double Oblique
(1sg ) alignment.
(235) PRESENT PAST (Optional Set2 A agreement not
shown)
Nom-Acc: æv-an3SFD man1OS m-in- æv-ǽ3SOF æv-in3PD vínd- Ergative
égan3SF in3P
æv-in3PD æv-ǽ3SOF m-in- æv-on3OP æv-an3SFD
ǽn3P vínd-ian3SF
æz1SD æv-í3SOM m-in-én1S man1OS æv3SDM vínd-
æ3SM
Neutral: šömæ2P æmæ1P m-in-ó2P šömæ 2P æmæ1P vínd- Neutral
iom1P
æmæ1P tö2S m-in-óm1P æmæ1P tö2S vínd-iš2S
(Past only) ⇨ æv-on2OP man1OS vínd- Double
in1S Oblique
‘X sees/saw Y’ (All examples SOV; Direct, Oblique, Neutral case is unmarked)
780 Donald L. Stilo
As in Dikin-Marāqei, only 1sg . and 3sg /pl have Oblique case forms in Māsulei
and Asālemi, thus determining Double Oblique and Neutral alignment in Māsulei
with the addition of the Ergative in Asālemi. In Vafsi all persons except 2pl have
Oblique forms. Only in Vafsi and Eštehārdi does the Direct case of the 1sg (æz)
serve as Past Patient in a purely Ergative pattern:
(236) Eštehārdi (ST: 149) Vafsi Ergative (Vafsi III, OSV)
æz=eš bë-rësta-ym. æz tani bǽ-resda-ym
I:dir =3 s 2 send:pt -1 s 1 ‘He sent me.’ I:dir he:ob send:pt -1 s 1
7.2.3. The reassignment of the pronominal accusatives: Leriki

Northern Talyshi pronominal alignment is generally described as differing from the
nominal alignment in the use of an innovated Accusative case for pronouns (except
perhaps 3sg ) to mark the P in all tenses. An analysis of Miller’s texts—restricted
to Leriki (Table 7A) for our present purposes—reveals incipient deviation from
the use of the Accusative in the Past, while my own Leriki texts (Table 7B) show
that this trend now seems to have been completely carried through in the 100 inter-
vening years since Miller’s texts. Additional modern materials may require some
reassessment.
Table 7A: Miller (1930) texts: token count of pronoun marking

Direct Oblique Common Case Accusative Case
Total:169 1sg 3sg 1sg 3sg 2sg 1pl 2pl 3pl 1sg 2sg 3sg * 1pl 2pl 3pl
Pres S/A 28 12 21 9 5 3
Past S 6 3 5
Past A 8 2 4 2
Pres P 12 1 14 7 – 2 3
Past P 8 1 3 6 3 1
Table 7B: Stilo (2002–2004) fieldwork texts: token count of pronoun marking

Direct Oblique Common Case Accusative Case
Total:275 1sg 3sg 1sg 3sg 2sg 1pl 2pl 3pl 1sg 2sg 3sg * 1pl 2pl 3pl
Pres S/A 15 39 4 7 24 6 9
Past S 4 28 21 16 1 2
Past A 1 18 7 2 6 5
Pres P 13 8 1 – 5 9
Past P 11 8 1 2 2 0 0 0 0 0 0
The small number of tokens for this survey could be problematic and comparison
of additional Leriki as well as other dialects from Miller’s 1930 texts (collected
in 1902) vs. Stilo’s 2000’s Northern Talyshi texts will be needed to reach firmer con-
clusions—clearly a goal for future investigation. The points below on pronominal
alignment are derived from Table 7B and thus show the more current (2002, 2004
fieldwork) status of Leriki:
The Direct case forms of the pronouns in the 2000’s texts are used for:
1) Subject of transitives in the Present and all intransitive tenses (but see Point 7
below) as well as, very occasionally, the A of Past transitives (237b).
(237) az kitob-í o-bæ-gïrd-ïn-é=m.
I book-ob pvb - all -turn-caus - inf = aux :1 s 1
‘I will return the book.’
ïm diæ sæ b=æy.
this look take-pt to=he:ob
‘It (the wolf) looked at him (boy).’
2) Patient in the Past system for 3rd singular pronouns:
(238) æv=ïžon maγarǽ=dæ pæydo kard=e.
he=3 p 2 cave=in manifest do:pt = aux
‘They found her in a cave.’
ïm=ïm d=ïštæ čaš-í víndæ.
this=1 s 2 with=self eye-ob see:pt
‘I saw this with my own eyes.’
The Accusative case in the 2000’s texts is used for:
3) the Patient in the Present system. SAP pronouns and 3rd plural show 23 tokens
of the Accusative, and Ø tokens of Oblique or Common cases. The 3rd singular
show 13 tokens of the Oblique case as the only alternative in the Present.
782 Donald L. Stilo
(239) b-æ-zn-e-Ø tïn-í ïsæ? mïn-i azad kárd-æ.

fut - dur -know-inf -3 s 1 you-acc now I-acc free do-pr -3 s 1
‘Will he recognize you now?’ ‘He frees/releases me.’
Caseless Pronouns: Neutral Alignment

4) The 2000’s texts show 5 tokens of the Common case for the Patient in the Past
system (not 1st or 3rd singular) and Ø tokens of the Accusative. Since there is
no Oblique case in these persons, the A, T (i. e. the theme of ditransitive con-
structions), and P are all unmarked.
(240) Agent
b=ï toyfæ=ku=mon æmæ mæs-æ be.
to=this tribe=from=1 p 2 we hear:pt - ppl aux : pt
‘We had heard about this tribe.’
Patient
æmæ=žon γand=e.
we=3 p 2 send:pt = aux
‘They sent us.’
The Oblique case in my own texts collected in the field is used for:
5) 3sg Patient in the Present System as the preferred form. (See Pt. 3 above.)
6) Agent of Past transitives in the 1sg mï (18 tokens) and 3sg æy (7 tokens).
(241) mï xæbæ sæ=y ki…
I:ob news take:pt = aux sub
‘I asked them whether…’
æy=æn vot=ïš=e bæ taher-i.
he:ob =also say:pt =3 s 2= aux to P.N.:ob
‘And he told Taher.’
7) 1sg Patient in the Past System as the preferred form. Thus as with Dikin-
Marāqei and Asālemi (§7.2.2), N. Talyshi has a Double Oblique in the 1sg :
(242) ïm-í mï bárd=ïš=e.
this-ob I-ob take:pt =3 s 2= aux
‘He took me (there).’
ïm-í mḯn=ïž biabru ka=y.
this-ob I-ob =3 s 2 disgraced do:pt = aux
‘He disgraced me.’
8) The tokens of 1sg S-marking in the Present vs. the Past intransitives show
almost mirror-image frequencies:
(243) Present: Direct (78.9 %) Oblique (21.1 %)

az=im. mï noxaš=im e.
I=cop :1 s 1 I:ob sick=cop :1 s 1 emot
‘It’s me.’ ‘Oh, I’m sick.’
(244) Past: Direct (16 %) Oblique (84 %)
az æγïl b-im. mïn=æn ǽštim.
I child cop : pt -1 s 1 I:ob =also rise:pt -1 s 1
‘I was a kid.’ ‘I also got up.’
Differences between Miller’s 1902 texts and Stilo 2000’s texts:
M iller (1930, collected in 1902) S tilo ( fieldwork : 2002, 2004)
1. The 1sg Subject is exclusively The 1 sg Subject is predominantly
Direct in both Present and Past Direct (78.9 %) in Present system
systems
The 1sg Subject is predominantly
Oblique (84 %) in Past system
2. The Accusative case is used in both The Accusative is used exclusively in
Present and Past systems (except the Present system (except 3sg .)
3sg .)
Accusative predominates in the The P in the Past system is

Present, while in the Past, exclusively Oblique- or Common
other than 3sg , marking of P with case-marked; the Accusative seems to
the Oblique (8) or have been ousted in this context
Common (9) case is preferred over
the Accusative (4) case.
7.3. Set1/Set2: pronominal person-agreement markers

Set1 (Direct) and Set2 (Oblique) PAMs have various functions (see Table 5J, §5.3),
but not all languages display the full range. Languages with a tense-sensitive split
in the PAMs generally use Set2 to encode the A in the Past. Only a few languages
use Set1 to co-index the P in all persons in the Past. A few more only have the latter
usage in the 3rd person. Still others do not license any Set1 encoding of P in the
Past.
In addition, Set1 functions as an independent enclitic copula (see Be1, §6.8)
and Set2 is an enclitic form of the pronominal possessive (§4.2, Pt. 14). Based on a
subset of functions of Set1 and Set2, there are nine (A-J, skipping I) language types:
784 Donald L. Stilo
Set2 encodes Past A + + + + + + + + –

Set2 encodes Pres. P + + + + – – – – +
Set1 encodes Past P + (+) [3] [3] [3] [3] – – –
Set2 = Pron. Possessor + + + – + – (–)2 + +
Set2 encodes Past P – – – – – – – – +
TYPE A B 1
C D E F G H J3
1: Set1 encoding of P in the Past is significantly more obligatory in Type A than in B. In

Types C – F, the Set1 is optional in this function and is restricted to the 3rd person, marked
[3] here.
2: Set2 possessive enclitics exist in Northern Talyshi but have a highly restricted usage.
3: Type J has Nominative-Accusative alignment in nouns, pronouns, and PAMs in all tenses
The nine types:
A. Dikin Marāqei: Set2 Past A; Set2 Pres P, Set1 Past P (all persons); Set2 Possessive
(245) Set2 Possessive P Encoding, Pres. Set1 P Encoding, Past
čašm=em, čašm=et dæ-m-fa-n=eš. vínd=im-ian.
eye=1 s 2 eye=2 s 2 pvb - dur -throw-1 s 1=3 s 2 see:pt =1 s 2-3 sf 1.
‘my eye, your eye’ ‘I throw it.’ ‘I saw her.’
B. Vafsi: Set2 Past A; Set2 Pres P, Set1 Past P (all persons); Set2 Possessive
čostǽk=es, čostǽk=esan is-ær-vin-óm b-ísan-die-ym
shoe=3 s 2 shoe=3 p 2 3 s 2 - dur -see-1 s 1 pu -3 p 2-see:pt -1 s 1.
‘his/her, their shoes’ ‘I see him/her’ ‘They saw me’
C. Khoynarudi: Set2 past A; Set2 Pres P, Set1 Past P (sparingly, restricted to 3sg /pl ),
Set2 Possessive; (all: YSFN)
baju=m, zurǽ=r b-ím-kešt-an! em-e=šun kešt-ind.
sister=1 s 2 son=2 s 2 pu -1 s 2 -kill-2 p 1 this-dp =3 p 2 kill- 3 p 1.
‘my sister, your son’ ‘Kill me!’ ‘They killed these ones.’
D. Nowkiāni: Set2 past A; Set2 Pres P; Set1 Past P (restricted); no Set2 Possessive;
Patient Encoding, Present System: Set1 A (suffixed)/Set2 P (encliticized)
(248) P Encoding, Pres. Set1 P Encoding, Past
Ø-xeruš-ém=ešan. béz-i-æ i-gæt=em-a.
dur -sell-1 s 1= 3 p 2 goat-indf - df pvb -take:pt =1 s 2- 3 sf 1.
‘I sell them.’ (YSFN) I bought a goat (f.).’ (YSFN)
E. Asālemi: Set2 past A; no Set2 Pres P; Set1 Past P (restricted to 3sg / pl ); Set2
Possessive
(249) Set1 Past P Encoding,
Set2 Possessive Past A marking omitted
šæví=m, zuǽ=r mən æ merd-e vind-in.
shirt=1 s 2 son=2 s 2 I:ob that man-dp see:pt -3 p 1.
‘my shirt, your son’ ‘I saw those men.’
Set2 A fronted
tə æy-é=r vind-in?
you that-dp =2 s 2 see:pt -3 p 1.
‘Did you see them?’
F. Hezārrudi: Set2 past A; no Set2 Pres P; Set1 Past P (restricted to 3sg /pl ); no Set2
Possessive
(250) Set2 A unfronted/Set1 P(YT: 458)
be-na=m-ende æmbar.
pu -put:pt =1 s 2- 3 p 1. storage.room
‘I put them in the storage room.’
Past A marking omitted/Set1 P (YT: 458)
æd-orund-ende otaq.
pvb -throw:pt - 3 p 1. room
‘X threw them into the room.’
G. Leriki: Set2 A Past; no Set2 Pres P; no Set1 Past P; (Set2 Possessive very mar-
ginal);
Past System: Set2 A (encliticized); Patient Encoding not licensed in any tense
(251) še-m mol-on=ïm dušæ=y.
go:pt -1 s 1 cattle-pl =1 s 2 milk:pt = aux
‘I went (and) milked the animals.’
omæ maštævo niyæ=m o-šánd=e.
come:ppl morning churn-1 s 2 pvb -shake-aux
‘The next morning I churned (it).’
H. Sagzābādi: Set2 A Past; no Set2 Pres P; no Set1 Past P; Set2 Possessive
(252) Possessive Enclitics (AA2: 103) Past system: Set2 = A (AA2: 103)
xwačær=i, dæst=eš ǽsb-e=šun dǽ-bæst dar.
sister=2 s 2 hand=3 s 2 horse-dp =3 p 2 pvb -tie:pt tree
‘your sister (Obliq.), his hand’ ‘They tied the horses up to a tree.’
786 Donald L. Stilo
J. Razajerdi: no Set2 past A; Set2 Pres P; no Set1 Past P; Set2 Possessive; Set2 Past P
(253) Set2 Possessive P Encoding, Pres. P Encoding, Past
bil=em, sær=et (not available) bil bijæ zí-em=eš.
spade=1 s 2 son=2 s 2 spade with hit:pt -1 s 1= 3 s 2.
‘my spade, your son’ ‘I hit it with the spade.’
7.4. Leftward mobility of person-agreement markers (Set1/Set2)

Except for Tatoid, Oskulak Talyshi, and Razajerdi, all under heavy Caspian and/
or Persian influence, Fronting of Set2 A PAMs is universally preferred but not
obligatory in Tatic. By contrast, Set1 clitic fronting is an isogloss in a restricted
area including N. Tati, N. and C. Talyshi, as well as Armenian, Neo-Aramaic, and
Udi (Stilo 2008a).
7.4.1. Fronting of Set2 agent throughout the Tatic family

Set2 clitics encoding the Past A may front to the immediately preverbal word
(“One-Place Fronting”), most often the P, or to other elements farther to the left
(“Distance Fronting”). They may also optionally remain on the verb, or in the right
context be completely deleted. The examples below show this Set2 flexibility in
Tatic:
(254) Talyshi: Leriki
Unfronted
hækim vard-ǽ =m=e.
doctor bring:ppl =1 s 2= aux
‘I have brought a doctor.’
One-place fronting (to object)
xariji dil=ædæ gæymæt =ï sæ=y?
foreign language=in grade=2 s 2 take:pt = aux
‘Have you gotten your grades in“foreign language” (yet)?’
(255) One-place fronting (to non-object)
i birdæm=dæ =š gæt=e mï.
one moment=in=3 s 2 take:pt = aux I:ob
‘It got me all of a sudden.’
Distance fronting (to object)
pul =ïš čey dast=o sæ=y.
money=3 s 2 his hand=from take:pt = aux
‘He took the money out of his hand.’
(256) Distance fronting (to non-object)

bævædæ =mon gïlæ-y hæyæjan dæ-voni
then-1 p 2 unc - indf trouble pvb :pass:pt
‘Then we had (lit: passed) some trouble.’
Set2 deleted
mï mæktub nïvïšt-ǽ be.
I:ob letter write:ppl aux : pt
‘I had written a letter.’
In Asālemi Set2 “… attaches to the left-most element in the phonologically integral
verb phrase” (Paul 2011: 137), i. e., it floats leftward via a Wackernagel rule to a
variety of initial-position elements. Set2 may also optionally remain on the verb
or be deleted:
(257) Distance fronting: Object in 1st position
nælætí=m xəlik-í=næ kəšt=æ.
snake=1 s 2 shovel-ob =with kill:pt = aux
Non-object in 1st position
xəlik-í=næ=m nælætí kəšt=æ.
shovel-ob =with=1 s 2 snake kill:pt = aux
‘I killed the snake with a shovel.’
(258) Unfronted
av-í dəlæ=ka per-aknə́st=əm=æ.
water-ob in=ka pvb -pull:pt =1 s 2= aux
‘I pulled (it) up out of the water.’
Set2 deleted
mən æ merd vind=æ.
I:ob that man see:pt = aux
‘I saw that man.’
Unfronted
sæmæd xan-i čest-é nur =čun =æ.
P.N. P.N.-ob shoe-dp put:pt =3 p 2= aux
‘They put out Samad Khan’s shoes.’
šeme=ru korreγ =em dur=æ.
you:pl =for trouble=1 s 2 give:pt = aux
‘I gave you trouble.’
788 Donald L. Stilo

Set2 deleted
sæmæd xan-i vut=e ke…
P.N. P.N.-ob say= aux sub
‘Samad Khan said that…’
(261) Sagzābādi
Unfronted
num=eš u-jiræt =šun bæ’diæzzæman.
name=3 s 2 pvb -take:pt =3 p 2 P.N.
‘They named him Ba’di-æzzaman.’ (AA2: 103)
bæhram =eš be-zi zæmin.
P.N.=3 s 2 pu -hit:pt ground
‘He knocked Bahram to the ground.’ (AA2: 115)
(262) Distance fronting (to object)
teti =em eštæ-š owǽrd-e.
girl=1 s 2 you-also bring:pt - perf ?
?
‘I have brought you the girl.’ (AA2: 115)

Set2 deleted
bæ’diæzzæman hučči né-va.
P.N. nothing neg -say:pt
‘B. didn’t say anything.’ (AA2: 107)
(263) Nowkiāni (YSFN)
One-place fronting (to object) One-place fronting (to non-object)
tefæng =eš u-gæ. i dæfæ =š be-vint…
rifle=3 s 2 pvb -take:pt one time=3 s 2 pu -see:pt
‘He picked up his rifle.’ ‘Once he saw (that)…’
(264) Nowkiāni (YSFN)
bælkæ číz-i =r mez i-gæ.
perhaps thing-indf =2 s 2 wage pvb -take:pt
‘Perhaps you got something for wages.’
Set2 deleted
čo ma-r æjo be-va.
his mother-ob he:obj pu -say:pt
‘His mother said to him,…’
(265) Vafsi
Unfronted
b-ímresda koleng b -ísan-ard
pu -1 s 2-send:pt pick pu -3 p 2-bring:pt
‘I sent them to bring a pick.’
yey aynæ =s b-ard.
one mirror=3 s 2 pu -bring:pt
‘She brought (him) a mirror.’
(266) Vafsi
One-place fronting (to non-object)
parama ke æz bén =es úgiræ.
a.few emf from root=3 s 2 pvb -take:pt
‘(The flood) even picked up a few(trees) by the roots.’
dàre si-é =s æz ben o.bær árd
tree-ez apple-of =3 s 2 from root out brought
‘It ripped apple trees out by the roots.’
(267) Vafsi
Distance fronting (to non-object) Set2 deleted
hæzíri =m tani bǽ-diæ. zarru-an injuri yá-watt-e.
yesterday=1 s 2 he:ob pu -see:pt child-op thus pu -say:pt - perf
‘I saw him yesterday.’ ‘The guys have said this (lit: thus).’
Set2 fronting placement may help to identify the P in the Double Oblique I (§7.1.3.3).
7.4.2. Clitic restrictions in Vafsi

Set2 clitic stacking is not allowed in Vafsi. If the only available host for the clitic
already has a Set2 as possessive, the second Set2 will remain on the verb. Set2 may
combine, however, with other types of enclitics, e. g., =iz ‘also.
(268) tin-an=iz gá=san b-ísan-košd-æ.
this-pl =also cow-3 p 2 pu -3 s 2-kill:pt - ppl
‘They slaughtered their cows, too.’
(269) Unfronted (optional)
yeki ke=dæ b-ís-æsdæ.
one house=in pu -3 s 2-leave:pt
‘He left one (of them) at home.’
790 Donald L. Stilo
Fronting restricted: clitic stacking

šæ:r=om vel=on kærd-e.
city=1 s 2 loose-1 s 1 do:pt - perf
‘I have abandoned my country.’
See also Experiencer verbs (§6.7) for similar examples of Set2 fronting with ‘to
want’.
7.4.3. Set1 fronting

Set1 clitic fronting in West Iranian is restricted to Northern Tati, Northern and
Central Talyshi, occurring only in analytic tenses: Present, Future, Progressives,
Perfects, and in N. Tati in the Imperfect as well. This fronting is triggered by infor-
mation structure by which the Set1 encliticizes to the stressed focused element of
the sentence, including even the Subject. Set1 fronting is an areal feature also seen
in Armenian, Neo-Aramaic, and Udi and its triggers are basically the same for both
the Iranian and non-Iranian languages mentioned here. Since Set1 clitics are also
the aux or copula, this fronting phenomenon also pertains to both Present and Past
aux forms:
(270) Leriki
Present Future (see also ex. 45)
čič=e vót-dæ? čï kovra=ž b-æ-va?
what=aux :3 s 1 say-loc from where=aux :2 s 1 fut - dur -bring
‘What is he saying?’ Where will you bring (it) from?’
(271) Leriki
Past Progressive
tï čič=b-iž vot-é-dæ?
you what=aux : pt -2 s 1 say-inf -loc
‘What were you saying?’
Past Perfect
tožæ=b-im az omæ kæ.
new=aux : pt -1 s 1 I come:pt house
‘I had just come home.’
(272) Viznei
Present Past Progressive
tə čič=iš kárd-æ? tə čič=b-iš kard-æ?
you what=aux :2 s 1 do-loc you what=aux : pt -2 s 1 do-loc
‘What are you doing?’ (YSFN) ‘What were you doing?’ (YSFN)
(273) Asālemi
Present
æz æ otaq-í=ka=m b-æ-xət.
I that room-ob = ka = aux :1 s 2 pr - dur -sleep:pt
‘I sleep in that room.’
(274) Keringāni
Present Present Perfect
hæsæn=en pül get. nun=em=en pæt-a.
P.N.=aux :3 s 1 money take:pt bread=1 s 2=aux :3 s 1 bake:pt - perf
‘Hassan gets money.’ ‘I have baked bread’ (YSFN)
7.4.4. Simultaneous fronting of Set1 and Set2

Leriki transitive verbs in the Perfect tenses allow for both Set1 and Set2 to front
simultaneously. Since the triggers for fronting are different for each set, they may
front together or separately to different items (see also Keringāni 274, above):
(275) æyo ko kam=ïm =e kard=æ.*
there work little=1 s 2= aux :3 s 1 do=aux
‘I worked there very little.’
*(scrambled version of: kam ko=m=e kard-æ ‘worked very little’)
idaræ=dæ=m =e ko kard=æ.
office=in=1 s 2= aux : perf work do=aux
‘I (have) worked in an office.’
(276) Past Perf., Fronted together Past Perf., Fronted Separately
tarix iminji=m =be nïvïštǽ. tarix=ïm iminji =be nïvïštǽ.
history first-1s 2=aux : pt wrote history-1s 2 first=aux : pt wrote
‘I had registered (lit: written) history as (my) major (lit: first).’
7.5. Ditransitives: recipients and addressees

Most Caspian and Tatic languages exhibit Neutral Ditransitive typology (T=P=R):
the Theme (T), Recipient (R) of Ditransitive verbs, and Patient (P) of monotran-
sitives are identically flagged with =ra in Caspian, Harzani, Razajerdi and Tatoid
and for the most part with the Oblique case in the rest of Tatic. This strategy is also
called the Double Object construction. This typology does not apply to non-salient
Ps and Ts which are unmarked while the R is always flagged with the respective
strategies mentioned, producing an Indirective typology, i. e., P=T≠R (see also
Stilo 2010). There are also other minor adpositional patterns that flag the R, yield-
ing Indirective typology. N. Talyshi of the Republic of Azerbaijan uses the latter
typology exclusively.
792 Donald L. Stilo
Depending on the language, Addressees may be flagged in the same way as Rs

although frequency counts may reveal that some languages show slight differences
in postverbal vs. preverbal word orders of Recipient and Addressees, respectively.
See §9.2 for a discussion of word order issues for Recipients and Addressees.
(277) Lāhijāni
mu bú-furt-əm digə́r=e*.
I pu -sell:pt -1 s other-indf : ra
‘I sold (it) to another (i. e., someone else).’ *(digə́r=e < digə́r-i=a)
(NJ: 61)
mu ketab=a un=a há-da-m
I book=ra he=ra pvb -give:pt -1 s
‘I gave him the book.’
Addressees
(278) be:tær bu-Ø t-æ b-utt-ə bu-m.
better cop : pt -3 s you-ra pu -say:pt aux : pt -1 s
‘It would have been better had I told you.’
tu k-e* b-ut-i?
you who-ra pu -say:pt -2 s
‘Who did you tell?’ (NJ: 55)
(*ki ‘who’ + =a > ke)
(279) Sāravi
Recipient
sabsere dær-e ve=re ešnasek de-n-e.
landlord be 4-3 s he=ra hint give-pr -3 s
‘The landlord is giving him a hint.’ (SY: 60)
Addressee
qeziye=re še hæmsaye=re bæ-ut-e.
issue=ra self neighbor=ra pu -said-3 s
‘He told the situation to his neighbor.’ (SY: 62)
(280) Harzani
Recipient Addressee
en jöb=e te=re ü-müt=mæ. merd=e öt=mæ.
this word=ra I=ra pvb -teach=1 s 2 man=ra say:pt =1 s 2
‘I taught you this word.’ (MH: 466) ‘I said to the man.’ (AAK: 78)
Northern Talyshi flags the Recipient and the Addressee exclusively with the adpo-
sition bæ ‘to’, while Asālemi appears to use both Oblique case and bæ strategies,
and Māsulei seems to use only the case-marking strategy, as do most Tatic varieties.
(281) Leriki
Recipients
b-æ-do=n æy bæ mï?
fut - dur -give=3 p 1 that:ob to I:ob
‘Will they give that to me?’
Addressees
æv-on bæ mï ní-bæ-vot-e=n.
that-pl to I:ob neg - fut - dur -say-inf =3 p 1
‘They won’t tell me.’
(282) Asālemi
sif-í b-æ-da=m æ:mæd-í.
apple-ob fut - dur -give=1 s 1 p.N.-ob
‘I’ll give Ahmad the apple.’
ræis-í bæ mən vat-æ ki…
chief-ob to I:ob say:pt - aux sub
‘The chief told me that…’ (Paul: 191)
(283) Māsulei
pul-e a-do=š-æ mən.
money-ob pvb -give:pt =3s 2 I:ob
‘He gave me the money.’
æ mærdæk-é əčə=y vat=æ?
that man-ob what=2 s 1 say=aux
‘What did you say to that man?’
7.5.1. Vafsi ditransitives

Vafsi has three strategies for ditransitives (see also Stilo 2010 for a fuller discus-
sion of ditransitives in Vafsi). Of these, the two major constructions are:
• the Double Object Construction (DOC) in which the R is Oblique-case marked;
• the Indirect Object Construction (IOC) in which the R is flagged with adposi-
tions.
The DOC, by far the most common ditransitive construction in Vafsi, typically
(but not obligatorily) places the R argument in postverbal position:
794 Donald L. Stilo
(284) in ke t-æsbir-ám esdæ.

this house dur -entrust-1 p 1 you.ob
‘We are turning this house over to you.’ (VF: B5,13)
púl=om há-da hæsǽn-i.
money=1 s 2 pvb -gave P.N.-ob
‘I gave (the) money to Hassan.’
As for the IOC and adpositional flagging, the preposition o occurs in the Vafs sub-
dialect mostly with the verb ‘give’ as an occasional alternate to the DOC:
(285) kellæ dokán-i=s háda o merde-y.
key shop-om =3 s 2 pvb -gave to man-om
‘He gave the man the key to the shop.’
in leyle-y æt-æsbir-om o esdæ.
this boy-om dur -entrust-1 s 1 to you:ob
‘I am entrusting this boy to you.’
In most languages of western Iran, a very limited set of verbs, esp. ‘write’ and
‘send’, require the adpositional flag ‘for’ to mark the Recipient rather than an
allative flag:
(286) Vafsi
ketab=om bæ-restá=ve tani=ra. nawǽ=m bǽ-nivisdæ esdæ=ra.
book=1 s 2 pu -sent=aux he:ob =for book=1 s 2 pu -sent=aux he:ob =for
‘I had sent him a book.’ ‘I wrote you a letter.’
(287) Sāravi (SH: 157)
æt-ta kaqez še zena vesse bæ-nvišt-eme
1-unc letter self wife for pu -write:pt -1 s
‘I wrote my wife a letter.’
The Addressee in Vafsi is most commonly flagged by the preposition dæ or some-
times by the Oblique case as in the DOC. The Addressee is generally preverbal.
(288) dæ tawan hič=es ná-wattæ.
to we.ob nothing=3 s 2 neg -said
‘He didn’t say anything to us.’
e-yz=æm in ær-vaz-om esdæ.
I.dir =also=also this dur -say-1 s 1 you.ob
‘And I will tell you this.’ (B12.5)
The third, or minor, Ditransitive pattern of Vafsi is the PAM Ditransitive Construc-
tion, an indexing strategy that encodes a pronominal R with Set2 PAMs directly on
the verb. With ‘give’, the Set2 clitic is obligatorily fronted to the Theme. Examples:
(289) is- ær-vaz-om. ær-goa ke nešanæ =s bǽ-d-a.

3 s 2-dur -say-1 s 1 dur -want: pt sub address=3 s 2 pu -give-2 p 1
‘I’ll tell him/her.’ ‘You had to give him an address.’
7.6. Other dative-like usages

A few other minor patterns with a Dative sense take Caspian =ra marking or Tatic
Oblique-case marking. They seem to be triggered by lexical items that require
them:
(290) Vafsi
æ-čuæ* esdæ tæmen bǽ-ræs-an-e?
dur -can you:ob I:ob pu -arrive-caus -1 s 1
‘Can he bring you up to me (i. e., make you my equal?)’ *(æ-čuæ <
æt-suæ)
Lāhijāni (NJ: 115)
pəriruz šəmə=rə či bu-Ø?
day.b.yest you:pl =ra what cop : pt -3 s
‘What was wrong with you the day before yesterday?’
(291) Sāravi (SY: 58)
še mar=e mæmnundar bi-ye.
self mother=ra grateful cop : pt -3 s
‘She was grateful to her mother.’
Māsulei
tə mən Ø-mon-e.
you I:ob pr -look.like-2 s
‘You look like me.’
8. Subordination
Subordination types all share the feature of being introduced by the Universal Sub-
ordinator ki/ke (sub ) in their underlying structures. This survey of Subordination
briefly covers Adverbial (‘when, if, since’), Relative, Complement, and Purposive
clauses.
8.1. Word order typology of subordinate clauses

As we shall see in §9, word order issues in the languages of this area indicate a
rather mixed typology. Subordinate clause types also demonstrate an equally mixed
796 Donald L. Stilo
word order typology. They are generally left-branching, but right-branching in a

significant number of major subordinate clause types. What Diessel (2001) shows
for the word order of Persian adverbial clauses holds equally true for most, if not
all, languages of western and central Iran: Conditionals predominately precede the
main clause, Temporals generally do, Causals are mixed, and Purposive (to which
I will add Complement and Relative) clauses always follow the head.
8.2. Adverbial clauses

Adverbial clauses have the following characteristics:
1. They are introduced by a subordinate conjunction, in which case the sub may
be retained or optionally deleted (exs. 292–303);
2. Additionally, the temporal, causal, or conditional subordinate conjunction is
commonly deleted. The sub is then retained, and the semantics of the three
types are all neutralized. With this deletion, if the enclitic sub ki/ke is in initial
position, it must be shifted to the right of the first constituent (exs. 304–313);
3. If the adverbial clause is shifted to after the main clause and if the subordinate
conjunction is deleted, the sub is no longer initial and is not right-shifted;
4. Initial adverbial clauses have a very distinctive intonational contour (also see
pitch levels as discussed in §3.6), marked with (,,) in this chapter: all the usual
word and sentence stresses and pitch of the whole clause are neutralized, the
first syllable of the clause gets a high pitch, and the last syllable of the clause
has an extra-long extension with the pitch consistently rising to extra-high.
5. Right-shifted initial adverbial clauses have a different distinctive contour: all
stresses are again neutralized, the pitch remains flat and at an extra-low level
throughout the clause.
6. The subordinate conjunction and the sub may also be optionally deleted, in
which case the only marker of subordination is intonation (exs. 314–318);
Examples marked with * below indicate that both the adverbial subordinate con-
junction and the Universal Subordinator ki/ke are present.
8.2.1. Temporals
(292) Lāhijāni
və́xti mu ún=a bə-dye-m,, u xott-í du-bu-Ø.
when I he=ra pu -see:past -1 s he sleep:inf be 4- aux : pt -3 s
‘When I saw him, he was sleeping.’
(293) Sāravi (SH: 151)

vǽqti æt kæm-i gæt bæ-yy-e,, še mar-e
when one little-indf big pu -become:pt -3 s self mother-rez
qædr=e fæ:m-en-e.
value=ra understand-pr -3 s
‘When she gets a little older, she will appreciate her mother.’
*(294) Leriki (MT: 114)
váxtï=ki æ moy=š vind=e,…
when=sub that fish=3 s 1 see=aux
‘When he saw that fish,…’
Kajali (KJ: 285)
vǽxti=ke jællad=eš be-hind,…
when=sub executioner=3 s 2 PU-SEE:PT
‘When she saw the executioner,…’
*(295) Vafsi
hǽni=ke vá-nde yeyláq-i séyl-æ b-áw-e,,
when=sub say:pt -3 p 1 winter.camp-om flood-df pu -come:pt -3 s 1
æz bǽ-tærsa-yme.
I pu -fear:pt -1 s 1
‘When they said a flood came at the winter camp, I got scared.’
8.2.2. Conditional clauses

(for additional examples see also: Conditionals, §6.5.12)

agə bə́-xa-m futbol bazí bù-kun-əm,,
if pu -want-1 s soccer game pu -do-1 s
‘If I want to play football,…’
age mu dubare bə́-kye-m,,…
if I again pu -fall-1 s
‘If I fall again,…’
798 Donald L. Stilo
(297) Sāravi
æger te bé-xa-i,, xærim-be*,...
if you pu -want-2 s
‘If you want, I’ll buy (it).’ (< *xærin-me) (SH:143)
æge hič ja mimuni b-úr-em.

if any place party pu -go-1 s
‘If I go to a party anywhere, ...’ (SY:58)

(298) Leriki
ægær merd-í ïštæ nom=ïž sipriš noa bu-Ø,,…
if man-ob self name=3 s 2 elder put aux : sbj -3 s 1
‘If a man has called himself a wiseman,…’
*(299) Vafsi
ǽge ke æz pa=y æt-eyšer-ome,,…
if sub I foot=2s2 dur -squeeze-1 s 1
‘If I squeeze your leg, …’
ča æmíq-i ǽgæ bæ-zæn-ende,,…
well deep-indf if pu -hit-3 p 1
‘If they strike a deep well,…’
8.2.3. Causal clauses

There are two primary types of causal adverbial clauses. The čon, čun, čunki ‘since’
type usually precedes the main clause, while the ‘because’ (Persian: bæráye ínke,
etc.) type generally follows. The latter type is discussed below in §8.3.
čun nəhar šime məymon b-im,, nú-tonəs-im orə
since lunch your:pl guest cop -1 p neg -can:pt -1 p there
bə́-š-im.
pu -go-1 p
‘Since we were your guest for lunch, we couldn’t go there.’
(301) Sāravi (SH: 143)
čon xæssu=me, sere mund-em-be*. (< *munn-en-me)
since tired=cop :1 s 1 house-pr -1 s stay-pr -1 s
‘Since I’m tired, I’ll stay home.’
(302) Māsulei
čon hæva gærmtær=a bo-Ø, mosafer hey
since weather hotter=cs become:pt -3 s 1 traveler keep.on
æræγ kær-ə.
sweat do-3 s 1
‘Since it got hotter, the traveler keeps sweating.’
(303) Vafsi
čun næf ’e xosusi man beyn=dæ nǽ-v-e,, tæmam
since gain-ez private in middle=in neg - cop : pt -3 s 1 all
esteqbal=esan kærd
welcome=3 p 2 do:pt
‘Since there wasn’t any private gain involved, they all welcomed it.’
8.2.4. Subordinate conjunction deleted, ki shifted

Temporals
dišow ki hæmdiga=ra bə-d-im,,…
last.night sub each.other-ra pu -see: pt -1 p
‘Last night when we saw each other,…’
dər-ə=ji ki b-ə́ma-m birun,,
door-ez =from sub pu -come:pt -1 s out
‘When I came out the door,…’
(305) Sāravi (SY: 66)
zena ke sere=je b-urd-e,,…
woman sub house-from pu -go:pt -3 s
‘When the woman left the house,’
(SY: 54)
sob ke bæ-i-ye,,…
morning sub pu -become:pt -3 s
‘When it became morning,’
(306) Leriki (MT: 115)
ï xæbæ-y ki bæ podšo ræsæ-Ø podšo mæyus
this news-? sub to king arrive:pt -3 s 1 king depressed
be-Ø.
become:pt -3 s 1
‘When this news arrived to the king, the king got depressed.’
800 Donald L. Stilo
(307) Hezārrudi (YT: 466)

ævvæl ruze ke æz xordæyal vi-em,, čemen dad
first day sub I child cop : pt -1 s 1 my father
be-mbæ-Ø.
pu -die:pt -3 s 1
‘In the beginning when I was a kid, my father died.’
(308) Vafsi
næzdíke əwdaní ke vǽ-ræsa-nde,, vel=esan kærd.
near-ez village sub pu -arrive:pt -3 p 1 loose=3 p 2 do:pt
‘When they got near the village, they released (it).’
Conditional Clauses
šəmə ki išon-ə ama qa:r n-in, pəs čərə
you:pl sub they-rez with angry neg : cop -2 p then why
išon-ə xonə n-a-n-in?
they-rez house neg -come-pr -2 p
‘If you are not mad at them, then why won’t you come to their house?’
Causal Adverbial Clauses
tu ki riš=a təraš-ən-i, čera ti sibil=a
you sub beard=ra shave-pr -2 s why your mustache=ra
nə-zən-i?
neg -hit-2 s
‘Since you shave your beard, why don’t you shave your moustache?’
(311) Vafsi
in ke soan há-da tæmen,, æz is-æt-ar-om
this sub you:pl pvb -give:pt I:ob I 3 s 2- dur -bring-1 s 1
soan goftošunúd hà-kær-a.
you:pl conversation pvb -do-2 p 1
‘Since you gave me this, I will bring him so you can converse (with him).’
The rightward shifting of the sub may also occur more than one place to the right:
ferqæči-ün em=čün šahar ke estǽræ,,
P.N.-op this=3 p 2 city sub take: pt
‘When the Ferqa people took this city,…’
Sagzābādi (AA2: 107)

lælæ jem gæp=eš ču be-šnowost
nanny this:ob speech=3 s 2 sub pu -hear:pt
‘When the nanny heard this,…’
(313) Vafsi
par awan ke vǽ-ss-iam kongræ…
last.year we sub pu -go:pt -1 p 1 congress
‘Last year when we went to the congress,…’
8.2.5. Subordinate conjunction and ki deleted (temporal, conditional, causal)
(314) Lāhijāni
šom xord-ə=də b-im,, nahid b-ə́ma-Ø ame xonə.
dinner eat-inf = be 4 aux : pt -1 p P.N. pu -come:pt -3 s our house
‘When/while we were eating dinner, Nahid came to our house.’ (NJ,134)
(315) Sāravi (SY: 58)
e-tta tim karess-ene,, sæt-ta tim æmel
one-unc seed plant:pt -3 p 100-unc seed produced
Ø-yæmu-Ø.
imf -come:pt -3 s
‘If they planted one seed, a hundred seeds would grow.’
(316) Leriki
æy bḯ-vind-o-ž, bæ-zn-e=ž?
he:ob sbj -see-opt -2 s 1 fut -know-inf =2 s 1
‘If you see him, will you recognize him?’
(317) Vafsi
in pæs=dæ tt-á-nde,, æz rótt=i m-eyz.
this time=in dur -come-3 p 1 from place=2 s 2 neg : imper -rise
‘This time when they come, you don’t get up from your spot.
(318) Vafsi
šuraw bǽ-zen-ende,, áwæ obǽr ætt-à-Ø.
P.N. pu -hit-3 p 1 water to-out dur -come-3 s 1
‘If they drill in Shuraw, the water will come out.’ (i. e., they’ll strike
water)
802 Donald L. Stilo
8.2.6. Right-shifted adverbial clauses
(319) Lāhijāni
mu n-ə́mə-na bu-m agə bú-donsə
I neg -come:pt - cond aux : pt -1 s if pu -know:pt
b-i-m itorə=y.
aux : pt - irr -1 s this.way=cop :3 s
‘I wouldn’t have come if I had known that it was this way.’
mu bu-šo bu-m ki u b-ə́ma-Ø.
I pu -go:pt aux : pt -1 s sub he pu -come:pt -3 s
‘I had (already) left when he came.’
Leriki
bærk=imon še-dæ ki társ-dæ=mon
strong=1 p 1 go:inf - pr sub fear-pr - pr =1 p 1
‘We are walking fast because we are scared.’
(321) Vafsi
molla-y bawæ ǽ-veš-e ke hoqqæ=s o tawan zæ.
P.N.-om father pu -burn-3 s 1 sub trick=3 s 2 to we:ob hit:pt
‘May Molla’s father burn (in hell) because he has played a trick on us.’
8.3. Right-branching subordinate clauses (main-sub)

Subordinate clauses whose expected word order is main - sub primarily consist of
three types: Causal II, Complement, and Purposive clauses. Relative clauses also
follow the head noun but may precede (as one alternate) in N. Talyshi (338–39).
(322) Leriki (MT: 100)
Causal II
æv=ïš nḯ-znæ=e sǽpinæ æy ki ve bæ
s/he=3 s 2 neg -know=aux because.of that:ob sub very to
yandï ǽ-š-i-n.
each.other dur -go- imf -3 p 1
‘He didn’t recognize her because they looked a lot like each other.’
8.3.1. Complement clauses
(323) Lāhijāni
ama bə́-d-im ki mi pier=əm b-áma-Ø.
we pu -see:pt -1 p sub my father=also pu -come:pt -3 s
‘We saw that my father came in.’
(324) Leriki
vínd=ïž=e ki ïm še-Ø jali.
see=3 s 2= aux sub this go:pt -3 s alone
‘(The wolf) saw that this one went off (from the pack) alone.’
ve saγ be ki tï omǽ=ž.
very healthy be:imper sub you come:pt =2 s 1
‘Thank you very much for having come. (lit: that you have come).’
vin=čun=æ1 asb-ün bær an=æ2 qefel.
see:pt =3 p 2=aux horse-op door be dem= cop :3 s locked
1
(< vind=šun=æ)
2
(for copula type see at §6.5.8.1)
‘They saw that the horses’ (stable) door is locked.’
(326) Vafsi
esdæ ná-watt-æ ke an čiz b-e.
you:ob neg -say:pt - ppl sub that what aux : pt -3 s 1
‘You didn’t say what that was.’
b-ísdi ke nǽ-r-buæ-Ø.
pu -3 s 2-see:pt sub neg -be:3 s 1
‘He saw that it isn’t possible.’
8.3.2. Purposive clauses

Purposive clauses are usually introduced either by a subordinate conjunction that
also means ‘until’ (ta, etc.) or by the sub ke/ki. Both subordinators may also be
deleted. The verb in the Purposive subordinate clause is always in the Present
Subjunctive:
dariyuš i harf-on=a zen-ə ta b-əyn-ə širin či
P.N. this speech-pl = ra hit-3 s until pu -see-3 s P.N. what
gu-n-əy.
say-pr -3 s
‘Dariush is saying these things (words) to see what Shirin will say.’
804 Donald L. Stilo
(328) Sāravi (SH: 150)

višter bæ-xos ta æsæbuni n-ævv-i.
more pu -sleep until angry neg -become-2 s
‘Sleep more so you don’t get angry.’
(SH: 143)
bi-yamu-Ø ke ve=re bǽ-ver-e.
pu -come:pt -3 s sub he=ra pu -take-3 s
‘He came to take him.’
(329) Leriki
čæpæ=mon ka ki gin nə́-bu-n.
fence=1 p 2 do:pt sub lost neg -become-3 s
‘We made a fence so they don’t get lost.’
Sagzābādi (AA2: 105)
be-š-ime da jiæ b-ar-em.
pu -go-1 s 1 until she:ob pu -bring-1 s 1
‘I went to bring her back.’
(330) Vafsi
ær-go vel-an áw-æ há-de-y yáwa
dur -want flower-pl water-df pvb -give-2 s 1 until
nǽ-melesen-u-æ.
neg -wither-pass -3 s 1
‘You have to water the flowers so they don’t dry up (wither).’
(331) æhmǽdi vasettǽ=s kæ̀rdæ-ve ke awan ašdí
P.N. mediator=3 s 2 do:pt = aux : pt sub we peace
hà-kær-am.
pvb -do-3 p 1
‘Ahmad (had) acted as a mediator so that we would make up.’
8.3.3. Postposed relative clauses (see §8.4.1 for preposed)
(332) Sāravi
un merdi=re ke vin-d-i me piyer=e.
that man=ra sub see-pr -2 s my father=cop :3 s
‘The man that you see is my father.’
Vafsi
kelje ke mákk=a w-è, indi=e.
girl sub lost=cs be:pt -3 s here=cop :3 s 1
‘The girl who got lost is here.’
Relative clauses modifying an object are often extraposed to the right, particularly if
they are longer than the verb of the main clause (cf. a similar situation in German).
(333) Lāhijāni
xatire-i bu-Ø ki mu dašt-əm.
memory- restr cop : pt -3 s sub I have:pt -1 s
‘It was a reminiscence that I had.’
ja-ha-yi hanna ki mu nú-šo-m.
place-pl - restr be 5 sub I neg -go:pt -1 s
‘There are places I haven’t gone to’
mu xəyli čiz-on=a don-əm ki gut-ə mənne-m.
I many thing-pl = ra know-1 s sub say-inf can:neg -1 s
‘I know a lot of things that I can’t say.’
Since the subordinator ke is invariable, NPs that have a non-core argument role in
the relative clause (‘whose, with whom’, etc.) are generally replaced by a resump-
tive pronoun, either full form or Set2, encoded in the same role as the underlying
NP:
(335) Vafsi
an merde-y ke nawǽ=s i-r-pærsa, ke ve-Ø?
that man-restr sub name=3 s 2 2 s 2-dur -ask:pt who cop : pt -3 s 1
(nawǽ=s ~ tani nawǽ)
(name=3s2 his name
‘Who was that man whose name you were asking about?’
8.4. Non-finite types of subordination in Leriki

In N. Talyshi temporal and purposive clauses have common non-finite alternatives
built on the infinitive of the main verb and an adposition: temporals with the loc-
ative postposition =ædæ and purposives with the benefactive preposition bo, as in
Leriki:
Temporal
(336) čoštæ hard-í=ædæ ov=ïm hárd=e.
breakfast eat-inf = loc water=1 s 2 eat:pt = aux
While eating breakfast, I drank water.”
tï væy b-i=ædæ, bárd=e?
you there be:inf =in take:pt =2 s 2
‘When you were there, did you take (him/her with you)?’
806 Donald L. Stilo
Purposive clauses
(337) penj dæγiγæ íyo mánd-imon bo hærækæt kard-e ïštæn
five minute there stay-1 p 1 for movement do-inf self
tosp-ïn-e.
warm-caus - inf
‘We stayed there for five minutes to move around (and) warm ourselves.’
8.4.1. Preposed relative clauses

Leriki has both postposed and preposed Relative clauses. The latter variant is a
non-finite type based on the past participle with the same structure for core (S, A,
P) or oblique roles (Possessive, Locative, etc.) of the deleted NP:
Core functions of deleted NP

(338) dï asp-í šæ kæs hḯst=e.
with horse-ob go:ppl person exist:=cop :3 s 1
‘There are people who go by horse.’
čïmï bæ tï doæ kitob kónjo=ye?
I:poss to you give:ppl book where=cop :3 s 1
‘Where is the book I gave you?’
Peripheral functions of deleted NP

(339) ïštï gæp žæ zoæ kïv=e?
you:poss speech hit:ppl boy where=cop :3 s 1
‘Where is the boy you were talking to?’
ko kard-æ vïræ=dæ tï tok-í žæ.
work do-ppl place=in you current-ob hit:pt
‘The (electric) current struck you at the place where you work.’
9. Additional word order issues
Most Tatic and Caspian languages exhibit predominately left-branching typol-

ogy but there are significant exceptions, e. g., right-branching Relative, Com-
plement, and Purposive clauses, consistently postverbal Goals and Recipients,
among others. In all of the languages in this area, esp. Caspian, we find many (and
still on-going) syntactic copies from Persian, e. g., Set2 Oblique enclitics (§6.7),
various right-branching patterns in the NP (the Ezafe, prepositions, etc.). In addi-
tion, all of Tati and most of Talyshi are also in heavy contact with Azeri, which is
predominately left-branching.
We have already discussed the left- and right-branching patterns in subordina-

tion (§8.1) above. In addition, languages such as Vafsi, located in transitional zones
between the northerly left-branching languages and the southerly right-branching
languages (see Stilo 2005), have highly mixed typologies in adpositions (§5.7.1),
genitives (§5.4.3), and, to a lesser extent, adjectives. In the next section we will
see further right-branching and mixed word order typologies of Patients/Objects,
Goals, Recipients and Addressees.
9.1. Postverbal Objects

As attested throughout the examples given in this chapter, the basic word order
of core arguments of the clause is SOV, but Object-Verb word order is not rigid
or absolute. While postverbal objects are commonly heard in speech, they occur
exclusively with definite nouns in these languages (see exs. 239, 255, 281). Fre-
quency counts have not yet been conducted for most of these languages, but we
can take the situation found in Miller’s (1930) Leriki corpus, Lazard’s (1979)
Māsulei corpus and comparable segments of my own Lerik texts and of Amini’s
Tonekāboni texts in Table 9A as typical.
Table 9A: OV/VO word order

Leriki:Miller Leriki:Stilo FN Māsulei:Lazard Tonekāboni: Amini
Def OV 47 88.7 % 32 86.5 % 214 98.6 % 63 96.9 %
Def VO 6 11.3 % 5 13.5 % 3 1.5 % 2 3.1 %
Indef OV 11 100 % 34 100 % 84 100 % 27 100 %
Indef VO 0 0 % 0 0 % 0 0 % 0 0 %
In addition, it is interesting to see just how many non-core arguments are preverbal
in a language such as Vafsi. While frequency counts for postverbal objects have
not yet be done for Vafsi, the following table shows just how common other post-
verbal elements are.
Table 9B: Vafsi preverbal vs. postverbal non-core arguments

Argument Addressee Temporal Ablative Locative Comit/Instr. Bene- Recipient Goal
factive
Preverbal 96.3 % 95.2 % 93.2 % 88.9 % 84.2 % 64.6 % 33.7 % 13.8 %
Postverbal 3.7 % 4.8 % 6.8 % 11.1 % 15.8 % 35.4 % 66.3 % 86.2 %

808 Donald L. Stilo
The most striking observation about this pull chain is the predominant postverbal
nature of Goals vs. the predominant preverbal position of the cluster of Address-
ees, Temporals, Ablatives, Locatives, and Comitative/Instrumentals. Note that the
Locative at 88.9 % preverbal is almost the mirror-image of the Goal at 86.2 % post-
verbal. It is my opinion that the Goals have initiated this pull chain and since
Recipients often share many features with Goals cross-linguistically, as do Bene-
factives with Rs, these arguments followed in the chain as seen in the table. There
is still a rather large gap between the three arguments on the right and all five
others to the left in the table.
9.2. Goals, recipients, and addressees

As we see in the Table 9B for Vafsi, Goals are generally postverbal by near-univer-
sal, yet optional, preference throughout the area. In Borjian’s Kordkheyli texts, for
example, slightly less than 10 % of Goals are in preverbal position. By contrast, I
found no postverbal Locatives in this corpus. Note that Goals, except in Leriki, are
generally unmarked by adpositions or case, i. e., are Ø-marked. Word order, while
not obligatory, is mostly the distinguishing factor.
(340) Lāhijāni
ba:d bú-šo-m xonə. m-a bə́-vərd-I bimarestan.
then pu -go:pt -1 s house I-ra pu -bring:- pt -3 s hospital
‘Then I went home.’ ‘He took me to the hospital.’
(341) Sāravi (SY: 54)
mimun-a burd-ene še sere. men b-ur-em bazar.
guest-pl go:pu : pt -3 p own house I pu -go-1 s market
‘The guests went to their house.’ ‘…(that) I go to the bazaar.’
Goals in Lazard’s corpus are 84.4 % postverbal. Goals in Miller’s Leriki corpus are
87.9 % postverbal, predominantly bæ-marked. The following Leriki examples are
from the texts gathered in my own field work in 2002–2006:
(342) Leriki
še-m bæ sivers bæ væyæ.
go:pt -1 s 1 to P.N. to wedding
‘I went to Si-Vers to a wedding.’
telli tokí=ædæ omæ-Ø iyo?
P.N. dark=in come:pt -3 s 1 here
‘Did Telli come here in the dark?’
(343) Māsulei (LM: 56, IV, 19) (LM: 34, I: 7)

tə Ø-bær-əm čəmən kæ. še-næ če otaγ-e xab.
you pr -take-1 s 1 my house go:pt -3 p 1 his room-ez sleep
‘I will take you to my house.’ ‘They went to his bedroom.’
(344) Vafsi
pæs búri-a vəws! bǽ-resa-Ø bæ yey hæsar.
then come:imper -2 p 1 P.N. pu -arrive:pt -3 sto one yard
‘Then come to Vafs!’ ‘He got to a courtyard.’
The following are some frequency counts for tokens of Recipients and Addressees
from texts in four languages covered in this chapter:
Language Recipients Addressees
Preposed Postposed Preposed Postposed
Kordkheyli 19 90.5 % 2 9.5 % 15 100 % 0 0 %
Māsulei 24 100 % 0 0 % 33 100 % 0 0 %
Leriki 19 60.3 % 14 39.7 % 2 66.7 % 1 33.3 %
Vafsi 32 33.7 % 63 66.3 % 51 96.3 % 2 3.7 %
10. Sample texts with interlinear glossing
The following are four short texts as examples of naturalistic speech in Lāhijāni,
Sāravi, Leriki, and Vafsi.
10.1. Lāhijāni: a reminiscence, Faramarz Vakof

The Lāhijāni /a/ and /ə/ phonemes are transcribed here with subphonemic pronun-
ciations.
zæmán-i ki mu kəlas-e no-e dɛbiristan issæ=bu-m,,
time-restr sub I grade-ez nine-ez high_school be 3= aux : pt -1 s
muællim né-ysæ=bu-Ø,
teacher neg - be 3= aux : pt -3 s
When I was in ninth grade of high school, the teacher wasn’t there,
n-ǽmæ=bu-Ø. libas-ə æmra hæminjuri bú-šo-im
neg -come:pt = aux : pt -3 s clothes-rez with just_like_this pu -go:pt -1 p
yek-tə zæmín-ə dirin ki
one-unc land-rez into sub
he hadn’t come. We went just like that with (our regular) clothes on into a field
that
810 Donald L. Stilo
hævali-ə mædrese bu-Ø. bú-šo-im

proximity- ez school be:pt -3 s pu -go:pt -1 p
was near the school. We went
futbal bazí bù-kon-im. bæ:d æz nim saət
soccer game pu -do-1 p after from half hour
to play football. After half an hour,
ye dæfærí mu bï-kɛt-im=o
one time I pu -fall:pt -1 s =and
all of a sudden I fell down and
mi zanu-e čɛp xéyli dɛrd kùn-i.
my knee-ez left very pain do-3 s
my left knee was hurting badly.
nigá b-ùd-əm bə́-de-m yɛ tike pare sæng bə́-šo-Ø
look pu -do: pt -1 s pu -see:pt -1s one piece piece stone pu -go:pt -3 s
mi zanu miɛn.
my knee in
I looked (and) saw (that) a little piece of stone pierced my knee.
u zæmon tilifon=o i čiz-on né-dašt-im. bæ:d
that time telephone=and this thing-pl neg -have: pt -1 p then
yɛk=tæ æz zak-on-ə
one-unc from child-pl - re
In those days we didn’t have telephones and these things. Then I got on the
tærk-e čærx bə́-ništ-əm æ m-æ bə́-vərd-i bimarestan.
back-ez bike pu -sit: pt -1 s he I-ra pu -bring:- pt -3 s hospital
duktər niga b-ud-i
doctor look pu -do: pt -3 s
back of one of the guys’ bikes (and) he took me to the hospital. The doctor looked
bə-di-Ø xéyli væxim=i.
pu -see:pt -3 s very serious-cop :3 s
and saw that it’s very serious.
bæ:d æz du saet æmæl-e jærahi æmæ bə́-d-im
after from two hour operation-ez surgical we pu -see:pt -1 p
After two hours of surgery, we saw that
ki mi piɛr=əm b-ǽmæ-Ø. m-æ sævar-i mašin
sub my father=also pu -come:pt -3 s I-ra mounted-ez car
b-úd-i bə́-vərd-i xoní.
pu -do:pt -3 s pu -bring:pt -3 s house
our (i. e., my) father also came in. He put me in the car (and) brought (me) home.
mæjbur=a bo-m čar mah æsa-yə æmra rá bù-šo-m,

forced=cs be: pt -1 s four month cane-rez with road pu -go:pt -1 s
I was forced to walk with a cane for four months.
bæ:tha mutəvæje bu-bo-m ki
later attentive pu -be:pt -1 s sub
Later on I realized that
ægæ bə́-xa-m futbol bazí bù-kun-əm, baási xéyli ihtiát
if pu -want-1 s soccer game pu -do-1 s must:pt very caution
if I want to play football, I had to be very careful.
bù-kun-əm. hæmin ihtiat
pu -do-1 s this.very caution
Just this caution
baés bù-bo-Ø mu de nə́-ton-əm futbal bazí bù-kun-əm.
cause pu -be:pt -3 s I more neg -can-1 s soccer game pu -do-1 s
caused me not to be able to play football any more.
čon hæmiše i tɛrs
since always this fear
Since there was always a fear
ye júr-i hǽnnæ=bu-Ø ki ǽge mu dubare bə́-ke-m,,
one way-indf be 5=aux : pt -3 s sub if I again pu -fall-1 s
momken=i bazhæm
possible=cop :3 s again
somehow that if I should fall again, perhaps
i jærahǽt-i ya zæxmí ye číz-i bú-bu-Ø. in
one surgery-indf or injury one thing-indf pu -be:sbj -3 s this
bais bú-bu-Ø ki be-ja=ye
cause pu -be:pt -3 s sub to-place-ez
there would be a surgery or an injury (or) something again. This caused me
in-ki mu herekæt bišter bú-kun-əm,, bišter bú-š-əm
this-sub I movement more pu -do-1 s more pu -go-1 s
kitabxaní=o i jur čiz-on.
library=and this kind thing-pl
to go more to the library and things like this instead of moving around more.
812 Donald L. Stilo
10.2. Sāravi (from Yoshie 1996: 64)
zena ruz-a pæmbe golek-a=re Ø-get-e Ø-ši-ye kele

wife day-pl cotton wad-pl = ra imf -say:pt -3s imf -go:imf -3 s river
lu, un-a=re Ø-reši-ye.
lip it-pl = ra imf -spin:pt -3s
Every day the wife would take the cotton wads, go the river edge (and) spin them.
væk-a kele lu qorqor Ø-kard-ene, ve=re xial=e
frog-pl river lip croak imf -do:pt -3 p s/he=ra imagination=cop :3 s
ke
sub
The frogs were croaking on the river bank (and) she imagined [exp. vb.] that
ven-e kar=je irad ger-n-ene. ve=re læj bæ-it-e
s/he-rez work-from criticism take-pr -3 p s/he=ra spite pu -take:pt -3 s
pæmbe-ha=re dæ-kard-e
cotton-pl = ra pvb -do:pt -3 s
they were (lit: are) criticizing her work. She got spiteful [exp. vb.] (and) dumped
the cotton
kele-e u dele. bá-ut-e bǽ-ir-in, ǽge šema beter
river-rez water inside pu -say:pt -3s pu -take-2 p if you better
bæled=eni bǽ-res-in,
able=cop :2 p pu -spin-2 p
(wads) into the river water. She said, ‘Take them, if you can spin (them) better,
bǽ-res-in! væk-a æz tærs bǽ-pperess-ene u-e dele.
pu -spin-2 p frog-pl from fear pu -jump:pt -3 p water-rez inside
pæmbe golek-a=m
cotton wad-pl =too
you spin (them)! Out of fear, the frogs jumped into the water. And the cotton
wads
b-urd-e u-e tæ. b-urd-e æ-tta ču
pu -go:pt -3s water-rez bottom pu -go:pt -3s one- unc stick
bæ-it-e u=re behæm Ø-zu-Ø
pu -take:pt -3 s water-ra together imf -hit:pt -3 s
went to the bottom of the water. She went, got a stick (and) was stirring up the
water
še pæmbe=re bǽ-ir-e. væk-a=re Ø-gat-e men šeme

own cotton-ra pu -take-3s frog-pl = ra imf -say:pt -3 s I you:pl
sere=re xerab ka-m-be
house=ra ruined do-pr -1 s
to get her cotton (back). She kept saying to the frogs, ‘I’ll ruin your house
[a standard curse]
me pæmbe-ha=re há-d-in!
my cotton-pl = ra pvb -give-2 p
Give me my cotton!
hæminti=ke u=re læt Ø-zu-Ø, ven-e mærdi
just.as=sub water=ra thrash imf -hit:pt -3 s s/he-rez husband
sǽr bæ̀ -resi-ye. bæ-di-ye
nve pu -arrive:pt -3 s pu -see:pt -3 s
‘Just as she was thrashing the water, her husband suddenly arrived. He saw
æ-tta či u-e dele bærq-bærq zæn-d-e.
one-unc what water-rez inside shine-shine hit-pr -3 s
something in the water was (lit: is) shining.
b-úrd-e bǽ-it-e, bǽ-di-ye
pu -go:pt -3s pu -take:pt -3s pu -see:pt -3 s
He went, picked (it) up, (and) saw
xešt-e tela=ye. čæn-ta=re še bǽ-it-e čæn-ta=re
brick-ez gold=cop :3 s few-unc = ra self pu -take:pt -3 s few-unc = ra
há-da-e zena-e sær=o
pvb -give:pt -3 s wife-rez head=and
they were (lit: are) gold bricks. He picked up a few of them himself and a few of
them he put on his wife’s head and
b-úrd-ene sere. mærdi ruz-a xešt-a=re Ø-værd-e
pu -go:pt -3 phouse man day-pl brick-pl = ra imf -take.to:pt -3 s
bazar pul Ø-kard-e,
bazaar money imf -do:pt -3 s
they went home. Every day the man would take the bricks to the bazaar, changed
(them) into money,
Ø-yard-e sendux-e dele ja Ø-da-e. zena=re
imf -bring:pt -3 s chest-rez inside nve imf -give:pt -3 s wife-ra
Ø=gat-e dæs nǽ-zen.
imf -say:pt -3 s hand neg -hit
he would bring (it home and) hide (it) in a chest. He would say to his wife, Don’t
touch (it)!’
814 Donald L. Stilo
10.3. Lerik text: encounters with a Bear, Xanbala Musayev, Pirasora village
mï ïmsor avgust-æ mang=ædæ šæ=b-im bæ višæ.

I:ob this.year August-link month-loc go:pt = aux : pt -1 s 1 to forest
This year in the month of August I had gone to the forest.
dum=o mï glæ čk-æ spæ hḯst=e.
back=abl I:ob unc good-link dog be 2= aux :3 s 1
Behind me is a good dog.
višæ dïlæ=dæ i kæræ=n bæ-n-é=m
forest inside=loc one time=too fut -know-inf =1 s 1
In the forest all of a sudden I see

glæ-y šï be-še-Ø.
unc -indf thing out-go:pt -3 s 1
something came out.
diǽ=m kàrde bæ-n-é=m hïrs=e. ha næisæ
look=1 s 2 do:pt fut -know-inf =1 s 1 bear=3 s 1 yes something
I looked and I see it is a bear. Well, really,
č-ïm-i nav=o vit-e æ-zï-n-im.
abl -dem : prox -ob front= abl run- inf dur -know-neg -1 s 1
I won’t be able to outrun it.
bæ do bé-šü-m,, ïm=æn
to tree pvb -go-1 s 1 dem : prox -too
If I climb a tree, it also
be-bæ-š-e-Ø. bæ-n-é=m hežo om-é-dæ-Ø.
pvb -fut -go-inf -3 s 1 fut -know-inf =1 s 1 always come-inf - loc -1 s 1
will climb up. I see it is still coming.
dï dæhšæti
with fright
Frighteningly
p-i-dæ=ž=e bḯ-kšt-o-Ø odæm-í.
want-inf - loc =3 s 2= aux sbj -kill-opt -3 s 1 person-ob
it was going to kill you (lit: a person).
sïpæ sǽ-gnì-ye č-æy.
dog pvb -fall:pt -3 s 1 abl - dem : dist -ob
Well, the dog went after it.
sïpæ=n kïšt-é-dæ=be. sïpæ=n dàst nḯ-kæšæ-y.

dog=too kill-inf - loc = aux : pt -3 s 1 dog=too hand neg -pull:pt -3 s 1
It was going to kill the dog, too. And the dog did not give up.
vít-e.
flee:pt -3 s 1
It (the bear) ran away.
pešï č-æ-y se čo ruž-í bæ digæ, še-m
then abl -dem : prox -ob three four day-ob to other go:pt -1 s 1
ižæn rást omæ̀-y.
again meeting come:pt -3 s 1
Then three or four days after that, I went (and) it encountered (me) again.
ovči b-ǽ-bæ-y-Ø, tifang=ïm pe-gætæ-bæ-y, az=æn
hunter sbj - dur -be:pt - irr -3 s 1 rifle=1 s 2 pvb -take:pt -be:pt - irr I=too
gásti
maybe
If there had been a hunter, (or) if I had picked up a gun, I maybe
b-ǽ-zn-i-m žæ-y. ænjæx æv… čæy gužd
sbj - dur -know- irr -1 s 1 hit:pt - inf only it dem : dist : gen meat
čæy pi
dem : dist : genfat
could have shot it, too. Only it… it seems they say its meat (and) its fat
güya vót-dæ=n dæmon=e. ovči omæ-y
supposedly say-loc =3 p 1 medicine-cop :3 s 1 hunter come:pt -3 s 1
vot=e ïsæ bḯ-žæn-o-m,,
say-aux now sbj -hit-opt -1 s 1
is supposedly medicinal. A hunter came along (and) said, if I shoot it now,
vot=e gám=e. čæy gužd bú bæ-kà-Ø.
say-aux warm-cop :3 s 1 dem : dist : gen meat smell fut -make:pt -3 s 1
he said, it is hot. The meat will go bad (i. e., will start to smell).
10.4. Vafsi text
buælisina yey řu ræfiq bǽ-v-e yek-i henra æz iran

Avicenna one day friend pu -be-:pt -3 s 1 one-ob with from Iran
æ-čí-nde
dur -go:pt -3 p 1
Avicenna once made friends with someone (as) they were going from Iran
816 Donald L. Stilo
mæmlekæte xareji. b-ísdi yey næfær šaér-i yey

country-ez foreign pu -3 s 2-see:pt one HCL poet-indf one
ketáb-i vær=es=dær=e.
book-indf side=3 s 2= loc = cop :3 s 1
to a foreign country. (There) he saw a poet (and) there was a book next to him.
bawæli in kætab hágiræ hey dær=es-gærd-en-a æz
Avicenna this book pvb -took-pt repet pvb =3 s 2-turn-caus - pt from
ævvæle ketáb-i
first-ez book-ob
Avicenna picked up the book and kept leafing through it from beginning
yawa axære ketáb-i murur=es a-kærd=o hár=esda
until last-ez book-ob going.over=3 s 2 pvb -do:pt =and pvb =3 s 2-give:pt
o saab ketáb-i.
to owner book-ob
to end of the book, and then he gave it back to its owner.
hǽni-ke bǽ-ssi-nde ša-y palu in šaere-y kætab tæqdim
when- sub pu -go:pt -3 p 1 shah-ob side this poet-om book present
kærd-e ša-y.
do:pt - perf ? shah-ob
When they went to the Shah, this poet presented the book to the king.
ša-y hágiræ naw=es bǽ-kærd=o va-Ø,
shah-ob pvb -take:pt look=3s2 pu -do:pt =and say:intr : pt -3 s 1
The Shah took it and looked at it and said,
xéyli næč kætab=e.
very good book=cop :3 s 1
‘This is a really nice book.
bawælisina va, aqa, an kætab tæmen=e,
Avicenna say:intr : pt -3 s 1 sir that book I:ob =cop :3 s 1
tini ne.
he- ob cop : neg :3 s 1
‘Avicenna (then) said, sire, that book is mine, it is not his.’
in šaeræ va-Ø ke an duru r-vaz-e.
this poet say:intr : pt -3 s 1 sub dem : pron : dist lie dur -say-3 s 1
The poet (then) said (that), ‘He is lying!
šeš sal há-nešeste-ym
six year pvb -sit:pt -1s1
For six years I sat (and)
zæ:mæt=om bǽ-keša kætab=om bæ-nnevisde.

trouble-1 s 2 pu -pull:pt book=1 s 2 pu -write:pt
labored (and) I wrote the book.
tæ æte r-vaz-i tæmen=e?
you now dur -say-2 s 1 I:ob = cop :3 s 1
Now you say ‘it’s mine’?
va-Ø, æz hær sætr tæ há-pærs-i ke či
say:intr : pt -3 s 1 from every line you pvb -ask-2 s 1 sub what
nevisdæ, æz pak æz bær
write:pt : ppl I all from memory
He said, ‘Any line you ask me (about) what is written there, I will tell it all
ær-vaz-om veréwn=i=ra.
dur -say-1 s 1 opb II=2 s 2=for
to you by heart.’
æz hær væræq=esan dǽ-gerd-en-a,, buælisina-y tæman
from every sheet=3 p 2 pvb -turn-caus - pt Avicenna-om all
Whatever page they turned to, Avicenna
æz bær
from memory
recited it all by heart.
bǽ-xand. šaer-æ va-Ø ke tæmen ba-watt-e æmma
pu -read:pt poet-def say:pt -3 s 1 sub I:ob pu -say:pt - perf but
The poet said, ‘I have written (lit: said) it but
vir=om=dæ nǽ-mænd-e. ša va-Ø ke kætab pæs
memory=1 s 2=in neg -stay:pt - perf king say:pt -3 s 1 sub book then
tini=e esdæ n-e.
he-ob =cop :3 s 1 you:ob cop : neg :3 s 1
I don’t remember it (by heart).’ The Shah said, ‘then the book is his, not yours.’
bæzáni buælisina va-Ø, næ, kætab tini=e.
then Avicenna say:pt -3 s 1 no book he:ob = cop :3 s 1
Then Avicenna said, ‘no, the book is his.
tæmen man kæšdi=dæ hár=om‑giræ
I:ob inside boat=in pvb=1s2-take:pt
I picked it up on the boat
818 Donald L. Stilo
æz bala yawa pain murur=om kærd-e,

from top until bottom review=1 s 2 do:pt - perf
hefz=om kærd-e.
memorization=1 s 2 do:pt - perf
in the boat (and) leafed through it from top to bottom and memorized it.’
Abbreviations
1p 1pl. PAM cs Change of State

1p1 1pl. PAM, Set1 f feminine
1p2 1pl. PAM, Set2 def Definite
1s 1sg. PAM dem Demonstrative
1s1 1sg. PAM, Set1 df Direct Feminine
1s2 1sg. PAM, Set2 df 2 Direct Feminine II
2p 2pl. PAM dir Direct Case
2p1 2pl. PAM, Set1 dist Distal
2s 2sg. PAM dp Direct Plural Case
2s1 2sg. PAM, Set1 dur Durative
2s2 2sg. PAM, Set2 emf Emphatic
3p 3pl. PAM emot Emotive particle
3p1 3pl. PAM, Set1 empf Emphatic
3p2 3pl. PAM, Set2 ez Ezafe linker
3s 3sg. PAM fem Feminine
3s1 3sg. PAM, Set1 fut Future
3s2 3sg. PAM, Set2 gen Genitive
3 sf 3sg. Feminine imf Imperfect
3 sf 1 3sg. Feminine Set1 imper Imperative
3 sm 3sg. Masculine indf Indefinite
3 sm 1 3sg. Masculine Set1 inf Infinitive
abl Ablative intr Intransitive
acc Accusative irr Irrealis
acc 1 Accusative, type 1 je Polysemous adposition
acc 2 Accusative, type 2 ka Polysemous adposition
ædæ polysemous adposition ku Polysemous adposition
all Allative link General NP linker
aux Copula as Auxiliary loc Locative
be 1 ‘Be’ verb One lv Light Verb
be 2 ‘Be’ verb Two m Masculine
be 3 ‘Be’ verb Three neg Negative
be 4 ‘Be’ verb Four nve Non-verbal element in
be 5 ‘Be’ verb Five light verb construction
be dem ‘Be’ based on Demon- ob Oblique
strative formant obj Objective Case
caus Causative of Feminine Oblique
cond Conditional of 2 Feminine Oblique II
cop Copula om Masculine Oblique
op Plural Oblique prox Proximal

opb II Oblique Pronominal pt Past
Base II (Benefact.) pu Punctual
opt Optative pvb Preverb
pl Plural ra ra Objective marker
os Singular Oblique repet Repetitive particle
pam Person-Agreement Marker restr Restrictive marker
pam 1 PAM Set1 for Relative clauses
pam 2 PAM Set2 rez Reverse Ezafe
pass Passive sbj Subjunctive
pd Direct Plural sd Singular Direct
perf Perfect sdf Sg Direct Feminine
pl Plural set 1 Set of Direct PAMs
po Oblique Plural set 2 Set of Oblique PAMs
poss Possessive sg Singular
ppl Past Participle sof Sg Oblique Feminine
pr Present som Sg Oblique Masculine
pr 1 Present I stat Stative/Resultative
pr 2 Present II sub UniversalSubordinator
prog Progressive temp Temporal
proh Prohibitive unc Universal Numeral
pron Pronoun Classifier
References
(AF=) ‘Abdoli, ‘Ali. 2001/1380. Farhang-e tatbiqi-e tāleši, tāti, āzari [translation missing].
Tehran: šerkat-e sahāmi-e entešār.
‘Abdoli Koluri, ‘Ali. 1992/1371. Nazari be jāme‘e-ye ‘ašāyeri-e tāleš [A glimpse into the
tribal community of the Talesh]. Tehran:Etelā‘āt Publishers.
(AA2=) Al-e Ahmad, Jalal. 1958/1337. Tātnešinhā-ye boluk-e Zahrā [The Tat population of
Zahrā District]. Dāneš: Tehran.
(AA1=) Al-e Ahmad, Jalal. 1973/1352. Owrāzān, 4th edn. Māziār: Tehran.
Amini, Tanja J. 2003. Das Tonekābonī von Šīrūd: Materialien und Untersuchungen zur An-
näherung an einen kaspischen Dialekt. Hamburg: University of Hamburg, unpublished
MA thesis.
Amirzādeh-Vājārgāh, Asghar. 1996/1375. Ārus dāmāt səng-ə siyā bobon (Arus va dāmād
sang-e siāh šodand) [The bride and groom turned into a black stone]. Rašt:Našr-e
Gilakān.
Arkadiev, Peter M. 2008. Poor (Two-term) case systems: Limits of neutralization. In Andrej
Malchukov & Andrew Spencer (eds.), The Oxford handbook on case, 686–99. Oxford:
Asatrian, Garnik & Habib Borjian. 2005. Talish and the Talishis (The state of research).
Iran and the Caucasus 9(1). 43–72.
Bazin, Marcel. 1981. Quelques échantillons de variations dialectales du Tâleši. Studia Iran-
ica 10. 111–124, 269–277.
820 Donald L. Stilo
Borjian, Habib & Maryam Borjian. 2007. Marriage rites in South Caspian villages. Archiv
orientální, vol. 75(2), 191–214.
Borjian, Maryam & Habib Borjian. 2007. Ethno-linguistic materials from rural Mazandaran:
Mysterious memories of a woman. Iran and the Caucasus 11(2). 226–254.
Borjian, Habib & Maryam Borjian. 2008. The last Galesh herdsman: Ethno-linguistic mate-
rials from South Caspian Rainforests. Iranian Studies 41(3). 365–402.
Diessel, Holger. 2001. The ordering distribution of main and adverbial clauses: A typologi-
cal study. Language 77. 433–455.
Grjunberg, Aleksandr L. 1963. Jazyk Severoazerbajdžanskix Tatov [The language of the
northern Azerbaijanian Tat]. Moscow: Izdatel’stvo Akademii Nauk SSSR.
Guizzo, Daniele. 2004. Due dialetti Taleshi di Talesh (Hashtpar). Naples: University of
Naples dissertation.
Haig, Geoffrey. 2008. Alignment change in Iranian languages: A construction grammar
(HA=) Hāšemi, Jahāngir. 1974/1353. Guyeš-e Alviri [The Alviri dialect]. Tehran: Univer-
sity of Tehran dissertation.
Ivanow, Wladimir. 1931. The dialect of Gozärkhon in Alamut. Acta Orientalia 9. 352–369.
Jahāngiri, ’AliAsghar. 1988/1367. Kandelus. Tehran: Mo’assese-ye farhangi-e Jahāngiri.
(NJ=) Jahāngiri, Nāder. 2003. Guyeš-e Gilaki-e Lāhijān, Vol I, vižegihā-ye āvāyi va sāxt-
vāžei [The Gilaki dialect of Lāhijān, Vol. I. The phonological and morphological char-
acteristics]. Tokyo: Research Institute for Languages and Cultures of Asia and Africa.
Kalbāsi, Irān. 1997/1376. Guyeš-e Kelārdašt (Rudbārak) [The dialect of Kelārdašt (Rud-
bārak)]. Tehran: Pažuhešgāh-e ‘olum-e ensāni va motale‘āt-e farhangi.
(AAK=) Kārang, ‘Abdol-‘Ali. 1954/1333. Tāti va Harzani. Tabriz:Esma’il Va’ezpur.
Kaye, Stephen. 2013. Morphomic stems in the northern Talyshi verb: Diachrony and syn-
chrony. In Silvio Cruschina, Martin Maiden & John Charles Smith (eds.), The bound-
aries of pure morphology: Diachronic and synchronic perspectives, 181–208. Oxford:
Lambton, Anne K. S. 1938. Three Persian dialects. James G. Forlong Fund, 18, London:
Royal Asiatic Society.
(LM=) Lazard, Gilbert. 1979. Textes en tâleši de Mâsule. Studia Iranica 8(2). 33–66.
Lazard, Gilbert. 1990. Le dialecte de Rudbâr (Gilân). In Dina Amin & Manouchehr Kasheff
(eds.), Iranica Varia: Papers in honor of Professor Ehsan Yarshater, 110–123. (Acta
Iranica 30. Troisième Série, vol. XVI. Textes et Mémoires). Leiden & Boston: Brill.
Lecoq, Pierre. 1989. Les dialectes du centre de l’Iran. In Rüdiger Schmitt (ed.), Compen-
dium Linguarum Iranicarum, 313–326. Wiesbaden: Reichert.
Farzpur-Māčiāni, M. 1964, 1965. Zabān-o farhang-e Māčiān [The language and lexicon of
the Mačiān], Našriyya-ye Daneškada-ye Adabiyyāt-e Tabriz 16, 1964/1343: 277–296,
451–470; 17, 1965/1344: 109–128, 261–284.
(MT2=) Miller, Boris Vsevolodovič. 1930. Talyšskie teksty [Talyshi texts]. Moscow:
Naučno-issledovatl’skij institut nacional’nyx i e’tničeskix kul’tur narodov vostoka
SSSR.
Miller, Boris Vsevolodovič. 1953. Talyšskij Jazyk [The Talyshi language]. Akademiya
Nauk: Moscow.
(MH=) Mortazavi, Manuchehr. 1954/1333. Noktei cand az zabān-e harzani [Some points on
the Harzani language]. NDATabriz 6. 304–314.
(MH=) Mortazavi, Manuchehr. 1962/1341. Fe‘l dar zabān-e harzani [The verb in the
Harzani language]. NDATabriz 14. 453–488.
(MH=) Mortazavi, Manuchehr. 1963/1342. Fe‘l dar zabān-e harzani [The verb in the
Harzani language]. NDATabriz 15. 61–97.
Nawata, Tatsuo. 1982. The Masal dialect of Talishi. Acta Iranica 22. 93–117.
Pahlavān, Keyvān. 2003/1382. Farhang-e mardom-e Elāšt (zādgāh-e Rezā šāh) va Savād-
kuh [The culture of the people of Elāsht (the birthplace of Rezā Shāh) and Savādkuh].
Tehran: Ārvan/Xāvarzamin.
Paul, Daniel. 2011. A comparative dialectal description of Iranian Taleshi. Manchester:
University of Manchester dissertation.
Pāyandeh, Mahmud. 1973/1352. Masalhā va estelāhāt-e Gil va Deylam [Dictionary of
proverbs and expressions of Gilan and Deylam]. Tehran: Bonyād-e farhang-e Iran.
Pāyandeh, Mahmud. 1987/1366. Farhang-e Gil va Deylam, Fārsi be Gilaki [Dictionary of
Gil and Deylam, Farsi to Gilaki]. Tehran: Amirkabir.
Pirejko, L. A. 1976. Talyšsko-russkij i russko-talyšskij slovar [Talyshi-Russian, Rus-
sian-Talyshi dictionary]. Moskva: Izdatel’stvo Russkij jazyk.
(RR=) Rastorgueva, Vera S., Aza A. Kerimova, Ahmed K. Mamedzade, L. A. Pirejko &
D. I. Èdel’man. 1971. Giljanskij jazyk [The Gilaki language]. Moscow: Nauka.
Rastorgueva, Vera S. & D. I. Èdel’man. 1982. Giljankskij, mazanderanskij (s dialektami
šamerzadi i velatru) [Gilaki, Mazanderani (including the dialects of Shahmirzad and
Velatru)]. In Vera S. Rastorgueva (ed.), Osnovy iranskogo jazykoznanija, Novoiranskie
jazyki:Zapadnaja gruppa, Prikaspijskie jazyki, 447–554. Moscow: Nauka.
Redard, Georges. 1970. Other Iranian languages. In Thomas Sebeok (ed.), Current trends in
linguistics, vol. 6, Linguistics in South West Asia and North Africa, 97–135. The Hague
& Paris: Mouton.
(SH=) Shokri, Guiti. 1995/1374. Guyeš-e Sāri (Māzanderāni) [The dialect of Sāri
(Māzanderāni)]. Tehran: Pažuhešgāh-e ‘olum-e ensāni va motale‘āt-e farhangi.
(SR=) Shokri, Guiti. 2006/1385. Guyeš-e Rāmsari [The Rāsari dialect]. Tehran: Pažu-
hešgāh-e ‘olum-e ensāni va motale‘āt-e farhangi.
Sokolova, Vera. 1953. Očerki po fonetike iranskix jazykov [Sketches of Iranian Phonetics].
Moscow: Izdatl’stvo Akademii Nauk.
Stassen, Leon. 2005. Predicative possession. In Martin Haspelmath, Matthew S. Dryer,
David Gil & Bernard Comrie (eds.), The world atlas of language structures. Oxford:
Stilo, Donald L. 1981. The Tati language group in the sociolinguistic context of northwest-
ern Iran and Transcaucasia. Iranian Studies 14. 137–187.
Stilo, Donald L. 1994. Phonological systems in contact in Iran and Transcaucasia. In Mehdi
Marashi (ed.), Persian studies in North America: studies in honor of Mohammad Ali
Jazayery, 75–94. Bethesda, Maryland: Iranbooks.
Stilo, Donald L. 2001. Gilan, x. Languages. In Ehsan Yarshater (ed.), Encyclopædia Iranica
10.6, 660–668. New York: Center for Iranian Studies, Columbia University.
(VF=) Stilo, Donald L. 2004a. Vafsi folk tales. Wiesbaden: Reichert.
Stilo, Donald L. 2004b. Coordination in three Western Iranian languages: Vafsi, Persian
and Gilaki. In Martin Haspelmath (ed.), Coordination, 269–330. Amsterdam & Phila-
delphia: Benjamins.
Stilo, Donald L. 2005. Iranian as buffer zone between the universal typologies of Turkic
and Semitic. In Éva Ágnes Csató, Bo Isaksson & Carina Jahani (eds.), Linguistic con-
822 Donald L. Stilo
vergence and areal diffusion: Case studies from Iranian, Semitic and Turkic, 35–63.
London & New York: Routledge Curzon.
Stilo, Donald L. 2007a. Isfahan, xxi. provincial dialects. In E. Yarshater (ed.), Encyclopædia
Iranica 14.1, 93–112. New York: Center for Iranian Studies, Columbia University.
Stilo, Donald L. 2007b. Isfahan, xix. Jewish dialect. In E. Yarshater (ed.), Encyclopædia
Stilo, Donald L. 2007c. Isfahan, xxii. Gazi dialect. In E. Yarshater (ed.), Encyclopædia
Stilo, Donald L. 2008a. Two sets of mobile verbal person agreement markers in the North-
ern Talyshi language. In Simin Karimi, Vida Samiian & Donald Stilo (eds.), Aspects
of Iranian linguistics: Papers in honor of Mohammad Reza Bateni, 363–390. Newcas-
tle-upon-Tyne: Cambridge Scholars Press.
Stilo, Donald L. 2008b. Case in Iranian: From reduction and loss to innovation and re-
newal. In Andrej Malchukov & Andrew Spencer (eds.), The Oxford handbook on case,
363–390. Oxford: Oxford University Press.
Stilo, Donald L. 2009. Circumpositions as an areal response: The case study of the Iranian
zone. In Lars Johanson (ed.), Turkic Languages, 3–33. Wiesbaden: Harrassowitz.
Stilo, Donald L. 2010. Ditransitive constructions in Vafsi: A corpus-based study. In An-
drej Malchukov, Martin Haspelmath & Bernard Comrie (eds.), Studies in ditransitive
constructions: A comparative handbook, 243–76. Berlin & New York: De Gruyter
Mouton.
Stilo, Donald L. 2012. Intersection zones, overlapping isoglosses, and ‘fade- out/fade-in’
phenomena in central Iran. In Behrad Aghaei & Mohammad R. Ghanoonparvar (eds.),
Iranian languages and culture: Essays in honor of Gernot Ludwig Windfuhr, 3–33.
Costa Mesa: Mazda.
Stilo, Donald L. 2014. Further Notes on the Iranian Substratum of Azerbaijani Turkish. In
Heidi Stein (ed.), Turkic Language in Iran – Past and Present, 271–293. Wiesbaden:
Harrassowitz.
Stilo, Donald L. 2015. The polygenetic origins of the Talyshi language. In Uwe Bläsing,
Victoria Arakelova & Matthias Weinreich (eds.), Studies on Iran and the Caucasus: In
honour of Garnik Asatrian, 411–453. Leiden & Boston: Brill.
Stilo, Donald L. 2016. On the Non-Persian Iranian Substratum of Azerbaijan. In Éva Á.
Csató, Lars Johanson, András Róna-Tas & Bo Utas (eds.), Turks and Iranians: Interac-
tions in Language and History, 185–234. Wiesbaden: Harrassowitz.
Stilo, Donald L. 2018a. Numeral classifier systems in the Araxes Iran linguistic area. In
William B. McGregor & Søren Wichmann (eds.), The diachrony of classification sys-
tems. Current Issues in Linguistic Theory. Amsterdam: John Benjamins.
Stilo, Donald L. 2018b. Dikin Marāqei Tati of Alamut: an undocumented conservative Tati
language. In Salome Gholami (ed.), Endangered Iranian languages.
Stilo, Donald L. forthcoming a. Mazanderani. Encyclopædia Iranica. New York: Center for
Iranian Studies, Columbia University.
Stilo, Donald L. forthcoming b. Loss vs. expansion of gender in Tatic languages: Kafteji
(Kabatei) and Kelasi.
Stilo, Donald L. to appear. A grammar of Vafsi.
Tāhbāz, Sirus. 1963/1342. Yuš. Dāneškadaye Adabiyyāt, Entešārāte Mo’assesaye Moṭāle’āt
va Taḥqiqāte Ejtemā’i, No. 9, Daftarhāye Monogrāfi 2, University of Tehran.
Windfuhr, Gernot L. 1987. Convergence: Iranian Talyshi and Turkic Azari. In Elena Bashir,
Madhav M. Deshpande & Peter E. Hook (eds.), Selected papers from SALA-7. South
Asian Roundtable Conference, Bloomington, Indiana, 385–405.
Windfuhr, Gernot L. 1989. New West Iranian. In Rüdiger Schmitt (ed.), Compendium Lin-
guarum Iranicarum, 251–62. Wiesbaden: Reichert.
Windfuhr, Gernot L. 1992. Cases. In E. Yarshater (ed.), Encyclopædia Iranica, 5.1, 25–37.
New York: Center for Iranian Studies, Columbia University.
Yarshater, Ehsan. 1959. The Dialect of Shāhrud (Khalkhāl). BSOAS 22. 52–68.
(KJ=) Yarshater, Ehsan. 1960. The Tāti Dialect of Kajal. BSOAS 23. 275–286.
(YA=) Yarshater, Ehsan. 1964. The dialects of Alvir and Vidar. In Georges Redard (ed.),
Indo-Iranica: Mélanges présentés à Georg Morgenstierne, 177–187. Wiesbaden: Har-
rassowitz.
(ST=) Yarshater, Ehsan. 1969. A grammar of southern Tati dialects. The Hague & Paris:
Mouton.
(YT=) Yarshater, Ehsan. 1970. The Tati dialect of Ṭārom. In Mary Boyce & Ilya Gershe-
vitch (eds.), W. B. Henning memorial volume, 451–467. London: Lund Humphries.
(ASA=) Yarshater, Ehsan. 1996. The Tāleshi of Asālem. Studia Iranica 25(1). 83–113.
(YX=) Yarshater, Ehsan. 2003. The Xo’ini Dialect. Persica XIX, Louvain: Peeters. 165–
182.
(KL=) Yarshater, Ehsan. 2005. Tāti Dialect of Kalāsuri. In Dieter Weber (ed.), Languages of
Iran: Past and present. Iranica 8, 269–284. Wiesbaden: Harrassowitz.
Yarshater, Ehsan. to appear b. Karingāni Dialect. (February 18, 2003 draft).
Yarshater, Ehsan. to appear c. Karani Dialect. (May 18, 2003 draft).
(SY=) Yoshie, Satako. 1996. The Sāri dialect. Tokyo: Institute for Languages and Cultures
of Asia and Africa.
Zoka, Yahyā. 1954/1332. Guyeš-e Keringān [The dialect of Keringān]. Tehran: Daneš.
Zoka, Yahyā. 1957/1336. Guyeš-e Galinqaya ya Harzandi [The dialect of Galinqaya]. Far-
hang-e Irānzamin 5, 51–92. Tehran.
824 Donald L. Stilo
In addition to the above sources Dr. Ehsan Yarshater generously provided the author
access to his handwritten field notes on Anbarāni, Dizi, Gandomābi, Jowkandāni,
Karani, Kalāsuri, Karani, Karnaqi, Khoini, Khoynarudi, Lerdi, Nowkiāni, Qal’ei,
Razajerdi, Sayyādlari, Viznei, (Northern Talyshi, Iran), Asālemi, Koluri, Kajali,
Gandomābi, Nowkiāni, Karani Gandomābi, Karnaqi, Dizi for the preparation of
this article.
Stilo field notes: Āmoli, Asālemi, Bābolsari, Charozh, Dikin-Marāqei, Dikin-Pa-

shei, Ezbaram, Gurāni, Jubani, Kafteji, Kelāsi, Khatirābādi, Koluri, Lāhijāni,
Langerudi, Lenkoran dialect, Marāqei, Māsāli, Māsulei, Northern and Southern
Talyshi (Stilo: in press), Northern Talyshi, Lerik, Oskulak Talyshi, Tutkāboni/Tut-
kāboni, Rāmsari, Rashti, Razajerdi, Rostamābādi, Rudbāri, Tonekāboni (unless
otherwise marked), Tutkāboni, Urei, Vafsi, Xatirābādi.
6. Caucasian rim and southern Black Sea coast
6.1. Caucasian rim and southern Black Sea coast:
overview
This section covers three languages: Ossetic (East Iranian, Indo-European),

Romeyka (Hellenic, Indo-European), and Laz (Georgian-Zan, Kartvelian). All
three are spoken in enclaves, separated from their closest genetic relatives and
surrounded by unrelated languages. Geographically, Laz and Romeyka are spoken
in the mountainous Black Sea coastal region in today’s northeastern Turkey. Laz
and Romeyka may have had some mutual influence on each other, and both show
evidence of a shared superstrate influence from Turkish. They also share a broadly
comparable sociolinguistic context: both are endangered minority languages,
spoken by Muslim communities within the Turkish state, with virtually no institu-
tional support or official status. Ossetic, on the other hand, is a recognized regional
language spoken in a politically disputed region of the the Caucasus. In recent
times, its main contact language is Georgian (Kartvelian), but it bears traces of
influence from all three indigenous language families of the Caucasus, as well as
from Turkic (Erschler 2009: 422, and this volume, chapter 6.3, §6). It has some
status as a regional standard, with books and periodicals in an established written
form, and in South Ossetia at least, restricted representation in education as well.
However, for all three languages, the general pattern is that very few children are
acquiring the languages, so their long term prospects of survival are fragile.
In the areal context of this volume, these three are perhaps best characterized
ex negativo as being outside of the “Mesopotamianˮ region (Haig 2017b, Haig and
Khan, this volume, chapter 1), to which e. g. the Semitic and Iranian languages of
northern Iraq, and eastern Anatolia belong (see this volume, sections 2 and 3). This
is perhaps most obviously the case with Ossetic and Laz, which lack many of the
features that typify the languages of the Mesopotamian region (for example, both
lack prepositions, nominal gender, post-nominal position of adjectives and posses-
sors, and both have at least five case marking suffixes, compared to the minimal
presence of suffixal case marking in the Semitic and Iranian languages of Mesopo-
tamia). Romeyka, on the other hand, shares certain features, such as prepositions
and nominal gender, with the languages of the Mesopotamian region, though other
aspects of its morphosyntax have converged with local varieties of Turkish.
In phonology, Ossetic and Laz both exhibit a signature feature of the Caucasus,
namely an additional series of voiceless obstruents, variously referred to as ejective,
or glottalized. In Laz, glottalization is a feature inherited from Kartvelian, though
Lacroix (chapter 6.2, §2.1) notes that glottalization in Laz is “not as strongˮ as in

Standard Georgian. Ossetic on the other hand, has developed glottalized obstru-
ents through language contact, rather than inheritance. There are obvious parallels
here to the spread of pharyngealization from Semitic into neighbouring Iranian
languages; both cases involve what is generally considered a phonetically marked
feature, yet which is apparently readily borrowed (see Blevins (2017: 96–109) for
discussion of comparable cases in terms of the “perceptual magnet effectˮ).
Adposition ordering is a feature that has been shown to be sensitive to areal
influence (Stilo 2005; Haig 2017), though it changes slowly, i. e. it requires pro-
longed and intensive contact before an inherited adpositional type will adapt
wholesale to a neighbouring, distinct, adpositional type. For the languages to the
north and east of our region, we would predict postpositions, and both Laz and
Ossetic conform to these expectations. Romeyka on the other hand retains its
inherited prepositions.
With regard to alignment in case marking and agreement, no clear overall
trend can be discerned. Romeyka is straightforwardly accusative in both its verbal
agreement and nominal case marking system, the only complication being the
typologically unusual (and not fully consistent) overt marking of masculine singu-
lar subjects (regardless of transitivity) through a ‘nominative’ case marker (cf. o
komši-s ‘the neighbour-nom ’, chapter 6.4, Table 3). Laz on the other hand reflects
the well-known complexities of its Kartvelian relatives in case marking and agree-
ment. Subjects may be in the absolutive, ergative, or dative case, depending on
verbal semantics and transitivity, but also on the tense specification of the verb.
While case marking of nominal arguments could be considered to be ergative in
alignment, the extensive use of dative subjects with, e. g. experiencers, or with
verbs with a potential or abilitative derivation, is not really accounted for within
conventional alignment typology. Laz also exhibits the property known as ‘inver-
sion’ in Kartvelian linguistics, whereby the two sets of verbal agreement markers,
Set I and Set II, which are respectively controlled by one argument type, swap
their controllers, depending e. g. on the tense of the verb. For example, with the
present tense verb in (1a), the subject determines agreement by Set I (the suffix -s),
while the object triggers agreement by Set II (m-); in the perfect tense verb in (1b),
by contrast, the subject triggers agreement by Set II features (-u-), while the object
triggers agreement by Set I (b-) (example provided by René Lacroix):
(1) a. m-dzi-om-s b. b-u-dzi-u
ii 1-see-th - i 3 sg i 1- ii 3. perf -see-th
‘he sees me’ ‘he has seen me’
Superficially, this looks rather similar to the tense-sensitive shift in agreement pat-
terns found in both Northeastern Neo-Aramaic (Khan, this volume, chapter 3.4),
and Central Kurdish (Haig, this volume, chapter 3.3), and the similarities have
been noted in the literature (Coghill 2016). For example in transitive clauses
Caucasian rim and southern Black Sea coast: overview 827
in Central Kurdish in the present tense (2a), the subject and object are indexed
through a verbal suffix and a pronominal clitic respectively (glossed in (2b) as
Set I and Set II), while in the past tense (3b), this is reversed:
(2) Central Kurdish
a. nā-yān-bīn-īn b. na-mān-dī-n
neg -3 pl (II)-see. prs -1 pl (I) neg -1 pl (II)-see.pst -3 pl (I)
‘We don’t see them’ ‘We didn’t see them’
However, while a contact scenario for the Kurdish and Neo-Aramaic similarities
appears to be plausible (in view of the close stuctural parallels, and our knowledge
of the histories of the respective speech communities), to what extent contact can
be invoked in the Kartvelian developments remains conjectural. Ossetic, on the
other hand, has lost all trace the ergativity that is presumed to have characterized
middle Iranian (Haig 2017a). The rich case-marking system is entirely accusative,
and is not sensitive to verbal tense or aspect. The Ossetic case system has been
claimed to reflect “Caucasian” influence, but Erschler (2009) points to the diffi-
culty of identifying the source language. However, Erschler (2009) does identify
West Caucasian influence in the development of possessive marking in Ossetic.
Likewise, influence from Kartvelian, or from Slavic, or from both, has been sug-
gested for the system of aspectual preverbs in Ossetic (see also chapter 6.3, §6).
Turning to the traditional features of word order typology, we find that all
languages have pre-nominal genitives and adjectives, in line with Turkic and the
languages of the Caspian region (Stilo, this volume, chapter 5). In this respect,
the languages of this section are distinct from all other Iranian languages consid-
ered in this volume, and from the Semitic languages. With regard to the order of
verb and direct object, both Laz and Ossetic are fairly consistently OV, while the
status of Romeyka in this respect remains somewhat unclear. It seems reasonable
to assume that Romeyka inherited VO word order, but under Turkish influence
may now permit OV, though the conditions for this, and its dialectal distribution
within Romeyka itself, remain to be established. Where we do find consistency,
however, is in the placement of the copula, which in all three languages is clause
final, at least as the unmarked option. Overt, clause-final copulas is a feature that
is common to the entirety of East Anatolia (Haig 2017b), and even extends to
those Arabic dialects of northern Iraq spoken “east of the Tigrisˮ (Prochàzka, this
volume, chapter 3.2, §2.4.1). Position of the copula is not a feature that is consid-
ered in traditional word order typology, but from an areal linguistic perspective, it
may be more sensitive to contact influence than, for example, the order of object
and verb.
One of the striking features of the Iranian and Semitic languages of the Meso-
potamian region is the predominance of finite verb forms in dependent clauses. In
the extreme case, languages of Mesopotamia have virtually no non-finite syntax,
so that even modals such as ‘be able’, ‘want’, or ‘need’, or same-subject phasal
verbs such as ‘begin’ or ‘finish’, require a finite dependent clauses (see Haig and
Khan, this volume, chapter 1). Although the languages of the Caucasian rim do
make extensive use of finite forms in dependent clauses, there is a greater tendency
towards non-finite verb forms. For example in Ossetic, “the infinitive in -ən […] is
used with phasal, modal, emotive, mental, causation, speech and evaluative pred-
icates ˮ (Serdobolskaya 2016: 304). Laz also uses non-finite forms (verbal nouns,
chapter 6.2, §4.6) for the verbs ‘forget’ and ‘want’. Similarly, in Romeyka non-fi-
nite forms are found with certain types of dependent clauses (chapter 6.4, §6.5.2).
Thus the broad generalization is that the strong tendency towards finiteness in
dependent clauses of all types is significantly weaker outside of the Mesopotamian
region.
Finally, we consider a type of dependent clause known to be sensitive to areal
influence, namely relative clauses. The general pattern for all the languages in
the other sections of this volume is for post-nominal, finite relative clauses, at
least for non-subject relativization. This holds even in the Turkic languages of
western Iran and northern Iraq (Bulut, this volume, chapters 3.5 and 4.2), even
though standard Turkish has pre-nominal, participial relative clauses. But in the
languages of the Caucasian rim, the dominance of post-nominal relative clauses
is absent. Ossetic uses a correlative strategy (chapter 6.3, §5.4), which involves
a head noun internal to the relative clause, but additionally represented by some
pronominal element external to the relative clause. In Laz, the dominant strategy is
prenominal (chapter 6.2, §6), and in Romeyka, both prenominal and postnominal
relative clauses appear to be possible (chapter 6.4, §6.5.1). While all three lan-
guages evidently differ in detail in their relativization strategies, common to all is
the possibility of placing the relative clause before the head noun, and in this point,
they clearly differ from the languages south of the Caucasian Rim, where post-
nominal relativization is preferred, and in several languages represents the sole
option.
References
Blevins, Juliette. 2017. Areal sound patterns: From perceptual magnets to stone soup. In
Raymond Hickey (ed.), The Cambridge Handbook of Areal Linguistics, 88–121. Cam-
Coghill, Eleanor. 2016. The Rise and Fall of Ergativity in Aramaic. Oxford: Oxford Uni-
versity Press
Erschler, David. 2009. Possession marking in Ossetic: Arguing for Caucasian influences.
Linguistic Typology 13(3). 417–450.
Haig, Geoffrey. 2017a. Deconstructing Iranian ergativity. In Jessica Coon, Diane Massam
& Lisa Travis (eds.), The Oxford Handbook of Ergativity, 465–500. Oxford: Oxford
University Press.
Haig, Geoffrey. 2017b. Western Asia: East Anatolia as a transition zone. In Raymond
Caucasian rim and southern Black Sea coast: overview 829
Hickey (ed.), The Cambridge Handbook of Areal Linguistics, 396–423. Cambridge:

Serdobolskaya, Natalia. 2016. Semantics of complementation in Ossetic. In Kaspar Boye
& Peter Kehayov (eds.), Complementizer semantics in European languages, 293–339.
Berlin: De Gruyter.
Stilo, Donald. 2005. Iranian as buffer zone between the universal typologies of Turkic and
Semitic. In Éva Ágnes Csató, Bo Isaksson & Carina Jahani (eds.), Linguistic Con-
vergence and Areal Diffusion: Case Studies from Iranian, Semitic and Turkic, 35–63.
London: Routledge.
6.2. Laz
René Lacroix
1. Introduction
Laz belongs to the South Caucasian language family, also known as Kartvelian.
The other three members are Mingrelian, Georgian and Svan. The genetic rela-
tionships between these languages are displayed in Figure 1. Laz and Mingrelian,
which are closely related, were once considered as two varieties of the same lan-
guage, called Zan.
Common Kartvelian
Georgian-Zan
Zan
Svan Georgian Laz Mingrelian
Figure 1: Genetic relationships between the South Caucasian languages
The Laz live predominantly in the mountainous region of northeastern Turkey,

on a strip of land along the shore of the Black Sea, and in Georgia, mainly in the
village of Sarp, which is split by the border between Turkey and Georgia. There
are also Laz villages to the East of the Sea of Marmara in Western Turkey; they
emigrated there from the end of the 19th century onwards.
The number of speakers is not known. According to Andrews (1989: 176), in
the 1965 Turkish census, which is the last official statistic, 26,007 people declared
Laz as their mother tongue and 59,101 as their second language. On the other hand,
Feurstein (1983) estimates the number of speakers at 250,000. Between 2000 and
5000 Laz live in Georgia (Holisky 1991: 397; Kutscher 2008).
The Laz population of Turkey was formerly Christian but converted to Islam in
the 17th century (Toumarkine 1995: 48). The Laz are primarily agriculturists. Tea
cultivation was introduced in the region in the 20th century and holds a prominent
place in the local economy.

Laz
Figure 2: Map of main Laz-speaking locations

831
832 René Lacroix
In Turkey, virtually all speakers are bilingual with Turkish. Nowadays, almost
all children acquire at best a passive knowledge of the language.1 Only among
older adults does one frequently find fluent speakers. Code-switching with Turkish
is widespread, especially among men. Even the speech of Laz who do not code-
switch is interspersed with Turkish elements. Laz is thus a highly endangered lan-
guage.
Laz was never used in school. In 2012, however, the Turkish government made
the decision to offer elective Laz lessons at school, provided enough pupils ask
for it.
Laz is not a written language. However, magazines written partly in Laz have
been published for many years. Furthermore, the recent efforts of some Laz has led
to the publication of novels in this language.
Laz has been partly described and is documented by several collections of
texts (see references in Lacroix 2009a: section 1.2). In addition, a documentation
project funded by HRELP (London) has been undertaken by the author.2 Laz is
divided into three main dialectal areas (see map). Each area is in turn divided
into a number of subdialects. Mutual intelligibility between certain dialects is not
straightforward, but it is not problematic between subdialects of the same area.
This paper describes the Arhavi dialect.
The following sections give basic information on the phonology (§2), the noun
phrase (§3), the verb (§4), simple clauses (§5) and complex clauses (§6). As it
proved impossible to give even the most elementary information on many aspects
of Laz in so few pages, most of the topics touched upon here where chosen either
because of their typological or areal bearing.
2. Phonology
2.1. Vowels and consonants

Laz has five vowels: /a/, /e/, /i/, /o/ and /u/. Vowels /i/, /e/ and /a/ have no clearly
perceptible allophonic variants. /o/ and /u/ have fronted allophones when preceded
by the palatal consonant /j/ in the verbal prefixes gj- and j-. There are no diph-
thongs.
The consonantal system is represented in Table 1.
1
During his one-year stay in the Laz community in 2011, the author met children who
were able to speak Laz in the village of Dikkaya (North-East of Turkey) and teenag-
ers in the village of Kabalak (West of Turkey). İrfan Çağatay (September 2012, p.c.)
reports that he heard children from the village of Topluca (North-East of Turkey) speak
Laz. These are exceptional cases.
2
https://elar.soas.ac.uk/Collection/MPI546814 (accessed 20 November 2017).
Laz 833
Table 1: Arhavi Laz consonants

labial dental or post-alveol. velar or glottal
alveolar or palatal uvular
occlusive voiceless non-glott. p t k
voiceless glottalized p’ t’ k’
voiced b d g
affricate voiceless non-glott. ts tʃ
voiceless glottalized ts’ tʃ’
voiced dz dʒ
fricative voiceless s ʃ x h
voiced v z ʒ ɣ
nasal m n
lateral l
approximant ɹ j
Although the consonants with an apostrophe have been classified as glottalized,

further research needs to be done to assess the exact nature of these consonants.
Glottalization is not as strong in Laz as it is in standard Georgian. When two or
more glottalized consonants follow each other, glottalization is indicated only on
the last one. Thus, metk’otʃi ‘throw it!’ holds for met’k’otʃi. Voiceless non-glottal-
ized stops /p/, /t/ and /k/ are aspirated. In addition to the consonants in Table 1, the
phoneme /f/ is found in Turkish borrowings (e. g. tufeɣi ‘gun’, from Turkish tüfek).
Laz allows up to four consonants in syllable onsets, as can be seen in the fol-
lowing examples:
Two consonants: skiri ‘child’, ktʃini ‘old woman’
Three consonants: tʃxvapa ‘hot’
Four consonants: mskva ‘beautiful’
2.2. Stress
Words other than finite verb forms are normally stressed on the penultimate sylla-
ble: bózo ‘girl’, k’ap’úla ‘back’, okosále ‘broom’, ok’átʃxe ‘then’, héya ‘this one’.
The position of the stress shifts when the word is inflected: bozó-ʃi ‘girl-gen ’,
bozo-pé-ʃi ‘girl-pl - gen ’. Certain final vowels may drop as a result of free varia-
tion, which leads to stress falling on the last syllable: dal-épe ~ dal-ép ‘sisters’.
Stress assignment rules in finite verb forms depend on morphological and
lexical properties. The details will not be given here. Suffice it to say that stress
may fall on any syllable except the final one. Some examples are given in (1–3).
834 René Lacroix
(1) b-i-gzál-are
i 1- mid -leave- fut . i 1/2 sg
‘I’ll leave’
(2) b-o-p’aamit-áp-am
i 1- caus -talk-caus - th
‘I have him talk’
(3) kó-dol-i-kun-es-doe
pv - pv - mid -put_on-aor . i 3. pl - evd
‘they put it on’
Negation is marked by va(r), which precedes the verb immediately and forms one
accentual unit with it:
(4) vá ge-b-ul-u
neg pv - i 1-go_down-th
‘I don’t go down’
3. The noun phrase
3.1. Nominal suffixes and clitics

This section presents the formation of the plural, the possessive determiners, the
cases and the “additive” suffix -ti. These appear to the right of the head of the NP,
in the following order: plural-possessive-case-additive (5).
(5) dʒumal-epe-tʃkimi-ʃi-ti
brother-pl - poss 1 sg - gen - add
‘my brothers’ too’
The plural is formed with the suffixes -pe and -epe, depending on phonological
and lexical properties of the base: nek’na ‘door’, nek’na-pe ‘doors’; pudʒi ‘cow’,
pudʒ-epe ‘cows’. The base of certain nouns ending in /a/ is expanded with -l in
the plural: kva ‘stone’, kval-epe ‘stones’; dʒuma ‘brother’, dʒumal-epe ‘brothers’;
oxordʒa ‘woman’, oxordʒal-epe ‘women’.
The plural suffixes have both affixal and clitical characteristics. As far as
affixal characteristics are concerned, the plural suffixes belong to the domain of
the stress-assignment rule, as already illustrated above: bózo ‘girl’, bozó-pe ‘girl-
pl ’. Furthermore, the choice between -pe and -epe depends in part on lexical infor-
mation, and thus cannot be accounted for entirely in phonological terms. Finally,
the plural triggers allomorphic variation of the base, as it selects the base expanded
with -l, if any.
As far as clitical characteristics are concerned, the plural suffixes can attach
Laz 835
not only to nouns, but also to adjectives, postpositions (6) and finite verb forms
(7). The latter situation occurs in free (headless) relative clauses (see §6.1).
(6) tʃkimi ster-epe i-ster-t’es
1 sg . gen like-pl mid -play-impft . i 3. pl
‘The ones like me [i. e. the children my age] used to play.’ (own data)
(7) si-na tʃk’om-i-pe-k
2 sg - sub eat-aor - pl - erg
‘those you ate’ (K72.137)
Table 2 gives the possessive determiners. Here as elsewhere in Laz, there is no
grammaticalized gender distinction. Three examples for possessive determiners
are provided in (8); note that they do not carry stress.
Table 2: Possessive determiners

1sg -tʃkimi
2sg -skani
3sg -muʃi
1pl -tʃkuni
2pl -tkvani
3pl -mutepeʃi
(8) bée-tʃkuni oxórdʒa-tʃkimi-k ts’ip’il-épe-muʃi-s

child-poss 1 pl woman-poss 1 sg - erg young-pl - poss 3 sg - dat
Table 3 shows the seven cases of Arhavi Laz.
Table 3: Cases
absolutive −
ergative -k
dative -s
genitive -ʃi
allative -ʃe ~ -ʃa
ablative -ʃen
instrumental -ten
Cases have both affixal and clitical characteristics. As regards affixal characteris-
tics, cases belong to the domain of stress assignment, as already illustrated above:
bózo ‘girl’, bozó-ʃi ‘girl-gen ’. On the morphological level, the genitive, allative
and ablative cases trigger a special form of the base of 1st and 2nd person pronouns
(see below).
As regards clitical characteristics, cases are generally not repeated in conjunc-
tions:
836 René Lacroix
(9) bozo do bitʃ’i-s

girl and boy-dat
‘for the girl and the boy’ (K72.139)
Furthermore, cases can attach to different types of words. In headless relative
clauses, for instance, they attach to finite verb forms (10).
(10) ordo-na e-j-sel-asen-s gza ordo
early-sub pv - mid -stand_up-fut . i 3 sg - dat path early
‘The path of that who stands up early ends early.’ (own data)
The last element to be considered here is -ti, glossed add for “additive”, which has
a range of uses. In particular, -ti is used with the meaning ‘and, too’ (11).
(11) hemu-s udʒi-ti k-u-ɣ-u-n
dist . dem - dat ear-add pv - ii 3. appl -have-th - i 3 sg
toli-ti k-u-ɣ-u-n xe-ti k-u-ɣ-u-n

eye-add pv - ii 3. appl -have-th - i 3 sg hand-add pv - ii 3. appl -have-th - i 3 sg
‘She has ears, and eyes, and hands too.’ (K93.130)
Second, the suffix -ti is used to switch from one topic to another, especially in
dialogues (12).
(12) bere-k ma-ja haʃo
child-erg 1 sg -rs thus
haʃo m-a-ɣod-u-ja u-ts’-u
thus ii 1- mid . appl -happen-aor . i 3 sg - rs ii 3. appl -tell-aor . i 3 sg
‘The young man told him: ‘Such and such things happened to me’.
he k’otʃi-k-ti si-na g-a-ɣod-u steri
dist . dem man-erg -add 2 sg - sub ii 2- mid . appl -happen-aor .i 3 sg like
ma-ti heʃo m-a-ɣod-u
1 sg - add thus ii 1- mid . appl -happen-aor . i 3 sg
The man answered: ‘What happened to you happened to me too.
si do ma
2 sg and 1 sg
dʒumal-epe do-b-iv-a-t-ja u-ts’-u he
brother- pl pv - i 1-become-opt -1/2 pl - rs ii 3. appl -tell-i 3 sg dist . dem
k’otʃi-k
man-erg
You and me, let’s become brothers!’
Laz 837
ha bere-k-ti k’aj-a
prox . dem child-erg -add good-rs
The young man said: ‘Good’.’ (Ž.95)
Here again, the element at stake has both affixal and clitical characteristics. As
far as affixal characteristics are concerned, -ti belongs to the domain on which the
stress-assigning rule obtains: túti-k ‘bear-erg ’, tutí-k-ti ‘bear-erg - add ’, tut-epé-
k-ti ‘bear-pl - erg - add ’. On the morphological level, -ti triggers a special singular
absolutive form of the demonstrative pronouns, as shown in Table 4.
Table 4: Partial paradigm of the demonstrative pronouns with and without -ti
proximal demonstrative pronoun distal demonstrative pronoun
singular without -ti with -ti without -ti with -ti
absolutive haja ha-ti heja he-ti
ergative hamu-k hamu-k-ti hemu-k hemu-k-ti
dative hamu-s hamu-s-ti hemu-s hemu-s-ti
As far as clitical characteristics are concerned, -ti can be attached to words of

different categories: nouns, pronouns, postpositions, negation var (see (43) below)
and conditional clause subordinator na.
Several other languages of the area (Turkish, Zazaki and Kurmanjî Kurdish)
have an enclitic particle akin to Laz -ti used as a topic-switch marker (Haig
2001: 207), and similar items are also found in the languages of northern Iraq,
for example North Eastern Neo-Aramaic (Khan, this volume, chapter 3.4) and
Kurdish (Haig, this volume, chapter 3.3).
3.2. Pronouns, demonstratives and interrogatives

Table 5 gives the paradigm of 1st and 2nd person pronouns, which involves some
variation. 3rd person pronouns are identical to demonstratives, and are presented
further below. As can be seen, 1st and 2nd person pronouns have the same form
in the three cases indicating core syntactic roles: absolutive, ergative and dative.
Their alignment is thus neutral (A=O=S).
838 René Lacroix
Table 5: Declension of 1st and 2nd person pronouns

1sg 2sg 1pl 2pl
absolutive
ergative ma ~ man si ~ sin tʃku ~ tʃkun tkva ~ tkvan
dative
genitive tʃkimi skani tʃkuni tkvani
allative tʃkim-de ~ skan-de ~ tʃkun-de ~ tkvan-de ~
tʃkim-da skan-da tʃkun-da tkvan-da
ablative tʃkim-den ~ skan-den ~ tʃkun-den ~ tkvan-den ~
tʃkimi-ʃen skani-ʃen tʃkuni-ʃen tkvani-ʃen
Laz has proximal and distal deictics. Only the demonstrative pronouns and deter-
miners are considered here. The demonstrative pronouns have two bases: one in
the absolutive (proximal haja, distal heja), the other in all remaining cases (proxi-
mal hamu-, distal hemu-). The demonstrative pronouns take the same case suffixes
as nouns (see the partial paradigms in Table 4). Proximal and distal demonstratives
are used as 3rd person pronouns, as can be seen in examples (13) and (14), respec-
tively.
(13) ma-ja hamu-s a muntxa b-u-ts’v-a-ja
1 sg - rs prox . dem - dat one something i 1- ii 3. appl -tell-opt - rs
[A man, a jackal and a snake are talking to each other. Referring to the
man, the jackal says to the snake:] ‘I’ll say something to him.’ (own data)
(14) hemtepe k’ala avi-ʃe b-id-i ʃavʃati-ʃe
dist . dem . pl with hunting-all i 1-go-aor Şavşat-all
‘[Last year, six men came, six Europeans.] I went hunting with them in
Şavşat.’ (own data)
The demonstrative determiners are ha ~ ham (proximal) and he ~ hem (distal).
They do not inflect. Examples can be found in (12) and (54).
Table 6 gives a list of interrogative proforms.
Table 6: Interrogative proforms

mi ‘who?’
mu ‘what?’
so ‘where?’ (location and direction)
solen ~ sole ‘from where? through where?’
mudes ~ mundes ‘when?’
mutʃ’o ‘how?’
nak’o ‘how much? how many?’
Laz 839
Mi ‘who’ and mu ‘what’ decline as nouns and can be pluralized (mipe, mupe).
Interrogative proforms generally occur in focus position, i. e. in front of the verb
(see §5.1), as in example (15).
(15) ma mundes b-ɣur-are
1 sg when i 1-die-fut . i 1/2 sg
‘When will I die?’ (D67.XXVIII)
3.3. Adjective
Laz adjectives do not inflect. They occur on the left of the noun they determine
(see §5.1). Some underived adjectives are given in Table 7.
Table 7: Sample of underived adjectives

didi ‘big, great’ subuki ‘light (of weight)’
tʃ’ut’a ‘small, little, tiny’ mtʃ’ipe ‘slender, thin’
t’abala ‘flat’ mtʃxu ‘thick’
mskva ‘beautiful, handsome’ gundze ‘long’
k’ai ‘good’ mk’ule ‘short’
p’at’i ‘bad, ugly’ aɣne ‘new, young’
utʃa ‘black’ mdʒveʃi ‘old’
ktʃe ‘white’ t’utsa ‘hot’
mtʃ’ita ‘red’ mendra ‘far away’
monk’a ‘heavy’
3.4. Echoic
Laz has a productive derivational mechanism which consists of reduplicating a
word and modifying the reduplicant according to the following rule: when the
word begins with a vowel, /m/ is added in front of it (16); when it begins with one
or several consonants, these are replaced by /m/ (17). The resulting compound
means ‘x and other things related to x’ (18).
(16) ejer ‘saddle’ (Turkish loan) → ejer-mejer
optʃk’omi ‘I ate’ → optʃk’omi-moptʃk’omi
(elicited example)
(17) dadzi ‘thorns’ → dadzi-madzi
tsxeni ‘horse’ → tsxeni-meni
840 René Lacroix
(18) xortsi-mortsi var

meat-echo neg
xvala mkvei tʃ’v-er meʃ-o-bɣ-u-n dolox
only flour burn-part pv - appl -be-th - i 3 sg inside
‘[People cook these ingredients and call it kete.] There’s no meat or
anything like that in it, but only cooked flour.’ (K93.99)
This morphological process, called “echoic” by Haig (2001), is an areal phenom-
enon, attested in Turkish, Kurmandjî Kurdish, Zazaki, Persian, Armenian and
Georgian, as well as in languages of the Balkans (Haig 2001: 208–209). A similar
phenomenon is attested in the Indian linguistic area (Emeneau 1980: 114).
4. The verb
The morphology of Laz finite verbs encodes tense, aspect, mood, evidentiality,
person, valency and locative distinctions. Four prefixes, called “affirmative pre-
verbs”, have a range of additional functions (see below). Table 8 shows the order
of the verbal morphemes.
Table 8: Morphological slots in Laz finite verb forms

-4 preverb
-3 preverb
-2 person marking
-1 valency/TAM
0 root
1 causative
2 causative
3 thematic suffix/TAM
4 TAM
5 TAM
6 person marking
7 TAM/evidentiality
Morphologically, Laz verbs may be divided into two broad classes, on the basis of
the suffix realizing the feature “Set I, 3sg” in the present tense: Class 1 verbs have
-s, while Class 2 verbs have -n. Thematic suffixes as well differ. These suffixes,
glossed “th ”, occur in certain tenses only, like the present (see 19–20) and the
imperfect (see 27); in other tenses, like the aorist and the imperative, no thematic
suffix appears (see 22 and 63, respectively). Class 1 verbs have either no thematic
suffix or one of the thematic suffixes -am, -em, -im, -om, -um and -mer; Class 2
Laz 841
verbs have one of the thematic suffixes -er, -ir or -ur3. These morphological dis-
tinctions correlate with syntactic and semantic distinctions. In particular, all Class
2 verbs are intransitive.
The following sections present person marking, preverbs, tenses, valen-
cy-changing derivations and non-finite verb forms.
4.1. Person marking

Laz verbs are specified for at least one person-number feature and at most two per-
son-number features. These features will be called “Set I” and “Set II”. They are
both realized through prefixes and suffixes (slots -2 and 6 in Table 8). In transitive
verbs, the subject triggers agreement by means of Set I features, while the object
triggers agreement by means of Set II features. Thus in (19), the subject bozok
‘girl’ is represented in the verb by the suffix -s “Set I 3sg”, while the object is rep-
resented by the prefix m- “Set II 1st person”. In the intransitive verb illustrated in
(20), the subject dʒuma-tʃkimi ‘my brother’ triggers agreement by means of Set I
features, realized in this case by the suffix -n “Set I 3sg”. An argument which trig-
gers agreement by Set I features will be called “Set I argument”, and an argument
which triggers Set II features in the verb will be called “Set II argument”.
(19) bozo-k ma m-dzi-om-s
girl-erg 1 sg ii 1-see-th - i 3 sg
‘The girl sees me.’ (own data)
(20) dʒuma-tʃkimi tk’ob-u-n
brother-poss 1 sg hide-th - i 3 sg
‘My brother hides himself.’ (own data)
Table 9 gives the paradigm of affixes realizing Set I features, and Table 10 the par-
adigm of affixes realizing simultaneously Set I and Set II features. No verb form
is specified for Set II features only.4 In both Tables 9 and 10, the choice between
-s, -n and -u on the one hand and between -an, -nan, -es and -n on the other is con-
ditioned by the morphological class to which the verb belongs and by tense. The
dash indicates the position of the stem.
3
These thematic suffixes are realized as -e, -i and -u, respectively, when followed by a
person suffix beginning with /n/.
4
Lacroix (2014) puts forward a historical scenario explaining the origin of the distri-
bution of person-number suffixes in Table 10, the organization of which is difficult to
account for in synchrony.
842 René Lacroix
Table 9: Set I paradigm

Singular Plural
1 b— b—t
2 — —t
3 —s/n/u —an/nan/es/n
Table 10: Sets I-II paradigm

1sg 1pl 2sg 2pl 3sg 3pl
1sg g— b—
1pl g—t b—t
2sg m— —
2pl m—t —t
3sg m—s/n/u/ g—s/n/u —s/n/u —an/nan/es/n
3pl m—an/nan/es/n g—an/nan/es/n —an/nan/es/n
In a clause containing a 3rd person Set I argument and a 3rd person Set II argument,
only one of them can trigger plural agreement. In the transitive construction, the
Set I argument has this ability (see bozopek ‘girls’ and the plural suffix -an in 21a),
while the Set II argument does not (see bitʃ’epe ‘boys’ in 21b).
(21) a. bozo-pe-k bitʃ’i dzi-om-an
girl-pl - erg boy see-th - i 3. pl
‘The girls see the boy.’ (own data)
b. bozo-k bitʃ’-epe dzi-om-s
girl-erg boy-pl see-th - i 3 sg
‘The girl sees the boys.’ (own data)
Consider now examples (22a–c), which illustrate the potential derivation.5 The
argument referring to the participant who can do the action triggers agreement by
Set II features (cf. m- “Set II 1st person” in 22a); it is marked by the dative case (cf.
k’otʃepes ‘men’ in 22b and k’otʃis ‘man’ in 22c). This argument has the ability to
trigger plural agreement (see the suffix -es in 22b), while the Set I argument does
not (see tʃxomepe ‘fish’ in 22c).
5
This derivation is called after its most frequent reading; it has other uses as well (see
§4.4).
Laz 843
(22) a. tʃxomi va m-a-tʃ’op-u

fish neg ii 1- pot -take-aor . i 3 sg
‘I could not catch fish.’ (own data)
b. k’otʃ-epe-s tʃxomi va a-tʃ’op-es
man-pl - dat fish neg pot -take-aor . i 3. pl
‘The men could not catch fish.’ (own data)
c. k’otʃi-s tʃxom-epe va a-tʃ’op-u
man- dat fish-pl neg pot -take-aor . i 3 sg
‘The man could not catch the fish.’ (own data)
The term “inverse construction” will be used to characterize any construction like
the potential, where a Set II 3rd person argument has the ability to trigger plural
agreement. The term “plain construction”, in contrast, will characterize any con-
struction in which a Set I 3rd person argument has the ability to trigger plural agree-
ment. The inverse construction is found in four situations: the potential derivation;
the benefactive derivation, which adds a possessor, beneficiary or maleficiary to
the valency of the verb; the perfect tenses (perfect, pluperfect II and evidential
pluperfect II); and lexicalized verbs and idioms involving an experiencer or a pos-
sessor. All finite verbs in Laz are specified for Set I. One-place verbs in the inverse
construction include a Set I 3rd person marker which does not refer to any argument
(see among others 22a).
In all these constructions, the Set II argument is in the dative case. As it exhib-
its several subject properties (see Lacroix 2009a, section 11.2), it can be consid-
ered as a non-canonical subject.
The term “inverse construction” (or “inversion”) is common in South Cau-
casian linguistics. It conveys the idea that the verbal affixes, which represent the
subject and the complement of a plain construction, are “switched” in the inverse
construction, where they represent the complement and the subject, respectively. A
similar phenomenon can be observed in two other languages of the area, Kurdish
and Aramaic (see Haig and Khan, this volume, chapter 6.1 for discussion).
4.2. Preverbs
There are two types of preverbs in Laz: affirmative and locative. Affirmative pre-
verbs appear in slots -4 and -3. They are related to several characteristics of the
clause. In particular, they never occur in clauses negated with the standard nega-
tion var (hence their name), nor in relative clauses. When several clauses are coor-
dinated, affirmative preverbs tend to occur only on the verb of the last conjunct
(cf. d- in 23).
844 René Lacroix
(23) id-es id-es do d-u-lumdʒ-es

go-aor . i 3.pl go-aor . i 3.pl and pv - ii 3. appl -get_dark-aor . i 3. pl
‘They walked, walked, and got overtaken by night.’ (K93.120)
Affirmative preverbs seem also to be sensitive to information structure. In (24), for
instance, where the object is focused, the verb cannot take the affirmative preverb
-o.
(24) uʃkui p-tʃk’om-i *optʃk’omi
apple i 1-eat-aor
[What did you eat, an apple or a pear?] ‘I ate an apple.’ (own data)
In addition, affirmative preverbs are used for the flexion of certain tenses
(see §4.3). Affirmative preverbs are one of the most intricate issues in the grammar
of Laz. Further research is needed to figure out all the rules which determine their
use.
There are more than thirty locative preverbs, all of which occur in slot -3. Mor-
phologically, these can be simple (me-, mo-, e-, ge-, etc.) or complex (meʃa-, moʃa,
mek’a-, mok’a-, etc.). A few examples are given in (25).
(25) mo-xt-u ‘he came’
ama-xt-u ‘he came in’
gama-xt-u ‘he went out’
e-xt-u ‘he went up’
ge-xt-u ‘he went down’
meʃa-xt-u ‘he went (into a narrow place)’
dolo-xt-u ‘he went down (into a narrow place)
Locative preverbs are not restricted to motion verbs, as can be seen in (26).
(26) ma sanduɣi-s dolo-p-x-er
1 sg coffer-dat pv - i 1-sit-th
‘I’m sitting in the coffer.’ (Q.48)
Locative preverbs may be lexicalized. For instance, the following verbs can be
derived on the root -tʃ-: -tʃ- (without preverb) ‘feed’, gama-tʃ- ‘sell’, me-tʃ- and
mo-tʃ- ‘give’, ge-tʃ- ‘hit’, gela-tʃ- ‘play [an instrument]’.
4.3. Tenses and evidentiality

Tenses have exponents in various slots in the verb, as can be seen in Table 8. A list
of Arhavi Laz tenses is given in Table 11. Tenses in column 2 are formed on those
of column 1 by adding an affirmative preverb; tenses in column 3 are formed on
those of column 1 by adding the evidential suffix -doren (or one of its phonologi-
cally reduced variants); the general evidential imperfect (column 4) is formed by
Laz 845
adding both an affirmative preverb and the evidential suffix to the imperfect. This
is illustrated in (27). The construction of verbs in the perfect, pluperfect II and
evidential pluperfect II tenses is inverse (see §4.1).
Table 11: Tenses

1 2 3 4
present general present evidential present
imperfect general imperfect evidential imperfect general evidential imperfect
aorist evidential aorist
pluperfect I
subjunctive
optative
past optative
imperative
future hypothetical
future
perfect
pluperfect II evidential pluperfect II
(27) imperfect general imperfect

tʃ’ar-um-t’i do-tʃ’ar-um-t’i
write-th - impft pv -write-th - impft
‘you were writing’ ‘you used to write’
evidential imperfect general evidential imperfect
tʃ’ar-um-t’i-doe do-tʃ’ar-um-t’i-doe
write-th - impft -evd pv -write-th - impft - evd
‘you were writing, they say’ ‘you used to write, they say’
The evidential suffix -doren (or one of its variants) occurs in the right-most slot
in finite verb forms. It is used when reporting events related by another person,
which the speaker did not witness:
846 René Lacroix
(28) bere-tʃkimi-k ɣali-k uʃkuri

child-poss 1 sg - erg river-erg apple
gel-i-Ø-me-t’u-ʃi ar teɣi ko-dzir-u-doren
pv - mid -carry_along-th - impft . i 3 sg - temp one unit pv -see-aor . i 3 sg - evd
[A child finds an apple in a river and shows it to his father. The father asks:
‘Where did you find this apple?’ The son answers: ‘I caught it as the river
carried it along.’ The father decides to find the owner of the apple. He says
to a man he meets:] ‘My son saw an apple which was carried along by the
river.’ (Ž.27)
Further, the evidential is used when drawing an inference based on present evi-
dence:
(29) xodʒa do-ɣur-u-doren
hodja pv -die-aor . i 3 sg - evd
[A group of men see a hodja lying motionless on the ground. They say:]
‘The hodja has died!’ (D67.XXVIII)
Finally, verb forms with -doren may have a mirative meaning:
(30) haja nak’o i-monk’an-u-doren
prox . dem how_much mid -get_heavy-aor . i 3 sg - evd
[An old man carries another man he has captured in a basket. At some point,
he puts the basket on the ground and goes away for a while. In his absence,
the captured man slips out of the basket, fills it with stones and manure,
binds it again and goes away. The old man comes back. As he wants to put
the basket on his back again, he says:] ‘How much heavy this has gotten!’
(Ž.42)
4.4. Valency-changing derivations

Laz, as the other South Caucasian languages, has several valency-changing der-
ivations. These have incidences on various parts of verbal morphology: valency
markers (slot -1), causative suffixes (slots 1 and 2), thematic suffixes (slot 3), 3rd
person singular and plural suffixes (slot 6) and choice between the plain and the
inverse construction are affected by these derivations. The main derivations will
be illustrated here with verbs formed on the root -bɣ- and the locative preverb
dolo-. Verbs formed on the root -bɣ- are used when talking about a group of similar
countable objects (apples, blades of grass, lice, flies, sheep, people, djinns, etc.)
or of an amount of matter (ash, plaster, etc). The preverb dolo- indicates a down-
ward movement into a narrow place or position in such a place. For each verb, the
case of the arguments, the Set of person-number features they trigger in the verb
and possible adjuncts are indicated in angle brackets. The argument which has the
Laz 847
ability to trigger plural agreement is typed in bold. The Class of the verb is given
in parentheses.
The first example, (31), contains a plain transitive verb; its valency marker
(slot -1) is o-. The presence of o- in plain transitive verbs is not predictable and
is specified in the lexicon (the verb ‘see’, for instance, doesn’t take this prefix
(see 19)).
(31) plain transitive ‘<erg I> tips <abs II>’ (Class 1)
Xalili-k uʃkui dol-o-bɣ-am-s
Halil-erg apple pv - tr -tip-th - i 3 sg
‘Halil tips the apples.’ (own data)
The following two examples illustrate the middle derivation. Middle verbs are
marked by i- in slot -1. They may belong to Class 1 or Class 2. The verb in example
(32) is transitive and belongs to Class 1. It indicates that the subject is directly
affected or concerned by the event described by the verb (in this case, the subject
is the possessor of the adjunct dʒebis ‘pocket’ and is thus both the initiator and the
goal of the event). Class 1 middle verbs may have other readings.
(32) transitive middle ‘<erg I> tips <abs II> into his <adjunct>’ (Class 1)
Xalili-k uʃkui dʒebi-s dol-i-bɣ-am-s
Halil-erg apple pocket-dat pv - mid -tip-th - i 3 sg
‘Halil tips the apples in his pocket.’ (own data)
The middle verb in (33) belongs to Class 2. The reading of this intransitive verb
is anticausative, but here again, other readings exist. As can be seen, the thematic
suffix (-e) and the Set I 3sg suffix (-n) differ from those of the Class 1 verb in (32).
Middle voice in Laz is presented in more details in Lacroix (2012a).
(33) intransitive middle ‘<abs I> tips’ (Class 2)
uʃkui dol-i-bɣ-e-n
apple pv - mid -tip-th - i 3 sg
‘The apples scatter.’ (own data)
While the middle derivation illustrated in (33) is valency-decreasing, the applica-
tive, of which (34) is an example, adds one argument to the valency of the corre-
sponding plain transitive, making it ditransitive. The applicative argument ( Xalili-s
in this example) is in the dative and triggers Set II agreement.6 It has the seman-
tic role of beneficiary. Other possible semantic roles are maleficiary, possessor,
location, goal and addressee. The valency prefix of this derivation is i- when the
applicative argument is 1st or 2nd person and u- when it is 3rd person.
6
Case marking of the applicative argument is not the same as that of the object, which
is in the absolutive. In consequence, this derivation should better be analyzed as a
‘non-canonical’ applicative.
848 René Lacroix
(34) transitive applicative ‘<erg I> tips <abs> for <dat II>’ (Class 1)
Xalili-s uʃkui dolo-b-u-bɣ-am
Halil-dat apple pv - i 1- ii 3. appl -tip-th
‘I’m tipping apples for Halil.’ (own data)
The benefactive derivation, illustrated in (35), is marked by a in the preroot slot.
The construction includes two arguments, one in the dative and one in the abso-
lutive. The dative argument may have the semantic role of beneficiary, possessor
or maleficiary. It shares many properties with an applicative argument, but differs
from it in one important respect: it has the ability to trigger plural agreement. The
dative argument here is a non-canonical subject.
(35) benefactive ‘<dat II>’s <abs I> tips’ (Class 2)
Xalili-s uʃkui dol-a-bɣ-e-n
Halil-dat apple pv - mid . appl -tip-th - i 3 sg
‘Halil’s apples scatter.’ (own data)
The same prefix a appears in the potential derivation (36), already discussed above.
As mentioned there, this derivation has other readings. In particular, it is used to
indicate that the subject acts accidentally and involuntarily (37).7
(36) potential ‘<dat II> is able to tip <abs I>’ (Class 2)
Xalili-s uʃkui dol-a-bɣ-e-n
Halil-dat apple pv - pot -tip-th - i 3 sg
‘Halil is able to tip the apples.’ (own data)
(37) deagentive ‘<dat II> tips <abs I> accidentally’ (Class 2)
Xalili-s uʃkui ko-dol-a-bɣ-u
Halil-dat apple pv - pv - pot -tip-aor . i 3 sg
‘Halil tipped the apples inadvertently.’ (own data)
The benefactive derivation (35) and the potential/deagentive derivation (36–37)
can be distinguished from one another on the basis of semantic and syntactic
properties. In the benefactive derivation, the referent of the dative argument is
affected by the event described by the verb, but is not its instigator. It cannot be
the addressee of a command. By contrast, the referent of the dative argument in
the potential/deagentive derivation is the instigator or source of the process. In the
case of the deagentive, the dative argument can be the addressee of an order (38).
(38) tabaɣ-epe mo me-g-a-tk’otʃ-ap-u-t’a-s
plate-pl proh pv - ii 2- pot -throw-caus - th - subj - i 3 sg
‘[Throw away everything, but watch out!] Don’t throw away the plates!’
(own data)
7
This “deagentive” reading seems more natural in the aorist tense than in the present.
Laz 849
Historically, both the benefactive and the potential/deagentive derivations result

from a middle-applicative derivation; the dative argument was an applicative argu-
ment, which took on subject properties in the course of time. This explains the
morphological and syntactic similarities between these constructions. The history
of the potential/deagentive derivation is discussed in Lacroix (2009a, section
9.8.6.2.3).
The causative derivation is marked by the suffixes -ap or -in immediately after
the root. -ap is used when deriving a causative from a transitive or intransitive verb
(39), while -in is used only for causatives derived from intransitive verbs. Causa-
tive verbs take the preroot prefix o-.
(39) causative ‘<erg I> has <dat II> tip <abs>’ (Class 1)
Xalili-k Ali-s uʃkui dol-o-bɣ-ap-am-s
Halil-erg Ali-dat apple pv - caus -tip-caus - th - i 3 sg
‘Xalil has Ali scatter apples.’ (own data)
The same root -bɣ- may be used to derive the applicative stative verbs ‘x is some-
where’ and ‘somebody’s x is somewhere, somebody has x somewhere’, where x
is a group of similar countable objects or an amount of matter. These verbs are
characterized by the thematic suffix -u(r). They are intransitive. The first one is
illustrated in (40). Its valency marker is o-8. The applicative argument t’ik’inas
‘basket’ indicates the place where the apples are located. The second verb (41)
co-occurs with two dative NPs. The first one, Xalilis, has the semantic role of
possessor. The second dative NP, t’ik’inas, is an adjunct and indicates the place
where the apples are located. The verb takes the valency marker i- when the Set II
argument is 1st or 2nd person and u- when it is 3rd person. Historically, this con-
struction derives from an applicative construction; hence the similarities between
them.
(40) intransitive locative applicative ‘<abs I> is in <dat II>’ (Class 2)
uʃkui t’ik’ina-s dol-o-bɣ-u-n
apple basket-dat pv - appl -be-th - i 3 sg
‘There are apples in the basket.’ (own data)
(41) intransitive possessive applicative ‘<dat II> has <abs I> in <adjunct>’
(Class 2)
Xalili-s t’ik’ina-s uʃkui dol-u-bɣ-u-n
Halil-dat basket-dat apple pv - ıı 3. appl -tip-th - ı 3 sg
‘Halil has apples in his basket.’ (own data)
8
As is clear by now, the prefix o- in slot -1 has a range of functions: it appears in certain
transitive verbs (31), in all causative verbs (39), and in locative applicative verbs (40).
850 René Lacroix
4.5. Negation
As we saw in section 2.2, negation is expressed by var, which is stressed and
precedes the verb. All pre-root slots can be filled in negative verbs except slot 4,
which hosts the affirmative preverb ko- (see §4.2); this is illustrated in (42). The
additive suffix -ti can be added to the negation, yielding the meaning ‘even not’
(43), which suggests that var is not completely bound to the verb.
(42) var do-m-i-dʒox-i
neg pv - ii 1- appl -call-aor
‘you didn’t call me’ (D37.V)
(43) var-ti ox-a-nk’an-es-doren
neg - add pv - pot -wobble-aor . i 3. pl - evd
‘They could even not manage to wobble him.’ (D67.I)
4.6. Non-finite verbs

Laz has four non-finite verb derivations: verbal noun, general participle, future
participle and negative participle. The morphological slots of non-finite verb
forms are presented in Table 12.
Table 12: Morphological slots in non-finite verb forms

-2 u- (negative participle)
-1 preverb
0 root
1 derivational suffix (-am, -im, -um, -in, -ap, -al, -mal)
2 -u (verbal noun, negative participle), -eri (general participle), -oni (future participle)
There is some variation in the choice of the derivational suffix in slot 1. The future
participle of the verb ‘see’, for instance, can be u-dzir-am-u (52) or u-dzir-ap-u
(53).
The verbal noun inflects like a noun and can appear in the same positions as
nouns. The internal structure of verbal noun constructions involves some varia-
tion: the object of the corresponding transitive finite verb may appear in the geni-
tive or in the absolutive. Both cases are illustrated in sentence (44), where k’andɣu
‘strawberry’ is in the absolutive, while oda ‘room’ is in the genitive.
(44) ma k’andɣu o-gor-u ʃeni mo-p-t-i
1 sg strawberry pv -look_for-vn for pv - i 1-come-aor
oda-ʃi o-kos-u ʃeni var mo-p-t-i
room-gen pv -wipe-vn for neg pv - i 1-come-aor
‘I came here to get strawberries, not to clean a room.’ (D67.IX)
Laz 851
The general participle may be used in an attributive, predicative (45–46) or adver-

bial function (47). The general participle is not syntactically oriented towards
any particular argument. For instance, it determines an argument representing the
object of the corresponding transitive verb in (45), an argument representing the
subject of the corresponding transitive verb in (46) and an argument represent-
ing the subject of the corresponding intransitive verb in (47)9. Furthermore, the
general participle may have a perfective (45–46) or imperfective meaning (47).
(45) uʃkuri tʃk’om-eri t’u
apple eat-part be.impft . i 3 sg
‘The apples were eaten.’ (state) (Ž.4)
(46) ma gjari tʃk’om-eri b-ore
1 sg food eat-part i 1-be
‘I’ve already eaten.’ (Ž.89)
(47) t’orodʒi bozo-ʃe putx-in-ej ko-mo-xt-u
dove girl-all fly-suf - part pv - pv -come-aor . i 3 sg
‘The dove flew towards the girl.’ (D67.V)
The future participle can be used as an attribute (48) or in predicative function
(49).
(48) o-tʃaliʃ-oni jer-epe var m-a-dzir-u-ja
pv -work-fp place-pl neg ii 1- pot -see-aor . i 3 sg - rs
‘I couldn’t find a place to work.’ (Ž.38)
(49) ʒin k’at’i o-kos-oni m-i-ɣ-u-nan
above floor pv -sweep-fp ii 1- appl -have-th - i 3. pl
‘We have to sweep the above floor.’ (D67.IX)
The negative participle can be used as an attribute (50), in predicative (51) or
adverbial function (52). Here again, the internal structure of negative participle
constructions involves some variation: in (52), the object of the corresponding
finite verb, dunja, is in the absolutive, while in (53), it is in the genitive case (arti-
katiʃ).
(50) ar tʃ’ut’a u-gub-u xortsi
one little np -cook-np meat
‘some uncooked meat’ (D67.I)
(51) masuma-na t’u u-tʃil-u t’u
third-sub be.impft . i 3 sg np -marry-np be.impft . i 3 sg
‘The third one was not married.’ (K72.137)
9
The suffix -ej in example (47) is a free variant of -eri.
852 René Lacroix
(52) dunja u-dzir-am-u heʃo i-k’itx-om-t’es-doren

world np -see-suf - np thus mid -study-th - impft . i 3. pl - evd
‘[The school was under the earth.] They studied that way, without seeing
the world.’ (D67.I)
(53) xut ts’ana-s artikati-ʃ u-dzir-ap-u orop-er-ep
five year-dat recipr - gen np -see-suf - np love-part - pl
‘the lovers, who hadn’t seen each other for five years’ (D37.VIII)
5. Simple clauses
5.1. Word order

Basic constituent order is SOV (see, among others, exs. 19, 21 and 31). Depend-
ing on information structure, other orders are possible. The main regularity is that
topicalized terms are fronted while focalized terms occur in immediate preverbal
position. In addition, elements known from preceding context can follow the verb.
Examples can be seen in (66) and (72) below, where the postposed dative argu-
ments are already known from the preceding context.
Arhavi Laz has postpositions and no prepositions (examples can be found in
(54), (71a) and (73)). Demonstratives, adjectives, genitival complements, numer-
als and, most often, relative clauses precede the head nominal (54–55).10
(54) he mskva bozo ʃeni
dist . dem beautiful girl for
‘for that beautiful girl’ (own data)
(55) Nurteni-ʃi sum bee
Nurten-gen three child
‘Nurten’s three children’ (own data)
5.2. Alignment
The subject of an intransitive verb may be in the absolutive, ergative or dative
case. Ergative intransitive subjects are animate and in many cases have control
over the event described by the verb. Dative intransitive subjects occur in inverse
constructions (see §4.1). Absolutive subject intransitive verbs constitute the larger
class. The alignment of these verbs is mixed: case marking follows an ergative
pattern (O and S are in the absolutive and A is in the ergative) while verb agree-
ment follows an accusative pattern (A and S trigger Set I features, while O triggers
10
Relative clauses are discussed in the next section.
Laz 853
Set II features). Examples of absolutive subject intransitive verbs can be found in

(20) and (29); for transitive verbs, see (19) and (31).
5.3. Adjuncts
Adjuncts may be in any case except the ergative. Only the dative, which has a
wide range of functions, will be illustrated here. This case marks different spatial
roles (location (41), goal (56) and source (57)), different time roles (location (58),
duration (59), frequency (60) and time necessary for the fulfillment of an action
(61)) as well as price (62).11
(56) lazi-ʃi oxori-s ar k’intʃi mo-xt-u
Laz-gen house-dat one bird pv -come-aor . i 3 sg
‘A bird came to the Laz’ house.’ (Ž.30)
(57) Axmet’ jemluɣi-s ko-gama-xt-u
Ahmet manger-dat pv - pv -go_out-aor . i 3 sg
‘Ahmet went out of the manger.’ (D37.XII)
(58) k’iʃi-s termoni b-i-kom-t’i-t
winter-dat termoni i 1- mid -do-impft - pl
‘In winter we cooked termoni [a dish].’ (own data)
(59) sum dɣa do sum seri-s tʃ’anda t’u
three day and three night-dat reception be.impft . i 3 sg
‘The reception lasted three days and three nights.’ (Ž.35)
(60) k’at’a ts’ana-s mtvii mt-um-s
each year-dat snow snow-th - i 3 sg
‘It snows every year.’ (own data)
(61) sum dek’ik’e-s aʒlija-ʃ saraji-ʃe ko-mo-xt-u-doren
three minute-dat dragon-gen palace-all pv - pv -come-aor . i 3 sg - evd
‘He came to the dragon’s palace in three minutes.’ (D67.I)
(62) haa xut lia-s e-p-tʃ’op-i
prox . dem five lira-dat pv - i 1-buy-aor
‘I bought it for five liras.’ (own data)
11
In addition to these uses, the dative marks indirect objects (as in the applicative der-
ivation – §4.4), non-canonical subjects (in the inverse construction – §4.1), and the
complement of certain postpositions.
854 René Lacroix
6. Complex clauses

As already mentioned, relative clauses precede the head nominal.12 Their main
verb is finite in the sense that the same form can head independent clauses. It
occurs at the end of the relative clause. The relative clause includes at least one
occurrence of the clitic subordinator na. Relative clauses use a gap strategy: the
position relativized is left empty. All these properties are illustrated in example
(63).
(63) hats’i si [divi-na dʒan-s jer] do-m-o-gur-i
now 2 sg giant-sub lie-i 3 sg place pv - ii 1- tr -teach-imp
‘Now tell me the place where the giant is lying.’ (D37.VIII)
Subjects, direct objects, indirect objects and obliques may be relativized, using the
same strategy.
In free relative clauses (i. e. relatives lacking a head nominal), the plural and
case markers, which normally occur on the head of the NP, are directly attached to
the verb of the relative. Examples can be found in (7) and (10) above.
As we are going to see, the clitic subordinator na appears in other types of
complex clauses. In the speech of some informants, it is attached to the following
word, while other informants attach it to the preceding word in the dependent
clause. The subordinator na may occur more than once in the dependent clause
and may be inserted in an NP which is itself embedded in the relative clause (64).
When repeating a sentence containing na, my consultant often changes its posi-
tion in the clause, which suggests that it is not determined by rigid phonological
or syntactic rules. The subordinator na is examined more exhaustively in Lacroix
(2012b).
(64) he-na ktʃini-na dolo-tk’otʃ-es oxordʒa do
dist . dem - sub old_woman-sub pv -throw-aor . i 3. pl woman and
k’otʃi
man
‘the woman and the man who threw that old woman’ (own data)
The relativizing strategy of Laz is typologically and areally uncommon. Broadly
speaking, Turkic and North Caucasian languages, which mainly use prenominal
participial relatives, contrast with Indo-European and South Caucasian languages,
where postnominal relatives headed by finite verb forms are common. In Laz,
relative clauses are prenominal and headed by finite verb forms. This strategy
12
The corpus of published Arhavi Laz texts includes a few occurrences of another relativ-
ization strategy, where the relative is postnominal (see Lacroix 2009a, section 12.2.6).
Laz 855
thus differs from those found in the other South Caucasian languages and in other
neighboring languages (Lacroix 2009b).
6.2. Complement clauses

Verbal complement clauses use several strategies. In (65), the internal structure
of the complement clause is identical to that of relative clauses (see the preceding
section).
(65) bere-k [mgej-epe-na t’u] k-ox-o-ts’on-u
child-erg wolf-pl - sub be.impft . i 3 sg pv - pv - tr -understand-aor . i 3 sg
‘The child understood that it was the wolves.’ (D67.II)
Other verbs using this strategy are -dzir- ‘see’, -gn- ‘hear’, -n- ‘want’, -ts’v- ‘tell’
and -tʃk- ‘know’.
A second type of complement clause makes use of the complementizer ki,
which originates in Persian and is also used in Turkish, Georgian and several other
languages of the area. Complement clauses with ki occur after the matrix verb.
Phonologically, the complementizer attaches to the last word of the matrix clause
(66).
(66) deli-k u-ts’-u tʃami-s[-ki pudʒi
madman-erg ii 3. appl -tell-aor . i 3 sg pine-dat - compl cow
o-tʃk’om-i]
pv -eat-aor
‘The madman said to the pine: ‘You’ve eaten the cow’.’ (Ž.105)
Other predicates which use the ki strategy are -duʃun- ‘think (about sth.)’, -k’itx-
‘ask’, -gn- ‘hear’, -tkv- ‘say’ and -tsk’ed- ‘see’.
Thirdly, complement clauses may include reported speech marked by one of
the clitics ja, ma or ʃo (depending on the person of the superordinate verb), as in
(67). This strategy is extended to verbs of thought (68).
(67) mi-k g-i-ts’-u [si karmat’e-ʃ nek’na
who-erg ii 2- appl -tell-aor . i 3 sg 2 sg mill-gen door
go-ntsk’-i-ja]
pv -open-imp - rs
‘Who told you to open the door of the mill?’ (D37.II)
(68) tʃkimi ʃeni [i-mt’-asen-ja] g-i-tʃk-i-n-na
1 sg . gen for mid -run_away-fut . i 3 sg - rs ii 2- appl -know-th - i 3 sg -if
‘If you think that I’ll run away…’ (lit. ‘if you think about me: ‘She’ll run
away’’) (Ž.50)
856 René Lacroix
Fourthly, complement clauses corresponding to content questions do not involve

any overt marker. Compare in this respect example (15) with the corresponding
indirect question in (69).
(69) [ma mundes b-ɣur-are] ko-g-i-tʃk-i-n
1 sg when i 1-die-fut . i 1/2 sg pv - ii 2- appl -know-th - i 3 sg
‘You know when I’ll die.’ (D67.XXVIII)
Other predicates which use this construction are oxo-ts’on- ‘understand’,
go-tʃk’ond- ‘forget’, -tkv- ‘say’, -ts’v- ‘tell’, -dzir- ‘see’ and do-gur- ‘learn’.
All the complementation types we have seen so far involve finite verb forms.
Laz has a further complementation strategy, which involves the verbal noun (70).
Other predicates that use the verbal noun strategy are -tʃk’ond- ‘forget’ and -n-
‘want’.
(70) [o-p’aamit-u-s] ko-gj-o-tʃk’-es
pv -talk-vn - dat pv - pv - appl -begin-aor . i 3. pl
‘They began to talk.’ (K72.136)
6.3. Adverbial clauses

Most adverbial clauses have the structure of postpositional phrases in which the
complement of the postposition is a clause rather than an NP. This is illustrated in
(71b), which can be compare with (71a), where the postposition takes a nominal
complement.
(71) a. [noɣa ʃakis] mende-m-o-on-i-t
town until pv - ii 1- tr -take-imp -1/2 pl
‘Take me to town!’ (K72.135)
b. [datʃxur-ja do do-b-i-mɣor-am-t’a ʃakis]
fire-rs and pv - i 1- mid -shout-th - subj until
ge-m-o-ntʃ’-i-t
pv - ii 1- tr -send_down-imp -1/2 pl
‘Send me down till I cry ‘fire!’’ (D37.VIII)
As can be seen, adverbial clauses with ʃakis do not include the subordinator na.
The same structure obtains in adverbial clauses formed with ʃkule ~ ʃk’ule ‘after,
while, when, on the point of’ and doni ‘since’. Adverbial clauses formed with ʃeni
‘in order to, since (cause)’ (ex. 72, to be compared with ex. 54), steri ‘as (com-
parison, manner and time)’ and k’onari ‘as much as’ include the subordinator na.
Laz 857
(72) ma [arka-z-na m-ul-u-nam-pe-na m-a-dzi-a

1 sg back-dat -sub pv -come-th - i 3. pl - pl - sub ii 1- pot -see-opt . i 3 sg
ʃeni] tertsi-z ge-p-x-e guruni-z

for opposite-dat pv - i 1-sit-th donkey-dat
‘I’m sitting backward on my donkey so that I can see those who come
behind me.’ (own data)

Conditional clauses precede the main clause. They are marked either by na or,
much less frequently, by k’on, which appear at the end of the conditional clause
(73).
(73) [tʃkim k’al ko-mo-xt-a-na]
1 sg . gen with pv - pv -come-opt -if
si muntxa g-i-no-n do-g-o-gur-are
2 sg anything ii 2- appl -want-i 3 sg pv - ii 2- tr -teach-fut . i 1/2 sg
‘If you come with me, I’ll teach you anything you like.’ (D37.VI)
It results from the discussion of complex clauses that na can be considered as
a multi-purpose subordinator: it appears in relative, complement and adverbial
clauses, where it occurs inside the dependent clause, and in conditional clauses,
where it occurs at the end of the dependent clause.
The following text is taken from Lacroix (2009a, text 9). Elements in bold are
Turkish loans.
(74) ar k’otʃi hapisane-ʃe ama-xt-u-don ama deli jen
one man prison-all pv -go_in-aor . i 3 sg - evd but mad be.i 3 sg
‘A man went to prison. He is mad.
(75) hek-na i-tʃaliʃ-am-s görevli k’otʃi-k deli-s
there-sub mid -work-th - i 3 sg in_charge man-erg mad-dat
u-ts’u-me-s
ii 3. appl -tell-th - i 3 sg
An employee who works there says to the madman:
858 René Lacroix
(76) si ar ts’ana-s ʒur ts’ana-s otxu ts’ana-s

2 sg one year-dat two year-dat four year-dat
g-o-tsk’-e do iji he ɣorma-s o-tsk’-e
pv - appl -look-th and all_the_time dist . dem hole-dat appl -look-th
“You’ve been looking through this hole constantly for one, two, four years.
(77) a ndɣa-s-ti ma me-m-a-ʃkv-i do
one day-dat - add 1 sg pv - ii 1- mid . appl -let-imp and
ar ma b-o-tsk’ed-a
one 1 sg i 1- appl -look-opt
Let me go there someday, so that I too look at it.”
(78) hemu-k-ti zop’on-s-ki o-tsk’ed-i
dist . dem - erg - add say-i 3 sg - comp appl -look-imp
The madman says: “Look”.
(79) mola-katʃ-e-epe-s-na o-tsk’-e-n k’otʃi-k-ti
pv -shut_up-part - pl - dat - sub appl -look-th - i 3 sg man-erg - add
ɣorma-s o-tsk’ed-u-ʃkule mutu va a-dzi-u
hole-dat appl -look-aor . i 3 sg - temp anything neg pot -see-aor . i 3 sg
When the man in charge of the prisoners looked through the hole, he
couldn’t see anything.
(80) deli-s u-ts’u-me-s görevli-k
mad-dat ii 3. appl -tell-th - i 3 sg in_charge-erg
He said to the madman:
(81) hak mutu va ren do
here anything neg be.i 3 sg and
mu-s o-tsk’-e nak’o ts’ana-s
what-dat appl -look-th how_many year-dat
“There’s nothing here. What have you been looking at for so many years?”
(82) ma nak’o ts’ana-s b-o-tsk’ed-i
1 sg how_many year-dat i 1- appl -look-aor
mutu va m-a-dzi-u do
anything neg ii 1- pot -see-aor . i 3 sg and
“I looked through the hole so many years and was still not able to see
anything.
(83) si ar defa o-tsk’ed-i do mu dzi-ae-ki
2 sg one time appl -look-aor and what see-fut .1/2 sg - comp
You who have looked only once, what could you see?”’
Laz 859
Abbreviations
add additive perf perfective

all allative pl plural
aor aorist poss possessive
appl (non-canonical) applicative pot potential
caus causative proh prohibitive
compl complementizer prox proximal
dat dative pv preverb
dem demonstrative recipr reciprocal
dist distal rs reported speech
echo echoic sg singular
emph / refl emphatic-reflexive sub multi-purpose subordinator
erg ergative subj subjunctive
evd evidential suf suffix lacking a semantic
fp future participle counterpart
fut future temp temporal suffix or postposition
gen genitive th thematic suffix
imp imperative tr transitive
impft imperfect vn verbal noun
mid middle voice 1, 2, 3 1st, 2nd, 3rd person
neg negation i Set I person-number features
np negative participle II Set II person-number features
opt optative x~y x and y are free variants
part general participle
References
References to the sources of examples

In the reference of the examples, I indicate the book from which the example was taken and
the text number, separated by a dot. Thus, “K72.137” holds for K’art’ozia 1972, text 137.
The references of the books are abbreviated as follows:
D37 Dumézil 1937
D67 Dumézil 1967
K72 K’art’ozia 1972
K93 K’art’ozia 1993
Q Q’ipšidze 1939
Ž Žɣent’i 1938
860 René Lacroix
Other references
Andrews, Peter A. 1989. Ethnic Groups in the Republic of Turkey. Compiled and edited
with the assistance of Rüdiger Benninghaus. Wiesbaden: Reichert.
Dumézil, Georges. 1937. Contes Lazes. Paris: Institut d’Ethnologie.
Dumézil, Georges. 1967. Documents anatoliens sur les langues et les traditions du Cau-
case, IV. Récits lazes (dialecte d’Arhavi). Paris: Presses Universitaires de France.
Feurstein, Wolfgang. 1983. Untersuchungen zur materiellen Kultur der Lazen. Freiburg:
Albert-Ludwigs Universität, unpublished MA thesis.
Haig, Geoffrey. 2001. Linguistic diffusion in modern East Anatolia: From top to bottom.
In Alexandra Y. Aikhenvald & R.M.W. Dixon (eds.), Areal diffusion and genetic in-
heritance: Problems in comparative linguistics, 195–224. Oxford: Oxford University
Press.
Holisky, Dee Ann. 1991. Laz. In Alice C. Harris (ed.), The Indigenous Languages of the
Caucasus, vol. 3, 395–472. Delmar & New York: Caravan Books.
K’art’ozia, Guram. 1972. Lazuri t’ekst’ebi [Laz texts]. Tbilisi: Mecniereba.
K’art’ozia, Guram. 1993. Lazuri t’ekst’ebi II [Laz texts 2]. Tbilisi: Mecniereba.
Kutscher, Silvia. 2008. The language of the Laz in Turkey: Contact-induced change or grad-
ual language loss? Turkic Languages 12(1). 82–102.
Lacroix, René. 2009a. Description du dialecte laze d’Arhavi (caucasique du sud, Turquie).
Grammaire et textes. Lyon: Université Lyon 2 dissertation. http://theses.univ-lyon2.fr/
documents/lyon2/2009/lacroix_r (accessed 12 July 2016).
Lacroix, René. 2009b. Laz relative clauses in a typological and areal perspective. In Peter
K. Austin, Oliver Bond, Monik Charette, David Nathan & Peter Sells (eds.), Language
Documentation and Linguistic Theory 2, 205–210. London: SOAS.
Lacroix, René. 2011. Ditransitive constructions in Laz. Linguistic Discovery 9(2). 78–103.
Lacroix, René. 2012a. Laz middle voice. In Gilles Authier & Katharina Haude (eds.), Erga-
tivity, Valency and Voice, 165–198. Berlin & Boston: De Gruyter Mouton.
Lacroix, René. 2012b. The multi-purpose subordinator na in Laz. In Holger Diessel &
Volker Gast (eds.), Clause Linkage in Cross-Linguistic Perspective: Data-Driven Ap-
proaches to Cross-Clausal Syntax, 77–103. Berlin & Boston: De Gruyter Mouton.
Lacroix, René. 2014. Origin of Sets I-II suffixes in South Caucasian through reanalysis.
In Nino Amiridze, Tamar Reseck & Manana Topadze Gäumann (eds.), Advances in
Kartvelian Morphology and Syntax (Studia Typologica 14), 47–78. Berlin: Akademie
Verlag.
Q’ipšidze, Ioseb. 1939. Č’anuri t’ekst’ebi [Laz texts]. Tbilisi: SSRK’ Mecnierebata
Ak’ademiis Sakartvelos Pilialis Gamomcemloba.
Toumarkine, Alexandre. 1995. Les Lazes en Turquie (XIXe-XXe siècles). Istanbul: Isis.
Žɣent’i, Sergi. 1938. Č’anuri t’ekst’ebi, arkabuli k’ilok’avi [Laz texts (dialect of Arhavi)].
Tbilisi: SSRK’ Mecnierebata Ak’ademiis Sakartvelos Pilialis Gamomcemloba.
6.3. Ossetic
David Erschler
1. Introduction
Ossetic is a cover term for two closely related, but not mutually intelligible Eastern
Iranian languages, often called dialects, Iron and Digor1. Ossetic is spoken in
North Ossetia-Alania, an autonomous republic within Russia, and South Ossetia,
a breakaway part of Georgia.
In earlier work, Ossetic was considered a member of the Northeastern Iranian
subbranch of the Iranian (along with Yaghnobi, Sogdian, and a number of other
extinct languages), however, recently doubt has been cast upon the validity of this
subgroup. Moreover, Eastern Iranian languages probably do not form a genetic
unit either, Sims-Williams (1996).
Systematic research on Ossetic started with Sjögren (1844) and Rosen (1846).
Miller (1903) was the first diachronically oriented comprehensive description
of Ossetic morphology. Miller (1881) and Miller and Stackelberg (1891) col-
lected and published Ossetic texts with Russian and German translations. The
grammar sketch Stackelberg (1886) is based on these texts. Other editions of
Ossetic texts with translations into European languages are Christensen (1921)
and Munkácsi (1932). Most of later synchronic research on Ossetic was pub-
lished2 in Russian (e. g., Iron Ossetic grammars Axvlediani 1963/1969 and
Bagaev 1965/1982; monographic treatments Gabaraev (1977) on word formation;
Tekhov (1970) on modality; Kudzoeva (2003) on word order; Tedeev (1989) and
Dzodzikova (2009) on verb structure. Digor is addressed in the grammar sketches
Isaev (1966); Takazov (2009), and the Digor-Russian dictionary Takazov (2003).
Perhaps more accessible to the English-reading audience are Abaev (1964)
(although all language data are presented in the Cyrillic-based orthography), a brief
general sketch in Thordarson (1989); an overview of phonology by Testen (1997),
an overview of word formation Erschler (2015); and a posthumously published
1
For reasons of space, I do not always provide examples from both languages. When
Iron and Digor forms are given simultaneously, they are shown in the following order
and separated by a slash: gɐdə / tikis ‘cat’; otherwise examples are marked with (I) and
(D) respectively. Data for this chapter were collected in the course of my fieldwork
in North Ossetia in 2007–2013. I am grateful to all my consultants for their generous
helpfulness. I have worked particularly much with Arbilana Abaeva, Tsara Dzhanaev,
and Elizaveta Kochieva (Iron); and with Sveta Gatieva, Marina Khamitsaeva, Khasan
Maliev†, Chermen Takazov, Fedar Takazov, and Vera Takazova (Digor).
2
Unpublished dissertations are not listed here, as they are virtually inaccessible even in
Russia.
862 David Erschler
collection of sketches Thordarson (2009). Furthermore, Haspelmath (1997) has an

overview of indefinite pronouns in Iron Ossetic. Some of the recently published
papers in English are Belyaev and Vydrin (2011); Erschler (2009, 2012a, 2012b);
Erschler and Volk (2011); and Tomelleri (2009).
Ossetians are descendants of medieval Alans and, through the Alans, of Sar-
matians. Their settlement in the piedmont of the Caucasus dates back to the first
centuries of the Common Era. Before that, nomadic ancestors of Ossetians wan-
dered in Eurasian steppes between the Danube and the Altay mountains. In early
middle ages, the Christian Alanic kingdom controlled much of the north-western
Caucasus. It was destroyed in the 13th–14th c. by Mongols and Timur, Abaev
and Bailey (1985). Following the kingdom’s destruction, the ancestors of Osse-
tians only survived in high mountain valleys, and, until the early 18th c., data
about them is extremely scarce. By the early 18th c., their territory roughly corre-
sponded to the highlands of the modern South and North Ossetia, Volkova (1974).
A subgroup of Alans, known as Iassi, migrated to Hungary in the 13th or 14th c.
Although they lost the language at some point, their descendants remained a sepa-
rate ethnic group within Hungary at least until the early 20th c.
In the recent past, Ossetians lived in high mountain valleys and mainly prac-
ticed animal husbandry. However, in the course of the last two centuries, the
majority of the population moved to the plains and took up agriculture. Currently,
their lifestyle is considerably urbanized.
The earliest written attestations of Alanic/early Ossetic are an Alanic tomb-
stone inscription from the tenth to twelfth century, a few lines in Alanic in a
twelfth century Byzantine manuscript Theogony by Ioannes Tzetzes, a few four-
teenth or fifteenth century glosses in a Greek manuscript (Alemany 2000; Kam-
bolov 2006: 166–207) and an Iassic word list recorded in Hungary in the 15th
c., Németh (1959). The first “real” texts date back to the late eighteenth to early
nineteenth century (translations from Church Slavonic and Georgian).
Ossetians form the majority of the population in both North and South Ossetia.
Most of them are Iron speakers (modulo the language loss of the last half-cen-
tury), but no precise figures are available. According to the 2010 census, there are
528,000 Ossetians (Iron and Digor) in Russia. Moreover, an unknown number of
Iron speakers live in South Ossetia, probably between 30,000 and 70,000. Also,
a few thousands Ossetians (Iron and Digor) live in Turkey. Digor Ossetians live
mostly in Digor and Iraf districts of North Ossetia. The total population of these
districts according to the 2010 census is about 35,000; this figure can be inter-
preted as a rough estimate of the number of Digor speakers.
Digor has two major dialects, one spoken in the town of Digora and another in
the villages of Chikola, Sheker, Dzagepbarz, Khazhnidon, Toldzgun, and Lesken
(see Figure 1). They differ in phonetics and, probably, to some extent in morpho-
syntax as well. It is the Chikola-Lesken dialect of Digor that is represented in this
chapter. Moreover, there was a mixed Iron-Digor dialect spoken in the Wallagkom
Ossetic 863
valley (Isaev 1966), which is now virtually extinct. Another mixed Iron-Digor

dialect emerged in the village of Ursdon-Sindziqaw. No systematic study of Digor
dialects has been undertaken so far.
Iron has about half a dozen dialects (Bekoev 1985). A major South Ossetian
dialect, Kudar, is sometimes perceived as the third Ossetic language. While pho-
netic differences between Iron dialects are fairly well described, morphosyntactic
differences have not been studied systematically. All dialects use the same orthog-
raphy, which does not reflect the phonetic differences.
Nominally, the majority of North Ossetians and virtually all South Ossetians
are Orthodox Christian, and, in North Ossetia, there exists a sizeable Moslem
minority. In fact, the traditional syncretic religion is still widely practiced.
In North Ossetia, no monolingual speakers remain, and at least 40 % of Osse-
tians are more fluent in Russian than in Ossetic according to Kambolov (2007).
Kambolov’s data are based on respondents’ self-evaluation, and the actual per-
centage of fluent speakers may be significantly lower. In North Ossetia, Ossetic
is now effectively restricted to the private sphere. Street signs, price tags in shops
or announcements in public transportation are all in Russian, which is also the
language of local authorities and courts.
The language situation in South Ossetia is somewhat better in this respect.
While the main language of education is Russian as well, in some rural schools the
language of education is at least partly in Iron for first several years. However, it
then completely switches to Russian and Iron is only taught only as a subject. The
situation with Digor is even worse.
In rural areas, Ossetic is still acquired by children. However, about 50 % of all
Ossetians live in Vladikavkaz, the Russian speaking capital of North Ossetia, and
children raised there usually grow up semispeakers at best. Furthermore, a high
level of unemployment in rural areas causes migration to Vladikavkaz and to inner
regions of Russia. All this makes the long term survival of Ossetic questionable.
Both Iron and Digor are written languages, books and periodicals are published,
and the Vladikavkaz TV broadcasts short programs in Iron and, since recently,
in Digor. Between the mid-1950s and the early 1990s, publishing in Digor was
strongly discouraged.
The modern Ossetic orthography is Cyrillic based, with only one additional
letter Æ/æ for the sound ɐ; the rest of specifically Ossetic sounds are expressed
by digraphs. Ways of transcribing/transliterating Ossetic vary widely across the
literature.
864
David Erschler
Figure 1: Digor and Iron Ossetic locations mentioned in this chapter

Ossetic 865
2. Phonology
2.1. Segmental phonology

Ossetic phonology is described in Testen (1997). A recent instrumental study of
Iron phonetics is Dzakhova (2009). Typically for the area, Ossetic has a series of
ejective consonants p’, t’, c’, č’ and k’, which mostly appear in loans or onomat-
opoeic words, but also developed in some words of Indo-European origin, e. g.
št’alə/st’alu ‘star’, ləšk’/lisk’ɐ ‘nit’. This series of consonants is obviously a bor-
rowing from autochthonous languages of the Caucasus.
Vowel inventory of Digor Ossetic
i u
e o
ɐ
a
Vowel inventory of Iron Ossetic
i u
ə
e o
ɐ
a
There is no phonemic contrast in length. However, /ɐ/, /i/, /u/, and /ə/ are acousti-
cally shorter than the rest of the vowels, Sokolova (1953).

labial coronal velar uvular glottal
bilabial labiodental dental alveolar postalveolar
stop b d(=d̪ ) g, gʷ q qʷ ʔ
p(=pʰ) t(=t̪ ʰ) k(=kʰ), kʷ
p’ t’ k’ k’ʷ
fricative fv sz ž (=ʒ) š (=ʃ) ʁ ʁʷ h
χ χʷ
affricate dz dʒ č(=ʧ)
c(=ts) č’(=ʧ’)
c’(=ts’)
nasal m n
lateral l
trill r
glide w j
866 David Erschler
In this chapter, I will not be using the IPA symbols in parentheses in Table 1.
In standard Iron, /c/ only survives in certain clusters. Glottal consonants are
marginal; the glottal stop /ʔ/ is automatically epenthesized before word-initial /a/,
/ɐ/, and /ə/, and sometimes between vowels. The glottal fricative /h/ is present in
very few words: ho ‘yes’, henər/henur ‘now’, maha ‘I don’t know’, haj-haj ‘cer-
tainly, of course’ and some others.
All consonants are phonetically palatalized before front vowels, in Digor the
palatalization leads to changes in the primary articulation for the sibilants and
affricates: /s/ becomes /š/, /z/ becomes /ž/, and /c/ becomes /č/. If a front vowel
follows a cluster, then all the consonants in the cluster are palatalized: D. kust
‘work’ > kušt-itɐ work-pl . This happens also when an enclitic with a front vowel is
attached to a word: k’os=mi > k’oš=mi cup=abl .1 sg . In Iron, the velar stops, /k/
and /g/, become /č/ and /ʤ/, respectively, before the oblique case marker -ə.
2.2. Phonological processes

Consonant assimilation is only observed when the allative marker -mɐ is attached
to an n-final stem, in which case the /n/ becomes /m/. Ossetic disallows hiatus, and
a variety of epenthetic consonants (usually glides or the glottal stop) are used to
break it up. Word finally, voiced consonants undergo devoicing. Vowel reduction
as such has not been attested, but, in Iron, /ə/ in some positions can be inaudible
in fluent speech.
2.3. Phonotactics
Minimal lexical words have CVC structure. The 2 sg pronoun də/du and some
wh-words are CV (e. g. D. ka ‘who.nom ’; či ‘what.nom / acc ’). Wh-words can
function as non-clitics in fragment questions, but normally they are procliticized
to a verb. The rest of CV and VC entities are either clitics or bound morphemes.
There is no reliable evidence that the notion of syllable as such is relevant
for Ossetic grammar. Bailey (1950) claimed that syllable weight is relevant for
Digor stress, but his stress data appear to be wrong. Initial clusters contain no
more than two consonants št’alə/st’alu ‘star’. In some Iron dialects, initial clusters
are resolved by the prosthetic ə: əšt’alə. In Digor, the prosthetic vowel is ɐ. Final
clusters3 of the form nC, lC, and rC, where C is a non-labial stop, are relatively
common. Other types of cluster are very rare except for a large number of clusters
with the final d/t that appear in past participles. Historically, the -d/-t was the past
participle exponent. In these participles, any consonant except the stops and the
affricates may precede the d/t. Three-consonant clusters are attested only in such
participles; the penultimate consonant then is always s: Digor ɐχst ‘shot’.
3
Due to the limitations of space, I only describe Digor clusters here.
Ossetic 867
2.4. Stress and prosody

Both Digor and Iron lack lexical or grammatical tone. The acoustic correlates of
stress are unknown. Dzaxova (2010) states that in Iron, the acoustic correlate is
a higher pitch on the respective syllable, but this does not hold for all words of
her sample (and the words were recorded without a carrier sentence). For Digor,
no conjectures have been advanced so far and a pilot study carried out by the
author has failed to detect lexical stress. There are no modern studies of prosody
in Ossetic, although some observations can be found in Abaev (1949).
3. Morphology
For an overview of word formation in Ossetic, see Erschler (2016) and the refer-
ences therein. Open word classes are verbs and nouns.

Adjectives and adverbs are morphologically indistinguishable from nouns. Other
classes of nominals include numerals, full pronouns, enclitic pronouns, reflexives
(also used as intensifiers), reciprocals, wh-words, various types of indefinites, and
quantifiers.
Nouns are inflected for number and case; the markers are suffixed to the noun,
and, noun phrases being strictly head-final, to the right edge of noun phrases. There
is no grammatical gender, even in pronouns. Animacy is not marked overtly, but
plays a role in differential object marking. Possessedness is marked by a prefix/
proclitic, which attaches to the left edge of a noun phrase and distinguishes person
and number of the possessor.
(1) jɐ= asə štər wurš bɐχ-t-ɐn (I)
poss .3 sg = this big white horse-pl - dat
‘for these big white horses of his/hers’
The plural marker attaches immediately to the right of the stem. It may trigger a
vowel alternation, which is not completely predictable, and, furthermore, if the
noun ends in a sonorant, the -t- of the plural marker may be geminated: (D) bon
‘day’ vs. bɐn-ttɐ ‘days4’. If a word ends in a cluster, the allomorph of the plural is
-ətɐ/-itɐ, with a possible vowel alternation in the stem: ɐχšt/ɐχst ‘shot’ → ɐχšt-ətɐ/
ɐχšt-itɐ. Plural marking is possible for wh-phrases and some indefinites as well:
D. či-tɐ what-pl ; ka-dɐr-tɐ who-idf - pl lit. ‘someones’. Semantic conditions under
which it arises are unclear.
4
The regular form bon-tɐ exists as well.
868 David Erschler
Personal names may carry the associative plural marker -i-/-e-: Ir. Aslan-i-tɐ
Aslan-ass - pl ‘Aslan and others with him’.
Case is usually marked on the right edge of the word form. The only exceptions
are certain indefinites, in which the case marker either precedes the indefinite
suffix (e. g. the Digor superessive form of či-dɐr what-idf ‘something’ is cɐbɐl-dɐr
what.sup - idf ) or, for plural marked indefinites, occurs several times in the word
form (e. g. the Digor ablative form of ka-dɐr-tɐ who-idf - pl ‘someone-pl ’ is kɐmɐj-
dɐr-t-i / kɐmɐj-dɐr-t-ɐj who.abl - idf - pl - obl / abl .) For nouns, numerals, and quan-
tifiers the case marking is nearly agglutinative, only minor morphonological alter-
nations occur. Pronominal stems show suppletion, see paradigms below. In Digor,
numerals require dedicated numeral declension suffixes inserted between the stem
and the regular case marker, for example duw-e-j ‘two-num - obl ’. Remarkably,
numeral phrases join the numeral class: bɐχ-i horse-obl , but duwɐ bɐχ-e-j two
horse-num - obl .
Case markers differ from postpositions in the following respects: a) case
markers attach to bare noun stems, whereas postpositions require case marking on
nouns (usually the oblique); b) case-marked nouns, unlike postposition comple-
ments, may bind clause-mate reflexives and reciprocals, and control depictives,
Erschler (2012b).
Ossetic does not allow Suffixaufnahme, i. e. stacking of several case suffixes
on a single noun phrase: *NP-case 1- case 2. Case affixes may be suspended, i. e.
one case affix may mark two conjoined noun phrases, Erschler (2012b).
(2) kʷəz ɐmɐ gɐdə-mɐ (I)
dog and cat-all
‘to a cat and a dog’
When a pronoun is coordinated with a noun, suspended affixation works slightly
differently: the first conjunct receives the oblique case, while the second takes the
case relevant to the entire phrase (see paradigms below), as in (3). When two pro-
nouns are coordinated, suspended affixation is ungrammatical.
(3) dɐw/*?də ɐmɐ aslan-ɐn (I)
you.obl /you.nom and Aslan-dat
‘to you and to Aslan’
The Ossetic case system consists of the nominative; accusative/genitive/ines-
sive; dative; ablative; allative; superessive; equative. Iron also has the comitative.
The accusative, genitive and inessive coincide for lexical nouns. The accusative
and the genitive only differ for clitics: (second position) enclitics correspond
to direct objects, whereas proclitics to noun phrases correspond to possessors
(Erschler 2009). The inessive differs from the genitive and accusative for enclitics,
and, in Digor, for numerals and numeral phrases. In this chapter, I gloss the syn-
cretic marker of the accusative, genitive, and inessive as the oblique.
Ossetic 869
Table 2: Noun inflection (Digor)

Consonant-final ɐ-final Other vowel-final
Plurals Singular
Nom bɐχ ‘horse’ bɐχ-tɐ ‘horse’ kizgɐ ‘girl’ Agube ‘male name’
Obl bɐχ-i bɐχ-t-i kižg-i Agube-j
Dat bɐχ-ɐn bɐχ-t-ɐn kizg-ɐn Agube-j-ɐn
Abl bɐχ-ɐj bɐχ-t-ɐj kizg-ɐj Agube-j-ɐj
All bɐχ-mɐ bɐχ-tɐ-mɐ kizgɐ-mɐ Agube-mɐ
Sup bɐχ-bɐl bɐχ-tɐ-bɐl kizgɐ-bɐl Agube-bɐl
Equ bɐχ-aw bɐχ-t-aw kizg-aw Agube-j-aw
Table 3: Inflection of personal pronouns (Digor)

1 2
Sg Pl Sg Pl
Nom ɐz maχ du sumaχ
Obl mɐn maχ dɐw sumaχ
Dat mɐn-ɐn maχ-ɐn dɐw-ɐn sumaχ-ɐn
Abl mɐn-ɐj maχ-ɐj dɐw-ɐj sumaχ-ɐj
Iness –5 – – –
All mɐnmɐ maχ-mɐ dɐw-mɐ sumaχ-mɐ
Sup mɐn-bɐl maχ-bɐl dɐw-bɐl sumaχ-bɐl
Equ mɐn-aw maχ-aw dɐw-aw sumaχ-aw
Table 4: Inflection of deictics.6 (Digor)

Nom a/ajɐ a-tɐ je je-tɐ
Obl aj an-i woj won-i
Dat a-m-ɐn an-ɐn wo-m-ɐn won-ɐn
Abl a-m-ɐj an-ɐj wo-m-ɐj won-ɐj
Iness a-m-i an-ɐm-i wo-m-i won-ɐ-m-i
All a-mɐ an-ɐmɐ wo-mɐ won-ɐ-mɐ
Sup a-bɐl an-ɐbɐl wo-bɐl won-ɐ-bɐl
Equ a-j-aw ani-j-aw wo-j-aw woni-j-aw
5
For the 1st and the 2nd persons, localization “in” can only be expressed with a postpo-
sition.
6
The distal deictics je/jetɐ also serve as the 3rd person pronouns.
870 David Erschler
Table 5: Inflection of enclitics (Digor)

1 2 3
Sg Pl Sg Pl Sg Pl
Nom – – – – – –
Acc mɐ nɐ dɐ wɐ ɐj sɐ
Gen – – – – – –
Dat min nin din win in šin
Abl mi ni di wi ši ši
Iness – – – – ši ši
All mɐmɐ nɐmɐ dɐmɐ wɐmɐ imɐ sɐmɐ
Sup mɐbɐl nɐbɐl dɐbɐl wɐbɐl ibɐl sɐbɐl
Equ – – – – – –
Table 6: Inflection of numerals and numeral phrases (Digor7)

Nom duw-ɐ ‘2’ duwɐ bon-i
insɐj ‘20’ two day-obl
Obl duw-e-j duwɐ bon-e-j
insɐj-e-j two day-num - obl
Abl duw-em-ɐj duwɐ bon-em-ɐj
insɐj-em-ɐj two day-num - abl
Dat duw-em-ɐn #duwɐ bon-em-ɐn8
insɐj-em-ɐn two day-num - dat
Iness duw-em-i #duwɐ bon-em-i
insɐjemi two day-num - ines
All duw-e-mɐ duwɐ bon-e-mɐ
insɐj-em-ɐ two day-num - all
Sup duw-e-bɐl duwɐ bon-e-bɐl
insɐj-e-bɐl two day-num - sup
Equ #duw-e-j-aw #duwɐ bon-e-jaw
#insɐj-aw two day-num - equ
7
In Iron, numerals are declined identically to nouns. In numeral phrases in the nomi-
native, the oblique marking surfaces on the nouns: ɐrtɐ qaž-ə three goose-obl ‘three
geese’. In other cases, the oblique is replaced by the respective case marker: ɐrtɐ qaž-ɐn
three goose-dat .
8
The # sign marks morphologically possible but pragmatically improbable forms.
Ossetic 871

Ossetic exhibits a robust difference between finite and non-finite forms: only finite
forms agree with their subjects in person and number, and show tense and mood
distinction. The aspect, on the other hand, is defined for non-finite forms as well.
For an overview of non-finite forms, see Erschler (2015).
Compared to other languages of the area, Ossetic only shows a rather small
array of finite forms: the present, past and future of the indicative, the impera-
tive, and the three tenses of the subjunctive. Admittedly, for the non-indicative
forms, the terms past/present/future subjunctive are convenient labels rather than
terms meaningfully reflecting the semantics of these forms. The past subjunctive,
besides its use in conditional and counterfactual clauses, can express estimate or
habituality in the past. The latter is also the most common use of the present sub-
junctive in Digor. Unusually for the area, Ossetic lacks grammatical evidentiality.
Except for a handful of roots, preverbless verbs are imperfective, and the per-
fective aspect is marked by preverbs. In Iron, perfective verbs can be imperfectiv-
ized or endowed with the meaning ‘had almost done x’ by inserting the conative
affix -sɐj- between the preverb and the root. In Digor written texts, analogous forms
with -cɐj- are sometimes encountered, but they have been rejected as Iron calques
by the majority ‒ in fact, all but one ‒ of the native speakers I have worked with.
(4) a. južaman mɐ=sigaret-t-ə k’opp fɐ-sɐj-fɐwud
once my=cigarette- pl - obl pack prv - con -finish
kodta (I)
do.pst .3 sg
‘Once, I was running out of cigarettes.’ (Lit. ‘Once, my pack of
cigarettes was ending.’)
(Dzasoxty 2007)
b. o χɐdɐgaj kʷəd=mɐ fɐ-sɐj-roχ
yes by_the_way how=abl .1 sg prv - con -forgetting
kodta (I)
do.pst .3 sg
‘Yes, by the way, I had almost forgotten.’
(Dzasoxty 2007)
Besides these basic forms, there exists a plethora of analytic modal constructions,
see some examples in Belyaev and Vydrin (2011) and in Erschler (2014). A peri-
phrastic form (the converb plus a finite form of the verb ‘to do’, which then carries
all TAM marking) is used when the lexical verb has to be focused or topicalized:
(5) win-gɐ=ba=ɐj ma fɐ-kkɐn-ɐd (D)
see-cvb = ctr = acc .3 sg neg . imp prv -do-imp .3 sg
‘Let him/her never see it.’
(Gurdzibety 2006)
872 David Erschler
A finite verb only agrees with the nominative argument irrespective of the seman-
tic role of the latter. The agreement is in person and number; collective nouns may
trigger plural agreement, whereas plural noun phrases may agree in the singular.
Details of the phenomenon have not been systematically studied.
Only about 300 verbs are simplex (i. e. attach the agreement markers directly
to the stem), the rest are a combination of a nominal part with a light verb, usually
‘to do’ or ‘to be’: Ir. aχʷər kɐn- learning do ‘to study’. Simplex verbs have two
stems, the past and the present, the relationship between which is irregular (e. g.
Ir. sɐw-/səd- ‘go’ vs. lɐw-/lɐwəd- ‘stand’), and they both have to be listed in the
lexicon.
For simplex finite verbs, the maximal structure is preverb-conative suffix-
stem-tense.mood.agreement, of which the preverb is optional, and the conative
suffix may be inserted only in the presence of a preverb. For complex verbs, pre-
verbs, and, possibly, the conative suffix attach to the nominal part, whereas tense-
mood-agreement suffixes are carried by the light verb.
If a verb occurs sentence-initially, then, in Iron, the preverb with the conative
suffix -sɐj- can be separated from the root by a pronominal clitic, as in (6a). In
Digor, this is possible for bare preverbs, (6b). In (6b), ra-jevʁud un ‘to pass (about
time)’ is a complex verb, whose nominal part carries the prefix ra-.
(6) a. fɐ-sɐj=šɐm χɐccɐ kod-ton (I)
prv - con = all .3 pl near do.pst - pst .1 sg
‘I was approaching them.’
(Bedzhyzaty 1995: 55)
b. ra=mɐbɐl jevʁud ɐj fɐzzɐg (D)
prv = sup .1 sg passed be.prs .3 sg autumn
‘I spent the fall.’ (Lit. ‘The fall passed on me.’)
(Aghuzarti 2008)
Verbs fall into two conjugation classes, the difference between which only shows in
past indicative forms: Ir. kod-toj do.pst .3 pl vs. səd-əštə go.pst .3 pl . They are often
called transitive and intransitive in the literature, presumably because the majority
of verbs in each class are transitive and intransitive respectively. However, class
assignment is lexically determined, and there are exceptions. For example, in Iron,
the verbs žar- ‘to sing’ and kɐš- ‘to read’ are morphologically intransitive: žarəd-
əštə sing.pst - pst .3 pl ‘they sang’, kašt-əštə read.pst - pst .3 pl ‘they read’, whereas
rɐj- ‘to bark’ and fɐlzɐʁd- ‘to snow heavily, especially during a blizzard’ are transi-
tive: rɐjd-toj bark.pst - pst .3 pl ‘they barked’; mit fɐlzɐʁ-ta snow snow.pst - pst .3 sg
‘It snowed heavily.’
In Iron, iterativity/habituality in the past is usually marked by the second posi-
tion clitic -ju (in some environments pronounced -iw), whereas Digor mostly uses
the past and present subjunctive forms for this. The habituality in the present is
expressed by preverbs, (7). In the following example, the preverbs and the present
Ossetic 873
indicative morphology are highlighted in boldface. Concerning the relative clause

structure, see §5.4 below.
(7) [nur=dɐr=ma ješke ʣubandi-tɐ ke zɐrdɐ-mɐ
now=emp =still somebody.obl word-pl who. obl heart- all
nɐ=fɐ-ccɐ-ncɐ] [je wotɐ ba-kɐn-uj] (…) (D)
neg = prv -go-prs .3 pl he.nom so prv -do-prs .3 sg
‘Even now, (whenever) somebody’s words would not please one, s/he
would say so (…).’
(Sabajti 2010)
The reflexive, reciprocal, and applicative are not marked on the verb: the reflexive
and reciprocal use dedicated pronouns, whereas the argument introduced by the
applicative is marked by the dative. Unlike in Slavic or Northwestern Caucasian,
Ossetic preverbs do not introduce arguments. The causative is expressed analyti-
cally, as the infinitive of the lexical verb plus the auxiliary ‘to do’: Ir. ba-mbar-ən
kodta prv -understand-inf do. pst .3 sg ‘s/he explained’. The passive is fairly rare;
it is expressed as the past participle or the converb plus the appropriate finite form
of the copula or of the verb ‘to go’.
(8) a. sovetton χecawadɐ terk-i bɐstɐ-bɐl sast
Soviet authority Terek-obl region-sup break. prt . pst
ku (ɐ)r-cud-ɐj (D)
when prv -go.pst - pst .3 sg
‘When the Soviets had been defeated (lit. broken) across Terek
region…’9
b. sk’ola zumɐg-i waʣ-gɐ ɐr-cud-ɐj (D)
school winter-obl leave-cvb prv -go.pst - pst .3 sg
‘The school had to be abandoned in winter.’
Below, Digor verb paradigms are illustrated.
The Paradigm of the Copula
Table 7: Indicative
Present Past Future
1 dɐn an adtɐn adtan woʣan(ɐn) woʤ(in)an
2 dɐ ajtɐ adtɐ adtajtɐ woʣɐnɐ woʤ(in)ajtɐ
3 ɐj / jes10 ɐncɐ adtɐj adtɐncɐ woʣɐ(nɐ)j woʣ(ɐn)ɐncɐ
9
(http://allingvo.ru/DIGOR/bagharati_sozur.htm, accessed 08 July 2017).
10
In prs .3 sg , the final -s may be dropped, and the existential copula is often pronounced
je.
874 David Erschler
Table 8: Imperative
sg pl
2 wo wotɐ
3 wɐd wɐntɐ
Table 9: Subjunctive
Present Past Future
sg pl sg pl sg pl
1 winɐ (wajinɐ)11 wijanɐ (wajanɐ) adtajnɐ adtajanɐ won wɐn

2 wisɐ (wajsɐ) wiajtɐ (wajajtɐ) adtajsɐ adtajajtɐ waj wajtɐ
3 widɐ (wajdɐ) wiwoncɐ (wajwoncɐ) adtajdɐ adtajiwoncɐ wa woncɐ
Paradigms of simplex verbs, Digor. (Unless indicated otherwise, illustrated for ‘to
do’)
Table 10: Indicative

Present, prs stem Future, prs stem
both conjugations both conjugations
1 kɐn-un kɐn-ɐn kɐn-ʣ(ɐn)ɐn12 kɐn-ʤ(in)an
2 kɐn-i(s)13 kɐn-(e)14 tɐ kɐn-ʣ(ɐn)ɐ kɐn-ʤ(in)ajtɐ
3 kɐn-uj kɐn-uncɐ kɐn-ʣ(ɐn)ɐj kɐn-ʣ(ɐn)ɐncɐ
Past, pst stem

“Transitive” conjugation, ‘to do’ “Intransitive” conjugation, ‘to look’
kod-ton kod-tan kas-tɐn kast-an
kod-taj kod-tajtɐ kas-tɐ kast-ajtɐ
kod-ta kod-toncɐ kas-tɐj kast-ɐncɐ
11
wi- and wa- forms of the present subjunctive appear to be completely equivalent, but
wi- forms are much more frequent. Some younger speakers do not recognize the variant
with wa-.
12
The “long” and “short” forms of the future suffix are completely equivalent, in careful
speech and writing it is mostly the longer form that is used.
13
The final -s of prs .2 sg is usually dropped in colloquial speech.
14
-e- is optional here only for this verb.
Ossetic 875
Table 11: Imperative (present stem)

Sg Pl
2 kɐn-ɐ kɐn-(e)tɐ
3 kɐn-ɐd kɐn-ɐntɐ
With some verbs, the imperative is irregular:

cɐw-un ‘go’ imp .2 sg co/*cɐwɐ; imp .2 pl co-tɐ/cɐwetɐ;
kɐw-un ‘to weep’ imp .2 sg ko/kɐwɐ imp .2 pl ko-tɐ /kɐwɐ-tɐ The respective 3rd
person forms are regular.
Table 12: Subjunctive (same for both conjugations)

Present (prs stem) Past (pst stem) Future (prs stem)
1 kɐn-inɐ kɐn-ijanɐ kod-tajnɐ kod-tajanɐ kɐn-on kɐn-an
2 kɐn-isɐ kɐn-ijajtɐ kod-tajsɐ kod-tajajtɐ kɐn-aj kɐn-ajtɐ
3 kɐn-idɐ kɐn-iwoncɐ kod-tajdɐ kod-tajiwoncɐ kɐn-a kɐn-oncɐ
4. Basic clause structure
4.1. Noun phrase and adpositions

Noun phrases are strictly head final and unsplittable, even by clitics. There is no
overt agreement within noun phrases.
(9) mɐ=wəsə dəwwɐ χorž ləmɐn-ɐn (I)
poss .1 sg =that two good friend- dat
‘for these two good friends of mine’
Numerals precede noun phrases they quantify, which stand in the singular.
There is only one preposition, ɐnɐ ‘without’. The majority of postpositions are
transparently related to lexical nouns, e. g. šɐr/sɐr ‘head (n.); on (postp.)’. With
verbs of movement, postpositions necessarily carry plural marking:
(10) χed-i sɐr-*(t)-i ba-wad-tɐn (D)
bridge-obl on-pl - obl prv -pass-pst .1 sg
‘I passed over a bridge.’
Postpositions can be case-marked and assign case, most commonly the oblique, to
their complement noun phrases. For oblique-assigning postpositions,15 the com-
plement can be realized as a possessive proclitic: Ir. mɐ=šɐr-mɐ poss .1 sg =on-all
15
Except təχχɐj/tuχχɐj ‘because of, about’, which only allows non-clitic complements.
876 David Erschler
‘above me’. For postpositions assigning other cases, their complements cannot be
replaced by clitics.
4.2. Verb phrase

Verb phrase, as a surface constituent, does not exist (although there might be argu-
ments for positing it on a more abstract level).
4.3. Constituent order

Non-finite clauses are rigidly verb final; the word order in finite clauses is in
principle free, but SOV largely prevails, modulo the usual disclaimer that clauses
where all arguments are expressed by full noun phrases are relatively infrequent.
Ossetic shows pro-drop, whereas discourse-given non-subject noun phrases are
usually expressed by clitics.
4.4. Argument marking

By default, Ossetic is strictly nominative-accusative. There are no instances of
TAM-based alignment splits. Very few verbs have different argument marking
patterns. These verbs either agree with the stimulus or show the default 3sg agree-
ment. They include: fɐnd- ‘to want’; qɐw-/ʁɐw- ‘to need’, and, in Iron, wərn- ‘to
believe’, which mark their experiencers with the oblique. The verb ɐntɐš-/ɐntɐs-
‘to succeed’ marks the experiencer with the dative. In Digor, the verb ɐnʁez-, lit.
‘to leaven’ is used with the default 3 sg morphology and a dative “subject” in the
meaning “to be permitted to X”. Furthermore, in one of the diatheses, the complex
verb prv -roχ / prv -ironχ cop ‘to forget’ marks the experiencer with the ablative:
(11) fe-ronχ=mi ɐj (D)
prv -forgetting=abl .1 sg be.prs .3 sg
‘I have forgotten it.’
Ossetic shows differential object marking. Referential animate direct objects are
obligatorily marked with the oblique, (12a–b), while inanimate direct objects nor-
mally remain unmarked, (12e) but the details are as yet poorly understood (Thor-
darson 2009: 135–137). As the sentences in (12c–d) show, if the animate object is
non-specific or non-referential, the object stays in the nominative, whereas when
both the subject and the object are inanimate, the latter gets the oblique marking.
Ossetic 877
(12) a. Animate definite direct object

lɐppu-jə ɐddɐ-mɐ ra-šajd-toj (I)
boy-obl outside-all prv -deceive.pst - pst .3 sg
‘They teased the boy outside (by deceit).’
(Qajtyqty 1998)
b. Animate referential direct object
k’ʷəri raždɐr wəm bɐrɐžʤ-ə gal-ə birɐʁ-tɐ
week earlier there Barazg-obl bull-obl wolf-pl
ɐrʁɐvš-toj (I)
slaughter.pst - pst .3 pl
‘A week ago, wolves killed Barazg’s bull there.’
(Qajtyqty 1998)
c. Animate non-referential direct object
kušɐrttag-ɐn=iw š-fəχ-toj gal (I)
slaughtered-dat = hab prv -cook.pst - pst .3 sg bull
‘Usually, they cooked a bull (as the animal for a feast).’
(Qajtyqty 1998)
d. Inanimate subject, inanimate object
wad a-šaš-ta gɐdə bɐlaš-*(ə)(I)
storm prv -break-pst .3 sg cat tree-obl
‘The/a storm broke the/a poplar.’
(Bedzhyzaty 1995)
e. Animate subject, inanimate direct object
lɐppu a-šaš-ta gɐdə bɐlaš-(?ə) (I)
boy prv -break-pst .3 sg cat tree-obl
‘The/a boy broke the/a poplar.’
4.5. Clitics
Enclitic pronouns (together with some other enclitics) form a cluster that obligato-
rily occupies (an appropriately defined) second position in Iron.
(13) nɐ=ta=jən=ɐj radta (I)
neg =again=dat .3 sg = acc .3 sg give.pst .3 sg
‘S/he again didn’t give it to her/him.’
In Digor, the placement of a clitic cluster is somewhat less restricted, although the
acceptability of a sentence decreases when the cluster is shifted rightwards. Clitics
in a cluster are subject to certain restrictions on case-person combinations. The
cluster is normally rigidly ordered, but some speakers allow reordering to improve
certain combinations of case and person. Modern Ossetic does not allow clitic
doubling, unless the doubled noun phrase is on the right edge of the clause. Clitics
may be hosted by non-finite clauses as well, (14).
878 David Erschler
(14) [ɐnqɐlmɐ=dɐm kɐš-gɐ-jɐ] ba-fənɐj dɐn (I)

waiting=all .2 sg look-cvb -abl prv -sleep be.prs .1 sg
‘I fell asleep while waiting for you.’
4.6. Questions
The word order in yes-no and alternative questions does not differ from that in
declarative clauses. In wh-questions, all wh-phrases are obligatorily preverbal
with only second position clitics, negative indefinites, negation markers, and
certain adverbials being able to intervene between the wh-complex and the verb,
see details in Erschler (2012a).
4.7. Negation
Sentential negation is marked by mood-dependent proclitics to the verb. Nega-
tive indefinites are all placed in the immediately preverbal position. They cannot
co-occur with sentential negation markers, see details in Erschler and Volk (2011).
Unlike in other Ossetic varieties, the South Ossetian Kudar dialect, obviously
under recent influences from Georgian, optionally allows for negative indefinites
to co-occur with sentential negation:
(15) nišə (næ) fed-ton16 (Kudar Iron)
nothing neg see.pst - pst .1 sg
‘I saw nothing.’
4.8. Binding
Reflexives and reciprocals can in principle be bound by any clause-mate NP,
although the subject is the preferred binder. Pronominal enclitics are free in the
minimal clause. Possessive proclitics are free from binding restrictions. Full pro-
nouns, including full possessive pronouns, are by default free in the sentence,
(16a). However, if the full pronoun is focused or topicalized, it can have an ante-
cedent within the same sentence, (16b).
(16) a. soslan-mɐi wotɐ kɐs-uj cuma jej/?i(D)
Soslan-all so look-prs .3 sg comp (s)he
mɐ=zɐrdɐ-mɐ cɐw-uj
poss .1 sg =heart-all go-prs .3 sg
‘Soslani thinks that I like himj/?i’
16
The transcription represents the Kudar pronunciation.
Ossetic 879
b. mɐdinɐ ɐrmɐst woji/?j ke warz-uj wotɐ

Madina only s/he.obl comp love- prs .3 sg so
soslan ʁudi kɐn-uj (D)
Soslan thought do-prs .3 sg
‘Soslani thinks that Madina only loves himi/?j’
5. Clause linkage
5.1. Coordination
The conjunctions ɐmɐ/ɐma ‘and’ and fɐlɐ/fal ‘but’ are clause-initial, second posi-
tion clitics then attach to conjunctions, like the Iron jɐm all .3 sg in (17b):
(17) a. nər ɐndɐr ran ba-lχɐd-ta bigʷəz χɐzar
now other place prv -buy.pst - pst .3 sg Bigwyz house
ɐmɐ adon wɐj kɐn-ə (I)
and these_ones.nom sale do-prs .3 sg
‘Now, Bigwyz has bought a house in a different place and is selling
these ones.’
(K’æbysty 1977)
b. mɐn=dɐr jɐ=χi-mɐ ragɐj ždɐχta
I.obl =too poss .3 sg = refl - all since.long attract.pst .3 sg
fantastikɐ fɐlɐ=jɐm mɐ=nəfš nɐ
sci-fi but=all .3 sg poss .1 sg =certainty neg
χašton (I)
carry.pst .1 sg
‘The (genre) of science fiction has long attracted me too, but I did
not dare to approach it.’
(K’æbysty 1977)
Noun phrases (but not clauses) can also be coordinated by encliticizing dɐr to
each one of them, (18a). The usual conjunction ‘and’ can also be used in the pres-
ence of dɐr, (18b).
(18) a. χʷarz-ɐj=dɐr cud-ɐj=dɐr či fɐ-wwid-ton (D)
good-abl =and bad-abl =and what prv -see.pst - pst .1 sg
‘What I have seen of good and of bad’
(Aghuzarti 2008)
b. č’ifɐ=dɐr ɐma wazal=dɐr (D)
dampness=and and cold=and
‘dampness and cold’
(Aghuzarti 2008)
880 David Erschler
There are two conjunctions for ‘or’: kɐnɐ/kenɐ is used when the two conjuncts are
not mutually exclusive, and ɐvi in all other situations. They are placed clause-ini-
tially (or constituent-initially, if constituents smaller than clauses are coordinated):
(19) a. ɐz kiwunugɐ fins-un nɐ=ʁav-un kenɐ ɐndɐr
I.nom book write-inf neg =intend-inf or other
ješti woj χuzɐn (D)
something it.obl similar
‘I am not going to write a book or anything like that.’
(Aghuzarti 2008)
b. də=mɐm ɐsɐg zur-əš ɐvi=mɐ mɐ=quš-tɐ
you=all .2 sg real talk-prs .2 sg or=acc .1 sg poss .1 sg =ear-pl
šaj-ənc? (I)
deceive-prs .3 pl
‘Are you really speaking to me or do my ears deceive me?’
(K’æbysty 1977)
5.2. Subordination
All subordinate clauses allow or even require a proleptic noun phrase in the main
clause. The conditions under which proleptics are obligatory are not yet well
understood. In (20), proleptics are marked with boldface.
(20) a. [sɐmɐj žonənad ba-nk’ar-aj] wəj təχχɐj
in_order_that science prv -feel-sub . fut .2 sg it because
ra-jdaj iwwəl ɐnson-dɐr-ɐj (I)
prv -begin.imp .2 sg it.sup easy-comp - abl
‘In order to internalize the knowledge, you should start from the
easiest.’
(K’æbysty 1977)
b. ɐž nisə ažəmʤən dɐn [zur-ən kɐj
I nothing guilty be.prs .1 sg talk-inf comp
žon-ən ɐmɐ matematikɐ aftɐ χorž kɐj š-aχʷər
know-prs .1 sg and mathematics so good comp prv -learn
kod-ton] wəm (I)
do.pst - pst .1 sg it.ines
‘I am not at all guilty that I can speak and have learned mathematics so
well.’
(K’æbysty 1977)
Ossetic 881
Complex clauses do not show any consequence of tense effects, whereby the tense
of the verb in the main clause would impose restrictions on the tense marking in
the dependent clause. In (21), the past tense of the main verb does not block the
future marking in the dependent clause.
(21) šošlan-mɐ aftɐ kɐš-ə kašt-iš ɐmɐ mɐdinɐ
Soslan-all so look-prs .3 sg look.pst - pst .3 sg and Madina
kɐj ɐrba-sɐwu-zɐniš (I)
comp prv -go-fut .3 sg
‘Soslan thinks/thought that Madina will/would come.’
5.3. Complementation
Complementizers are either strictly preverbal with same qualifications as for
wh-phrases as described in §4.6, (22a), or float between the left edge of the clause
and the verb, Erschler (2012a), (22b). The complement clause usually follows
the matrix verb, but may precede it as well. A finite dependent clause is typically
accompanied by a proleptic in the main one.
(22) a. ragɐj χat-ə [artur ɐmɐ gertrud ɐnɐ
since.long notice-prs .3 sg Arthur and Gertrude without
kɐrɐzi kɐj nɐ=fɐ-raž-ənc] wəj (I)
each_other comp neg =prv -withstand-prs .3 pl it
‘Since long, she’s noticed that Arthur and Gertrude cannot live without
each other.’
(K’æbysty 1977)
b. fɐlɐ dɐ=žardɐ kɐd ɐndɐr-ɐn wa
but poss .2 sg =heart if other-dat be.sub . fut .3 sg
lɐvɐrd (I)
give.prt . pst
‘But if your heart would be given to someone other’
(K’æbysty 1977)
Some verbs allow a null complementizer, as illustrated in (23); wəj it.nom / obl
serves here as a proleptic:
(23) [cpa-izɐr=ma šeri-jə fen-zɐn] [MatrixSwəj ɐnqɐl
this-evening=more Seri-obl see-fut .3 sg it.nom / obl hope
nal wədi] (I)
no_more be.pst .3 sg
‘She already did not think that she would see Seri tonight.’
(Bic’oty 2003)
882 David Erschler
The coordinating conjunction ɐmɐ /ɐma ‘and’ has grammaticalized into a subor-
dinator:
(24) a. bazel-ə qʷədə-tɐ aftɐ nə-ššuj-tɐ štə
Bazel-obl thought-pl so prv -gut-pl be.prs .3 pl
ɐmɐ=zə rajdajɐn-kɐron bɐrɐg nal wəd
and=abl .3 beginning-end sign no_more be.pst .3 sg
‘Bazel’s thoughts got so entangled that it was not possible to find their
beginning or end.’
(K’æbysty 1977)
b. ɐnqɐl dɐn ɐmɐ mɐ=qəʤ-ə
thought be.prs .1 sg and poss .1 sg =grief-obl
nɐ=ba-sɐw-zənɐ
neg =prv -go-fut .2 sg
‘I think/hope that you won’t be angry with me?’
(K’æbysty 1977)
Non-finite verb forms may head subordinate clauses as well, for instance, purpose
clauses are headed by infinitives:
(25) doχtur-tɐ=min barɐ ra-vardt-oncɐ [kust-mɐ
doctor-pl = dat .1 sg right prv -give.pst - pst .3 pl work-all
cɐw-un]-mɐ (D)
go-inf - all
‘Doctors permitted me to go to work.’
(Aghuzarti 2008)
5.4. Relative clauses and multiple relativization

A noun phrase can be relativized from any syntactic position. Relative pronouns,
which are homophonous with wh-words, are immediately preverbal, with the same
qualifications as for wh-phrases. Relative clauses are formed using the correlative
strategy: the head noun ‒ the correlate ‒ follows the relative clause and must carry
the distal demonstrative. In (26), the relative pronouns and the correlates are high-
lighted in boldface:
(26) a. parč-ə sə čəžg-imɐ nəχaš kod-ta
park-obl what girl-com talk do.pst - pst .3 sg
wəj-imɐ jɐ=k’am š-išta
s/he-com poss .3 sg =picture prv -take-pst .3 sg
‘He had his picture taken with the girl that he spoke with in the park.’
Ossetic 883
b. čəžg-imɐ kɐm nəχaš kod-ta wəsə parč-ə

girl-com where talk do.pst - pst .3 sg that parc-obl
jɐ=k’am š-iš-ta
poss .3 sg =picture prv -take.pst - pst .3 sg
‘He had his picture taken in the park where he talked with the girl.’
c. wədon šɐ=institut-mɐ aχʷər-mɐ ra-jš-toj
they poss .3 pl =institute-all learning-all prv -take-pst .3pl
[siri-jə či sɐr-ə] wəsə kɐšgon lɐppu-t-ə
Syria-obl who live-prs .3 sg that Circassian boy- pl - obl
ɐmɐ čəžʤ-ət-ə k’ord-ə (I)
and girl-pl - obl group-obl
‘They accepted to study in their institute a group of Circassian
boys and girls who live in Syria.’17
Given that quantifiers are not very well compatible with demonstratives, to relativ-
ize a universally or existentially quantified noun phrase, one needs to use different
strategies, namely, free relatives or non-finite dependent clauses:
(27) a. [čidɐrittɐr χɐrɐg dar-ə] [alčidɐr=ɐj sɐv-gɐ
whoever donkey own-prs .3 sg everyone=acc .3 sg beat-cvb
kɐn-ə] (I)
do-prs .3 sg
‘Anyone who owns a donkey, beats it.’ (Lit. ‘Whoever owns a donkey,
everyone beats it.’)
b. [dɐrʁɐvš-mɐ sɐw-ɐg išči] gošt-mɐ akumulʲator
Darghavs-all go-nmz someone Gosht-all accumulator
a-laš-zɐn (I)
prv -carry-fut .3 sg
‘Someone who is going to Darghavs will take the accumulator to
Gosht.’
It is also possible here to place the quantified noun phrase sentence-initially and to
use it as a binder for the regular demonstrative:
(28) alə žɐχχkušag=dɐr [χɐrɐg či dar-ə] wəj=jɐ
every peasant=emp donkey who own-prs .3 sg he=acc .3 sg
sɐv-gɐ=dɐr kɐn-ə (I)
beat-cvb =emp do-prs .3 sg
‘Every peasant who owns a donkey beats it.’
Ossetic allows multiple relativization, relative pronouns are then all placed prever-
bally. Such clauses only allow for generic reading:
17
(http://www.iriston.com/nogbon/news.php?newsid=543, accessed 08 July 2017).
884 David Erschler
(29) a. kɐm=ju kɐjimɐ nəχaš kod-ta wəm

where=hab who.com talk do.pst - pst .3 sg there
alkɐjimɐ jɐ=k’am išta (I)
everyone.com poss .3 sg =picture take-pst .3 sg
‘Wherever s/he spoke with anyone, s/he had his picture taken with
them.’
b. *kɐm sə čəžg-imɐ nəχaš kod-ta wəsə parč-ə
where what girl-com talk do.pst - pst .3 sg that park-obl
wəsə čəžg-imɐ jɐ=k’am š-išta (I)
that girl-com poss .3 sg =picture prv -take-pst .3 sg
Intended: ‘When he spoke to a girl in a park, he had his
picture taken with her.’18
6. Language Contact Issues
The typological profile of Ossetic, as it has been described in this chapter, is fairly
different from those of other modern Iranian languages. It is natural to conjecture
that at least some of these properties arose due to language contact. It should be
kept in mind, however, that Ossetians and their ancestors have long been isolated
from the rest of Iranian speakers. The latter occupy a more or less contiguous
territory and have been significantly influenced by Persian language and culture.
Some of the differences could have emerged, therefore, just as independent devel-
opments.
One instance of Caucasian influence is fairly obvious: this is the most natural
way to explain the emergence of ejectives in Ossetic. While virtually ubiquitous
in the Caucasus, where their presence is essentially the only shared areal feature,
Tuite (1999), ejective consonants are fairly uncommon in the rest of Eurasia and
are completely absent from other Iranian languages.
The fact that Ossetic extensively borrowed vocabulary from the neighboring
languages is also well-documented, see Abaev (1958–1995), Bielmeier (1977),
and Thordarson (1999), to name a few.
Contact influences in the realm of morphosyntax are admittedly harder to argue
for. Given that data on earlier stages of Ossetic are virtually absent, the only way to
prove that a given grammatical feature is a result of borrowing or influence from a
specific source is to show that (a) the feature in question is present in Ossetic and
in the source language(s); (b) that the given feature is rare cross-linguistically, or,
at least in the relevant area ‒ i. e. northern Eurasia including the Caucasus.
One grammatical feature often claimed to be an outcome of usually unspeci-
18
It should be emphasized that respective single relatives are fully grammatical, as illus-
trated in (26).
Ossetic 885
fied Caucasian influences is the relatively large Ossetic case system, (Vogt 1945;
Abaev 1949: 99). Sometimes, specifically South Caucasian influences are postu-
lated, Johanson (2008: 500). It is certainly likely that the emergence of the case
system, abnormally large for a Modern Iranian language, was an outcome of some
influence(s). However, neither the size, nor the case inventory are particularly
unusual for northern Eurasia, Erschler (2009: 423), Thordarson (2009: 170). The
fact of Caucasian influences on this system is not implausible, but hardly demon-
strable with any degree of certainty. Moreover, one salient feature of many Cauca-
sian languages ‒ namely, the presence of ergative marking, is entirely absent from
Ossetic.
It seems more promising to conjecture that possessive proclitics, the structure
of wh-questions, and the morphosyntax of negative indefinites were influenced by
neighboring autochthonous languages. Possessive proclitics, unique for Iranian
languages and in general very rare in Eurasia, have been argued to be a result of
West Caucasian influences, Erschler (2009). Obligatory preverbal placement of
wh-phrases is likely to be a South Caucasian influence, Erschler (2012). Finally,
the morphology of negative indefinites and their incompatibility with sentential
negation is likely to be due to South Caucasian influences as well, Erschler (2010).
Interestingly, no instances of potential grammatical influences from Northeast
Caucasian languages have been discovered so far. However, there is one putative
influence in the opposite direction: emergence of phonemic /f/ in Ingush was attrib-
uted to Ossetic influence already by Uslar (1888: 6). Uslar himself, to support this
conjecture, only observes that Chechen lacks /f/ and states that “their (i. e. Ingush)
vernacular was formed under a strong influence of Ossetic language”. The sup-
porting argument is that the phonemic /f/ is absent from almost all the Northeast
Caucasian languages, and from the South Caucasian languages. This is the only
known instance of a plausible Ossetic influence on a neighboring language.
Another potential instance of contact induced development is a two-term
deictic system present in Ossetic, Svan, Mingrelian, and Karachay-Balkar ‒ a fact
noticed by Thordarson (2009). However, another Eastern Iranian language, Yaz-
ghulami, has developed a two-term system without obvious external influences.
Preverbs with spatial sematics are present in virtually all autochthonous lan-
guages of the Caucasus, but it is only in the modern SC languages and in Ossetic
that they (almost) always have a perfectivizing function. It is so in Georgian
(Boeder 2005: 33), as well as for most preverbs in Mingrelian, Harris (1991),
and in Svan, Tuite (1997: 30). Such systems of perfectivizing preverbs strikingly
resemble the Slavic one. Furthermore, Abaev (1965: 60–62) argued that it is the
ancestors of Ossetians came in contact with the Slavs and brought the innova-
tion to the Caucasus. However, Thordarson (1982: 252–253) convincingly argues
against this conjecture. See also the discussion in Tomelleri (2009).
Finally, Mingrelian has an imperfectivizing affix placed in a verb between the
prefixes and the root (Harris 1991), a phenomenon similar to the Ossetic conative
886 David Erschler
suffix -sɐj-/-cɐj-, but this parallel might well be due to chance: for one thing, direct
contacts between Ossetians and Mingrelians are not documented nor are they very
likely for geographical reasons.
For the last 150 years, Ossetic has been under strong Russian influence. Code-
switching is very frequent and virtually any Russian word can be occasionally
used in Ossetic speech.
A sample glossed Digor oral text. Recorded 20 July, 2008 in Sheker, Iraf district,
North Ossetia from Mimonat Gogaeva. Transcribed in the summer of 2009 with
help of Aza Sasieva.
(30) mɐn-ɐn ɐncon-dɐr woʣɐj digoron-aw ʣor-un
I.obl - dat easy-comp be.fut .3 sg Digor-equ speak-inf
‘It will be easier for me to speak in Digor
(31) ɐz lešken-i ra-jgurd-tɐn mijn farast
I Lesken-obl prv -born.pst - pst .1 sg 1000 9
sɐdɐ duwin ɐrtikkag anž-i
100 20 third year-obl
I was born in Lesken, in the year 1923. [A decimal numeral.]
(32) ɐma lešken-i adtɐj hewɐχɐn ɐʁdɐwu-ttɐ
and Lesken-obl be.pst .3 sg such custom-pl
And in Lesken there were the following customs. [Note the singular verb
with the plural of ‘custom’.]
(33) ficcagi-dɐr χestɐr-ɐn kadɐ lɐvard-toncɐ χestɐr ku
first-comp senior-dat respect give.pst - pst .3 pl senior when
fe-štidɐ wɐd ʁɐwamɐ kɐstɐr-tɐ
prv -stand_up. pst - sub . pst .3 sg then needs junior-pl
jewugur-ɐj=dɐr iš-istad-ajoncɐ
together-abl = emp prv -stand_up. pst - sub . pst .3 pl
First of all, they respected seniors, if an elder would stand up, then all the
juniors would have to stand up as well.
(34) χestɐr-ɐn ʁɐwamɐ ɐʁdaw lɐvard-tajoncɐ
elder-dat needs etiquette give.pst -sub . pst .3 pl
(The juniors) would have to behave courteously (according to the etiquette)
towards the seniors.
Ossetic 887
(35) kɐstɐr-ɐn ʣurd-i barɐ n=adtɐj calimmɐ χestɐr

junior-dat word-obl right neg =be.pst .3 sg while elder
ʣurd-i barɐ ravard-tajdɐ wɐd-mɐ
word-obl right give.pst -sub . pst .3 sg then- all
To a junior there would be no right to speak, until the elder gives him right
to speak.
(36) χestɐr ka=j woj kud fɐndɐ widɐ
senior who=be.prs .3 sg s/he.obl how wish be.sub . pst .3 pl
wotɐ ʁɐwamɐ kod-tajoncɐ kɐstɐr-tɐ
so needs do.pst - sub . pst .3 pl junior-pl
As the elder’s wish would be, so should act the juniors.
(37) woj fɐndon ɐnχɐst kod-tajoncɐ
s/he.obl will complete do.pst -sub . pst .3 pl
They would have to fulfill his/her wishes.
(38) berɐ χʷarž-ʤijnɐd-tɐ adtɐj ragi nur=sɐbɐl
many good-abstr -pl be.pst .3 sg earlier now=3 sg . sup
sɐ=k’aχ-t-ɐj fɐ-χatt-ɐncɐ
poss .3 pl =leg-pl - abl prv -tread.pst - pst .3 pl
Originally, there was much good, now they tread upon it with their feet.
(39) ma woni nɐ-bal imiš-ʤinan fal=sɐ imis-un
and they.obl neg -more remember-fut .1 pl but=acc .3 pl remember-inf
ʁɐw-uj nɐwɐg-ɐj=sɐ ɐr-lɐw-un ʁɐw-uj
need-prs .3 sg new-abl = acc .3 pl prv -stand-inf need-prs .3 sg
We do not remember them now, but we must remember them and reinstate
them. [In principle, it should be the causative construction, ɐr-lɐw-un
kɐn-un ʁɐw-uj prv -stand-inf do-inf need-prs .3 sg , but kɐn-un may
sometimes be dropped in the casual speech.]
(40) ɐʁdaw kɐmi nɐ wa womi rasuʁd-ʤijnadɐ
etiquette where neg be.sub . fut .3 sg there beautiful-abstr
ne-kɐd woʣɐnɐj ne-kɐd
neg -when be.fut .3 sg neg -when
Where there will be no etiquette, never will there be any beauty, never.’
888 David Erschler
Abbreviations
abl ablative imp imperative

abstr abstract noun derivation ines inessive
acc accusative inf infinitive
all allative neg negative
aor aorist nmz nominalizer
ass associative plural nom nominative
com comitative num numeral declension
comp complementizer obl oblique
con conative pl plural
cop copula poss possessive
ctr contrastive topic prs present
cvb converb/gerund prt participle
dat dative prv preverb
emp emphatic clitic pst past
equ equative sg singular
fut future sub subjunctive
hab habitual sup superessive
idf indefinite
References
References to the sources of examples

Aghuzarti, Agube. 2008. Zin cardi k’æpxæn [Stairs of Hard Life]. Dzæwugighæu: CI NEI
RPX.
Bedzhyzaty, Chermen. 1995. Uacmystæ [Stories]. Dzæuydžyqæu: Ir.
Bic’oty, Griš. 2003. Kærdædžy st’aly [The Evening Star]. Dzæuydžyqæu: Ir.
Dzasoxty, Muzafer. 2007. Dællag Ir [Lower Ossetia]. Dzæuydžyqæu: Ir.
Gurdzibety, Baris. 2006. Uarzt æma fudænxæ [Love and hate]. Dzæugighæu: Olimp.
K’æbysty, Zaur. 1977. Fæstag majmuli [Last monkey]. Cxinval: Iryston.
Qajtyqty, Azæmæt. 1998. Xod: Taurægh æmæ card [Khod: Legend and life]. Dzæuy-
džyqæu: Ir.
Sabajti, Sulejman. 2010. Radzurdtæ [Stories]. Dzæugighæw: Gassiti Viktori nombæl
rauaghdon-poligrafion kustuat.
Other references
Abaev, Vasilij. 1949. Osetinskij jazyk i fol’klor [Ossetic language and folklore]. Moscow &
Leningrad: Izdatel’stvo AN SSSR.
Abaev, Vasilij. 1958/1973/1979/1989/1995. Istoriko-etimologiˇceskij slovar’ osetinskogo
jazyka [Historical etymological dictionary of Ossetic]. 5 vols. Moskva: Izdatel’stvo
Akademii Nauk SSSR.
Abaev, Vasilij. 1964. A grammatical sketch of Ossetic. Den Haag: Mouton.
Ossetic 889
Abaev, Vasilij. 1965. Skifo-evropejskie izoglossy [Scythian-European isoglosses]. Moscow:

Nauka.
Abaev, Vasilij & H. W. Bailey. 1985. Alans Encyclopædia Iranica 1.8, 801–803. Winona
Lake, IN: Eisenbrauns.
Alemany, Augusti. 2000. Sources on the Alans: A Critical Compilation. Leiden & Boston:
Brill.
Axvlediani, Georgij (ed.). 1963/1969. Grammatika osetinskogo jazyka [A grammar of Os-
setic]. 2 vols. Ordžonikidze: Severo-Osetinskij Naučno-Issledovatel’skij Institut.
Bailey, H. W. 1950. L’accento in osseto digoron. Ricerche Linguistiche I. 58–69.
Bagaev, Nikolaj. 1965/1982. Sovremennyj osetinskij jazyk [Modern Ossetic], part 1, 2. Or-
džonikidze: Ir.
Bekoev, Dmitri. 1985. Ironskij dialekt osetinskogo jazyka [The Iron dialect of Ossetic].
Cxinval: Iryston.
Belyaev, Oleg & Arseniy Vydrin. 2011. Participle-converbs in Iron Ossetic: Syntactic and
Semantic Properties. In Geoffrey Haig, Simin Karimi, Agnes Korn & Pollet Samvelian
(eds.), Topics in Iranian Linguistics (Beiträge zur Iranistik Bd. 34), 117–134. Wies-
baden: Reichert.
Bielmeier, Roland. 1977. Historische Untersuchung zum Erb- und Lehnwortschatzanteil im
ossetischen Grundwortschatz. Frankfurt am Main: Lang.
Boeder, Winfried. 2005. The South Caucasian languages. Lingua 115. 5–89.
Christensen, Arthur Emanuel. 1921. Textes ossètes (Det Kgl. Danske Videnskabernes Sel-
skab. Historisk-filologiske Meddelelser 6/1). Copenhagen: Publisher.
Dzaxova, Veronika. 2009. Fonetičeskie xarakteristiki fonologičeskoj sistemy sovremennogo
osetinskogo (ironskogo) literaturnogo jazyka [Phonetic Characterization of the Phono-
logical System of the Standard Modern Iron Ossetic]. Vladikavkaz: Izdatel’stvo SOGPI.
Dzaxova, Veronika. 2010. Ob osetinskom udarenii [On the Ossetic Stress]. Vestnik RGGU
9/10(52). 9–26.
Dzodzikova, Zaida. 2009. Analitizm v slovoobrazovanii osetinskogo jazyka [Analyticity in
the Ossetic Word Formation]. Vladikavkaz: SOIGSI.
Erschler, David. 2009. Possession marking in Ossetic: Arguing for Caucasian influences.
Linguistic Typology 13(3). 417–450.
Erschler, David. 2010. Otricatel’nye mestoimenija v osetinskom jazyke: areal’nye i tipolog-
icheskie aspekty [Negative Pronouns in Ossetic: Areal and Typological Issues]. Voprosy
Jazykoznanija 2010(2). 84–105.
Erschler, David. 2012a. From preverbal focus to preverbal “left periphery”: On diachronic
and typological aspects of the Ossetic clause structure. Lingua 122. 673–699.
Erschler, David. 2012b. Suspended Affixation and the Structure of Syntax-Morphology In-
terface. Studia Linguistica Hungarica 59. 153–175.
Erschler, David. 2014. A marginal modal form in Digor Ossetic. Acta Linguistica Petropol-
itana X(3). 711–719.
Erschler, David. 2016. Ossetic. In Peter O. Mueller, Ingeborg Ohnheiser, Susan Olsen &
Franz Rainer (eds.), Word-Formation: An International Handbook of the languages of
Europe, 3157–3179. Berlin & Boston: De Gruyter Mouton.
Erschler, David & Vitaly Volk. 2011. On negation, negative concord, and negative imper-
atives in Digor Ossetic. In Geoffrey Haig, Simin Karimi, Agnes Korn & Pollet Sam-
velian (eds.), Topics in Iranian Linguistics (Beiträge zur Iranistik Bd. 34), 135–152.
Wiesbaden: Reichert.
890 David Erschler
Gabaraev, Nikolaj. 1977. Morfologičeskaja struktura slova i slovvobrazovanie v osetin-

skom jazyke [Morphological word structure and word formation in Ossetic]. Tbilisi:
Mecniereba.
Harris, Alice C. 1991. Mingrelian. In Alice C. Harris (ed.), The Indigenous Languages of
the Caucasus, vol. 1, 313–394. Delmar, NY: Caravan Books.
Haspelmath, Martin. 1997. Indefinite Pronouns. Oxford: Clarendon Press.
Isaev, Magomet I. 1966. Digorskij dialekt osetinskogo jazyka [The Digor dialect of Os-
setic]. Moskva: Nauka.
Johanson, Lars. 2008. Case and contact linguistics. In Andrej Malchukov & Andrew Spen-
cer (eds.), The Oxford handbook of case, 494–501. Oxford: Oxford University Press.
Kambolov, Tamerlan. 2007. Jazykovaja situacija i jazykovaja politika v Severnoj Osetii:
Istorija, sovremennost’, perspektivy [Language situation and language policy in North
Ossetia: History, modernity, perspectives]. Vladikavkaz: Ir.
Kudzoeva, Andžela. 2003. Porjadok slov i aktual’noe členenie v osetinskom jazyke [Word
Order and Communicative Structure in Ossetic]. Vladikavkaz: SOGU.
Miller, Wsewolod. 1881. Osetinskie etjudy. Chast’ I. Osetinskie teksty [Ossetic Sketches.
Part 1. Ossetic Texts]. Moscow: Imperatorskij Moskovskij Universitet.
Miller, Wsewolod. 1903. Die Sprache der Osseten. Strassburg: Trübner.
Miller, Wsewolod & Reinhold von Stackelberg. 1891. Fünf ossetische Erzählungen in dig-
orischem Dialect. Mit deutscher Übersetzung, Glossar und Anhang von R. von Stackel-
berg. St. Petersburg: Académie Impériale des Sciences.
Munkácsi, Bernhard. 1932. Blüten der ossetischen Volksdichtung. Keleti Szemle 21. 68–75.
Németh, Gyula. 1959. Eine Wörterliste der Jassen, der ungarländischen Alanen. Abhand-
lungen der Deutschen Akademie der Wissenschaften zu Berlin. Klasse für Sprachen,
Literatur und Kunst 1958(4). Berlin: Akademie-Verlag.
Rosen, Georg. 1846. Ossetische Sprachlehre. Berlin: Lemgo und Detmold.
Sims-Williams, Nicholas. 1996. Eastern Iranian Languages. Encyclopædia Iranica 7.6,
649–652. New York: Center for Iranian Studies, Columbia University. http://www.iran
icaonline.org/articles/eastern-iranian-languages (accessed 23 September 2017).
Sjögren, Andreas Johan. 1844. Iрон Æвзагахур das ist Ossetische Sprachlehre. Nebst
kurzem ossetisch-deutsche und deutsch-ossetische Wörterbuche. St. Petersburg: Kai-
serliche Akademie der Wissenschaften.
Sokolova, Vera. 1953. Očerki po fonetike iranskix jazykov [Sketches of Iranian Phonetics],
part 2. Moscow-Leningrad: Izdatel’stvo AN SSSR.
Stackelberg, Reinhold von. 1886. Beiträge zur Syntax des Ossetischen. Strassburg: Trübner.
Takazov, Fedar. 2003. Digor-urussag dzurduat [Digor-Russian dictionary]. Vladikavkaz:
Alanija.
Takazov, Fedar. 2009. Grammatičeskij očerk osetinskogo (digorskogo) jazyka [A grammar
sketch of Digor Ossetic]. Vladikavkaz: Izdatel’stvo SOGU.
Tedeev, Geras. 1989. Semantičeskaja struktura osetinskogo glagola [The Semgantic Struc-
ture of the Ossetic Verb]. Tbilisi: Mecniereba.
Tekhov, Fedor. 1970. Vyraženie modal’nosti v osetinskom jazyke [Expression of modality in
Ossetic]. Tbilisi: Mecniereba.
Testen, David. 1997. Ossetic Phonology. In Alan S. Kaye (ed.), Phonologies of Asia and
Africa: Including the Caucasus, vol. 1, 707–732. Winona Lake, Indiana: Eisenbrauns.
Thordarson, Fridrik. 1982. Preverbs in Ossetic. In Monumentum Georg Morgenstierne, vol.
2, (Acta Iranica 22), 251–261. Leiden: Brill.
Ossetic 891
Thordarson, Fridrik. 1999. Linguistic contacts between the Ossetes and Kartvelians: A few
remarks. In Helma van den Berg (ed.), Studies in Caucasus linguistics: Selected papers
of the Eighth Caucasian Colloquium, Research School of Asian, African and Amerin-
dian Studies (CNWS), 279–285. Leiden: Universiteit Leiden.
Thordarsson, Fridrik. 2009. Ossetic grammatical studies (Sitzungsberichte der philoso-
phisch-historischen Klasse, Bd. 788, Veröffentlichungen zur Iranistik, Nr. 48). Wien:
Verlag der Österreichischen Akademie der Wissenschaften.
Tomelleri, Vittorio. 2009. The Category of Aspect in Georgian, Ossetic and Russian. Some
areal and typological observations. Faits de langues 2009(1). 245–272.
Tuite, Kevin. 1997. Svan (Languages of the World/Materials 139). München: LINCOM
Europa.
Tuite, Kevin. 1999. The myth of the Caucasian Sprachbund: The case of ergativity. Lingua
108. 1–29.
Uslar, Petr. 1888. Čečenskij jazyk [Chechen]. Tiflis: Izdanie Upravlenija Kavkazskago
Učebnago Okruga.
Vogt, Hans. 1945. Substrat et convergence dans l’évolution linguistique. Remarques sur
l’évolution et la structure de l’armenien, du géorgien, de l’osséte et du turc. Studia
Septentrionalia 2. 213–228.
Volkova, Natalya. 1974. Etničeskij sostav naselenija Severnogo Kavkaza v XVIII – načale
XX veka [The ethnic composition of the population of the North Caucasus in the 18th –
early 19th centuries]. Moscow: Nauka.
6.4. Romeyka
Laurentia Schreiber
1. Introduction
Romeyka is an endangered Indo-European language, belonging to the Hellenic

branch. It is still spoken in situ by Muslim communities on the southern Black Sea
coast in Turkey, and by diaspora speakers in Turkey’s large cities and elsewhere
(see §1.3). A complex set of geographical and historical circumstances have led
to its current sociolinguistic situation, which is outlined in the initial sections of
this chapter. Many structural aspects of Romeyka are still only poorly understood,
though ongoing research is rapidly filling many of the gaps in our knowledge
(Sitaridou 2013, 2014a, 2014b, and references therein). This chapter summarizes
the current state of knowledge, and refers to the available literature for more
detailed coverage.1 The sociolinguistic information in this chapter is based on
Schreiber (2016), Schreiber and Sitaridou (2017).
1.1. Naming conventions

Romeyka is the term that has become widespread in recent research,2 though
“Muslim Pontic”, a term introduced by Mackridge (1987), is used as a technical
term, too (namely by Özkan 2013; partially Brendemoen 2006; Bortone 2009;
see Schreiber and Sitaridou 2017 on glossonomy). The term Romeyka3 is in line
with what many speakers call their language, though others use the term “Rumca”,
especially when speaking in Turkish. As argued by Schreiber and Sitaridou (2017),
the use of either the Greek term Romeyka, or the Turkish term Rumca, corresponds
to the ethnolinguistic vitality of the particular speech community. Regional vari-
ants are referred to with name of the location in which the variety is spoken (e. g.
Romeyka of Of, Romeyka of Sürmene). When referring to other varieties of Pontic
1
I wish to express my sincere gratitude to Dr Hakan Özkan for sharing his data from
Romeyka of Sürmene. I thank Prof Geoffrey Haig for his rich feedback and insightful
discussion of this chapter. Moreover, I am deeply indebted to the Istanbulite Rumca
speaker who generously and patiently provided me an insight into her language. I ded-
icate this chapter to her. Furthermore, I thank Dr Kilu von Prince for her support in the
early stage of this chapter. I acknowledge that all errors are my own.
2
Cf. in particular work undertaken in the British Academy Project ‘Description and Doc-
umentation of the Romeyka varieties in Pontus; Continuity, Contact and Change’, PI:
Dr Ioanna Sitaridou, and consequent publications; www.romeyka.org.
3
For variation in spelling see http://www.romeyka.org/the-romeyka-project/rediscover-
ing-romeyka (accessed 07 August 2017).
Romeyka 893
Greek, the following terms are used: the term “Pontic Greek” without any sup-
plement denotes the language of Christian Pontic Greek speakers in Greece after
1923; “Christian Pontic Greek” is used for the Christian Pontic Greek speakers in
Pontus until 1923.
1.2. Language classification

Romeyka belongs to the Hellenic branch of the Indo-European language family. It
is part of the Asia Minor Greek group where it is considered to form a sub-group
with Cappadocian as opposed to other Asia Minor Greek varieties such as Silliot
(cf. Sitaridou 2014b). Several aspects of Romeyka’s origins remain controversial:
it is unclear how far Romeyka was involved in the development of an Asia Minor
koine (Sitaridou 2014b: 29), and whether Medieval Pontic had diverged into Chris-
tian and Muslim varieties already after Islamization in the 16th/17th century, and
thus before expulsion of the Christian speakers from Turkey in 1923, as argued by
Sitaridou (2014b: 31).
1.3. Sociolinguistic vitality

Assessments of the sociolinguistic vitality of Romeyka vary: “definitely endan-
gered” (Moseley 2010), “seriously/severely endangered” (Moseley 2007), “threat-
ened” (Catalogue of Endangered Languages 2017) and “vigorous” (Lewis, Simons,
and Fennig 2016). These assessments are questionable because they (a) do not dif-
ferentiate between Romeyka and Pontic Greek (hence speaker numbers are inac-
curate), and (b) are based on insufficient data (Schreiber and Sitaridou 2017).
The actual number of Romeyka speakers is not known. The last official number
reports 4,535 speakers and stems from the 1965 Turkish general census (Genel
Nüfus Sayımı), which was the last of its kind accounting for different mother
tongues (reported in Mackridge 1987; Andrews 1989). In general, the estimation
of speaker numbers is difficult due to (a) practical reasons such as internal and
external migration and residence patterns, (b) a biased choice of Turkish as mother
tongue, and (c) mixed Turkish-Romeyka identity at least in the urban communi-
ties. Estimates based on the populations of the villages in Trabzon province where
Romeyka is spoken run to approx. 300,000 speakers (Bilici 2011), but this kind
of estimate is obviously exceedingly coarse-grained (see Schreiber and Sitaridou
2017).
Romeyka is spoken in approximately fifty villages in Trabzon province around
the cities of Sürmene, Of, Çaykara and Tonya (Mackridge 1987; Andrews 1989;
Özkan 2013). Furthermore, since labour migration took place as of the 1950s, there
exist speech communities in urban centres such as Ankara and Istanbul whose lan-
guage vitality differs significantly from rural communities (Schreiber and Sitari-
dou 2017). Since 1960, diaspora communities exist in Germany and other central
894 Laurentia Schreiber
European countries (Özkan 2013). A considerable threat to Pontic Greek as spoken

by Christians in the Black Sea region was the relocation of Christian Pontic Greek
speakers in Greece in the context of the Treaty of Lausanne in 1923. However,
already before the 20th century, Pontic Greeks had migrated to southern Russia
and Caucasia (Moseley 2007) and later to the US, the Ukraine, and Greece (Bren-
demoen 2002).
Romeyka speakers may harbour multiple identity constructs, such as Turkish
national and citizenship identity, Muslim religious identity and, dependent on the
speech community, a sense of cultural Romeyka identity (Sitaridou 2013). The
latter is less pronounced in urban communities, which tend towards assimilation
into a Turkish mainstream (Schreiber 2016). In general, Romeyka speakers deny
any Rum or Greek identity (Sitaridou 2013) and often exhibit conservative Muslim
and nationalistic identification links (Özkan 2013).
All Romeyka speakers are bilingual in Romeyka and Turkish. Depending on
the speech community, the youngest generation acquires Romeyka still as early L2
or, respectively, as heritage language only (Schreiber and Sitaridou 2017).
The development of a written form of Romeyka is not encouraged by Turkish
language education policies, and it generally lacks the range of registers for more
official domains. As a result (and a cause), its use is restricted to informal domains,
and functions now as home and heritage language, with an ongoing decline
(Schreiber 2016). So far, no widely-accepted orthography has been developed
for Romeyka, and we lack a comprehensive descriptive grammar or dictionary,
though documentation is underway (see Fn. 1).
1.4. Dialectal variation

Romeyka is spoken in a mountainous region at the southern Black Sea coast his-
torically known as Pontus. The mountain range of the Pontic Alps reaches some
4,000 metres in altitude, and the rugged and poorly accessible terrain is possibly
a factor in determining regional variation and preservation of archaisms (Özkan
2013). Due to the semi-pastoral lifestyle in the villages and on the mountain pas-
tures (Tr. yayla), a rich lifestyle-specific vocabulary in Romeyka remains, includ-
ing terms for botany, work practices, and products (Schreiber and Sitaridou 2017).
Romeyka arguably consists of three sub-dialects which are mutually intelli-
gible but differ in terms of phonology and lexicon as well as according to some
morphosyntactic features (see Mackridge 1987). However, scholars do not agree
on the number and grouping of sub-dialects. Figure 1 presents three major sub-di-
alectal areas: Romeyka of Of (in previous research also referred to as “Ophitic”,
henceforth ROf; it is not clear whether Romeyka is still spoken in Of nowadays,
cf. Sitaridou 2014b: 32), including Romeyka of Çaykara; Romeyka of Sürmene
(henceforth RSür); and Romeyka of Tonya. Brendemoen (2002: 32) identifies
another sub-dialect around Maçka which he reports to be due to immigration from
Romeyka 895
the Sürmene area similar to that of RSür. ROf is argued to be the most archaic
variety of Romeyka (Mackridge 1987).
The Romeyka varieties spoken in Pontus in turn are more conservative than
Pontic Greek (henceforth PG) due to contact of the latter with Standard Modern
Greek (henceforth SMG) after 1932, but also due the fact that Christian Pontic
Greek in Pontus is assumed to have been in contact with other modern Greek vari-
eties prior to 1923.
1.5. Language contact

Romeyka has been in contact with Turkish probably at least since the Islamiza-
tion waves in the 17th century (Brendemoen 2002) but especially since internal
migration started in the 1960s. Arabic elements were introduced to Romeyka via
Ottoman Turkish. Intense language contact between Romeyka and Turkish (see
Mackridge 1987; Brendemoen 2006) resulted in both lexical and grammatical bor-
rowing and syntactic calquing (Sitaridou 2013).4 Romeyka may also have been
in contact with Armenian Hemshin/Homshetsma, Georgian, and Laz which is sus-
pected of having exerted a substrate influence on Romeyka (Brendemoen 2002;
Sitaridou 2013). Furthermore, Kurdish guest workers must have stayed in the area
at several points in history (Sitaridou 2013) though it is not clear whether this had
any influence.
1.6. Previous studies and methodological background

There are some early grammar(-fragments), focussing on Pontic Greek in Pontus,
in particular Christian Pontic Greek (Deffner 1878; Parcharidis 1880, 1888;
Dawkins 1931, 1937;). As Romeyka is expected to have diverged from Christian
Pontic Greek already before 1923 (Sitaridou 2014b), these descriptions cannot be
uncritically adopted as a baseline for understanding the historical development
of Romeyka. The last comprehensive grammar of PG was presented by Drettas
(1997). We still lack a thorough typological description of many domains of the
grammar of Romeyka although previous publications aimed at summarizing the
features of Romeyka that differ from SMG or other PG dialects (Mackridge 1987;
Bortone 2009; Özkan 2013). During the last years, a considerable body of research
has been carried out on the syntax of Romeyka (see Sitaridou 2013, 2014a, 2014b,
and the British Academy Project mentioned in Fn. 1).
The purpose of this chapter is to summarize the information available on
Romeyka grammar rather than presenting new data. Apart from above mentioned
4
For an investigation of mutual influences of Pontic Greek in Pontus and regional
Turkish varieties, especially with regard to the phonological domain, see Brendemoen
(2002).
896
Laurentia Schreiber
Figure 1: Micro-variation in Romeyka

Romeyka 897
earlier research, the information in this chapter draw on an unpublished grammar

sketch of Özkan (henceforth cited as Özkan, n.d.) as well as on own field work
with a female 81-year old Istanbulite L1 speaker of ROf as spoken in Çaykara.
The following description does not focus on a particular variety of Romeyka but
rather aims to provide general information about particular linguistic features.
When information on sub-dialectal variation is available, differences between
the varieties are pointed out as such.5 The following sections give information
on phonology (§2); nominal inflection (§3); determiners, pronouns, and quantifi-
ers (§4); verbal morphology (§5); and syntax (§6) including the syntax of the noun
phrase (§6.1), prepositional phrases (§6.2), negation and modality (§6.3), word
order in independent (§6.4) and subordinate clauses (§6.5), adverbial (§6.7) and
conditional clauses (§6.8).
2. Phonology
To date, no comprehensive analysis of Romeyka phonology is available, nor is

there a survey of Romeyka-internal phonological variation. The following notes
are gleaned from several sources, and my own fieldwork (see §1.6). Some aspects,
in particular those diachronically divergent from SMG or other Greek varieties,
have been described by Mackridge (1987, 1999), Bortone (2009), and Özkan
(2013). The phonology of Christian Pontic Greek is described by Oikonomidis
(1908). Drettas’ (1997) grammar of Pontic Greek includes a chapter on phonology.
Furthermore, Brendemoen (2002) describes the phonology of the Turkish dialects
of Trabzon with reference to interference with Romeyka.
2.1. Segments
2.1.1. Vowels
Romeyka exhibits, in addition to the five vowels also present in SMG, /a/, /e/,
/i/, /o/, /u/, also the vowels /æ/ and, more rarely, /œ/ (Mackridge 1987; Özkan
2013). The sub-dialect ROf as spoken in Uzungöl (Sarachos) seems to be the
only variety lacking the latter two vowels (Mackridge 1987: 131). According to
Mackridge (1987: 121), the vowels /ɯ/ and /y/ are used in Turkish loans, whereby
Özkan (2013: 142) points out that these vowels often figure differently in the
Trabzon-Turkish dialect, possibly due to contact with Greek (Brendemoen 2002).
5
Given the fact that Romeyka lacks any orthography and that its phonemic inventory has
not been adequately described yet (Mackridge 1987), Romeyka examples are presented
in a provisional broad phonemic transcription; word accent is not indicated.
Unlike in SMG and other Greek varieties, initial unstressed vowels are retained
in Romeyka verbs, e. g. epero ‘take.1sg ’, SMG παίρνω, and nouns, see (1a) and
(1c) (Mackridge 1987; Bortone 2009). In some varieties of the language, also post-
tonic /i/ and /u/ are retained whereas they are dropped in others (1a–b, see also
§2.2) (Mackridge 1987; Özkan 2013).
(1) a. /ospiti/ ROf as spoken in Uzungöl (Sarachos)
b. /ospit/ RSür as spoken in Beşköy
c. σπίτι, /spiti/ SMG
‘house’
In ROf, vowels are lengthened at the end of words and, particular when they are
phrase final (Özkan 2013: 139). At present I assume that there is no phonemic
length distinction, but more detailed analysis would be necessary to demonstrate
this assumption conclusively.
2.1.2. Consonants
The consonantal system of Romeyka is presented in Table 1.
Table 1: The consonantal inventory of Romeyka

Labial Dental Alveolar Post- Velar Uvular Glottal
alveolar
or Palatal
Plosive Voiceless p t k
Voiced b d g
Affricate Voiceless ts tʃ
Voiced dʒ
Fricative Voiceless f θ s ʃ x h
Voiced v ð z ʒ6 ɣ
Nasal m n
Tap ɾ
Lateral l
Approximant j
6
No example of this phoneme (apart from the consonant cluster /dž/) found in the mate-
rial available to me.
Romeyka 899
In essence, the consonant system is Greek with few additional phonemes, i. e. in
particular the palatal fricatives and affricates which exist in Turkish loans and
occur as conditioned variants of palatalized /x/, /g/, and /k/ though not all palatal-
ized velars become palato-alveolars (Mackridge 1987). Özkan (2013: 140–141)
reports for RSür as spoken in Beşköy allophonic variation regarding the place of
articulation of the voiceless fricatives, e. g. /x/ → /χ/, /x/. Romeyka exhibits at
least three affricates /ts/, /tʃ/, and/dʒ/.
The voice distinction is weakened for plosives, with underlying voiceless stops
often realized as voiced, though, according to my own fieldwork, that seems to
hold true to a lesser extent for the velar plosive /k/. Mackridge (1987: 123) claims
for PG that there is, apart from loan words, no distinction of voice in the stops.
Stops are generally considered voiced with fortis articulation and become lenis
after a nasal (Mackridge 1987; Özkan 2013). The lack of a clear distinction with
regard to voicing is reflected in the phonology of the regional Turkish variety
(Brendemoen 2002). The velar stops undergo certain phonetic processes: Unlike in
SMG, palatalized /k/ before a front vowel becomes /tʃ/ and palatalized /g/ becomes
/dʒ/. This process is prevented by a preceding /s/ which becomes /ʃ/, like in škilos
‘dog’ (Mackridge 1999, 1987).
2.2. Syllable structure7

The canonical syllable pattern in Romeyka is (C)(C)V(C)(C). A word consists
minimally of a single vowel, e. g. the definite feminine article i.
All consonants can occur in syllable initial position. Unlike SMG, Romeyka
has retained initial unstressed vowels in nouns and verbs (see §2.1.1) (Mackridge
1987: 122).
Romeyka allows consonant clusters consisting of two consonants mostly
in syllable initial position. Frequently occurring initial consonant clusters are
sequences of fricatives and plosives, fricatives and laterals; some examples are:
tše.ra.to ‘horn’, pa.tsi ‘girl’, sto.ma ‘mouth’, an.dras ‘husband’, kse.ro ‘I know’,
o.pse ‘night’, ški.lo ‘dog’, fti.ra ‘louse’, a.spra ‘star’, ɣli.tši ‘milk’. Consonant
clusters in syllable codas are permitted, e. g. e.mist ‘we’. Note that the assumed
maximum of two consonants in onset or coda can be maintained by considering
certain combinations to be affricates, hence single segments. For example, the /
tš/ in i turtš ‘the Turks’ can be analysed as an affricate, yielding a CVCC syllable.
However, the presence and status of affricates in Greek phonology remains gener-
ally a controversial issue.
Romeyka exhibits, among others, the following vowel sequences: /ia/, /io/, /
ai/, /ae/, /au/, /oi/, e. g. malía ‘hair’, ðío ‘two’, faí ‘bread, food’, erlaepsa ‘I under-
stand’, auros ‘man’, oʉlo ‘sun’. It cannot always be decided whether they should
7
Note that this section is merely based on a 113-item word list from my own fieldwork.
be considered diphthongs (hence filling a syllable nucleus), or whether they should

be considered bisyllabic. The combination /ao/ may arise secondarily through the
elision of an intermediate velar fricative; this topic requires research.
The combinations /ia/, /io/ are generally realized bisyllabically. Unlike in
SMG, they retain their original bi-syllabic structure and resist the tendency to
become /j/ + stressed vowel (Mackridge 1987; Bortone 2009). In plural forms,
also a sequence of three vowels may occur, as in faía ‘bread.pl ’. As in PG, no
contracted verbs exist; the vowels of the diphthongs /ei/ and /ao/ are separated by a
morphemic boundary as in e-is ‘you have’, and are likely to be parted by a glide as
in pa-(ɣ)o ‘I go’ (cf. Drettas 1997: 219). Vowel clusters of two identical vowels do
not occur; the masculine and feminine definite articles o and i are dropped before
vowels (Mackridge 1987).
There is significant sub-dialectal variation in the phonological shape of nouns
that developed out of AG neuters in -ion: In SMG the /on/ ending is lost, whereas
ROf as spoken in Uzungöl (Sarachos) retains an older form with /in/, e. g. omatin
‘eye’, and RSür as spoken in Beşköy drops the final /n/ (Özkan 2013), e. g. omati
‘eye’ (Mackridge 1987: 132). Özkan (2013: 142) states that in varieties dropping
unstressed word-final vowels, i. e. omat ‘eye’, due to this vowel deletion all con-
sonants can appear in word-final position. According to my own data, most nouns
end in a vowel, with only rare instances of voiceless obstruents. It is difficult to
identify weather final devoicing takes place as there is in general hardly distinction
with regard to voicing (§2.1.2).
3. Nominal inflection
Number, gender and case are the main categories of nominal inflection. Nouns
were historically assigned to declensional classes, though the analysis of this
system (or its remnants) in Romeyka remains incomplete. Within the noun phrase,
articles, adjectives, and some numerals agree with the nominal head in gender,
case, and number, although there is variation especially with regard to gender.
3.1. Number
The feature of number in Romeyka has the values singular and plural. The plural
is formed by adding a plural suffix, originally selected according to (grammatical)
gender (see §3.2): neuter nouns inflect with -æ or -a, e. g. ta raši-a ‘the moun-
tains’; and feminines and masculines by -ðæs or -ðes, e. g. i patsi-ðæs ‘the girls’
(see Table 2 for the paradigm).
Romeyka 901
Table 2: Number paradigm of RSür as spoken in Beşköy (based on Özkan 2013: 143–144)

sg pl
m o komšis i komšiðæs ‘the neighbour(s)’

f i mana i manaðæs/manas ‘the mother(s)’
n t=osbi ta osbidæ ‘the house(s)’
An older masculine ending -and (Mackridge 1987; Drettas 1999) seems to have
been superseded by the -ðæs ending, as evident in a double set of plural forms,
e. g. babugand /babugaðæs ‘grandfathers’ (Özkan 2013: 144). The plural in -and is
only used for some nouns often denoting peoples, families, relatives, professions,
and some animals (Özkan 2013: 144), e. g. i turkand ‘the Turks’ but cf. i turtš ‘the
Turks’ in Deffner (1878: 212). A /k/ in the word-final syllable of masculines or fem-
inines may yield a /tš/ plural ending – at least in some varieties, e. g. turtš ‘Turks’;
ineka – inetš ‘women’, presumably reflecting a palatal quality of an underlying
plural marker, perhaps -s. Another plural suffix is -ini, e. g. aurini ‘men’.
The original declension classes have increasingly become difficult to iden-
tify (Drettas 1997) because of the following factors: (i) masculines and femi-
nines have often come to share the same endings (Özkan 2013); (ii) there has
been a tendency to assign masculine and feminine endings only to animate nouns,
whereas all inanimate nouns are, irrespective of their grammatical gender, treated
as neuter (Dawkins 1931; Drettas 1997; see also Karatsareas 2011 for neuterisa-
tion in Cappadocian Greek); (iii) a more general spread of plural neuter forms to
masculine and feminine nouns; (iv) some nouns have two plural declensions, e. g.
mana ‘mother’ in (Table 2). Janse (2002) mentions a possible areal influence of
the neighbouring languages not exhibiting gender distinction on the instability of
Romeyka nominal inflection.
3.2. Gender
For agreement purposes, Romeyka distinguishes three genders: masculine, feminine,
and neuter. Originally, gender was assigned morphologically but shifted toward a
more semantically-oriented assignment based on animacy (see Karatsareas 2014;
see also §4.1.1). This development, starting in Ancient Greek (henceforth AG) and
going further in Romeyka than, for example, in SMG, includes the spread of neuter
forms to masculine and feminine paradigms in both singular and plural declension
of nouns, determiners, and adjectives (2) (Mackridge 1987; Özkan 2013).
(2) (Özkan 2013: 144; RSür as spoken in Beşköy)
o vuðias.m . sg – ta vuðia.n . pl ‘the ox(en)’
The spread of neuter applies especially to the plural of [– hum ] feminine nouns
and inanimate masculines, and it may even extend to [+ hum ] masculine/feminine
nouns. Female proper nouns may be assigned the singular neuter article to (Özkan
2013: 145).
In [– hum ] feminine nouns, also mixed declensions occur consisting of a female
determiner and neuter adjective (Mackridge 1987).
Gender inflection is also a means to integrate Turkish loans: [+ hum ] masculine
nouns add -is (-s following a vowel), while [– hum ] nouns add the neuter suffix -i
(or -in following a consonant), and inanimate nouns ending in a vowel are treated
as feminines without suffix, e. g. i para ‘money’ (Tr. para ‘money’) (Mackridge
1987).
3.3. Case
Romeyka has morphological exponence of nominative, accusative, and genitive
cases, expressed on articles and nouns, and other NP constituents. Note that mor-
phological case distinctions on nouns seem to be entirely neutralized in the plural,
yielding uniform case-neutral plural endings (cf. Table 3). With the articles, case
distinctions are partially retained in the plural, but there seems to be syncretism of
plural accusative and plural genitive forms (see Table 4), though there is dialectal
variation here. The accusative is used for both the direct and the indirect object,
unlike in other Greek dialects where the genitive is used to mark the indirect object
(Mackridge 1987). The genitive expresses nominal determination and possession
(Drettas 1999). It is not yet known how AG declension classes figure in Romeyka
case inflection. Furthermore, there appears to be considerable variation, e. g. ta
peðiði.gen . pl ‘boy.gen . pl ’ instead of ta peðia.gen . pl (Özkan 2013).
Table 3: Nominal declension of RSür as spoken in Beşköy (based on Özkan 2013: 143–144)
sg pl
m nom o komši-s i komši-ðæs ‘the neighbour’

acc ton komši8 tsi komši-ðæs
gen tu9 komši tsi komši-ðæs
f nom i mana i mana-ðæs ‘the mother’
acc ti mana tsi mana-ðæs
gen tsi mana-s tsi mana-ðæs
n nom t[o]=osbi ta osbid-æ ‘the house’
acc t[o]=osbi ta osbid-æ
gen t[u9]=osbit-i ta osbid-æ
8
A word-final /n/ originally occurring in the accusative singular of all genders (in nouns
of certain declension classes) is subject to sub-variation and may appear in some nouns,
e. g. ton tširin ‘the father.acc ’ (Mackridge 1987: 124).
9
Only in ROf as spoken in Sarachos (Mackridge 1999) and RSür (Dawkins 1937), other
varieties use tsi.
Romeyka 903
The declension of some masculine nominative singular nouns (those of the second
declension; in any case only animates) is sensitive to definiteness: the original -os
ending becomes -o(n) when the noun is definite, e. g. škilos, o škilo ‘(the) dog’
(Mackridge 1987: 124; Dawkins 1931: 394, also for other Asia Minor Greek vari-
eties). According to Dawkins (1931), this phenomenon is caused by the merger
of the second and third declension class whereby the nominative of the second
declension in -os is used for indefinite and the nominative of the third declension
in -o(n) for definite nouns. Note that the -o(n) nominative used for definite nouns
coincides with the accusative declension (Dawkins 1937: 31).
Proper nouns denoting male persons are treated like masculine singular nouns:
they take the masculine definite article o, and the -is ending for nominative, e. g.
o Mehmet-is m . sg . nom , although there may be a contact-induced tendency to drop
the article in the nominative in clause-initial position (for an example for the lack
of the article on a clause-initial subject NP see ex. 23 below); this requires more
research. Note, however, that Janse (2008: 23) claims for Cappadocian that the
nominative became associated with indefiniteness due to syncretism with indef-
inite animate object NPs and that therefore even definite masc./fem. subject NPs
do not take a definite article.
According to Table 3, we would expect a proper noun in direct object function
to drop the nominative -s. However, in (3) the ending is also dropped for a proper
noun in subject function. This suggests that the nominative form only occurs if
the constituent concerned is subject, and in sentence-initial position (cf. ex. 41),
though this requires further confirmation (cf., for example, Janse (2004: 14) who
claims for Cappadocian that due to syncretism of the nominative and indefinite
accusative of masc. animate nouns the nominative marker -s has been reanalysed
as an indefiniteness marker).
(3) (Michelioudakis and Sitaridou 2012b: 219; ROf, glosses modified)
Eðotšen=eme(n) o Mehmet-Ø ato(n).
gave.3 sg =me.acc . cl the Mehmet-nom him/it.acc
‘Mehmet gave me this/it.’
4. Determiners, pronouns and quantifiers
4.1. Definite and indefinite articles
4.1.1. Definite articles

Romeyka exhibits three definite articles: the masculine singular article o, the fem-
inine singular article i, and the neuter singular article to (see Table 4). ROf as
spoken in Uzungöl retains an ancient shape of the masculine/neuter genitive singu-
lar article, tu, which is ti/tsi in other Pontic varieties (Mackridge 1987). The mas-
culine accusative plural article in ROf as spoken in Uzungöl is tus which appears
elsewhere in the form tsi/tsu/ti/tin/tis. Especially in the masculine and feminine
plural the articles in Romeyka present an array of different forms (Özkan 2013).
Table 4: Definite articles in Romeyka

sg pl
m nom o i
acc to(n) tsi/tus
gen tsi/tu tsi
f nom i i
acc ti(n) tsi
gen tsi tsi
n nom t(o) t(a)
acc t(o) t(a)
gen tsi/tu t(a)
As regards the structure of a NP, an attributively used adjective preceding the

noun requires its own definite article, so the NP contains two articles: before the
adjective and the noun (cf. 6) (Mackridge 1987). Masculine and feminine definite
articles may be dropped before nouns with syllable-initial vowels.
In general, the definite determiners in Romeyka agree with the head in number,
gender, and case. However, the spread of the neuter plural declension to [–hum ]
feminine and inanimate masculine nouns (see also §3.2), and the spread of neuter
adjective inflection to adjectives qualifying a [–hum ] feminine noun, contribute
towards a mixed system (Mackridge 1987: 128). Michelioudakis and Sitaridou
(2012a: 366, Fn. 3) note that in Romeyka, we find both an inherited grammatical
gender agreement system, and a semantic agreement system, the distribution of
the two being conditioned by properties of the head such as animacy and gender.
Following Karatsareas (2011), they note that position on the Animacy Hierarchy is
relevant, with humans behaving distinctly from non-human nouns (see also Kar-
atsareas 2014). In example (4), we witness the initial article to indicating semantic
agreement (if neuter is considered to reflect non-humanness), while the article
immediately adjacent to the head exhibits syntactic (feminine) agreement. This
would also be in line with Corbett’s (2006: 234) prediction that with stacked
agreement targets, as in (4), the target more distant from the controller will show
semantic agreement, if any target does (see also Karatsareas 2014). Singular defi-
nite articles immediately preceding their heads always exhibit syntactic agreement
in Romeyka (Michelioudakis and Sitaridou 2012a: 366, Fn. 3).
Romeyka 905
(4) (Mackridge 1987: 128; PG, glosses mine)

to kokinon i kosdra
def . art . n red.n def . art . f hen.f
‘the red hen’
4.1.2. Indefinite articles

The indefinite article has the form of the numeral ‘one’ ena and inflects for case.
4.2. Pronominal system
4.2.1. Personal pronouns

Romeyka has both strong and weak personal pronouns, the latter being encliti-
cally attached to the verb (see (3) for an example). Enclitic pronouns only occur
in non-nominative functions. Strong and weak personal pronouns in Romeyka are
presented in Table 5. Note that there is a considerable amount of micro-variation
in the form of the pronouns; the ones presented here are from ROf.10
Table 5: Subject and object pronouns in Romeyka

Subject pronoun Strong object pronoun Weak object pronoun (clitic)
1 sg eɣo emena(n) -m(e)
2 sg esi esena(n) -se
3 sg atos.m ; ato(n(a)).n / m ; 11 -((a)n)æ
ate.f ; ate(n(a)).f
ato.n
1 pl emis(t) emas(una) -mas(una)
2 pl esis(t) esas(una) -sas(una)
3 pl atin atinus(a)/atinuna -ata
Object (accusative) pronouns have emerged from AG enclitic personal pronouns

(Özkan 2013: 143) and are either strong or weak whereby, as stated by Micheli-
oudakis and Sitaridou (2012b: 219), unlike in other Greek varieties, the use of
strong object pronouns seems to be an unmarked option. The third singular object
clitic -æ seems to be the only third-person form with neutralized gender (Miche-
lioudakis and Sitaridou 2012: 219) although Özkan (2013: 146) distinguishes in
RSür between -((a)n)a.m / n and -((a)n)æ.f . In general, the form of the object clitics
10
The data stem from Özkan (n.d.), from Michelioudakis and Sitaridou (2012b), Sitaridou
(2014b) as well as from own fieldwork.
11
Also etšinos ‘this one’ which is actually a demonstrative pronoun.
varies according to whether they are used in transitive or ditransitive verbs, i. e. as
to whether clitic clusters occur (but cf. Michelioudakis and Sitaridou 2012b: 237,
who argue that there are no clitic clusters in Romeyka, as they question the clitic
nature of first and second person object pronouns, the only ones that occur in
clusters).
As Michelioudakis and Sitaridou (2012b: 218) argue, if two object suffixes
occur together, (weak) Person-Case-Constraint (PCC)-like restrictions apply, such
that the third person clitic -æ cannot combine with any other clitic to form a cluster,
see (5a). However, the combination of a first- and a second-person pronoun (strong
PCC) is, unlike in SMG, acceptable (5b).
(5) (Michelioudakis and Sitaridou 2012b: 218; RSür, glosses modified)
a. *O Mehmetis eðotše=m=æ
the Mehmet gave.3sg =me.acc . cl =it.acc . cl
‘Mehmet gave it to me.’
(Michelioudakis and Sitaridou 2012b: 238; RSür, glosses modified)
b. Eðiksane=m=ese. / *eðiksane=s=eme
showed.3pl =me.acc . cl =you.acc / showed.3pl =you.acc . cl =me.acc
‘They showed me to you.’
Furthermore, first- and second-person accusative pronouns cannot be interpreted
as direct objects in combination with third-person pronouns irrespective of their
order (6).
(6) (Michelioudakis and Sitaridou 2012b: 238; RSür, ROf)
Eðiksan(e) æ/aton(a) emenan
showed.3pl him.acc . cl /him.acc me.acc
‘They showed him to me/ *They showed me to him.’
Özkan (2013: 146) notes, though, that the third person neuter pronominal suffix
can be combined with the neuter definite articles to and ta.pl used as direct object
pronouns after imperative forms like in ipe-na-to ‘say it’ when referring to some-
thing previously introduced to the context. For a more detailed discussion of con-
straints of clitic stacking in Romeyka see Michelioudakis and Sitaridou (2012b).
4.2.2. Demonstratives
Demonstratives have yet to be researched in any detail for Romeyka. Demonstra-
tive pronouns in Romeyka such as etšinos ‘this one’ inflect in general for number,
gender, and case. Özkan (n.d.) assumes that the demonstrative pronoun avudos or
afto ‘this’ (cf. Trabzon dialectal habu ‘this’ which has arisen due to Greek influ-
ence, Brendemoen 2002) can only be used as anaphoric pronouns referring to a
previously introduced antecedent, although deictic use seems to be possible, too
(7).
Romeyka 907
(7) (Own fieldwork; ROf)

Avudo to saχan epero.
dem the.acc plate.acc take.1sg
‘I take this plate.’
The 3rd person neuter pronoun seems also to be used as a demonstrative (8).
Ato to kitap temo en.
3sg . n def . art . n book 1sg . poss be.3sg
‘This book is mine.’
If a demonstrative determines a noun phrase, the definite article is obligatory
before the noun.
4.2.3. Possessives
Possessive pronouns in Romeyka are either enclitics, or independent possessive
pronouns (Table 6), which originate from AG possessives (Mackridge 1987;
Bortone 2009; Sitaridou 2014b). The third person possessives derive from demon-
stratives and therefore inflect (in the singular) in accordance with the gender of the
possessor (cf. Drettas 1997).
Table 6: Possessive pronouns and pronominal adjectives in Romeyka

Clitic pronoun Pronominal adjectives
1 sg -(i)m temo(n), tema.n / f . pl
2 sg -(i)s teso(n), tesa.n / f . pl
3 sg -(a)t tatinu.m / n , tatines.f 12
1 pl -(e)mun(a) temetero(n), temetera.n / f . pl
2 pl -(e)suna tesetero(n), tesetera.n / f . pl
3 pl -atun(a) tatinuna12
The initial vowel of the clitic pronouns may change according to the phonological
shape of the noun they are attached to. Prepositions indicating direction, such as s
‘to’, fuse with the possessive pronoun, like in s=temetero > semetero to xorio ‘to
our village’.
12
Drettas (1997) also mentions for Pontic Greek tetšinu.m / n , tetšines.f as 3rd person sin-
gular and tetšinon as 3rd person plural possessives for distant possessors.
Another third-person singular possessive suffix used for inanimate objects,

animals, and babies is -(e)θe, which can apparently be combined with the other
possessive clitics (9) (see also Özkan 2013: 146).
(9) (Sitaridou 2014b: 52; ROf, glosses modified)
Afti i ðulian13 to maθin=emun-eθe ɣola en
this.nom the.nom job.nom the learning=our-its easy be.3sg
‘It is easy for us to learn how to do this job.’
Possessed nouns require a definite article, regardless of whether the possessive
pronoun is independent or clitic (10a–b). Independent possessive pronouns are
generally preposed (10a), as are possessive nouns (11a). A possessed NP can itself
occur recursively as a preposed possessor, as in (11b). A kind of postposed posses-
sor seems also to be possible, as shown in (12).
a. temetero t=ospi
1sg . poss the=house
b. t=ospi=temona
the=house=1 pl . poss
‘our house’
a. tsi Aišes to ketši
the.gen Ayse.gen the.nom goat.nom
‘Ayse’s goat’
b. tsi patsis=im to ketši
the.gen daughter.gen =1sg . poss the.nom goat.nom
‘my daughter’s goat’
t=ospi tin ates
the.nom =house.nom the.acc (?) she.gen
‘her house’
4.2.4. Relative pronouns and relativizers

Relative pronouns, or more generally relativizers, are discussed in §6.5.1.1 below.
A possible candidate for a relative pronoun is a form derived from the neuter
article, d(o)/d=/t=.14 It seems to be used with inanimate, and animate non-human
13
Janse (2002: 221) notes that in some cases the possessor of a genitive phrase occurs
without overt genitive marking, presumably modelled on the Turkish compound-type
of possessive construction.
14
Özkan (2013) reports the form nto in RSür.
Romeyka 909
nouns (Özkan, n.d.) and for the relativization of objects (13), spatial complements,
subjects, and as a free relativizer. Note that this kind of relativization is post-nom-
inal, as opposed to the dominant prenominal strategy discussed in §6.5.1.1 below.
(13) (Gandon 2016: 221; Romeyka, glosses modified)
Eɣrapses et͡ ʃinon do ipamen
record.pst .2 sg that.acc rel say. pst .1 pl
‘Did you record what we have said?’
Variable relativizers are those derived from the interrogative pronouns, e. g. opios
and otinan only used for animates/humans (Özkan, n.d.). Furthermore, Sitaridou
(2014b: 30) notes the use of itina, an ancient relativizer.
4.2.5. Interrogatives
The information in this section stem from Michelioudakis and Sitaridou (2012a)
on wh-fronting in PG and Romeyka. Wh-words in Romeyka are presented in
Table 7. Sitaridou (2014b: 30) mentions furthermore retention of AG hoθen < ὅθεν
‘wherefore’.
Table 7: Interrogatives in Romeyka (adapted from Michelioudakis and Sitaridou

2012a: 356–358)
+ hum –hum
nom pios ‘who’ pios15 ‘what’
kaškiši ‘how many’ pion ‘what’
acc tinan ‘who’ pion ‘what’
motinan ‘with whom’ do ‘what’
kaškišus ‘how many’ doxna(n)16 ‘what’
Other laɣo ‘how’
pote ‘when’
pu merea/tšeka ‘where.side/there’
apoxen ‘from.where’
poso ‘how much’
ya to pio ‘why’
According to Michelioudakis and Sitaridou (2012a: 357), interrogatives in

Romeyka have grammaticalised [±hum ] restrictions but lack number/gender dis-
tinctions. The Turkish loan kaš(i)kiši ‘how many’ is optionally used together with
pios ‘who’ to add an indication of plurality (14).
15
Note that this source is difficult to interpret at this point.
16
In ROf only (Michelioudakis and Sitaridou 2012a: 358).
(14) (Michelioudakis and Sitaridou 2012a: 358; ROf)

Pios kaš(i)kisi erθen?
Who nom . pl came.3sg
‘Who (=many) came?’
All wh-phrases appear obligatorily in the left periphery, where two or more
wh-phrases can be fronted (Michelioudakis and Sitaridou 2012a: 360). Micheli-
oudakis and Sitaridou (2012a: 363) argue that movement is obligatory as wh-ele-
ments are not permitted in situ. Apart from discourse-linked interrogatives, multi-
ple wh-fronting is order-preserving (15).
(15) (Michelioudakis and Sitaridou 2012a: 363; ROf)
Pios tinan pote efilise?
who.nom who.acc . hum when kissed.3sg
‘Who kissed whom and when?’
Romeyka has borrowed the Turkish interrogative particle mI, used in polar ques-
tions, albeit without vowel harmony (16). Like in Turkish, the question particle is
flexible to move for focalization purposes. Note that in this example, the wh-word
laɣo is in situ; whether this is evidence of syntactic influence from Turkish, or has
other causes, cannot be discerned.
Teso i patsi laɣo en, emorfo mi en?
2 sg . poss the.nom daughter.nom how be.3sg beautiful q be.3 sg
‘How is your daughter, is she beautiful?’
4.2.6. Quantifiers and numerals

Quantifiers are at least olos (17) and ul(os) ‘all’ (18). The status of the following is
unclear: kat ‘some’, kanis ‘nobody’, tipu ‘nothing’ (Özkan, n.d.). At least the first
two can take personal clitics (19).
Olon emorfo ospi temo en.
all.acc beautiful house 1sg . poss be.3 sg
‘The most beautiful house is mine.’
So Istanbol op= ine ul Rumdža u= kserone.
in.the Istanbul rel = be.3pl all.nom Rumca neg = know.3 pl
‘The people (who are) in Istanbul all don’t know Rumca.’
Romeyka 911

Eɣo ul=isuna eɣabo.
1sg all=2pl . acc love.1sg
‘I love you all.’
Romeyka numerals exist only for the numbers ‘one’ to ‘five’, the rest is Turkish
(Mackridge 1987). The numeral ena ‘one’ can occur determined, e. g. to ena ‘the
one’. Furthermore, pronominal clitics can be attached to numerals, though the
personal suffix in (20) resembles the Turkish first plural suffix -Iz. Similarly, Tr.
tek ‘one, only’ may be used emphatically, e. g. ena tek ɣarðel ‘just one child’, cf.
Turkish bir tek çocuk.
na troɣume i ði-jiz
fut eat.1pl the two-1pl
‘The two of us will eat.’
Romeyka finite verbs encode person, TAM, and voice. Some functions are
expressed periphrastically by the use of particles, such as the future marking parti-
cle na, and others by enclitic affixes and/or stem alternation. The verbal system of
Romeyka contains many archaisms absent in other Modern Greek dialects such as
the AG aorist infinitive (see Mackridge 1987; Sitaridou 2014b).
5.1. Person agreement

Romeyka verbs are specified for person and number, realized through personal
suffixes agreeing with the subject. The clitic pronouns for objects (cf. Table 5
above) form a distinct paradigm; whether they should be considered “agreement”
is an issue we do not take up here. In what follows, I refer exclusively to the sub-
ject-indexing suffixes as agreement. The paradigms vary according to verb stems.
Some verbs are strong, i. e. they exhibit different stems with regard to tense.
The citation form used here is the first person singular present. Verbs can be
classified according to the endings in this form: verbs ending in -o, e. g. tro(ɣ)o
‘eat’, and verbs ending in -me, e. g. tšimume ‘sleep’, (Table 8 for the full par-
adigm based on own fieldwork). Within these classes, verbs can be subcatego-
rized according to ultimate and penultimate stress, the latter verb class forming the
major group (Drettas 1997: 205).
Table 8: Present tense inflectional paradigm of Romeyka

Verbs in -o Verbs in -me
‘eat’ ‘sleep’
1sg tro-(ɣ)o tšim-ume
2sg tro-s tšim-ase
3sg tro-i tšim-ate
1pl tro-ɣume tšim-umist
2pl tro-te tšim-asten
3pl tro-ɣune tšim-untane
Strong verbs have stems differing in phonologically unpredictable ways, or through

suppletion. They can be divided into two groups: (I) those with an imperfective
stem, used for e. g. the present tense, and a deviating perfective (traditionally
aorist) stem, e. g. elepo ‘see.prs ’, elepa ‘see.ipfv ’, iða ‘see.aor ; (II) those with
a present stem different from the aorist and imperfect stem, e. g. eftao ‘do.prs ’,
epina ‘do.ipfv ’, epika ‘do.aor ’ (Drettas 1997: 205).
5.2. Tense
Romeyka has three morphologically marked tenses: present, imperfect, and aorist.
Other tenses such as the future are formed periphrastically. Remarkably, there
is no periphrastic perfect tense, e. g. with ‘have’ (Mackridge 1987: 127).17 Note,
however, periphrastic present progressives which are discussed in §5.3.
5.2.1. Present
The present tense expresses an imperfective aspect and is used for habitual actions
(21). It is formed on the basis of the imperfective stem. For the paradigm see
Table 8.
Her sabaχtan so mektep paɣo
every morning to.the school go.1sg
‘Every morning I go to school.’
17
See Sitaridou (2014a: 122) for the discussion of a complex predicate consisting of
the verb ‘have’ in past tense followed by an infinitive which resembles a past perfect
though only functioning as a counterfactual.
Romeyka 913
5.2.2. Imperfect
The imperfect expresses an action which happened continuously or habitually in
the past. Like the aorist, it is formed by the vocalic augment /e/ (Bortone 2009),
like in emaireva ‘I used to cook.ipfv ’ vs. emairepsa ‘I cooked.aor ’. As in the
other tenses, the inflectional paradigm depends on the verb class (cf. Drettas 1997
for PG). In the speech of the Istanbulite ROf consultant, however, no imperfect
forms were attested. Instead, the speaker used an aorist in the relevant context, as
in (22):
Mikrina anda=emunest-ten emist-pal so mektep epixame
small(?) time.in=our-? we-foc to.the school go.aor .1 pl
‘When we were small, we used to go to school.’
5.2.3. Aorist
The aorist is used for perfective actions in the past (23). It can be combined with
temporal adverbs or inchoative verbs to mark the beginning of an action carrying
on to the present. Furthermore, it can be used (together with the future particle
na) for imminent actions happening in the near future (24). It is formed on the
basis of perfective aspect, i. e. the aorist stem with the ancient temporal augment
/e/ (Table 9), its forms matching largely with the AG aorist endings (Sitaridou
2014b: 53).
ɵretmenis aso χorio erθe
teacher.nom from.the village come.aor .3 sg
‘The teacher came from the village.’
t=akšam n=arde
the=evening fut =come.aor .3 sg
‘She will come this evening.’
Table 9: Aorist endings in ROf

Verbs in -o Verbs in -me
‘ate’ ‘slept’
1 sg e-fa-ɣa e-tšim-eθa
2 sg e-fa-es18 e-tšim-eθes
3 sg e-fa-e e-tšim-eθi
1 pl e-fa-ɣame e-tšim-eθame
2 pl e-fa-ete e-tšim-eθete
3 pl e-fa-ɣane e-tšim-eθane
5.2.4. Future
There is no morphological future tense in Romeyka. Future is expressed peri-
phrastically by means of the particle na (25a) and the present stem, like in Medie-
val Greek and unlike in SMG, which uses the particle θa. Na is, apart from being
a future marker, also used as a subordinating conjunction after volitives and other
verbs (Sitaridou 2014a; see §6.5.1.3). We nevertheless gloss it with fut through-
out, although this is not always an accurate reflection of the function. For RSür as
spoken in Beşköy, Özkan (2013: 147) also mentions a preverbal particle ha used
to form future in negated sentences (cf. 25b) and interrogatives.
a. Sabahle na pame so dyɣun.
tomorrow fut go.1pl to.the wedding
‘Tomorrow we will go to the wedding.’
b. Utše na pa(ɣ)o
not fut go.1sg
‘I will not go.’
5.3. Aspect
Perfective and imperfective aspect are in Romeyka only realized in the past tense
indicative, i. e. by the aorist and the imperfect respectively (Mackridge 1987;
Sitaridou 2014a). Present and future have no morphological aspectual distinctions
(cf. 26).
(26) (Özkan, n.d.; RSür as spoken in Beşköy, glosses mine)
t=ospi tamir na inete
the=house repair fut become.3sg
‘the house will have been repaired’
18
Not attested in my data.
Romeyka 915
Progressive aspect seems to be realized by two strategies: (i) by the invariable

auxiliary eš(i) + a finite verb form of the present stem for present (27a) and in the
imperfect for past (Özkan 2013). Istanbulite ROf applies the aorist form instead
(27b). Invariable eš(i) resembles the 3sg form of the verb ‘to have’ eš, and could
therefore be a frozen grammaticalized form of ‘have’ (see also Drettas 1997: 334).
Note the use of ‘have’ as an auxiliary for progressive aspect in vernacular Persian
(see Paul, this volume, chapter 4.6).
a. Aso pazar eš erxume.
from.the market aux come.prs .1sg
‘I am coming from the market.’
b. I Ayše aso Istanbol eš erte.
the Ayse from.the Istanbul aux come.aor .3 sg
‘Ayse has been coming from Istanbul.’
In PG, eš + k ‘and’ (tše in Romeyka) + present/imperfect is used to express explicit
processes which are near to completion (Drettas 1997).
(ii) Another periphrastic construction consists of a finite verb + coordinative
tše ‘and’ + finite steko/stekume ‘stand’ or kahome ‘sit’. Both auxiliaries seem to
be used interchangeably whereby steko and stekume seem to constitute a mixed
paradigm in themselves (see Table 10).
Table 10: Periphrastic progressive forms with kahome ‘sit’ and steko/stekume ‘stand’ on
the example of tšimume ‘sleep’19
kahome ‘sit’ steko/stekume ‘stand’
‘sleep’ + ‘and’ + ‘sit’ ‘sleep’ + ‘and’ + ‘stand’
1 sg tšimume tše kahome tšimume tše steko/stekume
2 sg tšimase tše kahese ?
3 sg tšimate tše kahete tšimate tše stitš
1 pl tšimumist tše kahomist ?
2 pl tšimaste tše kahesten ?
3 pl tšimun tše kahontane tšimun tše stekontane/stekune
In the past, both verbs are in the imperfective stem (Özkan 2013); cf. Istanbulite
ROf in (28) which uses the aorist with the present form of the auxiliary steko.
According to Özkan (2013: 148), the construction is used especially for
“longer lasting” progressive actions (see ex. 29) and possibly replicates a Turkish
periphrastic progressive -Ip + dur-, like in çalışıp duruyorum ‘I keep working’ (see
19
For the forms of tše steko/stekume marked by a question mark no information was
available in my data.
also Dawkins 1916: 199 who reports the same construction with ‘stop’ and ‘lie’ in
other AMG varieties).
to vakit erθe (tše stitš) n=efta
the time come.aor .3 sg (and stand.prs .1sg ) fut =make.1sg
to faji
the food
‘The time has come, I will prepare the food.’
So mutfak tšališevo tše steko.
in.the kitchen work.1sg and stand.1sg
‘I am working in the kitchen.’
According to Sitaridou (2014b: 44), the construction with steko bears inchoative
aspect though without a progressive function (cf. Drettas 1997: 336).
5.3.1. Voice
The details of passive morphology in contemporary Romeyka are yet to be estab-
lished. Historically, the categorization of verbs in two classes approximately cor-
responds to a voice distinction: verbs in -o are called actives whereas verbs in -me
are called (medio)passives (Drettas 1997: 205). Whereas ROf seems to lack pas-
sives in general, RSür allows passives (Michelioudakis and Sitaridou 2012b: 236).
The syntax of datives under passivization in RSür has been investigated by Miche-
lioudakis and Sitaridou (2012b). They note that a theme argument can be a regular
subject of a passive verb (30–31a), while a benefactive or recipient cannot advance
to subject under passivization (31b):
To harti eɣrafte tin Aiše
the.nom letter.nom written.pass .3 sg the.acc Aise.acc
‘The letter was written for Aise.’
a. I para tin Aiše eðoste.
the.nom money.nom the.acc Aise.acc given.pass .3 sg
‘The money was given to Aise.’
b. *I Aiše eðoste tin paran
the.nom Aise.nom given.pass .3 sg the.acc money.acc
‘Aise was given the money.’
Romeyka 917
5.3.2. Mood
Romeyka exhibits four moods: Indicative (tros ‘you eat’) and imperative (fa
‘Eat!’) are morphologically marked on the verb whereas subjunctive (na troɣo ‘I
should eat’) and optative (as troɣo ‘let me eat’) are formed by the particles na and
as, respectively, followed by the finite verb (Sitaridou 2014a).
The imperative covers the second persons and differs according to verb class:
verbs in -o such as tro(ɣ)o ‘eat’, fa ‘eat.imp .2 sg ’, fate ‘eat.imp .2 pl ’; verbs in -me
such as tšimume ‘sleep’, tšimeθ ‘sleep.imp .2 sg ’, tšimeθisten ‘sleep.imp .2 pl ’ (see
Drettas 1997: 227–232). Negation is formed by the negation particle mi and – at
least in Istanbulite ROf – the present indicative of the second persons, e. g. mi tros
‘neg eat.prs .2 sg ’. Thus ROf neutralizes the imperative/indicative distinction under
negation. Some verbs in Romeyka retain the ancient imperative ending (-s)on,
e. g. akuso(n) ‘listen’ (Bortone 2009: 84), but also other archaic imperatives such
as ipe ‘say’(Mackridge 1987: 125). On the syntax of modality see §6.3.
5.4. Finiteness
Mackridge (1987: 127, Fn. 17) considers that Romeyka lacks non-finite forms of
the verb, based on the assumption that finiteness equates to “indexing person”.
However, the infinitive found in certain varieties of Romeyka seems to run counter
to this (32) (see Sitaridou 2014b and the discussion in §6.5.2.1). Although there
appears to be no gerund in Romeyka (Sitaridou 2014a), Özkan (n.d.) provides exam-
ples from RSür resembling adverbial past participles which are formed by the suffix
-ta appended to the perfective stem, e. g. ɣelax-ta erθame ‘we came l aughing’.
(32) (Sitaridou 2014a: 122; ROf, glosses modified)
Utš eporesa tšimiθini.
not can.pst .1 sg sleep.inf
‘I could not sleep.’
5.5. Verbal derivation

This section is based on Özkan (2013: 147). Verbal derivation is productive in
Romeyka. A variety of derivational affixes is used for verbalization, causativiza-
tion, or integration of Turkish loans. A root stem to which derivational suffixes
are attached forms the verbal base on which the inflection suffixes are attached. A
single affix can have several functions. For example, the suffix -(i)zo is used: (i) for
verbalization of nouns, e. g. fos ‘light’ → fosizo ‘to be lit/to shine’; (ii) to increase
transitivity or as a causative marker, e. g. maθano ‘to learn’ → maθizo ‘to teach’;
(iii) for the formation of onomatopoetic verbs such as tsatsalizo ‘to weed’; (iv) to
grammatically adapt Turkish verbs like in dajanizo ‘to endure’ from Tr. dayan- ‘to
endure’.
This range of functions matches very closely the and suffix of Kurmanji Kurdish,
discussed in Haig (this volume, chapter 2.3). Another frequent suffix used for inte-
grating Turkish loans is -evo, e. g. tšališevo ‘to work’ from Tr. çalış- ‘to work’.
6. Syntax
6.1. Syntax of the noun phrase

Noun phrases (henceforth NPs) in Romeyka are head-final:
(33) (Constructed example)
o emorfos o peðas
the.m handsome.m the.m boy.m
‘the handsome boy’
The definite article co-occurs with all nouns, including proper nouns, and co-oc-
curs with both demonstratives, and possessives (34). Quantifiers are generally
treated like adjectives and thus require their own article (35).
tatinuna to xorio
3 pl the village
‘their village’
T=ulinuna t=ospidæ mikra ine.
the=all the=houses small be.3 pl
‘Everybody’s houses are small.’
Adjectives in Romeyka precede the noun they modify, and generally agree with
their heads in person and gender, though there may also be semantic agreement
(Michelioudakis and Sitaridou 2012a: 366, Fn. 3). In contrast, Özkan (2013: 145)
suggests that syntactic agreement is not consistent with adnominal adjectives (36),
while predicatively used adjectives exhibit semantic agreement if the noun denotes
a human referent (at least this is how his example can be interpreted). There is
obviously dialectal variation here, but the general tendency for the neuter form to
expand its range appears to be valid here (cf. Karatsareas 2011, 2014).
(36) (Özkan 2013: 145; RSür)
do galo i mana
the.n good.n the.f woman.f
‘the good woman’
Furthermore, a feminine animate noun may be modified by one neuter and one
masculine adjective (Mackridge 1987: 128).
Romeyka 919
6.2. Prepositions and prepositional phrases

Romeyka is a prepositional language. The local prepositions s ‘to, in’ and as
‘from’ merge with the definite articles; s > to, s=o ‘in the’. Most of the prepo-
sitions require accusative case, though the local prepositions may also govern a
genitive-NP (Özkan 2013: 145). Note that in Istanbulite ROf the local prepositions
seem not to take accusative case, but rather the nominative. Me and ja become met
and jat when their complement is a pronoun. For an overview of some prepositions
and their function see Table 11.
Table 11: Prepositions in Romeyka

Preposition Function Equivalent
os temporal until
prin temporal before
s locative in, to
as ablative from, of, by, from … on, since
ja purpose for (RSür)
ðæ purpose for (the sake of) (ROf)
me instrumental with, through, by
6.3. Negation and modality

This section is based on Sitaridou (2014a) on modality in Romeyka. Romeyka
expresses negation by the use of negators with rich allomorphic and cross-dialectal
variation, whereby the allomorphic distribution of the negators correspond to that
of AG (Sitaridou 2014b). The following four negators can be distinguished: (i) utš
for sentential negation in indicative sentences (37a), (ii) mi(n) in imperatives and
wishes (37b), (iii) xe in subjunctives (37c), (iv) mutš in counterfactuals (Sitaridou
2014a: 121; see Table 12 for an overview).
a. Utš eɣabo na troɣo.
neg want.1 sg fut eat.1 sg
‘I don’t want to eat.’
b. Esist mi trote, kalo utš en.
2 pl neg eat.imp .2 pl good neg be.3 sg
‘Don’t eat [that], [it] is not good.’
(Sitaridou 2014a: 123; ROf, glosses modified)
c. Esi θelis eɣo xe na troɣo
you.nom want.2sg I.nom not fut eat.1sg
‘You don’t want me to eat.’
Utš is derived from the ancient pre-verbal negative particle ούκ. Its phonologically
conditioned allomorphic forms are presented in Table 12.
Table 12: Negators in Romeyka (adapted from Table 1 in Sitaridou 2014a: 121)

Negator Conditions Example
utš utš phonologically conditioned utš eɣapo ‘I don’t like’
variation (see Sitaridou
tš tš alo ‘not more’
2014a: 121 for details)
tši tši pao ‘I don’t go’20
u u θelo ‘I don’t want’
utše utše na troɣo ‘I wouldn’t eat’
mi(n) imperatives; wishes mi trote ‘Don’t you eat!’
xe subjunctives (see 37c)
mutš counterfactuals na mutš ixa xasini ‘If I had not lost’21
Contact with Turkish affected the responsive particles used in ROf as spoken in
Uzungöl (Sarachos): whereas Greek ne is used for ‘yes’, ‘no’ is expressed by the
Turkish yok (Mackridge 1987: 134).
The periphrastic subjunctive formed by the particle na and the present stem
of the verb (or the imperfect in a na-clause governed by an aorist verb, Sitaridou
2014a) is used for a variety of constructions, such as volitionals (discussed in
§6.5.1.3 on na-clauses) and conditionals (discussed in §6.8). There is no aorist
subjunctive (but cf. Dawkins 1937 who claims the aorist subjunctive survived in
the PG of Christians from Samsun). Na as a future marker often also seems to
imply a modal meaning (38).
ðo=me nero, na pino.
give.imp .2 sg =1 sg water fut drink.1 sg
‘Give me water, (that) I may drink.’
Optative mood is formed by the particle as and the verb in present tense. It is used
in voluntatives (39) and counterfactuals (40).
Elate, as troɣume!
come.imp .2 pl opt eat.1pl
‘Come, let us eat.’
20
Sitaridou (2014a: 130).
21
Chatzopoulou and Sitaridou (2014, ex. 7).
Romeyka 921

As eftaɣo=se faji, ama utše na troɣo ipes me.
opt make.1 sg =2sg food but neg fut eat.1sg say.2 sg 1sg
‘I had made you food, but you said to me you wouldn’t eat it.’
Romeyka allows for embedded imperatives as complements to verbs of saying
(Sitaridou 2014a) such as in (41) although it is not clear to what extent this con-
struction differs from indirect speech. The order of the subordinate imperative and
the main verb are variable.
O Alis esenan yrapson ipen
the Alis.nom you write.imp .2 sg tell.pst .3 sg
‘Alis ordered you to write.’
6.4. Word order in independent clauses

It remains controversial whether the basic word order in Romeyka is predom-
inantly head-final (42) though permitting VO order (see for example ex. 25a)
(Michelioudakis and Sitaridou 2012b: 216) or whether Romeyka’s canonical order
is SVO, with OV (see for example ex. 27) resulting mostly from focus fronting and
interrogative clauses, as well as from Turkish contact (Neocleous 2017).
(42) (Michelioudakis and Sitaridou 2012b: 216; ROf)
škilon exo
dog.acc have.1sg
‘I have a dog.’
6.4.1. Ditransitives and causative constructions

This section is based on the description of ditransitives and benefactives in
Romeyka in Michelioudakis and Sitaridou (2012b: 231–236). In ditransitives,
recipients are accusative and do not alternate with prepositional phrases (43). Both
orders of direct and indirect object seem to be acceptable, though direct object –
indirect object order appears to be preferred (see Michelioudakis and Sitaridou
2012b).
(43) (Micheloudakis and Sitaridou 2012b: 231; RSür, glosses modified)
To peði eðotše fai ton aðelfo. / *son aðelfo.
the child gave.3sg food the.acc brother.acc / *to.the brother
‘The child gave food to the brother.’
In benefactives, again both word orders are possible (44). Benefactives may alter-
nate with prepositional phrases headed by ðæ ‘for’ (ROf) or ja ‘for’ (RSür).
(44) (Micheloudakis and Sitaridou 2012b: 235; RSür, glosses modified)

Aiše epitše to Mehmet pide /
Aise made.3sg the.acc Mehmet.acc pie.acc /
pide to Mehmet.
pie.acc the.acc Mehmet.acc
‘Aise baked Mehmet a pie.’
Causatives may be expressed periphrastically using a matrix verb ‘let, allow’, and
a complement clause introduced by na (45) (Sitaridou 2014a: 125). Apart from
this, causativity in Romeyka awaits further investigation.
Efikane=sas na skaftete ta xorafæ=suna.
let.pst .3 pl =you fut dig.2pl the fields=his
‘They let you dig his fields.’
6.5. Subordinate clauses

Romeyka exhibits several different strategies for subordination, including finite
and non-finite verbal forms (Sitaridou 2014a, 2014b). The only complementiser
in regular use is the all-purpose modal particle na (see §6.5.1.3; §5.2.4). In what
follows, a broad distinction is drawn between finite and non-finite subordination
distinguishing several sub-types.
6.5.1. Finite subordination
6.5.1.1. Relative clauses

There is little information to date on Romeyka relative pronouns (see §4.2.4 above),
except Gandon (2016: 219–223). Drettas (1997: 365) notes that relativizers in PG
may be analysed as connectors rather than subordinating pronouns. Relativizers in
Romeyka are in general preverbal and different strategies of relativization seem
to apply according to the role of the head noun of the relative clause. Invariable
relativizers are the following, although relative clauses without relativizer are pos-
sible, too (see Gandon 2016):
(i) op (possibly > AG όπου, Gandon 2016: 221) occurs in prenominal relative
clauses, immediately preceding the relative clause verb. It may be used for rela-
tivizing the subject of the relative clause, and can be used as a free relative, as in
(46).22
22
Gandon (2016: 223, Fn. 130) notes the possibly nominalizing function of the free rela-
tivizers. If analysed as a nominalization, the gloss should indicate nmlz . acc .
Romeyka 923

Aso Katoχor op erθe iða=na
from.the Katochor rel came.3sg saw.1sg =3sg
‘I saw the one that came from Katochor.’
(ii) Though likely, it is not fully clear whether p(i)/p(u) (Özkan), pe (> AG όπερ,
Sitaridou 2014b: 30) derive from the same root as op; a clitic p= is also found in
relative clauses:
(47) (Gandon 2016: 222, citing Neocleous, in preparation; Romeyka, glosses
modified)
Opse iða aʎis p=epiren iɲeka
yesterday saw.1sg Alis.nom rel =got.married.3sg woman.acc
‘Yesterday I saw the woman who Ali married.’23
The comparative construction in (48) also exhibits p=, though here it is not attached
to a verb. This kind of construction suggests that p= may also function as a general
nominalizer (‘that of you’). However, this is uncertain; it may also be analysed as
part of the preposition, or it may have contracted with the preposition aso ‘from’
(for an alternative analysis see ex. 69 in §6.7.2).
Temetero t=ospi a=p=esetero trani-dero en
1 pl . poss the=house.nom from=rel =2 pl . poss big-cmp be.3sg
‘Our house is bigger than yours.’; lit. ‘…than that of you.’
Romeyka uses a typologically uncommon strategy for relativization, involving
prenominal, but finite relative clauses (Gandon 2016: 220, citing Neocleous, in
preparation) (49b). Right-branching structures (postnominal relative clauses) may
be possible, too (49a), presumably a reflex of the type widely-attested in AG, and
still maintained in SMG. The prenominal relative clauses (for example exs. 50, 51)
in Romeyka are probably contact-induced by Turkish prenominal relative clauses.
There are also evident similarities to geographically adjacent Laz, which has
prenominal relativization introduced by preverbal subordinators (Lacroix, this
volume, chapter 6.2, §6.1).
(49) (Gandon 2016: 220, citing Neocleous, in preparation, glosses
modified)
a. i patsi [azi maziran d=erθen]
the.nom girl.nom from.the Mazira.acc rel =came.3sg
d=emon d=anepsin eton
the=my.nom the=niece.nom be.ipfv .3 sg
23
I interpret (47) as shown, but Gandon’s translation is slightly different.
b. [asi maziran d=erθen] i patsi

from.the Mazira.acc rel =came.3 sg the.nom girl.nom
d=emon d=anepsin eton
the=my.nom the=niece.nom be.ipfv .3 sg
‘The girl who came from Mazira is my niece’
In Istanbulite Romeyka, relativization may lack an overt relativizer if the head noun
is present (50), and there appears to be a clitic resumptive pronoun on the relative
clause verb (=na). In free relatives, a relativizer op is used (51–52) (see also Gandon
2016: 222 for free relatives), and with a spatial complement, the relativizer (n)do
may be found (53). The verb in the relative clause appears to inflect for subject
number, but we lack detailed information on the kinds of verb forms used here.
Aso Katoχor erθe o Mehmetis iða=na
from.the Katochor came.3sg the Mehmet saw.1sg =3sg
‘I saw Mehmet who came from Katochor.’
Aso Katoχor op erθe iða=na
from.the Katochor rel came.3sg saw.1sg =3sg
‘I saw the one that came from Katochor.’
So Istanbul op ine ul o Rumdža u=kserone.
in.the Istanbul rel be.3pl all the Rumca neg =know.3pl
‘Those (being) in Istanbul all don’t know Rumca.’
Sin džami etšika ndo en t=ospi temo en.
at.the mosque down_there rel be.3sg the=house 1 sg . poss be.3sg
‘The house at the mosque is mine.’
6.5.1.2. Finite complement clauses in verbs of saying

Subordinate clauses introduced by verbs of saying have a finite verb form in the
dependent clause which is either center-embedded as in relatives and embedded
imperatives (54), (55), or post-posed (56), (57). Note that all the available examples
could be interpreted as direct speech (cf. the second-person pronouns in the subor-
dinate clause); to what extent direct and direct speech are differentiated syntacti-
cally remains an open question. With verbs of saying, in general no complementis-
ers are used (Sitaridou 2014a: 128) but note the use of ki as complementiser in (57).
Romeyka 925
(54) (Özkan, n.d.; RSür, glosses mine)

Eɣo nt=eɣapesa=se son kosmo ipa.
I that=loved.1 sg =you to.the.acc world.acc told.1 sg
‘I have told the world that I loved you.’
O Alis esenan ɣrapson ipen.
the.nom Alis.nom you write.imp .2 sg told.3 sg
‘Alis ordered you to write.’
Tin patsi=m ipa epero=se ɣalemi.
the.acc daughter.acc =my told.1 sg take.1sg =you pencil.acc
‘I told my daughter that I would buy her a pencil.’
Eɣo ton džirim=m ipa ki bola eɣabo=se.
I the.acc father.acc =1 sg told.1 sg comp much love.1 sg =you
‘I told my father that I love him much.’
6.5.1.3. na-Clauses
This description of na-clauses is based on Sitaridou (2014a: 123–126). Apart from
its function as a future marker, na is used as a subordinating conjunction (Mack-
ridge 1987: 130). In comparison to SMG, its use in Romeyka is more restricted:
na occurs in nonveridical predicates such as volitionals (58a) and negated present
tense modals (58b), as well as in causatives (see ex. 45), mental perception verbs
such as enespala ‘I forgot’ (58c) and erte so tšefali=m ‘it came to my mind’, and
emotive verbs (58d) (Sitaridou 2014a).
(58) (Examples from Sitaridou 2014a: 123–126; ROf, glosses modified)
a. Utš eθelna n’ emaireva
not want.ipfv .1 sg fut cook.ipfv .1 sg
‘I didn’t want to cook.’
b. U poro n’ almeɣo.
not can.1sg fut milk.1sg
‘I cannot milk (the cows).’
c. Enespala na leɣo ti mami
forget.pst .1 sg fut say.1sg the grandmother.acc
ta xaberæ.
the.acc news.acc
‘I forgot to tell the news to the grandmother.’
d. Exara na mairevo.
be_happy.pst .1 sg fut cook.1sg
‘I was happy I had cooked.’
Note in (58a) that volitional θelo only requires a na-clause as complement when
negated. This does not hold true for the volitional aɣapo ‘I love/like’ which always
requires a na-clause (59) (Sitaridou 2014a: 124).
Eɣo esena eɣabo ne lepo.
I you like.1 sg fut see.1sg
‘I want you to see (that).’
6.5.2. Non-finite subordination

6.5.2.1. Infinitives
According to Sitaridou (2014a: 122), the infinitive in Romeyka is defined as not
bearing any agreement features, and as not occurring independently, but only as
the complement of a superordinate syntactic construction (Özkan 2013: 149). Nor-
mally, the infinitive consists of the aorist stem followed by the infinitival ending
-ini which can also bear the passive stem marker -θ-. The infinitive may have a
distinct nominative subject such as in inflected and personal infinitives (see Sitar-
idou 2014b). Furthermore, it can take either a predicate or a DP as a complement,
it can be coordinated, and it can be modified by adverbs (Sitaridou 2014a: 130).
Infinitives in Romeyka occur (i) in negated past tense modals (60a); (ii) as
complements to the negated past tense volitional utš eθelesa ‘I didn’t want’ (60b);
(iii) in prin ‘before’-clauses (60c) but not with other prepositions such as os ‘until’;
(iv) in counterfactuals such as wishes, exclamatives, and conditionals as a comple-
ment of ‘have’ (60d) (Sitaridou 2014a, 2014b). Unlike in SMG, the infinitive in
Romeyka does not occur in: (i) positive past tense modals, (ii) negated past tense
modals, and (iii) modals in questions (Sitaridou 2014a: 119). Sitaridou (2014a)
argues that the Romeyka infinitive behaves like a negative polarity item which is
licensed by antiveridicality.
(60) (Sitaridou 2014a: 126, 120; exs. 25, 26, 9a, 9b, respectively; ROf,
glosses modified)
a. Utš eporesa tšimiθini.
not can.pst .1 sg sleep.inf
‘I could not sleep.’
b. Utš eθelesa mairepsini.
not want.pst .1 sg cook.inf
‘I didn’t want to cook.’
Romeyka 927
c. Prin spudzisini so mandarin, tsi pao.

before clean.inf at.the barn not go.1sg
‘I am not leaving before I clean the barn.’
d. As išen portpatesini sa rašia.
opt have.pst .3 sg walk.inf to.the mountains
‘S/He should have walked in the mountains.’
The syntactic constructions in which the infinitive occurs vary between dialects
(Mackridge 1999: 102–103). For example, in RSür as spoken in Beşköy, the infin-
itive is not used in prin ‘before’-clauses (Özkan 2013: 149). Furthermore, in this
variety, the infinitive is inflected by active past personal endings added to the
infinitive when occurring after the negated past tense of θelo ‘I want’ and poro ‘I
can’, like in utš eθelesa porpatesna ‘I did not want to walk’ (Özkan 2013: 148).
Finally, inflected infinitives occur as a strategy of nominalization whereby they
are complements to (a) aspectuals such as epiturepsa ‘I finished’, and (b) verbs of
mental perception such as enespala ‘I forgot’ (61). The nominalized infinitive is
used with a, possibly obligatory, complex possessive (e)muneθe which may be a
calque from Turkish nominalizations of the type oku-ma-sı-nı ‘read-inf -3 sg . poss -
acc ’ (Sitaridou 2014a: 130). Furthermore, nominalization by means of inflected
infinitives before adjectives are formed as in (62, cf. also ex. 9) (Sitaridou
2014b: 42).
To tšimiθin=emun-eθe enespala.
the sleep.inf =our-its forget.pst .1 sg
‘I forgot to sleep.’
(62) (Sitaridou 2014b: 42, Romeyka, glosses modified)
Afti i ðulian to maθini=mu ɣola en
this.nom the.nom job.nom the learning=my easy be.3sg
‘It is easy for me to learn how to do this job.’
6.5.2.2. Deverbal nouns

Along with infinitives, Sitaridou (2014a) also recognizes verbal nouns. They may
be used (a) as complements to volitionals such as θelo ‘I want’ (63a), and (b) as
a complement to the phase modal epašlaepsa ‘I started’ (63b) which requires a
deverbal noun introduced by the preposition so ‘to the’ (Sitaridou 2014a: 130).
(63) (Sitaridou 2014a: 131, 130, respectively; ROf, glosses modified)

a. to peði=m to panimon aso xorion θelo.
the child=my the going from.the village want.1sg
‘I want my child to leave the village.’
b. Epašlaepsa pola so ðipsasimo.
start.pst .1 sg lot to.the drinking
‘I started to get very thirsty.’
6.6. Clausal complements lacking an overt complementiser

The following is based on Sitaridou’s (2014a: 127–129) discussion of null com-
plementisers. Several types of complement clause occur without an overt comple-
mentiser. These include the complements of (i) perception verbs (64a); (ii) some
emotive verbs such as efoveθa ‘I feared’ (64b) (but not in all emotive verbs; others
are expressed by na-clauses); (iii) epistemic predicates (64c); (iv) verbs of saying
(see ex. 56 in §6.5.1.2).
(64) (Sitaridou 2014a: 127, 128; exs. 27, 30, 33 respectively; ROf,
glosses modified)
a. Eɣo ekusa o tšopanon ton arko
I.nom hear.pst .1 sg the shepherd.nom the.acc wolf.acc
endoke.
kill.pst .3 sg
‘I heard that the shepherd killed the wolf.’
b. Efoveθa xanis ton para=s.
fear.pst .1 sg lose the.acc money.acc =your
‘I feared you may lose your money.’
c. θaro xastas en.
think.1sg sick be.3sg
‘I think s/he is sick.’
6.7. Adverbs and adverbial clauses
6.7.1. Adverbial clauses of time and purpose

Sitaridou (2014a) discusses the use of the infinitive in prin ‘before’-clauses (see
ex. 60c in §6.5.2.1). Other temporal adjuncts, however, do not select an infinitive
and require rather a na-clause (65). Temporal adverbial clauses are introduced
by an adverbial subordinator which appears at the beginning of the subordinate
clause, e. g. anda ‘when’ or os ‘until’ (65, 66).
Romeyka 929

Os na ‘rte o Mehmetis na permeno=se.
until fut come.3sg the.nom Mehmet. nom fut wait.1sg =you
‘I’ll wait for you until Mehmet comes.’
Anda pašlaevis so fanimon os na piturevis ata,
when start.2sg at.the eating.acc until fut finish.2sg them.acc . cl
u poris na stetšis.
not can.2sg fut stop.2sg
‘Once you start eating, you can’t stop until you finish all of it.’
Adjuncts of purpose after verbs of motion show nominalization (67) (Sitaridou
2014b: 41).
ðæ temon t’ erθan=im, xavitsin eftes me.
for my the coming=my pudding.acc make.2sg me.acc . cl
‘For the sake of my coming, you will make me some pudding.’
6.7.2. Comparative Constructions

Comparative constructions in Romeyka have not been previously described. The
standard of comparison seems to be introduced by the comparative particle as
which has a locative function (68) (or ap, especially with anaphors (69), Drettas
1997). There is no case marking neither on the standard NP nor on the comparee
NP.
(68) (Özkan, n.d.; RSür, glosses mine)
o Mehmetis aso Osmanis alo tranos en.
the Mehmet from.the Osman more big be.3sg
‘Mehmet is bigger than Osman.’
Temetero to χorio ap=esetero emorfi-tero en.
our the village prt =your beautiful-cmp be.3sg
‘Our village is more beautiful than yours.’
While (68) from RSür forms the comparison periphrastically with alo ‘more’, ROf
uses the invariable(?) verbal comparative suffix tero (69) which is derived from
AG verbal stem+o+τερ+inflectional ending. Furthermore, in Istanbulite Romeyka
Turkish fazla ‘more’ is borrowed (70).

Talo fazla tar en.
one more narrow be.3 sg
‘One is narrower.’
The superlative is built periphrastically by olon ‘all’ (71). There seems to be var-
iation with regard to as to whether the preposition as ‘from’ is used before olon.
Olon emorfo o=spi temo en.
all beautiful the=house 1sg . poss be.3 sg
‘The nicest house is mine.’

Table 13 (based on Table 3 in Sitaridou 2014a: 122) summarizes some types of
conditionals in ROf. See (72) for an example of a simple conditional clause.
Table 13: Conditionals in ROf (adapted from Table 3 in Sitaridou 2014a: 122)

If-clause Matrix clause
Nonveridical Possible na + present na + present (=future)
an + present
Irrealis n’ + imperfect n’ + imperfect
an + imperfect
Antiveridical na/ŋ’ + ixa ‘I had’ + infinitive
(counterfactual)
an + ixa ‘I had’ + infinitive ixa ‘I had’ + infinitive
The examples in (73) and (74) show some of the micro-variation in Romeyka
conditionals. In RSür as spoken in Beşköy, the irrealis construction requires an
inflected infinitive with an uninflected auxiliary. Sitaridou (2014a: 55) notes the
possible emergence of a new invariant modality marker ixe from a former auxil-
iary.
(72) (Sitaridou 2014a: 141; ROf)
An eporo, mairevo
if can.1sg cook.1sg
‘If I can, I cook.’
An ixa mairepsini, n‘ etroɣame
if had.1sg cook.inf fut eat.ipfv .1 pl
‘If I had cooked, we would have eaten.’
Romeyka 931
(74) (Sitaridou 2014b: 49; RSür, glosses modified, translation from Sitaridou)

Na ixa mairepsina etroɣamen
fut had.1sg cook.aor . inf .1 sg eat.ipfv .1 pl
‘If I had cooked, we would eat.’
Abbreviations
acc accusative n neuter

aor aorist neg negation
aux auxiliary nmlz nominalization
cl clitic nom nominative
cmp comparative opt optative
comp complementiser pass passive
def . art definite article pl plural
dem demonstrative poss possessive
f feminine prs present
foc focalizer prt particle
fut future pst past (this term subsumes e. g.
gen genitive “past perfect” of Sitaridou
hum human 2014a)
imp imperative q question particle
inf infinitive rel relativizer
ipfv imperfective sg singular
m masculine 1, 2, 3 1st, 2nd, 3rd person
References
Andrews, Peter A. 1989. Ethnic groups in the Republic of Turkey. Compiled and edited with
the assistance of Rüdiger Benninghaus. Wiesbaden: Reichert.
Bilici, Faruk. 2011. Que reste-t-il de la langue et de la culture grecques sur les côtes turques
de la mer Noire? Cahiers Balkaniques 38–39. 325–342.
Bortone, Pietro. 2009. Greek with no history, no standard, no models: Muslim Pontic
Greek. In Alexandra Georgakopoulou & Michael Silk (eds.), Standard Languages and
Language Standards: Greek, Past and Present, 67–89. London: Ashgate.
Brendemoen, Bernt. 2002. The Turkish dialects of Trabzon: Their phonology and historical
development. Wiesbaden: Harrassowitz.
Brendemoen, Bernt. 2006. Aspects of Greek-Turkish language contact in Trabzon. In Hen-
drik Boeschoten & Lars Johanson (eds.), Turkic Languages in Contact, 63–73. Wies-
Chatzopoulou, Katerina & Ioanna Sitaridou. 2014. Jespersen’s cycle for NEG2 and condi-
tional inversion in the history of Greek: Evidence from Romeyka conditionals. Paper
presented at the 16th Diachronic Generative Syntax Conference, Research Institute for
Linguistics, Hungarian Academy of Sciences, Budapest, 3–5 July, 2014. http://www.ny-
tud.hu/digs16/absz/chatzopoulou_sitaridou_abstract.pdf (accessed 13 November 2017).
Corbett, Greville G. 2006. Agreement. Cambridge: Cambridge University Press.

Dawkins, Richard M. 1916. Modern Greek in Asia Minor: A study of the dialects of Silli,
Cappadocia and Pharasa with grammar, texts, translations and glossary. Cambridge:
Dawkins, Richard. M. 1931. Notes on the study of the Modern Greek of Pontos. Byzantion 6.
389–400.
Dawkins, Richard M. 1937. The Pontic dialect of Modern Greek in Asia Minor and Russia.
Transactions of the Philological Society 36. 15–52.
Deffner, Michael. 1878. Die Infinitive in den pontischen Dialekten und die zusammenge-
setzten Zeiten im Neugriechischen. Monatsberichte der Königlich Preussischen Akad-
emie der Wissenschaften zu Berlin, 191–230. Berlin: Buchdruckerei der Königlichen
Akademie der Wissenschaften.
Drettas, Georges. 1997. Aspects pontiques. Paris: Association de Recherches Pluridisci-
plinaires.
Drettas, Georges. 1999. Το ελληνο-ποντιακό διαλεκτικό σύνολο [The Greek-Pontic dialect
group]. In Anastasios-Foivos Christidis, Maria Arapopoulou & Giannoula Giannou-
lopoulou (eds.), Διαλεκτικοί θύλακοι της ελληνικής γλώσσας [Dialect enclaves of the
Greek language], 91–100. Athens: Ministry of National Education and Religious Af-
fairs/Centre for the Greek Language.
Gandon, Ophélie. 2016. La relativisation dans une perspective aréale: l’aire Caucase – Ana-
tolie de l’Est – Iran de l’Ouest. Paris: Université Sorbonne Nouvelle dissertation.
Janse, Mark. 2004. Animacy, definiteness, and case in Cappadocian and other Asia Minor
Greek dialects. Journal of Greek Linguistics 5. 3–26.
Janse, Mark. 2008. Clitic doubling from Ancient to Asia Minor Greek. In Dalina Kallulli &
Liliane Tasmowski (eds.), Clitic Doubling in the Balkan Languages, 165–202. Amster-
dam & Philadelphia: Benjamins.
Karatsareas, Petros. 2011. A study of Cappadocian Greek nominal morphology from a di-
achronic and dialectological perspective. Cambridge: University of Cambridge disser-
tation.
Karatsareas, Petros. 2014. On the diachrony of gender in Asia Minor Greek: The develop-
ment of semantic agreement in Pontic. Language Sciences 43. 77–101.
Lewis, M. Paul, Gary F. Simons & Charles D. Fennig (eds.). 2016. Ethnologue: Languages
of the World, 19th edn. Dallas: SIL International. http://www.ethnologue.com (accessed
29 March 2016).
Mackridge, Peter. 1987. Greek-Speaking Moslems of North-East Turkey: Prolegomena to
study of the Ophitic sub-dialect of Pontic. Byzantine and Modern Greek Studies 11.
115–137.
Mackridge, Peter. 1999. The Greek spoken in the Region of Pontus. In Anastasios-Foivos
Christidis, Maria Arapopoulou & Giannoula Giannoulopoulou (eds.), Διαλεκτικοί
θύλακοι της ελληνικής γλώσσας [Dialect enclaves of the Greek language], 101–105.
Athens: Ministry of National Education and Religious Affairs/Centre for the Greek
Language.
Michelioudakis, Dimitris & Ioanna Sitaridou. 2012a. Multiple wh-fronting across Pontic
Greek dialects. In Angela Ralli, Brian Joseph, Mark Janse & Athanasios Karasimos
(eds.), Proceedings of the International Conference of Modern Greek Dialects and Lin-
guistic Theory 5(1). 353–378.
Romeyka 933
Michelioudakis, Dimitris & Ioanna Sitaridou. 2012b. Syntactic microvariation: Dative con-
structions in Greek. In Ricardo Etxepare & Beatriz Fernández (eds.), Datives in vari-
ation: A micro-comparative perspective, 212–255. Oxford: Oxford University Press.
Moseley, Christopher. 2007. Encyclopedia of the World’s Endangered Languages. London:
Routledge.
Moseley, Christopher. 2010. Atlas of the World’s Languages in Danger, 3rd edn. Paris: UN-
ESCO Publishing. http://www.unesco.org/new/en/communication-and-information/
access-to-knowledge/linguistic-diversity-and-multilingualism-on-internet/atlas-of-lan
guages-in-danger/ (accessed 17 November 2017).
Neocleous, Nicolaos. 2017. The reconstruction of VO and OV alternation in Romeyka. Pa-
per presented at the Conference on Language Contact and Language Change in Western
Asia, Goethe University Frankfurt, 10–12 March, 2017.
Neocleous, Nicolaos. In preparation. Headedness in Romeyka: Continuity, contact and
change in the Hellenic varieties of Pontus. Cambride: University of Cambridge disser-
tation.
Oikonomidis, Demosthenes E. 1908. Lautlehre des Pontischen. Leipzig: A. Deichert’sche
Verlagsbuchhandlung Nachf.
Özkan, Hakan. 2013. The Pontic Greek spoken by Muslims in the villages of Beşköy in the
province of present-day Trabzon. Byzantine and Modern Greek Studies 37(1). 130–150.
Özkan, Hakan. n.d. Grammar sketch of Romeyka. Unpublished draft.
Parcharidis, Ioannis. 1880. Γραμματική της διαλέκτου Τραπεζουντος: Περί επιθέτων,
αντωνυμιών και βαρυτόνων ρημάτων. [Grammar of the dialect of Trabzon]. Χειρόγραφο
335, Κέντρο Ερεύνης των Νεοελληνικών Διαλέκτων και Ιδιωμάτων, Ακαδημία Αθηνών.
Parcharidis, Ioannis. 1888. Συλλoγή ζώντων μνημείων της αρχαίας ελληνικής γλώσσας εν
Όφει. [Collection of living documents of the Ancient Greek language in Ophis] O εν
Kωνσταντινoυπόλει Eλληνικός Φιλoλoγικός Σύλλoγoς 18 (1883–1884). 120–178.
“Pontic”. Catalogue of Endangered Languages. 2017. University of Hawaii at Manoa.
http://www.endangeredlanguages.com/lang/3265 (accessed 17 November 2017).
Schreiber, Laurentia. 2016. Assessing sociolinguistic vitality: An attitudinal study of Rumca
(Romeyka). In Heike Wiese, Heiko F. Marten & Oliver Bunk (eds.), Arbeitspapiere
“Sprache, Variation und Migration”: Studentische Arbeiten, Papier Nr. 4. Potsdam:
Universität Potsdam.
https://www.unipotsdam.de/fileadmin01/projects/svm/Arbeitspapiere/MA_Schreiber_
May.pdf (accessed 17 November 2017).
Schreiber, Laurentia & Ioanna Sitaridou. 2017. Assessing the sociolinguistic vi-
tality of Istanbulite Romeyka: An attitudinal study. Journal of Multilingual
and Multicultural Development 39(1). 1–16. http://www.tandfonline.com/doi/
abs/10.1080/01434632.2017.1301944?journalCode=rmmm20 (accessed 17 November
2017).
Sitaridou, Ioanna. 2013. Greek-speaking enclaves in Pontus today: The documentation and
revitalization of Romeyka. In Mari Jones & Sarah Ogilvie (eds.), Keeping languages
alive: Documentation, pedagogy, and revitalization, 98–112. Cambridge: Cambridge
University Press.
Sitaridou, Ioanna. 2014a. Modality, antiveridicality, and complementation: The Romeyka
infinitive as a negative polarity item. Lingua 148. 118–146.
Sitaridou, Ioanna. 2014b. The Romeyka infinitive: Continuity, contact and change in the
Hellenic varieties of Pontus. Diachronica 31(1). 23–73.
7. Appendix: Basic lexical items in selected
languages of Western Asia
Geoffrey Haig, Geoffrey Khan and Laurentia Schreiber
7.1. Background
The list of items is based on a list of 40 meanings (a sub-set of the Swadesh-100
list), that is used by the Automated Similarity Judgement Program (ASJP), devel-
oped by Søren Wichmann and associates (Wichmann et al. 2016). The program is
designed for automatically classifying languages according to their origins, and
for inferring dates of divergences of different groups (see Holman et al. 2011,
and accompanying peer commentaries for critical assessment of the method). The
compilation includes lists from several chapters in this volume (provided by the
respective authors), as well as lists provided from other sources. In §7.5 below, we
provide a list of sources with accompanying notes on transcriptions etc.
For some meanings, no forms were provided due to e. g. lack of a reliable
response for the item concerned. The relevant cells in the Tables have been left
empty. The list of meanings is arranged alphabetically, except for the personal
pronouns, which are grouped together at the end of the table. Following a sugges-
tion by Don Stilo, an additional four items were added to the Wichmann-list (‘cat’,
‘water-melon’, ‘frog’, ‘why?’) which are known to provide interesting lexical
isoglosses in the languages of the region.
Please note that this is primarily a compilation of lexical items, for use in e. g.
recognizing cognates, or borrowings. It does not claim phonetic accuracy or con-
sistency in the way these items are rendered in the different lists. Given the very
heterogenous nature of the sources, and the differing traditions in the different
disciplines represented in this compilation, imposing a standardized transcription
would have not have been practicable, and we have therefore left the transcriptions
in the manner they were provided to us. For further information on the sources and
conventions for the individual lists, please refer to the accompanying notes under
§7.5 below.
Behd. Gor.I Gor.II Hawr. S. Kurd. Bakht.I Bakht.II
936
w w
blood xīn xīn χ ün winī x eːn hin xin
bone hestī pēša soqan pēša soqan hast, ostoxoʋn estexun
breast sīng sinæ sīna sinæ pestoʋn, mame pestun, mame
cat pišîk pəʃi kitē pəʃi gorbe gorbe
come dihēt mayo miyo āmāy æyeː, æteː oʋeyden oʋeydan
ə
die dimirīt mamro mim ru marday æmreː morden mordan
dog se sipa sɨpæ, ɢæmal tūta sæw say sag
w
drink vedixot moro mɨ ɨru wārday æxwa xarden xardan
7.2. Iranian languages
ear guh goš guʃ goš güʃkæ guš guš

eye čav čam ʧæm čam ʧaw tey tiye
fire agir ārir awɨr ayr awɨr taš taš
fish masī māsāw masæw māsāwī mahi māi māi
frog beq qurwaʁæ qurwāqē qʊrwaqæ baq baġ
full
tijī pɨr pař pɨr por por

hand dest dast dæs das dæs dast dast
hear gulē dibīt mɨʒənawu ēga æʒnæfeː ašniden fahmidan
horn šax ʃax šāx ʃax šāx šāx
knee čok zɨrani zangoł wɨʒɨn zuni zuni
leaf belk walig bærg, glaw gałā, wałg gæɫa par pahr
liver mēlak ʤɨær yahar, jarg ʤɨeːr jiyar jiyar

louse spīh hašpšē sipeː šeš šeš
mountain čīya kūh ku(h) kaš-u-ko keːf koh koh
name nav nāmo nam namē naw esm, nom esm, nom
new nī tazæ tāza tazæ nu naʋ
night šev šaw ʃæw šawē ʃæw šaʋ šaʋ
nose difin pit pit lūta lüt noft neft
one ēk īka yæk yak yæk yak yek, yak
path rēk řēga ra řā reː rah, jahde rah, jāde
person kes kæs kas kæs ādom ādam
see dibīnīt matīto miːnu dīyay æüneː diden didan
skin cerm, cild posta pus pos pus pust pust
star stēr hastāra asaræ hasāra hæsaːræ āstāre āstāre
stone ber kamar kæmær twanī koʧik bard, kočok bard
sun roj hor xwær waratāw, war, xwær aftaʋ aftaʋ
řojyār
tongue ezman zwān zuwan zwān zwaːn zoʋn zun
tooth didan didān diyan diđān diyan dendoʋn dendun
tree dar dār dar dār, diraxt dar dār deraxt
two dū duwa du dwē dʊ do do
Appendix: Basic lexical items in selected languages of Western Asia
water av ā aw āwī aw aʋ aʋ
937
938
water - melon šitī ʃami hannī ʃami hendoʋne hendune

why ? bočî bɨla payčī buːʧæ, æra siče siče
I ez, min amin mɨ amin min mo mo
we em, me ēma imæ ēma imæ imā imā
you ( sg ) tu, te atū tʊ to tʊ to to
you ( pl ) hîn, hîngo šima ʃɨmæ šima iwæ isā šomā
Kumz.1 Oss. Casp.I Casp.II Tal.I Tal.II Tat.I Tat.II

blood xwēm tug/tog xun xun xun xun xun xun
bone xār štɐg/ɐstɐg xaš hesseka astæ xaš xæš æsdoxan
breast sīna riw/rew čučay juju pïšton* juju# čize
cat gurbağ gɐdə/tikis pičə bameši pišik puču peču kúgæ
come āmad/āmas/byāt/tī arba-sɐw-/ a/ama e/amu (v)o/omæ a/om a/ami a-/awe
ɐrba-cɐw-
die murd/murs/mur/tumr mɐl- mærd merd mard mer/mærd mær/d mer/mærd

dog sağ kʷəz/kuj səg sæk sïpæ espæ espæ æsbæ
drink šaraba gid/š. nwaž-/niwaz- xor/xord xard pe-šom xənj/xənt henj/ dær-ašow
gis/š. kē/š. tka hent
ear gōš quš/ʁos guš guš guš guš guš guš
1
See §7.5 below for selected Kumzari etymologies.

eye čum sɐšt/cɐst čušm češ čaš čem čæšm čæmm
fire ātiš art atəš tæš otæš atæš atæš ayr
fish may kɐšag, kɐf/kɐsalgɐ mayi mayi moy moy may mahi
frog šufrāqō χɐfš/χɐpsɐ γurbaγə væk væzæx guzga guzga qurbaγe
full palla zag/iʣag pər per pur pər fet perr
hand dist k’uχ, ‘arm’/k’oχ dəs dæs dast dæs dæs dæsd
hear šnuft/šnawd/šnaws/ quš-/iʁos- ištowsən šnuss mæs/æ mæs/ə mæss æšnæv
šnēw
horn qarn šək’a/šik’a šax šax šox šaγ# šax šax
knee rukbit wɐrag/zɐngɐ zonə zanu zonï, zïng zonu, zəngə zeng ærzane
leaf warq šəf/šifɐ bəlg, vəlg vælg livæ lev liv vælg
liver jōğur jigɐr jigər jiger jigær jigær jigær
louse qar’a šəšt/šistɐ subuj espič sïbïž əspəj esbej esbézæ
mountain aql χoχ/χʷɐnχ ku ku band ku ku ku
name nām nom nom es nom nom, esm nam nawæ
new nō nog/nɐwɐg tazə nu, taze tožæ tazæ nu
night šaw ɐχšɐv/ɐχsɐvɐ šow šu šæv šæv šæv šo
nose nuxrit fənz/fij dəmaγ feni vəni vini veni viníæ
one tā iw/jew yek, i yek i i i yey
path tēra fɐndag ra ru ro ra ra ra

939
940
person ādimī adɐjmag adəm adem odæm adæm adæm adui

see jīr/jīs/maš/tēmiš fen-/win- din/de vin/di vin/vind ven/venn vin/vind vin/di
skin jild, pōṣṭ c’ar/c’arɐ pust pus pus pos puss pusd
star starg (ə)št’alə/(ɐ)st’alu sətarə setare astovæ setare assaræ esdaře
#
stone bard dur/dor səng sæng sïγ səng seg sæng
sun intaf χur/χor xoršid aftab hæši xoršid aftav xor
tongue ẓwān ɐvžag/ɐvzag zəbon zebun zïvon zebn zuan zuan
tooth dnān dɐndag dəndan, dændun dandon gaz gaz dendan
gaz
tree šidrit bɐlaš/bɐlasɐ dar dar do dar dar dar
two dō dəwwɐ/duwwɐ du do dï də de do
water āw don aw, ab u ov av av áwæ
#
water - gaḥḥ χarbəž/χarbuz hindonə hendev- zïmïstoni hindunæ xendunæ henduane
melon (Turkish) ane
why ? pi či sɐmɐn/cɐmɐn čə́rə če boči(ro) əčə́ra čéra ča
(see notes)
I mē ɐž/ɐz ( nom ), mu men az, mï(n) æz, mən æz, mæn æz
mɐn ( obl )
we mā maχ ( nom , obl ) amo ama, ema æmæ æmæ æmæ awan
you ( sg ) tō də/du ( nom ), tə te tï tə te tæ
dɐw ( obl )
you ( pl ) šmā šəmaχ/sumaχ šemo, šema šïmæ šemæ šemæ soan
( nom , obl ) šəmə

Arab.I Arab.II Arab.III Arab.IV CNA NENA.I NENA.II NENA.III NM
blood damm damm damm dam ʾadmo dəmma dəmma dəma dəmā
bone ʕaḏǝ̣ m ʕaḏị m ʕaḏǝ̣ m ʕaḏum garmo ɟarma gɛrma garma germā
+
breast sǝdǝr ṣadir; ṣadǝġ sadur ṣadro sadra; ṣadra; sənga kappā;
female ~ṣǝdǝġ; female female (Kurd.); female
breast: female breast: breast: female breast:
ṯadi breast: bəzza čəčča breast: nannā
dēs mămona
+
cat sannōṛa bisse bazzūni bazzūna qaṭən k̭aṱu qaṭu qaṭu šənārā
come pfv ǧā/ipfv ǧī (pfv ) ǧā (pfv ) Ɂiyya ʾoθe ʾatə ʾaθe (prs ) he (prs ) āṯi (prs )
7.3. Semitic languages
yǝǧi yiǧi (ipfv ) yǝǧi (ipfv ) (pfv ) (prs )

die māt (pfv )/ māt (pfv )/ māt (pfv )/ māt (pfv ) mayəθ mayət mayəθ (prs ) mel (prs ) mayeṯ
ymūt (ipfv ) ymūt ymūt (ipfv ) (prs ) (prs )
(ipfv )
dog kalb čalib kalb čaləb kalbo calba kalba kalba kalbā
drink šǝṛǝb (pfv )/ šrib (pfv )/ šǝġǝb (pfv )/ ʃarab šote šatə šate (prs ) šăte (prs ) šāyeṯ (prs )
yǝšṛab (ipfv ) yišṛab yǝšġab (pfv ) (prs )
(ipfv ) (ipfv )
ear ʔǝḏǝn ʔiḏin ḏān ðān ʾaðno nata naθa năḥalta onnā
+
eye ʕayn ʕēn ʕēn ʕēn ʿayno ʾayna ʾɛna ʾena ina
fire nāṛ nāṛ nāġ nār nuro nuyra nura nura nurā
fish samake šabbūṭ samaki samča nənto nuyna nuna nunila, fonidā
nunilta
941
942
frog ʕaqṛōqa ʕuguṛṛuga ʕaqōqa ʕagruga baqqe p̂ ək̭k̭a pəqqa, qurbaqa paqeṯṯā
(<*ʕaqġōqa) pəqqe (Pers.)
full mǝtli malyān mʕabba matrūs malyo məlya ṃəḷya zmaṭa məli
hand ʔīd ʔīd ʔīd Ɂīd ʾiðo ʾida ʾiða ʾila idā
+
hear sǝmǝʕ (pfv )/ smiʕ sǝmǝʕ (pfv )/ simaʕ šoməʿ šammə šăməʾ(prs ) šăme (prs ) ṣāyeṯ (prs )
yǝsmaʕ (ipfv ) (pfv )/ yǝsmaʕ (pfv ) (prs ) (prs )
yismaʕ (ipfv )
(ipfv )
horn qǝrn ǧarin qǝġǝn girin qarno k̭ana qana šaxa qarn
(Kurd.) (Arab.)
knee rǝgbe rukba rǝkbi rəkba barko bərca bərka bərka burkā
+
leaf waraqa wruga waġqa wirga ṭarfo ṱarpa ṭarəpθa gaḷa par (Pers.)
(Kurd.)
liver qaṣabe (for čabde kǝbǝd čabda kăzăbe jiɟar kawda koza kabdā
dish also the (as a dish kamto cumta
Turkish loan ǧīgar)
ǧīgar is used)
louse qamla gamla qamli gamla qalmo k̭alma qalma qăləmta qamṯā
+
moun - ǧabal ǧbile ǧabal yibal ṭuro ṱuyra ṭura ṭura ṭurā
tain
name ʔǝsǝm ʔisim ʔǝsǝm Ɂisim ʾəšmo šəmma šəmma šəma ešmā
ǝ
new ǧdīd ǧidīd ǧdīd yidīd ḥaθo xata xaθa xala haṯṯā
night layl lēl lēl liəl lalyo lelə lɛle lele lilyā

nose ʔǝnf xušše xašǝm darub nḥire naxira naxira poqa nəhirā
one wēḥǝd wāḥad wēḥǝd wāћid ḥa (m ), xa xa xa ehdā
ḥðo (f )
path dǝrb darib daġǝb darob darbo ʾurxa ʾurxa ʾorxa ohrā
(~ṭarīq)
person šaqṣ šaxaṣ šaxǝṣ nafar zlā́ m naša naša naša barnāšā
see ʔaṛa (pfv )/ šāf (pfv )/ qǝšǝʕ (pfv )/ ʃāf (pfv ) ḥoze xazə xaze (prs ) xăe (prs ) hāzi (prs )
yǝṛa (ipfv ) yšūf (ipfv ) yǝqšaʕ (prs ) (prs )
(ipfv )
(~ġǝšǝʕ
(pfv )/yǝġšaʕ
(ipfv ))
skin ǧǝld ǧilid ǧǝlǝd yilid galdo ɟəlda gəlda məška meškā
star nǝǧme naǧme naǧmi nayma kəkwo cuxva kəxwa kxəwla koḵḇā
stone ḥaǧaṛ ḥǧaṛa ḥaǧaġ ṣaxra kefo cipa kepa kepa gəlaltā
(~ḥǧāġa)
sun šams šamis šamǝs ʃaməs šəmšo šəmša šəmša šəmša šāmeš
ǝ
tongue lsēn lsān lsēn ʔilsān lišono lišana lišana lišana lešānā
tooth ḍǝrs sinn sǝnn (~snēn) ðərəs ʿaršo cica kaka kaka šennā
tree sǝǧara šǧaṛa sǝǧġa ʃijra dawmo ʾilana ʾilana ʾilana dəraxt
(Pers.)
ǝ
two ṯnayn ʔiṯnēne ṯnēn θniən tre tre tre tre tren
water ṃayy mayye ṃāy māy maye miyya miya mae mienā
943
944
water - ǧǝbse dibše šǝmzi rajjiya žabaš šəptiyya šəftiyya šwətya qarambā
melon (Kurd.) (Kurd.) (Kurd.)
why ? layš ʕalēš lēš liəʃ qay k̭a-mu qa-mo ta-ma qamu
I ʔana ʔāni ʔana Ɂāna ʾono ʾana ʾana ʾana anā
we nǝḥne ʔiḥna nǝḥna Ɂiћna ʾaḥna ʾaxnan ʾaxni ʾaxni ani
you ( sg ) ʔǝnt ( m ) ʔinte ( m ) ʔǝnta ( m ) Ɂinta ( m ) hat ʾat, ʾatən ʾati ʾāt āt
ʔǝnti (f ) ʔinti (f ) ʔǝnti (f ) Ɂinti (f )
you ( pl ) ʔǝntǝn ʔintu ( m ) ʔǝntǝn Ɂintum hatu ʾ axtun, ʾ ʾaxtu ʾaxtu atton
ʔintin (f ) (m) axtoxun,
Ɂintan (f ) ʾaxnoxun
Appendix: Basic lexical items in selected languages of Western Asia 945
7.4. Other language families

Tk. Qorveh Laz Rom.
blood qɑn ditsxiri (to) (j)ɘma
bone sümik ili ostuð
breast döʃ gjonksi (ta) gønksa (pl )
cat piʃik k’at’u (i) kata
come ɉelir mo-xt-im-u erχume
die ölir o-ɣur-u χame
dog köpæk dʒoɣori škilo
drink iʧir o-ʃ-u pino
ear qulax udʒi (to) otiam (pos-
sibly poss .1 sg )
eye göz toli (ta) mata (pl )
fire Ɂɔːt datʃxuri apsomo
fish mɑhi ntʃxomi opsar
frog qurbaʁæ mai furno
full dolæ opʃa eromoθe
hand Ɂæl xe (to) šeri
hear Ɂɛʃædir o-gn-u akuɣo
horn ʃɑx nkra tšerato
knee diz burguli (to) ɣonaton
leaf yɑrpɑx butk’a filo
liver ʤægær utʃa-ʃ gui (lit. ‘black heart’)
louse bit mt’i ftira
mountain dɑx germa raší(n)
name Ɂɑd dʒoxo (to) onema
new tɑzæ aɣne jengi (Turkish)
night gɛʤæ seri opse
nose bɨrn tʃxindi (to) miti
one biʤæ ar ena
path yol gza orðomo
946 Geoffrey Haig, Geoffrey Khan and Laurentia Schreiber
Tk. Qorveh Laz Rom.

person kæs k’otʃi (also ‘man’ and ‘husband’) nomat
see görir o-dzir-am-u elepo
skin daræ tk’ebi
star oldüz murutsxi aspra (pl )
stone dɑʃ kva klersi
sun gün mʒora (to) oʉlo
tongue dil nena (organ) (i) ɣlossa
tooth diʃ k’ibiri (to) ðond
tree Ɂɑʁaʧ ndʒa lisar, alat
two ikʲe ʒur ðio
water su tsk’ari nero
water - melon qɑrpɨz k’arpuzi
why ? niyæ mot, mu-ʃeni oðen
I mæn ma eɣo
we biz tʃku emis(tin)
you ( sg ) sæn si esi
you ( pl ) siz tkva esis(tin)
7.5. Notes on sources

Label Variety, Sub-variety Reference
Behd. Northern Kurdish, Behdinī, spoken in Dohuk town- Chap. 3.3, §4
ship, Iraqi Kurdistan
The Behdinī data were provided by Baydaa Mustafa, a female speaker from
Dohuk. Due to difficulties in eliciting citation forms of verbs, all verb forms
are given in the third person indicative present.
Gor.I Gorani, Šabakī, near Mosul, Iraq Chap. 3.3, §5
Gorani data were provided by Parvin Mahmoudveysi, who elicited them
from a young male speaker from the village of Sofaya, between Erbil and
Mosul, who claimed a Šabak identity. However, he was unable to provide
all forms. Due to difficulties in eliciting citation forms of verbs, all verb
forms are given in the third person masculine indicative present.
Gor.II Gorani, spoken in the town of Qorveh, Iran
Recorded by Masoud Mohammadirad in Qorveh, Iran, on 08. 03. 2018 and
transcribed by Geoffrey Haig. The speaker is a retired male, 65 years old,
from a Gorani-speaking family but married to a Central Kurdish speaking
wife, and uses predominantly Central Kurdish in daily life. Verb forms are
third person singular present. The transcription is broadly phonetic, using
IPA symbols except for the following: the front rounded vowel [y] and
the voiced palatal glide [j] are rendered with <ü, y> respectively, while all
rhotics are rendered indiscriminately with <r>.
Hawr. Hawrāmī, spoken in the Hawrāmān region in the Chap. 4.5
northern Zagros mountains in northwestern Iran and
northeastern Iraq
S. Kurd. Southern Kurdish variety of Qorveh, Iran (see Fattah
2000 for Southern Kurdish)
The data were recorded in Qorveh, Iran, on 10. 03. 2018. The speaker is
an educated male speaker (graduate student of linguistics, 29 years old),
who has been living in France since September 2016. Verb forms are third
person singular present. The transcription follows the one used for Gor.II
(Gorani Qorveh).
Bakht.I Bakhtiari, spoken in Masjed Soleymān, Khuzestan Chap. 4.3
Province, Iran (Source: Anonby and Asadi 2014)
Bakht.II Bakhtiari, spoken in Ardal, C&B Province, Iran Chap. 4.3
Kumz. Kumzari, spoken mainly in Kumzar and Khasab, Chap. 4.7
Oman

Although Kumzari is described as a mixed Semitic/Iranian language in
van der Wal Anonby (chapter 4.7) we have included it provisionally under
“Iranian” here, based on the fact that the majority of the lexical items of
this list have Iranian etymologies (approx. 25); selected etymologies are
the following: Semitic etymologies: skin : Arabic incl. Shihhi (Bernabela
2011:41); Mehri gɛ̱d (Rubin 2010:17). tree : Arabic; Mehri śigarēt (Watson
2012:25). horn : Jibbali ḳũhn (Rubin 2014:37). knee : Arabic; Mehri
bark (pl. bīrōk) (Rubin 2018:94). nose : Mehri nǝxrīr (Simeone-Senelle
1997:388); NENA I-II naxira. water - melon : Kmz. gaḥḥ; Arabic ǧiḥḥ
(Holes 2001:83); Mehri gawḥ (Johnstone 1987:127). mountain : Kmz. aql;
Musandam Arabic aql (E.Anonby in prep.)
Uncertain etymologies: sun intāf: Mehri ntaxf ‘we come at sunset’. frog
šufrāqō: Mehri šṣǝfdēt (Johnstone 1987:397). louse qar’a: cf. NENA II
qalma; Musandam Arabic qamla (E.Anonby in prep.). path tēra: Arabic
tariq; Persian rā. bone xār: cf. NENA garma
Oss. Ossetic, sub-dialects of Iron and Digor, spoken Chap. 6.3
in North Ossetia-Alania, an autonomous republic
within Russia, and South Ossetia, a breakaway part
of Georgia
Iron and Digor forms are given simultaneously, they are shown in the fol-
lowing order and separated by a slash: gɐdə/tikis ‘cat’. Note on the dative
of ‘why?’: this is etymologically the dative of sə/či ‘what’. Besides that,
there exists a dedicated lexical item sɐwənnɐ/cɐmɐnnɐ ‘why not’, etymo-
logically sɐwəl-nɐ what.sup - neg and cɐmɐn-nɐ what.dat - neg , respectively,
see Erschler and Volk (2011) for arguments that these are single words. The
personal pronouns are given for the nominative and oblique, other forms
are as follows 1 sg : Dat mɐnɐn; Abl mɐnɐj; All mɐnmɐ; Sup mɐnəl/mɐnbɐl;
Equ mɐnaw; Com mɐnimɐ. 2 sg : Dat dɐw-ɐn; Abl dɐw-ɐj; All dɐw-mɐ; Sup
dɐw-əl/dɐw-bɐl; Equ dɐw-aw; Com dɐw-imɐ. 1 pl : Dat maχ-ɐn; Abl maχ-ɐj;
All maχ-mɐ; Sup maχ-əl/maχ-bɐl; Equ maχ-aw; Com maχ-imɐ. 2 pl : Dat
šəmaχ/sumaχ-ɐn; Abl šəmaχ/sumaχ-ɐj; All šəmaχ/sumaχ-mɐ; Sup šəmaχ-əl/
sumaχ-bɐl; Equ šəmaχ/sumaχ-aw; Com šəmaχ-imɐ.
Casp.I Tatic (Lāhijāni), Eastern Gilaki Chap. 5
Casp.II Tatic (Sāravi), Mazanderani Chap. 5
Tal.I Tatic (Leriki), Northern Talyshi Chap. 5
Selected items (*) from Pirejko (1976) dictionary.
Tal.II Tatic (Māsulei) Chap. 5
Selected items ( ) from Māsāli (Nawata 1982).
#
Tat.I Tatic (Koluri) Chap. 5

Tat.II Tatic (Vafsi) Chap. 5

Arab.I Arabic of Mardin (qǝltu dialect) Chap. 2.4
Arab.II Arabic of Urfa/Ḥarrān (Bedouin dialect) Chap. 2.4
Arab.III Mosul Arabic Chap. 3.2
Arab.IV Khuzestani Arabic, Iran
The data were elicited from a native speaker consultant.
CNA Ṭuroyo (Midin), Central Neo-Aramaic, spoken in the Chap. 2.5
region of Ṭūr ʿAbdīn, southeastern Turkey (Source:
Jastrow 1985 and native speaker consultants)
NENA.I Christian Urmi, North-Eastern Neo-Aramaic, spoken Chap. 2.5
on the plain of Urmi and within the town of Urmi,
Iran (Source: Khan 2016)
NENA.II Christian Barwar, North-Eastern Neo-Aramaic, Chap. 3.4
spoken in the Barwar region of northern Iraq close
to the Turkish border, in a number of villages along
the Be-Xelape River. (Source: Khan 2008)
NENA.III Jewish Sanandaj, North-Eastern Neo-Aramaic, Chap. 4.4
spoken in western Iran (Source: Khan 2009)
NM Khorramshahr, Neo-Mandaic, spoken until recently Chap. 4.4
in the cities of Ahwaz and Khorramshahr, Iran
(Source: Häberl 2009; Mutzafi 2014)
Tk. Qorveh Iran Turkic, dialect of the town of Qorveh, Iran
Recorded in Qorveh, Iran, on 08. 03. 2018, by Geoffrey Haig. The speaker
is an educated male (high school teacher, approx. 30 years old) raised
in a primarily Turkish speaking family in Qorveh. He is trilingual, regu-
larly speaking the local varieties of Turkish and a mixture of Central and
Southern Kurdish, and Persian, plus a little Gorani. His mother (approx.
60 years old, primarily Turkic speaking) was also present at the recording.
Verb forms are third person singular present. A phonemic analysis was
not possible based on this minimal data set, but see Bulut (this volume,
chapter 4.2) for a description of the relatively close dialect of Sonqor. The
transcription is broadly phonetic, using IPA symbols except for the follow-
ing: the two front rounded vowels [ø, y] and the voiced palatal glide [j] are
rendered with <ö, ü, y> respectively, while all rhotics are rendered indis-
criminately with <r>.
Laz Laz (Kartvelian), spoken in northeastern Turkey near Chap. 6.2
the Black Sea, and in Georgia, mainly in the village
of Sarp. The lexical list is based on the variety of
Arhavi in Turkey.

Rom. Romeyka, spoken in north-eastern Turkey in Trabzon Chap. 6.4
Province, data based on the variety of Çaykara (ROf)
For nouns, the article is provided where available. Due to the lack of an
independent infinitive form, verbs are provided in the 1st person singular
which is consensus in Greek philology.
References
Anonby, Erik. In preparation. Dialectology of Musandam Arabic.

Anonby, Erik & Ashraf Asadi. 2014. Bakhtiari studies: phonology, text, lexicon. (Studia
Iranica Upsaliensia 24). Uppsala: Acta Universitatis Upsaliensis.
Erschler, David & Vitaly Volk. 2011. On negation, negative concord, and negative imper-
atives in Digor Ossetic. In Geoffrey Haig, Simin Karimi, Agnes Korn & Pollet Sam-
velian (eds.), Topics in Iranian Linguistics (Beiträge zur Iranistik Bd. 34), 135–152.
Wiesbaden: Reichert.
Fattah, Ismaïl Kamandâr. 2000. Les dialectes kurdes méridionaux: étude linguistique et
dialectologique. (Acta Iranica 37). Leuven: Peeters.
Holes, Clive. 2001. Dialect, culture, and society in Eastern Arabia: Glossary. Leiden &
Boston: Brill.
Holman, Eric W., Cecil H. Brown, Søren Wichmann, André Müller, Viveka Velupillai, Har-
ald Hammarström, Sebastian Sauppe, Hagen Jung, Dik Bakker, Pamela Brown, Oleg
Belyaev, Matthias Urban, Robert Mailhammer, Johann-Mattis List & Dmitry Egorov.
2011. Automated dating of the world’s language families based on lexical similarity.
Current Anthropology 52(6). 841–862.
Häberl, Charles. 2009. The Neo-Mandaic dialect of Khorramshahr. (Semitica Viva 45).
Johnstone, Thomas M. 1987. Mehri lexicon and English-Mehri word-list. London & New
York: Routledge.
Khan, Geoffrey. 2008. The Neo-Aramaic dialect of Barwar. 3 vols. Leiden & Boston: Brill.
gias Press.
Mutzafi, Hezy. 2014. Comparative lexical studies in Neo-Mandaic. (Studies in Semitic
Languages and Linguistics 73). Leiden & Boston: Brill.
Nawata, Tatsuo. 1982. The Masal dialect of Talishi. Acta Iranica 22. 93–117.
Pirejko, Lija A. 1976. Talyšsko-russkij i russko-talyšskij slovar [Talyshi-Russian and Rus-
sian-Talyshi dictionary]. Moskva: Izdatel’stvo Russkij jazyk.
Rubin, Aaron D. 2010. The Mehri language of Oman. Leiden & Boston: Brill.
Rubin, Aaron D. 2018. Omani Mehri: A new grammar with texts. Leiden & Boston: Brill.
Simeone-Senelle, Marie-Claude. 1997. The modern South Arabian languages. In Robert
Hetzron (ed.), The Semitic languages, 378–423. London & New York: Routlege.
Watson, Janet C.E. 2012. The structure of Mehri. Wiesbaden: Harrassowitz.
Wichmann, Søren, Eric W. Holman & Cecil H. Brown (eds.). 2016. The ASJP Database
(version 17).
Index of language names
The index includes names of languages, language groups, varieties, and dialects.
A Arabic, Northern Iraqi (vernacular) 160, 246,

Abivardi [Abiverdi] (variety of Qashqâʾî) 417, 249, 260, 827
434 Arabic, qǝltu [qeltu] dialects 31, 32, 35, 159,
Adiyaman-Urfa, Kurmanjî of 126 160, 162, 164–168, 170–173, 175, 176,
Afshâr [Afshar] dialects → Qashqâʾî (Central 179–183, 187, 244, 248, 249, 263–265,
Oghuz) 949
Akkadian 237, 305, 309 Arabic, Anatolian qǝltu dialects 160, 165–167,
Akre, Kurmanjî of 288–291, 294 172, 175, 176, 179, 180, 188, 265
Alviri 672, 678, 689, 694, 700, 711, 712, 717, Arabic, Šāwī [Šāwi] 159, 188
725–727, 752, 754, 767, 770, 772, 820, Arabic, sedentary-type dialects 31, 159, 244,
823 252, 259, 260
Amêdî, [Amêdi, Amedia] Kurmanjî of 16, 36, 37, Arabic, Syrian 163, 170
137, 306, 312–315, 317–319, 322, 486 Aradhin, Christian NENA 306, 324
Āmoli 719, 824 Aramaic 9, 14, 35–37, 160, 162, 167, 173, 183,
Anbarāni [Anbarani] 660, 712, 824 197, 201, 213, 214, 219, 220, 225–228, 234–
Ankawa, Christian NENA 11, 239, 306, 313, 323, 236, 238, 243, 254, 260, 266, 305, 309, 311,
324, 337, 350 312, 316, 326, 328–330, 341, 342, 344, 350,
Anzali 659 352, 353, 386, 394, 485, 488, 489, 501, 503,
Arabian, South 7, 627–629, 638, 639, 642, 658, 505, 513, 522–524, 526, 532, 625, 828, 843
951 Aramaic, Jewish Babylonian 305
Arabic 3, 7, 8, 10, 11, 13, 16, 19, 20, 22, 23, 28, Aramaic, Middle 350, 513
31, 32, 35–44, 77, 108, 109, 150, 159–177, Aramaic, Central Neo-Aramaic → Central
179–184, 187–189, 196, 198, 226–228, 237, Neo-Aramaic (CNA)
239, 240, 243–250, 252, 254, 259, 260, 263– Aramaic, North-Eastern Neo-Aramaic
266, 270–272, 288, 290, 296, 306, 311, 316, → North-Eastern Neo-Aramaic (NENA)
325, 341, 344, 345, 351, 353, 357, 358, 360– Arbel, Jewish NENA 240, 306, 312, 314, 322–
363, 369, 372, 377, 386, 387, 406, 408, 409, 327, 335, 337, 339, 341, 343, 345, 351
411, 413, 414, 417, 427, 440–442, 450, 484, Armenian 1, 5, 8–10, 14, 16, 18, 19, 21, 24, 25,
485, 490, 492, 495, 524, 526, 541, 570–574, 27, 31–34, 36, 37, 39–43, 45–50, 65, 66, 68,
579, 581, 583, 585–588, 607, 624, 627–629, 70, 72–76, 79, 81, 83, 85, 87, 89, 90, 92–104,
636, 638, 649, 656, 657, 677, 686, 692, 827, 108, 109, 113, 121, 150, 151, 196, 201, 218,
895, 948, 949, 951 238, 336, 352, 357, 406, 748, 786, 790, 840,
Arabic, Anatolian 13, 16, 20, 35, 159, 163, 895
166, 187, 188, 189 Asālemi 671, 672, 677, 685, 686, 691, 694, 698,
Arabic, bedouin-type dialects 31, 42, 159, 702, 712, 713, 715, 716, 718, 723, 724, 731,
164, 181, 184, 188, 196, 243, 244, 252, 259, 736, 737, 739, 743–745, 748, 756, 765–767,
629, 949 780, 782, 785, 787, 791, 793, 824
Arabic, dialects of 10, 11, 13, 16, 23, 28, 31, Ashitha, Christian NENA 33, 35, 39, 43, 44, 197,
32, 35–37, 39, 41, 42, 159, 161, 179, 181, 198, 201–206, 210, 213, 215, 217, 218, 223,
183, 187, 188, 228, 239, 240, 243, 245, 259, 224, 226, 227, 233
263–266, 316, 345, 492, 629, 649, 656, 657, Astara 660, 664
827 Avestan 46, 571
Arabic, Harran-Urfa 162–171, 173–175, 177, Azd 625, 627
179–184 Azerbaijani (Azerbaijan Turkic, Central
Arabic, Jewish Arabic dialects 259, 265 Oghuz) 376, 398, 399, 407, 416, 423–425,
Arabic, Mardin 164–165, 170–171, 177, 949 444, 621, 663, 763, 822, see also Azeri
Arabic, Mesopotamian 45, 187, 243, 264, 265 Turkic
954 Index of language names
Azeri Turkic 12, 33, 196, 200, 228, 229, 259, 200–209, 211–213, 215, 219, 227, 228, 234,
355, 364, 365, 367, 376, 377, 398–400, 406, 941–944, 949
408, 412, 421, 439, 444, 570, 577, 578, 673, Chāli 664, 672–675, 678, 685, 687, 694, 696,
676, 679, 763, 806, see also Azerbaijani 711, 715, 721, 733, 739, 767, 775
Charazei 664
B Charozh 664, 716, 737, 824
Bābolsari 678, 686, 687, 719, 757, 824 Chenār Rudxāni 664
Bāĵaɫānī [Bāǰāɫānī, Bāĵalānī, Bāǰalānī] 297, Chikolen-Lesken [Chikola-Lesken] dialect of
303 Digor Ossetic 862
Bakhtiari [Bakhtiāri] 12, 25, 385–389, 391–393, Chinese 574, 601
395, 419, 445–480, 629, 947, 951 Cudi cluster of Christian NENA 191, 198, 204,
Bākoluri 664 205, 210, 224
Balochi [Baluchi] 6, 25, 627
Barwar, Christian NENA 16, 20, 45, 157, 238– D
240, 306, 312–315, 317–320, 322, 324–327, Darī [Dari] 569, 570, 572, 579, 603–605, 607,
330, 332–334, 337–340, 342–344, 346, 351, 609, 610, 612, 623
500, 501, 949, 951 Derabun, Christian NENA 315
Barətla, Christian NENA 306, 317 Dere, Christian NENA 324
Baṭnaya, Christian NENA 323, 324 Digor Ossetic 861–863, 865–874, 876, 877, 886,
Bayat (variety of Qashqâʾî) 416, 422–425, 430 887, 889, 890, 948, 951
Baz, Christian NENA 199, 204, 205, 209, 210, Dikin-Marāqei 665, 678, 717, 727, 735, 780, 782,
212, 638 824
Baz, Aruntus, Christian NENA 205, 210 Dikin-Pashei [Dikin-Pašei] 665, 674, 675, 824
Baz, Maha Xtaya, Christian NENA 204, 209, Diyana-Zariwaw [Diyana-Zariway] 312, 313–
210, 212 318, 322, 330, 352
Begdilli [Begdili] (variety of Qashqâʾî) 423–425 Dīz, Christian NENA 203
Behdinī [Bahdini, Badini, Badinî, Bahdinī, Dizi 664, 730, 731, 824
Behdini], Northern Kurdish 13, 24, 34, 40, 41, Dobe, Jewish NENA 323, 337
43, 44, 106, 108, 114, 119, 121, 125–127, 129, Dohuk, Jewish NENA of 306, 314, 323
132, 134, 137, 138, 145, 267, 269, 272, 280, Dohuk, Kurmanjî of 138, 290, 291, 303
287, 292, 294, 304, 377, 388, 947 Domari 3, 28, 29
Bēṣpən, Christian NENA 198
Betanure, Jewish NENA 306, 310, 312, 314, 323, E
324, 336, 352 Elāšti [Elašti] 666, 668–670
Bēwyānī (variety of Gorani) 297 Elazığ [Elaziğ], Kurmanjî of 127
Bidaro, Christian NENA 313, 318 Elbistan, Kurmanjî of 121, 127, 136, 139, 141,
Bingird, Sorani of 271, 285 149
Bohtan, cluster of Christian NENA 14, 16, 32, 34, Erzurum, Kurmanjî of 127
35, 40–42, 191, 194, 198–205, 207, 210, 211, Eštehārdi [Eshtehārdi] 724, 738, 780
216–218, 220, 221, 224, 225, 228, 233, 234
Bokan, Jewish NENA 481, 496, 508 F
Bədyal, Christian NENA 310, 313, 314, 318, 319, Fârsî [Farsi, Fārsi] 405, 406, 434, 485, 569, 570,
324, 335, 337, 339 573, 579, 580, 582, 589, 600–602, 605, 607,
609, 610, 612, 616, 620–624, 821
C Fumani 659
Cappadocian 893, 901, 903, 932
Caucasian, South 9, 830, 843, 846, 854, 855, 860, G
885, 889 Galeshi [Galeši, Gāleši] 663, 666, 668–670, 675
Caucasian Tat 659, 664, 691, 819, 820, 823, 578, Gandomābi 664, 678, 723, 724, 730, 731, 736,
938–940, 948 738, 759, 772, 778, 824
Central Neo-Aramaic (CNA) 6, 10, 14, 16, 17, Gawraǰūyī [Gawřajūyī] (variety of Gorani) 296,
20, 31, 32, 34–37, 41, 42, 190, 194, 197, 198, 533, 541
Index of language names 955
Gāzarxāni (Tatoid) 674, 675 369, 370, 371, 374, 376, 383, 385, 386, 387,
Georgian 9, 10, 196, 748, 763, 825, 826, 830, 388, 389, 390, 391, 392, 393, 394, 395, 398,
833, 840, 855, 862, 878, 885, 891, 895 400, 401, 411, 413, 414, 415, 416, 419, 420,
Geylavāni 664 423, 426, 427, 428, 432, 433, 434, 435, 436,
Gīlakī [Gilaki] 18, 19, 577, 578, 658–660, 665, 437, 438, 439, 444, 445, 447, 458, 460, 472,
673–677, 679, 688, 691, 692, 697, 709, 716– 473, 478, 479, 480, 484, 485, 494, 503, 522,
718, 729, 769, 820, 821, 948 523, 533, 538, 566, 567, 568, 569, 570, 571,
Gorani [Gūrānī, Gūranī, Gurānī, Gurāni] 7, 13, 572, 573, 575, 576, 577, 578, 579, 601, 621,
15, 237, 267, 269, 270, 274, 278, 281, 295– 622, 623, 624, 625, 627, 629, 638, 639, 657,
302–304, 357, 368, 385, 410, 413, 426, 533, 658, 659, 660, 664, 667, 708, 711, 716, 720,
534, 536, 537, 540, 566, 567, 947, 949 730, 731, 763, 778, 790, 820, 821, 822, 823,
Gulf Arabic 629, 657 825, 826, 827, 828, 829, 861, 884, 885, 889,
Gullī, Kurmanjî of 291 890, 936, 948, 951
Gulpašan, Christian NENA 200 Iranian, Western 124, 385, 494, 623–625, 629,
Gurāni (variety of Tatoid) 674, 675, 700, 703, 639, 658, 821
714, 728, 735, 771, 824 Iranic → Iranian
Gurchāni (Vafsi) 665, 676 Iraq Turkic → Turkic, Iraq
Iron Ossetic 861, 862, 865, 889
H Iron Ossetic, Kudar dialect 863, 878
Hakkari cluster of Christian NENA 191, 199,
201, 203–206, 209, 210, 212 J
Halmun, Christian NENA 204, 224 Jāfī 534, 537, 538
Hamziye, Christian NENA 324 Jamshidābādi 673
Harbole,Christian NENA 205 Jilu, Christian NENA 11, 199, 203, 206, 233, 234
Harzani [Harzandi] 40, 664, 671, 672, 676, 689, Jowkandāni 712, 727, 728, 824
698, 710–712, 718, 720, 728, 731, 733, 739, Jubani 673–675, 700, 739, 824
744, 749–751, 764, 770, 771, 791–793, 820,
821, 823 K
Hashtpari 660, 820 Kabatei [Kafteji] 664, 665, 671–675, 678, 685,
Hassana, Christian NENA 198, 210 694, 696, 705, 711, 712, 717, 727, 728, 737,
Hawrami [Hawrāmi, Hawrāmī] 10, 15, 18, 755–757, 778, 779, 822, 824
25, 267, 295, 296, 298, 299, 301, 385–394, Kābolī [Kaboli] 610
533–539, 541–543, 546, 553, 554, 566, 567, Kajali 664, 671, 672, 678, 717, 723, 737, 738,
763, 947 764, 767–769, 773, 774, 797, 824
Hertevin, Christian NENA 14, 16, 34, 35, 198, Kakaʼī [Kākayī] (variety of Gorani) 297, 533
200, 201, 203, 205, 210, 211, 216–218, 221, Kalāsari [Kalāsuri] 40, 664, 671, 685, 694, 703,
234, 526 712, 718, 721, 725–728, 731, 733, 764, 768,
Hezārrudi 664, 678, 695, 717, 730, 731, 736, 769, 823, 824
767–769, 778, 779, 785, 800, 801 Kalhor of Bayâdestân 424, 435, 436
Hobyot 629 Kalhorī 577
Kandelusi 666
I Kandūlayī [Kändûläî] (variety of Gorani) 296,
Indo-Aryan 639 444, 533, 5, 567
Indo-European 53–60, 104 Karamlesh, Christian NENA 323, 324
Indo-Iranian 625, 627 Karani 664, 671, 672, 707, 723–725, 736, 738,
Iranian languages 3, 5, 6, 7, 8, 10, 11, 12, 13, 15, 823, 824
16, 18, 20, 21, 22, 24, 25, 26, 28, 29, 40, 45, Kargānrudi [Karganrudi] 764, 766
94, 106, 108, 118, 124, 130, 151, 156, 157, Karnaqi 664, 730, 731, 738, 824
158, 189, 195, 214, 227, 229, 237, 238, 239, Kartvelian 5, 8, 10, 21, 29, 241, 401, 825–827,
240, 241, 260, 267, 269, 272, 273, 274, 276, 830, 860, 949
278, 280, 285, 287, 295, 296, 298, 299, 303, Kelārdašti [Kelārdashti] 660, 666, 673, 674, 675,
304, 306, 327, 354, 362, 363, 364, 366, 368, 692, 693, 718, 720, 721, 729–731, 735
Kelāsi [Kelasi] 664, 665, 673–675, 678, 685, 687, Kurdish, Central (Sorani) 10, 11, 18, 25, 41,
694, 703, 715, 727, 736, 738, 752, 754, 757, 137, 139, 143, 213, 237–241, 267, 269–274,
822, 824 276–278, 280–285, 287, 288, 291–293, 298,
Kerend, Jewish NENA 481, 496, 497, 503, 508, 299, 302, 304, 316, 322, 325, 358, 369, 370,
512, 532 485, 501, 534, 536, 537, 566, 826, 827, 947
Keringāni [Keringani] 664, 671, 710–713, 718, Kurdish, Southern 3, 15, 106, 274, 278, 281,
720, 721, 725–727, 730, 731, 744, 751, 758, 296, 298–300, 368, 460, 947, 949
764, 791 Kurmanjî [Kurmanji, Kurmandji, Kurmanci,
Khalaj 19, 398, 399, 405, 407, 411, 415, 416, Kurmandjî Kurmānğí] 10, 29, 31, 34–37,
419, 422, 424, 430, 431, 433, 434, 438, 39–45, 106–111, 113, 115, 117, 118, 120–122,
439 124, 126, 127, 130, 131, 133–139, 141, 144,
Khalaj of Bayâdestân 19, 407, 415, 422, 424, 145, 147–152, 156–158, 184, 189, 237–240,
430, 431, 433, 434, 438, 439 267, 270, 281, 287, 289, 291, 303, 314, 316,
Khatirābādi 824 369, 427, 673, 699, 837, 840, 918, see also
Khoini [Xo’ini] 694, 698, 717, 723, 727, 728, Kurdish, Northern
730, 731, 733, 739, 753, 823, 824 Kurmanjî, Southeastern (SEK) 40, 42–44,
Khorâsân-Turkic (East Oghuz) 398 133–137, 139, 140, 142–146, 148, 150, 151
Khoynarudi 664, 711, 712, 733, 774, 784, 787, Kurmanjî, Southern (SK) 133–136, 139, 144,
788, 800, 801, 803, 824 148, 149
Kilit 659, 664 Kurmanjî, Western (WK) 108, 126, 127, 131,
Koluri 664, 671, 672, 677, 678, 685, 687, 691, 133–136, 138, 139, 141, 145–151
698, 711, 715–717, 720, 723, 724, 727, 728,
730, 731, 736, 748, 819, 824, 948 L
Kordkheyli 678, 686, 697, 747, 808, 809 Lāhijāni [Lahijāni, Lāhijani] 19, 659, 660, 666,
Koy Sanjak [Koy Sanjaq, Koy Sanǰaq], Jewish 667, 669, 670, 677–679, 682, 685, 686, 688,
NENA 296–298, 305, 306, 312, 313, 315, 316, 691–693, 697, 701, 703–705, 708, 709, 713,
323, 352 715, 718, 719, 720, 721, 723, 725–727, 729–
Kumzari [Kumzārī, Komzari, Kamazara] 1, 7, 15, 732, 734, 735, 737, 738, 741–745, 747, 749–
22, 24, 25, 385, 387, 389, 390, 395, 458, 480, 752, 754, 757–763, 765, 766, 768, 769, 771,
571, 625, 627–636, 638–651, 653, 656, 658, 792, 793, 795–803, 805, 808, 809, 824, 948
947, 948 Lākei [Lakei] 673
Kurdish 3, 6–18, 20, 24, 25, 28, 31–35, 37–39, Laki [Lakī] 300, 303, 577
41, 44, 45, 106–110, 113, 115–119, 121, Langerudi 659, 666, 734, 735, 743, 745, 749, 824
122, 126, 129–131, 134, 135, 137, 139, 143, Lārestānī 658
148, 150–152, 156–158, 160, 162, 163, 177, Latin 46, 163, 357, 574, 676
181–184, 188, 189, 194, 196, 200, 204, 213, Latin, Low 574
226–228, 237–240, 243, 244, 246, 251, 259, Laz 8, 10, 13, 16, 19, 21, 24–26, 241, 825–828,
260, 267, 269–274, 276, 278, 280, 281, 284, 830, 832, 833, 835, 837–841, 843, 844, 846,
287–291, 293, 294, 296, 298–304, 310, 311, 847, 850, 852–854, 856, 860, 895, 923, 945,
314, 316, 321–323, 325, 332, 337, 341, 344, 946, 949
345, 351, 352, 354, 355, 357–361, 368–370, Laz, dialect of Arhavi-Fındıklı 832, 833, 835,
374, 375, 377, 383, 388, 389, 392, 400, 406, 844, 852, 854, 860, 949
407, 410, 412, 413, 426–428, 434, 437, 441, Laz, dialect of Hopa 72
445, 460, 468, 482, 485, 490, 501, 509, 526, Lerdi 664, 733, 824
532–534, 536–538, 553, 561, 566, 571, 577, Leriki 660, 663, 677–679, 683–687, 690, 694,
601, 665, 699, 705, 826, 827, 837, 840, 843, 695, 698, 702–704, 706, 708–714, 716, 719,
895, 918, 947, 949 722–728, 731, 732, 734, 739–741, 743–746,
Kurdish, Northern (Kurmanjî) 8, 10, 13, 16, 748, 751–760, 763–769, 772, 774, 780, 781,
17, 24, 25, 106, 131, 150, 156, 238, 267, 785, 786, 790, 791, 793, 797–799, 801–809,
270, 271, 273, 278, 280, 284, 287–289, 814, 824, 948
291, 298, 370, 388, 468, 705, 947, see also Levandavili 663
Kurmanjî lišana deni Jewish NENA 191, 306
Lorī [Luri, Lori] 407, 445, 446, 452, 478, 479, Nerwa, Christian NENA 11, 306, 312, 313, 353
571, 629 Nerwar, Jewish NENA 310, 314, 323, 324
Lori, Northern 445 North-Eastern Neo-Aramaic (NENA) 6, 8–17,
Lori [Luri], Southern 445, 452, 629 19–23, 26–28, 31–44, 157, 190, 191, 194–207,
Lənkəran [Lenkoran] dialect 660, 663, 824 209–228, 230, 235, 238, 241, 269, 292,
305–307, 309–317, 319–322, 324, 328–334,
M 336, 345, 350, 351, 385–397, 460, 481–486,
Māčiāni 670 488–494, 496–505, 507–513, 515, 516, 518,
Mačo [Māčo] (variety of Gorani) 297–299, 301, 521, 522, 525–527, 531, 532, 826, 941–944,
534 946, 948, 949
Mandaic 305, 393, 397, 485, 488, 502, 526, 532 NENA, Christian (dialects) 9, 13, 191, 194,
→ Neo-Mandaic 196, 198, 203, 206, 215, 222, 223, 306, 308,
Mandaic, Classical 393, 485 311, 388, 390–392, 482, 484, 486, 488, 489,
Mangesh, Christian NENA 324 491
Maraš [Maraş], Kurmanjî of 52, 156 NENA, Jewish (dialects) 194, 199, 241, 269,
Mardin, Kurmanjî of 34, 116, 139, 141, 145, 152, 306, 307, 311, 316, 387, 388, 390–394, 481,
159, 160 488, 491, 507, 508, 511, 512, see also trans-
Mar-Yaqo, Christian NENA 324 Zab Jewish NENA
Māsāli 671, 672, 675, 685, 824, 948 Nowkiāni [Nowkiani] 664, 671, 678, 689, 720,
Masally 659, 660, 663 721, 725, 727, 728, 778, 784, 788, 824
Maṣlāwi Arabic → Moṣul, Arabic of
Māsulei 660, 671, 672, 677, 678, 681, 685, 694, O
703, 710, 715–717, 720, 721, 723, 725, 731, Oghuz, Central (Turkic) 358, 399, 439
736, 737, 739, 741, 744–748, 756, 758–763, Oghuz, South (Turkic) 399, 405
765–767, 769, 770, 772, 773, 775, 780, 793, Ophitic 894, 932
795, 799, 807, 809, 824, 948 Oskulaki 665, 673
+
Mawana, Christian NENA 13, 34, 202, 204, 213 Ossetic 8, 12, 16, 24–26, 825–829, 861–863,
Māzandarānī [Mazanderani, Māzanderānī, 865–868, 871, 873, 876–878, 883–891, 948,
Mazandarani] 577, 624, 659, 660, 677, 678, 951
691–693, 697, 709, 714, 715, 719, 735, 741,
747, 762, 821, 822, 948 P
Mazṛa, Christian NENA 218 Pahlavī [Pahlavi] 572, 623
Mehri 627, 629, 638, 658, 948, 951 Pārsī [Parsi, Pārsīg] 572, 622, 623
Middle Persian → Persian, Middle Parthian 274, 571, 572
Midyat, Kurmanjî of 107, 133, 144, 148, 152 Pashto 6, 570, 577, 579
Mingrelian 830, 885, 890 Pāwayī 536
Mlaḥsô [Mlaḥso] 32, 41, 42, 45, 190, 195, 197, Persian 3, 18, 22, 24, 25, 27, 65, 77, 85, 87,
198, 204–207, 209, 213, 220, 221, 234, 526 118, 176, 196, 228, 229, 234, 243, 274, 300,
Mongolian 4, 574 303, 362, 366, 368–370, 375, 377, 385–387,
Moṣul, Arabic of 23, 240, 246, 247, 248, 250, 389–393, 395, 396, 401, 405–409, 411–415,
252, 260–265, 949 418, 424–426, 428, 429, 433, 434, 436–442,
Mukri, Sorani of 12, 34, 270–272, 282, 304 445, 447, 449, 450, 458, 460, 464, 472, 473,
Muš [Muş], Kurmanjî of 8, 132, 159 477, 479, 480, 484, 485, 490, 501, 502, 504,
509, 514, 520, 524–526, 533, 534, 537, 538,
N 569–593, 595–597, 600–604, 607, 608, 614,
Nargezine-Xarjawa, Christian NENA 323 617, 621–625, 627–629, 637, 657, 658, 664,
Nāvrudi 663 665, 667, 668, 673, 677, 679, 687–689, 691,
NENA → North-Eastern Neo-Aramaic 692, 697, 702, 704, 717, 735, 738, 756, 757,
Neo-Mandaic 19, 21, 22, 385–388, 390, 393–395, 763, 786, 796, 798, 799, 806, 820, 821,
481–485, 487–493, 495, 497, 498, 500–502, 840, 855, 884, 885, 915, 948, 949, see also
504, 513–515, 517, 519, 521, 522, 524, 526, Fārsī
529, 531, 532, 949, 951 Persian, Classical 576, 578, 582, 592, 623
Persian, Colloquial 24, 27, 370, 389, 390, 437, Rumca → Romeyka
538, 576, 578, 583, 586–588, 590, 592, 596, Rumdža → Romeyka
614, 616, 620, 621, 624 Rumta, Christian NENA 201
Persian, Early Judaeo- 392, 580, 624 Russian 45, 100, 194, 196, 570, 575, 861, 863,
Persian, Early New 576, 580, 582, 584, 624 886, 887, 891
Persian, Judaeo- 392, 572, 580, 624 Rustaqa, Jewish NENA 306, 323, 335–337, 351
Persian, Middle (MP) 274, 571–573, 584, 624,
627, 637 S
Persian, Middle, Manichaean 584 Šabakī (variety of Gorani) 296, 298–300, 533,
Persian, Middle, Zoroastrian 572, 584 947
Persian, Modern 368–370, 401, 405, 407, 408, Safî Khânî (variety of Qashqâʾî) 416, 417,
409, 411, 413, 436, 479, 588, 593, 604 423–425, 430
Persian, New (NP) 571–573, 576, 580, 582– Sagzābādi 664, 676, 740, 785, 788, 801, 804
585, 590, 596, 602, 624 Saljuqian Turkic (historical form) 398–400
Persian, Old 571 Šammar dialects 243, 244, 246
Persian, Standard 385, 390, 538, 576, 580, Šamməsdin-Gawar cluster of Christian
581, 583, 586, 588, 589, 608 NENA 191, 202–204, 213
Peshabur, Christian NENA 313, 322, 324 Sanandaj, Christian NENA 389–392, 482, 484,
Piždar, Sorani of 271, 285 485, 487–494, 497, 497, 499, 501, 503–505,
Pontic Greek 893–895, 897, 903, 907, 932, 933 509–512, 516–519, 521–526
Pontic Greek, Muslim → Romeyka Sanandaj, Jewish NENA 10, 15, 19–21, 23,
351, 386, 387, 389, 393, 394, 397, 481, 482,
Q 485–492, 494–497, 499–501, 503, 506–512,
Qal’ei 824 517–519, 521–524, 527, 532, 949, 951
Qar Hasan, Jewish NENA 508 Sangesari 280
Qaraqosh, Christian NENA 10, 11, 240, 306, 309, Săqəz, Jewish NENA 481, 494, 497
311–313, 315–324, 326, 328, 329, 333, 334, Sāravi 660, 666, 668–670, 677, 678, 683,
337, 340–345, 351 685–687, 692, 693, 697, 698, 701, 703–705,
Qashqâʾî (South Oghuz) 368, 383, 399, 405, 407, 707–709, 711, 715, 719, 721, 727, 730–732,
409–414, 416–418, 421, 423–426, 430, 444 735, 736, 741–743, 745, 747, 749–751, 753,
Qočanəṣ, Christian NENA 201 757–759, 762, 763, 765, 766, 768, 769, 771,
792–795, 797–799, 801, 804, 808, 809, 812,
R 948
Rāmsari [Ramsari] 659, 666, 669, 670, 688, 692, Sarlī 267
693, 709, 721, 730–732, 734, 735, 743, 745, Sayyādlari 824
747–749, 757, 821, 824 Semitic 3–7, 10, 14, 20, 21, 25, 27, 29, 31, 32,
Rashti [Rašti] 666, 668–670, 673–675, 677–679, 42, 113, 150, 159, 189, 234–241, 243, 254,
685, 688, 697, 711, 717, 720, 725–727, 730, 264–266, 272, 292, 305, 328, 350, 351, 353,
731, 734, 736, 738, 741, 751, 762, 765, 824 361–363, 366, 371, 385, 387, 388, 397, 401,
Razajerdi 664, 678, 689, 694, 700, 703, 720, 727, 410, 411, 531, 532, 625, 627, 630, 632, 635,
729, 739, 747, 770, 771, 786, 791, 824 643, 658, 763, 821, 822, 825–827, 829, 941,
Rewandiz, Sorani of 292 948, 951
Romeyka 12, 13, 16, 20, 21, 825–828, 892–895, Semitic, South 625
897–907, 909–912, 914–927, 929–931, 933, Semnānī [Semnani] 571, 577
950 Šēxānī [Şēxānī] (variety of Gorani) 298, 533
Rostamābādi 673, 824 Šēx Bizini 106
Rōžbayānī [Rožbayānī] (variety of Gorani) 533 Shahrāni 673
Rudbāri 659, 665, 667, 673–675, 700, 716, 717, Shāli 664
720, 727, 728, 730, 731, 734, 739, 762, 771, Shāndermeni 664
824 Shihhi [Shihhuh, Shihuh] 627–629, 656, 658, 948
Ruma, Christian NENA 14, 34, 42, 191, 203, 207, Shōsh-u-Sharmə, Christian NENA 323
216–218, 228 Siāvarudi 664
Sinjar, Kurmanjî of [Sinjarī] 288, 290 Turkic, Iraq 14, 18, 383, 400, 409–411, 416,
Sorani → Kurdish, Central 418, 422, 423, 429, 434
Sulemaniyya, Christian NENA 305, 313, 315, Turkish 3, 7, 8, 10, 13, 18, 20, 28, 40, 65, 72, 74,
323, 324, 333, 487 77, 92, 95, 109, 121, 122, 138, 148–152, 156,
Sulemaniyya, Jewish NENA 15, 241, 312, 319, 160, 162, 163, 176, 177, 181–184, 186, 188,
321, 323, 324, 327, 329–332, 334, 335, 339– 189, 195, 196, 228, 229, 235, 240, 259, 260,
342, 344, 345, 482, 505, 510 264, 296, 303, 306, 309, 310, 315, 336, 345,
Sulemaniyya, Sorani of 270–273, 277–279 355, 357, 358, 360–366, 368, 371–374, 376,
Sumerian 237 377, 382, 383, 396, 399, 400, 406, 408, 413,
Surči [Surčī], Kurmanjî of 281, 291 416, 418, 419, 421–425, 427, 431, 432, 436,
Svan 830, 885, 891 443, 444, 574, 575, 663, 822, 825, 827, 828,
Syriac 196, 208, 219, 305, 310, 311, 351, 352, 830, 832, 833, 837, 839, 840, 855, 857, 892–
489 895, 897, 899, 902, 908–911, 915, 917, 918,
Šəno, Jewish NENA 191, 213 920, 921, 923, 927, 929, 931, 942, 949
Turkish, Ottoman (South Oghuz Turkic) 259,
T 345, 355, 363, 396, 400, 574, 895
Ṭabarī (New Tabarī) 572, 621 Turkish of Turkey (West Oghuz) 357, 399,
Tahāromi 664 400, 406, 418, 422, 423, 425, 436
Tajik [Tojiki Tājīkī] 458, 569, 570, 579, 580, 582, Turkmanja [Turkman] 237, 354, 355, 357–359,
593, 596, 597, 602–607, 609–612, 614, 616, 361, 363, 367, 374, 375, 377, 381, 383, 384,
617, 624 398, 400, see also Turkic, Iraq
Tākestāni 664, 675 Turkmen [Türkmen] 108, 354, 368, 382, 383,
Tāleš-Dolabi 664 398, 399, 405, 407
Tālešī [Tāleshi, Taleshi] 571, 578, 659, 819–821, Ṭūrōyō [Turoyo] 234–236, 305, 312
821, 823, 951 Tutkāboni 673–675, 700, 717, 728, 730, 731,
Telkepe, Christian NENA 318, 323, 324, 334, 734, 762, 771, 824
350, 352 Txuma cluster of Christian NENA 191, 198, 218
Tikab, Jewish NENA 508 Tən, Christian NENA 232, 323
Tisqopa, Christian NENA 323
Tiyare cluster of Christian NENA 191, 197–199, U
201–206, 210, 213, 215, 224, 226–228, 306 Umra d-Shish, Christian NENA 224, 234, 324
Tiyare, Lower, Christian NENA 191, 198 Urei 666, 668, 824
Tiyare, Upper, Christian NENA 191, 199, 218 Urmi, Christian NENA 17, 37–39, 191, 200–207,
Tonekābonī [Tonekāboni] 677, 688, 692, 693, 210–215, 218, 222, 224–227, 229, 230, 494
717–719, 721, 727, 730–732, 734, 743, 745, Urmi, Jewish NENA 191, 199, 202, 206, 210,
749, 757, 762, 807, 819, 824 216, 218, 222–225, 227, 228, 486, 487, 526
trans-Zab Jewish NENA 21, 26, 191, 199, 213, Uzbek 108, 570, 577, 607, 612, 616, 617
222, 223, 235, 238, 239, 241, 306, 314, 319,
321, 322, 326, 327, 339, 345, 352, 481, 486, V
492, 516, 518, see also NENA, Jewish Vafsī [Vafsi] 25, 29, 468, 480, 577, 658–660, 665,
Tur Abdin [Ṭūr ʿAbdīn, Tur ‘Abdîn], Kurmanjî 672, 676–678, 680, 681, 685–687, 691, 694,
of 31, 129 695, 698–703, 705, 709–711, 713–715, 717,
Turkî [Turki] (Iran-Turkic) 354, 398–401, 405– 719–723, 725, 728, 730–732, 738, 740, 742,
407, 412, 439, 441 743, 746, 751–755, 757–760, 763, 765–770,
Turkic languages 3, 5, 7, 8, 11–16, 18–21, 24–26, 772, 773, 775–778, 780, 784, 789, 793–795,
28, 29, 31, 33, 40, 157, 237–241, 260, 286, 797–805, 807–809, 815, 821, 822, 824, 948
337, 345, 354, 355, 357, 358, 360–371, 374, Vājārgāhi 670, 743, 766
376, 377, 382, 383, 385–390, 392–396, 398– Velātrui 660, 666, 669, 670, 678
401, 405–430, 432–439, 441, 443, 444, 446, Vidari 717
574, 575, 577, 578, 607, 610, 657, 821, 822, Viznei 671, 690, 694, 712, 790, 824
825, 827–829, 854, 860, 931, 949
Turkic, Iran 18, 24, 26, 360, 367, 371, 949
W Z
Wārmāwa, Sorani of 278, 286 Zakho, Jewish NENA 306, 313–315, 323, 324,
350
X Zakho, Kurmanjî of 137, 287, 289–291
Xatirābādi [Khatirābādi, Xatirabadi] 666, Zardayāna (variety of Gorani) 296, 298, 533, 540,
824 541
Xošābari 664 Zazaki 3, 106, 109, 149, 157, 158, 280, 281, 622,
Xərpa, Christian NENA 313, 323, 337 623, 763, 837, 840
Zincirli, Kurmanjî of 151

Index of place names
A Arbīl [Arbel, Arbil] (Iraq) 187, 237, 238, 240,
Adana (Turkey) 133 244, 264, 292, 297, 298, 306, 312, 314,
Adhaim (Iraq) 245, 297, 308 322–327, 335, 337, 339, 341, 343, 345, 351,
Adiyaman-Urfa (Turkey) 126 354, 355, 359–361, 370, 427, 947
Afghanistan 354, 399, 444, 569, 570, 577, 579, Ardabil (Iran) 578, 664
582, 623, 657 Ardal (Iran) 447, 457, 460, 461, 467, 468,
Ağrı (Turkey) 133 471–473, 480, 947
Aħmađ Āwā (Iraq) 534 Ardon (Russia) 864
Ahwaz [Ahvāz, Ahwāz, Aḥwāz] (Iran) 305, 385, Arhavi (Turkey) 832, 833, 835, 844, 852, 854,
481, 482, 484, 492, 504, 513, 514, 516, 525, 860, 949
532, 949 Armenia 31, 33, 65, 79, 80, 85, 87, 89, 93, 97, 98,
Akbudak (Turkey) 133 108, 123, 195, 196, 201
Akre [Aqra, ʕAqra] (Iraq) 187, 264, 244, Arpačī (Iraq) 298
288–291, 294, 351 Artvin (Turkey) 831
al-Qamishli (Syria) → Qamishli Asālem (Iran) 660, 663, 823
Alagir (Russia) 864 Asia Minor 13, 27, 52, 573, 893, 903, 932
Alazani river 864 Asparēz 534, 536
al-Baʿʿāj [al-Baʕʕāǧ] (Iraq) 245, 259 As-Sulaymāniyya (Iraq) → Sulemaniyya
Aleppo (Syria) 3, 28, 184, 189, 260 Astara (Azerbaijan) 660, 664
al-Hasakah (Syria) → Hasakah Australia 195, 310, 484, 569
ʿAlī Bāpīr (Iraq) 297 Awbar (Iraq) → Ambar
ʿAlî Qûrchî (south of Arak) (Iran) 412 Awroman [Awromān] (Iran) 296, 303, 566, 567
ʿAlī Sarāy (Iraq) 297 Azerbaijan 40, 190, 234, 355, 359, 362, 363, 365,
Aligudarz (Iran) 445 366, 383, 396, 398–400, 405, 406, 410, 412,
Almaty (Kazakhstan) 195, 196 413, 416, 419, 424, 428, 429, 436, 481, 659,
al-Mawṣil [al-Mauṣil] → Mosul (Iraq) 660, 663, 664, 724, 791, 822
Alqosh (Iraq) 306, 312, 323, 324, 350, 352 Āzex [İdil] (Turkey) 189
Alvir (Iran) 711, 823
Amadiye [Amêdi, Amedia, Amediye] (Iraq) 16, B
36, 37, 137, 290, 291, 306, 312–315, 317–319, Baʾadra [Badr] (Iraq) 290, 294, 354
322, 486 Ba'ashiqah (Iraq) 245
Ambar 82, 297 Badinan (Iraq) 116, 120
Āmol (Iran) 661 Baghdad [Bagdad] (Iraq) 194, 263–266, 354, 355
Anatolia 3, 5, 8, 12, 13, 16, 25, 27, 28, 31, 34, Bahzani (Iraq) 245
39, 41, 100, 106–109, 115, 123, 132, 134, 135, Bakragar (Iraq) 298
150, 156, 159, 162, 170, 181, 182, 188, 190, Baksan (Russia) 864
242, 244, 250, 303, 350, 355, 358, 361–363, Bamō (Iraq) 298
366, 367, 372, 383, 385, 386, 388, 398, 399, Bāne (Iran) 297
401, 410, 413, 416, 421, 424, 428, 429, 436, Bānī Binok (Iraq) 534
444, 825, 827, 828, 860 Baʿqūba (Iraq) 297
Anbarān (Iran) 663 Bārāmāwā (Iran) 534
Ankara (Turkey) 382, 444, 893 Bārānī Xwārū (Iran) 297
Ankawa (Iraq) 11, 239, 306, 313, 323, 324, 337, Bardī ʿAlī Xwārū (Iraq) 297
350 Barətla (Iraq) 306, 317
Antep-Adiyaman (Turkey) 117 Barwar (Iraq) 16, 20, 45, 157, 238–240, 306,
Anzali (Iran) 659 312–315, 317–320, 322, 324–327, 330, 332–
Aqra [ʕAqra] (Iraq) → Akre 334, 337–340, 342–344, 346, 351, 500, 501,
Arab Emirates 569, 625, 657 949, 951
Aradhin (Iraq) 306, 324 Barwari Bala (Iraq) 306
962 Index of place names
Barxu (Iran) 297 Chehr (Iran) 300

Barzan (Iraq) 306 Chehreqān [Chehreghān] (Iran) 661
Başkale (Turkey) 191 Chelgerd (Iran) 446
Batman (Turkey) 159 Chenār Rudxān [Chenār Rudkhān] (Iran) 664
Baxa Kon (Iraq) 534 Chikola (Russia) 862
Bayâdestân (Iran) 19, 407, 409, 412, 413, 415, China 574, 601, 623, 698
416, 418, 419, 421–424, 427, 429–439, 441, Cizre [Jazira] (Turkey) 107, 109, 187, 191, 269
444 Crimea 52, 53, 92, 574
Bayǰī (Iraq) 297 Cudi mountain [Cudi Daği] (Turkey) 191, 194,
Bədəlan [Badalan] (Azerbaijan) 661 195
Bədyal (Iraq) 306, 310
Beşköy (Turkey) 898–902, 914, 927, 930, 933 D
Beslan (Russia) 864 Dagā Šēxān (Iraq) 534
Betanure (Iraq) 306, 310, 312, 314, 323, 324, Dagāgā (Iran) 534
336, 352 Daɫ 534
Beth Arbaye (Turkey) 159 Daɫamarz 534
Bēwyān (Iran) 297 Damascus (Syria) 260, 305
Bēwyānī Gawra (Iraq) 297 Dandan Uiliq (China) 572
Bičūk (Iraq) 297 Dāquq (Iraq) 297
Bijar (Iran) 481 Dara (Iran) 534
Biɫbar 534 Dārā (Iraq) 297
Bingird (Iraq) 271, 285 Daragaz (Azeri exclave) (Iran) 399, 405
Bingöl (Turkey) 135 Daragözü (Turkey) 35, 164, 170, 181, 187
Bisotun (Iran) 300 Dāraka (Iran) 297
Bitlis (Turkey) 97 Darē Mař (Iraq) 534
Bīyāra (Iran) 534 Darmūr (Iran) 534
Bīyāra (Iraq) 534 Darwēšān (Iran) 534
Bohtan → Botan Dastenā (Iran) 461
Bokan (Iran) 481, 496, 508 Dēr Moɫē (Iran) 534, 536
Borb (Turkey) 90, 191 Derigne (Iraq) 306
Borīdar (Iran) 534 Dersim [Tunceli] (Turkey) 53, 74, 94–96, 123,
Borūǰerd (Iran) 297 127, 149, 156, 303
Botan [Bohtan] (Turkey) 14, 16, 32, 34, 35, Dēwaznāw (Iran) 534
40–42, 107, 109, 158, 191, 194, 291 Dibba [Daba] (United Arab Emirates) 625, 627
Bukan [Būkān] (Iran) 297 Dicle (Turkey) 107
Bukhara (Uzbekistan) 569 Digora (Russia) 862
Dikin (-Marāqei) (Iran) 665, 687, 694, 698, 705,
C 729, 735, 773, 775, 779, 784, 822
California (United States of America) 27, 97, 98, Diyana → Soran
102, 104, 195, 532 Diyarbakır [Diyarbakir] (Turkey) 106, 190, 421
Čamčamāł (Iraq) 298 Diyarbakir Province 190
Čamšīdar (Iran) 534 Diz (Iran) 111, 945
Caucasus 1, 3, 5, 8, 9, 26, 27, 85, 156, 195, 196, Dizāwar (Iran) 534
568, 819, 820, 822, 825, 860, 862, 865, 884, Dizlī (Dezlī) (Iran) 534
885, 890, 891 Dobe (Iraq) 306, 323, 337
Çaykara (Turkey) 893, 894, 897, 950 Dohuk [Duhok, Dohok] (Iraq) 11, 137, 138, 287,
Central Asia 72, 97, 100, 398, 573, 575, 622 289–291, 293, 303, 306, 314, 323, 947
Central Province (Iran) 399, 406, 407 Dorud (Iran) 445
Ceylanpınar (Turkey) 164 Dukān (Iraq) 297, 298
Chāl (Iran) 661 Dūšamnān (Iran) 297
Challa [Čāl] (Turkey) 191, 195, 233, 310 Dzagepbarz (Russia) 862
Charaze (Iran) 661
Index of place names 963
E Hakkari [Hakkâri, Hekari] (Turkey) 10, 108, 116,

Elāšt [Elāsht] (Iran) 821 158, 191, 193, 199, 201, 203–206, 209, 210,
Elaziğ (Turkey) 127 212, 287
Elbistan (Turkey) 121, 127, 136, 139, 141, Halabja [Halabǰa, Ḥalabja] (Iraq) 267, 295, 296,
149 301, 306, 351, 394, 397, 532, 534
Erbil → Arbil Hamadan (Iran) 406, 410, 416, 423, 578
Erzincan (Turkey) 107 Hāna (Iraq) 534
Erzurum (Turkey) 127 Hāna Garmaɫa (Iran) 534
Esfahan → Isfahan Harran (Turkey) 163, 164, 173
Eslāmābād (Iran) 297 Harsin (Iran) 27, 300, 302
Euphrates 1, 160, 237, 243, 248 Harzan [Harzand] (Iran) 711
Hasakah (Syria) 107, 161
F Hašamēz (Iran) 534
Fars, Province of 400, 416, 417 Hashtpar (Iran) 660, 820
Fereydun Shahr (Iran) 402 Hasköy (Turkey) 164, 173, 183, 189
Fuman (Iran) 659 Hawdiyan (Iraq) 314
Furqān (Iraq) 298 Hawrāmān [Hawraman, Hawromān, Awromān]
(Iran) 296, 297, 385, 386, 533, 534, 536–538,
G 540, 541, 566, 568, 947
Gahvāre (Iran) 296 Hawrāmān Žāwaro (Iran) 534
Galēn (Iran) 534 Hawrāmān-i Luhon [Luhōn] (Iran) 296, 303,
Galûgâh (Azeri exclave) (Iran) 399, 405 534
Gandomāb (Iran) 672 Hawrāmān-i Taxt (Iran) 296, 534
Gardabani (Georgia) 194 Ḥayy ʿAskarī (Iraq) 297
Ġarǰa Kōyī (Iraq) 297 Hezārrud (Iran) 672
Gawar [Gever] (official name Yüksekova)
(Turkey) 191 I
Gawraĵū [Gawraǰū] (Iran) 296, 297, 302, 304, Ilam [Īlām] (Iran) 536
534, 537, 567 India 95, 573–576, 621
Gazakān (Iraq) 297 Iori river 864
Gāzarxān [Gozärkhon] (Iran) 674, 675, 820 Iraf (district, Ossetia) (Russia) 862, 886, 887
Gaziantep (Turkey) 151 Iran 1, 8, 10, 12–14, 18–21, 24–27, 29, 31, 32,
Georgia 9, 12, 65, 194, 195, 303, 388, 400, 413, 34, 40, 65, 106, 108, 156, 190, 191, 195, 196,
830, 861, 948, 949 198, 199, 202, 206, 213, 216–218, 222–224,
Germany 195, 553, 609, 618, 620, 893 226–229, 238, 241, 267, 270, 273, 295, 296,
Geylavān (Iran) 662 298, 302, 304–306, 309, 311, 316, 331, 332,
Gilan (Iran) 577, 578, 663–665, 821 354, 358, 360–369, 371, 372, 377, 378, 383,
Girāɫ (Iran) 534 385–388, 392–396, 398–401, 405–407, 409,
Girdəni [Girdani] (Azerbaijan) 661 410, 412, 413, 416–419, 421, 423–425,
Gořgayī (Iran) 534 427–430, 433, 436, 444–447, 478, 480–482,
Gōřī Ginūž (Iraq) 297 484, 486, 507, 510, 516, 518, 525, 526,
Great Britain 573, 575 532–534, 536–538, 566–580, 582, 608,
Great Zab river 238, 306 620–625, 627, 649, 657–660, 663, 664, 687,
Gullī (Iraq) 291 693, 794, 796, 815, 819–824, 828, 932, 947,
Guɫp (Iraq) 534 949
Gurān (5) (Iran) 661 Iraq 1, 8, 12–14, 16, 18–21, 24, 25, 34, 36, 37,
Gurchān (Iran) 665, 676 41, 106–108, 116, 121, 126, 134, 137, 160,
162, 188, 190, 191, 194, 196, 199, 204–206,
H 209–215, 217, 218, 222, 223, 225, 237, 240–
Haftgel (Iran) 446 244, 246, 248, 252, 254, 263–265, 267, 269,
Ḥāǰī Miṣṭafā (Iraq) 297 270, 273, 281, 287, 295, 296, 298, 299, 301,
Haǰīǰ (Iran) 534 305, 306, 309–317, 319–321, 323, 324, 329,
331–334, 336, 342, 345, 350, 352, 354, 355, Kermanshah Province (Iran) 10, 27, 302, 385,
357, 358, 360–364, 366, 367, 369, 372, 377, 386, 394, 396, 481, 537
382, 383, 385, 388, 389, 395, 398–401, 407, Khabur river 194
409–416, 418, 419, 421–429, 434–436, 444, Khanaqin → Xanaqīn
481, 482, 484, 487, 504, 510, 516, 518, 521, Khasab (Oman) 625, 627, 628, 947
533, 534, 537, 563, 569, 825, 827, 828, 837, Khazhnidon (Russia) 862
947, 949 Khoin [Xo’in] (Iran) 661
Isfahan [Esfahan] (Iran) 366, 385, 399–401, 406, Khorasan [Khorâsân] (Iran) 108, 157, 400, 405,
412, 415, 416, 423, 424, 438, 439, 445, 479, 582
762, 822 Khorramshahr (Iran) 19, 21, 481, 482, 484, 487,
Israel 195, 196, 246, 309, 311, 352, 484, 569 489–493, 504, 513–515, 517–522, 524, 525,
Istanbul (Turkey) 157, 187, 195, 357, 383, 444, 529, 531, 949, 951
860, 893, 910, 911, 915, 924 Khoynarud (Azerbaijan) 661
Izeh (Iran) 445 Khuzestan [Khuzestān] Province (Iran) 385, 445,
478, 479, 481, 484, 485, 490, 492, 947
J Kifrī [Kifri] (Iraq) 298, 421
Jamshidābād (Iran) 662 Kilit, Ordubad (Azerbaijan) 659, 664
Jazira → Cizre Kinderib [Söğütlü] (Turkey) 188
Jowkandān (Iran) 663 Kirkuk [Kirkūk] (Iraq) 194, 265, 354, 355,
JubʿAdin (Syria) 305 358–361, 380, 381, 383, 536
Juban (Iran) 673 Kolur [Qolur] (Iran) 663
Kordestan Province (Iran) 10, 385, 386, 394, 396,
K 406, 481
Kabarlū (Iraq) 297 Kordxeyl [Kordkheyl] (Iran) 668
Kabate [Kafti] (Iran) 665 Koy Sanjak [Koy Sanjaq, Koy Sanǰaq]
Kabul (Afghanistan) 399, 412, 531, 607, 612 (Iraq) 296–298, 305, 306, 312, 313, 315, 316,
Kajal (Iran) 711, 823 323, 352
Kalār (Iraq) 297, 298 Kōya (Iraq) 297, 298
Kalāsur (Iran) 661, 672 Kozluk (Turkey) 13
Kalǰī 534 Krosnodar (Russia) 194
Kamāɫa 534 Kūhdašt (Iran) 297
Kandelus (Iran) 669, 670, 820 Kuhrang (Iran) 457, 480
Kandūla [Kandūle] (Iran) 297, 534, 537 Kumzar [Komzar, Kamzar] (Oman) 625, 627,
Kānī Māz (Iraq) 297, 298 947
Kānī Sīrna (Iraq) 297 Kur river 864
Karakoçan (Turkey) 122, 123
Karan (Iran) 662 L
Kařāwa (Iran) 534 Lāhijan [Lahijan, Lāhijān] (Iran) 659, 668, 820
Kargānrud (Iran) 663 Lake Urmi [Urmī, Lake Urmia, Lake Urmiye]
Karnaq (Iran) 662 (Iran) 85, 190, 267, see also Urmi
Kars (Turkey) 120, 399 Lake Van (Turkey) 16, 108, 191
Kaymna (Iran) 534 Langerud (Iran) 659
Kayseri (Turkey) 133 Larak Island (Iran) 625
Kelārdašt [Kelardasht] (Iran) 659, 820 Laylān (Iraq) 298
Kelās [Kalās] (Iran) 584, 665 Lenkoran [Lənkəran] (Azerbaijan) 660, 663, 824
Kerend (Iran) 481, 496, 497, 503, 508, 512, 532 Lerd (Iran) 662
Keringān (Iran) 671, 823 Lerik (Azerbaijan) 663, 716, 807, 814, 824
Kerman (Iran) 399 Lesken (Russia) 862, 886, 887
Kermanshah [Kermanšāh, Kermānšāh, Levandavil (Iran) 663
Kermānshāh] (Iran) 10, 27, 296, 302, 374, Little Zab river (Iraq) 245, 268
385, 386, 394, 396, 416, 481, 534, 537, 541, Lordegān (Iran) 461
577, 578, 623 Lotf-âbâd (Azeri exclave) (Iran) 399, 405
M Nāwa (Iran) 534

Ma’rib (Yemen) 625 Nawsūd (Iran) 385, 534, 536–539, 541, 542, 544,
Mačiān [Māchiān] (Iran) 669, 670, 820 548, 549, 554
Maçka (Turkey) 894 Naysāna (Iran) 534
Mahabad [Mahābād] (Iran) 191, 270 Nerwa (Iraq) 11, 306, 310, 312–314, 323, 324,
Maka (Oman) 625 353
Maku (Iran) 133 Nineveh plains (Iraq) 296
Malā Řaḥmān (Iraq) 297 Nodša [Notša] (Iran) 534, 537, 541, 549
Malatya (Turkey) 133 Noubarân (Iran) 435, 441, 442
Maʿlula (Syria) 305 Nowkiān (Iran) 662
Mandali (Iraq) 14, 357, 363, 427 Nowsūd (Iran) 297
Maraš [Mereš, Maraş, Maras] (Turkey) 52, 95, Nūrābād (Iran) 297
96, 116, 148, 156, 166 Nuṣaybīn [Nasibin] (Turkey) 159
Mardin (Turkey) 14, 16, 31–35, 38, 39, 106, Nwēn 534
116, 139, 141, 145, 152, 159, 160, 163–172,
174–180, 187, 189, 190, 196, 949 O
Marīvān [Marīwān] (Iran) 534, 538, 541 Of (Turkey) 892, 894, 896
Māsāl [Masal] (Iran) 660, 664, 707, 711, 821, 951 Oman 385, 625, 636, 638, 649, 656–658, 947,
Masally (Azerbaijan) 659, 660, 663 951
Masjed Soleymān [Masjed Soleyman] (Iran) 385, Oskulak (Iran) 664, 665, 673, 786, 824
445, 447, 449, 457, 947
Māsule [Mâsule] (Iran) 660, 664, 820 P
Maʿtīq (Iraq) 297 Pakistan 569
Mazandaran (Iran) 577, 578, 820 Para (Turkey) 8, 33, 128, 202, 314, 368, 379, 902,
Māzībin (Iran) 534 916, 928
Mazun (Oman) 625 Pareh Sar (Iran) 662
Medina (Saudi Arabia) 625 Pave [Paveh, Pāwa, Pāve] (Iran) 296, 297, 534,
Mesopotamia 1, 4, 65, 159, 160, 237, 243, 269, 537, 568, 573
305, 352, 355, 571, 825, 827 Pāyravand (Iran) 300
Mēxās (Iraq) 297 Persian Gulf 623, 657, 717
Midin (Turkey) 208, 209, 949 Persis [Fārs] (Iran) 571
Midyat (Turkey) 107, 133, 144, 148, 152, 160, Peshtətuk [Peshtatuk] (Azerbaijan) 661
171, 190, 195, 207–209, 213 Pīrānšahr (Iran) 297
Mlahso [Mlaḥso, Mlaḥsô] (Turkey) 32, 41, 42, Pirasora [Pirəsorə] (Azerbaijan) 756, 814
45, 190, 195, 197, 198, 204–207, 209, 212, Piždar (Iraq) 271, 285
213, 220, 221, 234, 526
Miǰūryānī Ambar (Iran) 297 Q
Mosul (Iraq) 23, 222, 240, 243, 244, 246–250, Qabul → Kabul
252–260, 269, 270, 295, 296–298, 306, 309– Qādir Karam (Iraq) 398
311, 325, 334, 340, 345, 351, 352, 355, 357, Qaɫāgā (Iran) 534
533, 947, 949 Qaɫaǰē (Iran) 534
Muš [Muş] (Turkey) 8, 39, 50, 52, 54, 61, 62, 79, Qałama (Iraq) 297
80, 90, 94, 100, 102, 132, 159 Qalʿat Ḥasanʾābād (Iran) 482
Musandam (waliyat, province) (Oman) 385, 625, Qaleh Gözäldareh [Qalʿä Gözäldärä] (Iran)
627, 658, 948, 951 424
Musandam peninsula (Oman) 625 Qāmīš (Iraq) 297, 298
Mutki (Turkey) 182 Qamishli (Syria) 133, 268, 307
Qarāmīn (Iraq) 297
N Qaraqosh (Iraq) 10, 11, 240, 306, 309, 311–313,
Naltchik (Russia) 864 315–324, 326, 328, 329, 333, 334, 337, 340–
Narwē (Iran) 534 345, 351
Nāvrud (Iran) 663 Qarluq (Iran) 402
Qarṭmīn (Yavantepe) (Turkey) 36, 37, 43, 44, Sargat (Iraq) 534
173, 180, 181 Sāri (Iran) 821, 823
Qašqa (Iraq) 298 Šār-i Hawrāmān (Iran) 534
Qasr-e Širin [Qaşr-e Şīrīn] (Iran) 481 Sarqałā (Iran) 297
Qazvīn [Qazvin, Qazvîn] (Iran) 399, 400, 410, Sarqizil (Iraq) 297
482, 485, 578, 664, 665, 673 Sar-u Pīrī 534
Qolur → Kolur Šarya (Iraq) 290
Qom (Iran) 406 Sason (Turkey) 13
Qorve [Qorveh] (Iran) 945–947, 949 Šawak (Iraq) 298
Qorveh (Iran) 945–947, 949 Say Miṣafā (Iraq) 297
Quetta (Pakistan) 569 Šāy Tōtyā (Iran) 297
Quɫē (Iraq) 534 Sayyādlar [Sayyadlar] (Iran) 663, 712
Sē Girdkān (Iraq) 297, 298
R Şemzinan [Şemdinli, Şemzînan] (Turkey) 13,
Řamazān (Iraq) 297 33–35, 41, 108, 120–122, 125, 132, 135,
Rāmsar (Iran) 659, 668 137–139, 144–146, 148
Rasht (Iran) 659 Senna (Iran) → Sanandaj
Razajerd (Iran) 661 Šəno (official name Ushnuye) (Iran) 191, 213
Řazāw (Iran) 534 Sərək [Sarak] (Azerbaijan) 661
Rewandiz [Ruwanduz] (Iraq) 292, 306, 323 Šēxān (Iran) 534
Rezvānshahr (Iran) 662 Shahe (Iraq) 310
Rostamābād (Iran) 662 Shahrān (Iran) 662
Rudbār [Rudbâr] (Iran) 659, 660, 664, 665, 667, Shahr-e Kord (Iran) 445, 479
673, 820 Shāl (Iran) 662
Ruma (Turkey) 14, 34, 42, 191, 203, 207, 216– Shāndermen (Iran) 664
218, 228 Shapat (Turkey) 314
Rustaqa (Iraq) 306, 323, 335–337, 351 Shaqlawa (Iraq) 306, 323, 334
Sheker (Russia) 862, 886, 887
S Shino (Iran) 107
Sablagh → Mahabad Shiraz [Shirâz] (Iran) 400, 434
Sagzābād (Iran) 661 Shirqat (Iraq) 245
Sainqala (Iran) 481 Shwata (Turkey) 191
Salamas [Salmâs] (Iran) 13, 34, 40, 191, 195, Siāvarud (Iran) 662
204, 235, 396 Siirt (Turkey) 13, 35, 159, 162, 165, 170–172,
Samarkand (Uzbekistan) 569 187, 191, 234
Šamməsdin (Turkey) 191, 202–204, 213 Silēn (Iran) 534
Šāmyān (Iran) 534 Sinjar (Iraq) 244, 290
Sanandaj [Sanandaǰ] (Iran) 10, 15, 19–21, Sinjar mountains (Iraq) 244
23, 269, 306, 351, 386, 387, 389–394, 397, Sipasar (Iraq) 298
481, 482, 484–497, 499–501, 503–512, Şırnak (Turkey) 187, 191
516–519, 521–524, 526, 527, 532, 534, ' Sivas (Turkey) 106
577, 949, 951 Siverek (Turkey) 173, 188
Sandu (Iraq) 310 Solduz (official name Naghade) (Iran) 191
Şanlıurfa [Urfa, Edessa] (Turkey) 159, 162–165, Sonqor (Province of Kordestan) (Iran) 368, 399,
173, 182, 184, 188, 264, 416, 949 406, 407, 409, 412, 413, 415, 416, 421, 422,
Săqəz [Saqqez] (Iran) 481, 494, 497 428, 429, 434, 437–441, 949
Sarachos (Turkey) 897, 898, 900, 902–904, Soran [Diyana] (Iraq) 133
920 Sosakān (Iraq) 534
Sar Āqā Seyyed (Iran) 448, 449 Šošmē (Iran) 534
Sar Pol-e Zahāb (Iran) 296, 297, 533 Soviet Union 195, 570
Sarak [Sərək] (Azerbaijan) 664 Sufaya (Iraq) 297
Sardasht (Iran) 107 Sulak river 864
Sulemaniyya (Iraq) 15, 107, 270–273, 277– Tur Abdin [Ṭūr ʿAbdīn, Tur ‘Abdîn] (Turkey)
279, 297,305, 306, 312, 313, 315, 319, 321, 129, 162, 181
323, 324, 327, 330–335, 339–342, 344, 345, Turkey 1, 10, 14, 31, 40, 65, 72, 95, 102, 104,
351, 394, 397, 481, 482, 487, 505, 510, 532, 106–109, 114, 116, 122, 137, 145, 156, 157,
535 159, 162, 187, 188, 190, 191, 193–196, 198,
Surči [Surčī] (Iraq) 281, 291 204, 218, 228, 264, 287, 290, 291, 303, 305,
Sürmene (Turkey) 892–895 306, 309, 310, 312, 314, 315, 355, 357, 383,
Sweden 195, 196, 484 398–400, 406, 418, 422, 423, 425, 436, 481,
482, 569, 570, 825, 830, 832, 860, 862, 892,
T 893, 931, 932, 949, 950
Tabaristan (Iran) 572, 622 Tutkābon (Iran) 662
Tabriz (Iran) 12, 388, 400, 406, 407, 413, 416, Tūz Xurmātū [Tuzkhurmatı, Tuzkhurmati, Tuz
820 Khurmatu] (Iraq) 355, 358–360, 376, 378, 421
Tahārom (Iran) 662 Tuzlagözü (Turkey) 170
Tajikistan 569, 570, 577, 579, 582
Tākestān (Iran) 663, 664 U
Tal’afer [Talafer, Telafer, Tel’afer] (Iraq) 354, United Arab Emirates 625, 657
355, 358 Ure (Iran) 660, 669, 670
Tāɫawān (Iran) 534 Urfa → Şanlıurfa
Tāleš-Dolab (Iran) 664 Urmi [Urmia] (Iran) 9, 12, 13, 17, 18, 28, 33,
Tallkayf [Tall Kayf] (Iraq) 246 34, 37–40, 42, 45, 59, 64, 81, 82, 85, 86, 94,
Tangī Ḥamām (Iran) 297 190, 191, 194–196, 198–207, 210–216, 218,
Tapa Čarmē (Iraq) 297 222–230, 235, 267, 309, 311, 316, 351, 352,
Taqtaq (Iran) 297, 298 386–388, 395–397, 413, 481, 482, 486, 487,
Tāw Doɫaw (Iran) 534 494, 526, 949, 951, see also Lake Urmi
Tawēɫa (Iraq) 534 Ursdon-Sindziqaw (Russia) 863
Tawrēbar (Iran) 534 USA (United States of America) 195, 484, 569
Tāzāwā (Iran) 534 Usarāʾ al-Mafqūdīn (Iraq) 297
Tbilisi (Georgia) 45, 195, 303, 860, 890 Uzbekistan 569, 570
Tehran (Iran) 195, 401, 431, 478–480, 484, 541, Uzungöl → Sarachos
566, 567, 578, 581, 596, 613, 615, 621, 624,
663, 704, 819–823 V
Telkepe (Iraq) 306, 318, 323, 324, 334, 350, Vafs (Iran) 663, 676, 699, 703, 759, 768, 769,
352 794, 809
Terek River 864 Vājārgāh (Iran) 670
Tergawar (Iran) 191 Van (Turkey) 1, 7, 15, 16, 22, 24, 33, 51, 52, 59,
Tēžtēž, (Iran) 534 64, 80–85, 90, 93, 97, 98, 102–104, 106–108,
Tifɫīya (Iran) 534 117, 191, 387, 389–391, 395, 421, 457, 458,
Tigris 1, 10, 19, 23, 31, 103, 190, 194, 228, 237, 480, 625, 628, 632, 639, 649, 656, 891, 948
243, 244, 246, 247, 252, 305, 313, 827 Varto (Turkey) 133
Tikrit [Tikrīt] (Iraq) 264, 309 Varzob (Tajikistan) 607
Tillo (Turkey) 160, 162–164, 177, 178, 180–183 Velatru [Velātru] (Azerbaijan) 821
Toldzgun (Russia) 862 Vidar (Iran) 823
Tonekabon (Shahsavār) (Iran) 659 Viranşehir (Turkey) 164
Tonya (Turkey) 893, 894 Vizne (Iran) 663
Topzawa [Tōpzāwa] (Iraq) 298, 301 Vladikavkaz (Russia) 863, 889, 890
Trabzon (Turkey) 72, 893, 897, 906, 931, 933,
950 W
Tskhinval (South Ossetia, Georgia) 864 Wallagkom valley (Russia) 862
Tūlaban (Iraq) 297 Wardak (Iraq) 297
Tuləguvan [Tulaguvan] (Azerbaijan) 661 Wārmāwa (Iraq) 278, 286
Tunceli → Dersim Wasē (Iran) 534
Waysīā 534 Yerevan (Armenia) 45, 92–105

Wēła (Iraq) 298 Yünlüce → Mlahso
West Azerbaijan province (Iran) 190, 396, 481
Z
X Zab → Great Zab river
Xānagā (Iran) 534 Zagros (mountains) (Iran) 269, 385, 445, 478,
Xanaqīn [Khanaqin, Xānaqīn] (Iraq) 295–298, 533, 947
306, 357, 427 Zakho (Iraq) 137, 287, 289–291, 306, 313–315,
Xatirābād [Khatirābād] (Iran) 661 323, 324, 350
Xirosa (Iran) 534 Zalka (Iran) 534
Xorramābād (Iran) 297 Zaɫm (Iraq) 534
Xošābar (Iran) 664 Zanjan [Zanǰān] (Iran) 410, 578
Xošāw (Iran) 534 Zaqar (Iraq) 297
Xurmāl 534 Zarda [Zarde] (Iran) 296, 297, 304, 534, 537, 567
Zēbārī (tribal confederation) 288
Y Zincirli (Turkey) 151
Yemen 625 Žīwār (Iran) 534
Yengi Tis‘în (neighbourhood in Kirkuk) Zom (Iran) 534
(Iraq) 358, 360 Žūrū (Iraq) 297

Index of subjects
A 852, 876, see also accusative (alignment),
ablative 68, 83, 364, 372, 382, 415, 418, 419, double oblique alignment
429, 430, 443, 594, 707, 807, 818, 835, 838, alignment splits 770, 772
868, 876, 888, 919 allophone 9, 197, 312, 360, 362, 408, 409, 411,
accusative 44, 68, 77, 79, 83, 92, 131, 207, 217, 448, 449, 485, 539, 581, 582, 677–679, 832
221, 241, 330, 364, 382, 393, 395, 408, 418, analytical attribution 253
419, 426, 443, 526, 772, 778, 780–783, 818, animacy 585, 597, 697, 708, 757, 770, 775, 867,
826, 827, 852, 868, 888, 902–906, 919, 921, 901, 904, 932
931 aorist 70, 71, 74–76, 78, 80, 89, 91, 365–367,
accusative (alignment) 44, 330, 331, 334, 393, 369, 372, 382, 396, 421–424, 430, 443, 840,
395, 526, 770, 771, 784 845, 848, 859, 888, 911–915, 920, 926, 931
accusative (case) 780, 781, 783, 919 applicative 847–849, 853, 859, 873
active (voice) 42, 169, 171, 186, 188, 219, 220, Arabic script 270, 417, 570, 572, 573, 607
251, 255, 263, 328, 332, 393, 506, 513, 632, areal convergence 9, see also convergence,
927 contact
adjective 37, 38, 118, 173, 220, 369, 389, 419, areal diffusion 8, 10, 12, 13, 15, 27, 29, 156, 502,
428, 429, 465, 466, 477, 504, 514, 517, 519, 526, 822, 829, 860
543, 545, 548, 565, 611, 612, 630, 631, 643, aspect 41, 126, 138, 140, 145, 149, 183, 220,
701, 702, 769, 770, 839, 902, 904, 918 221, 252, 282, 302, 371, 372, 391, 396, 420,
adjective, comparative → comparative 431, 438, 439, 456, 457, 513, 514, 532, 549,
adjective, superlative → superlative 632, 633, 636, 637, 642, 648, 657, 733, 827,
adposition 115, 120, 131, 155, 282, 283, 394, 840, 871, 891, 912–916, see also imperfective,
428, 544, 696, 705, 760, 764, 805, 818, 826 perfective, progressive
adverb 419, 456, 545, 548, 551, 585, 593, 607, aspiration 9, 316, 363, 410, 412, 540
630, 631, 644, 685 assimilation 13, 83, 179, 210, 238, 271, 300, 363,
adverbial 226, 227, 343, 371, 395, 396, 429, 454, 364, 407, 408, 414–416, 418, 420, 421, 494,
470, 471, 525, 586, 596, 631, 647–649, 687, 495, 509, 543, 579, 630, 719, 866, 894
795, 796, 798–802, 820, 851, 856, 857, 897, attributive 27, 37, 43, 44, 239, 372, 389, 397,
917, 928 433, 504, 545, 851
adverbial clause 226, 227, 371, 395, 396, 470, auxiliary 129, 149, 170, 332, 341, 391, 422, 437,
471, 525, 647–649, 687, 796, 798–802, 820, 460, 461, 477, 519, 603, 609, 636, 738, 767,
856, 857, 928 818, 873, 915, 930, 931
agent 44, 241, 372, 373, 394, 420, 422, 423, 429,
432, 544, 547–550, 552, 609, 631, 689, 696, B
727, 773–779, 782, 786 benefactive 18, 122, 123, 143, 282, 284, 369,
agentive 218, 332, 505, 507, 508, 513 393, 594, 597, 636, 805, 807, 843, 848, 849,
agreement 44, 116, 125, 126, 131, 145, 221, 241, 916
261, 264, 267, 273, 282–286, 291, 294, 298, bilabial(s) 113, 137, 164, 247, 287, 361, 407, 408,
315, 325, 331, 510, 520, 543, 565, 584, 603, 409, 410, 414, 540, 627, 629, 865
776, 777, 779, 822, 826, 841–843, 847, 848, bilingualism 181, 196, 246, 300, 577, 578, 677,
852, 872, 875, 876, 901, 904, 911, 918, 926, 704
932 binding (syntax) 151, 878
agreement, in number 116, 291, 294, 584, 603 borrowing 6–8, 18, 26, 28, 29, 150, 151, 156,
agreement trigger 835, 842, 843, 846–848, 852, 157, 240, 251, 345, 351, 532, 566, 583, 664,
872 689, 865, 884, 885, 895
alignment 6, 28, 44, 131, 156, 241, 285, 303, breaking of vowel 319, 388, 491, 492
330–332, 334, 335, 345, 350, 383, 393–395, broken (Arabic) plural 7, 8, 586, 587
464, 478, 507, 525, 526, 552, 553, 689, 693,
698, 711, 756, 770–782, 784, 820, 826, 837,
970 Index of subjects
C 371, 372, 375, 394, 429, 435, 436, 455,

case 15–17, 37, 40, 44, 50, 73, 114–116, 118, 119, 471, 544, 547, 548, 551, 591, 594, 601,
120, 131–133, 135, 137, 147, 166, 239, 249, 614, 636–639, 641, 642, 647, 648, 650,
273, 285, 291, 293, 301, 331, 332, 364, 365, 795, 796, 802, 803, 806, 843, 853,
368, 370–373, 377, 394, 396, 413, 417–419, 855–857, 875, 881, 919, 922–924,
426–429, 431, 432, 437, 451, 553, 543–545, 926–928
547, 553, 571, 585, 587, 594, 631, 687, 689– complement clause 226, 227, 276, 284, 344, 371,
691, 694, 696–701, 709, 710, 755, 756, 763, 372, 375, 429, 435, 471, 647, 648, 650, 803,
764, 770, 772, 774, 778–783, 791, 793, 794– 855, 856, 881, 922–924, 928
796, 808, 825, 826, 827, 834–838, 841–843, complementizer 155, 284, 302, 471, 477, 553,
846, 847, 850–854, 866–868, 870, 875, 876, 621, 829, 855, 859, 881, 888, see also comple-
877, 885, 900, 902, 904–906, 919, 929 ment clause
case, see also ablative, accusative, dative, complex predicate 22, 130, 134, 151, 156, 171,
genitive, instrumental 273, 293, 912, see also light verb
case form 37, 691, 700, 709, 778–781 compound 275, 302, 377, 437, 450, 454, 514,
case marking 114, 115, 116, 166, 239, 249, 520, 545, 546, 565, 602–604, 608, 609, 624,
331, 332, 370, 394, 417, 451, 533, 689, 698, 632, 635, 636, 638, 645, 702, 706, 734, 737,
772, 783, 793, 795, 825–827, 847, 852, 854, 738, 755, 768, 769, 839, 908
866, 868, 870, 875, 929 compounding 702
case morphology 16, 17, 364, 377, 418 compound marker 546, 565
case suffixes 147, 368, 372, 373, 415, 417, conditional 70, 149, 150, 155, 177, 187, 227,
426, 429, 431, 432, 838, 868 258, 279, 292, 302, 344, 345, 382, 435, 438,
case system 585, 690, 691, 694, 696, 819, 827, 439, 444, 523, 525, 552, 608, 610, 618–620,
868, 885 649, 688, 723, 724, 729, 732, 742–746,
causative 124, 169, 330, 351, 505, 513, 548, 690, 748–752, 796, 797, 800, 801, 818, 837,
765, 768, 769, 818, 840, 846, 849, 859, 873, 857, 871, 897, 930, 931
887, 917, 921 contact, language → language contact
Christians 12, 28, 45, 99, 160, 162, 163, 190, 191, contact-induced change 6, 22, 25, 26, 31, 150,
194, 195, 235, 244, 246, 305, 309–311, 314, 358, 361, 363, 368, 385, 386, 401, 407,
344, 352, 357, 397, 481, 482, 485, 894, 920, 410–412, 479, 860, 903, 923
951 contact-phenomena 181, 259
citations 468, 471 continuous 6, 51, 69, 96, 171, 252, 293, 350, 457,
clause chains 636, 647–649 551, 578
cleft construction 337 converb 871, 873, 888
clitic 15, 17–19, 22, 35–37, 44, 117, 118, 125, convergence 6, 9, 13–15, 17, 25, 29, 35–38, 40,
126, 130, 135, 147, 150, 240, 251, 273, 274, 42, 44, 45, 228, 238, 240–242, 298, 316, 321,
279–283, 285, 286, 293, 299–302, 304, 330, 332, 345, 383, 385, 388, 390, 393–396, 407,
368, 369, 391, 426, 434, 452, 464, 477, 512, 444, 446, 447, 490, 526, 534, 687, 821, 822,
515, 526, 541, 546, 547, 552–554, 556, 584, 829, 891
591–594, 603, 730, 731, 786, 789, 790, 794, coordination 465, 468, 469, 620, 646, 647, 657,
827, 854, 872, 877, 888, 905–908, 911, 923, 658, 821, 879
924, 931, 932 copied Iranian specifiers in {+AkA} or {+i} 15,
clitic doubling 877, 932 368, 374, 389, 426, 434, 543, 545
clitic pronoun → pronoun, clitic copied Iranian enclitic personal pronouns 368,
coda 319, 449, 450, 629, 630, 634, 652, 653, 426
899 copula 18, 20, 39–44, 50, 51, 69, 74, 78, 86,
code-switching 12, 151, 156, 311, 345, 621, 832, 87, 92, 126, 139, 140, 145, 151, 155, 172,
886, 887 181, 183, 186, 207–211, 216–218, 220–223,
comparative (adjective) 7, 8, 151, 370, 419, 426, 235, 240, 252, 253, 263, 266, 301, 302, 320,
584, 585, 589, 632, 643, 923, 929 333–339, 341, 351, 366, 382, 390–392, 414,
complement 18, 22, 115, 120, 122, 123, 125, 416, 420–422, 424, 425, 443, 455, 460–462,
226, 227, 276, 281, 283, 284, 337, 340, 344, 509–512, 515–519, 526, 547, 551, 552, 565,
Index of subjects 971
584, 603, 605–607, 613, 614, 621, 695, 718, 243, 244, 247–249, 252–254, 259, 260, 263–
725, 733, 734, 757, 758, 783, 790, 803, 818, 266, 270–273, 281, 285–290, 294, 296, 303,
827, 873, 888 306, 310, 311, 313–315, 317, 320, 322–324,
copying 8, 182, 368, 433, see also borrowing, 328, 335, 339, 343, 345, 346, 350–352, 361,
copied 368, 382, 383, 385, 387, 390–392, 394, 396–
counterfactuals 743, 751, 919, 920, 926 398, 412, 413, 444, 445, 447–449, 457, 467,
468, 471–473, 479–482, 487, 492, 493, 504,
D 505, 510, 514–516, 521, 524–527, 529, 531,
dative (case) 18, 68, 83, 92, 330, 332, 364, 367, 532, 534, 567, 577, 607, 628, 656–659, 663–
368, 370–372, 382, 393, 394, 418, 419, 426– 667, 670, 673, 676, 711, 724, 757, 820–824,
428, 431, 443, 505, 515, 795, 826, 835, 837, 832, 860, 862, 863, 878, 889, 890, 897, 932,
838, 842, 843, 847–849, 852, 853, 859, 868, 933, 949, 951
873, 876, 888, 933, 948 dialect continuum 107, 134, 673
definite article 17, 35, 92, 172–174, 179, 180, diaspora 106, 109, 157, 234, 237, 246, 385, 574,
182, 186, 207, 209, 212, 213, 263, 274, 322, 892, 893
323, 368, 389, 426, 452, 501, 502, 588, 900, differential object marking (DOM) 302, 462, 689,
903, 904, 906–908, 918, 919, 931 6693, 770–773, 775–777, 867, 876
definiteness 15, 16, 21, 114, 239, 273–275, 284, diphthong 13, 34, 62–64, 204, 210, 318, 491, 492,
389, 451, 452, 465, 478, 543, 545, 546, 585, 494, 495, 497, 581
588, 642, 703, 775, 903, 932 direct case 115, 116, 120, 131–133, 147, 293,
deictic copula 39, 41, 43, 209, 211, 334 544, 696, 755, 774, 780, 781, 818
deixis 16, 35, 39, 208, 210, 211, 321, 322, 388, direct object 5, 18, 21–26, 130–132, 169, 176,
389, 496–498 221, 223, 240, 241, 250, 254, 280, 281, 283,
delabialization (of rounded vowels) 363, 396, 411 301, 302, 333, 334, 345, 368, 373, 394, 426,
demonstrative 16, 17, 35–39, 116–118, 120, 122, 435, 452, 464, 465, 467, 518–520, 547, 548,
139, 141, 155, 184, 186, 214, 234, 263, 274, 588, 590, 591, 595, 597–600, 614, 635, 638,
275, 284, 300, 302, 304, 323, 324, 327, 364, 693, 710, 827, 854, 868, 876, 877, 903, 906,
365, 374, 375, 382, 388, 389, 394, 408, 419, 921
443, 452, 454, 455, 462, 465, 496–498, 503, direct object marking 467, 590, 595, 597
542, 543, 545–547, 554, 555, 565, 588, 590, ditransitive 791, 793, 921
592, 612, 616, 621, 630, 631, 643, 656, 692, documentation 295, 658, 832, 860, 892, 894, 933
693, 733, 818, 837, 838, 852, 859, 882, 883, double oblique alignment 689, 773, 775–780,
905–907, 918, 931 782, 789
demonstrative pronoun 16, 17, 35, 38, 214, drawl (suprasegmental intonation pattern) 367,
234, 300, 323, 324, 327, 364, 365, 374, 375, 426, 433
388, 389, 394, 408, 419, 455, 462, 496, 497, dual number 166, 174, 186, 249, 263, 588, 632
503, 546, 547, 5590, 592, 631, 837, 838, durative 252, 584, 606, 611, 621, 712, 715–718,
905, 906 729–731, 734, 736, 818
derivational verbal stem 215, 219, 220, 328, 329
determiner 116, 284, 902 E
deverb 632, 633, 635, 638, 642, 643, 645, 648 echoic 839, 840, 859
devoicing 61–64, 90, 112, 127, 165, 248, 492, ejective 10, 825, 865, 884, 885
540, 543, 866, 900 emphatic 315–317, 367, 408, 419, 452, 583, 593,
devoicing of consonants 248 605, 621, 627, 628, 631, 646, 818, 888
dialect 8–10, 12, 13, 18, 19, 21, 23, 25, 28, enclitic indefinite marker 182
32–35, 40, 42, 44, 45, 47, 48, 51–53, 62–66, endangered language 3, 163, 246, 270, 310, 385,
69, 70, 72, 73, 75, 77, 79–81, 85–89, 92–105, 484, 533, 822, 825, 832, 892, 893, 933
107–109, 116, 121, 122, 125, 129, 132–134, equative (case) 338, 365, 372, 418, 419, 868, 888
137–139, 141, 145–147, 149, 152, 156, 157, ergative 6, 44, 131, 132, 156, 241, 294, 331, 332,
159, 160, 163, 164, 169, 170, 172, 173, 177, 345, 393–395, 507, 508, 510, 511, 526, 689,
179, 181–183, 187, 188, 190, 191, 195–201, 711, 756, 773–780, 826, 835, 837, 838, 852,
203, 210, 217, 220, 224, 230, 233–235, 241, 853, 859, 885
ergative agreement 241 518, 570, 577, 598, 603, 605, 606, 609–611,
ergative alignment 44, 331, 332, 345, 393, 615–619, 621, 628, 633, 665, 678, 688, 712,
394, 526, 711, 773, 774 713, 715, 716, 718, 724–727, 737, 781, 790,
ergativity 6, 108, 131, 132, 156, 157, 235, 241, 818, 845, 850, 851, 859, 871, 873–875, 881,
285, 332, 350, 351, 394, 395, 531, 532, 689, 888, 911–914, 920, 925, 930, 931
711, 773, 774, 778, 827, 828, 891
evidentiality 183, 395, 479, 639, 656, 840, 844, G
871 geminate 583
evidential/inferential copula in {-ImIš} 366, gender 15, 16, 35, 108, 114, 117–120, 139–142,
424 146, 147, 157, 167, 180, 207, 208, 210, 239,
experiencer 150, 220, 286, 287, 547, 551, 603, 249, 254, 273, 291, 293, 301, 321–326, 330,
693, 696, 755, 756, 790, 843, 876 335, 337, 389, 451, 497, 501, 502, 510, 515,
experiencer verbs 755, 790 516, 533, 543–545, 550, 552, 555, 557, 571,
ezāfe [ezafe, ezâfe, izafe] 17, 37, 38, 40, 42–44, 585, 631, 685, 687, 689, 690, 692, 694, 699,
114, 115, 117–121, 133, 137, 139–142, 145– 701, 773, 777, 822, 825, 835, 867, 900–902,
147, 149, 150, 152, 155, 156, 214, 239, 240, 904–907, 909, 918, 932
273, 275, 284, 291, 293, 294, 302, 303, 327, gender, grammatical 114, 273, 451, 533, 543,
371, 382, 394, 428, 429, 443, 451, 453, 454, 544, 689, 692, 867, 900, 901, 904
465, 466, 477, 503, 504, 523, 531, 533, 544– gender, biological 114
546, 565, 572, 584, 589, 594–596, 601, 602, genitive (case) 17, 36–38, 44, 51, 68, 73, 74,
611, 612, 617, 621, 624, 689, 691, 697–702, 83, 92, 186, 187, 189, 212, 214, 239, 253,
806, 818, 819 263–265, 327, 364, 373, 382, 394, 416–419,
ezâfe construction 44, 115, 117–120, 133, 139, 426, 443, 500, 503, 504, 524, 544, 689, 691,
239, 275, 371, 418, 428, 504, 697, 699, 701 694–696, 699, 710, 755, 764, 779, 818, 835,
838, 850, 851, 859, 868, 902, 903, 908, 931
F glottalization 8–10, 825, 826, 833
feminine (gender) 16, 114, 115, 119, 141, 155, goal 5, 23–26, 115, 122, 130, 131, 241, 280, 283,
167, 172, 174, 179, 180, 186, 213, 214, 248, 340, 370, 394, 467, 468, 518, 520, 595, 613,
249, 253, 263, 321, 324, 325, 500–502, 544, 636, 645, 696, 781, 806–808, 847, 853
545, 548, 551, 552, 554, 555, 557, 565, 624, goal of motion verb 23, 25, 130, 241, 280,
678, 684, 685, 689, 702, 818, 819, 899–904, 467, 468, 595, 613
918, 931
finite 18, 50, 125, 140, 168, 171, 241, 250, 251, H
255, 277, 328, 369, 371, 374–376, 396, 431– habitual 40, 41, 171, 218, 221, 223, 224, 239,
433, 435, 438, 439, 456, 457, 459, 548, 549, 252, 278, 292, 333, 340, 365, 396, 457, 515,
551, 552, 607, 615, 617, 627, 635, 685, 725, 604, 608, 633, 636, 729, 732, 888, 912
827, 828, 833, 835–837, 840, 843, 845, 850, head 17, 36–38, 44, 118–120, 172–174, 179,
851, 854, 856, 871–873, 876, 881, 911, 915, 180, 212–214, 226, 227, 239, 253, 254, 258,
917, 922–924 275, 284, 326, 327, 342, 343, 368, 371, 372,
focal present 365, 366, 396, 421–423 374, 426, 428, 432–434, 465, 466, 503, 504,
focus 3, 4, 20, 151, 164, 186, 209, 219, 237, 243, 522–525, 545, 546, 555, 611, 612, 617, 691,
338, 428, 446, 457, 516, 533, 600, 663, 687, 697–702, 708, 796, 802, 828, 834, 852, 854,
759, 839, 889, 897, 921 875, 882, 900, 904, 922–924
fronting (of vowels) 13, 15, 34, 61, 81, 90, 137, head-marking 691, 697
204, 238, 290, 318, 410, 413, 614, 756, 764,
777, 786–791, 921 I
fronting (of stops and affricates) 271, 359, 360, imperative 39, 125, 127, 155, 169, 176, 183, 186,
407 215, 216, 218, 223, 224, 239, 240, 251, 255,
future 4, 26, 41, 50, 99, 109, 125, 126, 128, 129, 277, 300, 302, 328–330, 337, 340, 366, 369,
147, 149, 155, 170, 183, 186, 218, 221, 223, 376, 382, 425, 435–439, 443, 455–459, 462,
224, 239, 252, 276, 278, 292, 295, 302, 332, 463, 477, 505, 506, 513, 514, 516, 521, 526,
333, 340, 367, 423, 425, 435, 447, 511, 515, 547, 549–551, 565, 604–606, 621, 633–635,
656, 710, 715, 720–722, 818, 840, 845, 859, Iranicization (of vowel inventory) 363, 399, 411,
871, 874, 875, 888, 906, 917, 921, 931 428, 433
imperfective 92, 126, 144, 148, 155, 186, 241, irrealis verb form 40, 127, 218, 223, 224, 226,
279, 382, 457, 459, 477, 513–516, 518, 520, 227, 276, 291, 293, 332, 333, 341, 343, 344,
521, 526, 549, 851, 871, 912, 914, 915, 931 390, 391, 439, 509, 511–513, 518, 521, 523,
impossibility (Turkic verbal suffix) 366, 376, 525, 526, 604, 608, 611, 621, 633–635, 648–
425 650, 656, 729–731, 743–745, 749, 751, 818,
inalienable possession 212, 214, 326, 631, 635 930, see also realis
inchoative 767, 913, 916 isophone (Intra-Turkic, areal) 359, 360, 416
incorporation 7, 125, 144, 145, 156
indefinite 21, 68, 73, 115, 116, 119, 139, 141, J
142, 152, 155, 174, 179, 182, 186, 253, 254, Jews 162, 183, 194, 195, 244, 247, 252, 253, 264,
274, 275, 302, 382, 390, 418, 419, 452, 477, 305, 306, 309–311, 327, 351, 352, 482, 484,
502, 504, 519, 522, 525, 545, 565, 588, 598, 485, 494, 572
599, 612, 613, 621, 631, 642, 703, 704, 708,
709, 723, 770, 776, 777, 818, 862, 868, 888, K
890, 903, 905 ki (complementizer, particle, of Iranian origin)
indefinite pronoun 418, 419, 862 148–150, 226, 368, 374, 375, 418, 433–435,
indefinite marker 119, 174, 182, 502, 709 617, 724, 740, 744, 747, 795–805, 809–811,
indefinite suffix 152, 275, 504, 519 924, 925
indefiniteness 114–116, 137, 152, 273–275, 389, kinship (terms) 212, 214, 325, 326, 418, 689
390, 452, 478, 501, 502, 543–546, 584, 585,
588, 903 L
index 18, 45, 163, 187, 241, 276, 286, 302, 369 labialization 288, 362, 411, 412, 493
indexing 18, 125, 283–286, 552, 794, 917 language academy (National Iranian) 577,
indirect object 23, 221, 333, 340, 370, 392, 464, 579
467, 547, 553, 591, 592, 596, 597, 613, 693, language change 4, 480, 933
696, 793, 853, 854, 902, 921 language choice 303, 621, 623
infinitive 40, 50, 51, 74, 111, 112, 124, 144, 215, language contact 3–8, 9, 11–13, 14, 16, 18, 20–
216, 218, 220, 273, 276, 284, 294, 328–330, 22, 25, 26, 31, 32, 42, 108, 121, 150, 181–184,
336, 337, 382, 506, 512, 548, 565, 606, 610, 196–198, 214, 226, 228, 237, 240, 259, 260,
621, 688, 714, 718, 725, 733, 734, 737, 752, 298, 300, 312, 344, 345, 358, 361–363, 366,
754, 756, 805, 818, 828, 873, 888, 911, 912, 368, 370, 377, 385, 386, 388, 389, 395, 407,
917, 926–928, 930–933, 950 410–413, 423, 426, 427, 436, 442, 485, 492,
inflection 7, 15, 42, 44, 166–168, 210, 212, 509, 524, 526, 534, 639, 663, 667, 679, 806,
215, 216, 240, 249, 250, 275, 324, 328, 335, 825–827, 884–886, 895–897, 903, 920, 921,
390, 417, 497, 500–502, 504, 505, 509, 514, 923
515, 526, 633–635, 642, 648, 869, 870, 897, language documentation 295, 860
900–902, 904, 917 language endangerment → endangered language
instrumental 68, 122, 364, 372, 382, 396, 418, language loss 860, 862
419, 443, 594, 596, 636, 835, 865, 919 language shift 447, 624
interrogative 116, 301, 462, 517, 630, 631, 689, laryngeal 9, 33, 198, 200, 201, 238, 312, 314,
694, 838, 839, 909–911, 921 319, 387, 488, 489, 676
intonation 222, 230, 338, 346, 367, 426, 433, 450, laryngeal settings 200
451, 492, 516, 521, 527, 647, 687, 796 lenition 12, 138, 238, 271, 333, 386, 540, 541,
intransitivity 125, 131, 132, 144, 216, 217, 220, 630
282, 286, 293, 301, 329, 330, 331, 332, 335, lexicon 10, 15, 27, 48, 100, 150, 151, 181, 184,
336, 339, 394, 395, 505, 506, 507, 508, 510, 196, 199, 202, 228, 259, 265, 271, 298, 302,
511, 526, 549, 550, 552, 632, 687, 689, 690, 304, 305, 311, 345, 351, 358, 359, 376, 377,
711, 727, 728, 729, 739, 740, 741, 742, 743, 396, 399, 407, 409, 414, 439, 446, 457, 478,
765, 766, 768, 769, 773, 778, 781, 818, 841, 479, 485, 526, 578, 588, 627, 628, 657, 667,
847, 849, 851, 852, 853, 872, 874 672, 676, 820, 847, 872, 894, 951
light verb 22, 125, 130, 145, 148, 229, 273, 293, Mughal (Empire) 574–576, 621
345, 377, 464, 465, 509, 514, 520, 548, 602, multilingualism 3, 21, 196, 237, 354, 401, 405,
632, 635, 737, 755, 756, 765, 768, 769, 818, 406, 409, 444, 538, 933, see also bilingualism
872, see also complex predicate multiple relativization 882, 883
light verb construction 125, 130, 145, 464, multi-purpose subordinator 857, 859, 860
465, 514, 548, 602, 765, 768, 769, 818
lingua franca 573, 575, 623, 625 N
linguistic area 4, 27, 28, 822, 840, see also nasal 66, 72, 164, 247, 271, 359, 364, 409, 415,
Sprachbund, convergence zone 418, 448, 540–542, 581, 583, 590, 629, 719,
loanwords 10, 92, 100, 114, 138, 151, 181, 182, 833, 865, 898, 899
196, 228, 238, 259, 260, 290, 314, 325, 345, nasalization 449
361, 362, 386, 387, 389, 396, 409, 490, 492, national language 570, 574, 577, 579
495, 500, 502, 526, 541, 581, 583–585, 587, negation 7, 92, 125–127, 129, 138, 142, 144, 145,
588, 668, 686 148, 155, 175, 186, 224, 255, 263, 276–278,
locative 40, 43, 50, 79, 336, 364, 372, 377, 382, 281, 302, 366, 382, 425, 443, 462, 463, 526,
390, 391, 415, 418, 429, 430, 443, 517, 636, 542, 549, 551, 565, 584, 585, 604, 621, 627,
696, 705–707, 716, 718, 733–735, 737, 760, 630, 636, 638, 650, 710, 715, 834, 837, 843,
805–808, 818, 840, 843, 844, 846, 849, 919, 850, 859, 878, 885, 889, 897, 917, 919, 931,
929 951
neuter (gender) 900–904, 906–908, 918, 931
M nominalization 371, 432, 922, 923, 927, 929, 931
macro-areas 4, 5, 28 nominative (case) 15, 68, 73, 92, 217, 218, 233,
majhul (long mid) vowels 447 240, 323, 330, 349, 389, 394, 419, 501, 531,
masculine (gender) 16, 92, 114–116, 119, 137, 692, 826, 868, 870, 872, 876, 888, 902, 903,
141, 155, 167, 180, 186, 249, 263, 291, 302, 919, 926, 931, 948
321, 324, 325, 501, 544, 545, 548, 550, 552, noun class 502, 589, 903
555, 557, 565, 587, 689, 701, 818, 819, 826, noun phrase (NP) 17, 22, 36, 37, 116–118, 120,
900–904, 918, 931, 947 121, 139, 147, 239, 273, 275, 280, 281, 286,
metathesis 143, 364, 412, 434, 486 292, 394, 428, 452, 453, 465, 467, 468, 507,
middle voice 847, 859, 860 518, 524, 545–547, 551, 553, 566, 586, 601,
migration 65, 150, 160, 194, 195, 354, 398, 482, 611, 612, 614, 631, 639, 642, 648, 690, 691,
863, 893, 895, 933 763, 773–777, 805, 806, 818, 832, 834, 849,
minority language 3, 163, 303, 357, 401, 406 578, 854, 856, 859, 867, 868, 872, 875–880, 882,
579, 825 883, 897, 900, 902–904, 907, 908, 918, 929
mirative 457, 458, 633–635, 653, 656, 846 number 15, 116, 118, 120, 125, 139, 141, 207,
mixed language 625 239, 249, 254, 273, 274, 276, 291, 293, 294,
modality 149, 155, 276, 375, 423, 435, 436, 567, 321, 330, 335, 337, 389, 451, 505, 510, 543,
604, 633, 636, 861, 890, 897, 917, 919, 930, 544, 546, 547, 552, 557, 584, 585, 587, 594,
933 603, 623, 631, 632, 634, 636, 637, 642, 694,
mood 127, 138, 218, 420, 425, 436, 438, 439, 743, 777, 859, 867, 871, 872, 900, 901, 904,
456, 514, 549, 632, 637, 642, 648, 840, 871, 906, 909, 911, 924
872, 917, 920 number agreement 116, 291, 294, 584, 603
morphology 7, 15–17, 29, 35, 51, 68, 69, 72, 74,
77–80, 83–86, 88, 90, 91, 99, 108, 114, 124, O
143, 145, 150, 166, 168, 187, 196, 207, 215, object 5, 18, 21–24, 26, 43, 44, 115, 125, 131,
228, 235, 249, 250, 264, 267, 273, 276, 291, 132, 168, 169, 209, 221, 223, 240, 241, 250,
300, 303, 321, 328, 345, 358, 359, 364, 365, 254, 264, 280, 281, 283, 286, 294, 301, 331,
368, 377, 388, 396, 407, 417, 418, 420, 447, 333–335, 340, 345, 368, 370, 393–395, 419,
451, 452, 455, 465, 496, 504, 505, 526, 533, 426, 435, 451, 452, 455, 462, 464, 465, 467,
543, 585, 588, 602, 632, 634, 656, 657, 684– 477, 500, 507–511, 514, 515, 518–520, 524,
687, 690–692, 694, 820, 840, 846, 860, 861, 544, 547, 548, 550, 553, 566, 590–593, 595,
867, 871, 873, 876, 885, 897, 911, 916, 932 597–599, 602, 613, 614, 621, 632, 635, 636,
638, 650, 689, 691–693, 696, 733, 770, 786– 293, 547, 552, 553, 603, 606, 608, 621, 727,
789, 791, 793, 805, 826, 827, 841, 844, 847, 773, 827, 881, 912, 914, 926, 927
850, 851, 867, 876, 877, 902, 903, 905, 906, patient 258, 688, 689, 696, 721, 727, 773–782,
921, see also direct object, indirect object 784, 785, 791
object agreement 520, 727 perfect 41, 42, 44, 92, 140, 145, 148, 149, 155,
oblique 15, 17, 37, 44, 114–117, 119, 120, 131, 167–169, 171, 175, 183, 216, 217, 220, 221,
132, 135, 137, 147, 148, 155, 233, 239, 291, 240, 250–252, 255, 279, 302, 334–336, 341,
293, 301, 302, 323, 349, 370, 382, 389, 393, 351, 366, 367, 369, 392, 394, 395, 424, 456,
394, 426, 427, 501, 503, 505, 507, 515, 531, 458–460, 477, 511, 515, 519, 526, 532, 552,
533, 544, 547, 553, 554, 565, 586, 650, 689– 557, 560, 565, 585, 606–609, 611, 633–635,
691, 694–702, 727, 755, 756, 763, 764, 770, 645, 648, 650, 656, 663, 685, 710, 738–743,
772–783, 789, 791, 793, 794, 806, 818, 819, 747, 749–751, 763, 790, 791, 819, 826, 843,
866, 868, 870, 875, 876, 888, 948 845, 912, 931
oblique case 17, 37, 44, 114–116, 119, 120, perfective 6, 41, 42, 44, 186, 216, 217, 220, 240,
131, 132, 135, 137, 147, 239, 291, 293, 301, 241, 263, 331, 333, 335, 336, 382, 394, 395,
370, 394, 426, 533, 544, 547, 553, 689–691, 443, 455, 513–515, 520, 526, 552, 851, 859,
696–700, 755, 756, 763, 764, 770, 772, 774, 871, 912–914, 917
778, 780–782, 791, 793, 794, 795, 866, 868 personal pronoun → pronoun, personal
official language 3, 162, 196, 237, 244, 357, 398, person marking 42, 125, 126, 129, 145–147, 241,
405, 485, 538, 570, 572, 574, 575, 579 277–279, 299, 304, 330, 335, 455, 457–461,
onset, syllable 112, 272, 318, 319, 341, 361, 448, 466, 840, 841, 843
449, 833, 899 Persophonie 622
optative 176, 255, 376, 382, 383, 436–439, 443, pharyngeal 10, 11, 32, 33, 113, 114, 139, 156,
608, 689, 721, 723–725, 730, 731, 753, 819, 198–201, 271, 288, 312–316, 345, 361, 363,
845, 859, 917, 920, 931 386, 387, 410, 485, 486, 488–490, 540, 541,
orthography 109–111, 121, 152, 288, 289, 311, 583, 628, 629
365, 447, 478, 539, 542, 627, 861, 863, 894, pharyngealization 9, 11, 33, 113, 114, 150, 201,
897 202, 315, 316, 386, 387, 486, 490, 826
Ottoman (Empire) 1, 383, 574 phoneme 9, 11, 32, 33, 109, 110, 113, 139,
Ottoman culture 259 164, 165, 182, 197, 198, 235, 247, 248, 271,
Ottoman government 309, 355 272, 287, 312, 316, 316–318, 360–362, 387,
OV (word order, including SOV) 5, 21–24, 130, 408–411, 539, 540–542, 581–583, 627–630,
241, 273, 281, 339, 428, 467, 614, 627, 776– 676–679, 809, 833, 898, 899
779, 807, 827, 852, 876, 921, 933 phonemicized allophones 197, 312, 539, see
also allophone
P phrasal verbs 182, 229, 345, 509, 602
palatalization 12, 33, 200, 238, 271, 314, 388, pluperfect 155, 279, 460, 552, 607, 618, 843, 845
396, 413, 417, 676, 717, 866 plural 11, 15, 16, 35, 73, 74, 83, 85, 92, 114,
participle 40–42, 50, 69, 78, 86, 92, 137, 155, 116, 117, 119, 131, 139, 141, 142, 147, 155,
168–171, 183, 186, 215–217, 220, 240, 251, 166–169, 186, 213, 214, 239, 249, 263, 291,
252, 263, 278, 279, 328–330, 332, 334, 335, 299, 302, 321, 323–326, 359, 361, 364–366,
341, 366, 367, 372–374, 382, 392, 429, 432, 369, 382, 414, 416, 417, 420–423, 429, 443,
435, 458, 460, 477, 506, 510, 511, 513, 517, 451–453, 455, 477, 486, 497, 498, 501, 502,
519, 548, 552, 557, 565, 602, 606–610, 621, 515, 542, 544–550, 552, 557, 565, 584–589,
648, 650, 685, 718, 738, 741, 743, 760, 761, 603, 605, 621, 631, 633, 637, 642, 645, 656,
767, 806, 819, 850, 851, 859, 866, 873, 888 688–692, 700, 703, 721, 774, 781, 818, 819,
passive 169, 219, 220, 251, 255, 263, 273, 328, 834, 842, 843, 846–848, 854, 859, 867, 868,
332, 366, 405, 406, 513, 548, 603, 608–610, 872, 875, 886–888, 900–902, 904, 907, 911,
632, 689, 767–769, 819, 832, 873, 916, 926, 931
931 politeness, system of 578, 590, 603
past (tense) 7, 115, 125–127, 131, 135, 146–148, possessive 11, 14, 18, 40, 85, 87, 151, 211, 212,
171, 241, 252, 276, 281, 283, 284, 286, 290, 239, 273, 287, 302, 323, 324, 359, 364, 367,
369, 372, 373, 377, 382, 393, 416–421, 426– Q

428, 431, 432, 443, 512, 548, 551, 584, 591, quantifier 630, 708
631, 637, 641–643, 645, 679, 688, 689, 691, question 50, 128, 163, 166, 186, 187, 259, 299,
692, 699, 702, 709, 710, 763, 764, 770, 778, 367, 425, 426, 588, 599, 614, 631, 650, 656,
783–786, 789, 806, 819, 827, 834, 835, 849, 759, 856, 884, 885, 906, 910, 911, 915, 924,
859, 875, 878, 885, 888, 907, 908, 927, 931 931
possessor 85, 115, 117–119, 132, 133, 148, 281,
282, 284, 286, 371, 392, 393, 418, 427, 547, R
548, 593, 642, 691, 696–700, 763, 764, 784, realis verb form 218, 227, 344, 391, 512, 513,
843, 847–849, 867, 907, 908 515, 516, 525, 630, 633–635, 652, 653, 656,
post-constituent negation 627, 638 see also irrealis
potential 278, 365, 366, 370, 373, 382, 425, 432, reciprocal 117, 377, 382, 859, 873
439, 608, 745, 826, 842, 843, 848, 849, 859, reconstruction (historical) 21, 48, 207, 209, 486,
885 see also impossibility (Turkic verbal 933
suffix) reflexive 117, 119, 131, 148, 151, 155, 263, 300,
pragmatics 478, 479, 622 302, 367, 419, 420, 455, 546, 547, 565, 593,
predicative 18, 44, 137, 139, 393, 759, 763, 764, 594, 621, 631, 656, 709, 710, 873
821, 851 relative pronoun 173, 179, 181, 631, 648, 882,
preposed relative clauses 806 883, 908, 922, 923
preposition 39, 68, 112, 122, 123, 129, 131, 134, relativizer 118, 156, 186, 303, 909, 922–924,
135, 139, 142, 143, 253, 254, 257, 280, 283, 931
284, 330, 336, 340, 370, 393, 428, 454, 465, resumptive 524, 553, 924
466, 468, 505, 515, 520, 525, 543, 545, 553, resumptive pronoun 524, 924
589, 595, 596, 601, 630, 636, 641, 642, 705, re-syllabification 166
706, 718, 794, 805, 875, 919, 923, 927, 930 rhotacism 364, 412
present (tense) 51, 96, 115, 125, 140, 143, 146, root 47, 93, 113, 287, 328, 361, 377, 412, 414,
147, 170, 241, 252, 277, 278, 281, 293, 298, 455, 482, 634, 643, 688, 711–714, 718, 720,
301, 394, 422, 544, 547, 549, 552, 553, 604– 722, 733, 742, 743, 754, 755, 758, 765, 789,
606, 621, 687, 688, 716–720, 729, 734, 737, 840, 844, 846, 849, 850, 871, 872, 885, 917,
826, 827, 840, 912, 920, 925 923
preverb 144, 464, 548, 591, 604, 688, 712–716, rounding (vowels) 13, 14, 15, 111, 138, 204, 238,
728–731, 765, 819, 840, 844–846, 850, 859, 272, 289, 300, 362, 363, 396, 410, 411, 417,
871, 872, 888 420, 487, 582, 678, 947, 949
pro-drop 590, 613, 876
progressive 40, 41, 51, 74, 126, 140, 145, 170, S
218, 220, 221, 239, 252, 278, 336, 337, 341, Sassanian (Empire) 243, 625, 666
364, 365, 391, 396, 414, 415, 421, 457, 460, semantics 7, 286, 293, 304, 457, 566, 631, 632,
461, 511, 512, 515, 518, 603, 609, 633, 688, 639, 657, 796, 826, 829, 871
718, 729, 732–738, 790, 819, 915, 916 sociolinguistics 158, 162, 163, 188, 196, 246,
progressive present 170, 252, 396, 421, 511 311, 357, 398, 446, 478, 485, 537, 569, 577,
proleptic 880, 881 624, 821, 825, 892, 893, 933
pronoun, personal 117, 119, 167, 172, 175, 209, sound shift 50, 61, 248
250, 273, 274, 280, 291, 367, 368, 408, 419, sound shift r > ġ 248
420, 426, 427, 498, 515, 526, 546, 589, 631, SOV (word order) → OV
869, 905, 935, 948, see also demonstrative, specifier 368, 477
indefinite, interrogative, relative, resumptive Sprachbund 27, 891, see also linguistic area
pronoun, clitic 18, 22, 240, 273, 274, 281– stranding 123
283, 286, 330, 368, 369, 464, 526, 541, 547, stress 15, 118, 206, 207, 222, 230, 249, 300, 320,
552, 553, 556, 584, 591, 593, 907, 911, see 321, 338, 346, 450–452, 477, 492–495, 498,
also copied Iranian enclitic pronouns 516, 527, 539–544, 549, 584, 585, 605, 607,
622, 630, 642, 684–686, 688, 715, 720, 738–
740, 742, 743, 833, 835, 866, 867, 889, 911
subject 51, 74, 111, 113, 115, 117, 119, 120, 125, 150–152, 172, 176, 187, 222, 225, 236, 241,
128, 131, 132, 137, 141, 146–148, 151, 166, 252, 256, 260, 282, 284, 298, 301, 326, 338,
172, 179, 182, 217, 218, 241, 250, 281, 286, 341, 345, 358, 367, 371, 391, 392, 394, 396,
292, 293, 330–332, 334, 335, 338, 342, 368, 399, 429, 430, 433, 444, 447, 464, 465, 516,
374, 393–395, 414, 417, 464, 466, 477, 481, 521, 526, 580, 602, 611, 615, 621, 622, 627,
498, 505, 507, 510, 511, 515, 520, 521, 547, 636, 638, 642, 657, 827, 860, 890, 895, 897,
549, 550, 552, 587, 590–592, 601, 613, 614, 916–918, 931
631, 632, 636, 645, 649, 650, 678, 695, 696,
752, 755, 757, 779, 781, 783, 790, 826, 827, T
841, 843, 847–849, 851–853, 863, 876–878, topic 275, 282, 452, 466, 543, 659, 665, 744, 836,
902, 903, 905, 911, 916, 922–924, 926 837, 888, 900
subject marking 149, 464, 511 topicalization 599
subject, non-canonical 132, 150, 286, 287, transitional zone 5, 355, 400, 807
293, 843, 848, 853 transitive 44, 115, 125, 131, 132, 135, 216–218,
subject relativization 286, 828 220, 221, 241, 276, 281–283, 285, 286, 293,
subjunctive 40, 41, 50, 51, 69, 70, 74, 112, 125– 302, 329–332, 334, 336, 339, 394, 395,
129, 155, 239, 277–279, 284, 291, 292, 302, 477, 505–508, 510, 511, 544, 548–550, 552,
376, 390, 436–439, 456–458, 462, 463, 477, 553, 595, 599, 632, 687, 689, 690, 711, 721,
542, 549, 551, 552, 565, 604, 606, 610, 611, 727–731, 739–743, 763, 765, 768–770, 778,
615, 617, 619–621, 678, 685, 688, 710, 718, 791, 826, 841, 842, 847–851, 853, 859, 872,
720–725, 727, 730–732, 741–753, 757, 758, 874, 906
762, 803, 819, 845, 859, 871, 872, 874, 875, transitivity 126, 220, 329, 331, 552, 826,
888, 917, 920 917
subordination 25, 26, 241, 429, 465, 468, 607, Turkoman (of Iraq) 244, 246, 259, 311, 345
620, 630, 636, 647, 648, 795, 796, 805, 807,
880, 922, 923, 926 U
suffix 7, 8, 11, 14–17, 35, 44, 87, 114, 116, 118, unaccusative 144, 220, 508, 510, 511, 526, 767,
124, 137, 139, 147, 148, 152, 166–168, 174, 769, 770
179–181, 183, 210, 212–214, 226, 227, 233, unergative 507, 508
239, 240, 248, 249, 251, 252, 274, 275, 279,
283, 284, 286, 291, 292, 300, 322, 326, 327, V
330, 333, 336, 349, 364, 366, 369, 372, 373, valency 765, 840, 843, 846, 847, 849, 860
376, 377, 389, 390, 393, 414, 416, 418–420, valency-changing 846
422, 423, 425, 427, 428, 436, 451–453, 456, verb 6, 7, 13, 18–26, 40, 41, 50, 51, 70, 74–76,
458–460, 495, 499, 501, 502, 504, 513, 515, 80, 84, 86, 91, 112, 115, 116, 122, 125–132,
519, 520, 524, 526, 531, 543, 550–554, 134–136, 139, 144, 145, 148, 149, 159, 166,
584–586, 588, 589, 608, 615, 624, 630–633, 170–172, 175, 176, 181, 182, 223, 226, 227,
643, 644, 656, 687, 690–692, 699, 716, 719, 229, 234, 240, 241, 244, 249, 252, 264, 271,
729–731, 733, 765, 826, 827, 834, 836, 837, 273, 276–281, 283, 286, 292–294, 297–299,
840–842, 844, 845, 847, 849–851, 859, 868, 328, 331–335, 337, 340, 341, 345, 363, 365,
872, 874, 886, 887, 900–902, 906, 907, 911, 366, 368–371, 373–377, 390–392, 394, 395,
917, 918, 929 414, 420, 422, 425–433, 435–439, 446, 450,
superlative (adjective) 370, 419, 426, 584, 930 455–470, 477, 498, 505–512, 514–516, 518–
SVO (word order) → VO 520, 525, 526, 548–553, 560, 563, 587, 590,
syllable 48, 90, 108, 111–113, 143, 165, 202, 203, 592, 595, 599, 601–603, 605–607, 609, 610,
205, 249, 272, 317–321, 361, 363, 365, 367, 613, 615, 617, 618, 627, 630–638, 642, 644,
387, 397, 410, 412, 426, 433, 449, 450, 454, 645, 647–653, 656, 684, 687–690, 710–714,
477, 488, 489, 492–495, 539, 542, 543, 549, 716, 718, 720, 723, 725, 727, 729–731, 735–
573, 583, 584, 629, 630, 686, 720, 733, 796, 738, 740, 742–748, 750–752, 754–757, 759,
833, 866, 867, 899–901 762, 763, 765–769, 773–778, 786, 787, 789,
syntax 7, 16, 22, 24, 26, 27, 38, 44, 71, 76, 794, 803, 805, 818, 820, 821, 826–828, 832–
78, 80, 84, 87, 89, 91, 120, 130, 131, 148, 837, 839–841, 843–852, 854–856, 861, 866,
871–874, 876, 878, 881, 882, 885–887, 890, 326, 328–330, 360–363, 365, 386–388, 396,
905, 911–913, 915–917, 920–924, 947, 949 408–414, 417, 420–422, 433, 447–452, 454,
verbal modifier 170, 183, 252, 257, 260 455, 459, 465, 468, 477, 485, 487, 488, 491–
verb phrase (VP) 139, 281–283, 302, 468, 469, 495, 498, 500, 518, 539–545, 547, 549, 550,
602, 627, 635, 636, 645, 787, 876 557, 580–584, 591, 605, 628–630, 633, 637,
verb stem 7, 112, 114, 278, 279, 363, 365, 414, 643, 656, 676–683, 687, 690, 692, 708, 715,
422, 462, 505, 506, 548, 633, 637, 648, 690, 717–721, 733, 737, 739, 757, 832, 833, 839,
710, 711, 716, 740, 911 865–867, 869, 898–900, 902, 904, 907, 910,
VO (word order, including SVO) 5, 21–23, 28, 911, 947, 949
244, 339, 807, 827, 921, 933 vowel harmony 8, 363, 396, 715, 910, 911
voice (verbal) 220, 632, 637, 767, 847, 859, 860,
899, 911, 916, see also middle voice W
voice, voiced (phonation) 10–12, 50, 61, 66, 77, Wackernagel rule 787
79, 81, 88, 90, 96, 98, 103, 113, 137, 164, 165, wh-word 910, 911
200, 201, 238, 247, 271, 312–315, 360, 363, word order 5, 21–24, 27–29, 130, 134, 156, 176,
386, 387, 407–410, 412, 414, 448, 449, 488, 187, 223, 241, 264, 273, 280, 370, 371, 428,
489, 540, 541, 543, 564, 581, 582, 627, 628, 518, 612–614, 622, 688, 777, 792, 793, 795,
676, 677, 833, 866, 898, 899, 947, 949 796, 802, 806–808, 827, 852, 861, 876, 878,
voiceless (phonation) 10, 50, 79, 108, 113, 150, 890, 897, 921
164, 238, 247, 271, 288, 407, 448, 449, 488,
540, 541, 581, 583, 627, 676, 677, 825, 833, Y
898–900 Yezidi 140, 237, 246
volitive/volitional 41, 218, 926
vowel 12–15, 33, 34, 39, 41, 61, 74, 81, 83, 85, Z
90, 109–112, 115, 122, 126, 129, 137–139, Zagros-d 413
143, 150, 165–168, 182, 197, 199, 200,
202–206, 218, 219, 238, 248–250, 1, 272, ʔ
277, 279, 288–290, 300, 312, 314, 316–320, ʔimāla 165, 248

(The World of Linguistics 6) Geoffrey Haig, Geoffrey Khan - The Languages and Linguistics of Western Asia - An Areal Perspective-De Gruyter (2018)

Uploaded by

Copyright:

Available Formats

(The World of Linguistics 6) Geoffrey Haig, Geoffrey Khan - The Languages and Linguistics of Western Asia - An Areal Perspective-De Gruyter (2018)

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

(The World of Linguistics 6) Geoffrey Haig, Geoffrey Khan - The Languages and Linguistics of Western Asia - An Areal Perspective-De Gruyter (2018)

Uploaded by

Copyright:

Available Formats

What is the title of the book discussed?

What is the title of the book discussed?

What topics are covered in the book?

What topics are covered in the book?

The Languages and Linguistics of Western Asia

Library of Congress Cataloging-in-Publication Data

Bibliographic information published by the Deutsche Nationalbibliothek

© 2019 Walter de Gruyter GmbH & Co. KG, Berlin/Boston

Geoffrey Haig and Geoffrey Khan (editors) November 2018

1. Introduction · Geoffrey Haig and Geoffrey Khan . . . . . . . . . . . 1

2. Eastern Anatolia and northwestern Iran . . . . . . . . . . . . . . . . . . . 31

3. Northern Iraq . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237

4. Western Iran . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 385

5. The Caspian region and south Azerbaijan: Caspian and

6. Caucasian rim and southern Black Sea coast . . . . . . . . . . . . . . . 825

6.1. Caucasian rim and southern Black Sea coast: overview ·

7. Appendix: Basic lexical items in selected languages of

Index of language names . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 953

Index of place names . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 961

Index of subjects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 969

1. Aims and scope of the volume

This volume contains a series of descriptions of around 20 languages (or language

to indicate the approximate centre of the respective language variety. Language

focus on a synchronic description of the respective language or language group,

2. Western Asia from an areal typological perspective

On this coarse-grained view, our “Western Asia” would be assigned to “Eurasia”,

3. Areal patterns in phonology and morphosyntax across Western Asia

This section discusses a selection of features from phonology and morphosyntax

sample we draw on is by no means comprehensive, it is nevertheless possible to

3.1. Language contact: general considerations

due to influence from an Iranian contact language, e. g. precursors of Kurdish, or

found in Nakh-Daghestanian languages (e. g. Archi, Lak), Northwestern Cauca-

of the strengthening of laryngeals to preserve morphological distinctions, e. g.

following a vowel or sonorant undergoes lenition, know as “Zagros d”, result-

to have a similar function of making clear distinctions in a paradigm, e. g. ḥaft

3.3.1. The marking of definiteness

3.3.2. Commonalities in pronominal and case morphology

A case of the convergence of the syntactic patterns of demonstratives is as

3.3.3. Clitic pronouns indexing verbal objects

3.4. Constituent order

3.4.1. Position of copular elements

(5) Gilaki, Lāhijāni (Stilo, chapter 5, ex. 01)

(12) Romeyka (Schreiber, chapter 6.4, ex. 8)

3.4.2. The relative ordering of direct object and verb

(15) čāre aḇd-etton … qam tā jisər

(20) […] xa-dána tórta šăqə̀l-Ø-wa-la.|

3.5. Other issues in contact-induced syntactic change

3.6. Provisional remarks on ease of syntactic borrowing in the Western Asian

heritance: Problems in comparative linguistics, 195–224. Oxford: Oxford University

1.1. Loss of Semitic interdental fricatives

1.2. Unaspirated stops

1.3. Pharyngeals and pharyngealized consonants

1.8. Vowel fronting and raising

2.2. Noun phrase

2.2.1. Genitive markers

(4) Christian Urmi

Kurdish: yê spî

2.4. Attention-drawing particle d-

2.5. Verbal constructions

Geoffrey Haig and Geoffrey Khan (editors) November 2018