Romanization System For Khmer
Romanization System For Khmer
This system is based on the modified 1959 Service Géographique Khmère (SGK)
system. There are a small number of distinctions between the systems, which are listed at
the end of this system.
While most Cambodian toponyms consist of Khmer lexical items, word division
is not generally indicated, and Khmer diacritical marks are often omitted. Reference
sources should be consulted in cases of uncertainty.
CONSONANT CHARACTERS
and SUBSCRIPT CONSONANT FORMS
When a Khmer graphic cluster contains two consecutive consonants, the second is
generally written below the base character in a special subscript form, sometimes called a
“foot”. Most base consonant characters have a corresponding subscript form. The list
below shows the consonant characters and, in the right-hand columns, the form of each
consonant when it appears as a “foot”.
2 ខ ឃ ◌ខ ◌ឃ kh
3 ង ◌ង ng
4 ច ជ ◌ច ◌ជ ch
5 ឆ ឈ ◌ឆ ◌ឈ chh
6 ញ ◌ញ nh
7 ដ ឌ ◌្ដ ◌្ឌ d
8 ឋ ឍ ◌្ឋ ◌ th
9 ថ ធ ◌ថ ◌ធ th
10 ណ ន ◌ ◌ន n
11 ត ទ ◌ ◌ទ t
1
Consonant Characters Consonant “Feet” Romanization
â series ô series â series ô series
12 ប ◌ប b, p3
13 ព ◌ព p
14 ផ ភ ◌ផ ◌ភ ph
15 ម ◌ម m
16 យ ◌យ y
17 រ ◌្រ r
18 ឡ ល ◌្ល l
19 វ ◌្វ v
20 ស ◌ s
21 ហ ◌្ហ h
22 អ ◌្អ ’4
subscript forms ◌្ដ and ◌ធ usually represent the characters ដ (d) and ធ (th), respectively,
rather than ត (t) and ឋ (th): ក្ដី → kdei, កនធយ → kânthéay, but ក ន្ដប់ → kântráb.
[This last example employs 2 subscript characters on a single base consonant
character. In such cases, the base consonant character is romanized first, followed by the
character below the base, and then the character to the side.] Consultation of a reference
source may sometimes be necessary to establish which consonant character is represented
by the subscript form.
The subscript form usually determines the series of the vocalic nucleus that
follows: e.g. ខពង → khpông and ល្អ → l’â. The base consonant character determines the
vocalic series only if the subscript form is a “weak” consonant (either a nasal consonant
character (ង, ញ, ណ, ន, ម → ng, nh, n, n, m) or one of the following consonant
characters យ, រ, ល, វ → y, r, l, v), and the base consonant character is not one of these
2
VOWEL CHARACTERS
The Roman-script vowel letters in the â series columns follow a romanized
syllable-initial consonant letter or consonant letter plus subscript form (represented by ◌)
of that series: ក → kâ, ្រក → kra. The Roman-script vowel letters in the ô series
columns follow a romanized syllable-initial ◌ of that series: គ → kô, ្រគ → kréa. Eight
vowel characters are not differentiated as to series and may, therefore, follow a
romanized syllable-initial ◌ of either series: ែក → kê, ែគ → kê, ែ្រគ → krê. A Khmer ◌
1 ◌ â ô ◌◌់ á ó
2 ◌ា a éa ◌ា ◌់ ◌័◌ ă5 eă, oă5
3 ឥ ĕ ◌ិ ĕ ĭ
4 ឦ ei ◌ី ei i
5 ◌ឹ œ̆ œ̆
6 ◌ឺ œ œ
7 ឧ ŏ, ŭ9 ◌ុ ŏ ŭ
8 ◌ូ o u
9 ◌ួ uŏ uŏ
10 េ◌ aeu eu
11 េ◌ œă œă
12 េ◌ iĕ iĕ
13 េ◌ é é
14 ឯ ê ែ◌ ê ê
15 ឰ ai ៃ◌ ai ey
3
Independent Dependent
Characters Characters
Normal Shortened
â series ô series â series ô series
16 ឱ (ឲ) aô េ◌ aô oŭ
17 ឳ au េ◌ au ŏu
18 ឪ âu
19 ឫ rœ̆
20 ឬ rœ
21 ឭ lœ̆
22 ឮ lœ
DIACRITICAL MARKS
1 ◌៊2,3 4 ◌៌6
2 ◌៉2,3 5 ◌៏7
3 ◌់5 6 ◌៍8
NUMERALS
០ ១ ២ ៣ ៤ ៥ ៦ ៧ ៨ ៩
0 1 2 3 4 5 6 7 8 9
4
NOTES
and ប (ba), changes it to the ô series: ហ៊឵ង → héang (see also Note 3). The
ប and ប with the diacritical mark ◌៉ are romanized as p in the â series, rather
(’): រក្អ
័ ក → k’êk, ចេង្អ ត → châng’iĕt, រអិល → rô’ĕl, អ្វី → ’vei. In word-
initial position before a vowel letter, however, the apostrophe is optional: អ឵ង
→ ’ang or ang.
5 The diacritical mark ◌់ appears only in two combinations: ◌◌់ (example: បត់
◌័◌. In the â series, both ◌ា ◌់ and ◌័◌ are romanized ă: ច឵ក់ → chăk, ច័ក →
chăk. In the ô series, both are romanized eă when followed by k, ng, or h;
otherwise they are romanized oă: រព឵ក់ → rôpeăk, ម឵ត់ → moăt, វងក ័ →
veăngk, ភ័ព្វ → phoăpv.
6 The combination ◌៌ is romanized rC, where C represents any romanized
5
consonant is vowelled, i.e., followed by â in the â series or by ô in the ô series:
តំណ៏ → tâmnâ, ពម៏ → pômô.
8 The diacritical mark ◌៍, which appears above characters that are not
The BGN/PCGN system differs from that of the 1959 Service Géographique Khmère
(SGK) system in the following respects. In the BGN/PCGN system:
4. The graphic combinations ◌ា ◌់ and ◌័◌ are romanized eă when followed by k, ng,
or h; otherwise, they are romanized oă.
5. The independent character ឧ is romanized ŏ or ŭ.
6. Diacritical marks and numerals have been added.
7. Notes, which are essential to the application of the system, have been provided.