Lojban Book
Lojban Book
Lojban Book
12. Bound variable pro-sumti and pro-bridi: the da-series and the bu'a-series
The following cmavo are discussed in this section:
da KOhA da-series something-1
de KOhA da-series something-2
di KOhA da-series something-3
bu'a GOhA bu'a-series some-predicate-1
bu'e GOhA bu'a-series some-predicate-2
bu'i GOhA bu'a-series some-predicate-3
Bound variables belong to the predicate-logic part of Lojban, and are listed her
e for completeness only. Their semantics is explained in Chapter 16. It is worth
mentioning that the Lojban translation of Example 1.2 is:
12.1) la djan. cu lafti
da poi grana ku'o
gi'e desygau da
John raised
something-1 which is-a-stick
and shake-did something-1
John picked up a stick and shook it.
13. Pro-sumti and pro-bridi cancelling
The following cmavo are discussed in this section:
da'o DAhO cancel all pro-sumti/pro-bridi
How long does a pro-sumti or pro-bridi remain stable? In other words, once we kn
ow the referent of a pro-sumti or pro-bridi, how long can we be sure that future
uses of the same cmavo have the same referent? The answer to this question depe
nds on which series the cmavo belongs to.
Personal pro-sumti are stable until there is a change of speaker or listener, po
ssibly signaled by a vocative. Assignable pro-sumti and pro-bridi last indefinit
ely or until rebound with ``goi'' or ``cei''. Bound variable pro-sumti and pro-b
ridi also generally last until re-bound; details are available in Chapter 16.
Utterance pro-sumti are stable only within the utterance in which they appear; s
imilarly, reflexive pro-sumti are stable only within the bridi in which they app
ear; and ``ke'a'' is stable only within its relative clause. Anaphoric pro-sumti
and pro-bridi are stable only within narrow limits depending on the rules for t
he particular cmavo.
Demonstrative pro-sumti, indefinite pro-sumti and pro-bridi, and sumti and bridi
questions potentially change referents every time they are used.
However, there are ways to cancel all pro-sumti and pro-bridi, so that none of t
hem have known referents. (Some, such as ``mi'', will acquire the same referent
as soon as they are used again after the cancellation.) The simplest way to canc
el everything is with the cmavo ``da'o'' of selma'o DAhO, which is used solely f
or this purpose; it may appear anywhere, and has no effect on the grammar of tex
ts containing it. One use of ``da'o'' is when entering a conversation, to indica
te that one's pro-sumti assignments have nothing to do with any assignments alre
ady made by other participants in the conversation.
In addition, the cmavo ``ni'o'' and ``no'i'' of selma'o NIhO, which are used pri
marily to indicate shifts in topic, may also have the effect of canceling pro-su
mti and pro-bridi assignments, or of reinstating ones formerly in effect. More e
xplanations of NIhO can be found in Chapter 19.
14. The identity predicate: du
The following cmavo is discussed in this section:
du GOhA identity
The cmavo ``du'' has the place structure:
x1 is identical with x2, x3, ...
and appears in selma'o GOhA for reasons of convenience: it is not a pro-bridi. `
`du'' serves as mathematical ``='', and outside mathematical contexts is used fo
r defining or identifying. Mathematical examples may be found in Chapter 18.
The main difference between
14.1) ko'a du le nanmu
it-1 is-identical-to the man
and
14.2) ko'a mintu le nanmu
it-1 is-the-same-as the man
is this defining nature. Example 14.1 presumes that the speaker is responding to
a request for information about what ``ko'a'' refers to, or that the speaker in
some way feels the need to define ``ko'a'' for later reference. A bridi with ``
du'' is an identity sentence, somewhat metalinguistically saying that all attach
ed sumti are representations for the same referent. There may be any number of s
umti associated with ``du'', and all are said to be identical.
Example 14.2, however, predicates; it is used to make a claim about the identity
of ``ko'a'', which presumably has been defined previously.
Note: ``du'' historically is derived from ``dunli'', but ``dunli'' has a third p
lace which ``du'' lacks: the standard of equality.
15. lujvo based on pro-sumti
There exist rafsi allocated to a few cmavo of selma'o KOhA, but they are rarely
used. (See Section 16 for a complete list.) The obvious way to use them is as in
ternal sumti, filling in an appropriate place of the gismu or lujvo to which the
y are attached; as such, they usually stand as the first rafsi in their lujvo.
Thus ``donta'a'', meaning ``you-talk'', would be interpreted as ``tavla be do'',
and would have the place structure
15.1) t1 talks to you about subject t3 in language t4
since t2 (the addressee) is already known to be ``do''.
On the other hand, the lujvo ``donma'o'', literally ``you-cmavo'', which means `
`a second person personal pronoun'', would be interpreted as ``cmavo be zo do'',
and have the place structure:
15.2) c1 is a second person pronoun in language c4
since both the c2 place (the grammatical class) and the c3 place (the meaning) a
re obvious from the context ``do''.
An anticipated use of rafsi for cmavo in the ``fo'a'' series is to express terjv
o which can't be expressed in a convenient rafsi form, because they are too long
to express, or are formally inconvenient (fu'ivla, cmene, and so forth.) An exa
mple would be:
15.3) fo'a goi le kulnrsu,omi .i lo fo'arselsanga
x6 stands for Finnish-culture. An x6-song
Finally, lujvo involving ``zi'o'' are also possible, and are fully discussed in
Chapter 12. In brief, the convention is to use the rafsi for ``zi'o'' as a prefi
x immediately followed by the rafsi for the number of the place to be deleted. T
hus, if we consider a beverage (something drunk without considering who, if anyo
ne, drinks it) as a ``se pinxe be zi'o'', the lujvo corresponding to this is ``z
ilrelselpinxe'' (deleting the second place of ``se pinxe''). Deleting the x1 pla
ce in this fashion would move all remaining places up by one. This would mean th
at ``zilpavypinxe'' has the same place structure as ``zilrelselpinxe'', and ``lo
zilpavypinxe'', like ``lo zilrelselpinxe'', refers to a beverage, and not to a
non-existent drinker.
The pro-bridi ``co'e'', ``du'', and ``bu'a'' also have rafsi, which can be used
just as if they were gismu. The resulting lujvo have (except for ``du''-based lu
jvo) highly context-dependent meanings.
16. KOhA cmavo by series
mi-series: mi I (rafsi: ``mib'') do you (rafsi: ``don'' and ``doi'') mi'o yo
u and I mi'a I and others, we but not you ma'a you and I and others do'o you and
others ko you-imperative
ti-series: ti this here; something nearby (rafsi: ``tif'') ta that there; so
mething distant (rafsi: ``taz'') tu that yonder; something far distant (rafsi: `
`tuf'')
di'u-series: di'u the previous utterance de'u an earlier utterance da'u a mu
ch earlier utterance di'e the next utterance de'e a later utterance da'e a much
later utterance dei this very utterance do'i some utterance
ko'a-series: ko'a it-1; 1st assignable pro-sumti ko'e it-2; 2nd assignable p
ro-sumti ko'i it-3; 3rd assignable pro-sumti ko'o it-4; 4th assignable pro-sumti
ko'u it-5; 5th assignable pro-sumti fo'a it-6; 6th assignable pro-sumti (rafsi:
``fo'a'') fo'e it-7; 7th assignable pro-sumti (rafsi: ``fo'e'') fo'i it-8; 8th
assignable pro-sumti (rafsi: ``fo'i'') fo'o it-9; 9th assignable pro-sumti fo'u
it-10; 10th assignable pro-sumti
ri-series: ri (repeats the last sumti) ra (repeats a previous sumti) ru (rep
eats a long-ago sumti)
zo'e-series: zo'e the obvious value zu'i the typical value zi'o the nonexist
ent value (rafsi: ``zil'')
vo'a-series: vo'a x1 of this bridi vo'e x2 of this bridi vo'i x3 of this bri
di vo'o x4 of this bridi vo'u x5 of this bridi
da-series: da something-1 (rafsi: ``dav''/``dza'') de something-2 di somethi
ng-3
others: ke'a relativized sumti ma sumti question ce'u abstraction focus
17. GOhA and other pro-bridi by series
broda-series (not GOhA): broda is-1; 1st assignable pro-bridi brode is-2; 2n
d assignable pro-bridi brodi is-3; 3rd assignable pro-bridi brodo is-4; 4th assi
gnable pro-bridi brodu is-5; 5th assignable pro-bridi
go'i-series: go'i (repeats the last bridi) go'a (repeats a previous bridi) g
o'u (repeats a long-ago bridi) go'e (repeats the last-but-one bridi) go'o (repea
ts a future bridi) nei (repeats the current bridi) no'a (repeats the next outer
bridi)
bu'a-series: bu'a some-predicate-1 (rafsi: ``bul'') bu'e some-predicate-2 bu
'i some-predicate-3
others: co'e has the obvious relationship (rafsi: ``com''/``co'e'') mo bridi
question du identity: x1 is identical to x2, x3 ... (rafsi: ``dub''/``du'o'')
17. Other cmavo discussed in this chapter
goi GOI pro-sumti assignment (ko'a-series)
cei CEI pro-bridi assignment (broda-series)
ra'o RAhO pro-sumti/pro-bridi update
soi SOI reciprocity
se'u SEhU soi terminator
da'o DAhO cancel all pro-sumti/pro-bridi
Chapter 8
Relative Clauses, Which Make sumti Even More Complicated
1. What are you pointing at?
The following cmavo are discussed in this section:
poi NOI restrictive relative clause introducer
ke'a GOhA relative pro-sumti
ku'o KUhO relative clause terminator
Let us think about the problem of communicating what it is that we are pointing
at when we are pointing at something. In Lojban, we can refer to what we are poi
nting at by using the pro-sumti ``ti'' if it is nearby, or ``ta'' if it is somew
hat further away, or ``tu'' if it is distant. (Pro-sumti are explained in full i
n Chapter 7.)
However, even with the assistance of a pointing finger, or pointing lips, or wha
tever may be appropriate in the local culture, it is often hard for a listener t
o tell just what is being pointed at. Suppose one is pointing at a person (in pa
rticular, in the direction of his or her face), and says:
1.1) ti cu barda
This-one is-big.
What is the referent of ``ti''? Is it the person? Or perhaps it is the person's
nose? Or even (for ``ti'' can be plural as well as singular, and mean ``these on
es'' as well as ``this one'') the pores on the person's nose?
To help solve this problem, Lojban uses a construction called a ``relative claus
e''. Relative clauses are usually attached to the end of sumti, but there are ot
her places where they can go as well, as explained later in this chapter. A rela
tive clause begins with a word of selma'o NOI, and ends with the elidable termin
ator ``ku'o'' (of selma'o KUhO). As you might suppose, ``noi'' is a cmavo of sel
ma'o NOI; however, first we will discuss the cmavo ``poi'', which also belongs t
o selma'o NOI.
In between the ``poi'' and the ``ku'o'' appears a full bridi, with the same synt
ax as any other bridi. Anywhere within the bridi of a relative clause, the pro-s
umti ``ke'a'' (of selma'o KOhA) may be used, and it stands for the sumti to whic
h the relative clause is attached (called the ``relativized sumti''). Here are s
ome examples before we go any further:
1.2) ti poi ke'a prenu ku'o cu barda
This-thing such-that( IT is-a-person ) is-large.
This thing which is a person is big.
This person is big.
1.3) ti poi ke'a nazbi ku'o cu barda
This-thing such-that( IT is-a-nose ) is-large.
This thing which is a nose is big.
This nose is big.
1.4) ti poi ke'a nazbi kapkevna ku'o
cu barda
This-thing such-that( IT is-a-nose-type-of skin-hole )
is-big.
These things which are nose-pores are big.
These nose-pores are big.
In the literal translations throughout this chapter, the word ``IT'', capitalize
d, is used to represent the cmavo ``ke'a''. In each case, it serves to represent
the sumti (in Examples 1.2 through 1.4, the cmavo ``ti'') to which the relative
clause is attached.
Of course, there is no reason why ``ke'a'' needs to appear in the x1 place of a
relative clause bridi; it can appear in any place, or indeed even in a sub-bridi
within the relative clause bridi. Here are two more examples:
1.5) tu poi le mlatu pu lacpu ke'a ku'o
cu ratcu
That-distant-thing such-that( the cat [past] drags IT )
is-a-rat.
That thing which the cat dragged is a rat.
What the cat dragged is a rat.
1.6) ta poi mi djica le nu mi ponse ke'a [kei] ku'o
cu bloti
That-thing such-that( I desire the event-of( I own IT ) )
is-a-boat.
That thing that I want to own is a boat.
In Example 1.6, ``ke'a'' appears in an abstraction clause (abstractions are expl
ained in Chapter 11) within a relative clause.
Like any sumti, ``ke'a'' can be omitted. The usual presumption in that case is t
hat it then falls into the x1 place:
1.7) ti poi nazbi cu barda
This-thing which is-a-nose is-big.
almost certainly means the same thing as Example 1.3. However, ``ke'a'' can be o
mitted if it is clear to the listener that it belongs in some place other than x
1:
1.8) tu poi le mlatu pu lacpu cu ratcu
That-distant-thing which the cat drags is-a-rat
is equivalent to Example 1.4.
As stated before, ``ku'o'' is an elidable terminator, and in fact it is almost a
lways elidable. Throughout the rest of this chapter, ``ku'o'' will not be writte
n in any of the examples unless it is absolutely required: thus, Example 1.2 can
be written:
1.9) ti poi prenu cu barda
That which is-a-person is-big.
That person is big.
without any change in meaning. Note that ``poi'' is translated ``which'' rather
than ``such-that'' when ``ke'a'' has been omitted from the x1 place of the relat
ive clause bridi. The word ``which'' is used in English to introduce English rel
ative clauses: other words that can be used are ``who'' and ``that'', as in:
1.10) I saw a man who was going to the store.
and
1.11) The building that the school was located in is large.
In Example 1.10 the relative clause is ``who was going to the store'', and in Ex
ample 1.11 it is ``that the school was located in''. Sometimes ``who'', ``which'
', and ``that'' are used in literal translations in this chapter in order to mak
e them read more smoothly.
2. Incidental relative clauses
The following cmavo are discussed in this section:
noi NOI incidental relative clause introducer
There are two basic kinds of relative clauses: restrictive relative clauses intr
oduced by ``poi'', and incidental (sometimes called simply ``non-restrictive'')
relative clauses introduced by ``noi''. The difference between restrictive and i
ncidental relative clauses is that restrictive clauses provide information that
is essential to identifying the referent of the sumti to which they are attached
, whereas incidental relative clauses provide additional information which is he
lpful to the listener but is not essential for identifying the referent of the s
umti. All of the examples in Section 1 are restrictive relative clauses: the inf
ormation in the relative clause is essential to identification. (The title of th
is chapter, though, uses an incidental relative clause.)
Consider the following examples:
2.1) le gerku poi blanu cu barda
The dog which is-blue is-large.
The dog which is blue is large.
2.2) le gerku noi blanu cu barda
The dog incidentally-which is-blue is-large.
The dog, which is blue, is large.
In Example 2.1, the information conveyed by ``poi blanu'' is essential to identi
fying the dog in question: it restricts the possible referents from dogs in gene
ral to dogs that are blue. This is why ``poi'' relative clauses are called restr
ictive. In Example 2.2, on the other hand, the dog which is referred to has pres
umably already been identified clearly, and the relative clause ``noi blanu'' ju
st provides additional information about it. (If in fact the dog hasn't been ide
ntified clearly, then the relative clause does not help identify it further.)
In English, the distinction between restrictive and incidental relative clauses
is expressed in writing by surrounding incidental, but not restrictive, clauses
with commas. These commas are functioning as parentheses, because incidental rel
ative clauses are essentially parenthetical. This distinction in punctuation is
represented in speech by a difference in tone of voice. In addition, English res
trictive relative clauses can be introduced by ``that'' as well as ``which'' and
``who'', whereas incidental relative clauses cannot begin with ``that''. Lojban
, however, always uses the cmavo ``poi'' and ``noi'' rather than punctuation or
intonation to make the distinction.
Here are more examples of incidental relative clauses:
2.3) mi noi jdice cu zvati
I who-incidentally am-a-judge am-at [some-place].
I, a judge, am present.
In this example, ``mi'' is already sufficiently restricted, and the additional i
nformation that I am a judge is being provided solely for the listener's edifica
tion.
2.4) xu do viska le mi karce noi blabi
[True?] You see my car incidentally-which is-white.
Do you see my car, which is white?
In Example 2.4, the speaker is presumed to have only one car, and is providing i
ncidental information that it is white. (Alternatively, he or she might have mor
e than one car, since ``le karce'' can be plural, in which case the incidental i
nformation is that each of them is white.) Contrast Example 2.5 with a restricti
ve relative clause:
2.5) xu do viska le mi karce poi blabi
[True?] You see my car which is-white.
Do you see my car that is white?
Do you see my white car?
Here the speaker probably has several cars, and is restricting the referent of t
he sumti ``le mi karce'' (and thereby the listener's attention) to the white one
only. Example 2.5 means much the same as Example 2.6, which does not use a rela
tive clause:
2.6) xu do viska le mi blabi karce
[True?] You see my white car.
Do you see my car, the white one?
So a restrictive relative clause attached to a description can often mean the sa
me as a description involving a tanru. However, ``blabi karce'', like all tanru,
is somewhat vague: in principle, it might refer to a car which carries white th
ings, or even express some more complicated concept involving whiteness and car-
ness; the restrictive relative clause of Example 2.5 can only refer to a car whi
ch is white, not to any more complex or extended concept.
3. Relative phrases
The following cmavo are discussed in this section:
pe GOI restrictive association
po GOI restrictive possession
po'e GOI restrictive intrinsic possession
po'u GOI restrictive identification
ne GOI incidental association
no'u GOI incidental identification
ge'u GEhU relative phrase terminator
There are types of relative clauses (those which have a certain selbri) which ar
e frequently wanted in Lojban, and can be expressed using a shortcut called a re
lative phrase. Relative phrases are introduced by cmavo of selma'o GOI, and cons
ist of a GOI cmavo followed by a single sumti.
Here is an example of ``pe'', plus an equivalent sentence using a relative claus
e:
3.1) le stizu pe mi cu blanu
The chair associated-with me is-blue.
My chair is blue.
3.2) le stizu poi ke'a srana mi
cu blanu
The chair such-that( IT is-associated-with me)
is-blue.
In Example 3.1 and Example 3.2, the link between the chair and the speaker is of
the loosest kind.
Here is an example of ``po'':
3.3) le stizu po mi cu xunre
The chair specific-to me is red.
3.4) le stizu poi
ke'a se steci srana mi cu xunre
The chair such-that
(IT is-specifically associated-with me) is-red.
Example 3.3 and Example 3.4 contrast with Example 3.1 and Example 3.2: the chair
is more permanently connected with the speaker. A plausible (though not the onl
y possible) contrast between Example 3.1 and Example 3.3 is that ``pe mi'' would
be appropriate for a chair the speaker is currently sitting on (whether or not
the speaker owned that chair), and ``po mi'' for a chair owned by the speaker (w
hether or not he or she was currently occupying it).
As a result, the relationship expressed between two sumti by ``po'' is usually c
alled ``possession'', although it does not necessarily imply ownership, legal or
otherwise. The central concept is that of specificity (``steci'' in Lojban).
Here is an example of ``po'e'', as well as another example of ``po'':
3.5) le birka po'e mi cu spofu
The arm intrinsically-possessed-by me is-broken
3.6) le birka poi
jinzi ke se steci srana mi
cu spofu
The arm which
is-intrinsically (specifically associated-with) me
is-broken
3.7) le botpi po mi cu spofu
The bottle specific-to me is-broken
Example 3.5 and Example 3.6 on the one hand, and Example 3.7 on the other, illus
trate the contrast between two types of possession called ``intrinsic'' and ``ex
trinsic'', or sometimes ``inalienable'' and ``alienable'', respectively. Somethi
ng is intrinsically (or inalienably) possessed by someone if the possession is p
art of the possessor, and cannot be changed without changing the possessor. In t
he case of Example 3.5, people are usually taken to intrinsically possess their
arms: even if an arm is cut off, it remains the arm of that person. (If the arm
is transplanted to another person, however, it becomes intrinsically possessed b
y the new user, though, so intrinsic possession is a matter of degree.)
By contrast, the bottle of Example 3.7 can be given away, or thrown away, or los
t, or stolen, so it is possessed extrinsically (alienably). The exact line betwe
en intrinsic and extrinsic possession is culturally dependent. The U.S. Declarat
ion of Independence speaks of the ``inalienable rights'' of men, but just what t
hose rights are, and even whether the concept makes sense at all, varies from cu
lture to culture.
Note that Example 3.5 can also be expressed without a relative clause:
3.8) le birka be mi cu spofu
The arm of-body me is broken
reflecting the fact that the gismu ``birka'' has an x2 place representing the bo
dy to which the arm belongs. Many, but not all, cases of intrinsic possession ca
n be thus covered without using ``po'e'' by placing the possessor into the appro
priate place of the description selbri.
Here is an example of ``po'u'':
3.9) le gerku po'u le mi pendo cu cinba mi
The dog which-is my friend kisses me.
3.10) le gerku poi du le mi pendo cu cinba mi
The dog which = my friend kisses me.
The cmavo ``po'u'' does not represent possession at all, but rather identity. (N
ote that it means ``poi du'' and its form was chosen to suggest the relationship
.)
In Example 3.9, the use of ``po'u'' tells us that ``le gerku'' and ``le mi pendo
'' represent the same thing. Consider the contrast between Example 3.9 and:
3.11) le mi pendo po'u le gerku cu cinba mi
My friend which-is the dog kisses me.
The facts of the case are the same, but the listener's knowledge about the situa
tion may not be. In Example 3.9, the listener is presumed not to understand whic
h dog is meant by ``le gerku'', so the speaker adds a relative phrase clarifying
that it is the particular dog which is the speaker's friend.
Example 3.11, however, assumes that the listener does not know which of the spea
ker's friends is referred to, and specifies that it is the friend that is the do
g (which dog is taken to be obvious). Here is another example of the same contra
st:
3.12) le tcadu po'u la nu,iork
The city of New York [not another city].
3.13) la nu,iork po'u le tcadu
New York the city (not the state or some other New York)
The principle that the possessor and the possessed may change places applies to
all the GOI cmavo, and allows for the possibility of odd effects:
3.14) le kabri pe le mi pendo cu cmalu
The cup associated-with my friend is small.
My friend's cup is small
3.15) le mi pendo pe le kabri cu cmalu
My friend associated-with the cup is small.
My friend, the one with the cup, is small.
Example 3.14 is useful in a context which is about my friend, and states that hi
s or her cup is small, whereas Example 3.15 is useful in a context that is prima
rily about a certain cup, and makes a claim about ``my friend of the cup'', as o
pposed to some other friend of mine. Here the cup appears to ``possess'' the per
son! English can't even express this relationship with a possessive --- ``the cu
p's friend of mine'' looks like nonsense --- but Lojban has no trouble doing so.
Finally, the cmavo ``ne'' and ``no'u'' stand to ``pe'' and ``po'u'', respectivel
y, as ``noi'' does to ``poi'' --- they provide incidental information:
3.16) le blabi gerku ne mi cu batci do
The white dog, incidentally-associated-with me,
bites you.
The white dog, which is mine, bites you.
In Example 3.16, the white dog is already fully identified (after all, presumabl
y the listener know which dog bit him or her!). The fact that it is yours is mer
ely incidental to the main bridi claim.
Distinguishing between ``po'u'' and ``no'u'' can be a little tricky. Consider a
room with several men in it, one of whom is named Jim. If you don't know their n
ames, I might say:
3.17) le nanmu no'u la djim. cu terpemci
The man, incidentally-who-is Jim, is-a-poet.
The man, Jim, is a poet.
Here I am saying that one of the men is a poet, and incidentally telling you tha
t he is Jim. But if you do know the names, then
3.18) le nanmu po'u la djim. cu terpemci
The man who-is Jim is-a-poet.
The man, the one named Jim, is a poet.
is appropriate. Now I am using the fact that the man I am speaking of is Jim in
order to pick out which man I mean.
It is worth mentioning that English sometimes over-specifies possession from the
Lojban point of view (and the point of view of many other languages, including
ones closely related to English). The idiomatic English sentence
3.19) The man put his hands in his pockets.
seems strange to a French- or German-speaking person: whose pockets would he put
his hands into? and even odder, whose hands would he put into his pockets? In L
ojban, the sentence
3.20) le nanmu cu punji le xanci le daski
The man puts the hand at-locus-the pocket.
is very natural. Of course, if the man is in fact putting his hands into another
's pockets, or another's hands into his pockets, the fact can be specified.
Finally, the elidable terminator for GOI cmavo is ``ge'u'' of selma'o GEhU; it i
s almost never required. However, if a logical connective immediately follows a
sumti modified by a relative phrase, then an explicit ``ge'u'' is needed to allo
w the connective to affect the relativized sumti rather than the sumti of the re
lative phrase. (What about the cmavo after which selma'o GOI is named? It is dis
cussed in Chapter 7, as it is not semantically akin to the other kinds of relati
ve phrases, although the syntax is the same.)
4. Multiple relative clauses: ``zi'e''
zi'e ZIhE relative clause joiner
Sometimes it is necessary or useful to attach more than one relative clause to a
sumti. This is made possible in Lojban by the cmavo ``zi'e'' (of selma'o ZIhE),
which is used to join one or more relative clauses together into a single unit,
thus making them apply to the same sumti. For example:
4.1) le gerku poi blabi zi'e poi batci le nanmu cu klama
The dog which is white and which bites the man goes.
The most usual translation of ``zi'e'' in English is ``and'', but ``zi'e'' is no
t really a logical connective: unlike most of the true logical connectives (whic
h are explained in Chapter 14), it cannot be converted into a logical connection
between sentences.
It is perfectly correct to use ``zi'e'' to connect relative clauses of different
kinds:
4.2) le gerku poi blabi zi'e noi
le mi pendo cu ponse ke'a cu klama
The dog that-is( white) and incidentally-such-that(
my friend owns IT) goes.
The dog that is white, which my friend owns,
is going.
In Example 4.2, the restrictive clause ``poi blabi'' specifies which dog is refe
rred to, but the incidental clause ``noi le mi pendo cu ponse'' is mere incident
al information: the listener is supposed to already have identified the dog from
the ``poi blabi''. Of course, the meaning (though not necessarily the emphasis)
is the same if the incidental clause appears first.
It is also possible to connect relative phrases with ``zi'e'', or a relative phr
ase with a relative clause:
4.3) le botpi po mi zi'e poi blanu cu spofu
The bottle specific-to me and which-is blue is-broken.
My blue bottle is broken.
Note that if the colloquial translation of Example 4.3 were ``My bottle, which i
s blue, is broken'', then ``noi'' rather than ``poi'' would have been correct in
the Lojban version, since that version of the English implies that you do not n
eed to know the bottle is blue. As written, Example 4.3 suggests that I probably
have more than one bottle, and the one in question needs to be picked out as th
e blue one.
4.4) mi ba zutse le stizu pe mi
zi'e po do zi'e poi xunre
I [future] sit-in the chair associated-with me and
specific-to you and which-is red.
I will sit in my chair (really yours), the red one.
Example 4.4 illustrates that more than two relative phrases or clauses can be co
nnected with ``zi'e''. It almost defies colloquial translation because of the ve
ry un-English contrast between ``pe mi'', implying that the chair is temporarily
connected with me, and ``po do'', implying that the chair has a more permanent
association with you. (Perhaps I am a guest in your house, in which case the cha
ir would naturally be your property.)
Here is another example, mixing a relative phrase and two relative clauses, a re
strictive one and a non-restrictive one:
4.5) mi ba citka le dembi pe mi
zi'e poi cpana le mi palta
zi'e noi do dunda ke'a mi
I [future] eat the beans associated-with me
and which are-upon my plate
and which-incidentally you gave IT to-me.
I'll eat my beans that are on my plate, the ones
you gave me.
5. Non-veridical relative clauses: ``voi''
voi NOI non-veridical relative clause introducer
There is another member of selma'o NOI which serves to introduce a third kind of
relative clause: ``voi''. Relative clauses introduced by ``voi'' are restrictiv
e, like those introduced by ``poi''. However, there is a fundamental difference
between ``poi'' and ``voi'' relative clauses. A ``poi'' relative clause is said
to be veridical, in the same sense that a description using ``lo'' or ``loi'' is
: it is essential to the interpretation that the bridi actually be true. For exa
mple:
5.1) le gerku poi blabi cu klama
The dog which is-white goes.
it must actually be true that the dog is white, or the sentence constitutes a mi
scommunication. If there is a white dog and a brown dog, and the speaker uses ``
le gerku poi blabi'' to refer to the brown dog, then the listener will not under
stand correctly. However,
5.2) le gerku voi blabi cu klama
the dog which-I-describe-as white goes
puts the listener on notice that the dog in question may not actually meet objec
tive standards (whatever they are) for being white: only the speaker can say exa
ctly what is meant by the term. In this way, ``voi'' is like ``le''; the speaker
's intention determines the meaning.
As a result, the following two sentences
5.3) le nanmu cu ninmu
That-which-I-describe-as a-man is-a-woman.
The ``guy'' is actually a gal.
5.4) ti voi nanmu cu ninmu
This-thing which-I-describe-as a-man is-a-woman.
mean essentially the same thing (except that Example 5.5 involves pointing thank
s to the use of ``ti'', whereas Example 5.4 doesn't), and neither one is self-co
ntradictory: it is perfectly all right to describe something as a man (although
perhaps confusing to the listener) even if it actually is a woman.
6. Relative clauses and descriptors
So far, this chapter has described the various kinds of relative clauses (includ
ing relative phrases). The list is now complete, and the rest of the chapter wil
l be concerned with the syntax of sumti that include relative clauses. So far, a
ll relative clauses have appeared directly after the sumti to which they are att
ached. This is the most common position (and originally the only one), but a var
iety of other placements are also possible which produce a variety of semantic e
ffects.
There are actually three places where a relative clause can be attached to a des
cription sumti: after the descriptor (``le'', ``lo'', or whatever), after the em
bedded selbri but before the elidable terminator (which is ``ku''), and after th
e ``ku''. The relative clauses attached to descriptors that we have seen have oc
cupied the second position. Thus Example 5.1, if written out with all elidable t
erminators, would appear as:
6.1) le gerku poi blabi ku'o ku cu klama vau
the( dog which( is-white ) ) goes.
The dog which is white is going.
Here ``ku'o'' is the terminator paired with ``poi'' and ``ku'' with ``le'', and
``vau'' is the terminator of the whole bridi.
When a simple descriptor using ``le'', like ``le gerku'', has a relative clause
attached, it is purely a matter of style and emphasis where the relative clause
should go. Therefore, the following examples are all equivalent in meaning to Ex
ample 6.1:
6.2) le poi blabi ku'o gerku cu klama
The such-that (it-is-white) dog goes.
6.3) le gerku ku poi blabi cu klama
The( dog ) which is-white goes.
Example 6.1 will seem most natural to speakers of languages like English, which
always puts relative clauses after the noun phrases they are attached to; Exampl
e 6.2, on the other hand, may seem more natural to Finnish or Chinese speakers,
who put the relative clause first. Note that in Example 6.2, the elidable termin
ator ``ku'o'' must appear, or the selbri of the relative clause (``blabi'') will
merge with the selbri of the description (``gerku''), resulting in an ungrammat
ical sentence. The purpose of the form appearing in Example 6.3 will be apparent
shortly.
As is explained in detail in Chapter 6, two different numbers (known as the ``in
ner quantifier'' and the ``outer quantifier'') can be attached to a description.
The inner quantifier specifies how many things the descriptor refers to: it app
ears between the descriptor and the description selbri. The outer quantifier app
ears before the descriptor, and specifies how many of the things referred to by
the descriptor are involved in this particular bridi. In the following example,
6.4) re le mu prenu cu klama le zarci
Two-of the five persons go to-the market.
Two of the five people [that I have in mind]
are going to the market.
``mu'' is the inner quantifier and ``re'' is the outer quantifier. Now what is m
eant by attaching a relative clause to the sumti ``re le mu prenu''? Suppose the
relative clause is ``poi ninmu'' (meaning ``who are women''). Now the three pos
sible attachment points discussed previously take on significance.
6.5) re le poi ninmu ku'o mu prenu
cu klama le zarci
Two of the such-that([they] are-women )
five persons go to-the market.
Two women out of the five persons go to the market.
6.6) re le mu prenu poi ninmu [ku]
cu klama le zarci
Two of the (five persons which are-women)
go to-the market.
Two of the five women go to the market.
6.7) re le mu prenu ku poi ninmu
cu klama le zarci
(Two of the five persons ) which are-women
go to-the market.
Two women out of the five persons go to the market.
As the parentheses show, Example 6.6 means that all five of the persons are wome
n, whereas Example 6.7 means that the two who are going to the market are women.
How do we remember which is which? If the relative clause comes after the expli
cit ``ku'', as in Example 6.7, then the sumti as a whole is qualified by the rel
ative clause. If there is no ``ku'', or if the relative clause comes before an e
xplicit ``ku'', then the relative clause is understood to apply to everything wh
ich the underlying selbri applies to.
What about Example 6.5? By convention, it means the same as Example 6.7, and it
requires no ``ku'', but it does typically require a ``ku'o'' instead. Note that
the relative clause comes before the inner quantifier.
When ``le'' is the descriptor being used, and the sumti has no explicit outer qu
antifier, then the outer quantifier is understood to be ``ro'' (meaning ``all'')
, as is explained in Chapter 6. Thus ``le gerku'' is taken to mean ``all of the
things I refer to as dogs'', possibly all one of them. In that case, there is no
difference between a relative clause after the ``ku'' or before it. However, if
the descriptor is ``lo'', the difference is quite important:
6.8) lo prenu ku noi blabi
cu klama le zarci
(Some persons) incidentally-which are-white
go to-the market.
Some people, who are white, go to the market.
6.9) lo prenu noi blabi [ku]
cu klama le zarci
Some (persons incidentally-which are-white)
go to-the market.
Some of the people, who by the way are white,
go to the market.
Both Examples 6.8 and 6.9 tell us that one or more persons are going to the mark
et. However, they make very different incidental claims. Now, what does ``lo pre
nu noi blabi'' mean? Well, the default inner quantifier is ``ro'' (meaning ``all
''), and the default outer quantifier is ``su'o'' (meaning ``at least one''). Th
erefore, we must first take all persons, then choose at least one of them. That
one or more people will be going.
In Example 6.8, the relative clause described the sumti once the outer quantifie
r was applied: one or more people, who are white, are going. But in Example 6.9,
the relative clause actually describes the sumti before the outer quantificatio
n is applied, so that it ends up meaning ``First take all persons --- by the way
, they're all white''. But not all people are white, so the incidental claim bei
ng made here is false.
The safe strategy, therefore, is to always use ``ku'' when attaching a ``noi'' r
elative clause to a ``lo'' descriptor. Otherwise we may end up claiming far too
much.
When the descriptor is ``la'', indicating that what follows is a selbri used for
naming, then the positioning of relative clauses has a different significance.
A relative clause inside the ``ku'', whether before or after the selbri, is reck
oned part of the name; a relative clause outside the ``ku'' is not. Therefore,
6.10) mi viska la nanmu poi terpa le ke'a xirma [ku]
I see that-named( ``man which fears the of-IT horse'' ).
I see Man Afraid Of His Horse.
says that the speaker sees a person with a particular name, who does not necessa
rily fear any horses, whereas
6.11) mi viska la nanmu ku poi terpa le ke'a xirma.
I see that-named( ``Man'' ) which fears the of-IT horse.
I see the person named ``Man'' who is afraid of his horse.
refers to one (or more) of those named ``Man'', namely the one(s) who are afraid
of their horses.
Finally, so-called indefinite sumti like ``re karce'', which means almost the sa
me as ``re lo karce'' (which in turn means the same as ``re lo ro karce''), can
have relative clauses attached; these are taken to be of the outside-the-``ku''
variety. Here is an example:
6.12) mi ponse re karce [ku] poi xekri
I possess two cars which-are black.
The restrictive relative clause only affects the two cars being affected by the
main bridi, not all cars that exist. It is ungrammatical to try to place a relat
ive clause within an indefinite sumti (that is, before an explicitly expressed t
erminating ``ku''.) Use an explicit ``lo'' instead.
7. Possessive sumti
In Examples 2.4 through 2.6, the sumti ``le mi karce'' appears, glossed as ``my
car''. Although it might not seem so, this sumti actually contains a relative ph
rase. When a sumti appears between a descriptor and its description selbri, it i
s actually a ``pe'' relative phrase. So
7.1) le mi karce cu xunre
my car is-red.
and
7.2) le pe mi karce cu xunre
the (associated-with me) car is-red.
mean exactly the same thing. Furthermore, since there are no special considerati
ons of quantifiers here,
7.3) le karce pe mi cu xunre
The car associated-with me is-red
means the same thing as well. A sumti like the one in Example 7.1 is called a ``
possessive sumti''. Of course, it does not really indicate possession in the sen
se of ownership, but like ``pe'' relative phrases, indicates only weak associati
on; you can say ``le mi karce'' even if you've only borrowed it for the night. (
In English, ``my car'' usually means ``le karce po mi'', but we do not have the
same sense of possession in ``my seat on the bus''; Lojban simply makes the weak
er sense the standard one.) The inner sumti, ``mi'' in Example 7.1, is correspon
dingly called the ``possessor sumti''.
Historically, possessive sumti existed before any other kind of relative phrase
or clause, and were retained when the machinery of relative phrases and clauses
as detailed in this chapter so far was slowly built up. When preposed relative c
lauses of the Example 7.2 type were devised, possessive sumti were most easily v
iewed as a special case of them.
Although any sumti, however complex, can appear in a full-fledged relative phras
e, only simple sumti can appear as possessor sumti, without a ``pe''. Roughly sp
eaking, the legal possessor sumti are: pro-sumti, quotations, names and descript
ions, and numbers. In addition, the possessor sumti may not be preceded by a qua
ntifier, as such a form would be interpreted as the unusual ``descriptor + quant
ifier + sumti'' type of description. All these sumti forms are explained in full
in Chapter 6.
Here is an example of a description used in a possessive sumti:
7.4) le le nanmu ku karce cu blanu
The (associated-with-the man) car is blue.
The man's car is blue.
However, this system is not ideal for all purposes. For one thing, it is
verbose. The native lerfu words are often quite long, and with ``bu'' added
they become even longer: the worst-case Greek lerfu word would be
``.Omikron. bu'', with four syllables and two mandatory pauses. In addition,
alphabets that are used by many languages have separate sets of lerfu words
for each language, and which set is Lojban to choose?
The alternative plan, therefore, is to use a shift word similar to those introdu
ced in Section 3. After the appearance of such a shift word, the regular lerfu w
ords are re-interpreted to represent the lerfu of the alphabet now in use. After
a shift to the Greek alphabet, for example, the lerfu word ``ty'' would represe
nt not Latin ``t'' but Greek ``tau''. Why ``tau''? Because it is, in some sense,
the closest counterpart of ``t'' within the Greek lerfu system. In principle it
would be all right to map ``ty.'' to ``phi'' or even ``omega'', but such an arb
itrary relationship would be extremely hard to remember.
Where no obvious closest counterpart exists, some more or less arbitrary choice
must be made. Some alien lerfu may simply not have any shifted equivalent, forci
ng the speaker to fall back on a ``bu'' form. Since a ``bu'' form may mean diffe
rent things in different alphabets, it is safest to employ a shift word even whe
n ``bu'' forms are in use.
Shifts for several alphabets have been assigned cmavo of selma'o BY:
lo'a Latin/Roman/Lojban alphabet
ge'o Greek alphabet
je'o Hebrew alphabet
jo'o Arabic alphabet
ru'o Cyrillic alphabet
The cmavo ``zai'' (of selma'o LAU) is used to create shift words to still other
alphabets. The BY word which must follow any LAU cmavo would typically be a name
representing the alphabet with ``bu'' suffixed:
5.2) zai .devanagar. bu
Devanagari (Hindi) alphabet
5.3) zai .katakan. bu
Japanese katakana syllabary
5.4) zai .xiragan. bu
Japanese hiragana syllabary
Unlike the cmavo above, these shift words have not been standardized and probabl
y will not be until someone actually has a need for them. (Note the ``.'' charac
ters marking leading and following pauses.)
In addition, there may be multiple visible representations within a single alpha
bet for a given letter: roman vs. italics, handwriting vs. print, Bodoni vs. Hel
vetica. These traditional ``font and face'' distinctions are also represented by
shift words, indicated with the cmavo ``ce'a'' (of selma'o LAU) and a following
BY word:
5.5) ce'a .xelveticas. bu
Helvetica font
5.6) ce'a .xancisk. bu
handwriting
5.7) ce'a .pavrel. bu
12-point font size
The cmavo ``na'a'' (of selma'o BY) is a universal shift-word cancel: it returns
the interpretation of lerfu words to the default of lower-case Lojban with no sp
ecific font. It is more general than ``lo'a'', which changes the alphabet only,
potentially leaving font and case shifts in place.
Several sections at the end of this chapter contain tables of proposed lerfu wor
d assignments for various languages.
6. Accent marks and compound lerfu words
Many languages that make use of the Latin alphabet add special marks to some of
the lerfu they use. French, for example, uses three accent marks above vowels, c
alled (in English) ``acute'', ``grave'', and ``circumflex''. Likewise, German us
es a mark called ``umlaut''; a mark which looks the same is also used in French,
but with a different name and meaning.
These marks may be considered lerfu, and each has a corresponding lerfu word in
Lojban. So far, no problem. But the marks appear over lerfu, whereas the words m
ust be spoken (or written) either before or after the lerfu word representing th
e basic lerfu. Typewriters (for mechanical reasons) and the computer programs th
at emulate them usually require their users to type the accent mark before the b
asic lerfu, whereas in speech the accent mark is often pronounced afterwards (fo
r example, in German ``a umlaut'' is preferred to ``umlaut a'').
Lojban cannot settle this question by fiat. Either it must be left up to default
interpretation depending on the language in question, or the lerfu-word compoun
ding cmavo ``tei'' (of selma'o TEI) and ``foi'' (of selma'o FOI) must be used. T
hese cmavo are always used in pairs; any number of lerfu words may appear betwee
n them, and the whole is treated as a single compound lerfu word. The French wor
d ``été'', with acute accent marks on both ``e'' lerfu, could be spelled as:
6.1) tei .ebu .akut. bu foi ty. tei .akut. bu .ebu foi
( ``e'' acute ) ``t'' ( acute ``e'' )
and it does not matter whether ``akut. bu'' appears before or after ``.ebu''; th
e ``tei ... foi'' grouping guarantees that the acute accent is associated with t
he correct lerfu. Of course, the level of precision represented by Example 6.1 w
ould rarely be required: it might be needed by a Lojban-speaker when spelling ou
t a French word for exact transcription by another Lojban-speaker who did not kn
ow French.
This system breaks down in languages which use more than one accent mark on a si
ngle lerfu; some other convention must be used for showing which accent marks ar
e written where in that case. The obvious convention is to represent the mark ne
arest the basic lerfu by the lerfu word closest to the word representing the bas
ic lerfu. Any remaining ambiguities must be resolved by further conventions not
yet established.
Some languages, like Swedish and Finnish, consider certain accented lerfu to be
completely distinct from their unaccented equivalents, but Lojban does not make
a formal distinction, since the printed characters look the same whether they ar
e reckoned as separate letters or not. In addition, some languages consider cert
ain 2-letter combinations (like ``ll'' and ``ch'' in Spanish) to be letters; thi
s may be represented by enclosing the combination in ``tei ... foi''.
In addition, when discussing a specific language, it is permissible to make up n
ew lerfu words, as long as they are either explained locally or well understood
from context: thus Spanish ``ll'' or Croatian ``lj'' could be called ``libu'', b
ut that usage would not necessarily be universally understood.
Section 19 contains a table of proposed lerfu words for some common accent marks
.
7. Punctuation marks
Lojban does not have punctuation marks as such: the denpa bu and the slaka bu ar
e really a part of the alphabet. Other languages, however, use punctuation marks
extensively. As yet, Lojban does not have any words for these punctuation marks
, but a mechanism exists for devising them: the cmavo ``lau'' of selma'o LAU. ``
lau'' must always be followed by a BY word; the interpretation of the BY word is
changed from a lerfu to a punctuation mark. Typically, this BY word would be a
name or brivla with a ``bu'' suffix.
Why is ``lau'' necessary at all? Why not just use a ``bu''-marked word and annou
nce that it is always to be interpreted as a punctuation mark? Primarily to avoi
d ambiguity. The ``bu'' mechanism is extremely open-ended, and it is easy for Lo
jban users to make up ``bu'' words without bothering to explain what they mean.
Using the ``lau'' cmavo flags at least the most important of such nonce lerfu wo
rds as having a special function: punctuation. (Exactly the same argument applie
s to the use of ``zai'' to signal an alphabet shift or ``ce'a'' to signal a font
shift.)
Since different alphabets require different punctuation marks, the interpretatio
n of a ``lau''-marked lerfu word is affected by the current alphabet shift and t
he current font shift.
8. What about Chinese characters?
Chinese characters (``han4zi4'' in Chinese, ``kanji'' in Japanese) represent an
entirely different approach to writing from alphabets or syllabaries. (A syllaba
ry, such as Japanese hiragana or Amharic writing, has one lerfu for each syllabl
e of the spoken language.) Very roughly, Chinese characters represent single ele
ments of meaning; also very roughly, they represent single syllables of spoken C
hinese. There is in principle no limit to the number of Chinese characters that
can exist, and many thousands are in regular use.
It is hopeless for Lojban, with its limited lerfu and shift words, to create an
alphabet which will match this diversity. However, there are various possible wa
ys around the problem.
First, both Chinese and Japanese have standard Latin-alphabet representations, k
nown as ``pinyin'' for Chinese and ``romaji'' for Japanese, and these can be use
d. Thus, the word ``han4zi4'' is conventionally written with two characters, but
it may be spelled out as:
8.1) .y'y.bu .abu ny. vo zy. .ibu vo
``h'' ``a'' ``n'' 4 ``z'' ``i'' 4
The cmavo ``vo'' is the Lojban digit ``4''. It is grammatical to intersperse dig
its (of selma'o PA) into a string of lerfu words; as long as the first cmavo is
a lerfu word, the whole will be interpreted as a string of lerfu words. In Chine
se, the digits can be used to represent tones. Pinyin is more usually written us
ing accent marks, the mechanism for which was explained in Section 6.
The Japanese company named ``Mitsubishi'' in English is spelled the same way in
romaji, and could be spelled out in Lojban thus:
8.2) my. .ibu ty. sy. .ubu by. .ibu sy. .y'y.bu .ibu
``m'' ``i'' ``t'' ``s'' ``u'' ``b'' ``i'' ``s'' ``h'' ``i''
Alternatively, a really ambitious Lojbanist could assign lerfu words to the indi
vidual strokes used to write Chinese characters (there are about seven or eight
of them if you are a flexible human being, or about 40 if you are a rigid comput
er program), and then represent each character with a ``tei'', the stroke lerfu
words in the order of writing (which is standardized for each character), and a
``foi''. No one has as yet attempted this project.
9. lerfu words as pro-sumti
So far, lerfu words have only appeared in Lojban text when spelling out words. T
here are several other grammatical uses of lerfu words within Lojban. In each ca
se, a single lerfu word or more than one may be used. Therefore, the term ``lerf
u string'' is introduced: it is short for ``sequence of one or more lerfu words'
'.
A lerfu string may be used as a pro-sumti (a sumti which refers to some previous
sumti), just like the pro-sumti ``ko'a'', ``ko'e'', and so on:
9.1) .abu prami by.
A loves B
In Example 9.1, ``.abu'' and ``by.'' represent specific sumti, but which sumti t
hey represent must be inferred from context.
Alternatively, lerfu strings may be assigned by ``goi'', the regular pro-sumti a
ssignment cmavo:
9.2) le gerku goi gy. cu xekri .i gy. klama le zdani
The dog, or G, is black. G goes to the house.
There is a special rule that sometimes makes lerfu strings more advantageous tha
n the regular pro-sumti cmavo. If no assignment can be found for a lerfu string
(especially a single lerfu word), it can be assumed to refer to the most recent
sumti whose name or description begins in Lojban with that lerfu. So Example 9.2
can be rephrased:
9.3) le gerku cu xekri. .i gy. klama le zdani
The dog is black. G goes to the house.
(A less literal English translation would use ``D'' for ``dog'' instead.)
Here is an example using two names and longer lerfu strings:
9.4) la stivn. mark. djonz. merko
.i la .aleksandr. paliitc. kuzNIETsyf. rusko
.i symyjy. tavla .abupyky. bau la lojban.
Steven Mark Jones is-American.
Alexander Pavlovitch Kuznetsov is-Russian.
SMJ talks-to APK in Lojban.
Perhaps Alexander's name should be given as ``ru'o.abupyky'' instead.
What about
9.5) .abu dunda by. cy.
A gives B C
Does this mean that A gives B to C? No. ``by. cy.'' is a single lerfu string, al
though written as two words, and represents a single pro-sumti. The true interpr
etation is that A gives BC to someone unspecified. To solve this problem, we nee
d to introduce the elidable terminator ``boi'' (of selma'o BOI). This cmavo is u
sed to terminate lerfu strings and also strings of numerals; it is required when
two of these appear in a row, as here. (The other reason to use ``boi'' is to a
ttach a free modifier --- subscript, parenthesis, or what have you --- to a lerf
u string.) The correct version is:
9.6) .abu [boi] dunda by. boi cy. [boi]
A gives B to C
where the two occurrences of ``boi'' in brackets are elidable, but the remaining
occurrence is not. Likewise:
9.7) xy. boi ro [boi] prenu cu prami
X all persons loves.
X loves everybody.
requires the first ``boi'' to separate the lerfu string ``xy.'' from the digit s
tring ``ro''.
10. References to lerfu
The rules of Section 9 make it impossible to use unmarked lerfu words to refer t
o lerfu themselves. In the sentence:
10.1) .abu. cu lerfu
A is-a-letteral.
the hearer would try to find what previous sumti ``.abu'' refers to. The solutio
n to this problem makes use of the cmavo ``me'o'' of selma'o LI, which makes a l
erfu string into a sumti representing that very string of lerfu. This use of ``m
e'o'' is a special case of its mathematical use, which is to introduce a mathema
tical expression used literally rather than for its value.
10.2) me'o .abu cu lerfu
the-expression ``a'' is-a-letteral.
Now we can translate Example 1.1 into Lojban:
10.4) dei vasru vo lerfu
po'u me'o .ebu
this-sentence contains four letterals
which-are the-expression ``e''.
This sentence contains four ``e''s.
Since the Lojban sentence has only four ``e'' lerfu rather than fourteen, the tr
anslation is not a literal one --- but Example 10.4 is a Lojban truth just as Ex
ample 1.1 is an English truth. Coincidentally, the colloquial English translatio
n of Example 10.4 is also true!
The reader might be tempted to use quotation with ``lu ... li'u'' instead of ``m
e'o'', producing:
10.4.5) lu .abu li'u cu lerfu
[quote] .abu [unquote] is-a-letteral.
(The single-word quote ``zo'' cannot be used, because ``.abu'' is a compound cma
vo.) But Example 10.4 is false, because it says:
10.5) The word ``.abu'' is a letteral
which is not the case; rather, the thing symbolized by the word ``.abu'' is a le
tteral. In Lojban, that would be:
10.6) la'e lu .abu li'u cu lerfu
The-referent-of [quote] .abu [unquote] is-a-letteral.
which is correct.
11. Mathematical uses of lerfu strings
This chapter is not about Lojban mathematics, which is explained in Chapter 18,
so the mathematical uses of lerfu strings will be listed and exemplified but not
explained.
A lerfu string as mathematical variable:
11.1) li .abu du li by. su'i cy.
the-number a equals the-number b plus c
a = b + c
A lerfu string as function name (preceded by ``ma'o'' of selma'o MAhO):
11.2) li .y.bu du li ma'o fy. boi xy.
the-number y equals the number the-function f of x
y = f(x)
Note the ``boi'' here to separate the lerfu strings ``fy'' and ``xy''.
A lerfu string as selbri (followed by a cmavo of selma'o MOI):
11.3) le vi ratcu ny.moi le'i mi ratcu
the here rat is-nth-of the-set-of my rats
This rat is my Nth rat.
A lerfu string as utterance ordinal (followed by a cmavo of selma'o MAI):
11.4) ny.mai
Nthly
A lerfu string as subscript (preceded by ``xi'' of selma'o XI):
11.5) xy. xi ky.
x sub k
A lerfu string as quantifier (enclosed in ``vei ... ve'o'' parentheses):
11.6) vei ny. [ve'o] lo prenu
( ``n'' ) persons
The parentheses are required because ``ny. lo prenu'' would be two separate sumt
i, ``ny.'' and ``lo prenu''. In general, any mathematical expression other than
a simple number must be in parentheses when used as a quantifier; the right pare
nthesis mark, the cmavo ``ve'o'', can usually be elided.
All the examples above have exhibited single lerfu words rather than lerfu strin
gs, in accordance with the conventions of ordinary mathematics. A longer lerfu s
tring would still be treated as a single variable or function name: in Lojban, `
`.abu by. cy.'' is not the multiplication ``a x b x c'' but is the variable ``ab
c''. (Of course, a local convention could exist that made the value of a variabl
e like ``abc'', with a multi-lerfu-word name, equal to the values of the variabl
es ``a'', ``b'', and ``c'' multiplied together.)
There is a special rule about shift words in mathematical text: shifts within ma
thematical expressions do not affect lerfu words appearing outside
mathematical expressions, and vice versa.
12. Acronyms
An acronym is a name constructed of lerfu. English examples are ``DNA'', ``NATO'
', ``CIA''. In English, some of these are spelled out (like ``DNA'' and ``CIA'')
and others are pronounced more or less as if they were ordinary English words (
like ``NATO''). Some acronyms fluctuate between the two pronunciations: ``SQL''
may be ``ess cue ell'' or ``sequel''.
In Lojban, a name can be almost any sequence of sounds that ends in a consonant
and is followed by a pause. The easiest way to Lojbanize acronym names is to glu
e the lerfu words together, using ``''' wherever two vowels would come together
(pauses are illegal in names) and adding a final consonant:
12.1) la dyny'abub. .i la ny'abuty'obub.
.i la cy'ibu'abub. .i la sykybulyl.
.i la .ibubymym. .i la ny'ybucyc.
DNA. NATO.
CIA. SQL.
IBM. NYC.
There is no fixed convention for assigning the final consonant. In Example 12.1,
the last consonant of the lerfu string has been replicated into final position.
Some compression can be done by leaving out ``bu'' after vowel lerfu words (exce
pt for ``.y.bu'', wherein the ``bu'' cannot be omitted without ambiguity). Compr
ession is moderately important because it's hard to say long names without intro
ducing an involuntary (and illegal) pause:
12.2) la dyny'am. .i la ny'aty'om.
.i la cy'i'am. .i la sykybulym.
.i la .ibymym. .i la ny'ybucym.
DNA. NATO.
CIA. SQL.
IBM. NYC.
In Example 12.2, the final consonant ``m'' stands for ``merko'', indicating the
source culture of these acronyms.
Another approach, which some may find easier to say and which is compatible with
older versions of the language that did not have a ``''' character, is to use t
he consonant ``z'' instead of ``''':
12.3) la dynyzaz. .i la nyzatyzoz.
.i la cyzizaz. .i la sykybulyz.
.i la .ibymyz. .i la nyzybucyz.
DNA. NATO.
CIA. SQL.
IBM. NYC.
One more alternative to these lengthy names is to use the lerfu string itself pr
efixed with ``me'', the cmavo that makes sumti into selbri:
12.4) la me dy ny. .abu
that-named what-pertains-to ``d'' ``n'' ``a''
This works because ``la'', the cmavo that normally introduces names used as sumt
i, may also be used before a predicate to indicate that the predicate is a (mean
ingful) name:
12.5) la cribe cu ciska
that-named ``Bear'' writes
Bear is a writer
Example 12.5 does not of course refer to a bear (``le cribe'' or ``lo cribe'') b
ut to something else, probably a person, named ``Bear''. Similarly, ``me dy ny.
.abu'' is a predicate which can be used as a name, producing a kind of acronym w
hich can have pauses between the individual lerfu words.
13. Computerized character codes
Since the first application of computers to non-numerical information, character
sets have existed, mapping numbers (called ``character codes'') into selected l
erfu, digits, and punctuation marks (collectively called ``characters''). Histor
ically, these character sets have only covered the English alphabet and a few se
lected punctuation marks. International efforts are now underway to create a uni
fied character set that can represent essentially all the characters in essentia
lly all the world's writing systems. Lojban can take advantage of these encoding
schemes by using the cmavo ``se'e'' (of selma'o BY). This cmavo is conventional
ly followed by digit cmavo of selma'o PA representing the character code, and th
e whole string indicates a single character in some computerized character set:
13.1) me'o se'ecixa cu lerfu
la .asycy'i'is. loi merko rupnu
the-expression [code] 36 is-a-letteral
in-set ASCII
for-the-mass-of American currency-units.
The character code 36 in ASCII represents
American dollars.
``$'' represents American dollars.
Understanding Example 13.1 depends on knowing the value in the ASCII character s
et (one of the simplest and oldest) of the ``$'' character. Therefore, th
e ``se'e'' convention is only intelligible to those who know the underlying char
acter set. For precisely specifying a particular character, however, it has the
advantages of unambiguity and (relative) cultural neutrality, and therefore Lojb
an provides a means for those with access to descriptions of such character sets
to take advantage of them.
As another example, the Unicode character set (also known as ISO 10646) represen
ts the international symbol of peace, an inverted trident in a circle, using the
base-16 value 262E. In a suitable context, a Lojbanist may say:
13.2) me'o se'erexarerei sinxa le ka panpi
the-expression [code] 262E is-a-sign-of
the quality-of being-at-peace
When a ``se'e'' string appears in running discourse, some metalinguistic convent
ion must specify whether the number is base 10 (as above) or some other base, an
d which character set is in use.
14. List of all auxiliary lerfu-word cmavo
cmavo selma'o meaning
bu BU makes previous word into
a lerfu word
ga'e BY upper case shift
to'a BY lower case shift
tau LAU case-shift next lerfu word only
lo'a BY Latin/Lojban alphabet shift
ge'o BY Greek alphabet shift
je'o BY Hebrew alphabet shift
jo'o BY Arabic alphabet shift
ru'o BY Cyrillic alphabet shift
se'e BY following digits are
a character code
na'a BY cancel all shifts
zai LAU following lerfu word
specifies alphabet
ce'a LAU following lerfu word
specifies font
lau LAU following lerfu word
is punctuation
tei TEI start compound lerfu word
foi FOI end compound lerfu word
Note that LAU cmavo must be followed by a BY cmavo or the equivalent, where ``eq
uivalent'' means: either any Lojban word followed by ``bu'', another LAU cmavo (
and its required sequel), or a ``tei ... foi'' compound cmavo.
15. Proposed lerfu words --- introduction
The following sections contain tables of proposed lerfu words for some of the st
andard alphabets supported by the Lojban lerfu system. The first column of each
list is the lerfu (actually, a Latin-alphabet name sufficient to identify it). T
he second column is the proposed name-based lerfu word, and the third column is
the proposed lerfu word in the system based on using the cmavo of selma'o BY wit
h a shift word.
These tables are not meant to be authoritative (several authorities within the L
ojban community have niggled over them extensively, disagreeing with each other
and sometimes with themselves). They provide a working basis until actual usage
is available, rather than a final resolution of lerfu word problems. Probably th
e system presented here will evolve somewhat before settling down into a final,
conventional form.
For Latin-alphabet lerfu words, see Section 2 (for Lojban) and Section 5 (for no
n-Lojban Latin-alphabet lerfu).
16. Proposed lerfu words for the Greek alphabet
alpha .alfas. bu .abu
beta .betas. bu by
gamma .gamas. bu gy
delta .deltas. bu dy
epsilon .Epsilon. bu .ebu
zeta .zetas. bu zy
eta .etas. bu .e'ebu
theta .tetas. bu ty. bu
iota .iotas. bu .ibu
kappa .kapas. bu ky
lambda .lymdas. bu ly
mu .mus. bu my
nu .nus. bu ny
xi .ksis. bu ksis. bu
omicron .Omikron. bu .obu
pi .pis. bu py
rho .ros. bu ry
sigma .sigmas. bu sy
tau .taus. bu ty
upsilon .Upsilon. bu .ubu
phi .fis. bu py. bu
chi .xis. bu ky. bu
psi .psis. bu psis. bu
omega .omegas. bu .o'obu
rough .dasei,as. bu .y'y
smooth .psiles. bu xutla bu
17. Proposed lerfu words for the Cyrillic alphabet
The second column in this listing is based on the historical names of the letter
s in Old Church Slavonic. Only those letters used in Russian are shown; other la
nguages require more letters which can be devised as needed.
a .azys. bu .abu
b .bukys. bu by
v .vedis. bu vy
g .glagolis. bu gy
d .dobros. bu dy
e .iestys. bu .ebu
zh .jivet. bu jy
z .zemlias. bu zy
i .ije,is. bu .ibu
short i .itord. bu
k .kakos. bu ky
l .liudi,ies. bu ly
m .myslites. bu my
n .naciys. bu ny
o .onys. bu .obu
p .pokois. bu py
r .riytsis. bu ry
s .slovos. bu sy
t .tvriydos. bu ty
u .ukys. bu .ubu
f .friytys. bu fy
kh .xerys. bu xy
ts .tsis. bu tsys. bu
ch .tcriyviys. bu tcys. bu
sh .cas. bu cy
shch .ctas. bu ctcys. bu
hard sign .ier. bu jdari bu
yeri .ierys. bu .y.bu
soft sign .ieriys. bu ranti bu
reversed e .ecarn. bu
yu .ius. bu .iubu
ya .ias. bu .iabu
18. Proposed lerfu words for the Hebrew alphabet
aleph .alef. bu .alef. bu
bet .bet. bu by
gimel .gimel. bu gy
daled .daled. bu dy
he .xex. bu .y'y
vav .vav. bu vy
zayin .zai,in. bu zy
khet .xet. bu xy. bu
tet .tet. bu ty. bu
yud .iud. bu .iud. bu
kaf .kaf. bu ky
lamed .LYmed. bu ly
mem .mem. bu my
nun .nun. bu ny
samekh .samex. bu samex. bu
ayin .ai,in. bu .ai,in bu
pe .pex. bu py
tzadi .tsadik. bu tsadik. bu
quf .kuf. bu ky. bu
resh .rec. bu ry
shin .cin. bu cy
sin .sin. bu sy
taf .taf. bu ty.
dagesh .daGEC. bu daGEC. bu
hiriq .xirik. bu .ibu
tzeirekh .tseirex. bu .eibu
segol .seGOL. bu .ebu
qubbutz .kubuts. bu .ubu
qamatz .kamats. bu .abu
patach .patax. bu .a'abu
sheva .cyVAS. bu .y.bu
kholem .xolem. bu .obu
shuruq .curuk. bu .u'ubu
19. Proposed lerfu words for some accent marks and multiple letters
This list is intended to be suggestive, not complete: there are lerfu such as Po
lish ``dark'' l and Maltese h-bar that do not yet have symbols.
acute .akut. bu or
.pritygal. bu [pritu galtu]
grave .grav. bu
or .zulgal. bu [zunle galtu]
circumflex .cirkumfleks. bu
or .midgal. bu [midju galtu]
tilde .tildes. bu
macron .makron. bu
breve .brevis. bu
over-dot .garmoc. bu [gapru mokca]
umlaut/trema relmoc. bu [re mokca]
over-ring .garjin. bu [gapru djine]
cedilla .seDIlys. bu
double-acute .re'akut. bu [re akut.]
ogonek .ogoniek. bu
hacek .xatcek. bu
ligatured fi tei fy. ibu foi
Danish/Latin ae tei .abu .ebu foi
Dutch ij tei .ibu jy. foi
German es-zed tei sy. zy. foi
20. Proposed lerfu words for radio communication
There is a set of English words which are used, by international agreement, as l
erfu words (for the English alphabet) over the radio, or in noisy situations whe
re the utmost clarity is required. Formally they are known as the ``ICAO Phoneti
c Alphabet'', and are used even in non-English-speaking countries.
This table presents the standard English spellings and proposed Lojban versions.
The Lojbanizations are not straightforward renderings of the English sounds, bu
t make some concessions both to the English spellings of the words and to the Lo
jban pronunciations of the lerfu (thus ``carlis. bu'', not ``tcarlis. bu'').
Alfa .alfas. bu
Bravo .bravos. bu
Charlie .carlis. bu
Delta .deltas. bu
Echo .ekos. bu
Foxtrot .fokstrot. bu
Golf .golf. bu
Hotel .xoTEL. bu
India .indias. bu
Juliet .juliet. bu
Kilo .kilos. bu
Lima .limas. bu
Mike .maik. bu
November .novembr. bu
Oscar .oskar. bu
Papa .paPAS. bu
Quebec .keBEK. bu
Romeo .romios. bu
Sierra .sieras. bu
Tango .tangos. bu
Uniform .Uniform. bu
Victor .viktas. bu
Whiskey .uiskis. bu
X-ray .eksreis. bu
Yankee .iankis. bu
Zulu .zulus. bu
Chapter 18
lojbau mekso: Mathematical Expressions in Lojban
1. Introductory
lojbau mekso (``Lojbanic mathematical-expression'') is the part of the Lojban la
nguage that is tailored for expressing statements of a mathematical character, o
r for adding numerical information to non-mathematical statements. Its formal de
sign goals include:
1)
representing all the different forms of expression used by mathematicians in
their normal modes of writing, so that a reader can unambiguously read off math
ematical text as written with minimal effort and expect a listener to understand
it;
2)
providing a vocabulary of commonly used mathematical terms which can readily
be expanded to include newly coined words using the full resources of Lojban;
3)
permitting the formulation, both in writing and in speech, of unambiguous ma
thematical text;
4)
encompassing all forms of quantified expression found in natural languages,
as well as encouraging greater precision in ordinary language situations than na
tural languages allow.
Goal 1 requires that mekso not be constrained to a single notation such as Polis
h notation or reverse Polish notation, but make provision for all forms, with th
e most commonly used forms the most easily used.
Goal 2 requires the provision of several conversion mechanisms, so that the boun
dary between mekso and full Lojban can be crossed from either side at many point
s.
Goal 3 is the most subtle. Written mathematical expression is culturally unambig
uous, in the sense that mathematicians in all parts of the world understand the
same written texts to have the same meanings. However, international mathematica
l notation does not prescribe unique forms. For example, the expression
1.1) 3x + 2y
contains omitted multiplication operators, but there are other possible interpre
tations for the strings ``3x'' and ``2y'' than as mathematical multiplication. T
herefore, the Lojban verbal (spoken and written) form of Example 1.1 must not om
it the multiplication operators.
The remainder of this chapter explains (in as much detail as is currently possib
le) the mekso system. This chapter is by intention complete as regards mekso com
ponents, but only suggestive about uses of those components --- as of now, there
has been no really comprehensive use made of mekso facilities, and many matters
must await the test of usage to be fully clarified.
2. Lojban numbers
The following cmavo are discussed in this section:
pa PA 1
re PA 2
ci PA 3
vo PA 4
mu PA 5
xa PA 6
ze PA 7
bi PA 8
so PA 9
no PA 0
The simplest kind of mekso are numbers, which are cmavo or compound cmavo. There
are cmavo for each of the 10 decimal digits, and numbers greater than 9 are mad
e by stringing together the cmavo. Some examples:
2.1) pa re ci
one two three
123
one hundred and twenty three
2.2) pa no
one zero
10
ten
2.3) pa re ci vo mu xa ze bi so no
one two three four five six seven eight nine zero
1234567890
one billion, two hundred and thirty-four million,
five hundred and sixty-seven thousand,
eight hundred and ninety.
Therefore, there are no separate cmavo for ``ten'', ``hundred'', etc.
There is a pattern to the digit cmavo (except for ``no'', 0) which is worth expl
aining. The cmavo from 1 to 5 end in the vowels ``a'', ``e'', ``i'', ``o'', ``u'
' respectively; and the cmavo from 6 to 9 likewise end in the vowels ``a'', ``e'
', ``i'', and ``o'' respectively. None of the digit cmavo begin with the same co
nsonant, to make them easy to tell apart in noisy environments.
3. Signs and numerical punctuation
The following cmavo are discussed in this section:
ma'u PA positive sign
ni'u PA negative sign
pi PA decimal point
fi'u PA fraction slash
ra'e PA repeating decimal
ce'i PA percent sign
ki'o PA comma between digits
A number can be given an explicit sign by the use of ``ma'u'' and ``ni'u'', whic
h are the positive and negative signs as distinct from the addition, subtraction
, and negation operators. For example:
3.1) ni'u pa
negative-sign 1
-1
Grammatically, the signs are part of the number to which they are attached. It i
s also possible to use ``ma'u'' and ``ni'u'' by themselves as numbers; the meani
ng of these numbers is explained in Section 8.
Various numerical punctuation marks are likewise expressed by cmavo, as illustra
ted in the following examples:
3.2) ci pi pa vo pa mu
three point one four one five
3.1415
(In some cultures, a comma is used instead of a period in the symbolic version o
f Example 3.2; ``pi'' is still the Lojban representation for the decimal point.)
3.3) re fi'u ze
two fraction seven
2/7
Example 3.3 is the name of the number two-sevenths; it is not the same as ``the
result of 2 divided by 7'' in Lojban, although numerically these two are equal.
If the denominator of the fraction is present but the numerator is not, the nume
rator is taken to be 1, thus expressing the reciprocal of the following number:
3.4) fi'u ze
fraction seven
1/7
3.5) pi ci mu ra'e pa vo re bi mu ze
point three five repeating one four two eight five seven
.35142857142857...
Note that the ``ra'e'' marks unambiguously where the repeating portion ``142857'
' begins.
3.6) ci mu ce'i
three five percent
35%
3.7) pa ki'o re ci vo ki'o mu xa ze
one comma two three four comma five six seven
1,234,567
(In some cultures, spaces are used in the symbolic representation of Example 3.7
; ``ki'o'' is still the Lojban representation.)
It is also possible to have less than three digits between successive ``ki'o''s,
in which case zeros are assumed to have been elided:
3.8) pa ki'o re ci ki'o vo
one comma two three comma four
1,023,004
In the same way, ``ki'o'' can be used after ``pi'' to divide fractions into grou
ps of three:
3.9) pi ki'o re re
point comma two two
.022
3.10) pi pa ki'o pa re ki'o pa
point one comma one two comma one
.101012001
4. Special numbers
The following cmavo are discussed in this section:
ci'i PA infinity
ka'o PA imaginary i, sqrt(-1)
pai PA pi (approx 3.14159...)
te'o PA exponential e (approx 2.71828...)
fi'u PA golden ratio, phi,
(1 + sqrt(5))/2 (approx. 1.61803...)
The last cmavo is the same as the fraction sign cmavo: a fraction sign with neit
her numerator nor denominator represents the golden ratio.
Numbers can have any of these digit, punctuation, and special-number cmavo of Se
ctions 2, 3, and 4 in any combination:
4.1) ma'u ci'i
+Â¥
4.2) ci ka'o re
3i2 (a complex number equivalent to ``3 + 2i'')
Note that ``ka'o'' is both a special number (meaning ``i'') and a number punctua
tion mark (separating the real and the imaginary parts of a complex number).
4.3) ci'i no
infinity zero
Ã
0 (a transfinite cardinal)
The special numbers ``pai'' and ``te'o'' are mathematically important, which is
why they are given their own cmavo:
4.4) pai
pi
4.5) te'o
e
However, many combinations are as yet undefined:
4.6) pa pi re pi ci
1.2.3
4.7) pa ni'u re
1 negative-sign 2
Example 4.5 is not ``1 minus 2'', which is represented by a different cmavo sequ
ence altogether. It is a single number which has not been assigned a meaning. Th
ere are many such numbers which have no well-defined meaning; they may be used f
or experimental purposes or for future expansion of the Lojban number system.
It is possible, of course, that some of these ``oddities'' do have a meaningful
use in some restricted area of mathematics. A mathematician appropriating these
structures for specialized use needs to consider whether some other branch of ma
thematics would use the structure differently.
More information on numbers may be found in Sections 8 to 12.
5. Simple infix expressions and equations
The following cmavo are discussed in this section:
du GOhA equals
su'i VUhU plus
vu'u VUhU minus
pi'i VUhU times
te'a VUhU raised to the power
ny. BY letter ``n''
vei VEI left parenthesis
ve'o VEhO right parenthesis
Let us begin at the beginning: one plus one equals two. In Lojban, that sentence
translates to:
5.1) li pa su'i pa du li re
the-number one plus one equals the-number two.
1 + 1 = 2
Example 5.1, a mekso sentence, is a regular Lojban bridi that exploits mekso fea
tures. ``du'' is the predicate meaning ``x1 is mathematically equal to x2''. It
is a cmavo for conciseness, but it has the same grammatical uses as any brivla.
Outside mathematical contexts, ``du'' means ``x1 is identical with x2'' or ``x1
is the same object as x2''.
The cmavo ``li'' is the number article. It is required whenever a sentence talks
about numbers as numbers, as opposed to using numbers to quantify things. For e
xample:
5.2) le ci prenu
the three persons
requires no ``li'' article, because the ``ci'' is being used to specify the numb
er of ``prenu''. However, the sentence
5.3) levi sfani cu grake li ci
this fly masses-in-grams the-number three
This fly has a mass of 3 grams.
requires ``li'' because ``ci'' is being used as a sumti. Note that this is the w
ay in which measurements are stated in Lojban: all the predicates for units of l
ength, mass, temperature, and so on have the measured object as the first place
and a number as the second place. Using ``li'' for ``le'' in Example 5.2 would p
roduce
5.4) li ci prenu
the-number 3 is-a-person
which is grammatical but nonsensical: numbers are not persons.
The cmavo ``su'i'' belongs to selma'o VUhU, which is composed of mathematical op
erators, and means ``addition''. As mentioned before, it is distinct from ``ma'u
'' which means the positive sign as an indication of a positive number:
5.5) li ma'u pa su'i ni'u pa
du li no
the-number positive-sign one plus negative-sign one
equals the-number zero
+1 + -1 = 0
Of course, it is legal to have complex mekso on both sides of ``du'':
5.6) li mu su'i pa
du li ci su'i ci
the-number five plus one
equals the-number three plus three
5 + 1 = 3 + 3
Why don't we say ``li mu su'i li pa'' rather than just ``li mu su'i pa''? The an
swer is that VUhU operators connect mekso operands (numbers, in Example 5.6), no
t general sumti. ``li'' is used to make the entire mekso into a sumti, which the
n plays the roles applicable to other sumti: in Example 5.6, filling the places
of a bridi.
By default, Lojban mathematics is like simple calculator mathematics: there is n
o notion of ``operator precedence''. Consider the following example, where ``pi'
i'' means ``times'', the multiplication operator:
5.7) li ci su'i vo pi'i mu
du li reci
the-number three plus four times five
equals the-number two-three
3 + 4 Ã 5 = 23
Is the Lojban version of Example 5.7 true? No! ``3 + 4 Ã 5'' is indeed 23, because
the usual conventions of mathematics state that multiplication takes precedence
over addition; that is, the multiplication ``4 Ã 5'' is done first, giving 20, and
only then the addition ``3 + 20''. But VUhU operators by default are done left t
o right, like other Lojban grouping, and so a truthful bridi would be:
5.8) li ci su'i vo pi'i mu
du li cimu
the-number three plus four times five
equals the-number three-five
3 + 4 Ã 5 = 35
Here we calculate 3 + 4 first, giving 7, and then calculate 7 Ã 5 second, leading t
o the result 35. While possessing the advantage of simplicity, this result viola
tes the design goal of matching the standards of mathematics. What can be done?
There are three solutions, all of which will probably be used to some degree. Th
e first solution is to ignore the problem. People will say ``li ci su'i vo pi'i
mu'' and mean 23 by it, because the notion that multiplication takes precedence
over addition is too deeply ingrained to be eradicated by Lojban parsing, which
totally ignores semantics. This convention essentially allows semantics to domin
ate syntax in this one area.
(Why not hard-wire the precedences into the grammar, as is done in computer prog
ramming languages? Essentially because there are too many operators, known and u
nknown, with levels of precedence that vary according to usage. The programming
language 'C' has 13 levels of precedence, and its list of operators is not even
extensible. For Lojban this approach is just not practical. In addition, hard-wi
red precedence could not be overridden in mathematical systems such as spreadshe
ets where the conventions are different.)
The second solution is to use explicit means to specify the precedence of operat
ors. This approach is fully general, but clumsy, and will be explained in Sectio
n 20.
The third solution is simple but not very general. When an operator is prefixed
with the cmavo ``bi'e'' (of selma'o BIhE), it becomes automatically of higher pr
ecedence than other operators not so prefixed. Thus,
5.9) li ci su'i vo bi'e pi'i mu
du li reci
the-number three plus four-times-five
equals the-number two-three
3 + 4 Ã 5 = 23
is a truthful Lojban bridi. If more than one operator has a ``bi'e'' prefix, gro
uping is to the right; multiple ``bi'e'' prefixes on a single operator are not a
llowed.
In addition, of course, Lojban has the mathematical parentheses ``vei'' and ``ve
'o'', which can be used just like their written equivalents ``('' and ``)'' to g
roup expressions in any way desired:
5.10) li vei ny. su'i pa ve'o pi'i vei ny. su'i pa [ve'o]
du li ny. [bi'e] te'a re su'i re bi'e pi'i ny. su'i pa
the-number ( ``n'' plus one ) times ( ``n'' plus one )
equals the-number n-power-two plus two-times-``n'' plus 1
(n + 1)(n + 1) = n2 + 2n + 1
There are several new usages in Example 5.10: ``te'a'' means ``raised to the pow
er'', and we also see the use of the lerfu word ``ny'', representing the letter
``n''. In mekso, letters stand for just what they do in ordinary mathematics: va
riables. The parser will accept a string of lerfu words (called a ``lerfu string
'') as the equivalent of a single lerfu word, in agreement with computer-science
conventions; ``abc'' is a single variable, not the equivalent of ``a à b à c''. (Of c
ourse, a local convention could state that the value of a variable like ``abc'',
with a multi-lerfu name, was equal to the values of the variables ``a'', ``b'',
and ``c'' multiplied together.)
The explicit operator ``pi'i'' is required in the Lojban verbal form whereas mul
tiplication is implicit in the symbolic form. Note that ``ve'o'' (the right pare
nthesis) is an elidable terminator: the first use of it in Example 5.10 is requi
red, but the second use (marked by square brackets) could be elided. Additionall
y, the first ``bi'e'' (also marked by square brackets) is not necessary to get t
he proper grouping, but it is included here for symmetry with the other one.
6. Forethought operators (Polish notation, functions)
The following cmavo are discussed in this section:
boi BOI numeral/lerfu string terminator
va'a VUhU negation/additive inverse
pe'o PEhO forethought flag
ku'e KUhE forethought terminator
py. BY letter ``p''
xy. BY letter ``x''
zy. BY letter ``z''
ma'o MAhO convert operand to operator
fy. BY letter ``f''
The infix form explained so far is reasonable for many purposes, but it is limit
ed and rigid. It works smoothly only where all operators have exactly two operan
ds, and where precedences can either be assumed from context or are limited to j
ust two levels, with some help from parentheses.
But there are many operators which do not have two operands, or which have a var
iable number of operands. The preferred form of expression in such cases is the
use of ``forethought operators'', also known as Polish notation. In this style o
f writing mathematics, the operator comes first and the operands afterwards:
6.1) li su'i paboi reboi ci[boi] du li xa
the-number the-sum-of one two three equals the-number six
sum(1,2,3) = 6
Note that the normally elidable number terminator ``boi'' is required after ``pa
'' and ``re'' because otherwise the reading would be ``pareci'' = 123. It is not
required after ``ci'' but is inserted here in brackets for the sake of symmetry
. The only time ``boi'' is required is, as in Example 6.1, when there are two co
nsecutive numbers or lerfu strings.
Forethought mekso can use any number of operands, in Example 6.1 three. How do w
e know how many operands there are in ambiguous circumstances? The usual Lojban
solution is employed: an elidable terminator, namely ``ku'e''. Here is an exampl
e:
6.2) li py. su'i va'a ny. ku'e su'i zy du li xy.
the-number ``p'' plus negative-of( ``n'' ) plus ``z''
equals the-number ``x''
p + -n + z = x
where we know that ``va'a'' is a forethought operator because there is no operan
d preceding it.
``va'a'' is the numerical negation operator, of selma'o VUhU. In contrast, j ``v
u'u'' is not used for numerical negation, but only for subtraction, as it always
has two or more operands. Do not confuse ``va'a'' and ``vu'u'', which are opera
tors, with ``ni'u'', which is part of a number.
In Example 6.2, the operator ``va'a'' and the terminator ``ku'e'' serve in effec
t as parentheses. (The regular parentheses ``vei'' and ``ve'o'' are NOT used for
this purpose.) If the ``ku'e'' were omitted, the ``su'i zy'' would be swallowed
up by the ``va'a'' forethought operator, which would then appear to have two op
erands, ``ny'' and ``su'i zy.'', where the latter is also a forethought expressi
on.
Forethought mekso is also useful for matching standard functional notation. How
do we represent ``z = f(x)''? The answer is:
6.3) li zy du li ma'o fy.boi xy.
the-number z equals the-number the-operator f x
z = f(x)
Again, no parentheses are used. The construct ``ma'o fy.boi'' is the equivalent
of an operator, and appears in forethought here (although it could also be used
as a regular infix operator). In mathematics, letters sometimes mean functions a
nd sometimes mean variables, with only the context to tell which. Lojban chooses
to accept the variable interpretation as the default, and uses the special flag
``ma'o'' to mark a lerfu string as an operator. The cmavo ``xy.'' and ``zy.'' a
re variables, but ``fy.'' is an operator (a function) because ``ma'o'' marks it
as such. The ``boi'' is required because otherwise the ``xy.'' would look like p
art of the operator name. (The use of ``ma'o'' can be generalized from lerfu str
ings to any mekso operand: see Section 21.)
When using forethought mekso, the optional marker ``pe'o'' may be placed in fron
t of the operator. This usage can help avoid confusion by providing clearly mark
ed ``pe'o'' and ``ku'e'' pairs to delimit the operand list. Examples 6.1 to 6.3,
respectively, with explicit ``pe'o'' and ``ku'e'':
6.4) li pe'o su'i paboi reboi ciboi ku'e du li xa
6.5) li py. su'i pe'o va'a ny. ku'e su'i zy du li xy.
6.6) li zy du li pe'o ma'o fy.boi xy. ku'e
Note: When using forethought mekso, be sure that the operands really are operand
s: they cannot contain regular infix expressions unless parenthesized with ``vei
'' and ``ve'o''. An earlier version of the complex Example 17.6 came to grief be
cause I forgot this rule.
7. Other useful selbri for mekso bridi
So far our examples have been isolated mekso (it is legal to have a bare mekso a
s a sentence in Lojban) and equation bridi involving ``du''. What about inequali
ties such as ``x < 5''? The answer is to use a bridi with an appropriate selbri,
thus:
7.1) li xy. mleca li mu
the-number x is-less-than the-number 5
Here is a partial list of selbri useful in mathematical bridi:
du x1 is identical to x2, x3, x4, ...
dunli x1 is equal/congruent to x2 in/on
property/quality/dimension/quantity x3
mleca x1 is less than x2
zmadu x1 is greater than x2
dubjavme'a x1 is less than or equal to x2
[du ja mleca, equal or less]
dubjavmau x1 is greater than or equal to x2
[du ja zmadu, equal or greater]
tamdu'i x1 is similar to x2
[tarmi dunli, shape-equal]
turdu'i x1 is isomorphic to x2
[stura dunli, structure-equal]
cmima x1 is a member of set x2
gripau x1 is a subset of set x2
[girzu pagbu, set-part]
na'ujbi x1 is approximately equal to
[namcu jibni, number-near]
terci'e x1 is a component with function x2
of system x3
Note the difference between ``dunli'' and ``du''; ``dunli'' has a third place th
at specifies the kind of equality that is meant. ``du'' refers to actual identit
y, and can have any number of places:
7.2) py. du xy.boi zy.
``p'' is-identical-to ``x'' ``z''
p = x = z
Lojban bridi can have only one predicate, so the ``du'' is not repeated.
Any of these selbri may usefully be prefixed with ``na'', the contradictory nega
tion cmavo, to indicate that the relation is false:
7.3) li re su'i re na du li mu
the-number 2 + 2 is-not equal-to the-number 5.
2 + 2 ½ 5
As usual in Lojban, negated bridi say what is false, and do not say anything abo
ut what might be true.
8. Indefinite numbers
The following cmavo are discussed in this section:
ro PA all
so'a PA almost all
so'e PA most
so'i PA many
so'o PA several
so'u PA a few
no'o PA the typical number of
da'a PA all but (one) of
piro PA+PA the whole of/all of
piso'a PA+PA almost the whole of
piso'e PA+PA most of
piso'i PA+PA much of
piso'o PA+PA a small part of
piso'u PA+PA a tiny part of
pino'o PA+PA the typical portion of
rau PA enough
du'e PA too many
mo'a PA too few
pirau PA+PA enough of
pidu'e PA+PA too much of
pimo'a PA+PA too little of
Not all the cmavo of PA represent numbers in the usual mathematical sense. For e
xample, the cmavo ``ro'' means ``all'' or ``each''. This number does not have a
definite value in the abstract: ``li ro'' is undefined. But when used to count o
r quantify something, the parallel between ``ro'' and ``pa'' is clearer:
8.1) mi catlu pa prenu
I look-at one person
8.2) mi catlu ro prenu
I look-at all persons
Example 8.1 might be true, whereas Example 8.2 is almost certainly false.
The cmavo ``so'a'', ``so'e'', ``so'i'', ``so'o'', and ``so'u'' represent a set o
f indefinite numbers less than ``ro''. As you go down an alphabetical list, the
magnitude decreases:
8.3) mi catlu so'a prenu
I look-at almost-all persons
8.4) mi catlu so'e prenu
I look-at most persons
8.5) mi catlu so'i prenu
I look-at many persons
8.6) mi catlu so'o prenu
I look-at several persons
8.7) mi catlu so'u prenu
I look-at a-few persons
The English equivalents are only rough: the cmavo provide space for up to five i
ndefinite numbers between ``ro'' and ``no'', with a built-in ordering. In partic
ular, ``so'e'' does not mean ``most'' in the sense of ``a majority'' or ``more t
han half''.
Each of these numbers, plus ``ro'', may be prefixed with ``pi'' (the decimal poi
nt) in order to make a fractional form which represents part of a whole rather t
han some elements of a totality. ``piro'' therefore means ``the whole of'':
8.8) mi citka piro lei nanba
I eat the-whole-of the-mass-of bread
Similarly, ``piso'a'' means ``almost the whole of''; and so on down to ``piso'u'
', ``a tiny part of''. These numbers are particularly appropriate with masses, w
hich are usually measured rather than counted, as Example 8.8 shows.
In addition to these cmavo, there is ``no'o'', meaning ``the typical value'', an
d ``pino'o'', meaning ``the typical portion'': Sometimes ``no'o'' can be transla
ted ``the average value'', but the average in question is not, in general, a mat
hematical mean, median, or mode; these would be more appropriately represented b
y operators.
8.9) mi catlu no'o prenu
I look-at a-typical-number-of persons
8.10) mi citka pino'o lei nanba
I eat a-typical-amount-of the-mass-of bread.
``da'a'' is a related cmavo meaning ``all but'':
8.11) mi catlu da'a re prenu
I look-at all-but two persons
8.12) mi catlu da'a so'u prenu
I look-at all-but a-few persons
Example 8.12 is similar in meaning to Example 8.3.
If no number follows ``da'a'', then ``pa'' is assumed; ``da'a'' by itself means
``all but one'', or in ordinal contexts ``all but the last'':
8.13) ro ratcu ka'e citka da'a ratcu
all rats can eat all-but-one rats.
All rats can eat all other rats.
(The use of ``da'a'' means that Example 8.13 does not require that all rats can
eat themselves, but does allow it. Each rat has one rat it cannot eat, but that
one might be some rat other than itself. Context often dictates that ``itself''
is, indeed, the ``other'' rat.)
As mentioned in Section 3, ``ma'u'' and ``ni'u'' are also legal numbers, and the
y mean ``some positive number'' and ``some negative number'' respectively.
8.14) li ci vu'u re du li ma'u
the-number 3 - 2 = some-positive-number
8.15) li ci vu'u vo du li ni'u
the-number 3 - 4 = some-negative-number
8.16) mi ponse ma'u rupnu
I possess a-positive-number-of currency-units.
All of the numbers discussed so far are objective, even if indefinite. If there
are exactly six superpowers (``rairgugde'', ``superlative-states'') in the world
, then ``ro rairgugde'' means the same as ``xa rairgugde''. It is often useful,
however, to express subjective indefinite values. The cmavo ``rau'' (enough), ``
du'e'' (too many), and ``mo'a'' (too few) are then appropriate:
8.17) mi ponse rau rupnu
I possess enough currency-units.
Like the ``so'a''-series, ``rau'', ``du'e'', and ``mo'a'' can be preceded by ``p
i''; for example, ``pirau'' means ``a sufficient part of.''
Another possibility is that of combining definite and indefinite numbers into a
single number. This usage implies that the two kinds of numbers have the same va
lue in the given context:
8.18) mi viska le rore gerku
I saw the all-of/two dogs.
I saw both dogs.
8.19) mi speni so'ici prenu
I am-married-to many/three persons.
I am married to three persons
(which is ``many'' in the circumstances).
Example 8.19 assumes a mostly monogamous culture by stating that three is ``many
''.
9. Approximation and inexact numbers
The following cmavo are discussed in this section:
ji'i PA approximately
su'e PA at most
su'o PA at least
me'i PA less than
za'u PA more than
The cmavo ``ji'i'' (of selma'o PA) is used in several ways to indicate approxima
te or rounded numbers. If it appears at the beginning of a number, the whole num
ber is approximate:
9.1) ji'i vo no
approximation four zero
approximately 40
If ``ji'i'' appears in the middle of a number, all the digits following it are a
pproximate:
9.2) vo no ji'i mu no
four zero approximation five zero
roughly 4050
(where the ``four thousand'' is exact,
but the ``fifty'' is approximate)
If ``ji'i'' appears at the end of a number, it indicates that the number has bee
n rounded. In addition, it can then be followed by a sign cmavo (``ma'u'' or ``n
i'u''), which indicate truncation towards positive or negative infinity respecti
vely.
9.3) re pi ze re ji'i
two point seven two approximation
2.72 (rounded)
9.4) re pi ze re ji'i ma'u
two point seven two approximation positive-sign
2.72 (rounded up)
9.5) re pi ze pa ji'i ni'u
two point seven one approximation negative-sign
2.71 (rounded down)
Examples 9.3 through 9.5 are all approximations to ``te'o'' (exponential e). ``j
i'i'' can also appear by itself, in which case it means ``approximately the typi
cal value in this context''.
The four cmavo ``su'e'', ``su'o'', ``me'i'', and ``za'u'', also of selma'o PA, e
xpress inexact numbers with upper or lower bounds:
9.6) mi catlu su'e re prenu
I look-at at-most two persons
9.7) mi catlu su'o re prenu
I look-at at-least two persons
9.8) mi catlu me'i re prenu
I look-at less-than two persons
9.9) mi catlu za'u re prenu
I look-at more-than two persons
Each of these is a subtly different claim: Example 9.7 is true of two or any gre
ater number, whereas Example 9.9 requires three persons or more. Likewise, Examp
le 9.6 refers to zero, one, or two; Example 9.8 to zero or one. (Of course, when
the context allows numbers other than non-negative integers, ``me'i re'' can be
any number less than 2, and likewise with the other cases.) The exact quantifie
r, ``exactly 2, neither more nor less'' is just ``re''. Note that ``su'ore'' is
the exact Lojban equivalent of English plurals.
If no number follows one of these cmavo, ``pa'' is understood: therefore,
9.10) mi catlu su'o prenu
I look-at at-least [one] person
is a meaningful claim.
Like the numbers in Section 8, all of these cmavo may be preceded by ``pi'' to m
ake the corresponding quantifiers for part of a whole. For example, ``pisu'o'' m
eans ``at least some part of''. The quantifiers ``ro'', ``su'o'', ``piro'', and
``pisu'o'' are particularly important in Lojban, as they are implicitly used in
the descriptions introduced by the cmavo of selma'o LA and LE, as explained in C
hapter 6. Descriptions in general are outside the scope of this chapter.
10. Non-decimal and compound bases
The following cmavo are discussed in this section:
ju'u VUhU to the base
dau PA hex digit A = 10
fei PA hex digit B = 11
gai PA hex digit C = 12
jau PA hex digit D = 13
rei PA hex digit E = 14
vai PA hex digit F = 15
pi'e PA compound base point
In normal contexts, Lojban assumes that all numbers are expressed in the decimal
(base 10) system. However, other bases are possible, and may be appropriate in
particular circumstances.
To specify a number in a particular base, the VUhU operator ``ju'u'' is suitable
:
10.1) li pa no pa no ju'u re du li pa no
the-number 1010 base 2 equals the-number 10
Here, the final ``pa no'' is assumed to be base 10, as usual; so is the base spe
cification. (The base may also be changed permanently by a metalinguistic specif
ication; no standard way of doing so has as yet been worked out.)
Lojban has digits for representing bases up to 16, because 16 is a base often us
ed in computer applications. In English, it is customary to use the letters A-F
as the base 16 digits equivalent to the numbers ten through fifteen. In Lojban,
this ambiguity is avoided:
10.2) li daufeigai ju'u paxa du li rezevobi
the-number ABC base 16 equals the-number 2748
10.3) li jaureivai ju'u paxa du li cimuxaze
the-number DEF base 16 equals the-number 3567
Note the pattern in the cmavo: the diphthongs ``au'', ``ei'', ``ai'' are used tw
ice in the same order. The digits for A to D use consonants different from those
used in the decimal digit cmavo; E and F unfortunately overlap 2 and 4 --- ther
e was simply not enough available cmavo space to make a full differentiation pos
sible. The cmavo are also in alphabetical order.
The base point ``pi'' is used in non-decimal bases just as in base 10:
10.4) li vai pi bi ju'u paxa du li pamu pi mu
the-number F.8 base 16 equals the-number 15.5
Since ``ju'u'' is an operator of selma'o VUhU, it is grammatical to use any oper
and as the left argument. Semantically, however, it is undefined to use anything
but a numeral string on the left. The reason to make ``ju'u'' an operator is to
allow reference to a base which is not a constant.
There are some numerical values that require a ``base'' that varies from digit t
o digit. For example, times represented in hours, minutes, and seconds have, in
effect, three ``digits'': the first is base 24, the second and third are base 60
. To express such numbers, the compound base separator ``pi'e'' is used:
10.5) ci pi'e rere pi'e vono
3:22:40
Each digit sequence separated by instances of ``pi'e'' is expressed in decimal n
otation, but the number as a whole is not decimal and can only be added and subt
racted by special rules:
10.6) li ci pi'e rere pi'e vono su'i pi'e ci pi'e cici
du li ci pi'e rexa pi'e paci
the-number 3:22:40 plus :3:33 equals the-number 3:26:13
3:22:40 + 0:3:33 = 3:26:13
Of course, only context tells you that the first part of the numbers in Example
10.5 and Example 10.6 is hours, the second minutes, and the third seconds.
The same mechanism using ``pi'e'' can be used to express numbers which have a ba
se larger than 16. For example, base-20 Mayan mathematics might use digits from
``no'' to ``paso'', each separated by ``pi'e'':
10.7) li pa pi'e re pi'e ci ju'u reno du li vovoci
the-number 1;2;3 base 20 equals the-number 443
Carefully note the difference between:
10.8) pano ju'u reno
the-digit-10 base 20
which is equal to ten, and:
10.9) pa pi'e no ju'u reno
1;0 base 20
which is equal to twenty.
Both ``pi'' and ``pi'e'' can be used to express large-base decimal fractions:
10.10) li pa pi'e vo pi ze ju'u reno du li re vo pi ci mu
the-number 1;4.7 base 20 equals the-number 24.35
``pi'e'' is also used where the base of each digit is vague, as in the numbering
of the examples in this chapter:
10.11) dei jufra panopi'epapamoi
This-utterance is-a-sentence-type-of 10;11th-thing.
This is Sentence 10.11.
11. Special mekso selbri
The following cmavo are discussed in this section:
mei MOI cardinal selbri
moi MOI ordinal selbri
si'e MOI portion selbri
cu'o MOI probability selbri
va'e MOI scale selbri
me ME make sumti into selbri
me'u MEhU terminator for ME
Lojban possesses a special category of selbri which are based on mekso. The simp
lest kind of such selbri are made by suffixing a member of selma'o MOI to a numb
er. There are four members of MOI, each of which serves to create number-based s
elbri with specific place structures.
``-mei'' creates cardinal selbri. The basic place structure is:
x1 is a mass formed from the set x2 of n members, one or more of which is/ar
e x3
A cardinal selbri interrelates a set with a given number of members, the mass fo
rmed from that set, and the individuals which make the set up. The mass argument
is placed first as a matter of convenience, not logical necessity.
Some examples:
11.1) lei mi ratcu cu cimei
those-I-describe-as-the-mass-of my rats are-a-threesome.
My rats are three.
I have three rats.
Here, the mass of my rats is said to have three components; that is, I have thre
e rats.
Another example, with one element this time:
11.2) mi poi pamei cu cusku dei
I who am-an-individual express this-sentence.
In Example 11.2, ``mi'' refers to a mass, ``the mass consisting of me''. Persona
l pronouns are vague between masses, sets, and individuals.
However, when the number expressed before ``-mei'' is an objective indefinite nu
mber of the kind explained in Section 8, a slightly different place structure is
required:
x1 is a mass formed from a set x2 of n members, one or more of which is/are
x3, measured relative to the set x4.
An example:
11.3) lei ratcu poi zvati le panka cu so'umei
fo lo'i ratcu
the-mass-of rats which are-in the park are a-fewsome
with-respect-to the-set-of rats.
The rats in the park are a small number
of all the rats there are.
In Example 11.3, the x2 and x3 places are vacant, and the x4 place is filled by
``lo'i ratcu'', which (because no quantifiers are explicitly given) means ``the
whole of the set of all those things which are rats'', or simply ``the set of al
l rats.''
11.4) le'i ratcu poi zvati le panka cu se so'imei
The-set-of rats which-are in the park is-a manysome.
There are many rats in the park.
In Example 11.4, the conversion cmavo ``se'' swaps the x1 and the x2 places, so
that the new x1 is the set. The x4 set is unspecified, so the implication is tha
t the rats are ``many'' with respect to some unspecified comparison set.
More explanations about the interrelationship of sets, masses, and individuals c
an be found in Chapter 6.
``-moi'' creates ordinal selbri. The place structure is:
x1 is the (n)th member of set x2 when ordered by rule x3
Some examples:
11.5) ti pamoi le'i mi ratcu
This-one is the first-of the rats associated-with me.
This is my first rat.
11.6) ta romoi le'i mi ratcu
That is-the-allth-of the rats associated-with me.
That is my last rat.
11.7) mi raumoi le velskina porsi
I am-enough-th-in the movie-audience sequence
I am enough-th in the movie line.
Example 11.7 means, in the appropriate context, that my position in line is suff
iciently far to the front that I will get a seat for the movie.
``-si'e'' creates portion selbri. The place structure is:
x1 is an (n)th portion of mass x2
Some examples:
11.8) levi sanmi cu fi'ucisi'e lei mi djedi cidja
This-here meal is-a-slash-three-portion-of my day-food.
This meal is one-third of my daily food.
``-cu'o'' creates probability selbri. The place structure is:
event x1 has probability (n) of occurring under conditions x2
The number must be between 0 and 1 inclusive. For example:
11.9) le nu lo sicni cu sedja'o cu pimucu'o
The event of a coin being a head-displayer
has probability .5.
``-va'e'' creates a scale selbri. The place structure is:
x1 is at scale position (n) on the scale x2
If the scale is granular rather than continuous, a form like ``cifi'uxa'' (3/6)
may be used; in this case, 3/6 is not the same as 1/2. Here is an example:
11.10) le vi rozgu cu sofi'upanova'e xunre
This rose is 8/10-scale red
This rose is 8 out of 10 on the scale of redness.
This rose is very red.
When the quantifier preceding any MOI cmavo includes the subjective numbers ``ra
u'', ``du'e'', or ``mo'a'' (enough, too many, too few) then an additional place
is added for ``by standard''. For example:
11.11) le ratcu poi zvati le panka cu du'emei fo mi
The rats which-are in the park are too-many
by-standard me.
There are too many rats in the park for me.
The extra place (which for ``-mei'' is the x4 place labeled by ``fo'') is provid
ed rather than using a BAI tag such as ``ma'i'' because a specification of the s
tandard for judgment is essential to the meaning of subjective words like ``enou
gh''.
This place is not normally explicit when using one of the subjective numbers dir
ectly as a number. Therefore, ``du'e ratcu'' means ``too many rats'' without spe
cifying any standard.
It is also grammatical to substitute a lerfu string for a number:
11.12) ta ny.moi le'i mi ratcu
that is-nth-of the-set-of my rats
That is my nth rat.
More complex mekso cannot be placed directly in front of MOI, due to the resulti
ng grammatical ambiguities. Instead, a somewhat artificial form of expression is
required.
The cmavo ``me'' (of selma'o ME) has the function of making a sumti into a selbr
i. It is often used to make a name into a predicate:
11.13) ti me la nu,IORK. [me'u]
this-here pertains-to what-I-call ``New York''.
This is New York (or is New York-related).
A whole ``me'' construction can have a member of MOI added to the end to create
a complex mekso selbri:
11.14) ta me li ny. su'i pa me'u moi le'i mi ratcu
that is the-number n plus one-th-of the-set-of my rats.
That is my (n+1)-th rat.
Here the mekso ``ny. su'i pa'' is made into a sumti (with ``li'') and then chang
ed into a mekso selbri with ``me'' and ``me'u moi''. The elidable terminator ``m
e'u'' is required here in order to keep the ``pa'' and the ``moi'' separate; oth
erwise, the parser will combine them into the compound ``pamoi'' and reject the
sentence as ungrammatical.
It is perfectly possible to use non-numerical sumti after ``me'' and before a me
mber of MOI, producing strange results indeed:
11.15) le nu mi nolraitru
cu me le'e snime bolci
be vi la xel. cu'o
The event-of me being-a-nobly-superlative-ruler
has-the-stereotypical snow type-of-ball
at Hell probability.
I have a snowball's chance in Hell of being king.
Note: the elidable terminator ``boi'' is not used between a number and a member
of MOI. As a result, the ``me'u'' in Example 11.14 could also be replaced by a `
`boi'', which would serve the same function of preventing the ``pa'' and ``moi''
from joining into a compound.
12. Number questions
The following cmavo is discussed in this section:
xo PA number question
The cmavo ``xo'', a member of selma'o PA, is used to ask questions whose answers
are numbers. Like most Lojban question words, it fills the blank where the answ
er should go. (See Chapter 19 for more on Lojban questions.)
12.1) li re su'i re du li xo
the-number 2 plus 2 equals the-number what?
What is 2 + 2?
12.2) le xomoi prenu cu darxi do
the what-number-th person hit you?
Which person [as in a police lineup] hit you?
``xo'' can also be combined with other digits to ask questions whose answers are
already partly specified. This ability could be very useful in writing tests of
elementary arithmetical knowledge:
12.3) li remu pi'i xa du li paxono
the-number 25 times 6 equals the-number 1?0
to which the correct reply would be ``mu'', or 5. The ability to utter bare numb
ers as grammatical Lojban sentences is primarily intended for giving answers to
``xo'' questions. (Another use, obviously, is for counting off physical objects
one by one.)
13. Subscripts
The following cmavo is discussed in this section:
xi XI subscript
Subscripting is a general Lojban feature, not used only in mekso; there are many
things that can logically be subscripted, and grammatically a subscript is a fr
ee modifier, usable almost anywhere. In particular, of course, mekso variables (
lerfu strings) can be subscripted:
13.1) li xy.boixici du li xy.boixipa su'i xy.boixire
the-number x-sub-3 equals the-number x-sub-1 plus x-sub-2
x3 = x1 + x2
Subscripts always begin with the flag ``xi'' (of selma'o XI). ``xi'' may be foll
owed by a number, a lerfu string, or a general mekso expression in parentheses:
13.2) xy.boixino
x0
13.3) xy.boixiny.
xn
13.4) xy.boixi vei ny. su'i pa [ve'o]
xn+1}
Note that subscripts attached directly to lerfu words (variables) generally need
a ``boi'' terminating the variable. Free modifiers, of which subscripts are one
variety, generally require the explicit presence of an otherwise elidable termi
nator.
There is no standard way of handling superscripts (other than those used as expo
nents) or for subscripts or superscripts that come before the main expression. I
f necessary, further cmavo could be assigned to selma'o XI for these purposes.
The elidable terminator for a subscript is that for a general number or lerfu st
ring, namely ``boi''. By convention, a subscript following another subscript is
taken to be a sub-subscript:
13.5) xy.boi xi by.boi xi vo
xb4
See Example 17.10 for the standard method of specifying multiple subscripts on a
single object.
More information on the uses of subscripts may be found in Chapter 19.
14. Infix operators revisited
The following cmavo are discussed in this section:
tu'o PA null operand
ge'a VUhU null operator
gei VUhU exponential notation
The infix operators presented so far have always had exactly two operands, and f
or more or fewer operands forethought notation has been required. However, it is
possible to use an operator in infix style even though it has more or fewer tha
n two operands, through the use of a pair of tricks: the null operand ``tu'o'' a
nd the null operator ``ge'a''. The first is suitable when there are too few oper
ands, the second when there are too many. For example, suppose we wanted to expr
ess the numerical negation operator ``va'a'' in infix form. We would use:
14.1) li tu'o va'a ny. du
li no vu'u ny.
the-number (null) additive-inverse n equals
the-number zero minus n
-n = 0 - n
The ``tu'o'' fulfills the grammatical requirement for a left operand for the inf
ix use of ``va'a'', even though semantically none is needed or wanted.
Finding a suitable example of ``ge'a'' requires exhibiting a ternary operator, a
nd ternary operators are not common. The operator ``gei'', however, has both a b
inary and a ternary use. As a binary operator, it provides a terse representatio
n of scientific (also called ``exponential'') notation. The first operand of ``g
ei'' is the exponent, and the second operand is the mantissa or fraction:
14.2) li cinonoki'oki'o
du li bi gei ci
the-number three-zero-zero-comma-comma
equals the-number eight scientific three.
300,000,000 = 3 Ã 108
Why are the arguments to ``gei'' in reverse order from the conventional symbolic
notation? So that ``gei'' can be used in forethought to allow easy specificatio
n of a large (or small) imprecise number:
14.3) gei reno
(scientific) two-zero
1020
Note, however, that although 10 is far and away the most common exponent base, i
t is not the only possible one. The third operand of ``gei'', therefore, is the
base, with 10 as the default value. Most computers internally store so-called ``
floating-point'' numbers using 2 as the exponent base. (This has nothing to do w
ith the fact that computers also represent all integers in base 2; the IBM 360 s
eries used an exponent base of 16 for floating point, although each component of
the number was expressed in base 2.) Here is a computer floating-point number w
ith a value of 40:
14.4) papano bi'eju'u re
gei pipanopano bi'eju'u re ge'a re
(one-one-zero base 2)
scientific (point-one-zero-one-zero base 2) with-base 2
.10102 Ã 21102
15. Vectors and matrices
The following cmavo are discussed in this section:
jo'i JOhI start vector
te'u TEhU end vector
pi'a VUhU matrix row combiner
sa'i VUhU matrix column combiner
A mathematical vector is a list of numbers, and a mathematical matrix is a table
of numbers. Lojban considers matrices to be built up out of vectors, which are
in turn built up out of operands.
``jo'i'', the only cmavo of selma'o JOhI, is the vector indicator: it has a synt
ax reminiscent of a forethought operator, but has very high precedence. The comp
onents must be simple operands rather than full expressions (unless parenthesize
d). A vector can have any number of components; ``te'u'' is the elidable termina
tor. An example:
15.1) li jo'i paboi reboi te'u su'i jo'i ciboi voboi
du li jo'i voboi xaboi
the-number array( one, two ) plus array( three, four)
equals the-number array( four, six)
(1,2) + (3,4) = (4,6)
Vectors can be combined into matrices using either ``pi'a'', the matrix row oper
ator, or ``sa'i'', the matrix column operator. The first combines vectors repres
enting rows of the matrix, and the second combines vectors representing columns
of the matrix. Both of them allow any number of arguments: additional arguments
are tacked on with the null operator ``ge'a''.
Therefore, the ``magic square'' matrix
8 1 6 3 5 7 4 9 2
can be represented either as:
15.2) jo'i biboi paboi xa pi'a jo'i ciboi muboi ze
ge'a jo'i voboi soboi re
the-vector (8 1 6) matrix-row the-vector (3 5 7)
, the-vector (4 9 2)
or as
15.3) jo'i biboi ciboi vo sa'i jo'i paboi muboi so
ge'a jo'i xaboi zeboi re
the-vector (8 3 4) matrix-column the-vector (1 5 9)
, the-vector (6 7 2)
The regular mekso operators can be applied to vectors and to matrices, since gra
mmatically both of these are expressions. It is usually necessary to parenthesiz
e matrices when used with operators in order to avoid incorrect groupings. There
are no VUhU operators for the matrix operators of inner or outer products, but
appropriate operators can be created using a suitable symbolic lerfu word or str
ing prefixed by ``ma'o''.
Matrices of more than two dimensions can be built up using either ``pi'a'' or ``
sa'i'' with an appropriate subscript numbering the dimension. When subscripted,
there is no difference between ``pi'a'' and ``sa'i''.
16. Reverse Polish notation
The following cmavo is discussed in this section:
fu'a FUhA reverse Polish flag
So far, the Lojban notational conventions have mapped fairly familiar kinds of m
athematical discourse. The use of forethought operators may have seemed odd when
applied to ``+'', but when applied to ``f('' they appear as the usual functiona
l notation. Now comes a sharp break. Reverse Polish (RP) notation represents som
ething completely different; even mathematicians don't use it much. (The only co
mmon uses of RP, in fact, are in some kinds of calculators and in the implementa
tion of some programming languages.)
In RP notation, the operator follows the operands. (Polish notation, where the o
perator precedes its operands, is another name for forethought mekso of the kind
explained in Section 6.) The number of operands per operator is always fixed. N
o parentheses are required or permitted. In Lojban, RP notation is always explic
itly marked by a ``fu'a'' at the beginning of the expression; there is no termin
ator. Here is a simple example:
16.1) li fu'a reboi ci su'i
du li mu
the-number (RP!) two, three, plus
equals the-number five.
The operands are ``re'' and ``ci''; the operator is ``su'i''.
Here is a more complex example:
16.2) li fu'a reboi ci pi'i
voboi mu pi'i su'i
du li rexa
the-number (RP!) (two, three, times),
(four, five, times), plus
equals the-number two-six
Here the operands of the first ``pi'i'' are ``re'' and ``ci''; the operands of t
he second ``pi'i'' are ``vo'' and ``mu'' (with ``boi'' inserted where needed), a
nd the operands of the ``su'i'' are ``reboi ci pi'i'', or 6, and ``voboi mu pi'i
'', or 20. As you can see, it is easy to get lost in the world of reverse Polish
notation; on the other hand, it is especially easy for a mechanical listener (w
ho has a deep mental stack and doesn't get lost) to comprehend.
The operands of an RP operator can be any legal mekso operand, including parenth
esized mekso that can contain any valid syntax, whether more RP or something mor
e conventional.
In Lojban, RP operators are always parsed with exactly two operands. What about
operators which require only one operand, or more than two operands? The null op
erand ``tu'o'' and the null operator ``ge'a'' provide a simple solution. A one-o
perand operator like ``va'a'' always appears in a reverse Polish context as ``tu
'o va'a''. The ``tu'o'' provides the second operand, which is semantically ignor
ed but grammatically necessary. Likewise, the three-operand version of ``gei'' a
ppears in reverse Polish as ``ge'a gei'', where the ``ge'a'' effectively merges
the 2nd and 3rd operands into a single operand. Here are some examples:
16.3) li fu'a ciboi muboi vu'u
du li fu'a reboi tu'o va'a
The-number (RP!) (three, five, minus)
equals the-number (RP!) two, null, negative-of.
3 - 5 = -2
16.4) li cinoki'oki'o du
li fu'a biboi
ciboi panoboi ge'a gei
the-number 30-comma-comma equals
the-number (RP!) 8,
(3, 10, null-op), exponential-notation
30,000,000 = 3 Ã 108
17. Logical and non-logical connectives within mekso
The following cmavo are discussed in this section:
.abu BY letter ``a''
by BY letter ``b''
cy BY letter ``c''
fe'a VUhU nth root of (default square root)
lo'o LOhO terminator for LI
As befits a logical language, Lojban has extensive provision for logical connect
ives within both operators and operands. Full details on logical and non-logical
connectives are provided in Chapter 14. Operands are connected in afterthought
with selma'o A and in forethought with selma'o GA, just like sumti. Operators ar
e connected in afterthought with selma'o JA and in forethought with selma'o GUhA
, just like tanru components. This parallelism is no accident.
In addition, A+BO and A+KE constructs are allowed for grouping logically connect
ed operands, and ``ke ... ke'e'' is allowed for grouping logically connected ope
rators, although there are no analogues of tanru among the operators.
Despite the large number of rules required to support this feature, it is of rel
atively minor importance in the mekso scheme of things. Example 17.1 exhibits af
terthought logical connection between operands:
17.1) vei ci .a vo ve'o prenu cu klama le zarci
( three or four ) people go-to the market.
Example 17.2 is equivalent in meaning, but uses forethought connection:
17.2) vei ga ci gi vo ve'o prenu cu klama le zarci
( either 3 or 4 ) people go-to the market.
Note that the mekso here are being used as quantifiers. Lojban requires that any
mekso other than a simple number be enclosed in parentheses when used as a quan
tifier. This rule prevents ambiguities that do not exist when using ``li''.
By the way, ``li'' has an elidable terminator, ``lo'o'', which is needed when a
``li'' sumti is followed by a logical connective that could seem to be within th
e mekso. For example:
17.3) li re su'i re du
li vo lo'o .onai lo nalseldjuno namcu
the-number two plus two equals
the-number four or else a non-known number.
Omitting the ``lo'o'' would cause the parser to assume that another operand foll
owed the ``.onai'' and reject ``lo'' as an invalid operand.
Simple examples of logical connection between operators are hard to come by. A c
ontrived example is:
17.4) li re su'i je pi'i re du li vo
the-number two plus and times two equals the-number four.
2 + 2 = 4 and 2 Ã 2 = 4.
The forethought-connection form of Example 17.4 is:
17.5) li re ge su'i gi pi'i re
du li vo
the-number two both plus and times two
equals the-number four.
Both 2 + 2 = 4 and 2 Ã 2 = 4.
Here is a classic example of operand logical connection:
17.6) go li .abu bi'epi'i vei xy. te'a re ve'o
su'i by. bi'epi'i xy. su'i cy.
du li no
gi li xy. du li vei va'a by. ku'e
su'i ja vu'u fe'a
vei by. bi'ete'a re vu'u
vo bi'epi'i .abu bi'epi'i cy. ve'o [ku'e] ve'o
fe'i re bi'epi'i .abu
if-and-only-if the-number ``a''-times-( ``x'' power two )
plus ``b''-times-``x'' plus ``c''
equals the-number zero
then the-number x equals the-number [ the-negation-of( b )
plus or minus the-root-of
( ``b''-power-2 minus
four-times-``a''-times-``c'' ) ]
divided-by two-times-``a''.
Iff ax2 + bx + c = 0,
then x =-b ± Ã
(b2 - 4ac)
-----------------------
2a
Note the mixture of styles in Example 17.6: the negation of b and the square roo
t are represented by forethought and most of the operator precedence by prefixed
``bi'e'', but explicit parentheses had to be added to group the numerator prope
rly. In addition, the square root parentheses cannot be removed here in favor of
simple ``fe'a'' and ``ku'e'' bracketing, because infix operators are present in
the operand. Getting Example 17.6 to parse perfectly using the current parser t
ook several tries: a more relaxed style would dispense with most of the ``bi'e''
cmavo and just let the standard precedence rules be understood.
Non-logical connection with JOI and BIhI is also permitted between operands and
between operators. One use for this construct is to connect operands with ``bi'o
'' to create intervals:
17.7) li no ga'o bi'o ke'i pa
the-number zero (inclusive) from-to (exclusive) one
[0,1)
the numbers from zero to one, including zero
but not including one
Intervals defined by a midpoint and range rather than beginning and end points c
an be expressed by ``mi'i'':
17.8) li pimu ga'o mi'i ke'i pimu
the-number 0.5 plus-or-minus 0.5
which expresses the same interval as Example 17.7. Note that the ``ga'o'' and ``
ke'i'' still refer to the endpoints, although these are now implied rather than
expressed. Another way of expressing the same thing:
17.9) li pimu su'i ni'upimu bi'o ma'upimu
the-number 0.5 plus [-0.5 from-to +0.5]
Here we have the sum of a number and an interval, which produces another interva
l centered on the number. As Example 17.9 shows, non-logical (or logical) connec
tion of operands has higher precedence than any mekso operator.
You can also combine two operands with ``ce'o'', the sequence connective of selm
a'o JOI, to make a compound subscript:
17.10) xy. xi vei by. ce'o dy. [ve'o]
``x'' sub ( ``b'' sequence ``d'' )
xb,d}
18. Using Lojban resources within mekso
The following cmavo are discussed in this section:
na'u NAhU selbri to operator
ni'e NIhE selbri to operand
mo'e MOhE sumti to operand
te'u TEhU terminator for all three
One of the mekso design goals requires the ability to make use of Lojban's vocab
ulary resources within mekso to extend the built-in cmavo for operands and opera
tors. There are three relevant constructs: all three share the elidable terminat
or ``te'u'' (which is also used to terminate vectors marked with ``jo'i'').
The cmavo ``na'u'' makes a selbri into an operator. In general, the first place
of the selbri specifies the result of the operator, and the other unfilled place
s specify the operands:
18.1) li na'u tanjo te'u
vei pai fe'i re [ve'o] du li ci'i
the-number the-operator tangent
( pi / 2 ) = the-number infinity
tan(pi/2) = ¥
``tanjo'' is the gismu for ``x1 is the tangent of x2'', and the ``na'u'' here ma
kes it into an operator which is then used in forethought.
The cmavo ``ni'e'' makes a selbri into an operand. The x1 place of the selbri ge
nerally represents a number, and therefore is often a ``ni'' abstraction, since
``ni'' abstractions represent numbers. The ``ni'e'' makes that number available
as a mekso operand. A common application is to make equations relating pure dime
nsions:
18.2) li ni'e ni clani [te'u] pi'i
ni'e ni ganra [te'u] pi'i
ni'e ni condi te'u
du li ni'e ni canlu
the-number quantity-of length times
quantity-of width times
quantity-of depth
equals the-number quantity-of volume.
Length à Width à Depth = Volume
The cmavo ``mo'e'' operates similarly to ``ni'e'', but makes a sumti (rather tha
n a selbri) into an operand. This construction is useful in stating equations in
volving dimensioned numbers:
18.3) li mo'e re ratcu su'i mo'e re ractu
du li mo'e vo danlu
the-number two rats plus two rabbits
equals the-number four animals
2 rats + 2 rabbits = 4 animals.
Another use is in constructing Lojbanic versions of so-called ``folk quantifiers
'', such as ``a pride of lions'':
18.4) mi viska vei mo'e lo'e lanzu ve'o cinfo
I see ( the-typical family )-number-of lions.
I see a pride of lions.
19. Other uses of mekso
The following cmavo are discussed in this section:
me'o LI the mekso
nu'a NUhA operator to selbri
mai MAI utterance ordinal
mo'o MAI higher order utterance ordinal
roi ROI quantified tense
So far we have seen mekso used as sumti (with ``li''), as quantifiers (often par
enthesized), and in MOI and ME-MOI selbri. There are a few other minor uses of m
ekso within Lojban.
The cmavo ``me'o'' has the same grammatical use as ``li'' but slightly different
semantics. ``li'' means ``the number which is the value of the mekso ...'', whe
reas ``me'o'' just means ``the mekso ...'' So it is true that:
19.1) li re su'i re du li vo
the-number two plus two equals the-number four
2 + 2 = 4
but false that:
19.2) me'o re su'i re du me'o vo
the-mekso two plus two equals the-mekso four
``2 + 2'' = ``4''
since the expressions ``2 + 2'' and ``4'' are not the same. The relationship bet
ween ``li'' and ``me'o'' is related to that between ``la djan.'', the person nam
ed John, and ``zo .djan.'', the name ``John''.
The cmavo ``nu'a'' is the inverse of ``na'u'', and allows a mekso operator to be
used as a normal selbri, with the place structure:
x1 is the result of applying (operator) to x2, x3, ...
f or as many places as may be required. For example:
19.3) li ni'umu cu nu'a va'a li ma'umu
the-number -5 is-the-negation-of the-number +5
uses ``nu'a'' to make the operator ``va'a'' into a two-place bridi.
Used together, ``nu'a'' and ``na'u'' make it possible to ask questions about mek
so operators, even though there is no specific cmavo for an operator question, n
or is it grammatical to utter an operator in isolation. Consider Example 19.4, t
o which Example 19.5 is one correct answer:
19.4) li re na'u mo re du li vo
the-number two what-operator? two equals the-number four
2 ? 2 = 4
19.5) nu'a su'i
plus
In Example 19.4, ``na'u mo'' is an operator question, because ``mo'' is the selb
ri question cmavo and ``na'u'' makes the selbri into an operator. Example 19.5 m
akes the true answer ``su'i'' into a selbri (which is a legal utterance) with th
e inverse cmavo ``nu'a''. Mechanically speaking, inserting Example 19.5 into Exa
mple 19.4 produces:
19.6) li re na'u nu'a su'i re du li vo
the-number two (the-operator the-selbri plus)
two equals the-number four
where the ``na'u nu'a'' cancels out, leaving a truthful bridi.
Numerical free modifiers, corresponding to English ``firstly'', ``secondly'', an
d so on, can be created by suffixing a member of selma'o MAI to a digit string o
r a lerfu string. (Digit strings are compound cmavo beginning with a cmavo of se
lma'o PA, and containing only cmavo of PA or BY; lerfu strings begin with a cmav
o of selma'o BY, and likewise contain only PA or BY cmavo.) Here are some exampl
es:
19.7) pamai
firstly
19.8) remai
secondly
19.9) romai
all-ly
lastly
19.10) ny.mai
nth-ly
19.11) pasomo'o
nineteenthly (higher order)
Section 19
The difference between ``mai'' and ``mo'o'' is that ``mo'o'' enumerates larger s
ubdivisions of a text. Each ``mo'o'' subdivision can then be divided into pieces
and internally numbered with ``mai''. If this chapter were translated into Lojb
an, it would number each section with ``mo'o''.
A numerical tense can be created by suffixing a digit string with ``roi''. This
usage generates tenses corresponding to English ``once'', ``twice'', and so on.
This topic belongs to a detailed discussion of Lojban tenses, and is explained f
urther in Chapter 10.
Note: the elidable terminator ``boi'' is not used between a number and a member
of MAI or ROI.
20. Explicit operator precedence
As mentioned earlier, Lojban does provide a way for the precedences of operators
to be explicitly declared, although current parsers do not understand these dec
larations.
The declaration is made in the form of a metalinguistic comment using ``ti'o'',
a member of selma'o SEI. ``sei'', the other member of SEI, is used to insert met
alinguistic comments on a bridi which give information about the discourse which
the bridi comprises. The format of a ``ti'o'' declaration has not been formally
established, but presumably would take the form of mentioning a mekso operator
and then giving it either an absolute numerical precedence on some pre-establish
ed scale, or else specifying relative precedences between new operators and exis
ting operators.
In future, we hope to create an improved machine parser that can understand decl
arations of the precedences of simple operators belonging to selma'o VUhU. Origi
nally, all operators would have the same precedence. Declarations would have the
effect of raising the specified cmavo of VUhU to higher precedence levels. Comp
lex operators formed with ``na'u'', ``ni'e'', or ``ma'o'' would remain at the st
andard low precedence; declarations with respect to them are for future implemen
tation efforts. It is probable that such a parser would have a set of ``commonly
assumed precedences'' built into it (selectable by a special ``ti'o'' declarati
on) that would match mathematical intuition: times higher than plus, and so on.
21. Miscellany
A few other points:
``se'' can be used to convert an operator as if it were a selbri, so that its ar
guments are exchanged. For example:
21.1) li ci se vu'u vo
du li pa
the-number three (inverse) minus four
equals the-number one.
3 subtracted from 4 equals 1.
The other converters of selma'o SE can also be used on operators with more than
two operands, and they can be compounded to create (probably unintelligible) ope
rators as needed.
Members of selma'o NAhE are also legal on an operator to produce a scalar negati
on of it. The implication is that some other operator would apply to make the br
idi true:
21.2) li ci na'e su'i vo du li pare
the-number 3 non-plus 4 equals the-number 12
21.3) li ci to'e vu'u re du li mu
the-number 3 opposite-of-minus 2 equals the-number 5
The sense in which ``plus'' is the opposite of ``minus'' is not a mathematical b
ut rather a linguistic one; negated operators are defined only loosely.
``la'e'' and ``lu'e'' can be used on operands with the usual semantics to get th
e referent of or a symbol for an operand. Likewise, a member of selma'o NAhE fol
lowed by ``bo'' serves to scalar-negate an operand, implying that some other ope
rand would make the bridi true:
21.4) li re su'i re du li na'ebo mu
the-number 2 plus 2 equals the-number non-5.
2 + 2 = something other than 5.
The digits 0-9 have rafsi, and therefore can be used in making lujvo. Additional
ly, all the rafsi have CVC form and can stand alone or together as names:
21.5) la zel. poi gunta
la tebes. pu nanmu
those-named ``Seven'' who attack
that-named ``Thebes'' [past] are-men
The Seven Against Thebes were men.
Of course, there is no guarantee that the name ``zel.'' is connected with the nu
mber rafsi: an alternative which cannot be misconstrued is:
21.6) la zemei poi gunta
la tebes. pu nanmu
those-named-the Sevensome who attack
Thebes [past] are-men.
Certain other members of PA also have assigned rafsi: ``so'a'', ``so'e'', ``so'i
'', ``so'o'', ``so'u'', ``da'a'', ``ro'', ``su'e'', ``su'o'', ``pi'', and ``ce'i
''. Furthermore, although the cmavo ``fi'u'' does not have a rafsi as such, it i
s closely related to the gismu ``frinu'', meaning ``fraction''; therefore, in a
context of numeric rafsi, you can use any of the rafsi for ``frinu'' to indicate
a fraction slash.
A similar convention is used for the cmavo ``cu'o'' of selma'o MOI, which is clo
sely related to ``cunso'' (probability); use a rafsi for ``cunso'' in order to c
reate lujvo based on ``cu'o''. The cmavo ``mei'' and ``moi'' of MOI have their o
wn rafsi, two each in fact: ``mem''/``mei'' and ``mom''/``moi'' respectively.
The grammar of mekso as described so far imposes a rigid distinction between ope
rators and operands. Some flavors of mathematics (lambda calculus, algebra of fu
nctions) blur this distinction, and Lojban must have a method of doing the same.
An operator can be changed into an operand with ``ni'enu'a'', which transforms
the operator into a matching selbri and then the selbri into an operand.
To change an operand into an operator, we use the cmavo ``ma'o'', already introd
uced as a means of changing a lerfu string such as ``fy.'' into an operator. In
fact, ``ma'o'' can be followed by any mekso operand, using the elidable terminat
or ``te'u'' if necessary.
There is a potential semantic ambiguity in ``ma'o fy. [te'u]'' if ``fy.'' is alr
eady in use as a variable: it comes to mean ``the function whose value is always
'f'''. However, mathematicians do not normally use the same lerfu words or stri
ngs as both functions and variables, so this case should not arise in practice.
22. Four score and seven: a mekso problem
Abraham Lincoln's Gettysburg Address begins with the words ``Four score and seve
n years ago''. This section exhibits several different ways of saying the number
``four score and seven''. (A ``score'', for those not familiar with the term, i
s 20; it is analogous to a ``dozen'' for 12.) The trivial way:
22.1) bize
eight seven
87
Example 22.1 is mathematically correct, but sacrifices the spirit of the English
words, which are intended to be complex and formal.
22.2) vo pi'i reno su'i ze
four times twenty plus seven
4 Ã 20 + 7
Example 22.2 is also mathematically correct, but still misses something. ``Score
'' is not a word for 20 in the same way that ``ten'' is a word for 10: it contai
ns the implication of 20 objects. The original may be taken as short for ``Four
score years and seven years ago''. Thinking of a score as a twentysome rather th
an as 20 leads to:
22.3) mo'e voboi renomei su'i ze
the-number-of four twentysomes plus seven
In Example 22.3, ``voboi renomei'' is a sumti signifying four things each of whi
ch are groups of twenty; the ``mo'e'' and ``te'u'' then make this sumti into a n
umber in order to allow it to be the operand of ``su'i''.
Another approach is to think of ``score'' as setting a representation base. Ther
e are remnants of base-20 arithmetic in some languages, notably French, in which
87 is ``quatre-vingt-sept'', literally ``four-twenties-seven''. (This fact make
s the Gettysburg Address hard to translate into French!) If ``score'' is the rep
resentation base, then we have:
22.4) vo pi'e ze ju'u reno
four ; seven base 20
4720
Overall, Example 22.3 probably captures the flavor of the English best. Example
22.1 and Example 22.2 are too simple, and Example 22.4 is too tricky. Neverthele
ss, all four examples are good Lojban. Pedagogically, these examples illustrate
the richness of lojbau mekso: anything that can be said at all, can probably be
said in more than one way.
23. mekso selma'o summary
Except as noted, each selma'o has only one cmavo.
BOI elidable terminator for numerals
and lerfu strings
BY lerfu for variables and functions
(see Chapter 17)
FUhA reverse-Polish flag
GOhA includes ``du'' (mathematical equality)
and other non-mekso cmavo
JOhI array flag
KUhE elidable terminator for forethought mekso
LI mekso articles (li and me'o)
MAhO make operand into operator
MOI creates mekso selbri
(moi, mei, si'e, and cu'o, see Section 11)
MOhE make sumti into operand
NAhU make selbri into operator
NIhE make selbri into operand
NUhA make operator into selbri
PA numbers (see Section 25)
PEhO optional forethought mekso marker
TEhU elidable terminator for
NAhU, NIhE, MOhE, MAhO, and JOhI
VEI left parenthesis
VEhO right parenthesis
VUhU operators (see Section 24)
XI subscript flag
24. Complete table of VUhU cmavo, with operand structures
The operand structures specify what various operands (labeled a, b, c, ...) mean
. The implied context is forethought, since only forethought operators can have
a variable number of operands; however, the same rules apply to infix and RP use
s of VUhU.
su'i
plus (((a + b) + c) + ...) pi'i times (((a à b) à c) à ...) vu'u minus (((a - b) - c
- ...) fe'i divided by (((a / b) / c) / ...) ju'u number base numeral string ``
a'' interpreted in the base b pa'i ratio the ratio of a to b, a:b fa'i reciproca
l of/multiplicative inverse 1 / a gei scientific notation b à (c [default 10] to th
e a power) ge'a null operator (no operands) de'o logarithm log a to base b (defa
ult 10 or e as appropriate) te'a to the power/exponential a to the b power fe'a
nth root of/inverse power b'th root of a (default square root: b = 2) cu'a absol
ute value/norm | a | ne'o factorial a! pi'a matrix row vector combiner (all oper
ands are row vectors) sa'i matrix column vector combiner (all operands are colum
n vectors) ri'o integral integral of a with respect to b over range c sa'o deriv
ative derivative of a with respect to b of degree c (default 1) fu'u non-specifi
c operator (variable) si'i sigma summation summation of a using variable b over
range c va'a negation of/additive inverse - a re'a matrix transpose/dual a*
25. Complete table of PA cmavo: digits, punctuation, and other numbers.
Decimal digits:
no, pa, re, ci, vo, mu, xa, ze, bi, so 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 rafsi: n
on, pav, rel, cib, von, mum, xav, zel, biv, soz
Hexadecimal digits:
dau, fei, gai, jau, rei, vai A/10, B/11, C/12, D/13, E/14, F/15
Special numbers:
pai, ka'o, te'o, ci'i pi, imaginary i, exponential e, infinity
Number punctuation:
pi, ce'i, fi'u decimal point, percentage, fraction (not division) rafsi: piz
, cez, fi'u (from frinu; see Section 20)
pi'e, ma'u, ni'u mixed-base point, plus sign (not addition), minus sign (not
subtraction)
ki'o, ra'e thousands comma, repeating-decimal indicator
ji'i, ka'o approximation sign, complex number separator
Indefinite numbers:
ro, so'a, so'e, so'i, so'o, so'u, da'a all, almost all, most, many, several,
few, all but rafsi: rol, soj, sor or so'i, sos, sot, daz
su'e, su'o at most, at least rafsi: su'e, su'o
me'i, za'u less than, more than
no'o the typical number
Subjective numbers:
rau, du'e, mo'a enough, too many, too few
Miscellaneous:
xo, tu'o number question, null operand
26. Table of MOI cmavo, with associated rafsi and place structures
mei x1 is a mass formed from a set x2
of n members, one or more of which is/are x3,
[measured relative to the set x4/by standard x4]
rafsi: mem, mei
moi x1 is the (n)th member of set x2
when ordered by rule x3 [by standard x4]
rafsi: mom, moi
si'e x1 is an (n)th portion of mass x2
[by standard x3]
rafsi: none
cu'o event x1 has probability (n) of occurring
under conditions x2 [by standard x3]
rafsi: cu'o (borrowed from cunso; see Section 20)
Chapter 19
Putting It All Together: Notes on the Structure of Lojban Texts
$Revision: 4.3 $
mkhtml: 1.1
1. Introductory
This chapter is incurably miscellaneous. It describes the cmavo that specify the
structure of Lojban texts, from the largest scale (paragraphs) to the smallest
(single words). There are fewer examples than are found in other chapters of thi
s book, since the linguistic mechanisms described are generally made use of in c
onversation or else in long documents.
This chapter is also not very self-contained. It makes passing reference to a gr
eat many concepts which are explained in full only in other chapters. The altern
ative would be a chapter on text structure which was as complex as all the other
chapters put together. Lojban is a unified language, and it is not possible to
understand any part of it (in full) before understanding every part of it (to so
me degree).
2. Sentences: I
The following cmavo are discussed in this section:
.i I sentence separator
Since Lojban is audio-visually isomorphic, there needs to be a spoken and writte
n way of signaling the end of a sentence and the start of the following one. In
written English, a period serves this purpose; in spoken English, a tone contour
(rising or falling) usually does the job, or sometimes a long pause. Lojban use
s a single separator: the cmavo ``.i'' (of selma'o I):
2.1) mi klama le zarci .i do cadzu le bisli
I go to-the store. You walk on-the ice.
The word ``separator'' should be noted. ``.i'' is not normally used after the la
st sentence nor before the first one, although both positions are technically gr
ammatical. ``.i'' signals a new sentence on the same topic, not necessarily by t
he same speaker. The relationship between the sentences is left vague, except in
stories, where the relationship usually is temporal, and the following sentence
states something that happened after the previous sentence.
Note that although the first letter of an English sentence is capitalized, the c
mavo ``.i'' is never capitalized. In writing, it is appropriate to place extra s
pace before ``.i'' to make it stand out better for the reader. In some styles of
Lojban writing that have been used so far, every ``.i'' is placed at the beginn
ing of a line, possibly leaving space at the end of the previous line.
An ``.i'' cmavo may or may not be used when the speaker of the following sentenc
e is different from the speaker of the preceding sentence, depending on whether
the sentences are felt to be connected or not.
An ``.i'' cmavo can be compounded with a logical or non-logical connective (a je
k or joik), a modal or tense connective, or both: these constructs are explained
in Chapter 9, Chapter 10, and Chapter 14. In all cases, the ``.i'' comes first
in the compound. Attitudinals can also be attached to an ``.i'' if they are mean
t to apply to the whole sentence: see Chapter 13.
There exist a pair of mechanisms for binding a sequence of sentences closely tog
ether. If the ``.i'' (with or without connectives) is followed by ``bo'' (of sel
ma'o BO), then the two sentences being separated are understood to be more close
ly grouped than sentences connected by ``.i'' alone.
Similarly, a group of sentences can be preceded by ``tu'e'' (of selma'o TUhE) an
d followed by ``tu'u'' (of selma'o TUhU) to fuse them into a single unit. A comm
on use of ``tu'e ... tu'u'' is to group the sentences which compose a poem: the
title sentence would precede the group, separated from it by ``.i''. Another use
might be a set of directions, where each numbered direction might be surrounded
by ``tu'e ... tu'u'' and contain one or more sentences separated by ``.i''. Gro
uping with ``tu'e'' and ``tu'u'' is analogous to grouping with ``ke'' and ``ke'e
'' to establish the scope of logical or non-logical connectives (see Chapter 14)
.
3. Paragraphs: NIhO
The following cmavo are discussed in this section:
ni'o NIhO new topic
no'i NIhO old topic
da'o DAhO cancel cmavo assignments
The paragraph is a concept used in writing systems for two purposes: to indicate
changes of topic, and to break up the hard-to-read appearance of large blocks o
f text on the page. The former function is represented in both spoken and writte
n Lojban by the cmavo ``ni'o'' and ``no'i'', both of selma'o NIhO. Of these two,
``ni'o'' is the more common. By convention, written Lojban is broken into parag
raphs just before any ``ni'o'' or ``no'i'', but a very long passage on a single
topic might be paragraphed before an ``.i''. On the other hand, it is convention
al in English to start a new paragraph in dialogue when a new speaker starts, bu
t this convention is not commonly observed in Lojban dialogues. Of course, none
of these conventions affect meaning in any way.
A ``ni'o'' can take the place of an ``.i'' as a sentence separator, and in addit
ion signals a new topic or paragraph. Grammatically, any number of ``ni'o'' cmav
o can appear consecutively and are equivalent to a single one; semantically, a g
reater number of ``ni'o'' cmavo indicates a larger-scale change of topic. This f
eature allows complexly structured text, with topics, subtopics, and sub-subtopi
cs, to be represented clearly and unambiguously in both spoken and written Lojba
n. However, some conventional differences do exist between ``ni'o'' in writing a
nd in conversation.
In written text, a single ``ni'o'' is a mere discursive indicator of a new subje
ct, whereas ``ni'oni'o'' marks a change in the context. In this situation, ``ni'
oni'o'' implicitly cancels the definitions of all pro-sumti of selma'o KOhA as w
ell as pro-bridi of selma'o GOhA. (Explicit cancelling is expressed by the cmavo
``da'o'' of selma'o DAhO, which has the free grammar of an indicator -- it can
appear almost anywhere.) The use of ``ni'oni'o'' does not affect indicators (of
selma'o UI) or tense references, but ``ni'oni'oni'o'', indicating a drastic chan
ge of topic, would serve to reset both indicators and tenses. (See Section 8 for
a discussion of indicator scope.)
In spoken text, which is inherently less structured, these levels are reduced by
one, with ``ni'o'' indicating a change in context sufficient to cancel pro-sumt
i and pro-bridi assignment. On the other hand, in a book, or in stories within s
tories such as ``The Arabian Nights'', further levels may be expressed by extend
ing the ``ni'o'' string as needed. Normally, a written text will begin with the
number of ``ni'o'' cmavo needed to signal the largest scale division which the t
ext contains. ``ni'o'' strings may be subscripted to label each context of disco
urse: see Section 6.
``no'i'' is similar in effect to ``ni'o'', but indicates the resumption of a pre
vious topic. In speech, it is analogous to (but much shorter than) such English
discursive phrases as ``But getting back to the point ... ''. By default, the to
pic resumed is that in effect before the last ``ni'o''. When subtopics are neste
d within topics, then ``no'i'' would resume the previous subtopic and ``no'ino'i
'' the previous topic. Note that ``no'i'' also resumes tense and pro-sumti assig
nments dropped at the previous ``ni'o''.
If a ``ni'o'' is subscripted, then a ``no'i'' with the same subscript is assumed
to be a continuation of it. A ``no'i'' may also have a negative subscript, whic
h would specify counting backwards a number of paragraphs and resuming the topic
found thereby.
4. Topic-comment sentences: ZOhU
The following cmavo is discussed in this section:
zo'u ZOhU topic/comment separator
The normal Lojban sentence is just a bridi, parallel to the normal English sente
nce which has a subject and a predicate:
4.1) mi klama le zarci
I went to the market
In Chinese, the normal sentence form is different: a topic is stated, and a comm
ent about it is made. (Japanese also has the concept of a topic, but indicates i
t by attaching a suffix; other languages also distinguish topics in various ways
.) The topic says what the sentence is about:
4.2) zhe4 xiao1xi2 wo3 zhi1dao le
this news I know [perfective]
As for this news, I knew it.
I've heard this news already.
The wide space in the first two versions of Example 4.2 separate the topic (``th
is news'') from the comment (``I know already'').
Lojban uses the cmavo ``zo'u'' (of selma'o ZOhU) to separate topic (a sumti) fro
m comment (a bridi):
4.3) le nuzba zu'o mi ba'o djuno
The news : I [perfective] know.
Example 4.3 is the literal Lojban translation of Example 4.2. Of course, the top
ic-comment structure can be changed to a straightforward bridi structure:
4.4) mi ba'o djuno le nuzba
I [perfective] know the news.
Example 4.4 means the same as Example 4.3, and it is simpler. However, often the
position of the topic in the place structure of the selbri within the comment i
s vague:
4.5) le finpe zo'u citka
the fish : eat
Is the fish eating or being eaten? The sentence doesn't say. The Chinese equival
ent of Example 4.5 is:
4.6) yu2 chi1
fish eat
which is vague in exactly the same way.
Grammatically, it is possible to have more than one sumti before ``zo'u''. This
is not normally useful in topic-comment sentences, but is necessary in the other
use of ``zo'u'': to separate a quantifying section from a bridi containing quan
tified variables. This usage belongs to a discussion of quantifier logic in Lojb
an (see Chapter 16), but an example would be:
4.7) roda poi prenu ku'o su'ode zo'u
de patfu da
for-all X which-are-persons, there-exists-a-Y such-that
Y is the father of X.
Every person has a father.
The string of sumti before ``zo'u'' (called the ``prenex'': see Chapter 16) may
contain both a topic and bound variables:
4.8) loi patfu
roda poi prenu ku'o su'ode zo'u
de patfu da
for-the-mass-of fathers
for-all X which-are-persons, there-exists-a-Y such-that
Y is the father of X.
As for fathers, every person has one.
To specify a topic which affects more than one sentence, wrap the sentences in `
`tu'e ... tu'u'' brackets and place the topic and the ``zo'u'' directly in front
. This is the exception to the rule that a topic attaches directly to a sentence
:
4.9) loi jdini zo'u tu'e do ponse .inaja do djica [tu'u]
the-mass-of money : ( [if] you possess, then you want )
Money: if you have it, you want it.
Note: In Lojban, you do not ``want money''; you ``want to have money'' or someth
ing of the sort, as the x2 place of ``djica'' demands an event. As a result, the
straightforward rendering of Example 4.8 without a topic is not:
4.10) do ponse loi jdini .inaja do djica ri
You possess money only-if you desire its-mere-existence
where ``ri'' means ``loi jdini'' and is interpreted as ``the mere existence of m
oney'', but rather:
4.11) do ponse loi jdini .inaja do djica tu'a ri
You possess money only-if you desire something-about it
namely, the possession of money. But topic-comment sentences like Example 4.9 ar
e inherently vague, and this difference between ``ponse'' (which expects a physi
cal object in x2) and ``djica'' is ignored. See Example 9.3 for another topic/co
mment sentence.
The subject of an English sentence is often the topic as well, but in Lojban the
sumti in the x1 place is not necessarily the topic, especially if it is the nor
mal (unconverted) x1 for the selbri. Thus Lojban sentences don't necessarily hav
e a ``subject'' in the English sense.
5. Questions and answers
The following cmavo are discussed in this section:
xu UI truth question
ma KOhA sumti question
mo GOhA bridi question
xo PA number question
ji A sumti connective question
ge'i GA forethought connective question
gi'i GIhA bridi-tail connective question
gu'i GUhA tanru forethought connective question
je'i JA tanru connective question
pei UI attitude question
fi'a FA place structure question
cu'e CUhE tense/modal question
pau UI question premarker
Lojban questions are not at all like English questions. There are two basic type
s: truth questions, of the form ``Is it true that ... '', and fill-in-the-blank
questions. Truth questions are marked by preceding the bridi, or following any p
art of it specifically questioned, with the cmavo ``xu'' (of selma'o UI):
5.1) xu do klama le zarci
[True or false?] You go to the store
Are you going to the store/Did you go to the store?
(Since the Lojban is tenseless, either colloquial translation might be correct.)
Truth questions are further discussed in Chapter 15.
Fill-in-the-blank questions have a cmavo representing some Lojban word or phrase
which is not known to the questioner, and which the answerer is to supply. Ther
e are a variety of cmavo belonging to different selma'o which provide different
kinds of blanks.
Where a sumti is not known, a question may be formed with ``ma'' (of selma'o KOh
A), which is a kind of pro-sumti:
5.2) ma klama le zarci
[What sumti?] goes-to the store
Who is going to the store?
Of course, the ``ma'' need not be in the x1 place:
5.3) do klama ma
You go-to [what sumti?]
Where are you going?
The answer is a simple sumti:
5.4) le zarci
The store.
A sumti, then, is a legal utterance, although it does not by itself constitute a
bridi -- it does not claim anything, but merely completes the open-ended claim
of the previous bridi.
There can be two ``ma'' cmavo in a single question:
5.5) ma klama ma
Who goes where?
and the answer would be two sumti, which are meant to fill in the two ``ma'' cma
vo in order:
5.6) mi le zarci
I, to the store.
An even more complex example, depending on the non-logical connective ``fa'u'' (
of selma'o JOI), which is like the English ``and ... respectively'':
5.7) ma fa'u ma klama ma fa'u ma
Who and who goes where and where, respectively?
An answer might be
5.8) la djan. la marcas. le zarci le briju
John, Marsha, the store, the office.
John and Marsha go to the store and the office,
respectively.
(Note: A mechanical substitution of Example 5.8 into Example 5.7 produces an ung
rammatical result, because ``* ... le zarci fa'u le briju'' is ungrammatical Loj
ban: the first ``le zarci'' has to be closed with its proper terminator ``ku'',
for reasons explained in Chapter 14. This effect is not important: Lojban behave
s as if all elided terminators have been supplied in both question and answer be
fore inserting the latter into the former. The exchange is grammatical if questi
on and answer are each separately grammatical.)
Questions to be answered with a selbri are expressed with ``mo'' of selma'o GOhA
, which is a kind of pro-bridi:
5.9) la lojban. mo
Lojban [what selbri?]
What is Lojban?
Here the answerer is to supply some predicate which is true of Lojban. Such ques
tions are extremely open-ended, due to the enormous range of possible predicate
answers. The answer might be just a selbri, or might be a full bridi, in which c
ase the sumti in the answer override those provided by the questioner. To limit
the range of a ``mo'' question, make it part of a tanru.
Questions about numbers are expressed with ``xo'' of selma'o PA:
5.10) do viska xo prenu
You saw [what number?] persons.
How many people did you see?
The answer would be a simple number, another kind of non-bridi utterance:
5.11) vomu
Forty-five.
Fill-in-the-blank questions may also be asked about: logical connectives (using
cmavo ``ji'' of A, ``ge'i'' of GA, ``gi'i'' of GIhA, ``gu'i'' of GUhA, or ``je'i
'' of JA, and receiving an ek, gihek, ijek, or ijoik as an answer) --- see Chapt
er 14; attitudes (using ``pei'' of UI, and receiving an attitudinal as an answer
) --- see Chapter 13; place structures (using ``fi'a'' of FA, and receiving a cm
avo of FA as an answer) --- see Chapter 9; tenses and modals (using ``cu'e'' of
CUhE, and receiving any tense or BAI cmavo as an answer) --- see Chapter 9 and C
hapter 10.
Questions can be marked by placing ``pau'' (of selma'o UI) before the question b
ridi. See Chapter 13 for details.
The full list of non-bridi utterances suitable as answers to questions is:
any number of sumti (with elidable terminator ``vau'', see Chapter 6) an ek
or gihek (logical connectives, see Chapter 14) a number, or any mathematical exp
ression placed in parentheses (see Chapter 18) a bare ``na'' negator (to negate
some previously expressed bridi), or corresponding ``ja'a'' affirmer (see Chapte
r 15) a relative clause (to modify some previously expressed sumti, see Chapter
8) a prenex/topic (to modify some previously expressed bridi, see Chapter 16) li
nked arguments (beginning with ``be'' or ``bei'' and attached to some previously
expressed selbri, often in a description,see Chapter 5)
At the beginning of a text, the following non-bridi are also permitted:
one or more names (to indicate direct address without ``doi'', see Chapter 6
) indicators (to express a prevailing attitude, see Chapter 13) ``nai'' (to vagu
ely negate something or other, see Chapter 15)
Where not needed for the expression of answers, most of these are made grammatic
al for pragmatic reasons: people will say them in conversation, and there is no
reason to rule them out as ungrammatical merely because most of them are vague.
6. Subscripts: XI
The following cmavo is discussed in this section:
xi XI subscript
The cmavo ``xi'' (of selma'o XI) indicates that a subscript (a number, a lerfu s
tring, or a parenthesized mekso) follows. Subscripts can be attached to almost a
ny construction and are placed following the construction (or its terminator wor
d, which is generally required). They are useful either to extend the finite cma
vo list to infinite length, or to make more refined distinctions than the standa
rd cmavo list permits. The remainder of this section mentions some places where
subscripts might naturally be used.
Lojban gismu have at most five places:
6.1) mi cu klama le zarci le zdani
le dargu le karce
I go to-the market from-the house
via-the road using-the car.
Consequently, selma'o SE (which operates on a selbri to change the order of its
places) and selma'o FA (which provides place number tags for individual sumti) h
ave only enough members to handle up to five places. Conversion of Example 6.1,
using ``xe'' to swap the x1 and x5 places, would produce:
6.2) le karce cu xe klama le zarci
le zdani le dargu mi
The car is-a-transportation-means to-the market
from-the house via-the road for-me.
And reordering of the place structures might produce:
6.3) fo le dargu fi le zdani fa mi fe le zarci
fu le karce cu klama
Via the road, from the house, I, to the market,
using-the car, go.
Examples 6.1 to 6.3 all mean the same thing. But consider the lujvo ``nunkla'',
formed by applying the abstraction operator ``nu'' to ``klama'':
6.4) la'edi'u cu nunkla
mi le zarci le zdani
le dargu le karce
The-referent-of-the-previous-sentence is-an-event-of-going
by-me to-the market from-the house
via-the road using-the car.
Example 6.4 shows that ``nunkla'' has six places: the five places of ``klama'' p
lus a new one (placed first) for the event itself. Performing transformations si
milar to that of Example 6.2 requires an additional conversion cmavo that exchan
ges the x1 and x6 places. The solution is to use any cmavo of SE with a subscrip
t:
6.5) le karce cu sexixa nunkla
mi le zarci le zdani le dargu
la'edi'u
The car is-a-transportation-means-in-the-event-of-going
by-me to-the market via-the road
which-is-referred-to-by-the-last-sentence.
Likewise, a sixth place tag can be created by using any cmavo of FA with a subsc
ript:
6.6) fu le dargu fo le zdani fe mi
fa la'edi'u fi le zarci
faxixa le karce cu klama
Via the road, from the house, by me,
the-referent-of-the-last-sentence, to the market,
using the car, is-an-event-of-going.
Examples 6.4 to 6.6 also all mean the same thing, and each is derived straightfo
rwardly from any of the others, despite the tortured nature of the English gloss
es. In addition, any other member of SE or FA could be substituted into ``sexixa
'' and ``faxixa'' without change of meaning: ``vexixa'' means the same thing as
``sexixa''.
Lojban provides two groups of pro-sumti, both belonging to selma'o KOhA. The ko'
a-series cmavo are used to refer to explicitly specified sumti to which they hav
e been bound using ``goi''. The da-series, on the other hand, are existentially
or universally quantified variables. (These concepts are explained more fully in
Chapter 16.) There are ten ko'a-series cmavo and 3 da-series cmavo available.
If more are required, any cmavo of the ko'a-series or the da-series can be subsc
ripted:
6.7) daxivo
X sub 4
is the 4th bound variable of the 1st sequence of the da-series, and
6.8) ko'ixipaso
something-3 sub 18
is the 18th free variable of the 3rd sequence of the ko'a-series. This conventio
n allows 10 sequences of ko'a-type pro-sumti and 3 sequences of da-type pro-sumt
i, each with arbitrarily many members. Note that ``daxivo'' and ``dexivo'' are c
onsidered to be distinct pro-sumti, unlike the situation with ``sexixa'' and ``v
exixa'' above. Exactly similar treatment can be given to the bu'a-series of selm
a'o GOhA and to the gismu pro-bridi ``broda'', ``brode'', ``brodi'', ``brodo'',
and ``brodu''.
Subscripts on lerfu words are used in the standard mathematical way to extend th
e number of variables:
6.9) li xy.boixipa du li xy.boixire su'i xy.boixici
The-number x-sub-1 equals the-number x-sub-2 plus x-sub-3
$x1 = x2 + x3$
and can be used to extend the number of pro-sumti as well, since lerfu strings o
utside mathematical contexts are grammatically and semantically equivalent to pr
o-sumti of the ko'a-series. (In Example 6.9, note the required terminator ``boi'
' after each ``xy.'' cmavo; this terminator allows the subscript to be attached
without ambiguity.)
Names, which are similar to pro-sumti, can also be subscripted to distinguish tw
o individuals with the same name:
6.10) la djan. xipa cusku lu mi'enai do li'u la djan. xire
John1 expresses ``I-am-not you'' to John2.
Subscripts on tenses allow talking about more than one time or place that is des
cribed by the same general cmavo. For example, ``puxipa'' could refer to one poi
nt in the past, and ``puxire'' a second point (earlier or later).
You can place a subscript on the word ``ja'a'', the bridi affirmative of selma'o
NA, to express so-called fuzzy truths. The usual machinery for fuzzy logic (sta
tements whose truth value is not merely ``true'' or ``false'', but is expressed
by a number in the range 0 to 1) in Lojban is the abstractor ``jei'':
6.11) li pimu jei mi ganra
The-number .5 is-the-truth-value-of my being-broad
However, by convention we can attach a subscript to ``ja'a'' to indicate fuzzy t
ruth (or to ``na'' if we change the amount):
6.12) mi ja'a xipimu ganra
I truly-sub-.5 am-broad
Finally, as mentioned in Section 2, ``ni'o'' and ``no'i'' cmavo with matching su
bscripts mark the start and the continuation of a given topic respectively. Diff
erent topics can be assigned to different subscripts.
Other uses of subscripts will doubtless be devised in future.
7. Utterance ordinals: MAI
The following cmavo are discussed in this section:
mai MAI utterance ordinal, -thly
mo'o MAI higher order utterance ordinal
Numerical free modifiers, corresponding to English ``firstly'', ``secondly'', an
d so on, can be created by suffixing ``mai'' or ``mo'o'' of selma'o MAI to a num
ber or a lerfu string. Here are some examples:
7.1) mi klama pamai le zarci .e remai le zdani
I go-to (firstly) the store and (secondly) the market.
This does not imply that I go to the store before I go to the market: that meani
ng requires a tense. The sumti are simply numbered for convenience of reference.
Like other free modifiers, the utterance ordinals can be inserted almost anywhe
re in a sentence without affecting its grammar or its meaning.
Any of the Lojban numbers can be used with MAI: ``romai'', for example, means ``
all-thly'' or ``lastly''. Likewise, if you are enumerating a long list and have
forgotten which number is wanted next, you can say ``ny.mai'', or ``Nthly''.
The difference between ``mai'' and ``mo'o'' is that ``mo'o'' enumerates larger s
ubdivisions of a text; ``mai'' was designed for lists of numbered items, whereas
``mo'o'' was intended to subdivide structured works. If this chapter were trans
lated into Lojban, it might number each section with ``mo'o'': this section woul
d then be introduced with ``zemo'o'', or ``Section 7.''
8. Attitude scope markers: FUhE/FUhO
The following cmavo are discussed in this section:
fu'e FUhE open attitudinal scope
fu'o FUhO close attitudinal scope
Lojban has a complex system of ``attitudinals'', words which indicate the speake
r's attitude to what is being said. The attitudinals include indicators of emoti
on, intensity markers, discursives (which show the structure of discourse), and
evidentials (which indicate ``how the speaker knows''). Most of these words belo
ng to selma'o UI; the intensity markers belong to selma'o CAI for historical rea
sons, but the two selma'o are grammatically identical. The individual cmavo of U
I and CAI are discussed in Chapter 13; only the rules for applying them in disco
urse are presented here.
Normally, an attitudinal applies to the preceding word only. However, if the pre
ceding word is a structural cmavo which begins or ends a whole construction, the
n that whole construction is affected by the attitudinal:
8.1) mi viska le blanu .ia zdani [ku]
I see the blue [belief] house.
I see the house, which I believe to be blue.
8.2) mi viska le blanu zdani .ia [ku]
I see the blue house [belief].
I see the blue thing, which I believe to be a house.
8.3) mi viska le .ia blanu zdani [ku]
I see the [belief] blue house
I see what I believe to be a blue house.
8.4) mi viska le blanu zdani ku .ia
I see (the blue house ) [belief]
I see what I believe to be a blue house.
An attitudinal meant to cover a whole sentence can be attached to the preceding
``.i'', expressed or understood:
8.5) [.i] .ia mi viska le blanu zdani
[belief] I see the blue house
I believe I see a blue house.
or to an explicit ``vau'' placed at the end of a bridi.
Likewise, an attitudinal meant to cover a whole paragraph can be attached to ``n
i'o'' or ``no'i''. An attitudinal at the beginning of a text applies to the whol
e text.
However, sometimes it is necessary to be more specific about the range of one or
more attitudinals, particularly if the range crosses the boundaries of standard
Lojban syntactic constructions. The cmavo ``fu'e'' (of selma'o FUhE) and ``fu'o
'' (of selma'o FUhO) provide explicit scope markers. Placing ``fu'e'' in front o
f an attitudinal disconnects it from what precedes it, and instead says that it
applies to all following words until further notice. The notice is given by ``fu
'o'', which can appear anywhere and cancels all in-force attitudinals. For examp
le:
8.6) mi viska le fu'e .ia blanu zdani fu'o ponse
I see the [start] [belief] blue house [end] possessor
I see the owner of what I believe to be a blue house.
Here, only the ``blanu zdani'' portion of the three-part tanru ``blanu zdani pon
se'' is marked as a belief of the speaker. Naturally, the attitudinal scope mark
ers do not affect the rules for interpreting multi-part tanru: ``blanu zdani'' g
roups first because tanru group from left to right unless overridden with ``ke''
or ``bo''.
Other attitudinals of more local scope can appear after attitudinals marked by F
UhE; these attitudinals are added to the globally active attitudinals rather tha
n superseding them.
9. Quotations: LU, LIhU, LOhU, LEhU
The following cmavo are discussed in this section:
lu LU begin quotation
li'u LIhU end quotation
lo'u LOhU begin error quotation
le'u LEhU end error quotation
Grammatically, quotations are very simple in Lojban: all of them are sumti, and
they all mean something like ``the piece of text here quoted'':
9.1) mi pu cusku lu mi'e djan [li'u]
I [past] express [quote] I-am John [unquote]
I said, ``I'm John''.
But in fact there are four different flavors of quotation in the language, invol
ving six cmavo of six different selma'o. This being the case, quotation deserves
some elaboration.
The simplest kind of quotation, exhibited in Example 9.1, uses the cmavo ``lu''
(of selma'o LU) as the opening quotation mark, and the cmavo ``li'u'' (of selma'
o LIhU) as the closing quotation mark. The text between ``lu'' and ``li'u'' must
be a valid, parseable Lojban text. If the quotation is ungrammatical, so is the
surrounding expression. The cmavo ``li'u'' is technically an elidable terminato
r, but it's almost never possible to elide it except at the end of text.
The cmavo ``lo'u'' (of selma'o LOhU) and ``le'u'' (of selma'o LEhU) are used to
surround a quotation that is not necessarily grammatical Lojban. However, the te
xt must consist of morphologically correct Lojban words (as defined in Chapter 4
), so that the ``le'u'' can be picked out reliably. The words need not be meanin
gful, but they must be recognizable as cmavo, brivla, or cmene. Quotation with `
`lo'u'' is essential to quoting ungrammatical Lojban for teaching in the languag
e, the equivalent of the * that is used in English to mark such errors:
9.2) lo'u mi du do du la djan. le'u
na tergerna la lojban.
[quote] mi du do du la djan. [unquote]
is-not a-grammatical-structure-in Lojban.
Example 9.2 is grammatical even though the embedded quotation is not. Similarly,
``lo'u'' quotation can quote fragments of a text which themselves do not consti
tute grammatical utterances:
9.3) lu le mlatu cu viska le finpe li'u zo'u
lo'u viska le le'u
cu selbasti .ei
lo'u viska lo le'u
[quote] le mlatu cu viska le finpe [unquote] :
[quote] viska le [unquote]
is-replaced-by [obligation!]
[quote] viska lo [unquote].
In the sentence ``le mlatu viska le finpe'',
``viska le'' should be replaced by ``viska lo''.
Note the topic-comment formulation (Section 4) and the indicator applying to the
selbri only (Section 8). Neither ``viska le'' nor ``viska lo'' is a valid Lojba
n utterance, and both require ``lo'u'' quotation.
Additionally, pro-sumti or pro-bridi in the quoting sentence can refer to words
appearing in the quoted sentence when ``lu ... li'u'' is used, but not when ``lo
'u ... le'u'' is used:
9.4) la tcarlis. cusku lu le ninmu cu morsi li'u
.iku'i ri jmive
Charlie says [quote] the woman is-dead [unquote].
However, the-last-mentioned is-alive.
Charlie says ``The woman is dead'', but she is alive.
In Example 9.4, ``ri'' is a pro-sumti which refers to the most recent previous s
umti, namely ``le ninmu''. Compare:
9.5) la tcarlis. cusku lo'u le ninmu cu morsi le'u
.iku'i ri jmive
Charlie says [quote] le ninmu cu morsi [unquote].
However, the-last-mentioned is-alive.
Charlie says ``le ninmu cu morsi'', but he is alive.
In Example 9.5, ``ri'' cannot refer to the referent of the alleged sumti ``le ni
nmu'', because ``le ninmu cu morsi'' is a mere uninterpreted sequence of Lojban
words. Instead, ``ri'' ends up referring to the referent of the sumti ``la tcarl
is.'', and so it is Charlie who is alive.
The metalinguistic erasers ``si'', ``sa'', and ``su'', discussed in Section 13,
do not operate in text between ``lo'u'' and ``le'u''. Since the first ``le'u'' t
erminates a ``lo'u'' quotation, it is not directly possible to have a ``lo'u'' q
uotation within another ``lo'u'' quotation. However, it is possible for a ``le'u
'' to occur within a ``lo'u ... le'u'' quotation by preceding it with the cmavo
``zo'', discussed in Section 10. Note that ``le'u'' is not an elidable terminato
r; it is required.
10. More on quotations: ZO, ZOI
The following cmavo are discussed in this section:
zo ZO quote single word
zoi ZOI non-Lojban quotation
la'o ZOI non-Lojban name
The cmavo ``zo'' (of selma'o ZO) is a strong quotation mark for the single follo
wing word, which can be any Lojban word whatsoever. Among other uses, ``zo'' all
ows a metalinguistic word to be referenced without having it act on the surround
ing text. The word must be a morphologically legal (but not necessarily meaningf
ul) single Lojban word; compound cmavo are not permitted. For example:
10.1) zo si cu lojbo valsi
``si'' is a Lojbanic word.
Since ``zo'' acts on a single word only, there is no corresponding terminator. B
revity, then, is a great advantage of ``zo'', since the terminators for other ki
nds of quotation are rarely or never elidable.
The cmavo ``zoi'' (of selma'o ZOI) is a quotation mark for quoting unlexable (no
n-Lojban) text. Its syntax is ``zoi X. text .X'', where X is a Lojban word (call
ed the delimiting word) which is separated from the quoted text by pauses, and w
hich is not found in the written text or spoken phoneme stream. It is common, bu
t not required, to use the lerfu word (of selma'o BY) which corresponds to the L
ojban name of the language being quoted:
10.2) zoi gy. John is a man .gy. cu glico jufra
``John is a man'' is an English sentence.
where ``gy'' stands for ``glico''. Other popular choices of delimiting words are
``.kuot.'', a Lojban name which sounds like the English word ``quote'', and the
word ``zoi'' itself. Another possibility is a Lojban word suggesting the topic
of the quotation.
Within written text, the Lojban written word used as a delimiting word may not a
ppear, whereas within spoken text, the sound of the delimiting word may not be u
ttered. This leads to occasional breakdowns of audio-visual isomorphism: Example
10.3 is fine in speech but ungrammatical as written, whereas Example 10.4 is co
rrect when written but ungrammatical in speech.
10.3) ?mi djuno fi le valsi po'u zoi gy. gyrations .gy.
I know about the word which-is ``gyrations''.
10.4) ?mi djuno fi le valsi po'u zoi jai. gyrations .jai
I know about the word which-is ``gyrations''.
The text ``gy'' appears in the written word ``gyrations'', whereas the sound rep
resented in Lojban by ``jai'' appears in the spoken word ``gyrations''. Such bor
derline cases should be avoided as a matter of good style.
It should be noted particularly that ``zoi'' quotation is the only way to quote
rafsi, specifically CCV rafsi, because they are not Lojban words, and ``zoi'' qu
otation is the only way to quote things which are not Lojban words. (CVC and CVV
rafsi look like names and cmavo respectively, and so can be quoted using other
methods.) For example:
10.5) zoi ry. sku .ry. cu rafsi zo cusku
``sku'' is a rafsi of ``cusku''.
(A minor note on interaction between ``lo'u ... le'u'' and ``zoi'': The text bet
ween ``lo'u'' and ``le'u'' should consist of Lojban words only. In fact, non-Loj
ban material in the form of a ``zoi'' quotation may also appear. However, if the
word ``le'u'' is used either as the delimiting word for the ``zoi'' quotation,
or within the quotation itself, the outer ``lo'u'' quotation will be prematurely
terminated. Therefore, ``le'u'' should be avoided as the delimiting word in any
``zoi'' quotation.)
Lojban strictly avoids any confusion between things and the names of things:
10.6) zo .bab. cmene la bab.
The-word ``Bob'' is-the-name-of the-one-named Bob.
In Example 10.6, ``zo .bab.'' is the word, whereas ``la bab.'' is the thing name
d by the word. The cmavo ``la'e'' and ``lu'e'' (of selma'o LAhE) convert back an
d forth between references and their referents:
10.7) zo .bab. cmene la'e zo .bab.
The-word ``Bob'' is-the-name-of
the-referent-of the-word ``Bob''.
10.8) lu'e la bab. cmene la bab.
A-symbol-for Bob is-the-name-of Bob.
Examples 10.6 through 10.8 all mean approximately the same thing, except for dif
ferences in emphasis. Example 10.9 is different:
10.9) la bab. cmene la bab.
Bob is the name of Bob.
and says that Bob is both the name and the thing named, an unlikely situation. P
eople are not names.
(In Examples 10.6 through 10.7, the name ``bab.'' was separated from a preceding
``zo'' by a pause, thus: ``zo .bab.''. The reason for this extra pause is that
all Lojban names must be separated by pause from any preceding word other than `
`la'', ``lai'', ``la'i'' (all of selma'o LA) and ``doi'' (of selma'o DOI). There
are numerous other cmavo that may precede a name: of these, ``zo'' is one of th
e most common.)
The cmavo ``la'o'' also belongs to selma'o ZOI, and is mentioned here for comple
teness, although it does not signal the beginning of a quotation. Instead, ``la'
o'' serves to mark non-Lojban names, especially the Linnaean binomial names (suc
h as ``Homo sapiens'') which are the internationally standardized names for spec
ies of animals and plants. Internationally known names which can more easily be
recognized by spelling rather than pronunciation, such as ``Goethe'', can also a
ppear in Lojban text with ``la'o'':
10.10) la'o dy. Goethe .dy. cu me la'o ly. Homo sapiens .ly.
Goethe is a Homo sapiens.
Using ``la'o'' for all names rather than Lojbanizing, however, makes for very cu
mbersome text. A rough equivalent of ``la'o'' might be ``la me zoi''.
11. Contrastive emphasis: BAhE
The following cmavo are discussed in this section:
ba'e BAhE emphasize next word
za'e BAhE next word is nonce
English often uses strong stress on a word to single it out for contrastive emph
asis, thus
11.1) I saw George.
is quite different from
11.2) I saw George.
The heavy stress on ``George'' (represented in writing by italics) indicates tha
t I saw George rather than someone else. Lojban does not use stress in this way:
stress is used only to help separate words (because every brivla is stressed on
the penultimate syllable) and in names to match other languages' stress pattern
s. Note that many other languages do not use stress in this way either; typicall
y word order is rearranged, producing something like
11.4) It was George whom I saw.
In Lojban, the cmavo ``ba'e'' (of selma'o BAhE) precedes a single word which is
to be emphasized:
11.4.5) mi viska la ba'e .djordj.
I saw the-one-named [emphasis] ``George''.
I saw George.
Note the pause before the name ``djordj.'', which serves to separate it unambigu
ously from the ``ba'e''. Alternatively, the ``ba'e'' can be moved to a position
before the ``la'', which in effect emphasizes the whole construct ``la djordj.''
:
11.5) mi viska ba'e la djordj.
I saw [emphasis] the-one-named ``George''.
I saw George.
Marking a word with a cmavo of BAhE does not change the word's grammar in any wa
y. Any word in a bridi can receive contrastive emphasis marking:
11.6) ba'e mi viska la djordj.
I, no one else, saw George.
11.7) mi ba'e viska la djordj.
I saw (not heard or smelled) George.
Emphasis on one of the structural components of a Lojban bridi can also be achie
ved by rearranging it into an order that is not the speaker's or writer's usual
order. Any sumti moved out of place, or the selbri when moved out of place, is e
mphatic to some degree.
For completeness, the cmavo ``za'e'' should be mentioned, also of selma'o BAhE.
It marks a word as possibly irregular, non-standard, or nonce (created for the o
ccasion):
11.8) mi klama la za'e. .albeinias
I go-to so-called Albania
marks a Lojbanization of an English name, where a more appropriate standard form
might be something like ``la ctiipyris.'', reflecting the country's name in Alb
anian.
Before a lujvo or fu'ivla, ``za'e'' indicates that the word has been made up on
the spot and may be used in a sense that is not found in the unabridged dictiona
ry (when we have an unabridged dictionary!).
12. Parenthesis and metalinguistic commentary: TO, TOI, SEI
The following cmavo are discussed in this section:
to TO open parenthesis
to'i TO open editorial parenthesis
toi TOI close parenthesis
sei SEI metalinguistic bridi marker
The cmavo ``to'' and ``toi'' are discursive (non-mathematical) parentheses, for
inserting parenthetical remarks. Any text whatsoever can go within the parenthes
es, and it is completely invisible to its context. It can, however, refer to the
context by the use of pro-sumti and pro-bridi: any that have been assigned in t
he context are still assigned in the parenthetical remarks, but the reverse is n
ot true.
12.1) doi lisas. mi djica le nu
to doi frank. ko sisti toi
do viska le mlatu
O Lisa, I desire the event-of
( O Frank, [imperative] stop! )
you see the cat.
Lisa, I want you to (Frank! Stop!) see the cat.
Example 12.1 implicitly redefines ``do'' within the parentheses: the listener is
changed by ``doi frank.'' When the context sentence resumes, however, the old l
istener, Lisa, is automatically restored.
There is another cmavo of selma'o TO: ``to'i''. The difference between ``to'' an
d ``to'i'' is the difference between parentheses and square brackets in English
prose. Remarks within ``to ... toi'' cmavo are implicitly by the same speaker, w
hereas remarks within ``to'i ... toi'' are implicitly by someone else, perhaps a
n editor:
12.2) la frank. cusku lu mi prami do
to'isa'a do du la djein. toi li'u
Frank expresses ``I love you
[you = Jane]''
The ``sa'a'' suffix is a discursive cmavo (of selma'o UI) meaning ``editorial in
sertion'', and indicating that the marked word or construct (in this case, the e
ntire bracketed remark) is not part of the quotation. It is required whenever th
e ``to'i ... toi'' remark is physically within quotation marks, at least when sp
eaking to literal-minded listeners; the convention may be relaxed if no actual c
onfusion results.
Note: The parser believes that parentheses are attached to the previous word or
construct, because it treats them as syntactic equivalents of subscripts and oth
er such so-called ``free modifiers''. Semantically, however, parenthetical remar
ks are not necessarily attached either to what precedes them or what follows the
m.
The cmavo ``sei'' (of selma'o SEI) begins an embedded discursive bridi. Comments
added with ``sei'' are called ``metalinguistic'', because they are comments abo
ut the discourse itself rather than about the subject matter of the discourse. T
his sense of the term ``metalinguistic'' is used throughout this chapter, and is
not to be confused with the sense ``language for expressing other languages''.
When marked with ``sei'', a metalinguistic utterance can be embedded in another
utterance as a discursive. In this way, discursives which do not have cmavo assi
gned in selma'o UI can be expressed:
12.3) la frank. prami sei la frank. gleki la djein.
Frank loves (Frank is happy) Jane.
Using the happiness attitudinal, ``.ui'', would imply that the speaker was happy
. Instead, the speaker attributes happiness to Frank. It would probably be safe
to elide the one who is happy, and say:
12.4) la frank. prami sei gleki la djein.
Frank loves (he is happy) Jane.
The grammar of the bridi following ``sei'' has an unusual limitation: the sumti
must either all precede the selbri, or must be glued into the selbri with ``be''
and ``bei'':
12.5) la frank. prami sei gleki be fa la suzn. la djein.
Frank loves (Susan is happy) Jane.
This restriction allows the terminator cmavo ``se'u'' to almost always be elided
.
Since a discursive utterance is working at a ``higher'' level of abstraction tha
n a non-discursive utterance, a non-discursive utterance cannot refer to a discu
rsive utterance. Specifically, the various back-counting, reciprocal, and reflex
ive constructs in selma'o KOhA ignore the utterances at ``higher'' metalinguisti
c levels in determining their referent. It is possible, and sometimes necessary,
to refer to lower metalinguistic levels. For example, the English ``he said'' i
n a conversation is metalinguistic. For this purpose, quotations are considered
to be at a lower metalinguistic level than the surrounding context (a quoted tex
t cannot refer to the statements of the one who quotes it), whereas parenthetica
l remarks are considered to be at a higher level than the context.
Lojban works differently from English in that the ``he said'' can be marked inst
ead of the quotation. In Lojban, you can say:
12.6) la djan. cusku lu mi klama le zarci li'u
John expresses ``I go to-the store''.
which literally claims that John uttered the quoted text. If the central claim i
s that John made the utterance, as is likely in conversation, this style is the
most sensible. However, in written text which quotes a conversation, you don't w
ant the ``he said'' or ``she said'' to be considered part of the conversation. I
f unmarked, it could mess up the anaphora counting. Instead, you can use:
12.7) lu mi klama le zarci
seisa'a la djan. cusku be dei li'u
`` I go to-the store
( John expresses this-sentence ) ''
``I go to the store'', said John.
And of course other orders are possible:
12.8) lu seisa'a la djan. cusku be dei mi klama le zarci
John said, ``I go to the store''.
12.9) lu mi klama seisa'a la djan cusku le zarci
``I go'', John said, ``to the store''.
Note the ``sa'a'' following each ``sei'', marking the ``sei'' and its attached b
ridi as an editorial insert, not part of the quotation. In a more relaxed style,
these ``sa'a'' cmavo would probably be dropped.
The elidable terminator for ``sei'' is ``se'u'' (of selma'o SEhU); it is rarely
needed, except to separate a selbri within the ``sei'' comment from an immediate
ly following selbri (or component) outside the comment.
13. Erasure: SI, SA, SU
The following cmavo are discussed in this section:
si SI erase word
sa SA erase phrase
su SU erase discourse
The cmavo ``si'' (of selma'o SI) is a metalinguistic operator that erases the pr
eceding word, as if it had never been spoken:
13.1) ti gerku si mlatu
This is-a-dog, er, is-a-cat.
means the same thing as ``ti mlatu''. Multiple ``si'' cmavo in succession erase
the appropriate number of words:
13.2) ta blanu zdani si si xekri zdani
That is-a-blue house, er, er, is-a-black house.
In order to erase the word ``zo'', it is necessary to use three ``si'' cmavo in
a row:
13.3) zo .bab. se cmene zo si si si la bab.
The-word ``Bob'' is-the-name-of the word ``si'', er, er, Bob.
The first use of ``si'' does not erase anything, but completes the ``zo'' quotat
ion. Two more ``si'' cmavo are then necessary to erase the first ``si'' and the
``zo''.
Incorrect names can likewise cause trouble with ``si'':
13.4) mi tavla fo la .esperanto
si si .esperanton.
I talk in-language that-named ``and'' ``speranto'',
er, er, Esperanto.
The Lojbanized spelling ``.esperanto'' breaks up, as a consequence of the Lojban
morphology rules (see Chapter 4) into two Lojban words, the cmavo ``.e'' and th
e undefined fu'ivla ``speranto''. Therefore, two ``si'' cmavo are needed to eras
e them. Of course, ``.e speranto'' is not grammatical after ``la'', but recognit
ion of ``si'' is done before grammatical analysis.
Even more messy is the result of an incorrect ``zoi'':
13.5) mi cusku zoi fy. gy. .fy.
si si si si zo .djan
I express [foreign] [quote] ``sy'' [unquote],
er, er, er, er, ``John''.
In Example 13.5, the first ``fy'' is taken to be the delimiting word. The next w
ord must be different from the delimiting word, and ``gy.'', the Lojban name for
the letter ``g'', was chosen arbitrarily. Then the delimiting word must be repe
ated. For purposes of ``si'' erasure, the entire quoted text is taken to be a wo
rd, so four words have been uttered, and four more ``si'' cmavo are needed to er
ase them altogether. Similarly, a stray ``lo'u'' quotation mark must be erased w
ith ``fy. le'u si si si'', by completing the quotation and then erasing it all w
ith three ``si'' cmavo.
What if less than the entire ``zo'' or ``zoi'' construct is erased? The result i
s something which has a loose ``zo'' or ``zoi'' in it, without its expected sequ
els, and which is incurably ungrammatical. Thus, to erase just the word quoted b
y ``zo'', it turns out to be necessary to erase the ``zo'' as well:
13.6) mi se cmene zo .djan.
si si zo .djordj.
I am-named-by the-word ``John'',
er, er, the-word ``George''.
The parser will reject ``zo .djan. si .djordj.'', because in that context ``djor
dj.'' is a name (of selma'o CMENE) rather than a quoted word.
Note: The current machine parser does not implement ``si'' erasure.
As the above examples plainly show, precise erasures with ``si'' can be extremel
y hard to get right. Therefore, the cmavo ``sa'' (of selma'o SA) is provided for
erasing more than one word. The cmavo following ``sa'' should be the starting m
arker of some grammatical construct. The effect of the ``sa'' is to erase back t
o and including the last starting marker of the same kind. For example:
13.7) mi viska le sa .i mi cusku zo .djan.
I see the ... I say the-word ``John''.
Since the word following ``sa'' is ``.i'', the sentence separator, its effect is
to erase the preceding sentence. So Example 13.7 is equivalent to:
13.8) mi cusku zo .djan.
Another example, erasing a partial description rather than a partial sentence:
13.9) mi viska le blanu zdan. sa le xekri zdani
I see the blue hou ... the black house.
In Example 13.9, ``le blanu zdan.'' is ungrammatical, but clearly reflects the s
peaker's original intention to say ``le blanu zdani''. However, the ``zdani'' wa
s cut off before the end and changed into a name. The entire ungrammatical ``le'
' construct is erased and replaced by ``le xekri zdani''.
Note: The current machine parser does not implement ``sa'' erasure. Getting ``sa
'' right is even more difficult (for a computer) than getting ``si'' right, as t
he behavior of ``si'' is defined in terms of words rather than in terms of gramm
atical constructs (possibly incorrect ones) and words are conceptually simpler e
ntities. On the other hand, ``sa'' is generally easier for human beings, because
the rules for using it correctly are less finicky.
The cmavo ``su'' (of selma'o SU) is yet another metalinguistic operator that era
ses the entire text. However, if the text involves multiple speakers, then ``su'
' will only erase the remarks made by the one who said it, unless that speaker h
as said nothing. Therefore ``susu'' is needed to eradicate a whole discussion in
conversation.
Note: The current machine parser does not implement either ``su'' or ``susu'' er
asure.
14. Hesitation: Y
The following cmavo is discussed in this section:
.y. Y hesitation noise
Speakers often need to hesitate to think of what to say next or for some extra-l
inguistic reason. There are two ways to hesitate in Lojban: to pause between wor
ds (that is, to say nothing) or to use the cmavo ``.y.'' (of selma'o Y). This re
sembles in sound the English hesitation noise written ``uh'' (or ``er''), but di
ffers from it in the requirement for pauses before and after. Unlike a long paus
e, it cannot be mistaken for having nothing more to say: it holds the floor for
the speaker. Since vowel length is not significant in Lojban, the ``y'' sound ca
n be dragged out for as long as necessary. Furthermore, the sound can be repeate
d, provided the required pauses are respected.
Since the hesitation sound in English is outside the formal language, English-sp
eakers may question the need for a formal cmavo. Speakers of other languages, ho
wever, often hesitate by saying (or, if necessary, repeating) a word (``este'' i
n some dialects of Spanish, roughly meaning ``that is''), and Lojban's audio-vis
ual isomorphism requires a written representation of all meaningful spoken behav
ior. Of course, ``.y.'' has no grammatical significance: it can appear anywhere
at all in a Lojban sentence except in the middle of a word.
15. No more to say: FAhO
The following cmavo is discussed in this section:
fa'o FAhO end of text
The cmavo ``fa'o'' (of selma'o FAhO) is the usually omitted marker for the end o
f a text; it can be used in computer interaction to indicate the end of input or
output, or for explicitly giving up the floor during a discussion. It is outsid
e the regular grammar, and the machine parser takes it as an unconditional signa
l to stop parsing unless it is quoted with ``zo'' or with ``lo'u ... le'u''. In
particular, it is not used at the end of subordinate texts quoted with ``lu ...
li'u'' or parenthesized with ``to ... toi''.
16. List of cmavo interactions
The following list gives the cmavo and selma'o that are recognized by the earlie
st stages of the parser, and specifies exactly which of them interact with which
others. All of the cmavo are at least mentioned in this chapter. The cmavo are
written in lower case, and the selma'o in UPPER CASE.
``zo'' quotes the following word, no matter what it is.
``si'' erases the preceding word unless it is a ``zo''.
``sa'' erases the preceding word and other words, unless the preceding word is a
``zo''.
``su'' is the same as ``sa'', but erases more words.
``lo'u'' quotes all following words up to a ``le'u'' (but not a ``zo le'u'').
``le'u'' is ungrammatical except at the end of a ``lo'u quotation.
ZOI cmavo use the following word as a delimiting word, no matter what it is, but
using ``le'u'' may create difficulties.
``zei'' combines the preceding and the following word into a lujvo, but does not
affect ``zo'', ``si'', ``sa'', ``su'', ``lo'u'', ZOI cmavo, ``fa'o'', and ``zei
''.
BAhE cmavo mark the following word, unless it is ``si'', ``sa'', or ``su'', or u
nless it is preceded by ``zo''. Multiple BAhE cmavo may be used in succession.
``bu'' makes the preceding word into a lerfu word, except for ``zo'', ``si'', ``
sa'', ``su'', ``lo'u'', ZOI cmavo, ``fa'o'', ``zei'', BAhE cmavo, and ``bu''. Mu
ltiple ``bu'' cmavo may be used in succession.
UI and CAI cmavo mark the previous word, except for ``zo'', ``si'', ``sa'', ``su
'', ``lo'u'', ZOI, ``fa'o'', ``zei'', BAhE cmavo, and ``bu''. Multiple UI cmavo
may be used in succession. A following ``nai'' is made part of the UI.
``.y.'', ``da'o'', ``fu'e'', and ``fu'o'' are the same as UI, but do not absorb
a following ``nai''.
17. List of Elidable Terminators
The following list shows all the elidable terminators of Lojban. The first colum
n is the terminator, the second column is the selma'o that starts the correspond
ing construction, and the third column states what kinds of grammatical construc
ts are terminated. Each terminator is the only cmavo of its selma'o, which natur
ally has the same name as the cmavo.
be'o BE sumti attached to a tanru unit
boi PA/BY number or lerfu string
do'u COI/DOI vocative phrases
fe'u FIhO ad-hoc modal tags
ge'u GOI relative phrases
kei NU abstraction bridi
ke'e KE groups of various kinds
ku LE/LA description sumti
ku'e PEhO forethought mekso
ku'o NOI relative clauses
li'u LU quotations
lo'o LI number sumti
lu'u LAhE/NAhE+BO sumti qualifiers
me'u ME tanru units formed from sumti
nu'u NUhI forethought termsets
se'u SEI/SOI metalinguistic insertions
te'u various mekso conversion constructs
toi TO parenthetical remarks
tu'u TUhE multiple sentences or paragraphs
vau (none) simple bridi or bridi-tails
ve'o VEI mekso parentheses
Chapter 20
A Catalogue of selma'o
The following paragraphs list all the selma'o of Lojban, with a brief explanatio
n of what each one is about, and reference to the chapter number where each is e
xplained more fully. As usual, all selma'o names are given in capital letters (w
ith ``h'' serving as the capital of ``''') and are the names of a representative
cmavo, often the most important or the first in alphabetical order. One example
is given of each selma'o: for selma'o which have several uses, the most common
use is shown.
selma'o A (Chapter 14)
Specifies a logical connection (e.g. ``and'', ``or'', ``if''), usually between s
umti.
la djan. .a la djein. klama le zarci John and/or Jane goes to the store.
Also used to create vowel lerfu words when followed with ``bu''.
selma'o BAI (Chapter 9)
May be prefixed to a sumti to specify an additional place, not otherwise present
in the place structure of the selbri, and derived from a single place of some o
ther selbri.
mi tavla bau la lojban. I speak in-language Lojban.
selma'o BAhE (Chapter 19)
Emphasizes the next single word, or marks it as a nonce word (one invented for t
he occasion.
la ba'e .djordj. klama le zarci George goes to the store. It is George who g
oes to the store.
selma'o BE (Chapter 5)
Attaches sumti which fill the place structure of a single unit making up a tanru
. Unless otherwise indicated, the sumti fill the x2, x3, and successive places i
n that order. BE is most useful in descriptions formed with LE. See BEI, BEhO.
mi klama be ta troci I am-a-(goer to-that) type-of-trier. I try to go to tha
t place.
selma'o BEI (Chapter 5)
Separates multiple sumti attached by BE to a tanru unit.
mi klama be le zarci bei le zdani be'o troci I am-a-(goer to-the store from-
the home) type-of-trier. I try to go from the home to the market.
selma'o BEhO (Chapter 5)
Elidable terminator for BE. Terminates sumti that are attached to a tanru unit.
mi klama be le zarci be'o troci I am-a-(goer to-the market) type-of-trier. I
try to go to the market.
selma'o BIhE (Chapter 18)
Prefix attached to a mathematical operator to mark it as higher priority than ot
her mathematical operators, binding its operands more closely.
li ci bi'e pi'u vo su'i mu du li paze The-number 3 [priority] times 4 plus 5
equals the-number 17. 3 * 4 + 5 = 17
selma'o BIhI (Chapter 14)
Joins sumti or tanru units (as well as some other things) to form intervals. See
GAhO.
mi ca sanli la drezdn. bi'i la frankfurt. I [present] stand-on-surface Dresd
en [interval] Frankfurt. I am standing between Dresden and Frankfurt.
selma'o BO (Chapter 5, Chapter 15, Chapter 18)
Joins tanru units, binding them together closely. Also used to bind logically or
non-logically connected phrases, sentences, etc. BO is always high precedence a
nd right-grouping.
ta cmalu nixli bo ckule That is-a-small type-of (girl type-of school). That
is a small school for girls.
selma'o BOI (Chapter 18)
Elidable terminator for PA or BY. Used to terminate a number (string of numeric
cmavo) or lerfu string (string of letter words) when another string immediately
follows.
li re du li vu'u voboi re The-number two equals the-number the-difference-of
four-and two.
selma'o BU (Chapter 17)
A suffix which can be attached to any word, typically a word representing a lett
er of the alphabet or else a name, to make a word for a symbol or a different le
tter of the alphabet. In particular, attached to single-vowel cmavo to make word
s for vowel letters.
.abu .ebu .ibu .obu .ubu .ybu a, e, i, o, u, y.
selma'o BY (Chapter 17)
Words representing the letters of the Lojban alphabet, plus various shift words
which alter the interpretation of other letter words.
.abu tavla .by le la .ibymym. skami A talks-to B about-the of-IBM computers.
A talks to B about IBM computers.
selma'o CAI (Chapter 13)
A particle which indicates the intensity of an emotion: maximum, strong, weak, o
r not at all. Typically follows another particle which specifies the emotion.
.ei cai mi klama le zarci [Obligation!] [Intense!] I go-to the market. I mus
t go to the market.
selma'o CAhA (Chapter 10)
Specifies whether a bridi refers to an actual fact, a potential (achieved or not
), or merely an innate capability.
ro datka ka'e flulimna all ducks [capability] are-float-swimmers All ducks h
ave the capability of swimming by floating.
selma'o CEI (Chapter 7)
Assigns a selbri definition to one of the five pro-bridi gismu: ``broda'', ``bro
de'', ``brodi'', ``brodo'', or ``brodu'', for later use.
ti slasi je mlatu bo cidja lante gacri cei broda .i le crino broda cu barda
.i le xunre broda cu cmalu This is a plastic cat-food can cover, or thingy. The
green thingy is large. The red thingy is small.
selma'o CEhE (Chapter 14, Chapter 16)
Joins multiple terms into a termset. Termsets are used to associate several term
s for logical connectives, for equal quantifier scope, or for special constructs
in tenses.
mi ce'e do pe'e je la djan. ce'e la djeimyz. cu pendo I [,] you [joint] and
John [,] James are-friends-of. I am a friend of you, and John is a friend of Jam
es.
selma'o CO (Chapter 5)
When inserted between the components of a tanru, inverts it, so that the followi
ng tanru unit modifies the previous one.
mi troci co klama le zarci le zdani I am-a-trier of-type (goer to-the market
from-the house). I try to go to the market from the house.
selma'o COI (Chapter 6, Chapter 13)
When prefixed to a name, description, or sumti, produces a vocative: a phrase wh
ich indicates who is being spoken to (or who is speaking). Vocatives are used in
conversational protocols, including greeting, farewell, and radio communication
. See DOI, DOhU.
coi .djan. Greetings, John.
selma'o CU (Chapter 9)
Separates the selbri of a bridi from any sumti which precede it. Never strictly
necessary, but often useful to eliminate various elidable terminators.
le gerku cu klama le zarci The dog goes to-the store.
selma'o CUhE (Chapter 10)
Forms a question which asks when, where, or in what mode the rest of the bridi i
s true. See PU, CAhA, TAhE, and BAI.
do cu'e klama le zarci You [When/Where?] go to-the store? When are you going
to the store?
selma'o DAhO (Chapter 7)
Cancels the assigned significance of all sumti cmavo (of selma'o KOhA) and bridi
cmavo (of selma'o GOhA).
selma'o DOI (Chapter 13)
The non-specific vocative indicator. May be used with or without COI. No pause i
s required between ``doi'' and a following name. See DOhU.
doi frank. mi tavla do O Frank, I speak-to you. Frank, I'm talking to you.
selma'o DOhU (Chapter 13)
Elidable terminator for COI or DOI. Signals the end of a vocative.
coi do'u Greetings [terminator] Greetings, O unspecified one!
selma'o FA (Chapter 9)
Prefix for a sumti, indicating which numbered place in the place structure the s
umti belongs in; overrides word order.
fa mi cu klama fi la .atlantas. fe la bastn. fo le dargu fu le karce x1= I g
o x3= Atlanta x2= Boston x4= the road x5= the car. I go from Atlanta to Boston v
ia the road using the car.
selma'o FAhA (Chapter 10)
Specifies the direction in which, or toward which (when marked with MOhI) or alo
ng which (when prefixed by VEhA or VIhA) the action of the bridi takes place.
le nanmu zu'a batci le gerku The man [left] bites the dog. To my left, the m
an bites the dog.
selma'o FAhO (Chapter 19)
A mechanical signal, outside the grammar, indicating that there is no more text.
Useful in talking to computers.
selma'o FEhE (Chapter 10)
Indicates that the following interval modifier (using TAhE, ROI, or ZAhO) refers
to space rather than time.
ko vi'i fe'e di'i sombo le gurni You-imperative [1-dimensional] [space] [reg
ularly] sow the grain. Sow the grain in a line and evenly!
selma'o FEhU (Chapter 9)
Elidable terminator for FIhO. Indicates the end of an ad hoc modal tag: the tagg
ed sumti immediately follows.
mi viska do fi'o kanla [fe'u] le zunle I see you [modal] eye: the left-thing
I see you with the left eye.
selma'o FIhO (Chapter 9)
When placed before a selbri, transforms the selbri into a modal tag, grammatical
ly and semantically equivalent to a member of selma'o BAI.
mi viska do fi'o kanla le zunle I see you with eye the left-thing I see you
with my left eye.
selma'o FOI (Chapter 17)
Signals the end of a compound alphabet letter word that begins with TEI. Not an
elidable terminator.
tei .ebu .akut. bu foi ( ``e'' ``acute'' ) the letter ``e'' with an acute ac
cent
selma'o FUhA (Chapter 18)
Indicates that the following mathematical expression is to be interpreted as rev
erse Polish (RP), a mode in which mathematical operators follow their operands.
li fu'a reboi re[boi] su'i du li vo the-number [RP!] two, two, plus equals t
he-number four 2 + 2 = 4
selma'o FUhE (Chapter 19)
Indicates that the following indicator(s) of selma'o UI affect not the preceding
word, as usual, but rather all following words until a FUhO.
mi viska le fu'e .ia blanu zdani fu'o ponse I see the [start] [belief] blue
house [end] possessor I see the owner of a blue house, or what I believe to be o
ne.
selma'o FUhO (Chapter 19)
Cancels all indicators of selma'o UI which are in effect.
mi viska le fu'e .ia blanu zdani fu'o ponse I see the [start] [belief] blue
house [end] possessor I see the owner of what I believe to be a blue house.
selma'o GA (Chapter 14)
Indicates the beginning of two logically connected sumti, bridi-tails, or variou
s other things. Logical connections include ``both ... and'', ``either ... or'',
``if ... then'', and so on. See GI.
ga la djan. nanmu gi la djeimyz. ninmu Either John is a man or James is a wo
man (or both).
selma'o GAhO (Chapter 14)
Specifies whether an interval specified by BIhI includes or excludes its endpoin
ts. Used in pairs before and after the BIhI cmavo, to specify the nature of both
the left- and the right-hand endpoints.
mi ca sanli la drezdn. ga'o bi'i ga'o la frankfurt. I [present] stand Dresde
n [inclusive] [interval] [inclusive] Frankfurt. I am standing between Dresden an
d Frankfurt, inclusive of both.
selma'o GEhU (Chapter 8)
Elidable terminator for GOI. Marks the end of a relative phrase. See KUhO.
la djan. goi ko'a ge'u blanu John (referred to as #1) is-blue.
selma'o GI (Chapter 14)
Separates two logically or non-logically connected sumti, tanru units, bridi-tai
ls, or other things, when the prefix is a forethought logical connective involvi
ng GA, GUhA, or JOI.
ge la djan. nanmu gi la djeimyz. ninmu (It is true that) both John is a man
and James is a woman.
selma'o GIhA (Chapter 14)
Specifies a logical connective (e.g. ``and'', ``or'', ``if'') between two bridi-
tails: a bridi-tail is a selbri with any associated following sumti, but not inc
luding any preceding sumti.
mi klama le zarci gi'e nelci la djan. I go-to the market and like John.
selma'o GOI (Chapter 8)
Specifies the beginning of a relative phrase, which associates a subordinate sum
ti (following) to another sumti (preceding). See GEhU, NOI.
la djan. goi ko'a cu blanu John (referred to as #1) is blue.
selma'o GOhA (Chapter 7)
A general selma'o for all cmavo which can take the place of brivla. There are se
veral groups of these.
A: mi klama le zarci B: mi go'i A: I'm going to the market. B: Me, too.
selma'o GUhA (Chapter 14)
Indicates the beginning of two logically connected tanru units. Takes the place
of GA when forming logically-connected tanru. See GI.
la .alis. gu'e ricfu gi blanu Alice is both rich and blue.
selma'o I (Chapter 19)
Separates two sentences from each other.
mi klama le zarci .i mi klama le zdani I go-to the market. I go-to the offic
e.
selma'o JA (Chapter 14)
Specifies a logical connection (e.g. ``and'', ``or'', ``if'') between two tanru
units, mathematical operands, tenses, or abstractions.
ti blanu je zdani This is-blue and a-house.
selma'o JAI (Chapter 9)
When followed by a tense or modal, creates a conversion operator attachable to a
selbri which exchanges the modal place with the x1 place of the selbri. When al
one, is a conversion operator exchanging the x1 place of the selbri (which shoul
d be an abstract sumti) with one of the places of the abstracted-over bridi.
mi jai gau galfi le bitmu skari I am-the-actor-in modifying the wall color.
I act so as to modify the wall color. I change the color of the wall.
selma'o JOI (Chapter 14)
Specifies a non-logical connection (e.g. together-with-as-mass, -set, or -sequen
ce) between two sumti, tanru units, or various other things. When immediately fo
llowed by GI, provides forethought non-logical connection analogous to GA.
la djan. joi la .alis. cu bevri le pipno John massed-with Alice carry the pi
ano.
selma'o JOhI (Chapter 18)
Indicates that the following mathematical operands (a list terminated by TEhU) f
orm a mathematical vector (one-dimensional array).
li jo'i paboi reboi te'u su'i jo'i ciboi voboi du li jo'i voboi xaboi the-nu
mber array( one, two ) plus array( three, four) equals the-number array( four, s
ix) (1,2) + (3,4) = (4,6)
selma'o KE (Chapter 5)
Groups everything between itself and a following KEhE for purposes of logical co
nnection, tanru construction, or other purposes. KE and KEhE are not used for ma
thematical (see VEI and VEhO) or discursive (see TO and TOI) purposes.
ta ke melbi cmalu ke'e nixli ckule That is-a-( pretty little ) girl school.
That is a school for girls who are pretty in their littleness.
selma'o KEI (Chapter 11)
Elidable terminator for NU. Marks the end of an abstraction bridi.
la djan. cu nu sonci kei djica John is-an-(event-of being-a-soldier) type-of
desirer. John wants to be a soldier.
selma'o KEhE (Chapter 5)
Elidable terminator for KE. Marks the end of a grouping.
ta ke melbi cmalu ke'e nixli ckule That is-a-( pretty little ) girl school.
That is a school for girls who are pretty in their littleness.
selma'o KI (Chapter 10)
When preceded by a tense, makes it ``sticky'', so that it applies to all further
bridi until reset by another appearance of KI. When alone, eliminates all stick
y tenses.
selma'o KOhA (Chapter 7)
A general selma'o which contains all cmavo which can substitute for sumti. These
cmavo are divided into several groups.
le blanu zdani goi ko'a cu barda .i ko'a na cmamau ti The blue house (referr
ed to as #1) is big. #1 is-not smaller-than this-thing.
selma'o KU (Chapter 6, Chapter 10)
Elidable terminator for LE and some uses of LA. Indicates the end of a descripti
on sumti. Also used after a tense or modal to indicate that no sumti follows, an
d in the compound NA+KU to indicate natural language-style negation.
le prenu ku le zdani ku klama The person, to-the house, goes. The person goe
s to the house.
selma'o KUhE (Chapter 18)
Elidable terminator for PEhO: indicates the end of a forethought mathematical ex
pression (one in which the operator precedes the operands).
li pe'o su'i reboi reboi re[boi] ku'e du li xa The number [forethought] the-
sum-of two two two [end] equals the-number six.
selma'o KUhO (Chapter 8)
Elidable terminator for NOI. Indicates the end of a relative clause.
le zdani poi blanu ku'o barda The house which is-blue is-big.
selma'o LA (Chapter 5)
Descriptors which change name words (or selbri) into sumti which identify people
or things by name. Similar to LE. May be terminated with KU if followed by a de
scription selbri.
la kikeros. du la tulis. Cicero is Tully.
selma'o LAU (Chapter 17)
Cmavo which combine with the following alphabetic letter to represent a single m
arker: change from lower to upper case, change of font, punctuation, etc.)
tau sy. .ibu [single-shift] ``s'' ``i'' Si (chemical symbol for silicon)
selma'o LAhE (Chapter 6)
Qualifiers which, when prefixed to a sumti, change it into another sumti with re
lated meaning. Qualifiers can also consist of a cmavo from selma'o NAhE plus BO.
See LUhU.
mi viska la'e zoi kuot. A Tale of Two Cities .kuot I see that-represented-by
the-text `` A Tale of Two Cities ''. I see the book ``A Tale of Two Cities''.
selma'o LE (Chapter 6)
Descriptors which make selbri into sumti which describe or specify things that f
it into the x1 place of the selbri. See LA, KU.
le gerku klama le zdani The dog goes-to the house.
selma'o LEhU (Chapter 19)
Indicates the end of a quotation begun with LOhU. Not an elidable terminator.
lo'u mi du do du mi le'u cu na lojbo drani [quote] mi du do du mi [unquote]
is-not Lojbanically correct. ``mi du do du mi'' is not correct Lojban.
selma'o LI (Chapter 18)
Descriptors which change numbers or other mathematical expressions into sumti wh
ich specify numbers or numerical expressions. See LOhO.
li re su'u re na du li vo su'i vo the-number 2 minus 2 not equals the-number
4 plus 4. $2 - 2 <> 4 + 4$
selma'o LIhU (Chapter 19)
Elidable terminator for LU. Indicates the end of a text quotation.
mi cusku lu mi klama le zarci li'u I express [quote] I go-to the market [end
quote].
selma'o LOhO (Chapter 18)
Elidable terminator for LI. Indicates the end of a mathematical expression used
in a LI description.
li vo lo'o li ci lo'o cu zmadu The-number 4 [end number], the number 3 [end
number], is greater. $4 > 3$
selma'o LOhU (Chapter 19)
Indicates the beginning of a quotation (a sumti) which is grammatical as long as
the quoted material consists of Lojban words, whether they form a text or not.
Terminated by LEhU.
do cusku lo'u mi du do du ko'a le'u You express [quote] mi du do du ko'a [en
d quote]. You said, ``mi du do du ko'a''.
selma'o LU (Chapter 19)
Indicates the beginning of a quotation (a sumti) which is grammatical only if th
e quoted material also forms a grammatical Lojban text. Terminated by LIhU.
mi cusku lu mi klama le zarci li'u I express [quote] I go-to the market [end
quote].
selma'o LUhU (Chapter 6)
Elidable terminator for LAhE and NAhE+BO. Indicates the end of a qualified sumti
.
mi viska la'e lu barda gerku li'u lu'u I see the-referent-of [quote] big dog
[end quote] [end ref] I saw ``Big Dog'' [not the words, but a book or movie].
selma'o MAI (Chapter 18, Chapter 19)
When suffixed to a number or string of letter words, produces a free modifier wh
ich serves as an index number within a text.
pamai mi pu klama le zarci 1-thly, I [past] go to-the market. First, I went
to the market.
selma'o MAhO (Chapter 18)
Produces a mathematical operator from a letter or other operand. See TEhU, VUhU.
ma'o fy. boi xy. [operator] f x f(x)
selma'o ME (Chapter 5, Chapter 18)
Produces a tanru unit from a sumti, which is applicable to the things referenced
by the sumti. See MEhU.
ta me la ford. karce That is-a-Ford-type car That's a Ford car.
selma'o MEhU (Chapter 5)
The elidable terminator for ME. Indicates the end of a sumti converted to a tanr
u unit.
ta me mi me'u zdani That's a me type of house.
selma'o MOI (Chapter 5)
Suffixes added to numbers or other quantifiers to make various numerically-based
selbri.
la djan. joi la frank. cu bruna remei John in-a-mass-with Frank are-a-brothe
r-type-of twosome. John and Frank are two brothers.
selma'o MOhE (Chapter 18)
Produces a mathematical operand from a sumti; used to make dimensioned units. Te
rminated by TEhU.
li mo'e re ratcu su'i mo'e re ractu du li mo'e vo danlu the-number two rats
plus two rabbits equals the-number four animals 2 rats + 2 rabbits = 4 animals.
selma'o MOhI (Chapter 10)
A tense flag indicating movement in space, in a direction specified by a followi
ng FAhA cmavo.
le verba mo'i ri'u cadzu le bisli The child [movement] [right] walks-on the
ice. The child walks toward my right on the ice.
selma'o NA (Chapter 14, Chapter 15)
Contradictory negators, asserting that a whole bridi is false (or true).
mi na klama le zarci It is not true that I go to the market.
Also used to construct logical connective compound cmavo.
selma'o NAI (Chapter 14, Chapter 15)
Negates the previous word, but can only be used with certain selma'o as specifie
d by the grammar.
selma'o NAhE (Chapter 15)
Scalar negators, modifying a selbri or a sumti to a value other than the one sta
ted, the opposite of the one stated, etc. Also used with following BO to constru
ct a sumti qualifier; see LAhE.
ta na'e blanu zdani That is-a-non- blue house. That is a house which is othe
r than blue.
selma'o NAhU (Chapter 18)
Creates a mathematical operator from a selbri. See VUhU.
li na'u tanjo te'u vei pai fe'i re [ve'o] du li ci'i the-number the-operator
tangent ( pi / 2 ) = the-number infinity tan(pi/2) = infinity
selma'o NIhE (Chapter 18)
Creates a mathematical operand from a selbri, usually a ``ni'' abstraction. Term
inated by TEhU.
li ni'e ni clani [te'u] pi'i ni'e ni ganra [te'u] pi'i ni'e ni condi te'u du
li ni'e ni canlu the-number quantity-of length times quantity-of width times qu
antity-of depth equals the-number quantity-of volume. Length x Width x Depth = V
olume
selma'o NIhO (Chapter 19)
Marks the beginning of a new paragraph, and indicates whether it contains old or
new subject matter.
selma'o NOI (Chapter 8)
Relative clause introducers. The following bridi modifies the preceding sumti. T
erminated by KUhO. See GOI.
le zdani poi blanu cu cmalu The house which is blue is small.
selma'o NU (Chapter 11)
Abstractors which, when prefixed to a bridi, create abstraction selbri. Terminat
ed by KEI.
la djan. cu djica le nu sonci [kei] John desires the event-of being-a-soldie
r.
selma'o NUhA (Chapter 18)
Creates a selbri from a mathematical operator. See VUhU.
li ni'umu cu nu'a va'a li ma'umu the-number -5 is-the-negation-of the-number
+5
selma'o NUhI (Chapter 14, Chapter 16)
Marks the beginning of a termset, which is used to make simultaneous claims invo
lving two or more different places of a selbri.
mi klama nu'i le zarci le briju nu'u .e le zdani le ckule [nu'u] I go [start
] to-the market from-the office [joint] and to-the house from-the school.
selma'o NUhU (Chapter 14)
Elidable terminator for NUhI. Marks the end of a termset, and is also used in th
e middle of one to mark the termset joint (a logical or non-logical connective).
mi klama nu'i le zarci le briju nu'u .e le zdani le ckule [nu'u] I go [start
] to-the market from-the office [joint] and to-the house from-the school.
selma'o PA (Chapter 18)
Digits and related quantifiers (some, all, many, etc.).
mi speni re ninmu I am-married-to two women.
selma'o PEhE (Chapter 14)
Precedes a logical or non-logical connective that joins two termsets. Termsets (
see CEhE) are used to associate several terms for logical connectives, for equal
quantifier scope, or for special constructs in tenses.
mi ce'e do pe'e je la djan. ce'e la djeimyz. cu pendo I [,] you [joint] and
John [,] James are-friends-of. I am a friend of you, and John is a friend of Jam
es.
selma'o PEhO (Chapter 18)
An optional signal of forethought mathematical operators, which precede their op
erands. Terminated by KUhE.
li vo du li pe'o su'i reboi re The-number four equals the-number [forethough
t] sum-of two two
selma'o PU (Chapter 10)
Specifies simple time directions (future, past, or neither).
mi pu klama le zarci I [past] go-to the market. I went to the market.
selma'o RAhO (Chapter 7)
The pro-bridi update flag: changes the meaning of sumti implicitly attached to a
pro-bridi (see GOhA) to fit the current context rather than the original contex
t.
A: mi ba lumci le mi karce B: mi go'i A: I [future] wash my car. B: I do-the
-same (i.e. wash A's car). A: mi ba lumci le mi karce B: mi go'i ra'o A: I [futu
re] wash my car. B: I do-the-corresponding-thing (i.e. wash B's car).
selma'o ROI (Chapter 10)
A particle which, when suffixed to a number, makes an extensional tense (e.g. on
ce, twice, many times).
mi reroi klama le zarci I twice go-to the market.
selma'o SA (Chapter 19)
A particle which erases the previous phrase or sentence.
mi klama sa do klama le zarci I go, er, you go-to the market.
selma'o SE (Chapter 5)
Particles which convert a selbri, rearranging the order of places by exchanging
the x1 place with a specified numbered place.
le zarci se klama mi The market is-gone-to by me.
Also used in constructing connective and modal compound cmavo.
selma'o SEI (Chapter 19)
Marks the beginning of metalinguistic insertions which comment on the main bridi
. Terminated by SEhU.
la frank. prami sei gleki [se'u] la djein. Frank loves (he is happy) Jane.
selma'o SEhU (Chapter 19)
Elidable terminator for SEI and SOI. Ends metalinguistic insertions.
la frank. prami sei gleki se'u la djein. Frank loves (he is happy) Jane.
selma'o SI (Chapter 19)
A particle which erases the previous single word.
mi si do klama le zarci I, er, you go to-the market.
selma'o SOI (Chapter 7)
Marks reciprocity between two sumti (like ``vice versa'' in English).
mi prami do soi mi I love you [reciprocally] me. I love you and vice versa.
selma'o SU (Chapter 19)
A particle which closes and erases the entire previous discourse.
selma'o TAhE (Chapter 10)
A tense modifier specifying frequencies within an interval of time or space (reg
ularly, habitually, etc.).
le verba ta'e klama le ckule The child habitually goes to-the school.
selma'o TEI (Chapter 17)
Signals the beginning of a compound letter word, which acts grammatically like a
single letter. Compound letter words end with the non-elidable selma'o FOI.
tei .ebu .akut. bu foi ( ``e'' ``acute'' ) the letter ``e'' with an acute ac
cent
selma'o TEhU (Chapter 18)
Elidable terminator for JOhI, MAhO, MOhE, NAhU, or NIhE. Marks the end of a math
ematical conversion construct.
li jo'i paboi reboi te'u su'i jo'i ciboi voboi du li jo'i voboi xaboi the-nu
mber array( one, two ) plus array( three, four) equals the-number array( four, s
ix) (1,2) + (3,4) = (4,6)
selma'o TO (Chapter 19)
Left discursive parenthesis: allows inserting a digression. Terminated by TOI.
doi lisas. mi djica le nu to doi frank. ko sisti toi do viska le mlatu O Lis
a, I desire the event-of ( O Frank, [imperative] stop! ) you see the cat. Lisa,
I want you to (Frank! Stop!) see the cat.
selma'o TOI (Chapter 19)
Elidable terminator for TO. The right discursive parenthesis.
doi lisas. mi djica le nu to doi frank. ko sisti toi do viska le mlatu O Lis
a, I desire the event-of ( O Frank, [imperative] stop! ) you see the cat. Lisa,
I want you to (Frank! Stop!) see the cat.
selma'o TUhE (Chapter 19)
Groups multiple sentences or paragraphs into a logical unit. Terminated by TUhU.
xagmau zo'u tu'e ganai cidja gi cnino .i ganai vanju gi tolci'o [tu'u] is-be
st : [start] If food, then new. If wine, then old. As for what is best: if food,
then new [is best]; if wine, then old [is best].
selma'o TUhU (Chapter 19)
Logical terminator for TUhE. Marks the end of a multiple sentence group.
selma'o UI (Chapter 13)
Particles which indicate the speaker's emotional state or source of knowledge, o
r the present stage of discourse.
.ui la djan. klama [Happiness!] John is-coming. Hurrah! John is coming!
selma'o VA (Chapter 10)
A tense indicating distance in space (near, far, or neither).
le nanmu va batci le gerku The man [medium distance] bites the dog. Over the
re the man is biting the dog.
selma'o VAU (Chapter 14)
Elidable terminator for a simple bridi, or for each bridi-tail of a GIhA logical
connection.
mi dunda le cukta [vau] gi'e lebna lo rupnu vau do [vau] I (give the book) a
nd (take some currency-units) to/from you.
selma'o VEI (Chapter 18)
Left mathematical parenthesis: groups mathematical operations. Terminated by VEh
O.
li vei ny. su'i pa ve'o pi'i vei ny. su'i pa [ve'o] du li ny. [bo] te'a re s
u'i re bo pi'i ny. su'i pa the-number ( ``n'' plus one ) times ( ``n'' plus one
) equals the-number n-power-two plus two-times-``n'' plus 1 $(n + 1)(n + 1) = n2
+ 2n + 1$
selma'o VEhA (Chapter 10)
A tense indicating the size of an interval in space (long, medium, or short).
selma'o VEhO (Chapter 19)
Elidable terminator for VEI: right mathematical parenthesis.
li vei ny. su'i pa ve'o pi'i vei ny. su'i pa [ve'o] du li ny. [bo] te'a re s
u'i re bo pi'i ny. su'i pa the-number ( ``n'' plus one ) times ( ``n'' plus one
) equals the-number n-power-two plus two-times-``n'' plus 1 $(n + 1)(n + 1) = n2
+ 2n + 1$
selma'o VIhA (Chapter 10)
A tense indicating dimensionality in space (line, plane, volume, or space-time i
nterval).
le verba ve'a vi'a cadzu le bisli The child [medium space interval] [2-dimen
sional] walks-on the ice. In a medium-sized area, the child walks on the ice.
selma'o VUhO (Chapter 8)
A mechanism for attaching relative clauses or phrases to a whole (possibly conne
cted) sumti, rather than simply to the leftmost portion of the sumti.
la frank. ce la djordj. vu'o noi gidva cu zvati le kumfa Frank [in-set-with]
George, which are-guides, are-in the room. Frank and George, who are guides, ar
e in the room.
selma'o VUhU (Chapter 18)
Mathematical operators (e.g. $+$, $-$). See MAhO.
li mu vu'u re du li ci The-number 5 minus 2 equals the-number 3. 5 - 2 = 3
selma'o XI (Chapter 18)
The subscript marker: the following number or lerfu string is a subscript for wh
atever precedes it.
xy. xi re x sub 2 $x2$
selma'o Y (Chapter 19)
Hesitation noise: content-free, but holds the floor or continues the conversatio
n. It is different from silence in that silence may be interpreted as having not
hing more to say.
doi .y. .y. djan O, uh, uh, John!
selma'o ZAhO (Chapter 10)
A tense modifier specifying the contour of an event (e.g. beginning, ending, con
tinuing).
mi pu'o damba I [inchoative] fight. I'm on the verge of fighting.
selma'o ZEI (Chapter 4)
A morphological glue word, which joins the two words it stands between into the
equivalent of a lujvo.
ta xy. zei kantu kacma That is-an-(X - ray) camera That is an X-ray camera.
selma'o ZEhA (Chapter 10)
A tense indicating the size of an interval in time (long, medium, or short).
mi puze'a citka I [past] [short interval] eat. I ate for a little while.
selma'o ZI (Chapter 10)
A tense indicating distance in time (a long, medium or short time ago or in the
future).
mi puzi citka I [past] [short distance] eat. I ate a little while ago.
selma'o ZIhE (Chapter 8)
Joins multiple relative phrases or clauses which apply to the same sumti. Althou
gh it is generally translated with ``and'', it is not considered a logical conne
ctive.
mi ponse pa gerku ku poi blabi zi'e noi mi prami ke'a I own one dog such-tha
t it-is-white and such-that-incidentally I love it. I own a dog that is white an
d which, incidentally, I love. I own a white dog, which I love.
selma'o ZO (Chapter 19)
Single-word quotation: quotes the following single Lojban word.
zo si cu lojbo valsi The-word ``si'' is-a-Lojbanic word.
selma'o ZOI (Chapter 19)
Non-Lojban quotation: quotes any text using a delimiting word (which can be any
single Lojban word) placed before and after the text. The delimiting word must n
ot appear in the text, and must be separated from the text by pauses.
zoi kuot. Socrates is mortal .kuot. cu glico jufra The-text `` Socrates is m
ortal '' is-an-English sentence.
selma'o ZOhU (Chapter 16, Chapter 19)
Separates a logical prenex from a bridi or group of sentences to which it applie
s. Also separates a topic from a comment in topic/comment sentences.
su'o da poi remna ro da poi finpe zo'u da prami de For-at-least-one X which
is-a-man, for-all Ys which are-fish : X loves Y There is a man who loves all fis
h.
Chapter 21
Formal Grammars
The following two listings constitute the formal grammar of Lojban. The first ve
rsion is written in the YACC language, which is used to describe parsers, and ha
s been used to create a parser for Lojban texts. This parser is available from t
he Logical Language Group. The second listing is in Extended Backus-Naur Form (E
BNF) and represents the same grammar in a more human-readable form. (In case of
discrepancies, the YACC version is official.) There is a cross-reference listing
for the EBNF format that shows, for each selma'o and rule, which rules refer to
it.
1. YACC Grammar of Lojban
/* LOJBAN MACHINE GRAMMAR, 3RD BASELINE AS OF 10 JANUARY 1997
WHICH IS ORIGINAL BASELINE 20 JULY 1990 INCORPORATING JC'S TECH FIXES 1-28
THIS DRAFT ALSO INCORPORATES CHANGE PROPOSALS 1-47 DATED 29 DECEMBER 1996
THIS DOCUMENT IS EXPLICITLY DEDICATED TO THE PUBLIC DOMAIN
BY ITS AUTHOR, THE LOGICAL LANGUAGE GROUP INC.
CONTACT THAT ORGANIZATION AT 2904 BEAU LANE, FAIRFAX VA 22031 USA
U.S. PHONE: 703-385-0273
INTL PHONE: +1 703 385-0273
grammar.300 */
/* The Lojban machine parsing algorithm is a multi-step process. The YACC machin
e grammar presented here is an amalgam of those steps, concatenated so as to all
ow YACC to verify the syntactic ambiguity of the grammar. YACC is used to genera
te a parser for a portion of the grammar, which is LALR1 (the type of grammar th
at YACC is designed to identify and process successfully), but most of the rest
of the grammar must be parsed using some language-coded processing.
Step 1 - Lexing
From phonemes, stress, and pause, it is possible to resolve Lojban unambiguously
into a stream of words. Any machine processing of speech will have to have some
way to deal with 'non-Lojban' failures of fluent speech, of course. The resolve
d words can be expressed as a text file, using Lojban's phonetic spelling rules.
The following steps, assume that there is the possibility of non-Lojban text wit
hin the Lojban text (delimited appropriately). Such non-Lojban text may not be r
educible from speech phonetically. However, step 2 allows the filtering of a pho
netically transcribed text stream, to recognize such portions of non-Lojban text
where properly delimited, without interference with the parsing algorithm.
Step 2 - Filtering
From start to end, performing the following filtering and lexing tasks using the
given order of precedence in case of conflict:
a. If the Lojban word ``zoi'' (selma'o ZOI) is identified, take the following Lo
jban word (which should be end delimited with a pause for separation from the fo
llowing non-Lojban text) as an opening delimiter. Treat all text following that
delimiter, until that delimiter recurs *after a pause*, as grammatically a singl
e token (labelled 'anything_699' in this grammar). There is no need for processi
ng within this text except as necessary to find the closing delimiter.
b. If the Lojban word ``zo'' (selma'o ZO) is identified, treat the following Loj
ban word as a token labelled 'any_word_698', instead of lexing it by its normal
grammatical function.
c. If the Lojban word ``lo'u'' (selma'o LOhU) is identified, search for the clos
ing delimiter ``le'u'' (selma'o LEhU), ignoring any such closing delimiters abso
rbed by the previous two steps. The text between the delimiters should be treate
d as the single token 'any_words_697'.
d. Categorize all remaining words into their Lojban selma'o category, including
the various delimiters mentioned in the previous steps. In all steps after step
2, only the selma'o token type is significant for each word.
e. If the word ``si'' (selma'o SI) is identified, erase it and the previous word
(or token, if the previous text has been condensed into a single token by one o
f the above rules).
f. If the word ``sa'' (selma'o SA) is identified, erase it and all preceding tex
t as far back as necessary to make what follows attach to what precedes. (This r
ule is hard to formalize and may receive further definition later.)
g. If the word 'su' (selma'o SU) is identified, erase it and all preceding text
back to and including the first preceding token word which is in one of the selm
a'o: NIhO, LU, TUhE, and TO. However, if speaker identification is available, a
SU shall only erase to the beginning of a speaker's discourse, unless it occurs
at the beginning of a speaker's discourse. (Thus, if the speaker has said someth
ing, two ``su'''s are required to erase the entire conversation.
Step 3 - Termination
If the text contains a FAhO, treat that as the end-of-text and ignore everything
that follows it.
Step 4 - Absorption of Grammar-Free Tokens
In a new pass, perform the following absorptions (absorption means that the toke
n is removed from the grammar for processing in following steps, and optionally
reinserted, grouped with the absorbing token after parsing is completed).
a. Token sequences of the form any - (ZEI - any) ..., where there may be any num
ber of ZEIs, are merged into a single token of selma'o BRIVLA.
b. Absorb all selma'o BAhE tokens into the following token. If they occur at the
end of text, leave them alone (they are errors).
c. Absorb all selma'o BU tokens into the previous token. Relabel the previous to
ken as selma'o BY.
d. If selma'o NAI occurs immediately following any of tokens UI or CAI, absorb t
he NAI into the previous token.
e. Absorb all members of selma'o DAhO, FUhO, FUhE, UI, Y, and CAI into the previ
ous token. All of these null grammar tokens are permitted following any word of
the grammar, without interfering with that word's grammatical function, or causi
ng any effect on the grammatical interpretation of any other token in the text.
Indicators at the beginning of text are explicitly handled by the grammar.
Step 5 - Insertion of Lexer Lexemes
Lojban is not in itself LALR1. There are words whose grammatical function is det
ermined by following tokens. As a result, parsing of the YACC grammar must take
place in two steps. In the first step, certain strings of tokens with defined gr
ammars are identified, and either
a. are replaced by a single specified 'lexer token' for step 6, or
b. the lexer token is inserted in front of the token string to identify it uniqu
ely.
The YACC grammar included herein is written to make YACC generation of a step 6
parser easy regardless of whether a. or b. is used. The strings of tokens to be
labelled with lexer tokens are found in rule terminals labelled with numbers bet
ween 900 and 1099. These rules are defined with the lexer tokens inserted, with
the result that it can be verified that the language is LALR1 under option b. af
ter steps 1 through 4 have been performed. Alternatively, if option a. is to be
used, these rules are commented out, and the rule terminals labelled from 800 to
900 refer to the lexer tokens *without* the strings of defining tokens. Two set
s of lexer tokens are defined in the token set so as to be compatible with eithe
r option.
In this step, the strings must be labelled with the appropriate lexer tokens. Or
der of inserting lexer tokens *IS* significant, since some shorter strings that
would be marked with a lexer token may be found inside longer strings. If the to
kens are inserted before or in place of the shorter strings, the longer strings
cannot be identified.
If option a. is chosen, the following order of insertion works correctly (it is
not the only possible order): A, C, D, B, U, E, H, I, J, K, M ,N, G, O, V, W, F,
P, R, T, S, Y, L, Q. This ensures that the longest rules will be processed firs
t; a PA+MAI will not be seen as a PA with a dangling MAI at the end, for example
.
Step 6 - YACC Parsing
YACC should now be able to parse the Lojban text in accordance with the rule ter
minals labelled from 1 to 899 under option 5a, or 1 to 1099 under option 5b. Com
ment out the rules beyond 900 if option 5a is used, and comment out the 700-seri
es of lexer-tokens, while restoring the series of lexer tokens numbered from 900
up.
*/
%token A_501 /* eks; basic afterthought logical connectives */
%token BAI_502 /* modal operators */
%token BAhE_503 /* next word intensifier */
%token BE_504 /* sumti link to attach sumti to a selbri */
%token BEI_505 /* multiple sumti separator between BE, BEI */
%token BEhO_506 /* terminates BE/BEI specified descriptors */
%token BIhI_507 /* interval component of JOI */
%token BO_508 /* joins two units with shortest scope */
%token BRIVLA_509 /* any brivla */
%token BU_511 /* turns any word into a BY lerfu word */
%token BY_513 /* individual lerfu words */
%token CAhA_514 /* specifies actuality/potentiality of tense */
%token CAI_515 /* afterthought intensity marker */
%token CEI_516 /* pro-bridi assignment operator */
%token CEhE_517 /* afterthought term list connective */
%token CMENE_518 /* names; require consonant end, then pause no
LA or DOI selma'o embedded, pause before if
vowel initial and preceded by a vowel */
%token CO_519 /* tanru inversion */
%token COI_520 /* vocative marker permitted inside names; must
always be followed by pause or DOI */
%token CU_521 /* separator between head sumti and selbri */
%token CUhE_522 /* tense/modal question */
%token DAhO_524 /* cancel anaphora/cataphora assignments */
%token DOI_525 /* vocative marker */
%token DOhU_526 /* terminator for DOI-marked vocatives */
%token FA_527 /* modifier head generic case tag */
%token FAhA_528 /* superdirections in space */
%token FAhO_529 /* normally elided 'done pause' to indicate end
of utterance string */
%token FEhE_530 /* space interval mod flag */
%token FEhU_531 /* ends bridi to modal conversion */
%token FIhO_532 /* marks bridi to modal conversion */
%token FOI_533 /* end compound lerfu */
%token FUhE_535 /* open long scope for indicator */
%token FUhO_536 /* close long scope for indicator */
%token GA_537 /* geks; forethought logical connectives */
%token GEhU_538 /* marker ending GOI relative clauses */
%token GI_539 /* forethought medial marker */
%token GIhA_541 /* logical connectives for bridi-tails */
%token GOI_542 /* attaches a sumti modifier to a sumti */
%token GOhA_543 /* pro-bridi */
%token GUhA_544 /* GEK for tanru units, corresponds to JEKs */
%token I_545 /* sentence link */
%token JA_546 /* jeks; logical connectives within tanru */
%token JAI_547 /* modal conversion flag */
%token JOI_548 /* non-logical connectives */
%token KEhE_550 /* right terminator for KE groups */
%token KE_551 /* left long scope marker */
%token KEI_552 /* right terminator, NU abstractions */
%token KI_554 /* multiple utterance scope for tenses */
%token KOhA_555 /* sumti anaphora */
%token KU_556 /* right terminator for descriptions, etc. */
%token KUhO_557 /* right terminator, NOI relative clauses */
%token LA_558 /* name descriptors */
%token LAU_559 /* lerfu prefixes */
%token LAhE_561 /* sumti qualifiers */
%token LE_562 /* sumti descriptors */
%token LEhU_565 /* possibly ungrammatical text right quote */
%token LI_566 /* convert number to sumti */
%token LIhU_567 /* grammatical text right quote */
%token LOhO_568 /* elidable terminator for LI */
%token LOhU_569 /* possibly ungrammatical text left quote */
%token LU_571 /* grammatical text left quote */
%token LUhU_573 /* LAhE close delimiter */
%token ME_574 /* converts a sumti into a tanru_unit */
%token MEhU_575 /* terminator for ME */
%token MOhI_577 /* motion tense marker */
%token NA_578 /* bridi negation */
%token NAI_581 /* attached to words to negate them */
%token NAhE_583 /* scalar negation */
%token NIhO_584 /* new paragraph; change of subject */
%token NOI_585 /* attaches a subordinate clause to a sumti */
%token NU_586 /* abstraction */
%token NUhI_587 /* marks the start of a termset */
%token NUhU_588 /* marks the middle and end of a termset */
%token PEhE_591 /* afterthought termset connective prefix */
%token PU_592 /* directions in time */
%token RAhO_593 /* flag for modified interpretation of GOhI */
%token ROI_594 /* converts number to extensional tense */
%token SA_595 /* metalinguistic eraser to the beginning of
the current utterance */
%token SE_596 /* conversions */
%token SEI_597 /* metalinguistic bridi insert marker */
%token SEhU_598 /* metalinguistic bridi end marker */
%token SI_601 /* metalinguistic single word eraser */
%token SOI_602 /* reciprocal sumti marker */
%token SU_603 /* metalinguistic eraser of the entire text */
%token TAhE_604 /* tense interval properties */
%token TEI_605 /* start compound lerfu */
%token TO_606 /* left discursive parenthesis */
%token TOI_607 /* right discursive parenthesis */
%token TUhE_610 /* multiple utterance scope mark */
%token TUhU_611 /* multiple utterance end scope mark */
%token UI_612 /* attitudinals, observationals, discursives */
%token VA_613 /* distance in space-time */
%token VAU_614 /* end simple bridi or bridi-tail */
%token VEhA_615 /* space-time interval size */
%token VIhA_616 /* space-time dimensionality marker */
%token VUhO_617 /* glue between logically connected sumti
and relative clauses */
%token XI_618 /* subscripting operator */
%token Y_619 /* hesitation */
%token ZAhO_621 /* event properties - inchoative, etc. */
%token ZEhA_622 /* time interval size tense */
%token ZEI_623 /* lujvo glue */
%token ZI_624 /* time distance tense */
%token ZIhE_625 /* conjoins relative clauses */
%token ZO_626 /* single word metalinguistic quote marker */
%token ZOI_627 /* delimited quote marker */
%token ZOhU_628 /* prenex terminator (not elidable) */
%token BIhE_650 /* prefix for high-priority MEX operator */
%token BOI_651 /* number or lerfu-string terminator */
%token FUhA_655 /* reverse Polish flag */
%token GAhO_656 /* open/closed interval markers for BIhI */
%token JOhI_657 /* flags an array operand */
%token KUhE_658 /* MEX forethought delimiter */
%token MAI_661 /* change numbers to utterance ordinals */
%token MAhO_662 /* change MEX expressions to MEX operators */
%token MOI_663 /* change number to selbri */
%token MOhE_664 /* change sumti to operand, inverse of LI */
%token NAhU_665 /* change a selbri into an operator */
%token NIhE_666 /* change selbri to operand; inverse of MOI */
%token NUhA_667 /* change operator to selbri; inverse of MOhE */
%token PA_672 /* numbers and numeric punctuation */
%token PEhO_673 /* forethought (Polish) flag */
%token TEhU_675 /* closing gap for MEX constructs */
%token VEI_677 /* left MEX bracket */
%token VEhO_678 /* right MEX bracket */
%token VUhU_679 /* MEX operator */
%token any_words_697 /* a string of lexable Lojban words */
%token any_word_698 /* any single lexable Lojban words */
%token anything_699 /* a possibly unlexable phoneme string */
/* The following tokens are the actual lexer tokens. The _900 series
tokens are duplicates that allow limited testing of lexer rules in the
context of the total grammar. They are used in the actual parser, where
the 900 series rules are found in the lexer. */
%token lexer_A_701 /* flags a MAI utterance ordinal */
%token lexer_B_702 /* flags an EK unless EK_BO, EK_KE */
%token lexer_C_703 /* flags an EK_BO */
%token lexer_D_704 /* flags an EK_KE */
%token lexer_E_705 /* flags a JEK */
%token lexer_F_706 /* flags a JOIK */
%token lexer_G_707 /* flags a GEK */
%token lexer_H_708 /* flags a GUhEK */
%token lexer_I_709 /* flags a NAhE_BO */
%token lexer_J_710 /* flags a NA_KU */
%token lexer_K_711 /* flags an I_BO (option. JOIK/JEK lexer tags)*/
%token lexer_L_712 /* flags a PA, unless MAI (then lexer A) */
%token lexer_M_713 /* flags a GIhEK_BO */
%token lexer_N_714 /* flags a GIhEK_KE */
%token lexer_O_715 /* flags a modal operator BAI or compound */
%token lexer_P_716 /* flags a GIK */
%token lexer_Q_717 /* flags a lerfu_string unless MAI (then lexer_A)
*/
%token lexer_R_718 /* flags a GIhEK, not BO or KE */
%token lexer_S_719 /* flags simple I */
%token lexer_T_720 /* flags I_JEK */
%token lexer_U_721 /* flags a JEK_BO */
%token lexer_V_722 /* flags a JOIK_BO */
%token lexer_W_723 /* flags a JOIK_KE */
/* %token lexer_X_724 /* null */
%token lexer_Y_725 /* flags a PA_MOI */
%start text_0
%%
text_0 : text_A_1
| indicators_411 text_A_1
| free_modifier_32 text_A_1
| cmene_404 text_A_1
| indicators_411 free_modifier_32 text_A_1
| NAI_581 text_0
;
text_A_1 : JOIK_JEK_422 text_B_2
/* incomplete JOIK_JEK without preceding I */
/* compare note on paragraph_10 */
| text_B_2
;
text_B_2 : I_819 text_B_2
| I_JEK_820 text_B_2
| I_BO_811 text_B_2
| para_mark_410 text_C_3
| text_C_3
;
text_C_3 : paragraphs_4
/* Only indicators which follow certain selma'o:
cmene, TOI_607, LU_571, and the lexer_K and lexer_S I_roots and compounds,
and at the start of text(_0), will survive the lexer; all other valid ones
will be absorbed. The only strings for which indicators generate a
potential ambiguity are those which contain NAI. An indicator cannot be
inserted in between a token and its negating NAI, else you can't tell
whether it is the indicator or the original token being negated. */
| /* empty */
/* An empty text is legal; formerly this was handled by the explicit
appearance of FAhO_529, but this is now absorbed by the preparser. */
;
paragraphs_4 : paragraph_10
| paragraph_10 para_mark_410 paragraphs_4
;
paragraph_10 : statement_11
| fragment_20
| paragraph_10 I_819 statement_11
| paragraph_10 I_819 fragment_20
| paragraph_10 I_819
/* this last fixes an erroneous start to a sentence,
and permits incomplete JOIK_JEK after I, as well
in answer to questions on those connectives */
;
statement_11 : statement_A_12
| prenex_30 statement_11
;
statement_A_12 : statement_B_13
| statement_A_12 I_JEK_820 statement_B_13
| statement_A_12 I_JEK_820
;
statement_B_13 : statement_C_14
| statement_C_14 I_BO_811 statement_B_13
| statement_C_14 I_BO_811
;
statement_C_14 : sentence_40
| TUhE_447 text_B_2 TUhU_gap_454
| tag_491 TUhE_447 text_B_2 TUhU_gap_454
;
fragment_20 : EK_802
| NA_445
| GIhEK_818
| quantifier_300
| terms_80 VAU_gap_456 /* answer to ma */
/* mod_head_490 requires both gap_450 and VAU_gap_456
but needs no extra rule to accomplish this */
| relative_clauses_121
| links_161
| linkargs_160
| prenex_30
;
bridi_tail_50 : bridi_tail_A_51
| bridi_tail_A_51 GIhEK_KE_814 bridi_tail_50
KEhE_gap_466 tail_terms_71
;
bridi_tail_A_51 : bridi_tail_B_52
| bridi_tail_A_51 GIhEK_818 bridi_tail_B_52
tail_terms_71
;
bridi_tail_B_52 : bridi_tail_C_53
| bridi_tail_C_53 GIhEK_BO_813 bridi_tail_B_52
tail_terms_71
;
bridi_tail_C_53 : gek_sentence_54
| selbri_130 tail_terms_71
;
gek_sentence_54 : GEK_807 subsentence_41
GIK_816 subsentence_41 tail_terms_71
| tag_491 KE_493 gek_sentence_54 KEhE_gap_466
| NA_445 gek_sentence_54
;
tail_terms_71 : terms_80 VAU_gap_456
| VAU_gap_456
;
terms_80 : terms_A_81
| terms_80 terms_A_81
;
terms_A_81 : terms_B_82
| terms_A_81 PEhE_494 JOIK_JEK_422 terms_B_82
;
terms_B_82 : term_83
| terms_B_82 CEhE_495 term_83
;
term_83 : sumti_90
| modifier_84
| term_set_85
| NA_KU_810
;
modifier_84 : mod_head_490 gap_450
| mod_head_490 sumti_90
;
term_set_85 : NUhI_496 terms_80 NUhU_gap_460
| NUhI_496 GEK_807 terms_80 NUhU_gap_460
GIK_816 terms_80 NUhU_gap_460
;
sumti_90 : sumti_A_91
| sumti_A_91 VUhO_497 relative_clauses_121
;
sumti_A_91 : sumti_B_92
| sumti_B_92 EK_KE_804 sumti_90 KEhE_gap_466
| sumti_B_92 JOIK_KE_823 sumti_90 KEhE_gap_466
;
sumti_B_92 : sumti_C_93
| sumti_B_92 JOIK_EK_421 sumti_C_93
;
sumti_C_93 : sumti_D_94
| sumti_D_94 EK_BO_803 sumti_C_93
| sumti_D_94 JOIK_BO_822 sumti_C_93
;
sumti_D_94 : sumti_E_95
| GEK_807 sumti_90 GIK_816 sumti_D_94
;
sumti_E_95 : sumti_F_96
| sumti_F_96 relative_clauses_121
/* indefinite sumti */
| quantifier_300 selbri_130 gap_450
| quantifier_300 selbri_130
gap_450 relative_clauses_121
;
sumti_F_96 : sumti_G_97
/* outer-quantified sumti */
| quantifier_300 sumti_G_97
;
sumti_G_97 : qualifier_483 sumti_90 LUhU_gap_463
| qualifier_483 relative_clauses_121
sumti_90 LUhU_gap_463
/*sumti grouping, set/mass/individual conversion */
/*also sumti scalar negation */
| anaphora_400
| LA_499 cmene_404
| LA_499 relative_clauses_121 cmene_404
| LI_489 MEX_310 LOhO_gap_472
| description_110
| quote_arg_432
;
tanru_unit_150 : tanru_unit_A_151
| tanru_unit_150 CEI_444 tanru_unit_A_151
;
tanru_unit_A_151 : tanru_unit_B_152
| tanru_unit_B_152 linkargs_160
;
tanru_unit_B_152 : bridi_valsi_407
| KE_493 selbri_C_133 KEhE_gap_466
| SE_480 tanru_unit_B_152
| JAI_478 tag_491 tanru_unit_B_152
| JAI_478 tanru_unit_B_152
| ME_477 sumti_90 MEhU_gap_465
| ME_477 sumti_90 MEhU_gap_465 MOI_476
| NUhA_475 MEX_operator_374
| NAhE_482 tanru_unit_B_152
| NU_425 subsentence_41 KEI_gap_453
;
/* Main entry point for MEX; everything but a number must be in parens. */
quantifier_300 : number_812 BOI_gap_461
| left_bracket_470 MEX_310 right_bracket_gap_471
;
/* Entry point for MEX used after LI; no parens needed, but LI now has an
elidable terminator. (This allows us to express the difference between
``the expression a + b'' and ``the expression (a + b)''_) */
/* This rule supports left-grouping infix expressions and reverse Polish
expressions. To handle infix monadic, use a null operand; to handle
infix with more than two operands (whatever that means) use an extra
operator or an array operand. */
MEX_310 : MEX_A_311
| MEX_310 operator_370 MEX_A_311
| FUhA_441 rp_expression_330
;
/* Support for right-grouping (short scope) infix expressions with BIhE. */
MEX_A_311 : MEX_B_312
| MEX_B_312 BIhE_439 operator_370 MEX_A_311
;
/* Support for forethought (Polish) expressions. These begin with a
forethought flag, then the operator and then the argument(s). */
MEX_B_312 : operand_381
| operator_370 MEX_C_313 MEX_gap_452
| PEhO_438 operator_370 MEX_C_313 MEX_gap_452
;
MEX_C_313 : MEX_B_312
| MEX_C_313 MEX_B_312
;
rp_operand_332 : operand_381
| rp_expression_330
;
operand_381 : operand_A_382
| operand_A_382 EK_KE_804 operand_381 KEhE_gap_466
| operand_A_382 JOIK_KE_823 operand_381 KEhE_gap_466
;
operand_A_382 : operand_B_383
| operand_A_382 JOIK_EK_421 operand_B_383
;
operand_B_383 : operand_C_385
| operand_C_385 EK_BO_803 operand_B_383
| operand_C_385 JOIK_BO_822 operand_B_383
;
operand_C_385 : quantifier_300
| lerfu_string_817 BOI_gap_461
/* lerfu string as operand - classic math variable */
| NIhE_428 selbri_130 TEhU_gap_473
/* quantifies a bridi - inverse of -MOI */
| MOhE_427 sumti_90 TEhU_gap_473
/* quantifies a sumti - inverse of LI */
| JOhI_431 MEX_C_313 TEhU_gap_473
| GEK_807 operand_381 GIK_816 operand_C_385
| qualifier_483 operand_381 LUhU_gap_463
;
/* _400 series constructs are mostly specific strings, some of which may
also be used by the lexer; the lexer should not use any reference to
terminals numbered less than _400, as they have grammars composed on
non-deterministic strings of selma'o. Some above _400 also are this
way, so care should be taken; this is especially true for those that
reference free_modifier_32. */
anaphora_400 : KOhA_555
| KOhA_555 free_modifier_32
| lerfu_string_817 BOI_gap_461
;
cmene_404 : cmene_A_405
| cmene_A_405 free_modifier_32
;
cmene_A_405 : CMENE_518 /* pause */
| cmene_A_405 CMENE_518 /* pause*/
/* multiple CMENE are identified morphologically (by the lexer) -- separated by
consonant & pause */
;
bridi_valsi_407 : bridi_valsi_A_408
| bridi_valsi_A_408 free_modifier_32
;
bridi_valsi_A_408 : BRIVLA_509
| PA_MOI_824
| GOhA_543
| GOhA_543 RAhO_593
;
para_mark_410 : NIhO_584
| NIhO_584 free_modifier_32
| NIhO_584 para_mark_410
;
indicators_411 : indicators_A_412
| FUhE_535 indicators_A_412
;
indicators_A_412 : indicator_413
| indicators_A_412 indicator_413
;
indicator_413 : UI_612
| CAI_515
| UI_612 NAI_581
| CAI_515 NAI_581
| Y_619
| DAhO_524
| FUhO_536
;
DOI_415 : DOI_525
| COI_416
| COI_416 DOI_525
;
COI_416 : COI_A_417
| COI_416 COI_A_417
;
COI_A_417 : COI_520
| COI_520 NAI_581
;
JOIK_EK_421 : EK_802
| JOIK_806
| JOIK_806 free_modifier_32
;
JOIK_JEK_422 : JOIK_806
| JOIK_806 free_modifier_32
| JEK_805
| JEK_805 free_modifier_32
;
XI_424 : XI_618
| XI_618 free_modifier_32
;
NU_425 : NU_A_426
| NU_425 JOIK_JEK_422 NU_A_426
;
NU_A_426 : NU_586
| NU_586 NAI_581
| NU_586 free_modifier_32
| NU_586 NAI_581 free_modifier_32
;
MOhE_427 : MOhE_664
| MOhE_664 free_modifier_32
;
NIhE_428 : NIhE_666
| NIhE_666 free_modifier_32
;
NAhU_429 : NAhU_665
| NAhU_665 free_modifier_32
;
MAhO_430 : MAhO_662
| MAhO_662 free_modifier_32
;
JOhI_431 : JOhI_657
| JOhI_657 free_modifier_32
;
quote_arg_432 : quote_arg_A_433
| quote_arg_A_433 free_modifier_32
;
quote_arg_A_433 : ZOI_quote_434
| ZO_quote_435
| LOhU_quote_436
| LU_571 text_0 LIhU_gap_448
;
/* The quoted material in the following three terminals must be identified by
the lexer, but no additional lexer processing is needed. */
ZOI_quote_434 : ZOI_627 any_word_698
/*pause*/ anything_699 /*pause*/ any_word_6
98
;
/* 'pause' is morphemic, represented by '.' The lexer assembles anything_699 */
ZO_quote_435 : ZO_626 any_word_698
;
/* 'word' may not be a compound; but it can be any valid Lojban selma'o value,
including ZO, ZOI, SI, SA, SU. The preparser will not lex the word per its
normal selma'o. */
LOhU_quote_436 : LOhU_569 any_words_697 LEhU_565
;
/* 'words' may be any Lojban words, with no claim of grammaticality; the
preparser will not lex the individual words per their normal selma'o;
used to quote ungrammatical Lojban, equivalent to the * or ? writing
convention for such text. */
/* The preparser needs one bit of sophistication for this rule. A
quoted string should be able to contain other quoted strings - this is
only a problem for a LOhU quote itself, since the LEhU clossing this
quote would otherwise close the outer quotes, which is incorrect. For
this purpose, we will cheat on the use of ZO in such a quote (since this
is ungrammatical text, it is a sign ignored by the parser). Use ZO to
mark any nested quotation LOhU. The preparser then will absorb it by
the ZO rule, before testing for LOhU. This is obviously not the
standard usage for ZO, which would otherwise cause the result to be a
sumti. But, since the result will be part of an unparsed string anyway,
it doesn't matter. */
/* It may be seen that any of the ZO/ZOI/LOhU trio of quotation markers
may contain the powerful metalinguistic erasers. Since these quotations
are not parsed internally, these operators are ignored within the quote.
To erase a ZO, then, two SI's are needed after giving a quoted word of
any type. ZOI takes four SI's, with the ENTIRE BODY OF THE QUOTE
treated as a single 'word' since it is one selma'o. Thus one for the
quote body, two for the single word delimiters, and one for the ZOI. In
LOhU, the entire body is treated as a single word, so three SI's can
erase it. */
/* All rule terminator names with 'gap' in them are potentially
elidable, where such elision does not cause an ambiguity. This is
implemented through use of the YACC 'error' token, which effectively
recovers from an elision. */
FIhO_437 : FIhO_532
| FIhO_532 free_modifier_32
;
PEhO_438 : PEhO_673
| PEhO_673 free_modifier_32
;
BIhE_439 : BIhE_650
| BIhE_650 free_modifier_32
;
SEI_440 : SEI_597
| SEI_597 free_modifier_32
;
FUhA_441 : FUhA_655
| FUhA_655 free_modifier_32
;
BEI_442 : BEI_505
| BEI_505 free_modifier_32
;
CO_443 : CO_519
| CO_519 free_modifier_32
;
CEI_444 : CEI_516
| CEI_516 free_modifier_32
;
NA_445 : NA_578
| NA_578 free_modifier_32
;
BE_446 : BE_504
| BE_504 free_modifier_32
;
TUhE_447 : TUhE_610
| TUhE_610 free_modifier_32
;
LIhU_gap_448 : LIhU_567
| error
;
gap_450 : KU_556
| KU_556 free_modifier_32
| error
;
front_gap_451 : CU_521
| CU_521 free_modifier_32
;
MEX_gap_452 : KUhE_658
| KUhE_658 free_modifier_32
| error
;
KEI_gap_453 : KEI_552
| KEI_552 free_modifier_32
| error
;
TUhU_gap_454 : TUhU_611
| TUhU_611 free_modifier_32
| error
;
VAU_gap_456 : VAU_614
| VAU_614 free_modifier_32
| error
;
/* redundant to attach a free modifier on the following */
DOhU_gap_457 : DOhU_526
| error
;
FEhU_gap_458 : FEhU_531
| FEhU_531 free_modifier_32
| error
;
SEhU_gap_459 : SEhU_598
| error
/* a free modifier on a discursive should be somewhere within the discursive.
See SEI_440 */
;
NUhU_gap_460 : NUhU_588
| NUhU_588 free_modifier_32
| error
;
BOI_gap_461 : BOI_651
| BOI_651 free_modifier_32
| error
;
sub_gap_462 : BOI_651
| error
;
LUhU_gap_463 : LUhU_573
| LUhU_573 free_modifier_32
| error
;
GEhU_gap_464 : GEhU_538
| GEhU_538 free_modifier_32
| error
;
MEhU_gap_465 : MEhU_575
| MEhU_575 free_modifier_32
| error
;
KEhE_gap_466 : KEhE_550
| KEhE_550 free_modifier_32
| error
;
BEhO_gap_467 : BEhO_506
| BEhO_506 free_modifier_32
| error
;
TOI_gap_468 : TOI_607
| error
;
KUhO_gap_469 : KUhO_557
| KUhO_557 free_modifier_32
| error
;
left_bracket_470 : VEI_677
| VEI_677 free_modifier_32
;
right_bracket_gap_471 : VEhO_678
| VEhO_678 free_modifier_32
| error
;
LOhO_gap_472 : LOhO_568
| LOhO_568 free_modifier_32
| error
;
TEhU_gap_473 : TEhU_675
| TEhU_675 free_modifier_32
| error
;
right_br_no_free_474 : VEhO_678
| error
;
NUhA_475 : NUhA_667
| NUhA_667 free_modifier_32
;
MOI_476 : MOI_663
| MOI_663 free_modifier_32
;
ME_477 : ME_574
| ME_574 free_modifier_32
;
JAI_478 : JAI_547
| JAI_547 free_modifier_32
;
BO_479 : BO_508
| BO_508 free_modifier_32
;
SE_480 : SE_596
| SE_596 free_modifier_32
;
FA_481 : FA_527
| FA_527 free_modifier_32
;
NAhE_482 : NAhE_583
| NAhE_583 free_modifier_32
;
qualifier_483 : LAhE_561
| LAhE_561 free_modifier_32
| NAhE_BO_809
;
NOI_484 : NOI_585
| NOI_585 free_modifier_32
;
GOI_485 : GOI_542
| GOI_542 free_modifier_32
;
subscript_486 : XI_424 number_812 sub_gap_462
| XI_424 left_bracket_470 MEX_310
right_br_no_free_474
| XI_424 lerfu_string_817 sub_gap_462
;
ZIhE_487 : ZIhE_625
| ZIhE_625 free_modifier_32
;
LE_488 : LE_562
| LE_562 free_modifier_32
;
LI_489 : LI_566
| LI_566 free_modifier_32
;
mod_head_490 : tag_491
| FA_481
;
tag_491 : tense_modal_815
| tag_491 JOIK_JEK_422 tense_modal_815
;
ZOhU_492 : ZOhU_628
| ZOhU_628 free_modifier_32
;
KE_493 : KE_551
| KE_551 free_modifier_32
;
PEhE_494 : PEhE_591
| PEhE_591 free_modifier_32
;
CEhE_495 : CEhE_517
| CEhE_517 free_modifier_32
;
NUhI_496 : NUhI_587
| NUhI_587 free_modifier_32
;
VUhO_497 : VUhO_617
| VUhO_617 free_modifier_32
;
SOI_498 : SOI_602
| SOI_602 free_modifier_32
;
LA_499 : LA_558
| LA_558 free_modifier_32
;
utterance_ordinal_801 : lexer_A_905
;
EK_802 : lexer_B_910
| lexer_B_910 free_modifier_32
;
EK_BO_803 : lexer_C_915
| lexer_C_915 free_modifier_32
;
EK_KE_804 : lexer_D_916
| lexer_D_916 free_modifier_32
;
JEK_805 : lexer_E_925
;
JOIK_806 : lexer_F_930
;
GEK_807 : lexer_G_935
| lexer_G_935 free_modifier_32
;
GUhEK_808 : lexer_H_940
| lexer_H_940 free_modifier_32
;
NAhE_BO_809 : lexer_I_945
| lexer_I_945 free_modifier_32
;
NA_KU_810 : lexer_J_950
| lexer_J_950 free_modifier_32
;
I_BO_811 : lexer_K_955
| lexer_K_955 free_modifier_32
;
number_812 : lexer_L_960
;
GIhEK_BO_813 : lexer_M_965
| lexer_M_965 free_modifier_32
;
GIhEK_KE_814 : lexer_N_966
| lexer_N_966 free_modifier_32
;
tense_modal_815 : lexer_O_970
| lexer_O_970 free_modifier_32
| FIhO_437 selbri_130 FEhU_gap_458
;
GIK_816 : lexer_P_980
| lexer_P_980 free_modifier_32
;
lerfu_string_817 : lexer_Q_985
;
GIhEK_818 : lexer_R_990
| lexer_R_990 free_modifier_32
;
I_819 : lexer_S_995
| lexer_S_995 free_modifier_32
;
I_JEK_820 : lexer_T_1000
| lexer_T_1000 free_modifier_32
;
JEK_BO_821 : lexer_U_1005
| lexer_U_1005 free_modifier_32
;
JOIK_BO_822 : lexer_V_1010
| lexer_V_1010 free_modifier_32
;
JOIK_KE_823 : lexer_W_1015
| lexer_W_1015 free_modifier_32
;
PA_MOI_824 : lexer_Y_1025
;
/* The following rules are used only in lexer processing. They have been
tested for ambiguity at various levels in the YACC grammar, but are in
the recursive descent lexer in the current parser. The lexer inserts
the lexer tokens before the processed strings, but leaves the original
tokens. */
lexer_A_905 : lexer_A_701 utt_ordinal_root_906
;
utt_ordinal_root_906 : lerfu_string_root_986 MAI_661
| number_root_961 MAI_661
;
simple_JOIK_JEK_957 : JOIK_806
| JEK_805
;
/* no freemod in this version; cf. JOIK_JEK_422 */
/* this reference to a version of JOIK and JEK
which already have the lexer tokens attached
prevents shift/reduce errors. The problem is
resolved in a hard-coded parser implementation
which builds lexer_K, before lexer_S, before
lexer_E and lexer_F. */
number_root_961 : PA_672
| number_root_961 PA_672
| number_root_961 lerfu_word_987
;
lexer_M_965 : lexer_M_713 GIhEK_root_991 BO_508
| lexer_M_713 GIhEK_root_991 simple_tag_971 BO_508
;
simple_tense_modal_972 : simple_tense_modal_A_973
| NAhE_583 simple_tense_modal_A_973
| KI_554
| CUhE_522
;
simple_tense_modal_A_973: modal_974
| modal_974 KI_554
| tense_A_977
;
modal_974 : modal_A_975
| modal_A_975 NAI_581
;
modal_A_975 : BAI_502
| SE_596 BAI_502
;
tense_A_977 : tense_B_978
| tense_B_978 KI_554
;
tense_B_978 : tense_C_979
| CAhA_514
| tense_C_979 CAhA_514
;
/* specifies actuality/potentiality of the bridi */
/* puca'a = actually was */
/* baca'a = actually will be */
/* bapu'i = can and will have */
/* banu'o = can, but won't have yet */
/* canu'ojebapu'i = can, hasn't yet, but will */
tense_C_979 : time_1030
/* time-only */
/* space defaults to time-space reference space */
| space_1040
/* can include time if specified with VIhA */
/* otherwise time defaults to the time-space reference time */
| time_1030 space_1040
/* time and space - If space_1040 is marked with
VIhA for space-time the tense may be self-contradictory */
/* interval prop before space_time is for time distribution */
| space_1040 time_1030
;
lexer_P_980 : lexer_P_716 GIK_root_981
;
GIK_root_981 : GI_539
| GI_539 NAI_581
;
lexer_Q_985 : lexer_Q_717 lerfu_string_root_986
;
lerfu_string_root_986 : lerfu_word_987
| lerfu_string_root_986 lerfu_word_987
| lerfu_string_root_986 PA_672
;
lerfu_word_987 : BY_513
| LAU_559 lerfu_word_987
| TEI_605 lerfu_string_root_986 FOI_533
;
GIhEK_root_991 : GIhA_541
| SE_596 GIhA_541
| NA_578 GIhA_541
| GIhA_541 NAI_581
| SE_596 GIhA_541 NAI_581
| NA_578 GIhA_541 NAI_581
| NA_578 SE_596 GIhA_541
| NA_578 SE_596 GIhA_541 NAI_581
;
time_1030 : ZI_624
| ZI_624 time_A_1031
| time_A_1031
;
time_A_1031 : time_B_1032
| time_interval_1034
| time_B_1032 time_interval_1034
;
time_B_1032 : time_offset_1033
| time_B_1032 time_offset_1033
;
time_offset_1033 : time_direction_1035
| time_direction_1035 ZI_624
;
time_interval_1034 : ZEhA_622
| ZEhA_622 time_direction_1035
| time_int_props_1036
| ZEhA_622 time_int_props_1036
| ZEhA_622 time_direction_1035 time_int_props_1036
;
time_direction_1035 : PU_592
| PU_592 NAI_581
;
time_int_props_1036 : interval_property_1051
| time_int_props_1036 interval_property_1051
;
space_1040 : space_A_1042
| space_motion_1041
| space_A_1042 space_motion_1041
;
space_offset_1045 : space_direction_1048
| space_direction_1048 VA_613
;
space_intval_1046 : space_intval_A_1047
| space_intval_A_1047 space_direction_1048
| space_int_props_1049
| space_intval_A_1047 space_int_props_1049
| space_intval_A_1047 space_direction_1048
space_int_props_1049
;
space_intval_A_1047 : VEhA_615
| VIhA_616
| VEhA_615 VIhA_616
;
space_direction_1048 : FAhA_528
| FAhA_528 NAI_581
;
space_int_props_1049 : space_int_props_A_1050
| space_int_props_1049 space_int_props_A_1050
;
space_int_props_A_1050 : FEhE_530 interval_property_1051
;
/* This terminal gives an interval size in space-time (VEhA), and possibly a
dimensionality of the interval. The dimensionality may also be used
with the interval size left unspecified. When this terminal is used for the
spacetime origin, then barring any overriding VIhA, a VIhA here defines
the dimensionality of the space-time being discussed. */
*/
%%
2. EBNF Grammar of Lojban
Lojban Machine Grammar, EBNF Version, 3rd Baseline as of 10 January 1997
This document is explicitly dedicated to the public domain by its author, the Lo
gical Language Group Inc. Contact that organization at: 2904 Beau Lane, Fairfax
VA 22031 USA 703-385-0273 (intl: +1 703 385 0273)
Explanation of notation: All rules have the form:
namenumber = bnf-expression