Programming Languages
Programming Languages
Version 0.7
Mike Grant
Zachary Palmer
Scott Smith
http://www.cs.jhu.edu/~scott/pl/book/dist
ii
Preface vii
1 Introduction 1
1.1 The Pre-History of Programming Languages . . . . . . . . . . . . 1
1.2 A Brief Early History of Languages . . . . . . . . . . . . . . . . . 2
1.3 This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Operational Semantics 5
2.1 A First Look at Operational Semantics . . . . . . . . . . . . . . . 5
2.2 BNF grammars and Syntax . . . . . . . . . . . . . . . . . . . . . 6
2.2.1 Operational Semantics for Logic Expressions . . . . . . . 7
2.2.2 Abstract Syntax . . . . . . . . . . . . . . . . . . . . . . . 12
2.2.3 Operational Semantics and Interpreters . . . . . . . . . . 15
2.3 The F[ Programming Language . . . . . . . . . . . . . . . . . . . 17
2.3.1 F[ Syntax . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.3.2 Variable Substitution . . . . . . . . . . . . . . . . . . . . 19
2.3.3 Operational Semantics for F[ . . . . . . . . . . . . . . . . 23
2.3.4 The Expressiveness of F[ . . . . . . . . . . . . . . . . . . 31
2.3.5 Russell’s Paradox and Encoding Recursion . . . . . . . . 35
2.3.6 Call-By-Name Parameter Passing . . . . . . . . . . . . . . 40
2.3.7 F[ Abstract Syntax . . . . . . . . . . . . . . . . . . . . . 40
2.4 Operational Equivalence . . . . . . . . . . . . . . . . . . . . . . . 45
2.4.1 Defining Operational Equivalence . . . . . . . . . . . . . . 46
2.4.2 Properties of Operational Equivalence . . . . . . . . . . . 47
2.4.3 Examples of Operational Equivalence . . . . . . . . . . . 48
2.4.4 The λ-Calculus . . . . . . . . . . . . . . . . . . . . . . . . 50
iii
iv CONTENTS
7 Concurrency 135
7.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
7.1.1 The Java Concurrency Model . . . . . . . . . . . . . . . . 137
7.2 The Actor Model and AF[V . . . . . . . . . . . . . . . . . . . . 137
7.2.1 Syntax of AF[V . . . . . . . . . . . . . . . . . . . . . . . 138
7.2.2 An Example . . . . . . . . . . . . . . . . . . . . . . . . . 139
7.2.3 Operational Semantics of Actors . . . . . . . . . . . . . . 140
7.2.4 The Local Rules . . . . . . . . . . . . . . . . . . . . . . . 140
7.2.5 The Global Rule . . . . . . . . . . . . . . . . . . . . . . . 142
7.2.6 The Atomicity of Actors . . . . . . . . . . . . . . . . . . . 142
Bibliography 175
vi CONTENTS
Index 177
Preface
• The OCaml Manual [16], in particular the first two sections of Part I and
the first two sections of part IV.
The OCaml manual is complete but terse. Hickey’s book may be your antidote
if you want a more descriptive explanation than that provided in the manual.
vii
viii PREFACE
The FbDK
Complementing the book is the F[ Development Kit, FbDK. It is a set of Caml
utilities and interpreters for designing and experimenting with the toy F[ and
F[SR languages defined in the book. It is available from the book homepage
at http://www.cs.jhu.edu/~scott/pl/book/dist, and is documented in Ap-
pendix A.
Background Needed
The book assumes familiarity with the basics of Caml, including the module
system (but not the objects, the “O” in OCaml). Beyond that there is no ab-
solute prerequisite, but knowledge of C, C++, and Java is helpful because many
of the topics in this book are implemented in these languages. The compiler
presented in chapter 8 produces C code as its target, and so a very basic knowl-
edge of C will be needed to implement the compiler. More nebulously, a certain
“mathematical maturity” greatly helps in understanding the concepts, some of
which are deep. for this reason, previous study of mathematics, formal logic
and other foundational topics in Computer Science such as automata theory,
grammars, and algorithms will be a great help.
Chapter 1
Introduction
1
2 CHAPTER 1. INTRODUCTION
Operational Semantics
5
6 CHAPTER 2. OPERATIONAL SEMANTICS
the formal specification of the language that is used when writing compilers and
interpreters, and it allows us to rigorously verify things about the language.
where each “form” above describes a particular language form – that is, a string
of terminals and non-terminals. A term in the language is a string of terminals
which matches the description of one of these rules (traditionally the first).
For example, consider the language Sheep. Let {S} be the set of nontermi-
nals, {a, b} be the set of terminals, and the grammar definition be:
S ::= b | Sa
That is, any string starting with the character b and followed by zero or more
a characters is a term in Sheep. The following are examples that are not terms
in SHEEP:
• a: Terms in Sheep must start with a b.
• bbaaa: Sheep does not allow multiple b characters in a term.
• baah: h is not a terminal in Sheep.
• Saaa: S is a non-terminal in Sheep. Terms may not contain non-terminals.
Another way of expressing a grammar is by the use of a syntax diagram.
Syntax diagrams describe the grammar visually rather than in a textual form.
For example, the following is a syntax diagram for the language Sheep:
S b
S a
2.2. BNF GRAMMARS AND SYNTAX 7
The above syntax diagram describes all terms of the Sheep language. To
generate a form of S, one starts at the left side of the diagram and moves until
one reaches the right. The rectangular nodes represent non-terminals while the
rounded nodes represent terminals. Upon reaching a non-terminal node, one
must construct a term using that non-terminal to proceed.
As another example, consider the language Frog. Let {F, G} be the set of
nonterminals, {r, i, b, t} be the set of terminals, and the grammar definition be:
F ::= rF | iG
G ::= bG | bF | t
Note that this is a mutually recursive definition. Note also that each production
rule defines a syntactic category. Terms in FROG include:
F r F
i G
G b G
b F
e v
Not e
e And e
e Or e
e Implies e
v True
False
Note that the syntax above breaks tradition somewhat by using lower-case
letters for non-terminals. Terminals are printed in fixed-width font. The ratio-
nale for this is consistency with the metavariables we will be using in operational
semantics below and will become clear shortly.
We can now discuss the operational semantics of the boolean language. Op-
erational semantics are written in the form of logic rules, which are written as a
series of preconditions above a horizontal line and the conclusion below it. For
example, the logic rule
Red(x) Shiny(x)
(Apple Rule)
Apple(x)
indicates that if a thing is red and shiny, then that thing is an apple. This is, of
course, not true; many red, shiny things exist which are not apples. Nonetheless,
it is a valid logical statement. In our work, we will be defining logical rules per-
taining to a programming language; as a result, we have control over the space
1 Throughout the book we use syntax very similar to Caml in our toy languages, but with
the convention of capitalizing keywords to avoid potential conflicts with the Caml language.
2.2. BNF GRAMMARS AND SYNTAX 9
in which the rules are constructed. We need not necessarily concern ourselves
with intuitive sense so long as the programming language has a mathematical
foundation.
Operational semantics rules discuss how pieces of code evaluate. For exam-
ple, let us consider the And rule. We may define the following rule for And:
This rule indicates that the boolean language code True And False evalu-
ates to False. The absence of any preconditions above the line means that no
conditions must be met; this operational semantics rule is always true. Rules
with nothing above the line are termed axioms since they have no preconditions
and so the conclusion always holds.
As a rule, though, it isn’t very useful. It only evaluates a very specific
program. This rule does not describe how to evaluate the program True And
True, for instance. In order to generalize our rules to describe a full language and
not just specific terms within the language, we must make use of metavariables.
To maintain consistency with the above BNF grammar, we use metavariables
starting with e to represent expressions and metavariables starting with v to
represent values. We are now ready to make an attempt at describing every
aspect of the And operator using the following new rule:
Using this rule, we can successfully evaluate True And False, True and
True, and so on. Note that we have used a textual description to indicate the
value of the expression v1 And v2 ; this is permitted, although most rules in
more complex languages will not use such descriptions.
We very quickly encounter limitations in our approach, however. Consider
the program True And (False And True). If we tried to apply the above rule
to that program, we would have v1 = True and v2 = (False And True). These
two values cannot be applied to logical and as (False and True) is not a
boolean value; it is an expression. Our boolean language rule does not allow for
cases in which the operands to And are expressions. We therefore make another
attempt at the rule:
e 1 ⇒ v1 e 2 ⇒ v2
(And Rule (Try 3))
e1 And e2 ⇒ the logical and of v1 and v2
This rule is almost precisely what we want; in fact, the rule itself is complete.
Intuitively, this rule says that e1 And e2 evaluates to the logical and of the
values represented by e1 and e2 . But consider again the program True And
10 CHAPTER 2. OPERATIONAL SEMANTICS
False, which we expect to evaluate to False. We can see that e1 = True and
that e2 = False, but our evaluation relation does not relate v1 or v2 to any
value. This is because, strictly speaking, we do not know that True ⇒ True.
Of course, we would like that to be the case and, since we are in the process
of defining the language, we can make it so. We simply need to declare it in an
operational semantics rule.
(Value Rule)
v⇒v
The value rule above is an axiom declaring that any value always evaluates
to itself. This satisfies our requirement and allows us to make use of the And
rule. Using this formal logic approach, we can now prove that True And (False
And True) ⇒ False as follows:
One may read the above proof tree as an explanation as to why True
And (False And True) evaluates to False. We can choose to read that proof
as follows: “True And (False And True) evaluates to False by the And rule
because we know True evaluates to True, that False And True evaluates to
False, and that the logical and of true and false is false. We know that False
And True evaluates to False by the And rule because True evaluates to True,
False evaluates to False, and the logical and of true and false is false.”
An equivalent and similarly informal format for the above is:
True And ( False And True ) ⇒ False, because by the And rule
True ⇒ True, and
( False And True ) ⇒ False, the latter because
True ⇒ True, and
False ⇒ False
The important thing to note about all three of these representations is that
they are describing a proof tree. The proof tree consists of nodes which rep-
resent the application of logical rules with preconditions as their children. To
complete our boolean language, we define the ⇒ relation using a complete set
of operational semantics rules:
2.2. BNF GRAMMARS AND SYNTAX 11
(Value Rule)
v⇒v
e⇒v
(Not Rule)
Not e ⇒ the negation of v
e 1 ⇒ v 1 e 2 ⇒ v2
(And Rule)
e1 And e2 ⇒ the logical and of v1 and v2
The rules for Or and Implies are left as an exercise to the reader (see
Exercise 2.4).
These rules form a proof system as is found in mathematical logic. Logical
rules express incontrovertible logical truths. A proof of e ⇒ v amounts to
constructing a sequence of rule applications such that, for any given application
of a rule, the items above the line appeared earlier in the sequence and such
that the final rule application is e ⇒ v. A proof is structurally a tree, where
each node is a rule, and the subtree rules have conclusions which exactly match
what the parent’s assumptions are. For a proof tree of e ⇒ v, the root rule has
as its conclusion e ⇒ v. Note that all leaves of a proof tree must be axioms. A
tree with a non-axiom leaf is not a proof.
Notice how the above proof tree is expressing how this logic expression could
be computed. Proofs of e ⇒ v corresponds closely to how the execution of e
produces the value v as result. The only difference is that “execution” starts
with e and produces the v, whereas a proof tree describes a relation between e
and v, not a function from e to v.
Lemma 2.2. The boolean language is normalizing: For all boolean expressions
e, there is some value v where e ⇒ v.
to compile; the second divergent program compiles but runs forever. Through-
out this text, we will treat both of these forms of divergence in the same way.
A program is divergent if it does not produce a value; we do not care why.
type boolexp =
True | False |
Not of boolexp |
And of boolexp * boolexp |
Or of boolexp * boolexp |
Implies of boolexp * boolexp;;
To understand how the abstract and concrete syntax relate, consider the
following examples:
Example 2.1.
Concrete:
True
True
Abstract:
True
Example 2.2.
Concrete:
True And False And
Example 2.3.
Implies
Concrete:
(True And False) Implies
((Not True) And False) And And
JTrueK = True
JFalseK = False
JNot eK = Not(e)
Je1 And e2 K = And(Je1 K, Je2 K)
Je1 Or e2 K = Or(Je1 K, Je2 K)
Je1 Implies e2 K = Implies(Je1 K, Je2 K)
A particularly astute reader will have noticed that the parentheses in the
expressions handled until now were not explicitly addressed. This is because
parentheses and other parsing meta-operators are not traditionally mentioned
in the operational semantics of a language. Such operators merely have the effect
of grouping operations. For example, let us assume that binary operations in our
boolean language are left-associative; thus, the expressions True Or True And
False and (True Or True) And False are equivalent. Consider the following
examples:
14 CHAPTER 2. OPERATIONAL SEMANTICS
Example 2.4.
Concrete: And
True Or True And False
Or False
Abstract:
And(Or(True,True),False) True True
Example 2.5.
Concrete: Or
True Or (True And False)
True And
Abstract:
Or(True,And(True,False)) True False
The expression in example 2.4 will evaluate to False because one must
evaluate the Or operation first and then evaluate the And operation using the
result. Example 2.5, on the other hand, performs the operations in the opposite
order. Note that in both examples, though, the parentheses themselves are no
longer overtly present in the abstract syntax. This is because they are implicitly
represented in the structure of the AST; that is, the AST in example 2.5 would
not have the shape that it has if the parentheses were not present in the concrete
syntax of the form.
In short, parentheses merely change how expressions are grouped. In exam-
ple 2.5, the only rule we can match to the entire expression is the Or rule; the
And rule obviously can’t match because the left parentheses would be part of e1
while the right parenthesis would be part of e2 (and expressions with unmatched
parentheses make no sense). Similarly but less obviously, example 2.4 can only
match the And rule; the associativity implicitly forces the Or rule to happen
first, giving the And operator that entire expression to evaluate. This distinc-
2.2. BNF GRAMMARS AND SYNTAX 15
Observe that, in the above code, we have changed very little. We modified
the eval function to be recursive. We also added a call to eval for each of the
operands to the And operation. That call alone is sufficient to fix the problem;
the process of evaluating those arguments represents the e1 ⇒ v1 and e2 ⇒ v2
preconditions on the And rule, while the use of the resultings values in the tuple
causes the match to be against v1 and v2 rather than e1 and e2 . The above code
is a faithful implementation of the value rule and the And rule.
We can now complete the boolean language interpreter by continuing the
eval fuction in the same form:
let rec eval exp =
match exp with
True -> True
| False -> False
| Not ( exp0 ) -> ( match eval exp0 with
True -> False
| False -> True )
| And ( exp0 , exp1 ) -> ( match ( eval exp0 , eval exp1 ) with
( True , True ) -> True
| (_ , False ) -> False
| ( False , _ ) -> False )
| Implies ( exp0 , exp1 ) -> ( match ( eval exp0 , eval exp1 ) with
( False , _ ) -> True
| ( True , True ) -> True
| ( True , False ) -> False )
2.3. THE F[ PROGRAMMING LANGUAGE 17
The only difference between the operational semantics and the interpreter is
that the interpreter is a function. We start with the bottom-left expression in
a rule, use the interpreter to recursively produce the value(s) above the line in
the rule, and finally compute and return the value below the line in the rule.
Note that the boolean language interpreter above faithfully implements its
operational semantics: e ⇒ v if and only if eval(JeK) returns JvK as result.
We will go back and forth between these two forms throughout the book. The
operational semantics form is used because it is independent of any particular
programming language. The interpreter form is useful because we can interpret
real programs for nontrivial numbers of steps, something that is difficult to do
“on paper” with an operational semantics.
Definition 2.3 (Metacircular Interpreter). A metacircular interpreter is
an interpreter for (possibly a subset of ) a language x that is written in language
x.
Metacircular interpreters give you some idea of how a language works, but
suffer from the non-foundational problems implied in Exercise 2.5. A metacir-
cular interpreter for Lisp (that is, a Lisp interpreter written in Lisp) is a classic
programming language theory exercise.
• If the algorithm is given arguments for which the function is not defined,
it must either produce a clear error or otherwise not terminate. (That is,
it must not appear to have produced an incorrect value for the function if
no such value is defined.)
2.3.1 F[ Syntax
We will take the same approach in defining F[ as we did in defining the boolean
language above. We start by describing the grammar of the F[ language to
define its concrete syntax; the abstract syntax is deferred until Section 2.3.7.
We can define the grammar of F[ using the following BNF:
3 This does not guarantee that the F[ representation will be pleasant. Programs written in
F[ to perform even fairly simplistic computations such as determining if one number is less
than another are excruciating, as we will see shortly.
2.3. THE F[ PROGRAMMING LANGUAGE 19
a very simple and accurate description method, and that is what operational
semantics is all about – describing clearly and unabmiguously how programs
are to compute.
should not evaluate to Function x -> 3 since the inner x is bound by the inner
parameter. To correctly formalize this notion, we need to make the following
definitions.
Definition 2.6 (Variable Occurrence). A variable use x occurs in e if x ap-
pears somewhere in e. Note we refer only to variable uses, not definitions.
Definition 2.7 (Bound Occurrence). Any occurrences of variable x in the ex-
pression
Function x -> e
are bound, that is, any free occurrences of x in e are bound occurrences in this
expression. Similarly, in the expression
Let Rec f x = e1 In e2
Function x -> x + 1
Example 2.8.
x, y, and z are all bound in the body of this function. x and y are
bound by their respective function declarations, and z is bound by the Let
statement. Note that F[ does not contain Let as syntax, but it can be
defined as a macro, in Section 2.3.4 below, and from that it is clear that
binding rules work similarly for Functions and Let statements.
Example 2.9.
x is bound in the body of this function. Note that both x usages are
bound to the inner variable x.
Definition 2.9 (Closed Expression). An expression e is closed if it contains
no free variable occurrences. All programs we execute are closed (no link-time
errors).
Of the examples above, Examples 2.6, 2.8, and 2.9 are closed expressions.
Example 2.7 is not a closed expression.
Now that we have an understanding of bound and free variables, we can give
a formal definition of variable substitution.
Definition 2.10 (Variable Substitution). The variable substitution of x for e0
in e, denoted e[e0 /x], is the expression resulting from the operation of replacing
all free occurrences of x in e with e0 . For now, we assume that e0 is a closed
expression.
Here is an equivalent inductive definition of substitution:
x[v/x] = v
x0 [v/x] = x0 x 6= x0
(Function x → e)[v/x] = (Function x → e)
(Function x0 → e)[v/x] = (Function x0 → e[v/x]) x =6 x0
n[v/x] = n for n ∈ Z
True[v/x] = True
False[v/x] = False
(e1 + e2 )[v/x] = e1 [v/x] + e2 [v/x]
(e1 And e2 )[v/x] = e1 [v/x] And e2 [v/x]
..
.
While this may not seem like an illuminating realization, the fact that this
is mathematically discernable gives us a starting point for more complex subsi-
tutions. Consider the following example.
Example 2.10.
Expression:
(Function x -> Function y -> (x + x + y)) 5
Substitution:
(Function y -> (x + x + y))[5/x]
= (Function y -> (x + x + y)[5/x])
= Function y -> (x[5/x] + x[5/x] + y[5/x])
= Function y -> (5 + 5 + y)
α-conversion
In Example 2.9, we saw that it is possible for two variables to share the same
name. The variables themselves are, of course, distinct and follow the same rules
of scope in F[ as they do in Caml. But reading expressions which make frequent
use of the same variable name for different variables can be very disorienting.
For example, consider the following expression.
Let Rec f x =
If x = 1 Then
(Function f -> f (x - 1)) (Function x -> x)
Else
f (x - 1)
In f 100
How does this expression evaluate? It is a bit difficult to tell simply by looking
at it because of the tricky bindings. We can make it much easier to understand
by using different names. α-conversion is the process of replacing a variable
definition and all occurrences bound to it with a variable of a different name.
Example 2.11.
Function x -> x + 1
becomes
Function z -> z + 1
Example 2.11 shows a simple case in which x is substituted for z. For cases
in which the same variable name is used numerous times, we can use the same
approach. Consider Example 2.12 in which the inner variable x is α-converted
to z.
Example 2.12.
Function x -> Function x -> x
becomes
Function x -> Function z -> z
2.3. THE F[ PROGRAMMING LANGUAGE 23
Let Rec f x =
If x = 1 Then
(Function z -> z (x - 1)) (Function y -> y)
Else
f (x - 1)
In f 100
(Value Rule)
v⇒v
24 CHAPTER 2. OPERATIONAL SEMANTICS
We can also define boolean operations for F[ in the same way as we did
for the boolean language above. Note, however, that not all values in F[ are
booleans. Fortunately, our definition of the rules addresses this for us, as there
is (for example) no logical and of the values 5 and 3. That is, we know that
these rule only apply to F[ boolean values because they use operations which
are only defined for F[ boolean values.
e⇒v
(Not Rule)
Not e ⇒ the negation of v
e 1 ⇒ v1 e 2 ⇒ v2
(And Rule)
e1 And e2 ⇒ the logical and of v1 and v2
..
.
We can also define operations over integers in much the same way. For sake
of clarity, we will explicitly restrict these rules such that they operate only on
expressions which evaluate to integers.
e1 ⇒ v1 , e2 ⇒ v2 where v1 , v2 ∈ Z
(+ Rule)
e1 + e2 ⇒ the integer sum of v1 and v2
e1 ⇒ v1 , e2 ⇒ v2 where v1 , v2 ∈ Z
(- Rule)
e1 - e2 ⇒ the integer difference of v1 and v2
As with the boolean rules, observe that these rules allow the ⇒ relation to
be applied recursively: 5 + (4 - 3) can be evaluated using the + rule because
4 - 3 can be evaluated using the - rule first.
These rules allow us to write F[ programs containing boolean expressions
or F[ programs containing integer expressions, but we currently have no way to
combine the two. There are two mechanisms we use to mix the two in a pro-
gram: conditional expressions and comparison operators. The only comparison
operator in F[ is the = operator, which compares the values of integers. We
define the = rule as follows.
e1 ⇒ v1 , e2 ⇒ v2 where v1 , v2 ∈ Z
(= Rule)
e1 = e2 ⇒ True if v1 and v2 are identical, else False
Note that the = rule is defined only where v1 and v2 are integers. Due
to this constraint, the expression True = True is not evaluatable in F[. This
is, of course, a matter of choice; as a language designer, one may choose to
remove that constraint and allow boolean values to be compared directly. To
formalize this, however, a change to the rules would be required. A faithful
2.3. THE F[ PROGRAMMING LANGUAGE 25
e 1 ⇒ v 1 e 2 ⇒ v2 e 3 ⇒ v3
(Flawed If Rule)
If e1 Then e2 Else e3 ⇒ v2 if v1 is True, v3 otherwise
It seems that this rule allows us to evaluate many of the conditional expres-
sions we want to evaluate. But let us consider this expression:
e1 ⇒ True, e2 ⇒ v2
(If True Rule)
If e1 Then e2 Else e3 ⇒ v2
e1 ⇒ False, e3 ⇒ v3
(If False Rule)
If e1 Then e2 Else e3 ⇒ v3
Again, the key difference between these two rules is that they have different
sets of preconditions. Note that the If True rule does not evaluate e3 , nor
does the If False rule evaluate e2 . This allows the untraveled logic paths to
contain unevaluatable expressions without necessarily preventing the expression
containing them from evaluating.
Application
We are now ready to approach one of the most difficult F[ rules: application.
How can we formalize the evaluation of an expression like (Function x ->
x + 1) (5 + 2)? We saw in Section 2.3.2 that we can evaluate a function
26 CHAPTER 2. OPERATIONAL SEMANTICS
?
(Application Rule (Part 1))
e1 e2 ⇒ v
e 1 ⇒ v1 e 2 ⇒ v2 ?
(Application Rule (Part 2))
e1 e2 ⇒ v
e1 ⇒ Function x -> e e 2 ⇒ v2 ?
(Application Rule (Part 3))
e1 e2 ⇒ v
4 Actually, some languages would perform substitution before evaluating the expression,
but F[ and most traditional languages do not. Discussion of this approach is handled in
Section 2.3.6.
2.3. THE F[ PROGRAMMING LANGUAGE 27
F[ Recursion
We now have a very complete set of rules for the F[ language. We do not,
however, have a rule for Let Rec. As we will see, Let Rec is not actually
necessary in basic F[; it is possible to build recursion out of the rules we already
have. Later, however, we will create variants of F[ with type systems in which
it will be impossible for that approach to recursion to work. For that reason as
well as our immediate convenience, we will define the Let Rec rule now.
Again, we start with an iterative approach. We know that we want to be able
to evaluate the Let Rec expression, so we provide metavariables to represent
the components of the expression itself.
?
(Recursive Application Rule (Part 1))
Let Rec f x = e1 In e2 ⇒ v
Let Rec f x =
If x = 1 Then
1
Else
f (x - 1) + x
In f 5
If we focus on the last line (In f 5), we can see that we want the body
of the recursive function to be applied to the value 5. We can write our rule
accordingly by replacing f with the function’s body. We must make sure to use
the same metavariable to represent the function’s argument in order to allow
the new function body’s variable to be captured. We reach the following rule.
We can test our new rule by applying it to our recursive summation above.
???
f (5-1) ⇒ v 0 5⇒5
5 = 1 ⇒ False f (5-1) + 5 ⇒ v
Function x -> · · · ⇒ Function x -> · · · 5 ⇒ 5 If 5 = 1 Then 1 Else f (5-1) + 5 ⇒ v
(Function x -> If x = 1 Then 1 Else f (x-1) + x) 5) ⇒ v
Let Rec f x = If x = 1 Then 1 Else f (x-1) + x In f 5 ⇒ v
28 CHAPTER 2. OPERATIONAL SEMANTICS
As foreshadowed by its label above, our recursion rule is not yet complete.
When we reach the evaluation of f (5-1), we are at a loss; f is not bound.
Without a binding for f, we have no way of repeating the recursion.
In addition to replacing the invocation of f with function application in
e2 , we need to ensure that any occurrences of f in the body of the function
itself are bound. To what do we bind them? We could try to replace them with
function applications as well, but this leads us down a rabbit hole; each function
application would require yet another replacement. We can, however, replace
applications of f in e1 with recursive applications of f by reintroducing the Let
Rec syntax. We thus arrive at the following recursive application rule:
(Value Rule)
v⇒v
e⇒v
(Not Rule)
Not e ⇒ the negation of v
e1 ⇒ v1 e2 ⇒ v2
(And Rule)
e1 And e2 ⇒ the logical and of v1 and v2
e1 ⇒ v1 , e2 ⇒ v2 where v1 , v2 ∈ Z
(+ Rule)
e1 + e2 ⇒ the integer sum of v1 and v2
e1 ⇒ v1 , e2 ⇒ v2 where v1 , v2 ∈ Z
(= Rule)
e1 = e2 ⇒ True if v1 and v2 are identical, else False
e1 ⇒ True, e2 ⇒ v2
(If True Rule)
If e1 Then e2 Else e3 ⇒ v2
e1 ⇒ False, e3 ⇒ v3
(If False Rule)
If e1 Then e2 Else e3 ⇒ v3
e1 ⇒ Function x -> e, e2 ⇒ v2 , e[v2 /x] ⇒ v
(Application Rule)
e1 e2 ⇒ v
2.3. THE F[ PROGRAMMING LANGUAGE 29
Let us consider a few examples of proof trees using the F[ operational se-
mantics.
Example 2.14.
Expression:
If 3 = 4 Then 5 Else 4 + 2
Proof:
3⇒3 4⇒4 4⇒4 2⇒2
3 = 4 ⇒ False 4 + 2⇒6
If 3 = 4 Then 5 Else 4 + 2 ⇒ 6
Example 2.15.
Expression:
(Function x -> If 3 = x Then 5 Else x + 2) 4
Proof:
by Example 2.14
Function x -> · · · ⇒ Function x -> · · · 4 ⇒ 4 If 3 = 4 Then 5 Else 4 + 2 ⇒ 6
(Function x -> If 3 = x Then 5 Else x + 2) 4 ⇒ 6
Example 2.16.
Expression:
(Function f -> Function x -> f(f(x)))(Function y -> y - 1) 4
Proof:
(using F to represent the value rule invocation for brevity)
4 ⇒ 4 1 ⇒ 1
F 4 ⇒ 4 4 - 1 ⇒ 3 3 ⇒ 3 1 ⇒ 1
F (Function y -> y - 1) 4 ⇒ 3 3 - 1 ⇒ 2
F 4 ⇒ 4 (Function y -> y - 1) ((Function y -> y - 1) 4) ⇒ 2
F F (Function x -> (Function y -> y - 1) ((Function y -> y - 1) (x))) 4 ⇒ 2
(Function f -> Function x -> f(f(x)))(Function y -> y - 1) 4 ⇒ 2
# Let Rec f x =
If x = 1 Then 1 Else x + f (x - 1)
In f 3;;
==> 6
# Let Rec f x =
If x = 1 Then 1 Else x + f (x - 1)
In f;;
==> Function x ->
If x = 1 Then
1
Else
x + (Let Rec f x =
If x = 1 Then
1
Else
x + (f) (x - 1)
In
f) (x - 1)
Proof. By inspection of the rules, at most one rule can apply at any time.
Proof. To show that a language is not normalizing, we simply show that there
is some e such that there is no v with e ⇒ v. Let e be (Function x -> x
x)(Function x -> x x). e ; v for any v. Thus, F[ is not normalizing.
The expression in this proof is a very interesting one which we will examine
in more detail in Section 2.3.5. It does not evaluate to a value because each step
in its evaluation produces itself as a precondition. This is roughly analogous to
trying to prove proposition A by using A as a given.
In this case, the expression does not evaluate because it never runs out
of work to do. This is not the only kind of non-normalizing expression which
appears in F[; another kind consists of those expressions for which no evaluation
rule applies. For example, (4 3) is a simpler expression that is non-normalizing.
No rule exists to evaluate (e1 e2 ) when e1 is not a function expression.
Both of these cases look like divergence as far as the operational semantics
are concerned. In the case of an interpreter, however, these two kinds of ex-
pressions behave differently. The expression which never runs out of work to
do typically causes an interpreter to loop infinitely (as most interpreters are
not clever enough to realize that they will never finish). The expressions which
attempt application to non-functions, on the other hand, cause an interpreter
to provide a clear error in the form of an exception. This is because the er-
ror here is easily detectable; the interpreter, in attempting to evaluate these
2.3. THE F[ PROGRAMMING LANGUAGE 31
expressions, can quickly discover that none of its available rules apply to the
expression and respond by raising an exception to that effect. Nonetheless, both
are theoretically equivalent.
Logical Combinators First, there are the classic logical combinators, sim-
ple functions for recombining data.
Tuples Tuples and lists are encodable from just functions, and so they are not
needed as primitives in a language. Of course for an efficient implementation
you would want them to be primitives; thus doing this encoding is simply an
exercise to better understand the nature of functions and tuples. We will define
a 2-tuple (pairing) constructor; From a pair you can get a n-tuple by building
it from pairs. For example, (1, (2, (3, 4))) represents the 4-tuple (1, 2, 3, 4).
First, we need to define a pair constructor, pr. A first (i.e., slightly buggy)
approximation of the constructor is as follows.
def
pr (e1 , e2 ) = Function x -> x e1 e2
def
We use the notation a = b to indicate that a is an abbreviation for b. For ex-
ample, we might have a problem in which the incrementer function is commonly
def
used; it would make sense, then, to define incx = Function x -> x + 1. Note
that such abbreviations do not change the underlying meaning of the expres-
sion; it is simply for convenience. The same concept applies to macro definitions
in programming languages. By creating a new macro, one is not changing the
math behind the programming language; one is merely defining a more terse
means of expressing a concept.
Based on the previous definition of pr, we can define the following macros
for projection:
32 CHAPTER 2. OPERATIONAL SEMANTICS
def
left (e) = e (Function x -> Function y -> x)
def
right (e) = e (Function x -> Function y -> y)
def
pr (e1 , e2 ) = (Function e1 -> Function e2 -> Function x -> x e1 e2) e1 e2
This way, instead of textually substituting for e1 and e2 directly, we pass them
in as functions. This allows the interpreter evaluate e1 and e2 to values before
passing them in, and also ensures that e1 and e2 are closed expressions. This
eliminates the capture problem, because any occurrence of x is either bound
by a function declaration inside e1 or e2 , or was bound outside the entire pr
expression, in which case it must have already been replaced with a value at the
time that the pr subexpression is evaluated. Variable capture is an annoying
problem that we will see again in Section 2.4.
Now that we have polished our definitions, let’s look at an example of how
to use these encodings. First, let’s create create the pair p as (4, 5).
def
p = pr (4, 5) ⇒ Function x -> x 4 5
This encoding works, and has all the expressiveness of real tuples. There
are, nonetheless, a few problems with it. First of all, consider
Lists Lists can also be implemented via pairs. In fact, pairs of pairs are techni-
cally needed because we need a flag to mark the end of list. The list [1; 2; 3]
is represented by pr (pr(false,1), pr (pr(false,2), pr (pr(false,3),
def
emptylist))) where emptylist = pr(pr(true,0),0). The true/false flag
is used to mark the end of the list: only the empty list is flagged true. The
implementation is as follows.
def
cons (x, y) = pr(pr(Bool false, x), y)
def
emptylist = pr(pr(Bool true, Int 0),Int 0)
def
head x = right(left x)
def
tail x = right x
def
isempty l = (left (left l))
def
length = Let Rec len x =
If isempty(x) Then 0 Else 1 + len (tail x) In len
In addition to tuples and lists, there are several other concepts from Caml
that we can encode in F[. We review a few of these encodings below. For
brevity and readability, we will switch back to the concrete syntax.
34 CHAPTER 2. OPERATIONAL SEMANTICS
right
head
left
0
def
(Let x = e In e0 ) = (Function x -> e0 ) e
For example,
def
e; e0 = (Function n -> e0 ) e,
where n is chosen so as not to be free in e0 . This will first execute e, throw away
the value, and then execute e0 , returning its result as the final result of e; e0 .
2.3. THE F[ PROGRAMMING LANGUAGE 35
def
Freeze e = Function n -> e
def
Thaw e = e 0
We need to make sure that n is a fresh variable so that it is not free in e. Note
that the 0 in the application could be any value. Freeze e freezes e, keeping it
from being computed. Thaw e starts up a frozen computation. As an example,
This expression has same value as the equivalent expression without the freeze
and thaw, but the 2 + 3 is evaluated twice. Again, in a pure functional lan-
guage the only difference is that freezing and thawing is less efficient. In a
language with side-effects, if the frozen expression causes a side-effect, then the
freeze/thaw version of the function may produce results different from those of
the original function, since the frozen side-effects will be applied as many times
as they are thawed.
Recall from Lemma 2.2, that a corollary to the existence of this expression
is that F[ is not normalizing. This computation is odd in some sense. (x x) is
a function being applied to itself. There is a logical paradox at the heart of this
non-normalizing computation, namely Russell’s Paradox.
Russell’s Paradox
In Frege’s set theory (circa 1900), sets were written as predicates P (x). We
can view predicates as single-argument functions which return a boolean value:
36 CHAPTER 2. OPERATIONAL SEMANTICS
true if the argument is in the set represented by the predicate and false if it is
not. Testing membership in a set is done via application of the predicate. For
example, consider the predicate
Function x -> (x = 2 Or x = 3 Or x = 5)
This predicate represents the integer set {2, 3, 5} since it will return True for
any of the elements in that set and False for all other arguments. If we were
to extend F[ to include a native integer less-than operator, the predicate
would represent an infinitely-sized set containing all integer values less than 2
(as F[ still has no notion of real numbers). In general, given a predicate P
representing a set S,
e ∈ S iff P e ⇒ True
def
P = Function x -> Not(x x).
..
.
Not (Not ((Function x -> Not (x x))(Function x -> Not (x x))))
Not ((Function x -> Not (x x)) (Function x -> Not (x x)))
(Function x -> Not (x x)) (Function x -> Not (x x))
We know that Not (Not (e)) evaluates to the same value as e. 5 We can
see that we’re going in circles. Again, this statement tells us that P P ⇒ True
if and only if P P ⇒ False.
This is not how Russell viewed his paradox, but it has the same core struc-
ture; it is simply rephrased in terms of computation, and not set theory. The
computational realization of the paradox is that the predicate doesn’t compute
to true or false, so its not a sensible logical statement. Russell’s discovery of
this paradox in Frege’s set theory shook the foundations of mathematics. To
solve this problem, Russell developed his ramified theory of types, which is the
ancestor of types in programming languages. The program
is not typeable in Caml for the same reason the corresponding predicate is not
typeable in Russell’s ramified theory of types. Try typing the above code into
the Caml top-level and see what happens.
More information on Russell’s Paradox may be found in [15].
expressions are hard to prove. In Section 2.4, we will explore a formal means of determining
if two expressions are equivalent.
38 CHAPTER 2. OPERATIONAL SEMANTICS
def
summate0 = Function this -> Function arg ->
If arg = 0 Then 0 Else arg + this this (arg - 1)
Note the use of this this (arg - 1). The first use of this names the function
to be applied; the second use of this is one of the arguments to that function.
The argument this allows the recursive call to invoke the function again, thus
allowing us to recurse as much as we need.
We can now sum the integers {0, 1, . . . , 7} with the expression
summate0 summate0 7
summate0 always expects its first argument this to be itself. It can then use
one copy for the recursive call (the first this) and pass the other copy on for
future duplication. So summate0 summate0 “primes the pump”, so to speak, by
giving the process an initial extra copy of itself.
Better yet, recall that currying allows us to obtain the inner function without
applying it. In essence, a function with multiple arguments could be partially
evaluated, with some arguments fixed and others waiting for input. We can use
this to our advantage to define the summate function we want:
def
summate = summate0 summate0
This allows us to hide the self-passing from the individual using our summate
function, which cleans things up considerably. We can summarize the entire
process as follows using the Let macro we described before:
2.3. THE F[ PROGRAMMING LANGUAGE 39
def
summate = Let summ = Function this -> Function arg ->
If arg = 0 Then 0 Else arg + this this (arg - 1)
In Function arg -> summ summ arg
We now have a model for defining recursive functions without the use of
the Let Rec operator. This means that untyped languages with no built-in
recursion can still be Turing-complete. While this is an accomplishment, we
can do even better; we can abstract the idea of self-passing almost entirely out
of the body of the function itself.
def
almost y = Function body ->
Let fun = Function this -> Function arg ->
body this arg
In Function arg -> fun fun arg
def
summate = almost y (Function this -> Function arg ->
If arg = 0 Then 0 Else arg + this this (arg - 1))
The true Y -combinator actually goes one step further and passes this (this)
as argument, not just this, simplifying what we pass to Y :
def
summate = combY (Function this -> Function arg ->
If arg = 0 Then 0 Else arg + this (arg - 1))
40 CHAPTER 2. OPERATIONAL SEMANTICS
(3 - 2) is not evaluated until we are inside the body of the function where it
is thawed, and it is then evaluated two separate times. This is precisely the
behavior of call-by-name parameter passing, so Freeze and Thaw can encode it
by this means. The fact that (3 - 2) is executed twice shows the main weakness
of call by name, namely repeated evaluation of the function argument.
Lazy or call-by-need evaluation is a version of call-by-name that caches
evaluated function arguments the first time they are evaluated so it doesn’t
have to re-evaluate them in subsequent uses. Haskell [14, 7] is a pure functional
language with lazy evaluation.
type expr =
Var of ident | Function of ident * expr | Appl of expr * expr |
2.3. THE F[ PROGRAMMING LANGUAGE 41
One important point here is the existence of the ident type. Notice where
ident is used in the expr type: as variable identifiers, and as function param-
eters for Function and Let Rec. The ident type attaches additional semantic
information to a string, indicating that the string specifically represents an iden-
tifier.
Note, though, that ident is used in two different ways: to signify the dec-
laration of a variable (such as in Function (Ident "x",...)) and to signify
the use of a variable (such as in Var(Ident "x")). Observe that the use of
a variable is an expression and so can appear in the AST anywhere that any
expression can appear. The declaration of a variable, on the other hand, is not
an expression; variables are only declared in functions. In this way, we are able
to use the Caml type system to help us keep the properties of the AST nodes
straight.
For example, consider the following AST:
Plus
Ident Int
"x" 5
At first glance, it might appear that this AST represents the F[ expression
x + 5. However, the AST above cannot actually exist using the variants we
defined above. The Plus variation accepts two expressions upon construction
and Ident is not a variant; thus, the equivalent Caml code Plus(Ident "x",Int
5) would not even typecheck. The F[ expression x + 5 is represented instead
by the AST
Plus
Var Int
Ident 5
"x"
Example 2.17.
Concrete: Plus
1 + 2
Int Int
Abstract:
Plus(Int 1,Int 2) 1 2
Example 2.18.
Concrete: Or
True Or False
Bool Bool
Abstract:
Or(Bool true,Bool false) true false
Example 2.19.
If
1 2
2.3. THE F[ PROGRAMMING LANGUAGE 43
Example 2.20.
Appl
Concrete:
(Function x -> x + 1) 5 Function Int
Example 2.21.
Appl
Concrete:
(Function x -> Function y -> Appl Int
x + y) 4 5
Function Int 5
Abstract:
Appl(
Appl( Ident 4
Function(
Ident "x", "x" Function
Function(
Ident "y", Ident Plus
Plus(
Var(Ident "x"),
Var(Ident "y") "y" Var Var
))),
Int 4), Ident Ident
Int 5)
"x" "y"
44 CHAPTER 2. OPERATIONAL SEMANTICS
Example 2.22.
Concrete:
Let Rec fib x =
If x = 1 Or x = 2 Then 1 Else fib (x - 1) + fib (x - 2)
In fib 6
Abstract:
Letrec(Ident "fib", Ident "x", If(Or(Equal(Var(Ident "x"), Int
1), Equal(Var(Ident "x"), Int 2)), Int 1, Plus(Appl(Var(Ident
"fib"), Minus(Var(Ident "x"), Int 1)), Appl(Var(Ident "fib"),
Minus(Var(Ident "x"), Int 2)))), Appl(Var(Ident "fib"), Int 6))
Letrec
If "fib" 6
Or Int
Ident 1 Ident 2
"x" "x"
Notice how lengthy even simple expressions can become when represented
in the abstract syntax. Review the above examples carefully, and try some ad-
ditional examples of your own. It is important to be able to comfortably switch
2.4. OPERATIONAL EQUIVALENCE 45
between abstract and concrete syntax when writing compilers and interpreters.
Function x -> e ∼
=
Function z -> (Function x -> e) z, for z not free in e
Thaw (Freeze e) ∼
=e
In both examples, one of the expressions may be replaced by the other without
ill effects (besides perhaps changing execution time), so we say they are equiv-
alent. To write formal proofs, however, we will need to develop a more rigorous
definition of equivalence.
46 CHAPTER 2. OPERATIONAL SEMANTICS
We wish to study equivalence for possibly open programs, because there are
good equivalences such as x + 1 - 1 ∼ = x + 0. We define “at any place” by
the notion of a program context, which is, informally, a F[ program with
some holes (•) in it. Using this informal definition, testing if e1 ∼
= e2 would be
roughly equivalent to performing the following steps (for all possible programs
and all possible holes, of course).
5. Repeat steps 1-4 for every possible context. If none of these infinitely many
contexts produces different results, then e1 is equivalent to e2 .
def
C = (Function z -> Function x -> • ) z
Another way to phrase this definition is that two expressions are equivalent
if in any possible context, C, one terminates if the other does. We call this
operational equivalence because it is based on the interpreter for the language,
or rather it is based on the operational semantics. The most interesting, and
perhaps nonintuitive, part of this definition is that nothing is said about the
relationship between v and v 0 . In fact, they may be different in theory. However,
intuition tells us that v and v 0 must be very similar, since equivalence holds for
any possible context.
The only problem with this definition of equivalence is its “incestuous”
nature—there is no absolute standard of equivalence removed from the lan-
guage. Domain theory is a mathematical discipline which defines an algebra
of programs in terms of existing mathematical objects (complete and continu-
ous partial orders). We are not going to discuss domain theory here, mainly
because it does not generalize well to programming languages with side effects.
[17] explores the relationship between operational semantics and domain theory.
e∼
=e
Definition 2.17 (Symmetry).
= e0 if e0 ∼
e∼ =e
Definition 2.18 (Transitivity).
e∼
= e00 if e ∼
= e0 and e0 ∼
= e00
Definition 2.19 (Congruence).
C[e] ∼
= C[e0 ] if e ∼
= e0
Definition 2.20 (β-Equivalence).
((Function x -> e) v) ∼
= (e[v/x])
provided v is closed (if v had free variables they could be captured when v is
placed deep inside e).
Definition 2.21 (η-Equivalence).
(Function x -> e) ∼
= ((Function z -> Function x -> e) z)
Definition 2.22 (α-Equivalence).
(Function x -> e) ∼
= ((Function y -> e){y/x})
48 CHAPTER 2. OPERATIONAL SEMANTICS
Definition 2.23.
(n + n0 ) ∼
= the sum of n and n0
Definition 2.24.
Definition 2.25.
If e ⇒ v then e ∼
=v
Lemma 2.5. 2 3
def
Proof. By example. Let C = If •= 2 Then 0 Else (0 0). C[2] ⇒ 0 while
C[3] ; v for any v. Thus, by definition, 2 3.
Note that, in the above proof, we used the expression (0 0). This expression
cannot evaluate; the application rule applies only to functions. As a result,
this expression makes an excellent tool for intentionally making code get stuck
when building a proof about operational equivalence. Other similar get-stuck
expressions exist, such as True + True, Not 5, and the ever-popular (Function
x -> x x)(Function x -> x x).
It should be clear that we can use the approach in Lemma 2.5 to prove that
any two integers which are not equal are not operationally equivalent. But can
we make a statement about a non-value expression?
Lemma 2.6. x x + 1 - 1.
At first glance, this inequivalence may seem counterintuitive. But the proof
is fairly simple:
2.4. OPERATIONAL EQUIVALENCE 49
def
Proof. By example. Let C = (Function x -> •) True. Then C[x] ⇒ True.
C[x + 1 - 1] ≡ (Function x -> x + 1 - 1) True, which cannot evaluate
because no rule allows us to evaluate True + 1. Thus, by definition, x x +
1 - 1.
Lemma 2.7. If e ; v for any v, e0 ; v 0 for any v 0 , and both e and e0 are
closed expressions, then e ∼
= e0 . For example, (0 0) ∼= (Function x -> x
x)(Function x -> x x).
Exercises
Exercise 2.1. How would you change the Sheep language to allow the terms
bah, baah, · · · without excluding any terms which are already allowed?
Exercise 2.2. Is the term it in the Frog language? Why or why not?
Exercise 2.3. Is it possible to construct a term in Frog without using the
terminal t? If so, give an example. If not, why not?
Exercise 2.4. Complete the definition of the operational semantics for the
boolean language described in section 2.2.1 by writing the rules for Or and
Implies.
2.4. OPERATIONAL EQUIVALENCE 51
Exercise 2.5. Why not just use interpreters and forget about the operational
semantics approach?
52 CHAPTER 2. OPERATIONAL SEMANTICS
Chapter 3
In Chapter 2 we saw that, using a language with only functions and application,
we could represent advanced programming constructs such as tuples and lists.
However, we pointed out that these encodings have fundamental problems, such
as a low degree of efficiency, and the fact that they necessarily expose their
details to the programmer, making them difficult and dangerous to work with
in practice. Recall how we could take our encoding of a pair from Chapter 2
and apply it like a function; clearly the wrong behavior. In this chapter we look
at how we can build some of these advanced features into the language, namely
tuples and records, and we conclude the chapter by examining variants.
3.1 Tuples
One of the most fundamental forms of data aggregation in programming is the
notion of pairing. With pairs, or 2-tuples, almost any data structure can be
represented. Tripling can be represented as (1, (2, 3)), and in general n-tuples
can be represented with pairs in a similar fashion. Records and C-style structs
can be represented with sets (n-tuples) of (label, value)-pairs. Even objects
can be built up from pairs, but this is stretching it (just as encoding pairs as
functions was stretching it).
In Chapter 2, we showed an encoding of pairs based on functions. There were
two problems with this representation of pairs. First of all, the representation
was inefficient. More importantly, the behavior of pairs was slightly wrong,
because we could apply them like functions. To really handle pairs correctly,
we need to add them directly to the language. We can add pairs to F[ in a
fairly straightforward manner. We show how to add pair functionality to the
interpreter, and leave the operational semantics for pairs as an exercise for the
reader.
First, we extend the expr type in our interpreter to include the following.
53
54 CHAPTER 3. TUPLES, RECORDS, AND VARIANTS
type expr =
...
| Pr of expr * expr | Left of expr | Right of expr
Notice that our pairs are eager, that is, the left and right components of the pair
are evaluated, and must be values for the pair itself to be considered a value.
For example, (2, 3+4) ⇒ (2, 7). Caml tuples exhibit this same behavior.
Also notice that our space of values is now bigger. It includes:
Exercise 3.1. How would we write our interpreter to handle pairs in a non-
eager way? In other words, what would need to be in the interpreter so that
(e1 , e2 ) was considered a value (as opposed to only (v1 , v2 ) being considered a
value)?
Now that we have 2-tuples, encoding 3-tuples, 4-tuples, and n-tuples is easy.
We simply do them as (1, (2, (3, (. . . , n)))). As we saw before, lists can be
encoded as n-tuples.
3.2 Records
Records are a variation on tuples in which the fields have names. Records have
several advantages over tuples. The main advantage is the named field. From
a software engineering perspective, a named field “zipcode” is far superior to
“the third element in the tuple.” The order of the fields of a record is arbitrary,
unlike with tuples.
Records are also far closer to objects than tuples are. We can encode object
polymorphism via record polymorphism. Record polymorphism is discussed
3.2. RECORDS 55
in Section 3.2.1. The motivation for using records to encode objects is that
a subclass is composed of a superset of the fields of its parent class, and yet
instances of both classes may be used in the context of the superclass. Similarly,
record polymorphism allows records to be used in the context of a subset of their
fields, and so the mapping is quite natural. We will use records to model objects
in Chapter 5.
Our F[ records will have the same syntax as Caml records. That is, records
are written as {l1 =e1 ; l2 =e2 ; . . . ; ln =en }, and selection is written as e.lk ,
which selects the value labeled lk from record e. We use l as a metavariable
ranging over labels, just as we use e as a metavariable indicating an expression;
an actual record is for instance {x=5; y=7; z=6}, so x here is an actual label.
If records are always statically known to be of fixed size, that is, if they are
known to be of fixed size at the time we write our code, then we may simply
map the labels to integers, and encode the record as a tuple. For instance,
Obviously, the makes for ugly, hard-to-read code, but for C-style structs,
it works. But in the case where records can shrink and grow, this encoding is
fundamentally too weak. C++ structs can be subtypes of one another, so fields
that are not declared may, in fact, be present at runtime.
On the other hand, pairs can be encoded as records quite nicely. The pair (3,
4) can simply be encoded as the record {l=3; r=4}. More complex pairs, such
as those used to represent lists, can also be encoded as records. For example,
the pair (3, (4, (5, 6))), which represents the list [3; 4; 5; 6], can be
encoded as the record {l=3; r={l=4; r={l=5; r=6}}}.
A variation of this list encoding is used in the mergesort example in Sec-
tion 4.3.2. This variation encodes the above list as {l=3; r={l=4; r={l=5;
r={l=6; r=emptylist}}}}. This encoding has the nice property that the val-
ues are always contained in the l fields, and the rest of the list is always con-
tained in the r fields. This is much closer to the way real languages such as
Caml, Scheme, and Lisp represent lists (recall how we write statements like let
(first::rest) = mylist in Caml).
Next, we need a way to represent the record itself. Records may be of arbitrary
length, so a list of (label, expression)-pairs is needed. In addition, we need a way
to represent selection. The F[R expr type now looks like the following.
Let’s look at some concrete to abstract syntax examples for our new lan-
guage.
Example 3.1.
{size=7; weight=255}
Example 3.2.
e.size
Notice that our interpreter correctly handles {}, the empty record, by having
it compute to the itself since it is, by definition, a value.
# {one = 1; two = 2;
three = 2 + 1; four = (Function x -> x + x) 2};;
==> {one=1; two=2; three=3; four=4}
Next, let’s try a more interesting example, where we use records to encode
lists. Note that we define emptylist as -1. The function below sums all values
in a list (assuming it has a list of integers).
# Let emptylist = 0 - 1 In
Let Rec sumlist list =
If list = emptylist Then
0
Else
(list.l) + sumlist (list.r) In
sumlist {l=1; r={l=2; r={l=3; r={l=4; r=emptylist}}}};;
==> 10
58 CHAPTER 3. TUPLES, RECORDS, AND VARIANTS
3.3 Variants
We have been using variants in Caml, as the types for expressions expr. Now
we study untyped variants more closely. Caml actually has two (incompatible)
forms of variant, regular variants and polymorphic variants . In the untyped
context we are working in, the Caml polymorphic variants are more appropriate
and we will use that form.
We briefly contrast the two forms of variant in Caml for readers unfamil-
iar with polymorphic variants. Recall that in Caml, regular variants are first
declared as types
type feeling =
Vaguely of feeling | Mixed of feeling * feeling |
Love of string | Hate of string | Happy | Depressed
which allows Vaguely(Happy), Mixed(Vaguely(Happy),Hate("Fred")), etc.
Polymorphic variants require no type declaration; thus, for the above we can di-
rectly write ‘Vaguely(‘Happy), ‘Mixed(‘Vaguely(‘Happy),‘Hate("Fred")),
etc. The ‘ must be prefixed each variant name, indicating it is a polymorphic
variant name.
Let’s look at some concrete to abstract syntax examples for our new lan-
guage.
Example 3.3.
‘Positive(4)
Example 3.4.
Match e With
‘Positive(x) -> 1 | ‘Negative(y) -> -1 | ‘Zero(p) -> 0
Note in this example we can’t just have a variant Zero since 0-ary variants
are not allowed, and a dummy argument must be supplied. Multiple-argument
variants may be encoded by a single argument variant over a pair or record
(since we have neither pair or records in F[V, the only recourse is the encoding
of pairs used in F[ in Section 2.3.4).
In addition, our definition of F[V values must also be extended from the F[
ones to include variants: n(v) is a value, provided v is. To define the meaning
of F[V execution, we extend the operational semantics of F[ with the following
two rules:
e⇒v
(Variant Rule)
n(e) ⇒ n(v)
e ⇒ nj (vj ), ej [vj /xj ] ⇒ v
(Match Rule)
Match e With
n1 (x1 ) -> e1 | . . .
⇒ v
| nj (xj ) -> ej | . . .
| nm (xm ) -> em
The Variant rule constructs a new variant labeled n; its argument is eagerly
evaluated to a value, just as in Caml: ‘Positive(3+2) ⇒ ‘Positive(5). The
Match rule first computes the expression e being matched to a variant nj (vj ),
and then looks up that variant in the match, finding nj (xj ) -> ej , and then
evaluating ej with its variable xj given the value of the variant argument.
60 CHAPTER 3. TUPLES, RECORDS, AND VARIANTS
Example 3.5.
Exercise 3.2. Extend the F[V syntax and operational semantics so the Match
expression always has a final match of the form of the form“| -> e”. Is this
Match strictly more expressive than the old one, or not?
Variants and Records are Duals Here we see how the definition of a record
is modeled as a use of a variant, and a use of a record is the definition of a variant.
Variants are the dual of records: a record is this field and that field and
that field; a variant is this field or that field or that field. Since they are duals,
defining a record looks something like using a variant, and defining a variant
looks like using a record.
Variants can directly encode records and vice-versa, in a programming anal-
ogy of how DeMorgan’s Laws allows logical and to be encoded in terms of or, and
vice-versa: p Or q = Not(Not p And Not q); p And q = Not(Not p Or Not q).
Variants can be encoded using records as follows.
We will now leave the world of pure functional programming, and begin consid-
ering languages with side-effects. For now we will focus solely on two particular
side-effects, state and exceptions. There are, however, many other types of
side-effects, including the following.
4.1 State
Languages like F[, F[R, and F[V are pure functional languages. Once we add
any kind of side-effect to a language it is not pure functional anymore. Side-
effects are non-local, meaning they can affect other parts of the program. As
an example, consider the following Caml code.
let x = ref 9 in
let f z = x := !x + z in
x := 5; f 5; !x
61
62 CHAPTER 4. SIDE EFFECTS: STATE AND EXCEPTIONS
examine the entire body of code to see which side-effects influence the outcome
of which expressions. Therefore, is it a good programming moral to use side-
effects only when they are strongly needed.
Let us begin by informally discussing the semantics of references. In essence,
when a reference is created in a program, a cell is created that contains the
specified value, and the reference itself is a pointer to that cell. A good metaphor
for these reference cells is to think of each one as a junction box. Then, each
assignment is a path into the junction, and each read, or dereference, is a path
going out of the junction. The reference cell, or junction, sits to the side of
the program, and allows distant parts of the program to communicate with one
another. This “distant communication” can be a useful programming paradigm,
but again it should be used sparingly. Figure 4.1 illustrates this metaphor.
In the world of C++ and Java, non-const (or non-final) global variables are
the most notorious form of reference. While globals make it easy to do certain
tasks, they generally make programs difficult to debug, since they can be altered
anywhere in the code.
4.1. STATE 63
• Referencing, Ref e.
• Assignment, e := e0 .
• Dereferencing, !e.
• Cell names, c.
Finally, we are ready to write the evaluation rules for F[S. We will need to
modify all of the F[ rules with the store in mind (recall that our evaluation
rule is now he, S0 i ⇒ hv, Si, not simply e ⇒ v). We do this by threading
the store along the flow of control. Doing this introduces a great deal more
dependency between the rules, even the ones that do not directly manipulate
the store. We will rewrite the function application rule for F[S to illustrate the
types of changes that are needed in the other rules.
(Function Application)
he1 , S1 i ⇒ hFunction x -> e, S2 i, he2 , S2 i ⇒ hv2 , S3 i, he[v2 /x], S3 i ⇒ hv, S4 i
he1 e2 , S1 i ⇒ hv, S4 i
Note how the store here is threaded through the different evaluations, showing
how changes in the store in one place propagate to the store in other places,
and in a fixed order that reflects the indented evaluation order. The rules for
our new memory operations are as follows.
he, S1 i ⇒ hv, S2 i
(Reference Creation)
hRef e, S1 i ⇒ hc, S2 {c 7→ v}i, for c ∈
/ Dom(S2 )
he, S1 i ⇒ hc, S2 i
(Dereference)
h!e, S1 i ⇒ hv, S2 i, where S2 (c) = v
he1 , S1 i ⇒ hc, S2 i, he2 , S2 i ⇒ hv, S3 i
(Assignment)
he1 := e2 , S1 i ⇒ hv, S3 {c 7→ v}i
These rules can be tricky to evaluate because the store needs to be kept up
to date at all points in the evaluation. Let us look at a few example expressions
to get a feel for how this works.
4.1. STATE 65
Example 4.1.
Example 4.2.
F[S Interpreters
Just as we had to modify the evaluation relation in our F[S operational seman-
tics to support state, writing a F[S interpreter will also require some additional
work. There are two obvious approaches to take, and we will treat them both
below. The first approach involves mimicking the operational semantics and
defining evaluation on an expression and a store together. This approach yields
a functional interpreter in which eval(e, S0 ) for expression e and initial state
S0 returns the tuple (v, S), where v is the resulting value and S is the final
state.
The second and more efficient design involves keeping track of state in a
global, mutable dictionary structure. This is generally how real implementations
work. This approach results in more familiar evaluation semantics, namely
eval e returns v. Obviously such an interpreter is no longer functional, but
rather, imperative. We would like such an interpreter to faithfully implement
the operational semantics of F[S, and thus we would ideally want a theorem
that states that this approach is equivalent to the first approach. Proving such
a theorem would be difficult, however, mainly because our proof would rely
on the operational semantics of Caml, or whatever implementation language
we chose. We will therefore take it on good faith that the two approaches are
indeed equivalent.
66 CHAPTER 4. SIDE EFFECTS: STATE AND EXCEPTIONS
struct
let empty = (* initial empty store *)
let fresh = (* returns a fresh Cell name *)
let count = ref 0 in
function () -> ( count := !count + 1; Cell(!count) )
(* Note: this is not purely functional! It is quite
* difficult to make fresh purely functional.
*)
*)
let modify(s,c,v) = (* ... *)
end
module FbSEvalFunctor =
functor (Store : STORE) ->
struct
(* ... *)
end
Side-Effecting Operators
Now that we have a mutable store, our code has properties beyond the value
that is returned, namely, side-effects. Operators such as sequencing (;), and
While- and For loops now become relevant. These syntactic concepts are easily
defined as macros, so we will not add them to the official F[S syntax. The
macro definitions are as follows.
e1 ; e2 = (Function x -> e2 ) e1
While e Do e0 = Let Rec f x = If e Then f e0 Else 0 In f 0
Exercise 4.1. Why are sequencing and loop operations irrelevant in pure func-
tional languages like F[?
Let x = Ref 0 in x := x
This is the simplest possible store cycle, where a cell directly points to itself.
This type of store is illustrated in Figure 4.2.
Exercise 4.2. In the above example, what does !!!!!!!!x return? Can a store
cycle like the one above be written in Caml? Why or why not? (Hint: What
is the type of such an expression?)
A more subtle form of a store cycle is when a function is placed in the cell,
and the body of the function refers to that cell. Consider the following example.
4.1. STATE 69
Let c = Ref 0 In
c := (Function x -> If x = 0 Then 0 Else 1 + !c(x-1));
!c(10)
Exercise 4.3. Tying the knot be written in Caml, but not directly as above.
How must the reference be declared for this to work? Why can we create this
sort of cyclic store, but not the sort described in Exercise 4.2?
Now we’ll use tying the knot. Because F[SR does not include a sequencing
operation, we use the encoding presented in Section 4.1.1.
they are on the left or right side of an assignment operator. An l-value occurs
on the left side of the assignment, and represents a memory location. An r-
value to the right of an assignment, or elsewhere in the code, and represents
an actual value.
Consider the C/C++/Java assignment statement x = x + 1. The x on the
left of the assignment is an l-value, and the x in x + 1 is an r-value. In Caml,
we would write a similar expression as x := !x + 1. Thus, Caml is explicit in
which is being referred to, the cell or the value. For a Java l-value, the cell is
needed to perform the store operation, and for a Java r-value, the value of the
cell is needed, which is why, in Caml, we must write !x.
l-values and r-values are distinct. Some expressions may be both l-values
and r-values, such as x and x[3]. Other values may only be r-values, such as
5, (0 == 1), and sin(3.14159). Therefore, l-values and r-values are expressed
differently in the language grammar, specifically, l-values are a subset of r-values.
There are some shortcomings to such languages. For example, in Caml we can
say f(3) := 1, which assigns the value 1 to the cell returned by the function f.
Expressions of this sort are invalid in Java, because f(3) is not an l-value. We
could revise the F[S grammar to use this more restrictive notion, where l-values
must only be variables, in the following way:
type expr =
...
| Get of expr | Set of ident * expr | Ref of expr
For the variable on the left side of the assignment, we would need the address,
and not the contents, of the variable.
A final issue with the standard notion of state in C and C++ is the problem
of uninitialized variables. Because variables are not required to be initialized,
runtime errors due to these uninitialized variables are possible. Note that the
Ref syntax of F[S and Caml requires that the variable be explicitly initialized
as part of its declaration.
Once the root set is established, the garbage collector marks all cells not in
the root set as initially free. Then, it recursively traverses the memory graph
starting at the root set, and marks cells that are reachable from the root set
as not free, thus at the end of the traversal, all cells that are in a different
connected component from any of the cells in the root set are marked not free,
and these cells may be reused. There are many different ways to reuse memory,
but we will not go into detail here.
can’t be copied around like this, so a scheme like the one presented above is
necessary.
There is a possibility for some anomalies with this approach, however. Specif-
ically, there is a problem when a function is returned as the result of another
function application, and the returned function has local variables in it. Con-
sider the following example.
Function y -> x + y,
but it would be a mistake to simply return this as the correct value, because we
would lose the fact that x was bound to 3. The solution is that when a function
value is returned, the closure of that function is returned.
For the case above, the correct value to return is the closure
This closure view of function values is critical when writing a compiler, since
compiler’s should not be doing substitutions of code on the fly. Compilers are
discussed in Chapter 8.
In Section 4.4 and Chapters 5 and 6, we will study the language features
missing from F[SR, namely objects, exceptions, and types. In Chapter 8, we
will discuss translations for F[SR, but will not include these other language
features. The abstract syntax type for F[SR is as follows.
type expr =
Var of ident | Function of ident * expr | Appl of expr * expr |
Letrec of ident * ident * expr * expr | Plus of expr * expr |
Minus of expr * expr | Equal of expr * expr |
And of expr * expr | Or of expr * expr | Not of expr |
If of expr * expr * expr | Int of int | Bool of bool |
Ref of expr | Set of expr * expr | Get of expr | Cell of int |
Record of (label * expr) list | Select of label * expr |
Let of ident * expr * expr
In the next two sections we will look at some nontrivial “real-world” F[SR
programs to illustrate the power we can get from this simple language. We
begin by considering a function to calculate the factorial function, and conclude
the chapter by examining an implementation of the merge sort algorithm.
(*
* First we encode multiplication for positive nonnegative
* integers. Notice the currying in the Let Rec construct.
* Multiplication is encoded in the obvious way: repeated
* addition.
*)
Let Rec mult x = Function y ->
If y = 0 Then
0
Else
x + (mult x (y - 1)) In
(*
* Now that we have multiplication, factorial is easy to
* define.
*)
Let Rec fact x =
74 CHAPTER 4. SIDE EFFECTS: STATE AND EXCEPTIONS
If x = 0 Then
1
Else
mult x (fact (x - 1)) In
fact 7
The great thing about this approach is that every other value of z is negative,
and we can simply pass along a boolean value to represent the sign of z. If z
is nonnegative, the next iteration of the loop inverts it and subtracts 1. If z is
negative, the next iteration simply inverts it. Note that we can invert the sign
of an number x in F[SR by simply writing 0 - x. Now, armed with our ideas
about lesseq, let us start writing our merge sort.
Let cons = Function elt -> Function seq -> {l=elt; r=seq} In
(mergesort (halves.right)) In
problem with goto is that it is too raw. The paradigm of jumping around be-
tween labels is not all that useful in the context of functions. It is also inherently
dangerous, as one may inadvertently jump into the middle of a function that
is not even executing, or skip past variable initializations. In addition, with a
rich enough set of other control operators, goto really doesn’t provide any more
expressiveness, at least not meaningful expressiveness.
The truth is that control operators are really not needed at all. Recall
that F[, and the lambda calculus for that matter, are already Turing-complete.
Control operators are therefore just conveniences that make programming easier.
It is useful to think of control operators as “meta-operators,” that is, operators
that act on the evaluation process itself.
(Function x ->
(If x = 0 Then 5 Else Return (4 + x)) - 8) 4
Since x will not be 0 when the function is applied, the Return statement will
get evaluated, and execution should stop immediately, not evaluating the “-
8.” The problem is that evaluating the above statement means evaluating
(Return (4 + 4)) - 8.
But we know that the subtraction rule works by evaluating the left and right
hand sides of this expression to values, and performing integer subtraction on
them. Clearly that doesn’t work in this case, and so we need a special rules for
subtraction with a Return in one of the subexpressions.
First, we need to add Returns to the value space of F[ and provide an
appropriate evaluation rule:
e⇒v
(Return)
Return e ⇒ Return v
4.4. EXCEPTIONS AND OTHER CONTROL OPERATIONS 79
Next, we need the special subtraction rules, one for when the Return is on the
left side, and one for when the Return is on the right side.
e ⇒ Return v
(- Return Left)
e - e0 ⇒ Return v
e ⇒ v, e0 ⇒ Return v 0
(- Return Right)
e - e0 ⇒ Return v 0
Notice that these subtraction rules evaluate to Return v and not simply v.
This means that the Return is “bubbled up” through the subtraction operator.
We need to define similar return rules for every F[ operator. Using these new
rules, it is clear that
Return (4 + 4) - 8 ⇒ Return 8.
(Function x ->
(If x = 0 Then 5 Else Return (4 + x)) - 8) 4
A few other special application rules are needed for the cases when either the
function or the argument itself is a Return.
e1 ⇒ Return v
(Appl. Return Function)
e1 e2 ⇒ Return v
e1 ⇒ v1 , e2 ⇒ Return v
(Appl. Return Arg.)
e1 e2 ⇒ Return v
Of course, we still need the original function application rule (see Section 2.3.3)
for the case that function execution implicitly returns a value by dropping off
the end of its execution.
80 CHAPTER 4. SIDE EFFECTS: STATE AND EXCEPTIONS
e ⇒ Return v
(Double Return)
Return e ⇒ Return v
Of course we would need to restrict the original Return rule to the case where
v was not in the form Return v. With this rule, instead of returning from
two levels of function calls, the second Return actually interrupts and bubbles
through the first. Of course, double returns are not a common construct, and
these rules will not be used often in practice.
(function x ->
try
(if x = 0 then 5 else raise (Return (4 + x))) - 8
with
Return n -> n) 4;;
which bubbles up the exception #xn. This is the generalization of the value
class Return v from above. xn is a metavariable representing an exception
name. An exception contains two pieces of data: a name, and an argument.
The argument can be an arbitrary expression. Although we only allow single-
argument exceptions, zero-valued or multi-valued versions of exceptions can be
4.4. EXCEPTIONS AND OTHER CONTROL OPERATIONS 81
easily encoded by, for example, supplying the exception with a record argument
with zero, one, or several fields. We also add to F[X the expression
Note that Try binds free occurrences of x in e0 . Also notice that the F[X Try
syntax differs slightly from the Caml syntax in that Caml allows an arbitaray
pattern-match expression in the With clause. We allow only a single clause that
matches all values of x.
F[X is untyped, so exceptions do not need to be declared. We use the “#”
symbol to designate exceptions in the concrete syntax, for example, #MyExn.
Below is an example of a F[X expression.
The rules for Raise and Try are derived from the return rule and the ap-
plication return rule respectively. Raise “bubbles up” an exception, just like
Return bubbled itself up. Try is the point at which the bubbling stops, just like
function application was the point at which a Return stopped bubbling. The
operational semantics of exceptions are as follows.
e ⇒ Raise (#xn v)
(- Raise Left)
e - e0 ⇒ Raise (#xn v)
Note that we must handle the unusual case of when a Raise bubbles through a
Raise, something that will not happen very often in practice. The rule is very
much like the “- Raise Left” rule above.
e ⇒ Raise (#xn v)
(Raise Raise)
Raise e ⇒ Raise (#xn v)
After the function application and the evaluation of the If statement, we are
left with
which is
Object-Oriented Language
Features
• Everyday objects are active, that is, they are not fully controlled by us.
Objects have internal and evolving state. For instance, a running car is an
active object, and has a complex internal state including the amount of gas
remaining, the engine coolant and transmission fluid levels, the amount of
battery power, the oil level, the temperature, the degree of wear and tear
on the engine components, etc.
After all, it is really the car sending a message to us. We view ourselves as the “sender,”
however, because we initiated the check, that is, in a sense, we asked the car how much gas
was remaining.
85
86 CHAPTER 5. OBJECT-ORIENTED LANGUAGE FEATURES
• Everyday objects may be nested, that is, objects may be made up of several
smaller object components, and those components may themselves have
smaller components. Once again, a car is a perfect example of a nested
object, being made of of smaller objects including the engine and the
transmission. The transmission is also a nested object, including an input
shaft, an output shaft, a torque converter, and a set of gears.
• Everyday objects are, for the most part, uniquely named. Cars are uniquely
named by license places or vehicle registration numbers.
• Everyday objects may be self-aware, in the sense that they may intention-
ally interact with themselves, for instance a dog licking his paw.
• Objects support messages, which are named pieces of code that are tied
to a particular object. In this way objects are communicative, and groups
of objects accomplish tasks by sending messages to each other.
There are several additional features objects commonly have. Classes are
nearly always present in languages with objects. Classes are not required: it
is perfectly valid to have objects without classes. The language Self [20, 21, 2]
has no classes, instead it has prototype objects which are copied that do the
duty of classes. Important concepts of classes include creation, inheritance,
method overriding, superclass access, and dynamic dispatch. We will address
these concepts later.
Information hiding for fields and methods is another feature that most
object-oriented languages have, generally in the form of public, private, and
protected keywords.
88 CHAPTER 5. OBJECT-ORIENTED LANGUAGE FEATURES
Other aspects of objects include object types and modules. Types are dis-
cussed briefly, but modules are beyond the scope of this book. We also ignore
method overloading, as it is simply syntactic sugar and adds only to readabil-
ity, not functionality. Recall that overloading a method means there are two
methods with the same name, but different type signatures. Overriding means
redefining the behavior of a superclass’s method in a subclass. Overriding is
discussed in Section 5.1.5.
Let point = {
x = 4;
y = 3;
magnitude = Function -> . . . ;
iszero = Function -> . . .
} In . . .
in Figure 5.2 is technically a class diagram, for our purposes we will view it simply as the
induced object.
5.1. ENCODING OBJECTS IN F[SR 89
solution we will use is the same one that C++ uses behind the scenes: we pass
the object itself as an argument to the function. Let’s revise our first encoding
to the following one (assuming we’ve defined a sqrt function).
Let point = {
x = 4;
y = 3;
magnitude = Function this -> Function ->
sqrt this.x + this.y;
iszero = Function this -> Function ->
((this.magnitude) this {}) = 0
} In . . .
There are a few points of interest about the above example. First of all,
the object itself needs to be explicitly passed as an argument when we send
a message. For example, to send the magnitude message to point, we would
write
point.magnitude point {}
For convenience, we can use the abbreviation obj <- method to represent the
message (obj.method obj).
Even inside the object’s methods, we need to pass this along when we call
another method. iszero illustrates this point in the way it calls magnitude.
This encoding of self-reference is called the self-application encoding. There
are a number of other encodings, and we will treat some of them below.
There is a problem with this encoding, though; our object is still immutable.
An object with immutable fields can be a good thing in many cases. For ex-
ample, windows that can’t be resized and sets with fixed members can be im-
plemented with immutable fields. In general, though, we need our objects to
support mutable fields. Luckily this problem has an easy solution. Consider the
following revision of our point encoding, which uses Refs to represent fields.
Let point = {
x = Ref 4;
y = Ref 3;
magnitude = Function this -> Function ->
sqrt !(this.x) + !(this.y);
iszero = Function this -> Function ->
(this.magnitude this {}) = 0;
setx = Function this -> Function newx -> this.x := newx;
sety = Function this -> Function newy -> this.y := newy
} In . . .
90 CHAPTER 5. OBJECT-ORIENTED LANGUAGE FEATURES
To set x to 12, we can write either point <- setx 12, or we can directly
change the field with (point.x) := 12. To access the field, we now write
!(point.x), or we could define a getx method to abstract the dereferencing.
This strategy gives us a faithful encoding of simple objects. In the next few
sections, we will discuss how to encode some of the more advanced features of
objects.
Let eqpoint = {
(* all the code from point above: x, y, magnitude . . . *)
equal = Function this -> Function apoint ->
!(this.x) = !(apoint.x) And !(this.y) = !(apoint.y)
} In eqpoint <- equal({
x = Ref 3;
y = Ref 7;
(* . . . *)
})
5.1. ENCODING OBJECTS IN F[SR 91
The object passed to equal needs only to define x and y. Object polymor-
phism in our embedded language is thus more powerful than in C++ or Java:
C++ and Java look at the type of the arguments when deciding what is allowed
to be passed to a method. Subclasses can always be passed to methods that
take the superclass, but nothing else is allowed. In our encoding, which is closer
to Smalltalk, any object is allowed to be passed to a function or method as long
as it supports the messages needed by the function.
One potential difficulty with object polymorphism is that of dispatch: since
we don’t know the form of the object until runtime, we do not know exactly
where the correct methods will be laid out in memory. Thus hashing may be
required to look up methods in a compiled object-oriented language. This is
exactly the same problem that arises when writing the record-handling code in
our F[SR compiler (see Chapter 8), again illustrating the similarities between
objects and records.
pointInterface.setx 5
92 CHAPTER 5. OBJECT-ORIENTED LANGUAGE FEATURES
This solution has a flaw, though. Methods that return this re-expose the hidden
fields and methods of the full object. Consider the following example.
Let pointImpl = {
(* . . . *)
sneaky = Function this -> Function -> this
} In Let pointInterface = {
magnitude = pointImpl.magnitude pointImpl;
setx = pointImpl.setx pointImpl;
sety = pointImpl.sety pointImpl;
sneaky = pointImpl.sneaky pointImpl
} In pointInterface.sneaky {}
The sneaky method returns the full pointImpl object instead of the point-
Interface version. To remedy this problem, we need to change the encoding
so that we don’t have to pass this every time we invoke a method. Instead, we
will give the object a pointer to itself from the start.
point.magnitude {}
The method getThis still returns this, but this contains the public parts only.
Note that for this encoding to work, privateThis can be used only as a target
for messages, and can not be returned.
The disadvantage of this encoding is that this will need to be applied to
itself every time it’s used inside the object body. For example, instead of writing
(point.getThis {}).magnitude {}
5.1. ENCODING OBJECTS IN F[SR 93
These encodings are relatively simple. Classes and inheritance are more
difficult to encode, and are discussed below. Object typing is also particularly
difficult, and is covered in Chapter 6.
5.1.4 Classes
Classes are foremost templates for creating objects. A class is, in essence, an
object factory. Each object that is created from a class must have its own unique
set of instance variables (with the exception of static fields, which we will treat
later).
It is relatively easy to produce a simple encoding of classes. Simply freeze
the code that creates the object, and thaw to create new object. Ignoring
information hiding, the encoding looks as follows.
We can define new pointClass to be pointClass {}. Some typical code which
creates and uses instances might look as follows.
point1 and point2 will have their own x and y values, since Ref creates new
store cells each time it is thawed. The same freeze and thaw trick can be applied
to our encoding of information hiding to get hiding in classes. The difficult part
of encoding classes is encoding inheritance, which we discuss in the next section.
5.1.5 Inheritance
As a rule, about 80 percent of the utility of objects is realized in the concepts
we have talked about above, namely
94 CHAPTER 5. OBJECT-ORIENTED LANGUAGE FEATURES
• Objects that encapsulate data and code under a single name to achieve
certain functionality.
• Polymorphic objects.
The other 20 percent of the utility comes from from inheritance. Inheri-
tance allows related objects to be defined that share some common code. Inher-
itance can be encoded in F[SR by using the following scheme. In the subclass,
we create an instance of the superclass object, and keep the instance around
as a “slave.” We use the slave to access methods and fields of the superclass
object. Thus, the subclass object only contains new and overridden methods.
Real object oriented languages tend not to implement inheritance this way for
reasons of efficiency (imagine a long inheritance chain in which a method call
has to be delegated all the way back to the top of the hierarchy before it can
be invoked). Still, the encoding is good enough to illustrate the main points
of inheritance. For example, consider the following encoding of ColorPoint, a
subclass of Point, illustrated in Figure 5.3.
Let pointClass = . . . In
Let colorPointClass = Function ->
Let super = pointClass {} In {
x = super.x; y = super.y;
color = Ref {red = 45; green = 20; blue = 20};
magnitude = Function this -> Function ->
mult(super.magnitude this {})(this.brightness this {});
brightness = Function this -> Function ->
(* compute brightness. . . *)
setx = super.setx; sety = super.sety
} In . . .
There are several points of interest in this encoding. First of all, notice
that to inherit methods and fields from the superclass, we explicitly link them
together (i.e. x, y, setx, and sety). To override a method in the superclass, we
simply redefine it in the subclass instead of linking to the superclass; magnitude
5.1. ENCODING OBJECTS IN F[SR 95
is an example of this. Also notice that we can still invoke superclass methods
from the subclass. For instance, magnitude invokes super.magnitude in its
body. Notice how super.magnitude is passed this instead of super as its
argument. This has to do with dynamic dispatch, which we will address now.
getLength = (super.getLength);
} In \ldots
Let pointClass = {
x = Ref newx;
y = Ref newy;
magnitude = Function this -> Function ->
sqrt ((!(this.x)) + (!(this.y)))
};
xdefault = 4;
ydefault = 3
} In
Notice how the class method newWithXY is actually responsible for building
the point object. new simply invokes newWithXY with some default values
that are stored as class fields. This is a very clean way to encode multiple
constructors.
Perhaps the most interesting thing the encoding is how classes with static
fields start to look like our original encoding of simple objects. Look closely—
notice that class methods take an argument class as their first parameter, for
the exact same reason that regular methods take this as a parameter. So in
fact, pointClass is really just another primitive object that happens to be able
to create objects.
Viewing classes as objects is the dominant paradigm in Smalltalk. Java has
some support for looking at classes as objects through the reflection API as well.
Even in languages like C++ that don’t view classes as objects, design patterns
such as the Factory patterns[12] capture this notion of objects creating objects.
This encoding is truly in the spirit of object-oriented programming, and
it is a clean, and particularly satisfying way to think about classes and static
members.
but we include them in F[OB because it requires very little work. The value
returned by the expression new aClass is an object, which means that an object
is a first class expression. As long as our concrete syntax allows us to directly
define primitive objects, no additional work is needed in the interpreter.
An interesting consequence of having primitive objects is that is that we
could get rid of functions entirely (not methods). Functions could simply be
encoded as methods of primitive objects. For this reason, object-oriented lan-
guages that support primitive objects have many of the advantages of higher-
order functions.
Let pointClass =
Class Extends EmptyClass
Inst
x = 3;
y = 4
Meth
magnitude = Function -> sqrt(x + y);
setx = Function newx -> x := newx;
sety = Function newy -> y := newy
In Let colorPointClass =
Class Extends pointClass
Inst
x = 3;
y = 4;
(* A use of a primitive object: *)
color = Object
Inst
Meth red = 45; green = 20; blue = 20
Meth
magnitude =
Function ->
mult(Super <- magnitude {})(This <- brightness)
(* An unnormalized brightness metric *)
brightness = Function ->
color <- red + color <- green + color <- blue;
setx = Super <- setx (* explicitly inherit *)
sety = ...; setcolor = ...
There is a lot going on with this syntax, so let’s take some time to point out
some of the major elements. First of all, notice that This and Super are special
“reserved variables.” In our F[ encodings, we had to write “Function this
-> ” and pass this as an explicit parameter. Now, self-awareness happens
implicitly, and is This is a reference to self.
Note the use of a primitive object to define the color field of the color-
PointClass. The red, green, and blue values are implemented as methods,
since fields are always “private.”
In our previous encodings we used one style when defining the base class, and
another different style when defining the subclass. In F[OB we use the same
syntax for both by always specifying the superclass. To allow for base classes
that do not inherit from any other class, we allow a class to extend EmptyClass,
which is simply an special class that does not define anything.
F[OB instance variables use the l/r-value form of state that was discussed
in Section 4.1.3. There is no need to explicitly use the ! operator to get the
value of an instance variable. F[OB instance variables are therefore mutable,
not following the Caml convention that all variables are immutable. Note that
method arguments are still immutable. There are thus two varieties of vari-
able: immutable method parameters, and mutable instances. Since it is clear
which variables are instances and which are not, there is no great potential for
confusion. It is the job of the parser to distinguish between instance variables
and regular variables. An advantage of this approach is that it keeps instance
variables from being directly manipulated by outsiders.
Method bodies are generally functions, but need not be; they can be any
immutable, publicly available value. For example, immutable instances can be
considered methods (see the color primitive object in the example above).
Note that we still have to explicitly inherit methods. This is not the cleanest
syntax, but it simplifies the interpreter, and makes facilitates translation to
F[SR discussed in Section 5.2.3.
Also, there is no constructor function. new is used to create new instances,
and, following Smalltalk, initialization is done explicitly by writing an initialize
method.
For simplicity, F[OB does not support static fields and methods, nor does
it take the “classes as objects” view discussed in the previous section.
Here is a rough sketch of the interpreter. This interpreter is not complete, but
it gives a general idea of what one should look like.
This code sketch for the interpreter does a nice job of illustrating the roles of
This and Super. We only substitute for this when we send a message. That’s
because This is a dynamic construct, and we don’t know what it will be until
runtime (see the discussion of dynamic dispatching in Section 5.1.6). On the
other hand, Super is a static construct, and is known at the time we write the
code, which allows us to substitute for Super as soon as the Class expression
is evaluated.
Let pointClass =
Function -> Let super = (Function -> {}) {} In {
inst = {
x = Ref 3;
y = Ref 4
};
meth = {
magnitude = Function this -> Function ->
sqrt ((!(this.inst.x)) + (!(this.inst.y)));
setx = Function this -> Function newx ->
(this.inst.x) := newx;
sety = Function this -> Function newy ->
5.2. THE F[OB LANGUAGE 103
(this.inst.y) := newy
}
}
In Let colorPointClass =
Function -> Let super = pointClass {} In {
inst = {
x = Ref 3;
y = Ref 4;
color = Ref ({inst = {}; meth = {
red = Function this -> 45;
green = Function this -> 20;
blue = Function this -> 20
}})
};
meth = {
magnitude = Function this -> Function ->
mult ((super.meth.magnitude) this {})
((this.meth.brightness) this {});
brightness = Function this -> Function ->
(((!(this.inst.color)).meth.red) this) +
(((!(this.inst.color)).meth.green) this) +
(((!(this.inst.color)).meth.blue) this);
setx = Function this -> Function newy ->
(super.meth.setx) this newy;
sety = Function this -> Function newy ->
(super.meth.setx) this newy;
setcolor = Function this -> Function c ->
(this.inst.color) := c
}
} In
sqrt is not so easily defined. Since we’re more concerned with the behavior of
the objects, rather than numerical accuracy, just write a dummy sqrt function
that returns its argument:
Now, try running it with the F[SR file-based interpreter. Our dummy sqrt
function returns 7 for the point version of magnitude, and the colorPoint
magnitude multiplies that result by the sum of the brightness (85 in this case).
The result is
$ FbSR fbobFBsr.fbsr
==> 595
After writing a Caml version of toFbSR for the abstract syntax, a F[OB
compiler is trivially obtained by combining toFbSR with the functions defined
in Chapter 8:
Finally, there are several other ways to handle these kinds of encodings.
More information about encoding objects can be found in [10].
Chapter 6
Type Systems
and then call (f "abc"), the result is a runtime type error. Similarly, the
Smalltalk expression
results in a “message not supported” exception when run. Both of these runtime
errors could have been detected before runtime if the languages supported static
type systems.
The C++ code
executes unknown and potentially harmful effects, but the equivalent Java code
will throw an ArrayIndexOutOfBoundsException at runtime, because array
access is checked by a dynamic type system.
105
106 CHAPTER 6. TYPE SYSTEMS
These are just a few examples of the kinds of problems that type systems
are designed to address. In this chapter we discuss such type systems, as well
as algorithms to infer and to check types.
There are also several newer dimensions of types that are currently active re-
search areas.
• Effect types: the type “int -x,y-> int” indicates that variables x and
y will assigned to in this function. Java’s throws clauses for methods are
a form of effect types.
• Concrete class analysis: for variable x:Point, a concrete class analysis
produces a set such as {Point, ColorPoint, DataPoint}. This means at
runtime x could either be a Point, a ColorPoint, or a DataPoint (and,
nothing else). This is useful in optimization.
• Typed Assembly Language [3, 18]: put types on assembly-level code and
have a type system that guarantees no unsafe pointer operations.
• Logical assertions in types: int -> { x:int | odd(x) } for a function
returning odd numbers.
#include <iostream>
class Calculation {
public: virtual int f(int x) { return x; }
};
class Person {
public: virtual char *getName() { return "Mike"; }
};
return 0;
}
The code compiles with no errors, but when we run it the output is “ã¿.”
But if we compile it with optimization, the result is “Ãà.” Run it on a different
computer and it may result in a segmentation fault. Use a different compiler,
and the results may be completely exotic and unpredictable. The point is that
because we’re working with an untyped language, there are no dynamic checks
in place, and meaningless code like this results in undefined behavior.
Contrast this behavior with that of the equivalent piece of Java code. Recall
that Java’s dynamic type system checks the type of the object when a cast is
made. We will use approximately the same code:
class Calculation {
public int f(int x) { return x; }
}
class Person {
public String getName() { return "Mike"; }
}
6.2. TF[: A TYPED F[ VARIATION 109
class Main {
public static void main(String[] args) {
Object o = new Calculation();
System.out.println(((Person) o).getName());
}
}
At the other end of the spectrum, every subexpression and its identifier
must be decorated with its type. This is too extreme, however, and makes the
language unusable. Instead of writing
Function x -> x + 1,
For TF[ and TF[SRX, we will concentrate on the C and Pascal view of
explicit type information. We specify function argument and return types, and
declared variable types, and allow the rest to be inferred.
We should also illustrate the difference between type checking and type
inference. In any typed language, the compiler should typecheck the program
before generating code. Type inference algorithms infer types and check
that the program body has no type errors. Caml is an example of this.
A type checker generally just checks the body is well-typed given the types
listed on declarations. This is how C, C++, and Java work, although technically
they are also inferring some types, such as the type of 3+4.
In any case it must be possible to run the type inference or type checking
algorithm quickly. Caml in theory can take exponential time to infer types, but
in practice it is linear.
Type Systems
Type systems are rule-based formal systems that are similar to operational
semantics. Type systems rigorously and formally specify what program have
what types, and have a strong and deep parallel with formal logic (recall our
discussion of Russell’s Paradox in Section 2.3.5). Type systems are generally a
set of rules about type assertions.
6.2. TF[: A TYPED F[ VARIATION 111
The expressions of TF[ are almost identical to those of F[, except that we
must explicitly decorate functions with type information about the argument.
For example, in the concrete syntax we write
(Hypothesis)
Γ ` x : τ for Γ(x) = τ
(Int)
Γ ` n : Int for n an integer
(Bool)
Γ ` b : Bool for b True or False
112 CHAPTER 6. TYPE SYSTEMS
Γ ` e : Int, Γ ` e0 : Int
(+)
Γ ` e + e0 : Int
Γ ` e : Int, Γ ` e0 : Int
(-)
Γ ` e - e0 : Int
Γ ` e : Int, Γ ` e0 : Int
(=)
Γ ` e = e0 : Bool
These rules are fairly straightforward. For addition and subtraction, the
operands must typecheck to Ints, and the result of the expression is an Int.
Equality is similar, but typechecks to a Bool. Note that equality will only
typecheck with integer operands, not boolean ones. The And, Or, and Not rules
are similar, and their definition should be obvious.
The If rule is a bit more complicated. Clearly, the conditional part of
the expression must typecheck to a Bool. But what about the Then and Else
clauses. Consider the following expression.
Γ ` e : Bool, Γ ` e0 : τ, Γ ` e00 : τ,
(If)
Γ ` If e Then e0 Else e00 : τ
We have now covered all the important rules except for functions and appli-
cation. The Function rule is a bit different from other rules, because functions
introduce new variables. The type of the function body depends on the type
of the variable itself, and will not typecheck unless that variable is in Γ, the
type environment. To represent this in our rule, we need to perform the type
assertion with the function’s variable appended to the type environment. We
do this in the following way.
6.2. TF[: A TYPED F[ VARIATION 113
Γ, x : τ ` e : τ 0
(Function)
Γ ` (Function x:τ -> e) : τ -> τ 0
Notice the use of the type constructor -> to represent the type of the entire
function expression. This type constructor should be familiar, as the Caml type
system uses the same notation. In addition, the Function rule includes the
addition of an assumption to Γ. We assume the function argument, x, is of type
τ , and add this assumption to the environment Γ to derive the type of e. The
Application rule follows from the Function rule:
Γ ` e : τ -> τ 0 , Γ ` e0 : τ
(Application)
Γ ` e e0 : τ 0
we then have
114 CHAPTER 6. TYPE SYSTEMS
` f 5 True : Int
Because by the application rule,
` f : Int -> Bool -> Int
(which we derived above)
` 5 : Int by the Int rule
And thus
` f 5 : Bool -> Int by the Application rule.
Given this and
` True : BoolbytheBoolrule
we can get
` f 5 True : Int by the Application rule.
Now that we have a type system for the TF[ language, what can we do
with it. We can detect whether or not our programs are well-typed, but what
does it mean if they are. The answer comes in the form of the following type
soundness theorem.
Lemma 6.1. typecheck faithfully implements the TF[ type system. That is,
just about every piece of syntax we have used up to now, except for F[OB’s
classes and objects and F[V’s variants. Here is the abstract syntax defined in
terms of a Caml type.
type expr =
Var of ident
| Function of ident * dtype * expr
| Letrec of ident * ident * dtype * expr * dtype * expr
| Appl of expr * expr
| Plus of expr * expr | Minus of expr * expr
| Equal of expr * expr | And of expr * expr
| Or of expr * expr | Not of expr
| If of expr * expr * expr | Int of int | Bool of bool
| Ref of expr | Set of expr * expr | Get of expr
| Cell of int | Record of (label * expr) list
| Select of label * expr | Raise of expr * dtype |
| Try of expr * string * ident * expr
| Exn of string * expr
and dtype =
Int | Bool | Arrow of dtype * dtype
| Rec of label * dtype list | Rf of dtype | Ex of dtype
Next, we will define the type rules for TF[SRX. All of the TF[ type rules
apply, and so we can move directly to the more interesting rules. Let’s begin
by tackling recursion. What we’re really typing is the In clause, but we need to
ensure that the rest of the expression is also well-typed.
Γ, f : τ -> τ 0 , x : τ ` e : τ 0 , Γ, f : τ -> τ 0 ` e0 : τ 00
(Let Rec)
Γ ` (Let Rec f x:τ = e:τ 0 In e0 ) : τ 00
Γ ` e1 : τ1 , . . . , Γ ` en : τn
(Record)
Γ ` {l1 = e1 ; . . . ; ln = en } : {l1 : τ1 ; . . . ; ln : τn }
Γ ` e : {l1 : τ1 ; . . . ; ln : τn }
(Projection)
Γ ` e.li : τi for 1 ≤ i ≤ n
We’ll also need to be able to type side effects. We can type Ref expressions
with the special type τ Ref. Set and Get expressions easily follow.
6.4. TYPES FOR AN ADVANCED LANGUAGE: TF[SRX 117
Γ`e:τ
(Ref)
Γ ` Ref e : τ Ref
Γ ` e : τ Ref, Γ ` e0 : τ
(Set)
Γ ` e := e0 : τ
Γ ` e : τ Ref
(Get)
Γ ` !e : τ
Finally, the other kind of side effects we need to type are exceptions. To type
an exception itself, we will simply use the type Exn. This allows all exceptions
to be typed in the same way. For example, the code
will typecheck, and has type Exn. We do this to allow maximum flexibility.
Because they alter the flow of evaluation, Raise expressions should always
typecheck, provided the argument typechecks to an Exn type. It is difficult
to know what type to give a raise expression, though. Consider the following
example.
This expression should typecheck to type Int. From our If rule, however,
we know that the Then and Else clause must have the same type. We infer,
therefore, that the Raise expression must have type Int for the If to typecheck.
In Section 6.6.2 we see how to handle this inference automatically. For now, we
will simply type Raise expressions with the arbitrary type τ . Note that this is
a perfectly valid thing for a type rule to do, but it is difficult to implement in
an actual typechecker.
Next, notice that the With clause of the Try expression is very much like a
function. Just as we did with functions, we will need to decorate the identifier
with type information as well. However, as we see below, this decoration can
be combined with the final kind of type decoration, which we will discuss now.
Consider the expression
The type of this expression is clearly Int. But suppose the example were mod-
ified a bit.
118 CHAPTER 6. TYPE SYSTEMS
This expression will also type to Int. But suppose we were to evaluate the
expression using our operational semantics for exceptions. When the exception
is raised, False will be substituted for x, which could cause a runtime type
error. The problem is that our operational semantics is ignorant of the type of
the exception argument.
We can solve this problem without changing the operational semantics, how-
ever. Suppose, instead of writing #Ex False, we wrote #Ex@Bool False. The
@Bool would be used by the type rules to verify that the argument is indeed
a Bool, and the interpreter will simply see the string “#Ex@Bool”, which will
be used to match with the With clause. This also eliminates the need for type
decoration on the With clause identifier, since it serves the same purpose. In
a sense, this is very much like overloading a method in Java or C++. When a
method is overloaded, the type and the method name are needed to uniquely
identify the correct method. Our final exception syntax looks like this:
This expression typechecks and has type Int. When evaluated, the result is
Raise #Ex@Bool False, i.e. the exception is not caught by the With clause.
This is the behavior we want.
Now that we’ve got a type-friendly syntax worked out, let’s move on to the
actual type rules. They are fairly straightforward.
Γ`e:τ
(Exception)
Γ ` #xn@τ e : Exn
Γ ` e : Exn
(Raise)
Γ ` (Raise e) : τ for arbitrary τ
Γ ` e : τ, Γ, x : τ 0 ` e0 : τ
(Try)
Γ ` (Try e With #xn@τ 0 x -> e0 ) : τ
Therefore, by the Try rule, we deduce the type Int for the original expression.
Exercise 6.2. Why are there no type rules for cells?
Exercise 6.3. How else could we support recursive functions in TF[SRX with-
out using Let Rec, but still requiring that recursive functions properly type-
check? Prove that your solution typechecks.
Exercise 6.4. Attempt to type some of the untyped programs we have studied
up to now, for example, the Y -combinator, Let, sequencing abbreviations, a
recursive factorial function, and the encoding of lists. Are there any that can
not typecheck at all?
Exercise 6.5. Give an example of a non-recursive F[SR expression that eval-
uates properly to a value, but does not typecheck when written in TF[SRX.
6.5 Subtyping
The type systems that we covered above are reasonably adequate, but there are
still many types of programs that have no runtime errors that will nonetheless
not typecheck. The first extension to our standard type systems that we would
like to consider is what is known as subtyping. The main strength of subtyping
is that it allows record and object polymorphism to typecheck.
Subtypes should already be a familiar concept from Java and C++. Sub-
classes are subtypes, and extending or implementing an interface gives a sub-
type.
6.5.1 Motivation
Let us motivate subtypes with an example. Consider a function
This function takes as an argument a record with field l of type Int. In the
untyped F[R language the record passed into the function could also include
other fields besides l, and the call
would generate no run-time errors. However, this would not type-check by our
TF[SRX rules: the function argument type is different from the type of the
value passed in.
The solution is to re-consider record types such as {m:Int; n:Int} to mean
a record with at least the m and n fields of type Int, but possibly other fields as
well, of unknown type. Think about the previous record operations and their
types: under this interpretation of record typing, the Record and Projection
rules both still make sense. The old rules are still sound, but we need a new
rule to reflect this new understanding of record types:
Γ ` e : {l1 : τ1 ; . . . ; ln : τn }
(Sub-Record0 )
Γ ` e : {l1 : τ1 ; . . . ; lm : τm } for m < n
This rule is valid, but it’s not as good as we could do. To see why, consider
another example,
Here the function f should, informally, take a record with at least x and y
fields, but also should accept records where additional fields are present. Let us
try to type the function F .
The Sub-Rec0 rule is of no help here either. What we need is a rule that says
that a function with a record type argument may have fields added to its record
argument type, as those fields will be ignored:
Γ ` e : {l1 : τ1 ; . . . ; ln : τn } -> τ
(Sub-Function0 )
Γ ` e : {l1 : τ1 ; . . . ; ln : τn ; . . . ; lm : τm } -> τ
Using this rule, F G will indeed typecheck. The problem is that we still need
other rules. Consider records inside of records:
should still be a valid typing since the y field will be ignored. However, there is
no type rule allowing this typing either.
Γ ` e : τ, ` τ <: τ 0
(Sub)
Γ ` e : τ0
We also need to make our subtyping operator reflexive and transitive. This
can be accomplished with the following two rules.
(Sub-Refl)
` τ <: τ
` τ <: τ 0 , ` τ 0 <: τ 00
(Sub-Trans)
` τ <: τ 00
Our rule for subtyping records needs to do two things. It needs to ensure
that if a record B is the same as record A with some additional fields, then B
is a subtype of A. It also needs to handle the case of records within records. If
122 CHAPTER 6. TYPE SYSTEMS
B’s fields are all subtypes of A’s fields, then B should also be a subtype of A.
We can reflect this concisely in a single rule as follows.
(Sub-Record)
` τ1 <: τ10 , . . . , τn <: τn0
` {l1 : τ1 ; . . . ; ln : τn ; . . . ; lm : τm } <: {l1 : τ10 ; . . . ; ln : τn0 }
The function rule must also do two things. If functions A and B are equiv-
alent except that B returns a subtype of what A returns, then B is a subtype
of A. However, if A and B are the same except that B’s argument is a subtype
of A’s argument, then A is a subtype of B. Simply put, for a function to be a
subtype of another function, it has to take less and give more. The rule should
make this clear:
From our discussions and examples in the previous section, it should be clear
that this more general set of rules will work.
Γ ` e : τ -> τ 0 , Γ ` e0 : τ
(Application)
Γ ` e e0 : τ 0
What principality means is no other typing will let more uses of the program
typecheck, so the principal typing will always be best. The desired property of
our type inference algorithm is that it will always infer principal types. An
124 CHAPTER 6. TYPE SYSTEMS
EF[ types during inference are going to include an extra set of constraining
equations, E, which constrain the behavior of the type variables. Type judg-
ments for EF[ are thus of the form Γ ` e : τ \E, the same as before but tacking
a set of equations on the side. Each member of E is an equation like ’a = ’c.
Equational types will be used to aid inference. Here is an outline of the overall
approach.
It is a fact that if there are no inconsistencies in the equations, they can always
be simplified to give an equation-free type.
Definition 6.4. (Equational Type) An equational type is a type of the form
is an equational type. If you think about it, this is really the same as the type
which cannot be a type since it implies that functions and booleans are the
same type. Such equation sets are deemed inconsistent, and will be equated
with failure of the type inference process. There are also possibilities for circular
(self-referential) types that don’t quite look inconsistent:
Caml disallows such types, and we will also disallow them initially. These types
can’t be simplified away, and that is the main reason why Caml disallows them:
users of the language would have to see some type equations.
(Hypothesis)
Γ ` x : τ \∅ for Γ(x) = τ
(Int)
Γ ` n : Int\∅ for n an integer
(Bool)
Γ ` b : Bool\∅ for b a boolean
The rules for +, -, and = also look similar to their TF[ counterparts, but
now we must take the union of the equations of each of the operands to be the
set of equations for the type of the whole expression, and add an equation to
reflect the type of the operands.
126 CHAPTER 6. TYPE SYSTEMS
Γ ` e : τ \E, Γ ` e0 : τ 0 \E 0
(+)
Γ ` e + e0 : Int\E ∪ E 0 ∪ {τ = Int, τ 0 = Int}
Γ ` e : τ \E, Γ ` e0 : τ 0 \E 0
(-)
Γ ` e - e0 : Int\E ∪ E 0 ∪ {τ = Int, τ 0 = Int}
Γ ` e : τ \E, Γ ` e0 : τ 0 \E 0
(=)
Γ ` e = e0 : Bool\E ∪ E 0 ∪ {τ = Int, τ 0 = Int}
The And, Or, and Not rules are defined in a similar way. The rule for If is
also similar to the TF[ If rule. Notice, though, that we do not immediately
infer a certain type like we did in the previous rules. Instead, we infer a type
’d, and equate ’d to the types of the Then and Else clauses.
(If)
Γ ` e : τ \E, Γ ` e0 : τ 0 \E 0 , Γ ` e00 : τ 00 \E 00
Γ ` (If e Then e0 Else e00 ) : ’d\E ∪ E 0 ∪ E 00 ∪ {τ = Bool, τ 0 = τ 00 = ’d}
Finally, we are ready for the function and application rules. Functions no
longer have explicit type information, but we may simply choose a type ’a as
the function argument, and include it in the equations later. The application
rule also picks a ’a type, and adds an equation with ’a as the right hand side
of a function type. The rules should make this clear.
Γ, x : ’a ` e : τ \E
(Function)
Γ ` (Function x -> e) : ’a -> τ \E
Γ ` e : τ \E, Γ ` e0 : τ 0 \E 0
(Application)
Γ ` e e0 : ’a\E ∪ E 0 ∪ {τ = τ 0 -> ’a}
These rules almost directly define the equational type inference procedure:
the proof can pretty much be built from the bottom (leaves) on up. Each
equation added denotes two types that should be equal.
we check for any inconsistent equations, such as Int = Bool that denote type
errors.
The following algorithm computes the equational closure of set E.
• For each equation of the form τ0 -> τ00 = τ1 -> τ10 in E, add τ0 = τ1 and
τ00 = τ10 to E.
• For each set of equations τ0 = τ1 and τ1 = τ2 in E, add the equation
τ0 = τ2 to E (by transitivity).
• Repeat (1) and (2) until no more equations can be added to E.
Note that we will implicitly use the symmetric property on these equations, and
so there is no need to add τ1 = τ0 for every equation τ0 = τ1 .
The closure serves to uncover inconsistencies. For instance,
If the equations are consistent, the next step is to solve the equational con-
straints. We do this by substituting type variables with actual types. The
algorithm is as follows. Given τ \E,
Notice that step (1) considers the symmetric equivalent of each equation,
which is why we didn’t include them in the closure. The algorithm has a flaw
though: the replacements may continue forever. This happens when E contains
a circular type. Recall the example of a self-referential type
The solution is to check for such cycles before trying to solve the equations.
The best way to do this it to phrase the problem in graph-theoretical context.
Specifically, we define a directed graph G in which the nodes are the the type
variables in E. There is a directed edge from ’a to ’b if ’a = τ is an equation
in E and ’b occurs in τ .
We raise a typeError if there is a cycle in G for which there is at least one
edge representing a constraint that isn’t just between type variables (’a = ’b).
In summary, our entire EF[ type inference algorithm is as follows. For
expression e,
Theorem 6.2. The typings produced by the above algorithm are always princi-
pal.
{’a = Bool, Int = ’b, ’a -> ’b = Bool -> ’c, ’b = ’c, Int = ’c}
The set is not immediately inconsistent, and does not contain any cycles. There-
fore, we solve the equations. In this case, τ contains only ’c, and we can replace
’c with Int. We output Int as the type of the expression, which is clearly cor-
rect.
But when we compute the closure of this equational type, we get the equation
Int = Bool! What went wrong? The problem in this case is that each use of x
in the body used the same type variable ’a. In fact, when we type Function y
-> y, we know that ’a can be anything, so for different uses, ’a can be different
things. We need to build this intuition into our type system to correctly handle
cases like this. We define such a type system in PEF[, which is EF[ with Let
and Let-polymorphism.
PEF[ has a special Let typing rule, in which we allow a new kind of type
in Γ: ∀’a1 . . . ’an . τ . This is called a type schema, and may only appear in Γ.
An example of a type schema is ∀’a. ’a -> ’a. Note that the type variables
’a1 . . . ’an are considered to be bound by this type expression.
The new rule for Let is
(Let Inst.)
Γ, x : ∀’a1 . . . ’an . τ 0 ` x : R(τ 0 )\∅
where R(τ 0 ) is a renaming of the variables ’a1 . . . ’an to fresh names. Since
these names are fresh each time x is used, the different uses won’t conflict like
above.
It will help to see an example of this type system in action. Let’s type the
example program from above:
We have
This constraint set trivially has the solution type ’a -> ’a. Thus, we then
typecheck the Let body under the assumption that x has type ∀’a.’a -> ’a.
Similarly,
The important point here is that this use of x gets a different type variable,
’d, by the Let-Inst rule. Putting the two together, the type is something like
Since ’b and ’d are different variables, we don’t get the conflict we got previ-
ously.
F[ is not the best system to show off the power of replacing equality with
subtyping. Since the language does not have records, there is not any interesting
subtyping that could happen. To show the usefulness of subtyping, we thus
define the constraints in an environment where we have records, F[R. F[R plus
constraints is CF[R. We can contrast CF[R with the EFbR language which
we did not study but is simply EF[ with support for records. Instead of types
τ \E for a set of equations E, CF[R has types
CF[R has the following set of type rules. These are direct generalizations of
the EF[ rules, replacing = by <:. the <: is always in the direction of information
flow. We let C represent a set of subtyping constraints.
(Hypothesis)
Γ ` x : τ \C for Γ(x) = τ
(Int)
Γ ` n : Int\∅ for n an integer
(Bool)
Γ ` b : Bool\∅ for b a boolean
Γ ` e : τ \C, Γ ` e0 : τ 0 \C 0
(+)
Γ ` e + e0 : Int\C ∪ C 0 ∪ {τ <: Int, τ 0 <: Int}
Γ ` e : τ \C, Γ ` e0 : τ 0 \C 0
(-)
Γ ` e - e0 : Int\C ∪ C 0 ∪ {τ <: Int, τ 0 <: Int}
Γ ` e : τ \C, Γ ` e0 : τ 0 \C 0
(=)
Γ`e = e0 : Bool\C ∪ C 0 ∪ {τ <: Int, τ 0 <: Int}
(If)
Γ ` e : τ \C, Γ ` e0 : τ 0 \C 0 , Γ ` e00 : τ 00 \C 00 ,
Γ ` (If e Then e0 Else e00 ) : ’d\C ∪ C 0 ∪ C 00 ∪ {τ <: Bool, τ 0 <: ’d, τ 00 <: ’d}
Γ, x : ’a ` e : τ \C
(Function)
Γ ` (Function x -> e) : ’a -> τ \C
Γ ` e : τ \C, Γ ` e0 : τ 0 \C 0
(Application)
Γ ` e e0 : ’a\C ∪ C 0 ∪ {τ <: τ 0 -> ’a}
The two rules we have not seen in EF[ are the Record and Projection rules.
There is nothing particularly special about these rules, however.
6.7. CONSTRAINED TYPE INFERENCE 133
Γ ` e1 : τ1 \C1 , . . . , Γ ` en : τn \Cn
(Record)
Γ ` {l1 =e1 ; . . . ; ln =en } : {l1 : τ1 ; . . . ; ln : τn }\C1 ∪ . . . ∪ Cn
Γ ` e : τ \C
(Projection)
Γ ` e.l : ’a\{τ <: {l : ’a}} ∪ C
As with EF[, these rules almost directly define the type inference procedure
and the proof can pretty much be built from the bottom up.
The complete type inference algorithm is as follows. Given an expression e,
The algorithms for computing the closure of C and doing cycle detection are
fairly obvious generalizations of the EF[ algorithms. Closure is computed as
follows.
2. For each constraint τ0 -> τ00 <: τ1 -> τ10 in C, add τ1 <: τ0 and τ00 <: τ10
to C.
reason we don’t want to solve the constraints is that any substitution proceeds
with possible loss of generality. Consider, for example, a constraint ’a <: τ , and
the possibility of substituting ’a with τ . This precludes the possibility that the
’a position be a subtype of τ , as the substitution in effect asserts the equality
of ’a and τ . In simpler terms, we need to keep the constraints around as part
of the type. This is the main weakness of constrained type systems; the types
include the constraints, and are therefore difficult to read and understand.
We have the same shortcomings as in the equational case at this point: there
is as of yet no polymorphism. The solution used in the equational case won’t
work here, as it required the constraints to be solved.
The solution is to create constrained polymorphic types
∀’a1, . . . , ’an. τ \C
Concurrency
7.1 Overview
Concurrent execution can be loosely grouped into three implementation cat-
egories, in order of loosest to tightest coupling. Distributed computation is
computation on multiple computers which share no memory and are sending
messages between each other to communicate data. There is still a wide range
of how tightly these computers can be decoupled. Grid computing is distributed
computing where the Internet is the communication medium. Cluster comput-
ing is over a fast LAN network. Massive Parallel Processing (MPP) involves
specialized communication hardware for very high communication bandwidth.
Distributed shared memory is the case where different processes are running on
different processors but sharing some special memory via a memory bus. Lastly,
multithreaded computation is the case where multiple threads of execution share
a single memory which is local. Multithreaded computations may run on a
single (core) CPU, meaning the concurrent execution is an illusion achieved by
interleaving the execution steps of two threads, or if the computer has multiple
cores there can be true concurrent execution of a multithreaded program. The
Threads Wikipedia article clarifies how threads and processes differ.
All of the above models still support independent foci of control. That is the
primary focus of our study in this chapter – it captures a wide range of models
and they are the most elegant forms of concurrent architecture. There are sev-
eral other models that we are not addressing. Vector processors are computers
that can do an operation on a whole array in one step. These architectures
used to be called SIMD (Single Instruction Multiple Data). Stream processors
are the modern version of vector processors which can work on more than just
135
136 CHAPTER 7. CONCURRENCY
arrays in parallel, for example sparse arrays. The GPGPU (General Purpose
Graphics Processing Units) is a recent version of a stream processor which arose
as a generalization of specialized graphics processors. Lastly, FPGA’s are Field
Programmable Circuits: you can create your own (parallel) logic circuitry on
the fly.
The standard model these days for Java, C, etc concurrency is via multi-
threading. Multithreaded programming is coming on in a big way – newer CPUs
have multiple cores and can run multiple threads simultaneously. Mulithreaded
programming unfortunately is also a disaster waiting to happen – the programs
are just too hard to debug. The root of the problem is there are too many pos-
sible cases of interleaving of the different threads that can occur when accessing
the shared memory. Some of them may show up in testing, but many of them
will only show up after deployment.
Race conditions may be avoided via the use of locks of various kinds. Moni-
tors are regions of mutual exclusion in the source program, only one thread can
execute a monitor block at any one point. They are similar to the synchronized
keyword of Java. A Semaphore is a low-level locking mechanism: grab a lock
before a critical operation; block if someone else has lock; once you have the lock
you know you are the only one accessing; free it when you are done. General
semaphores allow n threads to simultaneously access a critical region, not just
one.
7.2.2 An Example
Before getting into the operational semantics lets do a simple example. Here is
an actor that gets a start message and then counts down from its initial value
to 0. Here, we use the term Y to represent the Y-combinator and the term to
represent a value of no importance (as with the empty record {} before).
Function myaddr ->
Y (Function this -> Function localdata -> Function msg ->
Match msg With
‘main(n) -> myaddr <- ‘count(n); this(_)
| ‘count(n) -> If n = 0
Then this(_)
Else
myaddr <- ‘count(n-1);
this(_) /* set to respond to next message */
)
Here is a code fragment that another actor could use to fire up a new actor with
the above behavior and get it started. Suppose the above code we abbreviated
CountTenBeh.
Let x = Create(CountTenBeh,_) /* _ is the localdata - unused */
In x <- ‘main(10)
Here is an alternative way to count down, where the localdata field holds
the value, and its not in the message.
Function myaddr ->
Y (Function this -> Function localdata -> Function msg ->
Match msg With
‘count(_) ->
If localdata = 0
Then _
Else
myaddr <- ‘count(_);
this(localdata - 1) /* set to respond to next msg */
)
Suppose the above code was abbreviated CountTenBeh2; using it is then
Let x = create(CountTenBeh2,10) /* 10 is the localdata */
In x <- ‘count(_)
The latter example is the correct way to give actors local data – in the former
example the counter value had to be forwarded along every message.
Here is another usage fragment for the first example:
140 CHAPTER 7. CONCURRENCY
Let x = create(CountTenBeh,_)
In x <- ‘main(10); x <- ‘main(5)
In this case the actor x will in parallel and independently counting down from
10 .. 0 and 5 .. 0 - these counts may also interleave in random ways. For the
second example an analogue might be:
Let x = create(CountTenBeh2,10)
In x <- ‘count(_); x <- ‘count(_)
This does nothing but get one more count message queued up; since the actor
sends a new one out each time it gets one until 0, the effect will be to have a
leftover message at the end.
Actor systems can run forever, there is no notion of a final value. So, the system
does small steps of computation:
G1 → G2 → G3 → . . .
– this indicates one step of computation; in each single step one actor in the
soup competely processed one message in the soup. We will define this →
relation below. →∗ is the reflexive, transitive closure of → – many steps of
actor computation. In general this continues infinitely since actor systems may
not terminate. The final meaning of a run of an actor system is this infinite
stream of states.
S
Now let us look at the operational semantics rules for =⇒. Most of the rules
are very minor changes to the corresponding F[V rule; we just give the + rule
to show how the existing F[V rules must be tweaked to add the potential for
side effects S:
S S0
e1 =⇒ v1 , e2 =⇒ v2 where v1 , v2 ∈ Z
(+ Rule) S∪S 0
e1 + e2 =⇒ the integer sum of v1 and v2
Since e or e0 above could in theory have each created actors or sent messages,
we need to append their effects to the final result. These effects are like state,
they are on the side. A major difference with F[S is the effects here are ”write
only” – they don’t change the direction of local computation in any way, they
are only spit out. In that sense local actor computation stays functional.
Here is the send rule:
S S0
e1 =⇒ a, e2 =⇒ v
(Send Rule)
S∪S 0 ∪{[a v]}
e1 e2 =⇒ v
S S0 S 00
e1 =⇒ v1 , e2 =⇒ v2 , v1 a v2 =⇒ v3
(Create Rule)
S∪S 0 ∪S 00 ∪{ha,v3 i}
Create(e1 , e2 ) =⇒ a, for a a fresh actor name
This time the return result matters - it is the name of the new actor. The
running of v1 a v2 passes the actor its own name and its initial values to initialize
it, and so v1 will need to be a curried function of two arguments to accept these
parameters a and v2 (in fact it needs to be a curried function of three arguments,
because later the message will also be passed as a parameter; more on that very
soon).
142 CHAPTER 7. CONCURRENCY
S
(G ∪ {[a v 0 ]} ∪ {ha, vi}) → (G ∪ {ha, v 00 i} ∪ S) if (v v 0 =⇒ v 00 )
This is the only global rule. It matches an actor with a message in the global
soup that is destined for it, uses the local semantics to run that actor (in isola-
tion), and throws back into the soup all of the S, which contains all the messages
sent by this one actor run as well as any new actors created by this one actor
run. A global actor run is just the repeated application of this rule. Notice how
the actor behavior which was v is changed to v 00 , the result of this run.
To test these rules you can run the example programs above.
Compilation by Program
Transformation
The goal of this chapter is to understand the core concepts behind compilation
by writing a F[SR compiler. Compilers are an important technology because
code produced by a compiler is faster than interpreted code by several orders
of magnitude. At least 95% of the production software running is compiled
code. Compilation today is a very complex process: compilers make multiple
passes on a program to get source code to target code, and perform many
complex optimizing transformations. Our goals in this chapter are to understand
the most basic concepts behind compilation: how a high-level program can be
mapped to machine code.
We will outline a compiler of F[SR to a very limited subset of C (“pseudo-
assembly”). The reader should be able to implement this compiler in Caml by
filling in the holes we have left out. The compiler uses a series of program trans-
formations to express the compilation process. These program transformations
map F[SR programs to equivalent F[SR programs, removing high-level fea-
tures one at a time. In particular the following transformations are performed
in turn on a F[SR program by our compiler:
1. Closure conversion
2. A-translation
3. Function hoisting
143
144 CHAPTER 8. COMPILATION BY PROGRAM TRANSFORMATION
• Function y -> 3 + y seems like the right thing, but it amounts to code
substitution, something a compiler can’t do since compiled code must be
immutable.
Since neither of these ideas work, something new is needed. The solution is to
return a closure, a pair consisting of the function and an environment which
remembers the values of any nonlocal variables for later use:
Function definitions are now closure definitions; to invoke such a function a new
process is needed. Closure conversion is a global program transformation
that explicitly performs this operation in the language itself. Function values
are defined to be closures, i.e. tuples of the function and an environment re-
membering the values of nonlocal variables. When invoking a function which is
defined as a closure, we must explicitly pass it the nonlocals environment which
is in the closure so it can be used find values of the nonlocals.
The translation is introduced by way of example. Consider the inner Function
y -> x + y in add above translates to the closure
Let us look at the details of the translation. Closures are defined as tuples in
the form of records
consisting of the original function (the fn field) and the nonlocals environment
(the envt field), which is itself a record. In the nonlocals environment { x =
xx.arg }, x was a nonlocal variable in the original function, and its value is
remembered in this record using a label of the same name, x. All such nonlocal
variables are placed in the environment; in this example x is the only nonlocal
variable.
Functions that used to take an argument y are modified to take an argument
named yy (the original variable name, doubled up). We don’t really have to
change the name but it helps in understanding because the role of the variable
has changed: the new argument yy is expected to be a record of the form { envt
= ..; arg = ..}, passing both the environment and the original argument to
the function.
If yy is indeed such a record at function invocation, then within the body
we can use yy.envt.x to access what was a nonlocal variable x in the original
function body, and yy.arg to access what was the argument y to the function.
146 CHAPTER 8. COMPILATION BY PROGRAM TRANSFORMATION
add’ = {
fn = Function xx -> {
fn = Function yy -> (yy.envt.x) + (yy.arg);
envt = { x = xx.arg }
};
envt = {}
}
So, we first pull out the function part of the closure, (add’.fn), and then pass it
a record consisting of the environment add’.envt also pulled from the closure,
and the argument, 3. Translation of add 3 4 takes the result of the above,
which should evaluate to a function closure { fn = ...; envt = ...}, and
does the same trick to apply 4 to it:
and the result would be 12, the same as the original result, confirming the
soundness of the translation in this case. In general applications are converted
as follows. At function call time, the remembered environment in the closure
is passed to the function in the closure. Thus, for the add’ 3 closure above,
add3’, when it is applied later to e.g. 7, the envt will know it is 3 that is to be
added to 7.
One more level of nesting Closure conversion is even slightly more compli-
cated if we consider one more level of nesting of function definitions, for example
The Function z needs to get x, and since that Function z is defined inside
Function y, Function y has to be an intermediary to pass from the outermost
function x. Here is the translation.
triadd’ = {
fn = Function xx -> {
fn = Function yy -> {
fn = Function zz ->
(zz.envt.x) + (zz.envt.x) + (zz.arg);
envt = { x = yy.envt.x; y = yy.arg }
};
envt = { x = xx.arg }
};
envt = {}
}
Some observations can be made. The inner z function has nonlocals x and y
so both of them need to be in its environment; The y function doesn’t directly
use nonlocals, but it has nonlocal x because the function inside it, Function
z, needs x. So its nonlocals envt has x in it. Function z can get x into its
environment from y’s environment, as yy.envt.x. Thus, Function y serves as
middleman to get x to Function z.
8.2 A-Translation
Machine language programs are linear sequences of atomic instructions; at
most one arithmetic operation is possible per instruction, so many instructions
are needed to evaluate complex arithmetic (and other) expressions. The A-
translation closes the gap between expression-based programs and linear, atomic
instructions, by rephrasing expression-based programs as a sequence of atomic
operations. We represent this as a sequence of Let statements, each of which
performs one atomic operation.
The idea should be self-evident from the case of arithmetic expressions. Con-
sider for instance
4 + (2 * (3 + 2))
Let v1 = 3 + 2 In
Let v2 = 2 * v1 In
Let v3 = 4 + v2 In
v3
8.2. A-TRANSLATION 149
This program should give the same result as the original since all we did was
to make the computation sequence more self-evident. Notice how similar this is
to 3-address machine code: it is a linear sequence of atomic operations directly
applied to variables or constants. The v1 etc variables are temporaries; in ma-
chine code they generally end up being assigned to registers. These temporaries
are not re-used (re-assigned to) above. Register-like programming is not possi-
ble in F[SR but it is how real 3-address intermediate language works. In the
final machine code generation temporaries are re-used (via a register allocation
strategy).
We are in fact going to use a more naive (but uniform) translation, that also
first assigns constants and variables to other variables:
Let v1 = 4 In
Let v2 = 2 In
Let v3 = 3 In
Let v4 = 2 In
Let v5 = v3 + v4 In
Let v6 = v2 * v5 In
Let v7 = v1 + v6 In
v7
The full A-translation will, as with the arithmetic example, do a full linearization
of all operations:
Let v1 =
(Function x ->
Let v1’ = (Function y -> Let v1’’ = y in v1’’) In v1’)
In
Let v2 = 4 In
Let v3 = v1 v2 In
Let v4 = 2 In
Let v5 = v3 v4 In
v5
All forms of F[SR expression can be linearized in similar fashion, except If:
If (3 = x + 2) Then 3 Else 2 * x
Let v1 = x + 2 In
Let v2 = (3 = v1) In
If v2 Then 3 Else Let v1 = 2 * x In v1
v1 := x + 2
v2 := 3 = v1
BRANCH v2, L2
L1: v3 := 3
GOTO L3
L2: v4 := 4
L3:
but is a form easier to manipulate in Caml since lists of declarations will be ap-
pended together at translation time. When writing a compiler, the programmer
may or may not want to use this intermediate form. It is not much harder to
write the functions to work directly on the Let representation.
We now sketch the translation for the core primitives. Assume the following
auxiliary functions have been defined:
• The function letize which converts from the list-of-tuples form to the
actual Let form, and
• resultId, which for list [(v1,e1); ...; (vn,en)] returns result iden-
tifier vn.
(* ... *)
152 CHAPTER 8. COMPILATION BY PROGRAM TRANSFORMATION
At the end of the A-translation, the code is all “linear” in the way it runs
in the interpreter, not as a tree. Machine code is also linearly ordered; we are
getting much closer to machine code.
Theorem 8.2. A-translation is sound, i.e. e and atrans(e) both either com-
pute to values or both diverge.
Although we have only partially defined A-translation, the extra syntax of
F[SR (records, reference cells) does not provide any major complication.
After these two phases, functions will have no nonlocal variables. Thus, we
can hoist all functions in the program body to the start of the program. This
brings the program structure more in line with C (and machine) code. Since our
final target is C, the leftover code from which all functions were hoisted then
is made the main function. A function hoist carries out this transformation.
Informally, the operation is quite simple: take e.g.
and replace it by
In general, we hoist all functions to the front of the code and give them a
name via Let. The transformation is always sound if there are no free variables
in the function body, a property guaranteed by closure conversion. We will
define this process in a simple iterative (but inefficient) manner:
Definition 8.3 (Function Hoisting).
let hoist e =
if e = e1[(Function ea -> e’)/f] for some e1 with f free,
and e’ itself contains no functions
(i.e. Function ea -> e’ is an innermost function)
then
Let f = (Function ea -> e’) In hoist(e1)
else e
8.3. FUNCTION HOISTING 153
This function hoists out innermost functions first. If functions are not hoisted
out innermost-first, there will still be some nested functions in the hoisted defi-
nitions. So, the order of hoisting is important.
The definition of hoisting given above is concise, but it is too inefficient. A
one-pass implementation can be used that recursively replaces functions with
variables and accumulates them in a list. This implementation is left as an
exercise. Resulting programs will be of the form
Lemma 8.1.
So, the program is almost nothing but a collection of functions, with a body
that just invokes main. This brings the program closer to C programs, which are
nothing but a collection of functions and main is implicitly invoked at program
start.
Let Rec definitions also need to be hoisted to the top level; their treatment
is similar and will be left as an exercise.
154 CHAPTER 8. COMPILATION BY PROGRAM TRANSFORMATION
8.4 Translation to C
We are now ready to translate into C. To summarize up to now, we have
We have done about all the translation that is possible within F[SR. Programs
are indeed looking a lot more like machine code: all functions are declared at
the top level, and each function body consists of a linear sequence of atomic
instructions (with exception of If which is a branch). There still are a few
things that are more complex than machine code: records are still implicitly
allocated, and function call is atomic, no pushing of parameters is needed. Since
C has function call built in, records are the only significant gap that needs to
be closed.
The translation involves two main operations.
2. For each function body, map each atomic tuple to a primitive C statement.
Atomic Tuples Before giving the translation, we enumerate all possible right-
hand sides of Let variable assignments that come out of the A-translation (in
the following vi, vj, vk, and f are variables).These are called the atomic
tuples.
Fact 8.1 (Atomic Tuples). F[SR programs that have passed through the first
three phases have function bodies consisting of tuple lists where each tuple is of
one of the following forms only:
1. x for variable x
2. n for number n
3. b for boolean b
4. vi vj (application)
5. vj + vk
6. vj - vk
7. vj And vk
8. vj Or vk
9. Not vj
10. vj = vk
8.4. TRANSLATION TO C 155
11. Ref vj
12. vj := vk
13. !vj
15. vi.l
16. If vi Then tuples1 Else tuples2 where tuples1 and tuples2 are the
lists of variable assignments for the Then and Else bodies.
Functions should have all been hoisted to the top so there will be none
of those in the tuples. Observe that some of the records usages are from the
original program, and others were added by the closure conversion process. We
can view all of them as regular records. All we need to do now is generate code
for each of the above tuples.
Figure 8.1 illustrates the overall model of memory we are dealing with.
To elaborate a bit more on stack storage of variables, here is some C pseudo-
code to give you the idea of how the stack pointer sp is used.
Stack-stored entities are also temporary in that they will be junk when the
function/method returns.
Another important issue is whether to box or unbox values.
Figure 8.2 illustrates the difference between boxed and unboxed values.
For multi-word entities such as arrays, storing them unboxed means variables
directly hold a pointer to the first word of the sequence of space. To clarify the
above concepts we review C’s memory layout convention. Variables may be
declared either as globals, register (the register directive is a request to put in
a register only), or on the call stack; all variables declared inside a function
are kept on the stack. Variables directly holding ints, floats, structs, and
arrays are all unboxed. (Examples: int x; float x; int arr[10]; snork x
for snork a struct.) There is no such thing as a variable directly holding a
function; variables in C may only hold pointers to functions. It is possible to
write “v = f” in C where f is a previously declared function and not “v = &f”,
but that is because the former is really syntactic sugar for the latter. A pointer
to a function is in fact a pointer to the start of the code of the function. Boxed
variables in C are declared explicitly, as pointer variables. (Examples: int
*x; float *x; int *arr[10]; snork *x for snork a struct.) All malloc’ed
8.4. TRANSLATION TO C 157
Figure 8.2: Boxed vs. unboxed values. The integer value 123 is stored as an
unboxed value, while the record {x=5; y=10} is stored as a boxed value.
158 CHAPTER 8. COMPILATION BY PROGRAM TRANSFORMATION
structures must be stored in a pointer variable because they are boxed: variables
can’t directly be heap entities. Variables are static and the heap is dynamic.
Here is an example of a simple C program and the Sun SPARC assembly
output which gives some impressionistic idea of these concepts:
int glob;
main()
{
int x;
register int reg;
int* mall;
int arr[10];
x = glob + 1;
reg = x;
mall = (int *) malloc(1);
x = *mall;
arr[2] = 4;
/* arr = arr2; --illegal: arrays are not boxed */
}
main:
sethi %hi(glob), %o1
or %o1, %lo(glob), %o0 /* load global address glob into %o0 */
ld [%o0], %o1 /* dereference */
add %o1, 1, %o0 /* increment */
st %o0, [%fp-20] /* store in [%fp-20], 20 back from fp -- x */
/* x directly contains a number,
/* not a pointer */
ld [%fp-20], %l0 /* %l0 IS reg (its in a register directly) */
mov 1, %o0
call malloc, 0 /* call malloc. resulting address to %o0 */
nop
st %o0, [%fp-24] /* put newspace location in mall [%fp-24] */
ld [%fp-24], %o0 /* load mall into %o0 */
ld [%o0], %o1 /* this is a malloced structure -- unbox. */
st %o1, [%fp-20] /* store into x */
mov 4, %o0
st %o0, [%fp-56] /* array is a sequence of memory on stack */
.LL2:
ret
restore
8.4. TRANSLATION TO C 159
Memory layout for our F[SR compiler F[SR is (mostly) a Caml subset
and so its memory is also managed more implicitly than C memory. We will
use a simple, uniform scheme in our compilers which is close in spirit to Java’s:
Box Refs and records and function values, but keep boolean values and integers
unboxed. Also, as in C (and Java), all local function variables will be allocated
on the stack and accessed as offsets from the stack pointer. We will achieve the
latter by implementing F[SR local variables as C local variables, which will be
stack allocated by the C compiler.
Since a Ref is nothing but a mutable location, there may not seem to be any
reason to box it. However, if a function returns a Ref as result, and it were not
boxed, it would have been allocated on the stack and thus would be deallocated.
Here is an example that reflects this problem:
Let f = (Function x -> Ref 5) In !f( ) + 1
If Ref 5 were stored on the stack, after the return it could be wiped out.
All of the Let-defined entities in our tuples (the vi variables) can be either in
registers or on the call stack: none of those variables are directly used outside
the function due to lexical scoping, and they don’t directly contain values that
should stay alive after the function returns. For efficiency, they can all be
declared as register Word variables:
register Word v1, v2, v3, v4, ...;
One other advantage of this simple scheme is every variable holds one word
of data, and thus we don’t need to keep track of how much data a variable is
holding. This scheme is not very efficient, and real compilers optimize signif-
icantly. One example is Ref’s which are known to not escape a function can
unboxed and stack allocated.
All that remains is to come up with a scheme to compile each of the above
atomic tuples and we are done. Records are the most difficult so we will consider
them before writing out the full translation.
Compiling untyped records Recall from when we covered records that the
fields present in a record cannot be known in advance if there is no type system.
So, we won’t know where the field that we need is exactly. Consider, for example,
Field l will be in two different positions in these records so the selection will not
have a sole place it can find the field in. Thus we will need to use a hashtable for
record lookup. In a typed language such as Caml this problem is avoided: the
above code is not well-typed in Caml because the if-then can’t be typed. Note
that the problems with records are closely related to problems with objects,
since objects are simply records with Refs.
This memory layout difficulty with records illustrates an important rela-
tionship between typing and compilation. Type systems impose constraints on
program structure that can make compilers easier to implement. Additionally,
typecheckers will obviate the need to deal with certain run-time errors. Our sim-
ple F[SR compilers are going to core dump on e.g. 4 (5); in Lisp, Smalltalk,
or Scheme these errors would be caught at run-time but would slow down exe-
cution. In a typed language, the compiler would reject the program since it will
not typecheck. Thus for typed languages they will both be faster and safer.
Our method for compilation of records proceeds as follows. We must give
records a heavy implementation, as hash tables (i.e., a set of key-value pairs,
where the keys are label names). In order to make the implementation simple,
records are boxed so they take one word of memory, as mentioned above when
we covered boxing. A record selection operation vk .l is implemented by hashing
on key l in the hash table pointed to by vk at runtime. This is more or less how
Smalltalk message sends are implemented, since records are similar to objects
(and Smalltalk is untyped).
The above is less than optimal because space will be needed for the hashtable,
and record field accessing will be much slower than, for example, struct access
in C. Since closures are records, this will also significantly slow down function
call. A simple optimization would be to treat closure records specially since the
field positions will always be fixed, and use a struct implementation of closure
(create a different struct type for each function).
For instance, consider
toCFunctions([]) = ""
toCFunctions(Functiontuple::Functiontuples) =
toCFunction(Functiontuple) ^ toCFunctions(Functiontuples)
This is a subtle issue, but the code vi = &vj, for example, would definitely not
work for the Ref case (vj may go out of scope).
This translation sketch above leaves out many details. Here is some elabo-
ration.
Typing issues We designed out memory layout so that every entity takes up
one word. So, every variable is of some type that is one word in size. Type all
variables as Word’s, where Word is a 1-word type (defined as e.g. typedef void
*Word;). Many type casts need to be inserted; we are basically turning off the
type-checking of C, but there is no ”switch” that can be flicked. So, for instance
vi = vj + vk will really be vi = (Word (int vj) + (int vk)) – cast the
words to ints, do the addition, and cast back to a word. To cast to a function
pointer is a tongue-twister: in C you can use (*((Word (*)()) f))(arg). The
simplest way to avoid confusion when actually writing a compiler is to include
the following typedefs to the resulting C code:
/*
* Define the type ‘Word’ to be a generic one-word value.
*/
typedef void *Word;
/*
* Define the type ‘FPtr’ to be a pointer to a function that
* consumes Word and returns a Word. All translated
* functions should be of this form.
*/
typedef Word (*FPtr)(Word);
/*
* Here is an example of how to use these typedefs in code.
*/
Word my function(Word w) {
return w;
}
int main(int argc, char **argv) {
Word f1 = (Word) my function;
Word w1 = (Word) 123;
Word w2 = ( *((FPtr) f1) )(w1); /* Computes f1(123) */
printf("%d\n", (int) w2); /* output is "123\n". */
return 0;
}
Global Issues Some global issues you will need to deal with include the fol-
lowing. You will need to print out the result returned by the main function
(so, you probably want the FbSR main function to be called something like
8.5. SUMMARY 163
FbSRmain and then write your own main() by hand which will call FbSRmain);
The C functions need to declare all the temporary variables they use. One
solution is to declare in the function header a C array
Word v[22]
8.5 Summary
let frontend e = hoist(atrans(clconv(e)));;
let translator e = toC(frontend(e));;
8.6 Optimization
Optimization can be done at all phases of the translation process. The above
translation is simple, but inefficient. There is always a tradeoff between sim-
plicity and efficiency in compiler designs, both the efficiency of compilation
itself, and efficiency of code produced. In the phases before C code is produced,
optimizations consist of replacing chunks of the program with operationally
equivalent chunks.
Some simple optimizations include the following. The special closure records
{fn = .., envt = .. } could be implemented as a pointer to a C struct
with fn and envt fields, instead of using the very slow hash method,1 will
significantly speed up the code produced. Records which do not not have field
names overlapping with other records can also be implemented in this manner
(there can be two different records with the same fields, but not two different
records with some fields the same and some different). Another optimization is
to modify the A-translation to avoid making tuples for variables and constants.
Constant expressions such as 3 + 4 can be folded to 7.
More fancy optimizations require a global flow analysis be performed. Sim-
ply put, a flow analysis finds all possible uses of a particular definition, and all
possible definitions corresponding to a particular use.
A definition is a record, a Function, or a number or boolean, and a use is a
record field selection, function application, or numerical or boolean operator.
1 Although hash lookups are O(1), there is still a large amount of constant overhead,
FbDK: The F[
Development Kit
The F[ Development Kit, or FbDK, is a small set set of utilities for working
with our example languages, F[ and F[SR. FbDK is useful in two ways. First,
it provides the binaries Fb and FbSR for toplevel and file-based interpretation. In
addition, all of the source code for FbDK is available, except for the interpreters.
The reader may write his own F[ or F[SR interpreter, and plug it into the
FbDK source to make a toplevel and file interpreter for testing purposes. This
appendix explains how to use the FbDK binaries, and gives an overview of the
source code.
Before we begin, let’s establish a notation for the examples in this appendix.
Shell commands and expressions within the toplevel are typeset in typewriter
font. Lines beginning with “$” are shell commands, while lines beginning with
“#” are expressions typed within the toplevel. Lastly, lines beginning with “==>”
are the resulting values computed by the toplevel.
165
166 APPENDIX A. FBDK: THE F[ DEVELOPMENT KIT
Now there will be a subdirectory with the same name as the tar archive (without
the .tar.gz extension). In this directory are two bytecode files: fb.byte and
dsr.byte. Now, simply execute the shell command ocamlrun fb.byte to run
a D top loop, and similarly for DSR.
The FbDK source is available at
http://www.cs.jhu.edu/~scott/pl/book/dist/FbDK/downloads/source
The source requires no installation. Simply unpack the source archive in the
directory of your choice. We will discuss how to work with the source code in
Section A.3.7.
$ FbSR --help
$ FbSR
# 3 + 4;;
==> 7
# (Function x -> x + 1) 5;;
==> 6
# True Or False;;
==> True
The toplevel behaves much like the Caml toplevel, but has a few differences.
The main difference is that Let statements must always have an explicit In. This
1 Fb and FbSR are built out of the same functor, parameterized by language. This will
means that F[SR programs must be input as one continuous string expression,
and evaluation does not begin until the ;; is input. For this reason, larger
programs should be written as files, and directly interpreted instead of input at
the toplevel.
The second difference is more of an advantage. Unlike the Caml toplevel,
the F[SR toplevel always outputs a full representation of its result values. This
is particular useful for looking at expressions that evaluate to functions. While
Caml just reports the value as <fun>, F[SR gives a full description. For exam-
ple,
Notice how we can see the full result of the currying. This is useful for debugging
large functions.
$ FbSR myprogram.dsr
The value result of the program will be displayed on standard output. For
example, interpreting the merge sort example in Section 4.3.2 would look like
this.
$ FbSR mergesort.dsr
{l=1; r={l=2; r={l=3; r={l=4; r={l=5; r={l=6; r=
{l=7; r={l=8; r={l=9; r={l=10; r=-1}}}}}}}}}}
end;;
name by taking the capitalized base name of the source file (.ml or .mli file). That is, it strips
the leading directory name, if any, as well as the .ml or .mli suffix; then, it [sets] the first
letter to uppercase, in order to comply with the requirement that module names must be
capitalized.”[16]
A.3. THE FBDK SOURCE CODE 169
module type S =
sig
val main: unit -> unit
end
let toplevel_loop () =
Printf.printf "\t%s version %s\n\n"
Lang.name Version.version;
flush stdout;
while true do
Printf.printf "# ";
flush stdout;
let lexbuf = Lexing.from_channel stdin in
let ast = Lang.Parser.main Lang.Lexer.token lexbuf in
let result = Lang.Interpreter.eval ast in
Printf.printf "==> %s\n" (Lang.Pp.pp result " ");
flush stdout
done
let print_version () =
Printf.printf "%s version %s\nBuild Date: %s\n"
Lang.name Version.version Version.date
let main () =
let filename = ref "" in
let toplevel = ref true in
let version = ref false in
Arg.parse
[("--version",
170 APPENDIX A. FBDK: THE F[ DEVELOPMENT KIT
Arg.Set(version),
"show version information")]
(function fname ->
filename := fname;
version := false;
toplevel := false)
("Usage: " ^
Lang.name ^
" [ options ] [ filename ]\noptions:");
if !version then
print_version ()
else if !toplevel then
toplevel_loop ()
else
run_file !filename
end
end;;
Application.main ();;
type expr =
Var of ident | Function of ident * expr | Appl of expr * expr |
Letrec of ident * ident * expr * expr |
Plus of expr * expr | Minus of expr * expr | Equal of expr * expr |
And of expr * expr| Or of expr * expr | Not of expr |
If of expr * expr * expr | Int of int | Bool of bool
open Fbast;;
Building FbDK
The final point we need to discuss is how to actually build FbDK. Luckily, doing
so is very straightforward. Before building for the first time, you will need to
configure the build process for your platform. This is done by going to the
$FbDK SRC directory and typing
$ ./configure
The configure script checks for the OCaml tools that are needed to build
the source, and also determines whether to use a native compiler, or whether
to compile to bytecode. Note that the directory containing ocaml, ocamlc,
ocamlyacc, etc. should be in your path before running the configure script.
Once the configure script is finished, you should see a Makefile in the
$FbDK SRC directory. Once the Makefile is there, configure does not need to
be run again, unless you move the source to a different platform.
A.3. THE FBDK SOURCE CODE 173
Now, FbDK can be built at any time by going to the $FbDK SRC directory
and typing
$ make
make will build the binaries Fb, FbSR, and BOOL and copy them to the $FbDK SRC
directory, from where then can be run, for example, as
$ ./FbSR
If you platform does not support sh and GNU Make, you can still build
FbDK. Windows users should use win32Build.bat to build FbDK. Note that
you can comment out the lines of this file that you don’t need, it builds all three
interpreters.
$ ./Fb
Try entering in an expression into the toplevel. The dummy interpreter simply
spits the expression back without evaluating it, so your session will look like
# 3+4;;
==> 3 + 4
Press CTRL+C to exit the toplevel. Now let’s write the match case for addition.
Open the file $FbDK SRC/src/Fb/fbinterp.ml in your favorite text editor. The
file should initially look as follows.
open Fbast;;
(*
* Replace this with your interpreter code.
*)
let rec eval e = e
open Dast;;
exception TypeMismatch;;
exception NotImplemented;;
174 APPENDIX A. FBDK: THE F[ DEVELOPMENT KIT
(*
* Replace this with your interpreter code.
*)
let rec eval e =
match e with
Int x -> Int x
Now, rebuild FbDK by typing make in the $FbDK SRC directory again. Run
the F[ toplevel. Now, we can evaluate addition expressions:
# 3+4;;
==> 7
Proceed in this manner, adding one or two match cases, and then testing,
then adding some more match cases. This way, if an error occurs, it will be easy
to figure out which changes caused it.
175
176 BIBLIOGRAPHY
[14] Paul Hudak, John Peterson, and Joseph Fasel. A gentle introduction to
Haskell, version 98, June 2000. http://www.haskell.org/tutorial/.
[15] Andrew D. Irvine. Russell’s paradox. Stanford Encyclopedia of Philosophy,
June 2001. http://plato.stanford.edu/entries/russell-paradox/.
[16] Xavier Leroy. The Objective Caml system release 3.11, documentation
and user’s manual, November 2008. http://caml.inria.fr/pub/docs/
manual-ocaml/index.html.
[17] Ian A. Mason, Scott F. Smith, and Carolyn L. Talcott. From operational
semantics to domain theory. Information and Computation, 128(1):26–47,
1996.
[18] Greg Morrisett, Karl Crary, Neal Glew, Dan Grossman, Richard Samuels,
Frederick Smith, David Walker, Stephanie Weirich, and Steve Zdancewic.
Talx86: A realistic typed assembly language. In 1999 ACM SIGPLAN
Workshop on Compiler Support for System Software, pages 25–35, Atlanta,
GA, USA, May 1999.
[20] Randall B. Smith and David Ungar. Self: The power of simplicity. In
Conference proceedings on Object-oriented programming systems, languages
and applications, pages 227–242. ACM Press, 1987.
[21] Randall B. Smith and David Ungar. Programming as an experience: The
inspiration for Self. Lecture Notes in Computer Science, 952:303–??, 1995.
177
178 INDEX