0% found this document useful (0 votes)

65 views39 pages

Typeds

The document discusses type inference in programming languages. It provides examples of type inference in ML by examining code without type information and inferring the most general types. Type inference reduces syntactic overhead and guarantees the most general type is produced. The algorithm was originally developed in the 1950s and is now used in many languages to allow static typing without requiring type annotations.

Uploaded by

Gan Tian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views39 pages

Typeds

Uploaded by

Gan Tian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

cs242

Kathleen Fisher

Reading: “Concepts in Programming Languages”, Chapter 6

 We are looking for homework graders.
 If you are interested, send mail to
[email protected]
 Need to be available approximately 5-9pm on
Thursdays.

 You’ll be paid Stanford’s hourly rate.

 We’ll provide food “of your choice.”
 Previous graders have really enjoyed it.
 Great way to really learn the material.
 General discussion of types
 What is a type?
 Compile-time vs run-time checking
 Conservative program analysis
 Type inference
 Good example of static analysis algorithm
 Will study algorithm and examples
 Polymorphism
 Uniform vs non-uniform impl of polymorphism
 Polymorphism vs overloading
 Thoughts to keep in mind Architect Programmer
 What features are
convenient for programmer?
 What other features do they Programming
prevent? Language Compiler,
Tester Runtime
 What are design tradeoffs? environ-
 Easy to write but harder to Diagnostic
ment
read? Tools
 Easy to write but poorer error
messages?
 What are the implementation
costs?
A type is a collection of computable values
that share some structural property.

 Examples  Non-examples
Integer 3, True, \x->x
String Even integers
Int  Bool f:Int  Int | if x>3
(Int  Int)  Bool then f(x) > x *(x+1)

Distinction between sets that are types and sets

that are not types is language dependent.
 Program organization and documentation
 Separate types for separate concepts
 Represent concepts from problem domain
 Indicate intended use of declared identifiers
 Types can be checked, unlike program comments

 Identify and prevent errors

 Compile-time or run-time checking can prevent
meaningless computations such as 3 + true – “Bill”

 Support optimization
 Example: short integers require fewer bits
 Access record component by known offset
 JavaScript and Lisp use run-time type checking
f(x) Make sure f is a function before calling f.

 ML and Haskell use compile-time type checking

f(x) Must have f : A  B and x : A

 Basic tradeoff
 Both kinds of checking prevent type errors.
 Run-time checking slows down execution.
 Compile-time checking restricts program flexibility.
JavaScript array: elements can have different types
Haskell list: all elements must have same type
 Which gives better programmer diagnostics?
 In JavaScript, we can write a function like
function f(x) { return x < 10 ? x : x(); }
Some uses will produce type error, some will not.

 Static typing always conservative

if (big-hairy-boolean-expression)
then f(5);
else f(15);
Cannot decide at compile time if run-time error will
occur!
 Not safe: BCPL family, including C and C++
 Casts, pointer arithmetic

 Almost safe: Algol family, Pascal, Ada.

 Dangling pointers.
 Allocate a pointer p to an integer, deallocate the memory
referenced by p, then later use the value pointed to by p.
 No language with explicit deallocation of memory is fully
type-safe.

 Safe: Lisp, ML, Haskell, Smalltalk, JavaScript, Java

 Dynamically typed: Lisp, Smalltalk, JavaScript
 Statically typed: ML, Haskell, Java
 Standard type checking:
int f(int x) { return x+1; };
int g(int y) { return f(y+1)*2; };
 Examine body of each function. Use declared
types of identifiers to check agreement.

 Type inference:
int f(int x) { return x+1; };
int g(int y) { return f(y+1)*2;};
 Examine code without type information. Infer
the most general types that could have been
declared.
ML and Haskell are designed to make type inference feasible.
 Types and type checking
 Improved steadily since Algol 60
 Eliminated sources of unsoundness.
 Become substantially more expressive.
 Important for modularity, reliability and compilation

 Type inference
 Reduces syntactic overhead of expressive types
 Guaranteed to produce most general type.
 Widely regarded as important language innovation
 Illustrative example of a flow-insensitive static
analysis algorithm
 Original type inference algorithm was invented by Haskell
Curry and Robert Feys for the simply typed lambda calculus in
1958.

 In 1969, Hindley extended the algorithm and proved it always

produced the most general type.

 In 1978, Milner independently developed equivalent algorithm,

called algorithm W, during his work designing ML.

 In 1982, Damas proved the algorithm was complete.

 Already used in many languages: ML, Ada, C# 3.0, F#, Haskell,

Visual Basic .Net 9.0, and soon in: Fortress, Perl 6, C++0x

 We’ll use ML to explain the algorithm because it is the original

language to use the feature and is the simplest place to start.
 Example
- fun f(x) = 2 + x;
> val it = fn : int  int

 What is the type of f?

 + has two types: int  int  int,
real  real  real
 2 has only one type: int
 This implies + : int  int  int
 From context, we need x:int
 Therefore f(x:int) = 2+x has type int  int
 Example
-fun f(x) = 2+x; Graph for \x ->((plus 2) x)

>val it = fn:int  int tint = intint


 What is the type of f?
int (t = int)
Assign types to leaves @

Propagate to internal nodes and intint

@ x :t
generate constraints
+ 2 : int
int  int  int
real  real  real
Solve by substitution
:r (s = domain  range)
@
(domain = d)
f :s x:d (range = r)

 Apply function f to argument x: f(x)

 Because f is being applied, its type (s in figure) must be a
function type: domain  range.
 Domain of f must be type of argument x (d in figure).
 Range of f must be result type of expression (r in figure).
 Solving, we get: s = d  r.
:s (s = domain  range)

(domain = d)
x:d e :r
(range = r)

 Function expression: \x -> e

 Type of lambda abstraction (s in figure) must be a
function type: domain  range.
 Domain is type of abstracted variable x (d in figure).
 Range is type of function body e (r in figure).
 Solving, we get : s = d  r.
 Example
-fun f(g) = g(2);
>val it = fn : (int  t)  t

 What is the type of f?

Assign types to leaves Graph for \g  (g 2)

Propagate to internal nodes and  st = (intt)t

generate constraints

@ t (s = intt)

Solve by substitution g :s 2 : int

 Function
-fun f(g) = g(2);
>val it = fn:(int  t)  t

 Possible applications
-fun add(x) = 2+x; -fun isEven(x) = ...;
>val it = fn:int  int >val it = fn:int  bool
-f(add); -f(isEven);
>val it = 4 : int >val it = true : bool
 Function
-fun f(g) = g(2);
>val it = fn:(int  t)  t

 Incorrect use
-fun not(x) = if x then false else true;
>val it = fn : bool  bool
-f(not);
Error: operator and operand don't agree
operator domain: int -> 'Z
operand: bool -> bool

Type error: cannot make bool  bool = int  t

 Function Definition
-fun f(g,x) = g(g(x));
>val it = fn:(t  t)*t  t
Graph for g,x. g(g x)
 Type Inference
s*tv = (vv)*vv

Assign types to leaves
@ v (s = uv)
Propagate to internal nodes and
generate constraints g: s u (s = tu)
@

g :s x :t
Solve by substitution
 Datatype with type variable
- datatype ‘a list = nil | cons of ‘a *(‘a list)
> nil : ‘a list
> cons : ‘a *(‘a list)  ‘a list

’a is syntax for “type variable a”

 Polymorphic function
- fun length nil = 0
| length (cons(x,rest)) = 1 + length(rest)
> length : ‘a list  int

 Type inference
 Infer separate type for each clause
 Combine by making two types equal (if necessary)
 length(cons(x,rest)) = 1 + length(rest)

 : p (p = v  w, p = t)
(‘a* ‘a list  ‘a list = : w (int int = r  w)
s * u  v) @
@ :v
@ : r (t = u  r)
:s*u @ : int  int
cons
: ‘a * ‘a list rest : u
length
 ‘a list + 1 :t
: int
x:s
 length(cons(x,rest)) = 1 + length(rest)

Collected Constraints:
 :p
p=t
:w p=vw
@
:v
@
:r int int = r  w
@
:s*u @ : int  int t=ur
cons
: ‘a * ‘a list rest : u ‘a* ‘a list  ‘a list =
length
 ‘a list s*uv
+ 1 :t
: int
x:s
 length(cons(x,rest)) = 1 + length(rest)

Collected Constraints:
 :p
p=t
:w p=vw
@
:v
@
:r int int = r  w
@
:s*u @ : int  int t=ur
cons
: ‘a * ‘a list rest : u ‘a = s
length
 ‘a list + 1 :t ‘a list = u
: int
x:s ‘a list = v
 length(cons(x,rest)) = 1 + length(rest)

Collected Constraints:
 :p
p=t
:w p = ‘a list  w
@
:v
@
:r int int = r  w
@
:s*u @ : int  int t = ‘a list  r
cons
: ‘a * ‘a list rest : u
length
 ‘a list + 1 :t
: int
x:s
 length(cons(x,rest)) = 1 + length(rest)

Collected Constraints:
 :p
p=t
:w p = ‘a list  int
@
:v
@
:r t = ‘a list  int
@
:s*u @ : int  int
cons
: ‘a * ‘a list rest : u
length
 ‘a list + 1 :t
: int
x:s Result:
p = ‘a list  int
 Function with multiple clauses
- fun append(nil,l) = l
| append(x::xs,l) = x :: append(xs,l)
> append: ‘a list * ‘a list  int

 Infer type of each branch

 First branch:
append :‘a list *‘b  ‘b
 First branch:
append :‘a list *‘b  ‘a list

 Combine by equating types of two branches:

append :‘a list *‘a list  ‘a list
 Type inference is guaranteed to produce the
most general type:
- fun map(f,nil) = nil
| map(f, x::xs) = f(x) :: (map(f,xs))
> map:('a  'b) * 'a list  'b list

 Function has many other, less general types:

 map:('a  int) * 'a list  int list
 map:(bool  'b) * bool list  'b list
 map:(char  int) * char list  int list

 Less general types are all instances of most

general type, also called the principal type.
 When the Hindley/Milner type inference
algorithm was developed, its complexity was
unknown.
 In 1989, Kanellakis, Mairson, and Mitchell
proved that the problem was exponential-
time complete.
 Linear in practice though…
 Consider this function…
fun reverse (nil) = nil
| reverse (x::xs) = reverse(xs);

 … and its most general type:

reverse : ‘a list  ‘b list

 What does this type mean?

Reversing a list does not change its type, so
there must be an error in the definition of
reverse!

See Koenig paper on “Reading” page of CS242 site

 Type inference computes the types of expressions
 Does not require type declarations for variables
 Finds the most general type by solving constraints
 Leads to polymorphism

 Sometimes better error detection than type checking

 Type may indicate a programming error even if no type error.

 Some costs
 More difficult to identify program line that causes error
 ML requires different syntax for integer 3, real 3.0.
 Natural implementation requires uniform representation sizes.
 Complications regarding assignment took years to work out.

 Idea can be applied to other program properties

 Discover properties of program using same kind of analysis
 Haskell also uses Hindley Milner type inference.
 Haskell uses type classes to support user-defined
overloading, so the inference algorithm is more
complicated.
 ML restricts the language to ensure that no
annotations are required, ever.
 Haskell provides various features like polymorphic
recursion for which types cannot be inferred and so
the user must provide annotations.
 ML polymorphic function
 Declarations require no type information.
 Type inference uses type variables to type expressions.
 Type inference substitutes for variables as needed to
instantiate polymorphic code.

 C++ function template

 Programmer must declare the argument and result types of
functions.
 Programmers must use explicit type parameters to express
polymorphism.
 Function application: type checker does instantiation.

ML also has module system with explicit type parameters

 ML
- fun swap(x,y) =
let val z = !x in x := !y; y := z end;
val swap = fn : 'a ref * 'a ref -> unit

 C++
template <typename T>
void swap(T& x, T& y){
T tmp = x; x=y; y=tmp;
}
Declarations look similar, but compiled very differently
 ML
 Swap is compiled into one function
 Typechecker determines how function can be used

 C++
 Swap is compiled into linkable format
 Linker duplicates code for each type of use

 Why the difference?

 ML ref cell is passed by pointer. The local x is a pointer to
value on heap, so its size is constant.
 C++ arguments passed by reference (pointer), but local x is
on the stack, so its size depends on the type.
 C++ polymorphic sort function
template <typename T>
void sort( int count, T * A[count ] ) {
for (int i=0; i<count-1; i++)
for (int j=i+1; j<count-1; j++)
if (A[j] < A[i]) swap(A[i],A[j]);
}

 What parts of code depend on the type?

 Indexing into array
 Meaning and implementation of <
 Parametric polymorphism
 Single algorithm may be given many types
 Type variable may be replaced by any type
 if f:tt then f:intint, f:boolbool, ...

 Overloading
 A single symbol may refer to more than one
algorithm
 Each algorithm may have different type
 Choice of algorithm determined by type context
 Types of symbol may be arbitrarily different
 + has types int*intint, real*realreal, no
others
 Some predefined operators are overloaded
 User-defined functions must have unique type
- fun plus(x,y) = x+y;
This is compiled to int or real function, not both

 Why is a unique type needed?

 Need to compile code, so need to know which +
 Efficiency of type inference
 Aside: General overloading is NP-complete
Two types, true and false
Overloaded functions
and : {true*truetrue, false*truefalse, …}
 Types are important in modern languages
 Program organization and documentation
 Prevent program errors
 Provide important information to compiler
 Type inference
 Determine best type for an expression, based on
known information about symbols in the expression
 Polymorphism
 Single algorithm (function) can have many types
 Overloading
 One symbol with multiple meanings, resolved at
compile time

Tutorial
100% (1)
Tutorial
223 pages
Ackermann's Set Theory
No ratings yet
Ackermann's Set Theory
61 pages
Tandoori Compositional Typeclass
100% (1)
Tandoori Compositional Typeclass
59 pages
Bottom-Up Parsing Including LR (0), SLR
No ratings yet
Bottom-Up Parsing Including LR (0), SLR
55 pages
DAA - Unit IV - Space and Time Tradeoffs - Lecture Slides
No ratings yet
DAA - Unit IV - Space and Time Tradeoffs - Lecture Slides
41 pages
FORMAL LANGUAGES-Ktunotes - in
No ratings yet
FORMAL LANGUAGES-Ktunotes - in
12 pages
Hindley Milner - Slides
100% (1)
Hindley Milner - Slides
28 pages
(Imre Ruzsa) Introduction To Metalogic With An AP
100% (3)
(Imre Ruzsa) Introduction To Metalogic With An AP
189 pages
C Programming Language
From Everand
C Programming Language
Younish Pathan
No ratings yet
Data Types and Representation
No ratings yet
Data Types and Representation
88 pages
Unit 4
No ratings yet
Unit 4
57 pages
12 Type System
No ratings yet
12 Type System
44 pages
Lex and Yacc
No ratings yet
Lex and Yacc
70 pages
Tese Polymorphic Type Inference
No ratings yet
Tese Polymorphic Type Inference
52 pages
Theory of Computation - 1
No ratings yet
Theory of Computation - 1
46 pages
Unit 3 Compiler Design
No ratings yet
Unit 3 Compiler Design
45 pages
11 TypeInfer
No ratings yet
11 TypeInfer
46 pages
Thinking With Types Type Level Programming in Haskell Sandy Maguire
No ratings yet
Thinking With Types Type Level Programming in Haskell Sandy Maguire
59 pages
10 TypeCheck
No ratings yet
10 TypeCheck
43 pages
PLDI Week 10 More Typing
No ratings yet
PLDI Week 10 More Typing
55 pages
CS571 sp24 Lecture16
No ratings yet
CS571 sp24 Lecture16
71 pages
8-Type Checking (Updated)
No ratings yet
8-Type Checking (Updated)
35 pages
20180723220018D2749 - Comp6062-Pert18-19 - 2018
No ratings yet
20180723220018D2749 - Comp6062-Pert18-19 - 2018
28 pages
Type Checking
No ratings yet
Type Checking
24 pages
Oopsla - Introduction
No ratings yet
Oopsla - Introduction
54 pages
Context Free Grammars: Unit - Iii
No ratings yet
Context Free Grammars: Unit - Iii
17 pages
Types, Polymorphism and Overloading: Department of Informatics - University of Oslo
No ratings yet
Types, Polymorphism and Overloading: Department of Informatics - University of Oslo
48 pages
Type Checking
No ratings yet
Type Checking
21 pages
4 1 CustomType
No ratings yet
4 1 CustomType
32 pages
Christiansen - Implementing Dependent Type Checker
No ratings yet
Christiansen - Implementing Dependent Type Checker
49 pages
Lec14 Type Checking
No ratings yet
Lec14 Type Checking
39 pages
FP c03 NoAnim
No ratings yet
FP c03 NoAnim
70 pages
SML Tutorial PDF
No ratings yet
SML Tutorial PDF
34 pages
Lecture 15
No ratings yet
Lecture 15
28 pages
Programming Language
No ratings yet
Programming Language
23 pages
Chapter 4
No ratings yet
Chapter 4
43 pages
Programming Language
No ratings yet
Programming Language
20 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Lab 1
No ratings yet
Lab 1
40 pages
4 Basic Data Types
No ratings yet
4 Basic Data Types
27 pages
Automata Theory-Notes
No ratings yet
Automata Theory-Notes
92 pages
Turing
No ratings yet
Turing
70 pages
Answers: The University of Nottingham
No ratings yet
Answers: The University of Nottingham
20 pages
Brief History of "Programming"
No ratings yet
Brief History of "Programming"
50 pages
AT&CD Unit 3 - 2 Part
No ratings yet
AT&CD Unit 3 - 2 Part
8 pages
Programming Languages: Type Systems: Onur Tolga S Ehito Glu
No ratings yet
Programming Languages: Type Systems: Onur Tolga S Ehito Glu
18 pages
Project
No ratings yet
Project
12 pages
Begin at The Beginning : Exec-Time "Dynamic"
No ratings yet
Begin at The Beginning : Exec-Time "Dynamic"
60 pages
Lecture 5 - Type Systems
No ratings yet
Lecture 5 - Type Systems
13 pages
ACD Assignment-1
No ratings yet
ACD Assignment-1
5 pages
Program-21: To Implement A Stack and Manage Data
No ratings yet
Program-21: To Implement A Stack and Manage Data
16 pages
20 2 Final
No ratings yet
20 2 Final
10 pages
Chap. 6, Intermediate Code Generation: J. H. Wang Dec. 17, 2008
No ratings yet
Chap. 6, Intermediate Code Generation: J. H. Wang Dec. 17, 2008
55 pages
Compiler Lab Manual
No ratings yet
Compiler Lab Manual
82 pages
Type Checking
No ratings yet
Type Checking
28 pages
COL226 10 Conditionals and Tuples
No ratings yet
COL226 10 Conditionals and Tuples
9 pages
2005 MT Key
No ratings yet
2005 MT Key
8 pages
Slides8 Polymorphism PDF
No ratings yet
Slides8 Polymorphism PDF
31 pages
Algorithm W
No ratings yet
Algorithm W
7 pages
CS51 Midterm Review Lecture Notes: A Brief Introdcution
No ratings yet
CS51 Midterm Review Lecture Notes: A Brief Introdcution
9 pages
Compiler Design - CSE 504: Type Checking
No ratings yet
Compiler Design - CSE 504: Type Checking
28 pages
Computational Linguistics With The Zen Attitude: Department of Sanskrit Studies
No ratings yet
Computational Linguistics With The Zen Attitude: Department of Sanskrit Studies
27 pages
Prelim Lab Quiz 2 - Attempt Review
No ratings yet
Prelim Lab Quiz 2 - Attempt Review
6 pages
ML 3 - A Unification-Based Type Inferencer: CS 421 - Fall 2015
No ratings yet
ML 3 - A Unification-Based Type Inferencer: CS 421 - Fall 2015
15 pages
Tiger Language
No ratings yet
Tiger Language
52 pages
Types S
No ratings yet
Types S
27 pages
Unit Iv Syntax Directed Translation & Run Time Environment
No ratings yet
Unit Iv Syntax Directed Translation & Run Time Environment
12 pages
CSC 407 Tutorial
No ratings yet
CSC 407 Tutorial
6 pages
M6 Guide
No ratings yet
M6 Guide
10 pages
HW 2
No ratings yet
HW 2
5 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Practice Problems: Topic Name: Variable Name, Operators
No ratings yet
Practice Problems: Topic Name: Variable Name, Operators
9 pages
Cbse Python Language Basics - Unlocked
No ratings yet
Cbse Python Language Basics - Unlocked
11 pages
PHP - Arrays: Numeric Array
No ratings yet
PHP - Arrays: Numeric Array
4 pages
CS3110 Final Cheat Sheet
No ratings yet
CS3110 Final Cheat Sheet
7 pages
Functional Programming
No ratings yet
Functional Programming
7 pages
Algorithm W Step by Step: Martin Grabm Uller
No ratings yet
Algorithm W Step by Step: Martin Grabm Uller
7 pages
Real World OCaml
No ratings yet
Real World OCaml
13 pages
Type Checking
No ratings yet
Type Checking
18 pages
TOC Syllabus
No ratings yet
TOC Syllabus
2 pages
Compiler Ch6 (Part 1)
No ratings yet
Compiler Ch6 (Part 1)
9 pages
Btech Cs 6 Sem Compiler Design f1031 May 2019
No ratings yet
Btech Cs 6 Sem Compiler Design f1031 May 2019
2 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
HW Cis 262
No ratings yet
HW Cis 262
2 pages
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Parsing in C - Sam King PDF
No ratings yet
Parsing in C - Sam King PDF
1 page
Mid Exam
No ratings yet
Mid Exam
2 pages
Basic Set Theory Terminology
No ratings yet
Basic Set Theory Terminology
2 pages
Computer Applications (ICSE) Sample Paper 8
77% (13)
Computer Applications (ICSE) Sample Paper 8
4 pages
Syllabus
No ratings yet
Syllabus
2 pages

Typeds

Uploaded by

Typeds

Uploaded by

cs242

Reading: “Concepts in Programming Languages”, Chapter 6

 You’ll be paid Stanford’s hourly rate.

Distinction between sets that are types and sets

 Identify and prevent errors

 ML and Haskell use compile-time type checking

 Static typing always conservative

 Almost safe: Algol family, Pascal, Ada.

 Safe: Lisp, ML, Haskell, Smalltalk, JavaScript, Java

 In 1969, Hindley extended the algorithm and proved it always

 In 1978, Milner independently developed equivalent algorithm,

 In 1982, Damas proved the algorithm was complete.

 Already used in many languages: ML, Ada, C# 3.0, F#, Haskell,

 We’ll use ML to explain the algorithm because it is the original

 What is the type of f?

>val it = fn:int  int tint = intint

Propagate to internal nodes and intint

 Apply function f to argument x: f(x)

 Function expression: \x -> e

 What is the type of f?

Propagate to internal nodes and  st = (intt)t

Solve by substitution g :s 2 : int

Type error: cannot make bool  bool = int  t

’a is syntax for “type variable a”

 Infer type of each branch

 Combine by equating types of two branches:

 Function has many other, less general types:

 Less general types are all instances of most

 … and its most general type:

 What does this type mean?

See Koenig paper on “Reading” page of CS242 site

 Sometimes better error detection than type checking

 Idea can be applied to other program properties

 C++ function template

ML also has module system with explicit type parameters

 Why the difference?

 What parts of code depend on the type?

 Why is a unique type needed?

You might also like