Lecture 5

Type checking is a major component of semantic analysis in programming languages. Different languages have different type systems - some like C have weak type systems while others like Ada have very strong type systems. Before type checking, name resolution determines the type of each identifier by adding definitions to a symbol table. This table is then referenced during semantic analysis to check types and evaluate code correctness. Semantic analysis also checks other aspects like array bounds and control flow.

Uploaded by

mohamed samy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views10 pages

Lecture 5

Uploaded by

mohamed samy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Chapter Four

Semantic Analysis

Type checking is a major component of semantic analysis.

Different programming languages have different approaches to
type checking. Some languages (like C) have a rather weak type
system, so it is possible to make serious errors if you are not
careful. Other languages (like Ada) have very strong type
systems, but this makes it more difficult to write a program that
will compile at all!
Before we can perform type checking, we must determine the
type of each identifier used in an expression.
A variable x in an expression could refer to a local variable, a
function parameter, a global variable, or something else entirely.
We solve this problem by performing name resolution, in
which each definition of a variable is entered into a symbol
table. This table is referenced throughout the semantic analysis
stage whenever we need to evaluate the correctness of some
code.
Once name resolution is completed, we have all the information
necessary to check types. In this stage, we compute the type of
complex expressions by combining the basic types of each value
according to standard conversion rules. Semantic analysis also
includes other forms of checking the correctness of a program,
such as examining the limits of arrays, avoiding bad pointer

1
traversals, and examining control flow. Depending on the design
of the language, some of these problems can be detected at
compile time, while others may need to wait until runtime.

4.1 Overview of Type Systems

Most programming languages assign to every value (whether a
literal, constant, or variable) a type, which describes the
interpretation of the data in that variable.
The type system of a language serves several purposes:
• Correctness. A compiler uses type information provided by
the programmer to raise warnings or errors if a program
attempts to do something improper.
• Performance. A compiler can use type information to find the
most efficient implementation of a piece of code.
• Expressiveness. A program can be made more compact and
expressive if the language allows the programmer to leave out
facts that can be inferred from the type system.
A programming language (and its type system) are commonly
classified on the following axes:
• safe or unsafe
• static or dynamic
• explicit or implicit
In an unsafe programming language, it is possible to write
valid programs that have wildly undefined behavior that violates
the basic structure of the program. For example, the following

2
code in C is syntactically legal and will compile, but is unsafe
because it writes data outside the bounds of the array a[].
/* This is C code */
int i;
int a[10];
for(i=0;i<100;i++) a[i] = i;
In a safe programming language, it is not possible to write a
program that violates the basic structures of the language.
A safe programming language enforces the boundaries of arrays,
the use of pointers, and the assignment of types to prevent
undefined behavior.
Most interpreted languages, like Perl, Python, and Java, are safe
languages.
For example, in C#, the boundaries of arrays are checked at
runtime, so that running off the end of an array has the
predictable effect of throwing
an IndexOutOfRangeException:
/* This is C-sharp code */
a = new int[10];
for(int i=0;i<100;i++) a[i] = i;

In a statically typed language, all type checking is performed

at compile time, long before the program runs.
Static typing is often used to distinguish between integer and
floating point operations. While operations like addition and
multiplication are usually represented by the same symbols in
3
the source language, they are implemented with fundamentally
different machine code. For example, in the C language on X86
machines, (a+b) would be translated to an ADDL instruction for
integers, but an FADD instruction for floating point values.
To know which instruction to apply, we must first determine the
type of a and b and deduce the intended meaning of +.
In a dynamically typed language, type information is available
at runtime and stored in memory a long side the data that it
describes.
In an explicitly typed language, the programmer is responsible
for indicating the types of variables and other items in the code
explicitly.
Explicit typing can also be used to prevent assignment between
variables that have the same underlying representation, but
different meanings.

In an implicitly typed language, the compiler will infer the

type of variables and expressions (to the degree possible)
without explicit input from the programmer.
4.2 Designing a Type System
To describe the type system of a language, we must explain its
atomic types, its compound types, and the rules for assigning
and converting between types.
The atomic types of a language are the simple types used to
describe individual variables: integers, floating point numbers,

4
Boolean values, and so forth. For each atomic type, it is
necessary to clearly define the range that is supported.
The compound types of a language combine together existing
types into more complex aggregations.
Suppose that an integer i is assigned to a floating point f. A
similar situation arises when an integer is passed to a function
expecting a floating point as an argument. There are several
possibilities for what a language may do in this case:
• Disallow the assignment. A very strict language (like B-
Minor) could simply emit an error and prevent the program from
compiling!
• Perform a bitwise copy. If the two variables have the same
underlying storage size, the unlike assignment could be
accomplished by just copying the bits in one variable to the
location of the other.
• Convert to an equivalent value. For certain types, the
compiler may have built-in conversions that change the value to
the desired type implicitly.
• Interpret the value in a different way. In some cases, it may
be desirable to convert the value into some other value that is
not equivalent but still useful for the programmer.
4.3 The B-Minor Type System
The B-Minor type system is safe, static, and explicit.
B-Minor has the following atomic types:
• integer - A 64 bit signed integer.

5
• boolean - Limited to symbols true or false.
• char - Limited to ASCII values.
• string - ASCII values, null terminated.
• void - Only used for a function that returns no value.
And the following compound types:
• array [size] type
• function type ( a: type, b: type, ... )
And here are the type rules that must be enforced:
• A value may only be assigned to a variable of the same type.
• A function parameter may only accept a value of the same
type.
• The type of a return statement must match the function return
type.
• All binary operators must have the same type on the left and
right hand sides.
• The equality operators != and = = may be applied to any type
except void, array, or function and always return boolean.
The comparison operators < <= >= > may only be applied to
integer values and always return boolean.
• The boolean operators ! && || may only be applied to boolean
values and always return boolean.
• The arithmetic operators + - * / % ˆ ++ -- may only be applied
to integer values and always return integer.

4.4 The Symbol Table

6
The symbol table records all of the information that we need to
know about every declared variable (and other named items, like
functions) in the program. Each entry in the table is a struct
symbol which is shown in Figure 4.1.
struct symbol {
symbol_t kind;
struct type *type;
char *name;
int which;
};

The kind field indicates whether the symbol is a local variable, a

global variable, or a function parameter. The type field points to
a type structure indicating the type of the variable. The name
field gives the name (obviously), and the which field gives the
ordinal position of local variables and parameters.
To begin semantic analysis, we must create a suitable symbol
structure for each variable declaration and enter it into the
symbol table.
Conceptually, the symbol table is just a map between the name
of each variable, and the symbol structure that describes it:

7
However, it’s not quite that simple, because most programming
languages allow the same variable name to be used multiple
times, as long as each definition is in a distinct scope. For
example, the following B-Minor program defines the symbol x
three times, each with a different type and storage class. When
run, the program should print 10 hello false.
x: integer = 10;
f: function void ( x: string ) =
{ print x, "\n";
{
x: boolean = false;
print x, "\n";}}
main: function void () =
{
print x, "\n";
f("hello"); }

To accommodate these multiple definitions, we will structure

our symbol table as a stack of hash tables. Each hash table maps
the names in a given scope to their corresponding symbols. This
allows a symbol (like x) to exist in multiple scopes without
conflict. As we proceed through the program, we will push a
new table every time a scope is entered, and pop a table every
time a scope is left.

8
Figure 4.2: A Nested Symbol Table

void scope_enter();
void scope_exit();
int scope_level();
void scope_bind( const char *name, struct symbol *sym );
struct symbol *scope_lookup( const char *name );
struct symbol *scope_lookup_current( const char *name );
Figure 4.3: Symbol Table API

To manipulate the symbol table, we define six operations given

in Figure 4.3. They have the following meanings:
• scope enter() causes a new hash table to be pushed on the top
of the stack, representing a new scope.

9
• scope exit() causes the topmost hash table to be removed.
• scope level() returns the number of hash tables in the current
stack. (This is helpful to know whether we are at the global
scope or not.)
• scope bind(name,sym) adds an entry to the topmost hash table
of the stack, mapping name to the symbol structure sym.
• scope lookup(name) searches the stack of hash tables from top
to bottom, looking for the first entry that matches name exactly.
If no match is found, it returns null.
• scope lookup current(name) works like scope lookup except
that it only searches the topmost table. This is used to determine
whether a symbol has already been defined in the current scope.

4.5 Name Resolution

With the symbol table in place, we are now ready to match each
use of a variable name to its matching definition. This process is
known as name resolution.
Wherever a variable is declared, it must be entered into the
symbol table and the symbol structure linked into the abstract
syntax tree (AST). Wherever a variable is used, it must be
looked up in the symbol table, and the symbol structure linked
into the AST. Of course, if a symbol is declared twice in the
same scope, or used without declaration, then an appropriate
error message must be emitted.

PCS 7 - Programming Instructions For Blocks
50% (4)
PCS 7 - Programming Instructions For Blocks
220 pages
Python Book
100% (3)
Python Book
445 pages
PL 04TypesAndPolymorphism
100% (1)
PL 04TypesAndPolymorphism
59 pages
Edit Package
No ratings yet
Edit Package
10 pages
M6 Main
100% (1)
M6 Main
46 pages
Of Programming Languages by Ravi Sethi
No ratings yet
Of Programming Languages by Ravi Sethi
22 pages
MIS - Project Title Proposal
100% (1)
MIS - Project Title Proposal
14 pages
Chapter 4
No ratings yet
Chapter 4
43 pages
Unit V Functional and Logic Programs: Contents:-Language Specific Compilation: Object Oriented
No ratings yet
Unit V Functional and Logic Programs: Contents:-Language Specific Compilation: Object Oriented
91 pages
CH 6
No ratings yet
CH 6
88 pages
CSC305 CHAPTER 2b
No ratings yet
CSC305 CHAPTER 2b
54 pages
Lecture 03
No ratings yet
Lecture 03
44 pages
PPL Unit-II - Datatypes Final
No ratings yet
PPL Unit-II - Datatypes Final
154 pages
Computer Science Notes Year 3
No ratings yet
Computer Science Notes Year 3
130 pages
PL 10 CH 6
No ratings yet
PL 10 CH 6
92 pages
Data Types and Representation
No ratings yet
Data Types and Representation
88 pages
Programming Language
No ratings yet
Programming Language
33 pages
Chapter 7:: Data Types: Programming Language Pragmatics
No ratings yet
Chapter 7:: Data Types: Programming Language Pragmatics
23 pages
Chap 5 Semantic Analysis and Type Cheking N07 G02
No ratings yet
Chap 5 Semantic Analysis and Type Cheking N07 G02
43 pages
PPL (Unit2 Data Types)
75% (4)
PPL (Unit2 Data Types)
43 pages
PPL-Unit 2
No ratings yet
PPL-Unit 2
45 pages
UNIT-II - Structuring The Data, Computations and Program
No ratings yet
UNIT-II - Structuring The Data, Computations and Program
105 pages
Lecture 4 - The Importance of Data Types
No ratings yet
Lecture 4 - The Importance of Data Types
13 pages
03 Types
No ratings yet
03 Types
30 pages
12 Type System
No ratings yet
12 Type System
44 pages
8 Type
No ratings yet
8 Type
81 pages
Ch07 Type Systems 4e
No ratings yet
Ch07 Type Systems 4e
23 pages
Intermediate Code Generator 1
No ratings yet
Intermediate Code Generator 1
48 pages
Model Question Papers of COPA Exam 01
No ratings yet
Model Question Papers of COPA Exam 01
7 pages
Compiler Note Book
No ratings yet
Compiler Note Book
41 pages
PL Data Types
No ratings yet
PL Data Types
41 pages
20180723220018D2749 - Comp6062-Pert18-19 - 2018
No ratings yet
20180723220018D2749 - Comp6062-Pert18-19 - 2018
28 pages
Lecture 5 - Type Systems
No ratings yet
Lecture 5 - Type Systems
13 pages
Introduction To Compilers and Language Design - Chapter7
No ratings yet
Introduction To Compilers and Language Design - Chapter7
21 pages
PPL Unit 2
No ratings yet
PPL Unit 2
18 pages
Types
No ratings yet
Types
62 pages
4 Abstract Syntax
No ratings yet
4 Abstract Syntax
17 pages
Topicwise Lecture Notes of Compiler Design (CS - 603 (C) ) As On 8.4.2024
No ratings yet
Topicwise Lecture Notes of Compiler Design (CS - 603 (C) ) As On 8.4.2024
23 pages
"Animated Rainbow": A Micro Project Report On
100% (1)
"Animated Rainbow": A Micro Project Report On
13 pages
Chapter 7:: Data Types: Programming Language Pragmatics, Fourth Edition
No ratings yet
Chapter 7:: Data Types: Programming Language Pragmatics, Fourth Edition
21 pages
Chapter 7 Symbol Tables and Error Handler
No ratings yet
Chapter 7 Symbol Tables and Error Handler
34 pages
Unit V Symantic Analysis
No ratings yet
Unit V Symantic Analysis
45 pages
CSC 204 Data Structure-1
No ratings yet
CSC 204 Data Structure-1
31 pages
Sem-2 BCA-CPP Notes
No ratings yet
Sem-2 BCA-CPP Notes
64 pages
Unit II
No ratings yet
Unit II
23 pages
Type Checking
No ratings yet
Type Checking
24 pages
Type Checking
No ratings yet
Type Checking
21 pages
04 Semantics
No ratings yet
04 Semantics
17 pages
Chapter4 Type Systems Full Guide
No ratings yet
Chapter4 Type Systems Full Guide
4 pages
Advanced Compiler Design and Implementation: Introduction To Advanced Topics
No ratings yet
Advanced Compiler Design and Implementation: Introduction To Advanced Topics
16 pages
Lec6 - SemanticAnalysis 3
No ratings yet
Lec6 - SemanticAnalysis 3
38 pages
M6 Guide
No ratings yet
M6 Guide
10 pages
Compiler Ch6 (Part 1)
No ratings yet
Compiler Ch6 (Part 1)
9 pages
PPL-Unit 2 Part 5
No ratings yet
PPL-Unit 2 Part 5
20 pages
Type Checking and Type Equality: Type Systems Are The Biggest Point of Variation Across Programming Languages. Even
No ratings yet
Type Checking and Type Equality: Type Systems Are The Biggest Point of Variation Across Programming Languages. Even
10 pages
Chapter Five: Type Checking
100% (1)
Chapter Five: Type Checking
48 pages
SE Compiler Chapter 5-Type Checking and Symbol Table
No ratings yet
SE Compiler Chapter 5-Type Checking and Symbol Table
8 pages
Type Checking
No ratings yet
Type Checking
18 pages
ISC 2024 Class 12 Computer Science Project Documentation Rules and Guidelines
0% (1)
ISC 2024 Class 12 Computer Science Project Documentation Rules and Guidelines
3 pages
UNIT-2 Short Answers
No ratings yet
UNIT-2 Short Answers
12 pages
Assignment 1 Computer Skills
No ratings yet
Assignment 1 Computer Skills
9 pages
005chapter 5 - Symbol Table and Type Checking
No ratings yet
005chapter 5 - Symbol Table and Type Checking
31 pages
AT&CD Unit 3 - 2 Part
No ratings yet
AT&CD Unit 3 - 2 Part
8 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
9 pages
CS-602 - PPL - Unit-2
No ratings yet
CS-602 - PPL - Unit-2
31 pages
Hospital Management Class Xii Kashis and Riya
No ratings yet
Hospital Management Class Xii Kashis and Riya
33 pages
Object Oriented Programming - 2 PDF
No ratings yet
Object Oriented Programming - 2 PDF
94 pages
NDJ OilPlam Eng Booklet 130311F
No ratings yet
NDJ OilPlam Eng Booklet 130311F
165 pages
Distributed-Memory Parallel Programming With MPI: Supervised By: Dr. Shaima Hagras
No ratings yet
Distributed-Memory Parallel Programming With MPI: Supervised By: Dr. Shaima Hagras
20 pages
Type Systems Summary
No ratings yet
Type Systems Summary
2 pages
Unit 3 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Compiler Design - WWW - Rgpvnotes.in
19 pages
Computer Programming-Unit 4
No ratings yet
Computer Programming-Unit 4
4 pages
DUMP
No ratings yet
DUMP
131 pages
Nachos Study Book
No ratings yet
Nachos Study Book
109 pages
Q. No Sub Q.No Answer: (Autonomous)
No ratings yet
Q. No Sub Q.No Answer: (Autonomous)
23 pages
ABB ICSTT-SDS-8110 - en Plantguard TMR Processor P8110
No ratings yet
ABB ICSTT-SDS-8110 - en Plantguard TMR Processor P8110
2 pages
The Hacker Test (Version 1.0)
No ratings yet
The Hacker Test (Version 1.0)
17 pages
B.tech 2 1 Computer Science Engineering R20 Course Structure Syllabi
No ratings yet
B.tech 2 1 Computer Science Engineering R20 Course Structure Syllabi
22 pages
Chapter 01 See Program Running
No ratings yet
Chapter 01 See Program Running
63 pages
Software Engineering
No ratings yet
Software Engineering
3 pages
Sambalpur University Institute of Information Technology
No ratings yet
Sambalpur University Institute of Information Technology
32 pages
JasonDsouza - 9537 - Batch A
No ratings yet
JasonDsouza - 9537 - Batch A
114 pages
Nsa Lab Full
No ratings yet
Nsa Lab Full
82 pages
CUDA Programming Within Mathematica
No ratings yet
CUDA Programming Within Mathematica
17 pages
Arbitrum
No ratings yet
Arbitrum
18 pages
What Is Prolog?
No ratings yet
What Is Prolog?
5 pages
O Level Project Sample Documentation Ashley
No ratings yet
O Level Project Sample Documentation Ashley
39 pages
Security Lec8 Slides
No ratings yet
Security Lec8 Slides
18 pages
Lect 9
No ratings yet
Lect 9
17 pages
922 Final Paper v4
No ratings yet
922 Final Paper v4
11 pages
ICDL Programmes and Resources
No ratings yet
ICDL Programmes and Resources
3 pages
Compter Studies Book 3
No ratings yet
Compter Studies Book 3
4 pages
C Programming Language
From Everand
C Programming Language
Younish Pathan
No ratings yet
Coding for beginners The basic syntax and structure of coding
From Everand
Coding for beginners The basic syntax and structure of coding
Diamond Moore
No ratings yet