0% found this document useful (0 votes)

72 views11 pages

CS 3723 - Programming Language: 1. Introductory Stuff

This document provides an overview of key concepts in programming languages including: - The differences between syntax, semantics, runtime, and compile-time. - What constitutes a formal language and how finite state machines like DFAs and NFAs are used to recognize languages. - Common tokens like identifiers, keywords, constants, and operators. - The components of a context-free grammar including terminal symbols, non-terminal symbols, replacement rules, and a start symbol. - How to derive sentences and construct parse trees from grammars to determine if a grammar is ambiguous. - How reverse Polish notation writes expressions without parentheses by placing operators after their operands.

Uploaded by

Lara Jane Lopega

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views11 pages

CS 3723 - Programming Language: 1. Introductory Stuff

Uploaded by

Lara Jane Lopega

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

CS 3723 – Programming Language

1. Introductory Stuff.

The three contrasts:

● Syntax versus semantics.

● Run-time versus compile-time.
● Translation versus interpretation

What a language is.?

A language is a set of sentences

Question: Given a simple DFA or NFA, describe the language recognized by it.

NFA

NFA for language L

This automaton is called non-deterministic because there is a case with two
possible choices for the arrow to take

1
DFA

https://lh5.googleusercontent.com/0pNYmYbaODQauCOshxpOIBdoFkIIHliIQUigpz
fHhHVOMz4uUFwdRrfoAjIyNovgqrc1rATjJlJE0AifmG-tIYg4YoAlGXpp5jI2fVqEDcY7x
UvUBRS3rOtd3_CrlStQFA

DFA for language L

This automation is called deterministic because in each case there is exactly one
choice for which arrow to follow. The process is uniquely determined. If you end
up in the terminal state 3, and have reached the end of the string, then you
accept the input string as belonging to L and otherwise you reject it.

2. Finite State Machines.

Question: Given a simple DFA, say how to simulate it.

Simulating a DFA: In a language with labels and gotos, this is easy.

Each state becomes a labeled location in a simulation program, and each arrow
becomes a goto between the two labeled locations that correspond to the two
states. You start at the location corresponding to the start state. You accept if you
are in an accepting state (or terminal state) at the end of the string being
processed. All this is illustrated with programs to recognize C-style
comments: Comments. The above link also shows the use of a while-switch in

2
case gotos are not available or not allowed (say, by an instructor in a course).

Question: Given a simple NFA, say how to simulate it.

Simulating an NFA: As you process each input character, your simulating program
should keep track of the set of all possible states that you might be in. You start
out with the singleton set {start state}. At the end of the input string, if your set of
states includes a terminal state, then you accept. Otherwise you reject. This same
process is essentially the "subset algorithm" that lets one take an NFA and
construct a DFA that accepts the same language as the NFA.

3. Lexical Analysis.

What tokens are?

A token is a sequence of basic units.

Question: Give examples of typical tokens.

● Identifier: This is usually defined as an initial letter, followed by any

sequence of letters or digits. The formal definition often includes other
possibilities, such as an underscore character ( _ ) in the same places where
a letter can occur.
● Keyword or reserved word: Certain strings of letters would look just like an
identifier, but are actually used as a keyword in the language and cannot be
used as an identifier.
● Constant: Depending on the language, there are a number of different
types of constants: integer, floating point, boolean, string, character, and
others.
● Operator: Operators are often one or two special characters, though they
can be more complicated. They are usually infix, prefix, or postfix. More
complex operators, such as the infamous ternary operator ‘? ‘: (present in
C, C++, Java, and even Ruby), are treated by the scanner as separate tokens.
● Special character: A number of special characters are used to separate
other tokens from one another, such as( ) [ ] { } : ; ,

4. Context-free Grammars.

What a CF grammar looks like? The names of its parts.?

3
A context-free grammar (CFG) is a set of recursive replacement rules
(or rewriting rules, or productions, or just rules) that are used to generate
patterns of strings.

More formally, a CFG consists of the following components:

● a set of terminal symbols, which are the characters of the alphabet that appear
in the strings generated by the grammar.
● a set of non-terminal symbols, which are placeholders for patterns of terminal
symbols that can be generated by the non-terminal symbols
● A set of replacement rules, which are rules for replacing (or rewriting)
non-terminal symbols (on the left side of the production) in a string with other
non-terminal or terminal symbols (on the right side of the production).
● A start symbol, which is a special non-terminal symbol that appears in the initial
string generated by the grammar.
Question: Given a CF grammar and a sentence, produce a derivation sequence
deriving the sentence.

Grammar: Arith. Exp.

E ----> E + E
E ----> E * E
E ----> ( E )
E ----> a | b | c | ...

Sentence: (a+b)*c

Leftmost derivation sequence:

E ===> E * E ===> ( E ) * E ===> ( E + E) * E ===> ( a + E ) * E ===> ( a + b )
* E ===> ( a + b ) * c

Rightmost derivation sequence:

E ===> E * E ===> E * c ===> ( E ) * c ===> ( E + E ) * c ===> ( E + b ) * c ===> (

4
a+b)*c

5. Ambiguous Grammars.

Recognize ambiguity when you see it.

Question: Given a grammar, show that it is ambiguous. (Show the two distinct
parse trees)

Ambiguity: There are other sentences derived from E above that have more than
one parse tree, and corresponding left- and rightmost derivations.
For example, the very simple sentence a + b * c. The table looks at leftmost
derivations and parse trees:
1st Leftmost Der.
2nd Leftmost Der.

E ===> E + E E ===> E * E
===> a + E ===> E + E * E
===> a + E * E ===> a + E * E
===> a + b * E ===> a + b * E
===> a + b * c ===> a + b * c
1st Parse Tree 2nd Parse Tree
E E
/|\ /|\
/ | \ / | \
E + E E * E
| /|\ /|\ |
| / | \ / | \ |
a E * E E + E c
| | | |
b c a b
Grammar: Arith. Exp.
E ----> E + E
E ----> E * E
E ----> ( E )
E ----> a | b | c
Even if some parse trees are unique, if there are multiple parse trees for any
sentence, then the grammar is called ambiguous. In a programming language it is
not acceptable to have more than one possible reading of a construct. We can't
flip a coin to decide which parse tree to use. There are several ways around this
problem:
1. Rewrite the grammar so that it is no longer ambiguous yet still accepts
exactly the same language. This is not always possible.
5
2. Introduce extra rules that allow the program to decide which of multiple
parse trees to use. These are called disambiguating rules. (Ah, yes,
"disambiguating", one of my favorite words.)
3. An ambiguous grammar may signal problems with language design, and the
programming language itself might be changed.
6. Unambiguous CF Grammars.**

Question: Given a sentence, construct the leftmost derivation for it and the parse
tree (both unique).

Grammar: Arith. Exp.

E ----> E + E
E ----> E * E
E ----> ( E )
E ----> a | b | c | ...

Leftmost derivation sequence:

E ===> E * E ===> ( E ) * E ===> ( E + E) * E ===> ( a + E ) * E ===> ( a + b )
* E ===> ( a + b ) * c

Parse Tree:

The sentence ( a + b ) * c has a unique leftmost derivation, a unique (different) rightmost

derivation and the unique parse tree shown below:
Parse Tree: ( a + b ) * c
E
/|\
/ | \
E / | \
/|\ / | \
E * E / | \
/|\ \ E | E
( E ) c /|\ | |
/|\ / | \ | |
E + E / E \ | |
| | / /|\ \ | |
a b | / | \ || |
| E | E || |
| | | | || |
( a + b )* c

6
7. Reverse Polish Notation.

What it is.

RPN is a parenthesis-free notation for expressions, in which each operator comes

after (to the right of) its operands.

Question: Given the value of a RPN expression involving integer constants.

We might use arithmetic expressions with operators: + - * / ^, with parentheses (

), and with and integers, floats (or even identifiers) as operands. Suppose one
wants to evaluate (find the value of) such an expression, say 3+4*5, or 3
*4+5. This
topic is often covered in beginning programming or in data structures. The
method used is a combination of two algorithms:
1. Translate an arithmetic expression to RPN. In the examples above to 345*+,
or to 34*5+, and
2. Evaluate the RPN. In the first example, push the first three operands on a
stack, use * on the the top two operands to get4*5=20, and push this. Then
use + on the remaining top two operands (there are only two) to
get 3+20=23, getting the final result. With the second example, push the
first two operands, apply * to the top two operands (there are only two),
getting 12 on the stack. then push the remaining operand 5 on the stack,
and apply + to both stack elements to get 12+5=17.

8. Shift-Reduce Parsers

. * Question: Given a grammar, a S-R table, and a sentence in the language

generated by the grammar, use the grammar and the table to construct a
rightmost derivation backwards.

Consider the following grammar:

7
Grammar: Arithmetic Expressions

P ---> E (P start symbol)

E ---> E+T | T
T ---> T*F | F
F ---> ( E ) | id

Use the following table for this grammar:

Parser: Shift-Reduce Table

| id | * | + | ( | ) | $ |
-----+-----+-----+-----+-----+-----+-----+
P | | | | | | acc | (s = "shift")
E | | | s | | s | r |
T | | s | r | | r | r | (r = "reduce")
F | | r | r | | r | r |
id | | r | r | | r | r | (acc = "accept")
* | s | | | s | | |
+ | s | | | s | | |
( | s | | | s | | |
) | | r | r | | r | r |
$ | s | | | s | | |
-----+-----+-----+-----+-----+-----+-----+

The table below shows the shift-reduce parse of the following sentence, showing the
stack, current symbol, remaining symbols, and next action to take at each stage. (This
sentence has the extra artifical symbol $ stuck in at the beginning and the end.)
Input Sentence

$ ( id + id ) * id $

(You should initially shift the starting $.)

Shift-Reduce Actions

Stack Curr Rest of Input Action

(top at right) Sym
--------------------------------------------------------------------------
$ ( id + id ) * id $ shift
$ ( id + id ) * id $ shift
$ ( id + id ) * id $ reduce: F ---> id
$ (F + id ) * id $ reduce: T ---> F

8
$ (T + id ) * id $ reduce: E ---> T
$ (E + id ) * id $ shift
$ (E + id ) * id $ shift
$ ( E + id ) * id $ reduce: F ---> id
$ (E + F ) * id $ reduce: T ---> F
$ (E + T ) * id $ reduce: E ---> E + T
$ (E ) * id $ shift
$ (E) * id $ reduce: F ---> ( E )
$ F * id $ reduce: T ---> F
$ T * id $ shift
$ T * id $ shift
$ T * id $ reduce: F ---> id
$ T * F $ reduce: T ---> T * F
$ T $ reduce: E ---> T
$ E $ reduce: S ---> E
$ P $ accept

Notice that the sequence of reductions give the following rightmost derivation in
reverse:
Rightmost Derivations
( id + id ) * id
S ===> E
===> T
===> T * F
===> T * id
===> F * id
===> ( E ) * id
===> ( E + T ) * id
===> ( E + F ) * id
===> ( E + id ) * id
===> ( T + id ) * id
===> ( F + id ) * id
===> ( id + id ) * id

9. Semantic Actions

The table below shows the shift-reduce parse of the same sentence, showing the
stack, current symbol, remaining symbols, and next action to take at each stage.
9
The semantic tags are shown in red below the id items and the stack items.
Shift-Reduce Actions Tag field below stack in red

Stack Curr Rest of Input Action

(top at right) Sym
--------------------------------------------------------------------------
$ ( id + id ) * id $ shift
$ ( id + id ) * id $ shift
$ ( id + id ) * id $ reduce: F ---> id
a
$ ( F + id ) * id $ reduce: T ---> F
a
$ ( T + id ) * id $ reduce: E ---> T
a
$ ( E + id ) * id $ shift
a
$ ( E + id ) * id $ shift
a
$ ( E + id ) * id $ reduce: F ---> id
a b
$ ( E + F ) * id $ reduce: T ---> F
a b
$ ( E + T ) * id $ reduce: E ---> E + T
a b
[output("t1 = a + b;");
$ ( E ) * id $ shift
t1
$ ( E ) * id $ reduce: F ---> ( E )
t1
$ F * id $ reduce: T ---> F
t1
$ T * id $ shift
t1
$ T * id $ shift
t1
$ T * id $ reduce: F ---> id
t1 c
$ T * F $ reduce: T ---> T * F
t1 c
[output("t2 = t1 * c;");
$ T $ reduce: E ---> T
t2
$ E $ reduce: P ---> E
t2
$ P $ accept
t2
[output("print(t2);");
Arithmetic expression: $ ( a + b ) * c $
Tranlation to Intermediate code:
t1 = a + b;
t2 = t1 * c;
print(t2);

10
10. Recursive-Descent Parsers

Recursive descent parser is a top-down parser, so called because it builds a parse tree
from the top (the start symbol) down, and from left to right, using an input sentence as
a target as it is scanned from left to right. The actual tree is not constructed but is
implicit in a sequence of function calls.

Operating Systems 8th Edition Cheat Sheet (Up To Chapter 6)
0% (1)
Operating Systems 8th Edition Cheat Sheet (Up To Chapter 6)
5 pages
Performance Optimization With SAP On DB2 - Key Performance Indicators
No ratings yet
Performance Optimization With SAP On DB2 - Key Performance Indicators
38 pages
205 Oracle To Postgres Migration
100% (2)
205 Oracle To Postgres Migration
58 pages
Device Level Ring Diagnostics Faceplate User Guide
No ratings yet
Device Level Ring Diagnostics Faceplate User Guide
55 pages
5 Marks Questions
100% (2)
5 Marks Questions
4 pages
Automata Theory Lec-03
No ratings yet
Automata Theory Lec-03
58 pages
History 6
No ratings yet
History 6
170 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
40 pages
BNF
No ratings yet
BNF
30 pages
Compiler 2
No ratings yet
Compiler 2
45 pages
Chp3 Syntax Analysis
No ratings yet
Chp3 Syntax Analysis
113 pages
Entrepreneurship Process
No ratings yet
Entrepreneurship Process
22 pages
Unit 3 - Theory of Computation - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Theory of Computation - WWW - Rgpvnotes.in
16 pages
08 CFG
No ratings yet
08 CFG
27 pages
SYS600 System Configuration
No ratings yet
SYS600 System Configuration
256 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
38 pages
2014-CD Ch-03 SAn
No ratings yet
2014-CD Ch-03 SAn
21 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
SSK5204 Chapter 5: Context-Free Grammars and Languages
No ratings yet
SSK5204 Chapter 5: Context-Free Grammars and Languages
55 pages
Parsernotes in C
No ratings yet
Parsernotes in C
45 pages
Lecture 7-8 - Context-Free Grammars and Bottom-Up Parsing
No ratings yet
Lecture 7-8 - Context-Free Grammars and Bottom-Up Parsing
39 pages
Language Description: Syntactic Structure
No ratings yet
Language Description: Syntactic Structure
35 pages
CSC441-Lesson 04
No ratings yet
CSC441-Lesson 04
40 pages
OWASP SG 14nov Ryan Baxendale
No ratings yet
OWASP SG 14nov Ryan Baxendale
53 pages
Toc 4 and 5 Unit Notes
No ratings yet
Toc 4 and 5 Unit Notes
72 pages
Figure 1two Parse Trees For 9-5+2
No ratings yet
Figure 1two Parse Trees For 9-5+2
3 pages
RG CFG AMbiguity
No ratings yet
RG CFG AMbiguity
8 pages
Data Science Portfolio
No ratings yet
Data Science Portfolio
17 pages
Geotechnical Report: Technological Institute of The Phillipines 938 Aurora BLVD, Cubao, Quezon City
No ratings yet
Geotechnical Report: Technological Institute of The Phillipines 938 Aurora BLVD, Cubao, Quezon City
4 pages
Classification and General Requirements of All Buildings by Use or Occupancy SECTION 701. Occupancy Classified
No ratings yet
Classification and General Requirements of All Buildings by Use or Occupancy SECTION 701. Occupancy Classified
5 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Compilers - Week 3
No ratings yet
Compilers - Week 3
17 pages
Multitenant Database Architecture
No ratings yet
Multitenant Database Architecture
70 pages
KCA015 Unit2
No ratings yet
KCA015 Unit2
29 pages
Context Free Grammars
No ratings yet
Context Free Grammars
39 pages
Answers To Exercises For Section 2.2
No ratings yet
Answers To Exercises For Section 2.2
7 pages
Java Reviewewr
No ratings yet
Java Reviewewr
13 pages
Nonlinear Optimization Using The Generalized Reduced Gradient Method
100% (1)
Nonlinear Optimization Using The Generalized Reduced Gradient Method
63 pages
Context Free Grammars
No ratings yet
Context Free Grammars
25 pages
Slide Set 5 Parsing
No ratings yet
Slide Set 5 Parsing
18 pages
SE Compiler Chapter 3-Parser
No ratings yet
SE Compiler Chapter 3-Parser
27 pages
Gnuplot
No ratings yet
Gnuplot
18 pages
Uid-Graphical System Advatages
No ratings yet
Uid-Graphical System Advatages
21 pages
CE 411 Experiment 2
No ratings yet
CE 411 Experiment 2
8 pages
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
127 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Microsoft NET For Programmers
100% (1)
Microsoft NET For Programmers
376 pages
Veyon Administrator Manual: Release 4.1.2
No ratings yet
Veyon Administrator Manual: Release 4.1.2
55 pages
MaxSea User Manual
100% (1)
MaxSea User Manual
303 pages
Lecture 6
No ratings yet
Lecture 6
50 pages
Module1 1
No ratings yet
Module1 1
20 pages
Lec02 Programming Language Specification
No ratings yet
Lec02 Programming Language Specification
36 pages
ContextFreeGrammars
No ratings yet
ContextFreeGrammars
28 pages
Ring of Fire PDF
No ratings yet
Ring of Fire PDF
73 pages
Computer Paper Ist Assesment
No ratings yet
Computer Paper Ist Assesment
10 pages
Context-Free Languages & Grammars (Cfls & CFGS) : 1 10/10/2022 C.P.Shabariram Ap (Sr. GR.) /cse
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : 1 10/10/2022 C.P.Shabariram Ap (Sr. GR.) /cse
36 pages
Parsing: Programming Language Principles
No ratings yet
Parsing: Programming Language Principles
33 pages
Context
No ratings yet
Context
57 pages
Tornado Python
No ratings yet
Tornado Python
139 pages
Write Only One STMT Per Line: Pseudocode: An Introduction
No ratings yet
Write Only One STMT Per Line: Pseudocode: An Introduction
5 pages
Interview Questions 123 PDF
No ratings yet
Interview Questions 123 PDF
94 pages
Module 2
No ratings yet
Module 2
19 pages
Auto Correction
No ratings yet
Auto Correction
13 pages
BPMN, CMMN and DMN Specifications at Omg
No ratings yet
BPMN, CMMN and DMN Specifications at Omg
2 pages
File List
No ratings yet
File List
18 pages
Text Pad Tutorial
No ratings yet
Text Pad Tutorial
11 pages
Vision 2023 Toc Chapter 5 Context Free Grammar 12
No ratings yet
Vision 2023 Toc Chapter 5 Context Free Grammar 12
25 pages
Automata - Unit 3-1
No ratings yet
Automata - Unit 3-1
26 pages
02 Simple Sysntax Directed Translation
No ratings yet
02 Simple Sysntax Directed Translation
57 pages
Parsing Bun
No ratings yet
Parsing Bun
48 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
ST-Microcontrollers Tunis PFE 2010
No ratings yet
ST-Microcontrollers Tunis PFE 2010
14 pages
Lecture 9
No ratings yet
Lecture 9
22 pages
SketchUp & EASE
80% (5)
SketchUp & EASE
13 pages
GPS Based Bus Managemnt System
No ratings yet
GPS Based Bus Managemnt System
14 pages
17 CFGremove Ambiguity Optional
No ratings yet
17 CFGremove Ambiguity Optional
30 pages
TOC II Updated
No ratings yet
TOC II Updated
41 pages
Syntax Analysis Parsing
No ratings yet
Syntax Analysis Parsing
9 pages
OT Letter
No ratings yet
OT Letter
1 page
Data Structures: Course Code: 13CT1106 L TPC 4 0 0 3
No ratings yet
Data Structures: Course Code: 13CT1106 L TPC 4 0 0 3
3 pages
Good To Know: PDO Re-Mapping Procedure
No ratings yet
Good To Know: PDO Re-Mapping Procedure
2 pages
PL ch3
No ratings yet
PL ch3
21 pages
4 Parsing
No ratings yet
4 Parsing
32 pages
Chapter3 CFG
No ratings yet
Chapter3 CFG
67 pages
Chapter 3
No ratings yet
Chapter 3
57 pages
CD Unit 3
No ratings yet
CD Unit 3
76 pages
CS6109 Module 4
No ratings yet
CS6109 Module 4
36 pages
Context Free Grammars
No ratings yet
Context Free Grammars
40 pages
Flat M2
No ratings yet
Flat M2
40 pages
Chapter 4 Intro - To - Parsing
No ratings yet
Chapter 4 Intro - To - Parsing
53 pages
Multimedia Application L4
No ratings yet
Multimedia Application L4
42 pages
Lecture 05
No ratings yet
Lecture 05
58 pages
Automata Lectuee5
No ratings yet
Automata Lectuee5
33 pages
ContextFreeGrammars Myppt
No ratings yet
ContextFreeGrammars Myppt
41 pages
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)

CS 3723 - Programming Language: 1. Introductory Stuff

Uploaded by

CS 3723 - Programming Language: 1. Introductory Stuff

Uploaded by

CS 3723 – Programming Language

The three contrasts:

● Syntax versus semantics.

What a language is.?

A language is a set of sentences

NFA for language L

DFA for language L

2. Finite State Machines.

Question: Given a simple DFA, say how to simulate it.

Simulating a DFA: In a language with labels and gotos, this is easy.

Question: Given a simple NFA, say how to simulate it.

What tokens are?

A token is a sequence of basic units.

Question: Give examples of typical tokens.

● Identifier: This is usually defined as an initial letter, followed by any

What a CF grammar looks like? The names of its parts.?

More formally, a CFG consists of the following components:

Grammar: Arith. Exp.

Leftmost derivation sequence:

Rightmost derivation sequence:

Recognize ambiguity when you see it.

Grammar: Arith. Exp.

Leftmost derivation sequence:

The sentence ​( a + b ) * c​ has a unique leftmost derivation, a unique (different) rightmost

RPN is a parenthesis-free notation for expressions, in which each operator comes

Question: Given the value of a RPN expression involving integer constants.

We might use arithmetic expressions with operators: ​+ - * / ^​, with parentheses ​(

. * Question: Given a grammar, a S-R table, and a sentence in the language

Consider the following grammar:

P ---> E (P start symbol)

Use the following table for this grammar:

(You should initially shift the starting ​$​.)

Stack Curr Rest of Input Action

Stack Curr Rest of Input Action

You might also like

The sentence ( a + b ) * c has a unique leftmost derivation, a unique (different) rightmost

We might use arithmetic expressions with operators: + - * / ^, with parentheses (

(You should initially shift the starting $.)