0% found this document useful (0 votes)

110 views20 pages

Python Lex-Yacc: Language Tool For Python CS 550 Programming Languages

PLY (Python Lex-Yacc) is a tool that allows users to define their own programming languages by specifying token definitions using regular expressions and grammars. It generates a lexer and parser from these specifications to act as a compiler for the language. The document provides an example of defining a simple calculator language with PLY and running it on the tux server, which already has PLY installed. It describes writing token and grammar rules, generating parsing tables, and testing the language implementation.

Uploaded by

Saif Ullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

110 views20 pages

Python Lex-Yacc: Language Tool For Python CS 550 Programming Languages

Uploaded by

Saif Ullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Python Lex-Yacc

Language Tool for Python

CS 550 Programming Languages

Alexander Gutierrez
May 12, 2016
Python Lex-Yacc

● Python Lex-Yacc (PLY) is a version of lex and yacc written in the Python
interpreted programming language

● Attempts to be a faithful recreation of lex and yacc

● It reads regular expressions to define tokens in order to create a lexer like in

lex

● It reads an LALR(1) grammar and associated rule actions to create a parser

● Uses the lexer to generate tokens to feed to the parser, thereby acting as a
compiler

2
Where to use?

● Download PLY from their website:

○ http://www.dabeaz.com/ply/
● The latest version (ply-3.8) works best on Python 2.6+ or Python 3.0+

● Since it is a tool that uses Python, you will need to install Python if you don’t
have it in your environment

● Versions of ply at 3.0 or above (ply-3.0+) support both Python 2 or Python 3

(both are maintained versions of the Python programming language with some
differences)

● If you don’t want to bother with installing Python, tux already has it and PLY!

3
Python on tux.cs.drexel.edu

● Both Python 2.7.6 and Python 3.4.3 are available on tux

● Invoking the Python 2.7.6 interpreter:
○ Command name: python (or python2)
○ Both of these are symlinks. The interpreter lives at /usr/bin/python2.7

● Invoking the Python 3.4.3 interpreter:

○ Command name: python3
○ Also a symlink. This interpreter lives at /usr/bin/python3.4

4
Using PLY on tux.cs.drexel.edu

● tux already has PLY configured! I will cover it anyway.

● Download the latest version of PLY
○ http://www.dabeaz.com/ply/
● Extract the archive and you will get a directory called ply-3.8, put this
wherever you want
● In this directory, the py lex and py yacc live at
○ ply-3.8/ply/lex.py
○ ply-3.8/ply/yacc.py
● We will be importing these as python modules
● As for your token and grammar file(s), I suggest simply placing them in the
same directory that contains ply-3.8
● My working directory looks like this:

$ ls
calc.py ply-3.8

5
The Bigger Picture

● Just like Flex/Bison, we can use PLY to (relatively) easily implement our own
programming language

● To do this, we need to write a python file that includes instruction manuals for
PLY

● For lex.py, we need to determine what tokens our language consists of and
how each token can be described using a regular expression

● For yacc.py, we need to create an LALR(1) grammar that takes these tokens
and executes code

● PLY will create both a lexer object and a parser object at run-time which we
can use as our compiler

6
Calculator Example

● The code for this example can be found included with PLY:
○ ply-3.8/example/calc/calc.py

● Yes, we can have both our lex and yacc definitions in the same file (though not
necessary)
● This example looks at simple arithmetic calculator
● First, we will look at the regular expressions we give to lex.py
● Next, we will look at the grammar we give to yacc.py
● Finally, we will run the code and test on input

7
calc.py > Part 1/2 of lex definitions

tokens = (
'NAME','NUMBER',
)

literals = ['=','+','-','*','/', '(',')']

-- ALTERNATIVE -- (note: literals checked lastly in matching)

tokens = (
'NAME','NUMBER',
'PLUS','MINUS','TIMES','DIVIDE','EQUALS',
'LPAREN','RPAREN',
)
# Tokens
t_PLUS = r'\+'
t_MINUS = r'-'
t_TIMES = r'\*'
t_DIVIDE = r'/'
t_EQUALS = r'='
t_LPAREN = r'$'
t_RPAREN = r'$'
8
calc.py > Part 2/2 of lex definitions

# Tokens
t_NAME = r'[a-zA-Z_][a-zA-Z0-9_]*'

def t_NUMBER(t):
r'\d+'
t.value = int(t.value)
return t

t_ignore = " \t"

def t_newline(t):
r'\n+'
t.lexer.lineno += t.value.count("\n")

def t_error(t):
print("Illegal character '%s'" % t.value[0])
t.lexer.skip(1)

# Build the lexer

import ply.lex as lex
lex.lex()

9
calc.py > Part 1/4 of yacc definitions
precedence = (
('left','+','-'),
('left','*','/'),
('right','UMINUS'),
)

# dictionary of names
names = { }

10
calc.py > Part 2/4 of yacc definitions
def p_statement_assign(p):
'statement : NAME "=" expression'
names[p[1]] = p[3]

def p_statement_expr(p):
'statement : expression'
print(p[1])

def p_expression_binop(p):
'''expression : expression '+' expression
| expression '-' expression
| expression '*' expression
| expression '/' expression'''
if p[2] == '+' : p[0] = p[1] + p[3]
elif p[2] == '-': p[0] = p[1] - p[3]
elif p[2] == '*': p[0] = p[1] * p[3]
elif p[2] == '/': p[0] = p[1] / p[3]

def p_expression_uminus(p):
"expression : '-' expression %prec UMINUS"
p[0] = -p[2]

11
calc.py > Part 3/4 of yacc definitions
def p_expression_group(p):
"expression : '(' expression ')'"
p[0] = p[2]

def p_expression_number(p):
"expression : NUMBER"
p[0] = p[1]

def p_expression_name(p):
"expression : NAME"
try:
p[0] = names[p[1]]
except LookupError:
print("Undefined name '%s'" % p[1])
p[0] = 0

12
calc.py > Part 4/4 of yacc definitions
def p_error(p):
if p:
print("Syntax error at '%s'" % p.value)
else:
print("Syntax error at EOF")

import ply.yacc as yacc

yacc.yacc()

while 1:
try:
s = raw_input('calc > ')
except EOFError:
break
if not s: continue
yacc.parse(s)

13
Multiple lexers/parsers
lexer = lex.lex()
parser = yacc.yacc()

while 1:
try:
s = raw_input('calc > ')
except EOFError:
break
if not s: continue
parser.parse(s,lexer)

14
Running on tux

● My working directory looks like this:

$ ls
calc.py ply-3.8

● We can create and run our lexer and parser by simply invoking python on our
definitions file:

$ python calc.py
Generating LALR tables
calc >

● Since we have code that executes to take input, we are given the prompt that
we specified. Another thing to notice is that it created other files:

$ ls
calc.py parser.out parsetab.py ply-3.8

15
parser.out

● This is a helpful file we can use in debugging.

● It is generated when we create our parser, but does not contain any code

● It is simply a debug output that expresses the grammar that yacc.py

understood

● This can be useful if you have shift/reduce and reduce/reduce conflicts

● The file contains a pretty-printed grammar (your grammar, hopefully), terminals

and nonterminals, and the states that the machine enters

● Debugging these conflicts is out of the scope of this presentation, but can
generally be solved from the understanding of LR parsing gained in this course

16
parsetab.py

● This file contains the parsing table used by your parser

● This is also generated when we create our parser
● Do not edit this file

● Mostly useful to prevent rerunning the entire construction process each time
we want to use our new language (remember python is interpreted, so without
this it would have to do compiler-compiling on every run)

● It uses some sort of hash and stores it in _lr_signature so that it can detect
if there was significant enough change to the parsing definitions to warrant
reconstruction

● Most of the time this will just be read directly the next time you run your parser

17
Using Our New Language

● We can test to make sure it works by running our definitions file and giving it
input:
$ python calc.py
calc > 3 * 5
15
calc > x=2-1
calc > x
1
calc > x+9
10
calc > 3 - + 2
Syntax error at '+'
2
calc >

18
Summary

● Use PLY on tux (already installed and configured)

● Design your own language by creating tokenization instructions via regular

expressions and a grammar

● Implement the language by giving PLY these instructions to generate a lexical

analyzer and parser respectively through the use of python

19
Reference

PLY (Python Lex-Yacc)

● http://www.dabeaz.com/ply/

Assignment No 3 Lex and Yacc
100% (1)
Assignment No 3 Lex and Yacc
4 pages
Example Program For The Lex and Yacc Programs
No ratings yet
Example Program For The Lex and Yacc Programs
13 pages
Lecture003 LEXandYACC
No ratings yet
Lecture003 LEXandYACC
64 pages
SS Lab Manual
No ratings yet
SS Lab Manual
38 pages
Ply Talk
100% (2)
Ply Talk
87 pages
CS3501 Compiler Design Lab
No ratings yet
CS3501 Compiler Design Lab
35 pages
03LexicalAndSyntaxAnalysis 1
No ratings yet
03LexicalAndSyntaxAnalysis 1
25 pages
Compiler LAB4 22BPS1073
No ratings yet
Compiler LAB4 22BPS1073
24 pages
Language Processing: Introduction To Compiler Construction: Andy D. Pimentel Computer Systems Architecture Group
No ratings yet
Language Processing: Introduction To Compiler Construction: Andy D. Pimentel Computer Systems Architecture Group
91 pages
Yacc
No ratings yet
Yacc
32 pages
Lab Manual-CC
No ratings yet
Lab Manual-CC
19 pages
Lex Yacc Tutorial: Kun-Yuan Hsieh
No ratings yet
Lex Yacc Tutorial: Kun-Yuan Hsieh
64 pages
CD Lab Manual
No ratings yet
CD Lab Manual
16 pages
Lex Yacc Ply
No ratings yet
Lex Yacc Ply
6 pages
Experiment No 7 - SPCC
No ratings yet
Experiment No 7 - SPCC
9 pages
YACC
No ratings yet
YACC
8 pages
CD Yacc AndLex
No ratings yet
CD Yacc AndLex
14 pages
LEX and YACC
No ratings yet
LEX and YACC
31 pages
LexYacc Final
No ratings yet
LexYacc Final
44 pages
Laboratory - Manual: Compiler Design
No ratings yet
Laboratory - Manual: Compiler Design
38 pages
CD Lab Manual File
No ratings yet
CD Lab Manual File
27 pages
CD Lab Manual
No ratings yet
CD Lab Manual
28 pages
PCC Lab File
No ratings yet
PCC Lab File
27 pages
20dit057 LP
No ratings yet
20dit057 LP
42 pages
Yacc
No ratings yet
Yacc
12 pages
CS419 Lecture 15
No ratings yet
CS419 Lecture 15
15 pages
CD - Exp-12 0682
No ratings yet
CD - Exp-12 0682
4 pages
Cdlab 7
No ratings yet
Cdlab 7
4 pages
Yacc Tutorial
No ratings yet
Yacc Tutorial
15 pages
CD Manual
No ratings yet
CD Manual
27 pages
Shyam
No ratings yet
Shyam
4 pages
CS3501 Labmanual
No ratings yet
CS3501 Labmanual
23 pages
CompilerDesign 210170107518 Krishna (4-10)
No ratings yet
CompilerDesign 210170107518 Krishna (4-10)
47 pages
Compiler Construction: Department of Computer Science
No ratings yet
Compiler Construction: Department of Computer Science
17 pages
Compiler Design Practical File PDF
No ratings yet
Compiler Design Practical File PDF
33 pages
Compiler File
No ratings yet
Compiler File
47 pages
CD Week 4
No ratings yet
CD Week 4
5 pages
Lab Manual
No ratings yet
Lab Manual
23 pages
CD - Exp-11 0682
No ratings yet
CD - Exp-11 0682
4 pages
Compiler Lab Manual
No ratings yet
Compiler Lab Manual
32 pages
Lex and Yacc
No ratings yet
Lex and Yacc
27 pages
CD Lab Manual
No ratings yet
CD Lab Manual
52 pages
CS39003 Compilers Laboratory, Autumn 2024-2025 Assignment No: 3 Date: 02-Sep-2024
No ratings yet
CS39003 Compilers Laboratory, Autumn 2024-2025 Assignment No: 3 Date: 02-Sep-2024
2 pages
YACC With Example
No ratings yet
YACC With Example
5 pages
Spring 2024 Compiler Constructoin A Lab 7
No ratings yet
Spring 2024 Compiler Constructoin A Lab 7
17 pages
CD File
No ratings yet
CD File
22 pages
Compiler Record Work
No ratings yet
Compiler Record Work
10 pages
Compiler Design Assignment Lexical Analysis: 30/08/2021 Neha Vijay Khairnar 191081036 IT
No ratings yet
Compiler Design Assignment Lexical Analysis: 30/08/2021 Neha Vijay Khairnar 191081036 IT
8 pages
Lex and Yacc: A Brisk Tutorial
No ratings yet
Lex and Yacc: A Brisk Tutorial
25 pages
Yacc Examples
No ratings yet
Yacc Examples
9 pages
Compiler Design Lab Programs
No ratings yet
Compiler Design Lab Programs
8 pages
Lex and Yacc
No ratings yet
Lex and Yacc
5 pages
59 Tweed 15w Amp Kit Instructions
100% (1)
59 Tweed 15w Amp Kit Instructions
44 pages
Example Program For The Lex and Yacc Programs
No ratings yet
Example Program For The Lex and Yacc Programs
5 pages
Lex Yacc
No ratings yet
Lex Yacc
9 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
36 pages
HTML Questions and Answers
No ratings yet
HTML Questions and Answers
5 pages
Infrastructure Pentesting PDF
No ratings yet
Infrastructure Pentesting PDF
13 pages
Additive Manufacturing Kme071
No ratings yet
Additive Manufacturing Kme071
1 page
Gapped Text
No ratings yet
Gapped Text
11 pages
Quick Guide: SUN2000 - (50KTL-ZHM3, 50KTL-M3)
No ratings yet
Quick Guide: SUN2000 - (50KTL-ZHM3, 50KTL-M3)
21 pages
Pioneer SPH-DA360DAB-Operation-Manual
No ratings yet
Pioneer SPH-DA360DAB-Operation-Manual
65 pages
Group 4 Review 1-1
No ratings yet
Group 4 Review 1-1
14 pages
F0072 Slave Exec Erase
No ratings yet
F0072 Slave Exec Erase
1 page
Parsing
No ratings yet
Parsing
158 pages
Teacher Resume Format in Word India
100% (1)
Teacher Resume Format in Word India
6 pages
Code:: Program To Implement RMI
No ratings yet
Code:: Program To Implement RMI
4 pages
Venkatesh Receip
No ratings yet
Venkatesh Receip
1 page
API-fication: Core Building Block of The Digital Enterprise
No ratings yet
API-fication: Core Building Block of The Digital Enterprise
14 pages
INF1520 101 - 2022 - 0 - b-2
No ratings yet
INF1520 101 - 2022 - 0 - b-2
20 pages
IT Reviewer
No ratings yet
IT Reviewer
13 pages
ISP 39 - Joining Letter
No ratings yet
ISP 39 - Joining Letter
4 pages
Vulnerabilities in TCP/IP Protocols
No ratings yet
Vulnerabilities in TCP/IP Protocols
61 pages
Ontapcuoiky SE445E G A C I K 2024-1
No ratings yet
Ontapcuoiky SE445E G A C I K 2024-1
9 pages
An Understanding of AI's Limitations Is Starting To Sink in - The Economist
No ratings yet
An Understanding of AI's Limitations Is Starting To Sink in - The Economist
4 pages
Syntax Analysis
No ratings yet
Syntax Analysis
87 pages
OWASP Quick Start Guide
No ratings yet
OWASP Quick Start Guide
13 pages
Unit 3-Business Process Automation
No ratings yet
Unit 3-Business Process Automation
13 pages
Project IS3940 - PNU
No ratings yet
Project IS3940 - PNU
28 pages
Guidelines DS Python
No ratings yet
Guidelines DS Python
2 pages
EMMC Bus Protocol Linux Kernel Internals by SSM
No ratings yet
EMMC Bus Protocol Linux Kernel Internals by SSM
10 pages
An Online Scheduling Algorithm With Advance Reservation For Large-Scale Data Transfers
No ratings yet
An Online Scheduling Algorithm With Advance Reservation For Large-Scale Data Transfers
22 pages
PRO1 Brochure
No ratings yet
PRO1 Brochure
12 pages
Meraki Datasheet mr44
No ratings yet
Meraki Datasheet mr44
10 pages
Doctor Patient
No ratings yet
Doctor Patient
5 pages
Levelling Activity - Represent Shape (SOLUTIONS)
No ratings yet
Levelling Activity - Represent Shape (SOLUTIONS)
9 pages
Cursed Emoji Love - Google Search
No ratings yet
Cursed Emoji Love - Google Search
1 page
Lecture 9 (Reading Material)
No ratings yet
Lecture 9 (Reading Material)
5 pages
QUES - 7 - D4 - DS - Day - 4 Question - Contests - HackerRank
No ratings yet
QUES - 7 - D4 - DS - Day - 4 Question - Contests - HackerRank
2 pages
Python科學計算第一次作業
No ratings yet
Python科學計算第一次作業
1 page
Python Programming Concepts
From Everand
Python Programming Concepts
MRB
No ratings yet
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Mastering Python Programming: A Comprehensive Guide: The IT Collection
From Everand
Mastering Python Programming: A Comprehensive Guide: The IT Collection
Christopher Ford
5/5 (1)
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet

Python Lex-Yacc: Language Tool For Python CS 550 Programming Languages

Uploaded by

Python Lex-Yacc: Language Tool For Python CS 550 Programming Languages

Uploaded by

Python Lex-Yacc

Language Tool for Python

● Attempts to be a faithful recreation of lex and yacc

● It reads regular expressions to define tokens in order to create a lexer like in

● It reads an LALR(1) grammar and associated rule actions to create a parser

● Download PLY from their website:

● Versions of ply at 3.0 or above (ply-3.0+) support both Python 2 or Python 3

● Both Python 2.7.6 and Python 3.4.3 are available on tux

● Invoking the Python 3.4.3 interpreter:

● tux already has PLY configured! I will cover it anyway.

literals = ['=','+','-','*','/', '(',')']

-- ALTERNATIVE -- (note: literals checked lastly in matching)

t_ignore = " \t"

# Build the lexer

import ply.yacc as yacc

● My working directory looks like this:

● This is a helpful file we can use in debugging.

● It is simply a debug output that expresses the grammar that yacc.py

● This can be useful if you have shift/reduce and reduce/reduce conflicts

● The file contains a pretty-printed grammar (your grammar, hopefully), terminals

● This file contains the parsing table used by your parser

● Use PLY on tux (already installed and configured)

● Design your own language by creating tokenization instructions via regular

● Implement the language by giving PLY these instructions to generate a lexical

PLY (Python Lex-Yacc)

You might also like