0% found this document useful (0 votes)

3 views6 pages

CSC 333-HW02

Uploaded by

madtecharch

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views6 pages

CSC 333-HW02

Uploaded by

madtecharch

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

HW02 CSC -333 Code Report

1. Skip Initial White Spaces:

• Pseudocode:

skip any initial white space (spaces, tabs, and newlines)

• Implementation: The lexer uses a token specification for white spaces and
skips over them:

('SKIP', r'[ \t]+'), # Skip spaces and tabs

('NEWLINE', r'\n'), # Skip newlines
These tokens are checked using regular expressions, and no tokens are generated for
spaces, tabs, or newlines:

elif kind == 'SKIP':

continue # Ignore white spaces
elif kind == 'NEWLINE':
continue # Ignore newlines

2. Single Character Tokens:

• Pseudocode:

if cur.char ∈ ('(', ')', '+', '-', '*')

return the corresponding single-character token
• Implementation: Single-character tokens (like parentheses and arithmetic
operators) are matched with specific regex patterns and returned as tokens:

('OP', r'[+\-/%]|\\*'), # Numeric Operators

('LPAR', r'{'), # Left Parenthesis
('RPAR', r'}'), # Right Parenthesis
('LBRAC', r'\('), # Left Bracket
('RBRAC', r'\)'), # Right Bracket
('SEMICOLON', r';'), # Semicolon
('COMMA', r','), # Comma
Example return for single characters:

elif kind == 'LPAR':

token = f"Token: LPAR({value})"
elif kind == 'RPAR':
token = f"Token: RPAR({value})"

3. Handling Assignment (=):

• Pseudocode:

if cur.char = '='
read the next character
if it is '=' return the relational operator token
else return assign
• Implementation: The code uses regular expressions to detect both the
assignment operator (=) and the relational operator (==):

('ASSIGN', r'='), # Assignment Operator

('REL_OP', r'[<>]=?|!=|=='), # Relational Operators
For =, it checks if it's a part of a relational operator (==) or a simple assignment:

elif kind == 'ASSIGN':

token = f"Token: ASSIGN({value})"
elif kind == 'REL_OP':
token = f"Token: REL_OP({value})"

4. Handling Division and Comments (/, //):

• Pseudocode:

if cur.char = '/'
peek at the next character
if it is '*' or '/'
read additional characters until "*/" or newline is seen, respectively
• Implementation: The lexer recognizes the division operator (/) and handles
comments using a regular expression for comments starting with #:

('COMMENT', r'#.*'), # Comments starting with #

('OP', r'[+\-*/%]|\*\*'), # Numeric Operators including division `/`
Comments are ignored:

elif kind == 'COMMENT':

continue # Ignore comments

5. Numbers (Integer and Floating-point):

• Pseudocode:

if cur.char is a digit
read any additional digits and at most one decimal point
return number
• Implementation: The lexer checks for integers and floating-point numbers
using two separate regular expressions:

('FLOAT', r'\d+\.\d{1,3}'), # Floating-point numbers

('INTEGER', r'\d+'), # Integer numbers
The floating-point regex ensures that up to 3 decimal places are allowed. Tokens are
returned as either FLOAT or INTEGER:

elif kind == 'FLOAT':

token = f"Token: FLOAT({value})"
elif kind == 'INTEGER':
token = f"Token: INTEGER({value})"
6. Identifiers and Keywords:
• Pseudocode:

if cur.char is a letter
read any additional letters and digits
check to see whether the resulting string is a keyword
if so, return the corresponding token
else return id
• Implementation: Identifiers and keywords are handled by a regular expression
that matches letters followed by letters, digits, or underscores. The lexer checks if
the matched token is a keyword:

('ID', r'[A-Za-z_][A-Za-z_0-9]*'), # Identifiers

elif kind == 'ID':

token = f"Token: ID({value})"
elif kind == 'KEYWORD':
token = f"Token: KEYWORD({value})"

7. Error Handling (Unknown Characters):

• Pseudocode:
else announce an error
• Implementation: The lexer handles any characters that do not match a valid
token by returning an error message for unknown tokens:

('MISMATCH', r'.'), # Any other character

If an unknown character is found, the lexer generates an error token:

elif kind == 'MISMATCH':

token = f"@ unknown({value})"

The lexical analyzer is designed to follow the logic described in the pseudocode. It
systematically processes the input character by character, using regular expressions to
detect tokens and handle errors. Each token is processed as per its type, and results are
printed to the console and saved to a file.
This lexer can be extended easily for additional language features by adding more token
patterns and modifying the state transitions.

Compiler Design Chapter-2
60% (5)
Compiler Design Chapter-2
105 pages
Semester Q NLP 2022-23
No ratings yet
Semester Q NLP 2022-23
2 pages
Chapter 2 - Lexical Analysis
100% (1)
Chapter 2 - Lexical Analysis
69 pages
Compiler Construction CS-4207: Lecture 4-5 Instructor Name: Atif Ishaq
100% (1)
Compiler Construction CS-4207: Lecture 4-5 Instructor Name: Atif Ishaq
37 pages
Compiler Design Practical File PDF
No ratings yet
Compiler Design Practical File PDF
33 pages
Ply Talk
100% (2)
Ply Talk
87 pages
List of DOS Commands
0% (1)
List of DOS Commands
17 pages
Semi Detailed Lesson Plan in Stat. & Prob 11
100% (4)
Semi Detailed Lesson Plan in Stat. & Prob 11
7 pages
Dokumen - Pub Approaching Almost Any Machine Learning Problem 9788269211528 L 5276104
100% (1)
Dokumen - Pub Approaching Almost Any Machine Learning Problem 9788269211528 L 5276104
151 pages
Multiplication 2 Multiply in Parts 1
No ratings yet
Multiplication 2 Multiply in Parts 1
3 pages
Chapter 3 Lexical Analysis
No ratings yet
Chapter 3 Lexical Analysis
5 pages
Lab No. 01 - Building A Lexical Analyzer
No ratings yet
Lab No. 01 - Building A Lexical Analyzer
4 pages
Lecture 1 - Neural Network Definitions and Concepts 1
No ratings yet
Lecture 1 - Neural Network Definitions and Concepts 1
4 pages
Unit 1 (B)
No ratings yet
Unit 1 (B)
69 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
Code:: Compiler Design (3170701) 190090107055
No ratings yet
Code:: Compiler Design (3170701) 190090107055
76 pages
JAVA LAB MANUAL - 3rd SEM
No ratings yet
JAVA LAB MANUAL - 3rd SEM
30 pages
CD ch2
No ratings yet
CD ch2
104 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
56 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
39 pages
SVU - 2020 TY B.Tech IT Syllabus July 2022
No ratings yet
SVU - 2020 TY B.Tech IT Syllabus July 2022
97 pages
Practical File: Be (Cse) 6 Semester
No ratings yet
Practical File: Be (Cse) 6 Semester
54 pages
Chapter 2
No ratings yet
Chapter 2
91 pages
Unit 2 Lexical Analysis
No ratings yet
Unit 2 Lexical Analysis
94 pages
Lexical Analysis and Lexical Analyzer Generators: COP5621 Compiler Construction
No ratings yet
Lexical Analysis and Lexical Analyzer Generators: COP5621 Compiler Construction
52 pages
2 Lexing
No ratings yet
2 Lexing
73 pages
Predictive Modelling Project
No ratings yet
Predictive Modelling Project
94 pages
Mahatma Phule Arts, Science & Commerce College, Panvel. Dist: Raigad
No ratings yet
Mahatma Phule Arts, Science & Commerce College, Panvel. Dist: Raigad
83 pages
CD Unit-2
No ratings yet
CD Unit-2
64 pages
Lexical Analysis
No ratings yet
Lexical Analysis
88 pages
2 - Lexical Analysis
No ratings yet
2 - Lexical Analysis
52 pages
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part2
No ratings yet
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part2
62 pages
COS 320 Compilers: David Walker
No ratings yet
COS 320 Compilers: David Walker
38 pages
WINSEM2024-25 CSI2005 TH VL2024250502429 2024-12-14 Reference-Material-II
No ratings yet
WINSEM2024-25 CSI2005 TH VL2024250502429 2024-12-14 Reference-Material-II
84 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
No ratings yet
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
52 pages
Secure Code Review
No ratings yet
Secure Code Review
94 pages
Chapter2-Lexical Analysis
No ratings yet
Chapter2-Lexical Analysis
64 pages
CTSD-1 Practical
No ratings yet
CTSD-1 Practical
41 pages
Assignment 1-3 Java
No ratings yet
Assignment 1-3 Java
14 pages
CD Unit-2
No ratings yet
CD Unit-2
64 pages
CH 3 Myppt
No ratings yet
CH 3 Myppt
59 pages
CS - CH - 4 5 NOTESehwi
No ratings yet
CS - CH - 4 5 NOTESehwi
18 pages
Amitav Report Plagiarism
No ratings yet
Amitav Report Plagiarism
54 pages
Compilers - Week 2
No ratings yet
Compilers - Week 2
14 pages
Introduction To Lex
No ratings yet
Introduction To Lex
20 pages
Unit 2
No ratings yet
Unit 2
89 pages
CD Lab Manual File
No ratings yet
CD Lab Manual File
27 pages
Pograms
No ratings yet
Pograms
20 pages
Chapter 2 - Lexical Analysis - Regular Expressions
No ratings yet
Chapter 2 - Lexical Analysis - Regular Expressions
27 pages
4-Intro To Flex and Bison-09!09!2024
No ratings yet
4-Intro To Flex and Bison-09!09!2024
28 pages
Sunshine's Homepage - Understanding CRC
No ratings yet
Sunshine's Homepage - Understanding CRC
28 pages
Compiler Design Lab KCS552
No ratings yet
Compiler Design Lab KCS552
82 pages
Python and Django Syllabus
No ratings yet
Python and Django Syllabus
11 pages
PP La Sa
No ratings yet
PP La Sa
20 pages
Module 5 Lexical Analyser
No ratings yet
Module 5 Lexical Analyser
10 pages
CH 5 Introdution To Python
No ratings yet
CH 5 Introdution To Python
18 pages
Acknowledgements: The Slides For This Lecture Are A Modified Versions of The Offering by
No ratings yet
Acknowledgements: The Slides For This Lecture Are A Modified Versions of The Offering by
40 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
14 pages
BCA-V AIDA Syllabus
No ratings yet
BCA-V AIDA Syllabus
15 pages
Computer Science Paper 11th
No ratings yet
Computer Science Paper 11th
8 pages
Speedup and Efficiency
No ratings yet
Speedup and Efficiency
11 pages
Lab Manual-CC
No ratings yet
Lab Manual-CC
19 pages
1.1 Number Systems SME
No ratings yet
1.1 Number Systems SME
17 pages
Lecs 103
No ratings yet
Lecs 103
15 pages
Compiler - Lexical Analyzer-2
No ratings yet
Compiler - Lexical Analyzer-2
16 pages
Python Notes - Unit 4
No ratings yet
Python Notes - Unit 4
13 pages
Project CC
No ratings yet
Project CC
11 pages
Quiz Solution
No ratings yet
Quiz Solution
9 pages
Unit 2
No ratings yet
Unit 2
14 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
8 pages
Compiler Design Assignment Write Specification of LEX/FLEX Program.
No ratings yet
Compiler Design Assignment Write Specification of LEX/FLEX Program.
5 pages
1505760060csen 3111
No ratings yet
1505760060csen 3111
6 pages
CD Lab Manual
No ratings yet
CD Lab Manual
16 pages
Oops 5
No ratings yet
Oops 5
5 pages
Comp 1127 Assignment
No ratings yet
Comp 1127 Assignment
5 pages
CD Week3
No ratings yet
CD Week3
6 pages
Algorithm To Find Navamsha Chart From Birth Chart
No ratings yet
Algorithm To Find Navamsha Chart From Birth Chart
3 pages
Spccexp 3 New
No ratings yet
Spccexp 3 New
5 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Session 2
No ratings yet
Session 2
4 pages
Compiler Construction TIDANG LESLIE NADIA
No ratings yet
Compiler Construction TIDANG LESLIE NADIA
3 pages
Dbms Lesson Plan
No ratings yet
Dbms Lesson Plan
4 pages
Mod Menu Log - Com - Storytaco.bloodkiss - Dangerous.ikemen - Otome
No ratings yet
Mod Menu Log - Com - Storytaco.bloodkiss - Dangerous.ikemen - Otome
2 pages
Data Cleaning Checklist: Checklist Examples in Action Potential Solutions Data Constraints Problems
No ratings yet
Data Cleaning Checklist: Checklist Examples in Action Potential Solutions Data Constraints Problems
1 page
Assignment 4 SSM
No ratings yet
Assignment 4 SSM
1 page
BCD To Binary
No ratings yet
BCD To Binary
1 page
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet

CSC 333-HW02

Uploaded by

CSC 333-HW02

Uploaded by

HW02 CSC -333 Code Report

1. Skip Initial White Spaces:

skip any initial white space (spaces, tabs, and newlines)

('SKIP', r'[ \t]+'), # Skip spaces and tabs

elif kind == 'SKIP':

2. Single Character Tokens:

if cur.char ∈ ('(', ')', '+', '-', '*')

('OP', r'[+\-*/%]|\*\*'), # Numeric Operators

elif kind == 'LPAR':

3. Handling Assignment (=):

('ASSIGN', r'='), # Assignment Operator

elif kind == 'ASSIGN':

4. Handling Division and Comments (/, //):

('COMMENT', r'#.*'), # Comments starting with #

elif kind == 'COMMENT':

5. Numbers (Integer and Floating-point):

('FLOAT', r'\d+\.\d{1,3}'), # Floating-point numbers

elif kind == 'FLOAT':

('ID', r'[A-Za-z_][A-Za-z_0-9]*'), # Identifiers

elif kind == 'ID':

7. Error Handling (Unknown Characters):

('MISMATCH', r'.'), # Any other character

elif kind == 'MISMATCH':

You might also like

('OP', r'[+\-/%]|\\*'), # Numeric Operators