0% found this document useful (0 votes)

30 views11 pages

Project CC

Uploaded by

usairashahbaz152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as ODT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views11 pages

Project CC

Uploaded by

usairashahbaz152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as ODT, PDF, TXT or read online on Scribd

You are on page 1/ 11

For your project, the grammar defines the syntax of your calculator language.

Below is a detailed
breakdown of the key components (keywords, operators, identifiers, numbers, strings, constants,
reserved words, etc.), including each token type and its regular expression (RE).

1. Keywords
Keywords are reserved words that have a specific meaning in the language. They cannot be used as
identifiers.
Keywords:
• func (used to define a function)
• return (used to return a value from a function)

• Print used to print

Token Type: KEYWORD

Regular Expression (RE):
func|return|print

2. Operators
Operators are symbols used for arithmetic and assignment operations.
Operators:
• Arithmetic: +, -, *, /
• Assignment: =
• Parentheses: (, )

Token Type: OPERATOR

Regular Expressions (RE):
• Addition: \+
• Subtraction: \-
• Multiplication: \*
• Division: /
• Assignment: =
• Left Parenthesis: $
• Right Parenthesis: $

3. Identifiers
Identifiers are names for variables, functions, or parameters. They must start with a letter or _ and
can be followed by letters, digits, or _.

Examples:
• x, myVar, _total, addNumbers
Token Type: IDENTIFIER
Regular Expression (RE):
[a-zA-Z_][a-zA-Z0-9_]*

4. Numbers
Numbers are numeric literals. They can be integers or floating-point numbers.
Examples:
• Integer: 42, 0
• Floating-point: 3.14, 0.001

Token Type: NUMBER

Regular Expression (RE):
\d+(\.\d+)? # Matches integers and floating-point numbers

5. Strings
Strings are sequences of characters enclosed in quotes. In your calculator language, you might not
have strings, but they can be added for functions like print().

Examples:
• "Hello"
• 'World'
Token Type: STRING
Regular Expression (RE):
"[^"]*"|'[^']*'

6. Constants
Constants are fixed values in the language, such as PI or E. These can be treated as predefined
identifiers.
Examples:
• PI = 3.14159
• E = 2.71828
Token Type: CONSTANT
Regular Expression (RE):
[a-zA-Z_][a-zA-Z0-9_]* # Same as identifiers, with a predefined list

7. Reserved Words
Reserved words are special terms in the language that have specific roles. Reserved words in your
project may include keywords or predefined constants.
Reserved Words:
• func, return,print

Token Type: RESERVED_WORD

Regular Expression (RE):
func|return\print

8. Delimiters
Delimiters are symbols used to separate components of the language.
Examples:
• , (comma to separate parameters in function definitions)
• {, } (braces to define blocks of code)

Token Type: DELIMITER

Regular Expressions (RE):
• Comma: ,
• Left Brace: {
• Right Brace: }

Summary of Tokens and Their Regular Expressions

Token Type Description
KEYWORD func, return, print
OPERATOR +, -, *, /, =, (, )
IDENTIFIER Variable/function names
NUMBER Integer or floating-point numbers
STRING Quoted string literals
CONSTANT Predefined values like PI, E
RESERVED_WORD Reserved words (same as keywords)
DELIMITER Commas, braces

Examples of Tokens in Code

Input Code:
func add(a, b) {
result = a + b
return result
}

Tokenized Output:

Token Type
func KEYWORD
add IDENTIFIER
( DELIMITER
a IDENTIFIER
, DELIMITER
b IDENTIFIER
) DELIMITER
{ DELIMITER
result IDENTIFIER
= OPERATOR
a IDENTIFIER
+ OPERATOR
b IDENTIFIER
return KEYWORD
result IDENTIFIER
} DELIMITER

Here’s the comprehensive output, BNF forms, and related documentation for your project:

BNF (Backus-Naur Form) Grammar for the Calculator Language

<program> ::= <function>+

<function> ::= "func" <identifier> "(" <parameters> ")" "{" <statements> "}"

<parameters> ::= <parameter> ("," <parameter>)*

<statements> ::= <statement>*

<assignment> ::= <identifier> "=" <expression>

<return_statement> ::= "return" <expression>
<print_statement> ::= "print" "(" <expression> ")"

<expression> ::= <term> (("+" | "-") <term>)*

<term> ::= <factor> (("*" | "/") <factor>)*
<factor> ::= <identifier> | <number> | "(" <expression> ")"

<identifier> ::= [a-zA-Z_][a-zA-Z0-9_]*

<number> ::= [0-9]+(\.[0-9]+)?
Tokenized Output for Example Input Code
Input Code:
func add(a, b) {
result = a + b
return result
}

Tokenized Output:
Token Type
func KEYWORD
add IDENTIFIER
( DELIMITER
a IDENTIFIER
, DELIMITER
b IDENTIFIER
) DELIMITER
{ DELIMITER
result IDENTIFIER
= OPERATOR
a IDENTIFIER
+ OPERATOR
b IDENTIFIER
return KEYWORD
result IDENTIFIER
} DELIMITER
Tokenized Output with Error Detection
Input Code with Error:
func add(a, b) {
result = a + b
returns result
}

Output:
Token Type Error
func KEYWORD
add IDENTIFIER
( DELIMITER
a IDENTIFIER
, DELIMITER
b IDENTIFIER
) DELIMITER
{ DELIMITER
result IDENTIFIER
= OPERATOR
a IDENTIFIER
+ OPERATOR
b IDENTIFIER
returns IDENTIFIER Invalid keyword detected: returns
result IDENTIFIER
} DELIMITER
Extracted Regular Expressions (REs)
Token Type Regular Expression Purpose
KEYWORD `\b(func return
Matches arithmetic operators (+, -, *, /) and
OPERATOR [\+\-\*/=()]
assignment (=), as well as (, ).
\b[a-zA-Z_][a-zA-
IDENTIFIER Matches variable, function, and parameter names.
Z0-9_]*\b
NUMBER \d+(\.\d+)? Matches integers and floating-point numbers.
STRING `"[^"]*" '[^']*'`
CONSTANT `\b(PI E)\b`
DELIMITER [{},] Matches delimiters {, }, and ,.
SKIP [ \t]+ Matches and skips spaces and tabs.
NEWLINE \n Matches newline characters to track line numbers.
MISMATCH . Matches any single character not matching other
patterns (error handling).

Error Handling Rules

1. Invalid Identifiers:
• Identifiers containing reserved keywords are flagged as errors.
• Example: returnValue flagged if return is a reserved keyword.
2. Unknown Tokens:
• Any character or sequence not matching defined token types is flagged as an error.
• Example: @ is not valid and flagged.
3. Unmatched Delimiters:
• Parentheses, braces, or commas without a matching counterpart are flagged.
• Example: func add(a, b { flagged for missing ).

Examples of Errors
Code Error Description
returns result returns is not a valid keyword.
func add(a, b { Missing closing parenthesis ).
result = a ++ b Invalid operator ++ in the current grammar.
3abc = 5 Invalid identifier 3abc starts with a number.

This documentation fully supports your CC project, including grammar, tokens, BNF forms, and
error handling. Let me know if you need further enhancements!

import re

# Define token types and their corresponding regular expressions

TOKEN_SPECIFICATIONS = [
('KEYWORD', r'\b(func|return|print)\b'), # Keywords
('OPERATOR', r'[\+\-\*/=()]'), # Operators
('IDENTIFIER', r'\b[a-zA-Z_][a-zA-Z0-9_]*\b'), # Identifiers
('NUMBER', r'\d+(\.\d+)?'), # Numbers (integers and floats)
('STRING', r'"[^"]*"|\'[^\']*\''), # Strings
('CONSTANT', r'\b(PI|E)\b'), # Constants (predefined identifiers)
('DELIMITER', r'[{},]'), # Delimiters
('SKIP', r'[ \t]+'), # Skip spaces and tabs
('NEWLINE', r'\n'), # Newlines (to track line numbers)
('MISMATCH', r'.') # Any other character (error handling)
]

# Compile regular expressions into patterns

TOKEN_REGEX = '|'.join(f'(?P<{name}>{pattern})' for name, pattern in
TOKEN_SPECIFICATIONS)

# Define the lexical analyzer function with enhanced error handling

def tokenize_with_keyword_check(code):
tokens = []
errors = []
line_number = 1 # Start at the first line

for match in re.finditer(TOKEN_REGEX, code):

token_type = match.lastgroup
value = match.group(token_type)
if token_type == 'NEWLINE': # Increment line number for newlines
line_number += 1
elif token_type == 'SKIP': # Ignore spaces and tabs
continue
elif token_type == 'MISMATCH': # Handle invalid characters
errors.append(f"Error: Unexpected character '{value}' at line {line_number}")
elif token_type == 'IDENTIFIER' and any(keyword in value for keyword in ['func', 'return',
'print']):
errors.append(f"Error: Invalid identifier '{value}' containing a reserved keyword at line
{line_number}")
else:
tokens.append((token_type, value, line_number))

return tokens, errors

# Example input code

input_code = """
func add(a, b) {
result = a + b
returns result
}
"""

# Tokenize the input code

tokenized_output, error_list = tokenize_with_keyword_check(input_code)

# Print tokenized output

print("Tokenized Output:")
for token_type, value, line_number in tokenized_output:
print(f"{value} -> {token_type} (Line {line_number})")

# Print errors if any

if error_list:
print("\nErrors:")
for error in error_list:
print(error)
class ShiftReduceParser:
def __init__(self, grammar, start_symbol="<program>"):
self.grammar = grammar # Grammar rules
self.start_symbol = start_symbol # Start symbol of the grammar
self.stack = [] # Stack to hold symbols
self.input = [] # Tokens from input code
self.parse_tree = [] # List to hold the final parse tree
self.table = [] # Table to store Stack, Input, and Action for each step

def parse(self, input_tokens):

"""
Parses the input tokens using the Shift-Reduce technique and prints the table.
"""
self.input = input_tokens + ["$"] # Add the end of input symbol ($)
self.stack = [] # Reset the stack
self.parse_tree = [] # Reset the parse tree
self.table = [] # Reset the table
while self.input:
# Try to reduce
reduced = False
for rule in self.grammar:
rhs = rule[1]
rhs_len = len(rhs)
if len(self.stack) >= rhs_len and self.stack[-rhs_len:] == rhs:
# If the RHS matches the top of the stack, reduce
self.stack = self.stack[:-rhs_len] # Pop RHS from stack
self.stack.append(rule[0]) # Push LHS to stack
self.parse_tree.append(f"Reduced {rhs} to {rule[0]}")
self.table.append((list(self.stack), list(self.input), f"Reduced {rhs} to {rule[0]}"))
reduced = True
break
# If no reduction is possible, shift
if not reduced:
if self.input[0] == "$" and len(self.stack) == 1 and self.stack[0] == self.start_symbol:
self.table.append((list(self.stack), list(self.input), "Accept"))
print("Parsing completed successfully.")
return True
else:
# Shift the first input symbol to the stack
self.stack.append(self.input.pop(0)) # Move from input to stack
self.table.append((list(self.stack), list(self.input), f"Shifted {self.stack[-1]}"))
print("Error: Unable to parse input.")
return False

def print_parsing_table(self):
"""
Prints the parsing table.
"""
print(f"{'Step':<5}{'Stack':<40}{'Input':<40}{'Action':<40}")
for step, (stack, input_buffer, action) in enumerate(self.table):
print(f"{step+1:<5}{' '.join(stack):<40}{' '.join(input_buffer):<40}{action:<40}")

# Define the CFG rules as a list of tuples (LHS, RHS)

grammar = [
("<program>", ["<function>"]),
("<function>", ["func", "<identifier>", "(", "<parameters>", ")", "{", "<statements>",
"}"]),
("<parameters>", ["<parameter>"]),
("<parameters>", ["<parameter>", ",", "<parameters>"]),
("<parameter>", ["<identifier>"]),
("<statements>", ["<statement>"]),
("<statements>", ["<statement>", "<statements>"]),
("<statement>", ["<assignment>"]),
("<statement>", ["<return_statement>"]),
("<statement>", ["<print_statement>"]),
("<assignment>", ["<identifier>", "=", "<expression>"]),
("<return_statement>", ["return", "<expression>"]),
("<print_statement>", ["print", "(", "<expression>", ")"]),
("<expression>", ["<term>"]),
("<expression>", ["<term>", "+", "<expression>"]),
("<expression>", ["<term>", "-", "<expression>"]),
("<term>", ["<factor>"]),
("<term>", ["<factor>", "*", "<term>"]),
("<term>", ["<factor>", "/", "<term>"]),
("<factor>", ["<identifier>"]),
("<factor>", ["<number>"]),
("<factor>", ["(", "<expression>", ")"]),
("<identifier>", ["ID"]), # Token representation of identifier
("<number>", ["NUM"]), # Token representation of number
]

# Sample input for testing the parser (tokens for a function definition)
input_tokens = [
"func", "add", "ID", "(", "ID", ",", "ID", ")", "{", "ID", "=", "ID", "+", "ID", "return", "ID", "}"
]

# Create a ShiftReduceParser object

parser = ShiftReduceParser(grammar)

# Run the parser on the sample input

parser.parse(input_tokens)

# Print the parsing table

parser.print_parsing_table()

Thorsten Ball-Writing An Interpreter in Go (2017) PDF
100% (1)
Thorsten Ball-Writing An Interpreter in Go (2017) PDF
206 pages
Chapter 2 - Lexical Analysis - Regular Expressions
No ratings yet
Chapter 2 - Lexical Analysis - Regular Expressions
27 pages
Lexical Analysis: Programming Languages Translators
No ratings yet
Lexical Analysis: Programming Languages Translators
21 pages
Chapter2-Lexical Analysis
No ratings yet
Chapter2-Lexical Analysis
64 pages
67163118e98feCCWeek 03lecture05
No ratings yet
67163118e98feCCWeek 03lecture05
62 pages
Compiler Design Chapter-2
60% (5)
Compiler Design Chapter-2
105 pages
Practical File: Be (Cse) 6 Semester
No ratings yet
Practical File: Be (Cse) 6 Semester
54 pages
2 - Lexical Analysis
No ratings yet
2 - Lexical Analysis
52 pages
Acknowledgements: The Slides For This Lecture Are A Modified Versions of The Offering by
No ratings yet
Acknowledgements: The Slides For This Lecture Are A Modified Versions of The Offering by
40 pages
Lexing
No ratings yet
Lexing
16 pages
4-Intro To Flex and Bison-09!09!2024
No ratings yet
4-Intro To Flex and Bison-09!09!2024
28 pages
Chapter 2 - Lexical Analysis
100% (1)
Chapter 2 - Lexical Analysis
69 pages
CD Week3
No ratings yet
CD Week3
6 pages
Lexical Specification
No ratings yet
Lexical Specification
6 pages
7.specification of Tokens
No ratings yet
7.specification of Tokens
3 pages
Assignment 01
No ratings yet
Assignment 01
5 pages
Python Reference Manual
No ratings yet
Python Reference Manual
67 pages
04 Lexi Cal A Analysis
No ratings yet
04 Lexi Cal A Analysis
39 pages
Unit 1
No ratings yet
Unit 1
34 pages
Lexical Analysis: Textbook:Modern Compiler Design
No ratings yet
Lexical Analysis: Textbook:Modern Compiler Design
43 pages
Laboratory Manual For Compiler Design: Robb T. Koether
No ratings yet
Laboratory Manual For Compiler Design: Robb T. Koether
194 pages
Compiler Design Practical File PDF
No ratings yet
Compiler Design Practical File PDF
33 pages
Lab Manual-CC
No ratings yet
Lab Manual-CC
19 pages
Lexical Analysis and Lexical Analyzer Generators: COP5621 Compiler Construction
No ratings yet
Lexical Analysis and Lexical Analyzer Generators: COP5621 Compiler Construction
52 pages
Programm 1
No ratings yet
Programm 1
8 pages
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
No ratings yet
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
52 pages
Py Regex v4p0
No ratings yet
Py Regex v4p0
122 pages
Compilation Techniques
No ratings yet
Compilation Techniques
20 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
27 pages
Design Language For Compiler
No ratings yet
Design Language For Compiler
6 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
Ruby Final Draft Enu 20100825
No ratings yet
Ruby Final Draft Enu 20100825
331 pages
Ch3 Modified
No ratings yet
Ch3 Modified
80 pages
Python For You and Me
No ratings yet
Python For You and Me
175 pages
Chapter 3 Lexical Analysis
No ratings yet
Chapter 3 Lexical Analysis
5 pages
Unit 2 Lexical Analyzer
No ratings yet
Unit 2 Lexical Analyzer
63 pages
Session 2
No ratings yet
Session 2
4 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
14 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
56 pages
Compiler Design in C (Allen I. Holub)
100% (1)
Compiler Design in C (Allen I. Holub)
986 pages
Pyregex
No ratings yet
Pyregex
71 pages
Dascript
No ratings yet
Dascript
124 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
Mozart-Oz Notation
No ratings yet
Mozart-Oz Notation
39 pages
Lexical and Syntactic Analysis: Slide 1
No ratings yet
Lexical and Syntactic Analysis: Slide 1
39 pages
Xi - Python Programing Fundamental
No ratings yet
Xi - Python Programing Fundamental
22 pages
SSCD Chapter3
No ratings yet
SSCD Chapter3
97 pages
Slides 02 - Compiler Construction - UET CS - Lexical Analyzer Rev 2
No ratings yet
Slides 02 - Compiler Construction - UET CS - Lexical Analyzer Rev 2
69 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
Pymbook Readthedocs Io en Latest
100% (1)
Pymbook Readthedocs Io en Latest
173 pages
Dascript
No ratings yet
Dascript
108 pages
Beej's Guide To C Programming Library - Reference
No ratings yet
Beej's Guide To C Programming Library - Reference
461 pages
CD Manual
No ratings yet
CD Manual
58 pages
Python For You and Me: Release 0.3.alpha1
100% (1)
Python For You and Me: Release 0.3.alpha1
143 pages
Comp 1127 Assignment
No ratings yet
Comp 1127 Assignment
5 pages
Pymbook PDF
No ratings yet
Pymbook PDF
143 pages
Building Skills in Python
100% (5)
Building Skills in Python
574 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
2 The Need For Security
No ratings yet
2 The Need For Security
68 pages
Project Proposal Network
No ratings yet
Project Proposal Network
4 pages
CN Ecercise Chapter 2 Physical Layer
No ratings yet
CN Ecercise Chapter 2 Physical Layer
29 pages
CN Data Link Layer Exercise
No ratings yet
CN Data Link Layer Exercise
15 pages
Intro Iphone
No ratings yet
Intro Iphone
19 pages
NJ SQL Best Practices V1 0 QuickStartGuide en 201504 P77I-E-01
No ratings yet
NJ SQL Best Practices V1 0 QuickStartGuide en 201504 P77I-E-01
16 pages
Documentation. HiPath 3000 - 5000 HiPath 3000 Manager C. Communication For The Open Minded. Administrator Documentation A31003-H3580-M101!7!76A9
No ratings yet
Documentation. HiPath 3000 - 5000 HiPath 3000 Manager C. Communication For The Open Minded. Administrator Documentation A31003-H3580-M101!7!76A9
283 pages
Setupalog
No ratings yet
Setupalog
5 pages
ICT603 Assessment Guide 2024S1 PDF
No ratings yet
ICT603 Assessment Guide 2024S1 PDF
24 pages
Sap Basis Introductory Training Program Day10
No ratings yet
Sap Basis Introductory Training Program Day10
53 pages
Cloud Computing and Big Data 7th Conference JCC BD 2019 La Plata Buenos Aires Argentina June 24 28 2019 Revised Selected Papers Marcelo Naiouf
No ratings yet
Cloud Computing and Big Data 7th Conference JCC BD 2019 La Plata Buenos Aires Argentina June 24 28 2019 Revised Selected Papers Marcelo Naiouf
55 pages
05 Generics
No ratings yet
05 Generics
116 pages
G120 Lista de Parametros CU230-2
No ratings yet
G120 Lista de Parametros CU230-2
668 pages
A Case Study of Sony Interactive Entertainment
No ratings yet
A Case Study of Sony Interactive Entertainment
6 pages
Cryptographic Techniques For Data Privacy in Digit
No ratings yet
Cryptographic Techniques For Data Privacy in Digit
19 pages
GS1200-5 / GS1200-8: Quick Start Guide
No ratings yet
GS1200-5 / GS1200-8: Quick Start Guide
4 pages
SQL Functions: Assignments Q
No ratings yet
SQL Functions: Assignments Q
4 pages
Huzzaz
No ratings yet
Huzzaz
24 pages
Vindicator V5 ACS Data Sheet PDF
No ratings yet
Vindicator V5 ACS Data Sheet PDF
2 pages
NRPL ADS-B Esite A4 2002114 2
No ratings yet
NRPL ADS-B Esite A4 2002114 2
3 pages
12 - C - 25 - CS - 1 - 2 - Computer Science & Engineering
No ratings yet
12 - C - 25 - CS - 1 - 2 - Computer Science & Engineering
33 pages
Grove Temperature and Humidity Sensor Sen11301p
No ratings yet
Grove Temperature and Humidity Sensor Sen11301p
9 pages
Students Details For Nexjob - in
No ratings yet
Students Details For Nexjob - in
4 pages
Lovejeet Ar Worksheet 10
No ratings yet
Lovejeet Ar Worksheet 10
2 pages
Data Science Projects
No ratings yet
Data Science Projects
74 pages
Module - 1 Fundamentals
No ratings yet
Module - 1 Fundamentals
46 pages
Blackwintersecurity Com Oscp
No ratings yet
Blackwintersecurity Com Oscp
6 pages
Baseband Migration CheckList
No ratings yet
Baseband Migration CheckList
52 pages
Function
No ratings yet
Function
18 pages
Mr. Robot
No ratings yet
Mr. Robot
11 pages
Big Data Analytics in Smart Grids
No ratings yet
Big Data Analytics in Smart Grids
5 pages
NCR SelfServ 28 Datasheet English US
No ratings yet
NCR SelfServ 28 Datasheet English US
2 pages
6th International Conference On Cloud Computing and IoT (CCCIoT 2025)
No ratings yet
6th International Conference On Cloud Computing and IoT (CCCIoT 2025)
2 pages
Biometeric 2
No ratings yet
Biometeric 2
4 pages
Penetration Testing and Ethical Hacking Course
No ratings yet
Penetration Testing and Ethical Hacking Course
5 pages

Project CC

Uploaded by

Project CC

Uploaded by

For your project, the grammar defines the syntax of your calculator language.

• Print used to print

Token Type: KEYWORD

Token Type: OPERATOR

Token Type: NUMBER

Token Type: RESERVED_WORD

Token Type: DELIMITER

Summary of Tokens and Their Regular Expressions

Examples of Tokens in Code

BNF (Backus-Naur Form) Grammar for the Calculator Language

<parameters> ::= <parameter> ("," <parameter>)*

<statements> ::= <statement>*

<assignment> ::= <identifier> "=" <expression>

<expression> ::= <term> (("+" | "-") <term>)*

<identifier> ::= [a-zA-Z_][a-zA-Z0-9_]*

Error Handling Rules

# Define token types and their corresponding regular expressions

# Compile regular expressions into patterns

# Define the lexical analyzer function with enhanced error handling

for match in re.finditer(TOKEN_REGEX, code):

return tokens, errors

# Example input code

# Tokenize the input code

# Print tokenized output

# Print errors if any

def parse(self, input_tokens):

# Define the CFG rules as a list of tuples (LHS, RHS)

# Create a ShiftReduceParser object

# Run the parser on the sample input

# Print the parsing table

You might also like