0% found this document useful (0 votes)

3 views

Lex_Program

The document outlines an assignment to create a lexical analyzer for a simplified C-like programming language using Lex. It details the types of tokens the lexer should identify, including keywords, operators, identifiers, literals, comments, and punctuation marks. The provided Lex code implements the lexer functionality and includes a summary of the token counts after processing input source code.

Uploaded by

yogasimmanravisagar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Lex_Program

Uploaded by

yogasimmanravisagar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

COMPILER ENGINEERING

ASSIGNMENT: LEX PROGRAM

BY Yogasimman.R
IT
2022115125
Problem Statement: Lexical Analyzer for C-like Language
Objective:
Create a lexical analyzer (lexer) for a simplified C-like programming language. The lexer will read
source code and break it into its constituent tokens, classifying them based on their types. The
program should be able to identify keywords, operators, identifiers, literals (integer, float, string,
character), comments, punctuation marks, and handle invalid tokens.
Input:
The program takes as input a text file or standard input that contains source code written in a
simplified C-like language. The input can contain the following:
1. Keywords: int, float, double, char, void, if, else, for, while, return,
break, continue, struct, union, typedef, enum, switch, case, default,
const, static, extern.
2. Identifiers: Any valid variable or function names (e.g., main, sum, x, temp_var).
3. Operators: +, -, *, /, ++, --, ==, !=, <=, >=, &&, ||, &, |, ^, ~, <<, >>, =, +=, -=, *=,
/=.
4. Literals: Integer literals (e.g., 123, 0b1010, 0x1A), float literals (e.g., 3.14, 2.71e-3),
character literals (e.g., 'a'), and string literals (e.g., "hello").
5. Comments: Single-line comments starting with // and multi-line comments enclosed by
/* ... */.
6. Punctuation Marks: Parentheses (), braces {}, brackets [], semicolons ;, and commas
,.

Lex Code:
%{
#include <stdio.h>
#include <string.h>
#include <ctype.h>

int line_number = 1;
int keyword_count = 0;
int identifier_count = 0;
int operator_count = 0;
int compound_operator_count = 0;
int ternary_operator_count = 0;
int literal_count = 0;
int complex_literal_count = 0;
int comment_count = 0;
int punctuation_count = 0;
int nested_structure_count = 0;
int invalid_token_count = 0;

char *keywords[] = {
"int", "float", "double", "char", "void", "if", "else", "for", "while", "return",
"break", "continue", "struct", "union", "typedef", "enum", "switch", "case",
"default", "const", "static", "extern", NULL
};

int is_keyword(char *str) {

for (int i = 0; keywords[i] != NULL; i++) {
if (strcmp(str, keywords[i]) == 0)
return 1;
}
return 0;
}

void print_token(char type, char value) {

printf("%s (%s) at line %d\n", type, value, line_number);
}

%option noyywrap

\n { line_number++; }

[ \t\r]+ {}
"/\\*"([^*]|[*]+[^/])*"\\*/" {
print_token("Multi-line Comment", yytext);
comment_count++;
}
"//".* { print_token("Single-line Comment", yytext); comment_count++; }

[a-zA-Z_][a-zA-Z0-9_]* {
if (is_keyword(yytext)) {
print_token("Keyword", yytext);
keyword_count++;
} else {
print_token("Identifier", yytext);
identifier_count++;
}
}

"\\+\\+|--" { print_token("Increment/Decrement Operator", yytext);

compound_operator_count++; }
"==|!=|<=|>=|&&|\\|\\|" { print_token("Logical/Relational Operator", yytext); operator_count++; }
"\\+=|-=|\\*=|/=" { print_token("Compound Assignment Operator", yytext);
compound_operator_count++; }
"\\?|:" { print_token("Ternary Operator", yytext); ternary_operator_count++; }
"&|\\||\\^|~|<<|>>" { print_token("Bitwise Operator", yytext); operator_count++; }
"=" { print_token("Assignment Operator", yytext); operator_count++; }

"0b[01]+" { print_token("Binary Literal", yytext); complex_literal_count++; }

"0x[0-9a-fA-F]+" { print_token("Hexadecimal Literal", yytext); complex_literal_count++; }
"0[0-7]+" { print_token("Octal Literal", yytext); complex_literal_count++; }
"[0-9]+\\.[0-9]+([eE][-+]?[0-9]+)?" {
print_token("Float Literal", yytext);
complex_literal_count++;
}
"[0-9]+" { print_token("Integer Literal", yytext); literal_count++; }

"([^\\\"]|\\.)*" { print_token("String Literal", yytext); literal_count++; }

"'(\\\\.|[^\\\\])'" { print_token("Character Literal", yytext); literal_count++; }

"\\{|\\}" { print_token("Brace", yytext); nested_structure_count++; }

"\\(|\\)" { print_token("Parenthesis", yytext); punctuation_count++; }
"\\[|\\]" { print_token("Bracket", yytext); punctuation_count++; }
";" { print_token("Semicolon", yytext); punctuation_count++; }
"," { print_token("Comma", yytext); punctuation_count++; }

"sin|cos|log|sqrt" { print_token("Function", yytext); identifier_count++; }

int main(void) {
yylex();
printf("\n--- Summary ---\n");
printf("Keywords: %d\n", keyword_count);
printf("Identifiers: %d\n", identifier_count);
printf("Operators: %d\n", operator_count);
printf("Compound Operators: %d\n", compound_operator_count);
printf("Ternary Operators: %d\n", ternary_operator_count);
printf("Literals: %d\n", literal_count);
printf("Complex Literals: %d\n", complex_literal_count);
printf("Comments: %d\n", comment_count);
printf("Punctuation: %d\n", punctuation_count);
printf("Nested Structures: %d\n", nested_structure_count);
printf("Invalid Tokens: %d\n", invalid_token_count);
return 0;
}

Test_Input.txt:
x = sin(45) + cos(30) * y / log(2) + z++;
result = a * b + c / d - e % f;
if (sqrt(25) >= 5) { x++; }

Output:

Compiler - Design - Lab Final 2024
No ratings yet
Compiler - Design - Lab Final 2024
45 pages
Program 3
No ratings yet
Program 3
2 pages
SPCC EXP7
No ratings yet
SPCC EXP7
8 pages
CD Lab Manual File
No ratings yet
CD Lab Manual File
27 pages
CD Lab
No ratings yet
CD Lab
34 pages
CD Lab-1
No ratings yet
CD Lab-1
34 pages
CD 1
No ratings yet
CD 1
31 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
49 pages
Quantitative Aptitude
No ratings yet
Quantitative Aptitude
33 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
36 pages
Final Practical File 21058570040
No ratings yet
Final Practical File 21058570040
19 pages
1.write A Program To Check Whether A String Belongs To The Grammar or Not
0% (1)
1.write A Program To Check Whether A String Belongs To The Grammar or Not
18 pages
character, Word and Line Count
No ratings yet
character, Word and Line Count
2 pages
SP File
No ratings yet
SP File
38 pages
CD LexProgram
No ratings yet
CD LexProgram
11 pages
Adhiparasakthi College of Engineering: G. B. Nagar, Kalavai - 632 506, Ranipet District, Tamil Nadu
No ratings yet
Adhiparasakthi College of Engineering: G. B. Nagar, Kalavai - 632 506, Ranipet District, Tamil Nadu
38 pages
Flex Tool Presentation - DVK
No ratings yet
Flex Tool Presentation - DVK
17 pages
Lex and Yacc Programs
No ratings yet
Lex and Yacc Programs
8 pages
Cs6109 - Compiler Design: Lab Assignment
No ratings yet
Cs6109 - Compiler Design: Lab Assignment
8 pages
b Tech 1006322 CD Lab
No ratings yet
b Tech 1006322 CD Lab
35 pages
Compiler_Lab_Experiments[1]
No ratings yet
Compiler_Lab_Experiments[1]
24 pages
Compiler Design Record Old
No ratings yet
Compiler Design Record Old
43 pages
CD Lab 1
No ratings yet
CD Lab 1
23 pages
Practicalcode 08 10 Session2023 24
No ratings yet
Practicalcode 08 10 Session2023 24
8 pages
CS3501_LABMANUAL
No ratings yet
CS3501_LABMANUAL
23 pages
lab manual2021 regulation
No ratings yet
lab manual2021 regulation
28 pages
Lex Yacc Program Practice
No ratings yet
Lex Yacc Program Practice
21 pages
Led 3
No ratings yet
Led 3
7 pages
Aktu CD Lab File 2
No ratings yet
Aktu CD Lab File 2
51 pages
Compiler Project Abstract
No ratings yet
Compiler Project Abstract
12 pages
CC lab 1-2
No ratings yet
CC lab 1-2
6 pages
Lecture (Lab)
No ratings yet
Lecture (Lab)
20 pages
CD
No ratings yet
CD
18 pages
Compiler Design LAB: Submitted by
No ratings yet
Compiler Design LAB: Submitted by
12 pages
Compiler File
No ratings yet
Compiler File
47 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
11 pages
Compiler Design Practical File PDF
No ratings yet
Compiler Design Practical File PDF
33 pages
2021UCS1618 Compiler
No ratings yet
2021UCS1618 Compiler
31 pages
NEW CD ANS
No ratings yet
NEW CD ANS
11 pages
CS3501 Compiler Design Lab
No ratings yet
CS3501 Compiler Design Lab
35 pages
Write A Lex Program To Count No of Identifiers, Keywords, Digits
100% (2)
Write A Lex Program To Count No of Identifiers, Keywords, Digits
13 pages
System Software Programs
No ratings yet
System Software Programs
8 pages
Lexical Analyzer Generator Lex (Flex in Recent Implementation)
No ratings yet
Lexical Analyzer Generator Lex (Flex in Recent Implementation)
14 pages
Compiler Lab Manual
No ratings yet
Compiler Lab Manual
32 pages
Lex 1
No ratings yet
Lex 1
12 pages
Sample Lex and YAcc Programs PDF
0% (1)
Sample Lex and YAcc Programs PDF
22 pages
Sample Lex and YAcc Programs
No ratings yet
Sample Lex and YAcc Programs
22 pages
System Software Lab Manual: (Lex Programs)
No ratings yet
System Software Lab Manual: (Lex Programs)
22 pages
Sample Lex and YAcc Programs
No ratings yet
Sample Lex and YAcc Programs
22 pages
Sample Lex and YAcc Programs
No ratings yet
Sample Lex and YAcc Programs
22 pages
Sample Lex and YAcc Programs
No ratings yet
Sample Lex and YAcc Programs
22 pages
Wa0030.
No ratings yet
Wa0030.
14 pages
COMPILER DES. LAB MANUAL
No ratings yet
COMPILER DES. LAB MANUAL
7 pages
PCC lab file
No ratings yet
PCC lab file
27 pages
cd_week3
No ratings yet
cd_week3
6 pages
Lexer
No ratings yet
Lexer
6 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
7) Write A Program To Design Lexical Analyzer
No ratings yet
7) Write A Program To Design Lexical Analyzer
25 pages
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Dicas de Packet Trcer
No ratings yet
Dicas de Packet Trcer
5 pages
Automated Tape Library
No ratings yet
Automated Tape Library
155 pages
Aplio 500 Platinum Ultrasound System
No ratings yet
Aplio 500 Platinum Ultrasound System
11 pages
840Dsl_TCU30_3_equip_man_0323_en-US
No ratings yet
840Dsl_TCU30_3_equip_man_0323_en-US
92 pages
Lackovic E04-050 Harmonics and Inverters
No ratings yet
Lackovic E04-050 Harmonics and Inverters
40 pages
Banking Interview Question Bank - CA Monk
100% (1)
Banking Interview Question Bank - CA Monk
33 pages
ATP - NETgear Process
No ratings yet
ATP - NETgear Process
8 pages
Input - Output Inc - 12-Volt - Batteries.and - Charge
No ratings yet
Input - Output Inc - 12-Volt - Batteries.and - Charge
2 pages
Huawei - Telecom - 6 - Months - Training - Proposal PDF
100% (2)
Huawei - Telecom - 6 - Months - Training - Proposal PDF
11 pages
10 Operating Instruction 247 PDF
No ratings yet
10 Operating Instruction 247 PDF
56 pages
Reliability Analyses of Electrical Distribution System: A Case Study
No ratings yet
Reliability Analyses of Electrical Distribution System: A Case Study
9 pages
BEE(Micro project - Half Wave Rectifier)
No ratings yet
BEE(Micro project - Half Wave Rectifier)
9 pages
Grade 11 NOtes 2
No ratings yet
Grade 11 NOtes 2
3 pages
NSR Form - Manoj M
No ratings yet
NSR Form - Manoj M
1 page
Appendix 1 Sample Questionnaire
No ratings yet
Appendix 1 Sample Questionnaire
5 pages
Washington DC Fiber Network Map
No ratings yet
Washington DC Fiber Network Map
3 pages
POWER ELECTRONICS Tutorial HH
No ratings yet
POWER ELECTRONICS Tutorial HH
6 pages
MPHW F Result
No ratings yet
MPHW F Result
34 pages
Assignment 1-Introduction of Information Technology
No ratings yet
Assignment 1-Introduction of Information Technology
6 pages
Presentation, Analysis, and Interpretation of Data
No ratings yet
Presentation, Analysis, and Interpretation of Data
10 pages
Bca 03
No ratings yet
Bca 03
2 pages
Types of Cursors
No ratings yet
Types of Cursors
1 page
Lossy and Lossless Compression Techniques
100% (1)
Lossy and Lossless Compression Techniques
18 pages
Docdownloadv2 Women in World Music PR
No ratings yet
Docdownloadv2 Women in World Music PR
9 pages
Present Perfect Passive: Exercise 1
No ratings yet
Present Perfect Passive: Exercise 1
4 pages
Cisco Unified Wireless Network Base Security Feaures
No ratings yet
Cisco Unified Wireless Network Base Security Feaures
70 pages
Affective Computing Report
No ratings yet
Affective Computing Report
19 pages
Extremewireless Wing 7532 802.11ac Access Point: Maximum Speed. Minimum Cost
No ratings yet
Extremewireless Wing 7532 802.11ac Access Point: Maximum Speed. Minimum Cost
7 pages
Question Bank For 1st UNIT - MHB4117 - BMT
No ratings yet
Question Bank For 1st UNIT - MHB4117 - BMT
9 pages
DataSheet
No ratings yet
DataSheet
3 pages

Lex_Program

Uploaded by

Lex_Program

Uploaded by

COMPILER ENGINEERING

ASSIGNMENT: LEX PROGRAM

int is_keyword(char *str) {

void print_token(char *type, char *value) {

"\\+\\+|--" { print_token("Increment/Decrement Operator", yytext);

"0b[01]+" { print_token("Binary Literal", yytext); complex_literal_count++; }

"([^\\\"]|\\.)*" { print_token("String Literal", yytext); literal_count++; }

"'(\\\\.|[^\\\\])'" { print_token("Character Literal", yytext); literal_count++; }

"\\{|\\}" { print_token("Brace", yytext); nested_structure_count++; }

"sin|cos|log|sqrt" { print_token("Function", yytext); identifier_count++; }

You might also like

void print_token(char type, char value) {