0% found this document useful (0 votes)

27 views6 pages

C2ex Java

Uploaded by

ashrafmuzammil.26csa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views6 pages

C2ex Java

Uploaded by

ashrafmuzammil.26csa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Date: Ex – 1(b)Lexical analyser

Aim:
To develop a Lexical Analyzer that processes C code to identify and classify keywords,
identifiers, operators, punctuation, constants, and lexemes from a source file.

Algorithm:

1. Initialize Data Structures:

● Use LinkedHashSet to store identifiers, preserving insertion order.

● Define lists for keywords, operators, punctuation, constants, and lexemes.
● Initialize predefined sets of keywords, operators, and punctuation symbols.

2. Read File Line by Line:

● Open and read the file using BufferedReader.

● Process each line by splitting it into tokens based on whitespace and non-word characters.

3. Handle Special Tokens:

● Skip preprocessor directives and headers (e.g., #include <stdio.h>).

● Process string literals ("..."), character literals ('A'), and function calls (func()).

4. Classify Tokens:

● Add tokens to the respective lists (keywords, operators, punctuation, constants) based on
their type.
● Add single alphabetical characters as identifiers.

5. Store Lexemes:

● Store any token that doesn't fit into keywords, operators, punctuation, constants, or
identifiers into the lexemes list.

6. Display Symbol Table:

● After processing all lines, print the contents of the symbol table including keywords,
identifiers, operators, punctuation, constants, and lexemes.

Code:

import java.io.BufferedReader;
import java.io.FileReader;
import java.io.IOException;
import java.util.ArrayList;
import java.util.Arrays;
import java.util.HashSet;
import java.util.LinkedHashSet;
import java.util.Set;

public class LexicalAnalyzer2 {

static Set<String> identifiers = new LinkedHashSet<>(); // Use LinkedHashSet to maintain
insertion order
static ArrayList<String> keywordsList = new ArrayList<>();
static ArrayList<String> operatorsList = new ArrayList<>();
static ArrayList<Character> punctuationList = new ArrayList<>();
static ArrayList<String> constantsList = new ArrayList<>();
static ArrayList<String> lexemes = new ArrayList<>(); // New array for function names and
others

// Define initial keywords and operators

static Set<String> keywords = new HashSet<>(Arrays.asList(
"int", "float", "char", "void", "if", "else", "while", "return",
"for", "do", "switch", "case", "include", "stdio", "main"
));
static Set<String> operators = new HashSet<>(Arrays.asList(
"+", "-", "*", "/", "=", "++", "--", "==", "!=", ">", "<", ">=", "<=", "&&", "||"
));
static Set<Character> punctuations = new HashSet<>(Arrays.asList(
';', ',', '(', ')', '{', '}', '[', ']'
));

static void processLine(String line) {

// Handle multi-character tokens like strings and function calls
String[] tokens = line.split("(?=\\W)|(?<=\\W)");

for (String token : tokens) {

token = token.trim();

if (token.isEmpty()) {
continue; // Skip empty tokens
}

// Skip preprocessor directives

if (token.startsWith("#")) {
continue;
}

// Skip header files or anything in angle brackets (e.g., <stdio.h>)

if (token.startsWith("<") && token.endsWith(">")) {
continue;
}

// Handle string literals (e.g., "Hello, World!\n")

if (token.startsWith("\"") && token.endsWith("\"")) {
lexemes.add(token);
continue;
}

// Handle character literals (e.g., 'A')

if (token.startsWith("'") && token.endsWith("'") && token.length() == 3) {
lexemes.add(token);
continue;
}

// Handle function calls

if (token.contains("(") && token.contains(")")) {
lexemes.add(token);
continue;
}

// Process other tokens

if (keywords.contains(token)) {
if (!keywordsList.contains(token)) {
keywordsList.add(token);
}
} else if (operators.contains(token)) {
operatorsList.add(token);
} else if (punctuations.contains(token.charAt(0))) {
punctuationList.add(token.charAt(0));
} else if (Character.isDigit(token.charAt(0))) {
constantsList.add(token);
} else if (isSingleAlphabetic(token)) {
// Ensure only single alphabetical tokens are added as identifiers
identifiers.add(token);
} else {
// Tokens that are not identifiers or constants might be part of lexemes
lexemes.add(token);
}
}
}

// Helper method to check if a token is a single alphabetic character

private static boolean isSingleAlphabetic(String token) {
return token.length() == 1 && Character.isLetter(token.charAt(0));
}

public static void main(String[] args) {

// Hardcoded file path
String filePath = "C:\\4025 CSA\\dio2.c";

try (BufferedReader br = new BufferedReader(new FileReader(filePath))) {

String line;
while ((line = br.readLine()) != null) {
processLine(line);
}
} catch (IOException e) {
System.out.println("An error occurred while reading the file.");
e.printStackTrace();
}

// Display the symbol table after processing the entire file

System.out.println("Symbol Table:");
System.out.println("Keywords: " + String.join(", ", keywordsList));
System.out.println("Identifiers: " + String.join(", ", identifiers));
System.out.println("Operators: " + String.join(", ", operatorsList));
System.out.println("Punctuations: " + punctuationList.toString());
System.out.println("Constants: " + String.join(", ", constantsList));
System.out.println("Lexemes: " + String.join(", ", lexemes)); // New output for lexemes
}
}

Dio2.c
#include <stdio.h>
int main() {
int a = 10;
float b = 20.5;
char c = 'A';

a = a + 1;
b = b * 2;
printf("Hello, World!\n");

return 0;
}

Output

Symbol Table:
Keywords: include, stdio, int, main, float, char, return
Identifiers: a, b, c
Operators: <, >, =, =, =, =, +, =, *
Punctuations: [(, ), {, ;, ;, ;, ;, ;, (, ,, ), ;, ;, }]
Constants: 10, 20, 5, 1, 2, 0
Lexemes: ., ., ', ', printf, ", Hello, World, !, \, "

Result:
Hence a Lexical Analyzer that processes C code to identify and classify keywords has been
successfully written, executed and its output verified successfully.

Nexus Book PDF
100% (2)
Nexus Book PDF
446 pages
Oracle CRM Service Contracts Queries
No ratings yet
Oracle CRM Service Contracts Queries
55 pages
Dacs-Wn Series PDF
No ratings yet
Dacs-Wn Series PDF
6 pages
ADM960 Flashcards
100% (1)
ADM960 Flashcards
30 pages
من المفترض ان ده حل الكويز بس بيقع في كذا تيست
No ratings yet
من المفترض ان ده حل الكويز بس بيقع في كذا تيست
4 pages
Concepts_Assignment (Technical Report Template)[1]
No ratings yet
Concepts_Assignment (Technical Report Template)[1]
14 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
4 pages
CC Assignment # 1
No ratings yet
CC Assignment # 1
6 pages
JasonDsouza - 9537 - Batch A
No ratings yet
JasonDsouza - 9537 - Batch A
114 pages
Mid Term Project
No ratings yet
Mid Term Project
4 pages
Ornek Scanner Parser
No ratings yet
Ornek Scanner Parser
44 pages
cdjavacodes (1) (1)
No ratings yet
cdjavacodes (1) (1)
23 pages
Sec B
No ratings yet
Sec B
43 pages
LA Using Transition Diagram
No ratings yet
LA Using Transition Diagram
2 pages
CD Lab Manual File
No ratings yet
CD Lab Manual File
27 pages
Program No. - 3: Write A Program To Find Different Tokens in A Program
No ratings yet
Program No. - 3: Write A Program To Find Different Tokens in A Program
3 pages
Week 2a &2B
No ratings yet
Week 2a &2B
6 pages
21bai1724 Lab-01
No ratings yet
21bai1724 Lab-01
11 pages
System Software and Compiler Lab: Token Separation
No ratings yet
System Software and Compiler Lab: Token Separation
5 pages
22bce2509 VL2024250102410 Ast01
No ratings yet
22bce2509 VL2024250102410 Ast01
12 pages
sslab2
No ratings yet
sslab2
6 pages
21BCE3008
No ratings yet
21BCE3008
7 pages
Cs-603 Activity: Abca-1 (Coding/Debugging) Compiler: Name - Divyansh Sharma Roll No. - 0905cs211055
No ratings yet
Cs-603 Activity: Abca-1 (Coding/Debugging) Compiler: Name - Divyansh Sharma Roll No. - 0905cs211055
6 pages
Experiment No 3 PDF
No ratings yet
Experiment No 3 PDF
4 pages
Compiler Record
No ratings yet
Compiler Record
42 pages
Lexer
No ratings yet
Lexer
6 pages
compiler practical file (1)
No ratings yet
compiler practical file (1)
33 pages
compiler lab2
No ratings yet
compiler lab2
17 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
68 pages
Assignment No- 01
No ratings yet
Assignment No- 01
4 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
33 pages
Name:atif Ali Enrollment: (01-134191-008)
No ratings yet
Name:atif Ali Enrollment: (01-134191-008)
15 pages
CD Lab Manual
No ratings yet
CD Lab Manual
43 pages
01 134201 011 9556776808 12042022 111907pm
No ratings yet
01 134201 011 9556776808 12042022 111907pm
14 pages
2775
No ratings yet
2775
65 pages
Lab 2: Lexer Implementation: Preparation
No ratings yet
Lab 2: Lexer Implementation: Preparation
6 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
Compiler Design Record (21072)
No ratings yet
Compiler Design Record (21072)
48 pages
1PrCD
No ratings yet
1PrCD
6 pages
assignment 2
No ratings yet
assignment 2
4 pages
Lab 5 (Latest-ByAman)(For Students)
No ratings yet
Lab 5 (Latest-ByAman)(For Students)
5 pages
Compiler Design Lab Manual
82% (11)
Compiler Design Lab Manual
47 pages
Cse420 Lab 1
No ratings yet
Cse420 Lab 1
4 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
Compiler Design (CS-701) : Develop A Lexical Analyzer To Recognize A Few Patterns in C
No ratings yet
Compiler Design (CS-701) : Develop A Lexical Analyzer To Recognize A Few Patterns in C
17 pages
lab2_cd_22BLC1161
No ratings yet
lab2_cd_22BLC1161
9 pages
a
No ratings yet
a
4 pages
compiler .cppp
No ratings yet
compiler .cppp
4 pages
Program Scanner Untuk Melakukan Analisis Leksikal Dengan C
No ratings yet
Program Scanner Untuk Melakukan Analisis Leksikal Dengan C
4 pages
21BAI1213 - Abhinav V - Experiment-2
No ratings yet
21BAI1213 - Abhinav V - Experiment-2
11 pages
Compiler Design Lab Work
No ratings yet
Compiler Design Lab Work
43 pages
Lab 3
No ratings yet
Lab 3
8 pages
CD File - Merged
No ratings yet
CD File - Merged
52 pages
Rajalakshmi Institute of Technology Chennai: Department of Computer Science and Engineering
No ratings yet
Rajalakshmi Institute of Technology Chennai: Department of Computer Science and Engineering
20 pages
CS3501-Compiler Lab-2021R-Updated-19-7-2023
No ratings yet
CS3501-Compiler Lab-2021R-Updated-19-7-2023
44 pages
Lecture 2.76
No ratings yet
Lecture 2.76
31 pages
Implementation of Lexical Analyser Using C
No ratings yet
Implementation of Lexical Analyser Using C
11 pages
CompilerConsLab Pranjal
No ratings yet
CompilerConsLab Pranjal
11 pages
Compiler Design & Networks Lab Manual
No ratings yet
Compiler Design & Networks Lab Manual
69 pages
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
From Everand
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
Kanto
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Python Reference: An Alphabetical Guide
From Everand
Python Reference: An Alphabetical Guide
Jo Foster
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Gandingan
No ratings yet
Gandingan
4 pages
PG AI Principles v1.0
No ratings yet
PG AI Principles v1.0
9 pages
New Performance Ventilation Controls
No ratings yet
New Performance Ventilation Controls
2 pages
COS3721-101 2018 3 B PDF
No ratings yet
COS3721-101 2018 3 B PDF
18 pages
Only And: For Regular Serving Railway Employees of SWR Rwfiynk
No ratings yet
Only And: For Regular Serving Railway Employees of SWR Rwfiynk
5 pages
Top 5 attack combinations for Town Hall 10 in Clash of Clans (2024)
No ratings yet
Top 5 attack combinations for Town Hall 10 in Clash of Clans (2024)
1 page
ET01 User Manual
No ratings yet
ET01 User Manual
3 pages
MQ Iib Crash
No ratings yet
MQ Iib Crash
8 pages
GA4 Ecommerce Tracking - Part 2 - Ecommerce Events PDF
No ratings yet
GA4 Ecommerce Tracking - Part 2 - Ecommerce Events PDF
18 pages
Instructions for Term End Examination-June 2025
No ratings yet
Instructions for Term End Examination-June 2025
3 pages
hologic 2015 PPT
No ratings yet
hologic 2015 PPT
38 pages
Digital Systems Design Using VHDL 3rd Edition Roth Solutions Manual 1
100% (61)
Digital Systems Design Using VHDL 3rd Edition Roth Solutions Manual 1
36 pages
Capstone Project m&A
No ratings yet
Capstone Project m&A
12 pages
Operational Readiness Assessment ORA Template With Instructions
100% (3)
Operational Readiness Assessment ORA Template With Instructions
13 pages
Sequential Practice
No ratings yet
Sequential Practice
6 pages
Brother DR2300 Drum Unit Reset Instructions
No ratings yet
Brother DR2300 Drum Unit Reset Instructions
4 pages
Effect of Mobile Marketing On Youngsters: Padmashree Institute of Management and Sciences
No ratings yet
Effect of Mobile Marketing On Youngsters: Padmashree Institute of Management and Sciences
36 pages
FPS Shooter Games Presentation
No ratings yet
FPS Shooter Games Presentation
10 pages
Automatic V Oltage Regulator
No ratings yet
Automatic V Oltage Regulator
2 pages
SSL Handshake With Two-Way Authentication With Certificates
No ratings yet
SSL Handshake With Two-Way Authentication With Certificates
1 page
Governing Body College Presentation
No ratings yet
Governing Body College Presentation
31 pages
CA SPOM Set-D Paper-4 Concept Compilation
No ratings yet
CA SPOM Set-D Paper-4 Concept Compilation
61 pages
Chap 2 (Linear Programing by Simplex)
No ratings yet
Chap 2 (Linear Programing by Simplex)
55 pages
Computational Fluid Dynamics Assignment 2
No ratings yet
Computational Fluid Dynamics Assignment 2
20 pages
Adam Thierer (PFF) Remarks at FCC Hearing On Public Interest in Digital Era (3!4!10)
No ratings yet
Adam Thierer (PFF) Remarks at FCC Hearing On Public Interest in Digital Era (3!4!10)
14 pages
Psycopg2 Tutorial
No ratings yet
Psycopg2 Tutorial
6 pages

C2ex Java

Uploaded by

C2ex Java

Uploaded by

Date: Ex – 1(b)Lexical analyser

1. Initialize Data Structures:

● Use LinkedHashSet to store identifiers, preserving insertion order.

2. Read File Line by Line:

● Open and read the file using BufferedReader.

3. Handle Special Tokens:

● Skip preprocessor directives and headers (e.g., #include <stdio.h>).

6. Display Symbol Table:

public class LexicalAnalyzer2 {

// Define initial keywords and operators

static void processLine(String line) {

for (String token : tokens) {

// Skip preprocessor directives

// Skip header files or anything in angle brackets (e.g., <stdio.h>)

// Handle string literals (e.g., "Hello, World!\n")

// Handle character literals (e.g., 'A')

// Handle function calls

// Process other tokens

// Helper method to check if a token is a single alphabetic character

public static void main(String[] args) {

try (BufferedReader br = new BufferedReader(new FileReader(filePath))) {

// Display the symbol table after processing the entire file

You might also like