0% found this document useful (0 votes)

7 views70 pages

Project Document

Uploaded by

Nandini Tati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views70 pages

Project Document

Uploaded by

Nandini Tati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 70

A PROJECT REPORT ON

STUDENT ANSWER EVALUATION USING LLM’s

A Major Project Submitted to
Jawaharlal Nehru Technological University, Kakinada
in Partial fulfillments of Requirements for the Award of the Degree of

BACHELOR OF TECHNOLOGY
IN
COMPUTER SCIENCE & ENGINEERING (AI & ML)

Submitted By
Mr. Chundi kousik (20KT1A4215)
Ms. Chipilla Kavya Sravani (20KT1A4214)
Mr. Pagadala Tharaka Subbareddy (20KT1A4237)

Under the Esteemed Guidance of

Mrs. N. V. Maha Lakshmi, M.Tech, (Ph.D)

Associate Professor

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING (AI & ML)

POTTI SRIRAMULU CHALAVADI MALLIKHARJUNA RAO

COLLEGE OF ENGINEERING & TECHNOLOGY
(AUTONOMOUS)
(Approved by AICTE New Delhi, Affiliated to JNTU-Kakinada)

KOTHAPET, VIJAYAWADA-520001, A.P

2020-2024
POTTI SRIRAMULU CHALAVADI MALLIKHARJUNARAO
COLLEGE OF ENGINEERING & TECHNOLOGY KOTHAPET,
VIJAYAWADA-520001.
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING (AI & ML)

CERTIFICATE

This is to certify that the project work entitled “Student Answer Evaluation Using LLM’s”
is a bonafide work carried out by Chundi kousik chundi(20KT1A4215), Chipilla Kavya
Sravani(20KT1A4214), Pagadala Tharaka Subbareddy(20KT1A4237). Fulfillment for the
award of the degree of Bachelor of Technology in COMPUTER SCIENCE &
ENGINEERING (AI & ML) of Jawaharlal Nehru Technological University, Kakinada during
the year 2023-2024. It is certified that all corrections/suggestions indicated for internal
assessment have been incorporated in the report. The project report has been approved as it
satisfies the academic requirements in respect of project work prescribed for the above degree.

Project Guide Head of the Department

External Examiner
ACKNOWLEDGEMENT

We owe a great many thanks to a great many people who helped and supported and suggested
us in every step. We are glad for having the support of our principal Dr. J. Lakshmi Narayana
who inspired us with his words filled with dedication and discipline towards work. We express
our gratitude towards Mrs. N. V. Maha Lakshmi , HOD of AIML for extending her support
through technical and motivation classes which had been the major source to carrying out our
project. We are very much thankful to Mrs. N. V. Maha Lakshmi, Associate Professor,
Guide of our project for guiding and correcting various documents of ours with attention and
care. She has taken the pain to go through the project and make necessary corrections as and
when needed. Finally, we thank one and all who directly and indirectly helped us to complete
our project successfully.

Project Associates

Mr. Chundi Kousik (20KT1A4215)

Ms. Chipilla Kavya Sravani (20KT1A4214)

Mr. Pagadala Tharaka Subbareddy (20KT1A4237)

DECLARATION

This is to declare that the project entitled “Student Answer Evaluation Using LLMs”
submitted by us in the partial fulfillment of requirements for the award of the degree of Bachelor
of Technology in COMPUTER SCIENCE & ENGINEERING(AI & ML) in Potti
Sriramulu Chalavadi Mallikharjuna Rao College of Engineering and Technology, is
bonafide record of project work carried out by us under the guidance of Mrs. N. V. Maha
Lakshmi, Associate Professor. As per our knowledge, the work has not beensubmitted to any
other institute or universities for any other degree.

Project Associates

Mr. Chundi Kousik (20KT1A4215)

Ms. Chipilla Kavya Sravani (20KT1A4214)

Mr. Pagadala Tharaka Subbareddy (20KT1A4237)

ABSTRACT
ABSTRACT

In educational assessment, the need for accurate and insightful evaluation of student responses
is paramount. This project introduces a novel approach by leveraging Long Language Models
(LLMs) to enhance the assessment process. Unlike conventional methods that rely on
predefined criteria or human judgment, this system harnesses the power of LLMs to compare
student answers with model-generated ideal responses. At the heart of this methodology lies the
ability of LLMs to understand language semantics deeply, enabling them to generate coherent
and contextually appropriate responses. By employing this capability, the system facilitates a
dynamic evaluation framework that transcends the limitations of traditional grading
approaches.

Central to this paradigm shift is the emphasis on semantic similarity between student responses
and ideal answers. Through sophisticated computational analysis, the system provides objective
and adaptive assessment, accommodating diverse responses and educational contexts.
Moreover, it offers granular feedback to students, pinpointing specific areas for improvement
in their responses. This innovative approach not only promises to elevate the standard of
educational assessment but also fosters a more equitable and insightful evaluation of student
learning, paving the way for enhanced pedagogical practices.

Keywords: Long Language Models; Semantic Understanding; Machine Learning; LLAMA;

Mistral;QA retrieval
CONTENTS

S.NO Topic Page No

1.Introduction

1.1 Brief Overview of project 1

1.1.1 Scope 1

1.1.2 Purpose 1

1.1.3 Objective of study 1

1.1.4 Literature Review 2

1.2 Problem Statement 8

1.3 Proposed System 8

2. System Analysis

2.1 System Study

2.1.1 Feasibility Study 10

2.1.1.1 Operational Feasibility 10

2.1.1.2 Technical Feasibility 10

2.1.1.3 Behavioural Feasibility 11

2.1.1.4 Financial and Economic Feasibility 11

2.2 System Requirements

2.2.1 Functional Requirements 12

2.2.2 Non-Functional Requirements 12

2.3 System Requirement specification

2.3.1 Hardware Requirements 13

2.3.2 Software Requirements 13

2.3.3 Required Libraries 14

2.4 Methodologies

2.4.1 Proposed Methodology 14

2.4.2 Traditional Approaches 15

3. System Design

3.1 About system design 18

3.1.1 Initialize design definition 19

3.1.2 Establish design characteristics 19

3.2 System Architecture 20

3.3 Data Flow Diagrams/ UML Diagrams 20

3.4 Dataset 26

4. System Implementation

4.1 System setup 27

4.2 Code 28

4.3 Results 34

5.Testing

5.1 Performance Metrics 37

5.2 Validating the Test Cases 37

6.Conclusion 38

7.Future Work 39
8.References 40

9.Bibliography 42

10.Appendix

10.1 Python Introduction 43

10.2 History of Python 43

10.3 Python Features 44

10.4 NLP 45

10.5 AI & ML 46

10.6 LLM’s 47
LIST OF FIGURES

S.NO FIG.NO NAME OF THE PAGE

FIGURE NO

1 2.1 Types of LLMs 15

2 3.1 System Architecture 20

3 3.2 Data Flow Diagram 22

4 3.3 External Entity 23

Representation

5 3.4 Process Representatio 24

6 3.5 Data Flow Representation 24

7 3.6 Warehouse/ Datastore 24

Representation

8 3.7 Level-0 25

9 3.8 Levevl-1 25

10 4.1 Stream lit 27

11 4.2 Google Colab 28

12 4.3 Web Interface 34

13 4.4 Result with Wrong Answer 34

14 4.5 Result with Right Answer 35

15 4.6 Result with Right Answer 36

16 10.1 Python Features 45

17 10.2 Overview of AI,ML,DL,NLP 47

18 10.3 Applications of LLMs 49

LIST OF TABLES

S.NO TABLE NO NAME OF THE PAGE NO

TABLE
1 1.1 Comparison Table 5-8

2 5.1 Validating the Test Cases 37

STUDENT ANSWER EVALUATION
USING LLM’S
INTRODUCTION
STUDENT ANSWER EVALUATION USING LLMS

1. INTRODUCTION

1.1 Brief Overview of Project

This project introduces an innovative approach to educational assessment, leveraging open-

source Language Models (LLMs) and chunk-based content retrieval systems. The methodology
involves segmenting educational content into manageable "chunks" for targeted retrieval and
comparison during the evaluation process. Advanced natural language processing techniques
facilitated by LLMs enable systematic comparison of student responses against expected
answers, culminating in the generation of percentage scores. Preliminary testing indicates
promising improvements in accuracy and efficiency compared to traditional methods, with
potential implications for broader transformation of automated assessment practices in
educational settings.

1.1.1 Scope

The project focuses on addressing the limitations of conventional assessment methods by

integrating technology-driven solutions. It encompasses the segmentation of educational
content, formulation of tailored queries, systematic comparison of student responses using
LLMs, and generation of percentage scores for actionable insights into student performance.
The scope extends to potential applications in various educational projects and the fine-tuning
of LLMs for specific assessment tasks across different subjects and question formats.

1.1.2 Purpose

The purpose of this project is to enhance the precision, speed, and scalability of student answer
evaluation processes in educational settings. By leveraging advancements in natural language
processing and information retrieval, the methodology aims to provide more efficient and
reliable means of assessing student performance. The project seeks to offer educators actionable
insights into student performance while also contributing to the broader transformation of
automated assessment practices within educational contexts.

1.1.3 Objective of the Study

The primary object of this study is to develop and implement an innovative methodology for
evaluating student responses using advanced natural language processing techniques and
information retrieval systems. The study aims to analyze the effectiveness of this approach in

DEPARTMENT OF CSE (AI & ML) Page | 1

STUDENT ANSWER EVALUATION USING LLMS

enhancing the precision, speed, and scalability of assessment processes compared to traditional
methods. Additionally, the study explores the potential implications of integrating technology,
particularly LLMs, in educational assessment practices, with a focus on its applicability across
various subjects and question formats.

1.1.4 Literature Review

In[1], Muhammad Farrukh Bashir .A co-author of "Subjective Answers Evaluation Using

Machine Learning and Natural Language Processing" . It deals with the challenge of evaluating
subjective exam answers, which is challenging because natural language is ambiguous. The
authors offer a novel approach to automate the evaluation process that makes use of machine
learning and natural language processing methods. To assess descriptive responses, they
employ methods such as Wordnet, Word2vec, word mover's distance (WMD), cosine similarity,
multinomial naive bayes (MNB), and term frequency-inverse document frequency (TF-
IDF).According to the paper, WMD outperforms cosine similarity, and with sufficient training,
the machine learning model can accurately predict grades on its own with an 88% accuracy
rate. A co-author of Subjective Answers Evaluation Using Machine Learning and Natural
Language Processing.

In[2] , Hossam Magdy Balaha Researcher at Mansoura University in Egypt led a study that
introduced an Automatic Exam Correction Framework (AECF) designed for a variety of
question forms, such as equations, essays, and multiple-choice questions (MCQs). The system
responds to the growing need for automated grading, which is particularly relevant in the
context of online learning. The creation of a five-layered technique with the goal of simplifying
the grading process is at the heart of their project.'HMB-MMS-EMA', an equation similarity
checker algorithm that is novel, is a key component of their methodology. A specific dataset
called 'HMB-EMD-v1' was created to support this method and make expression matching tasks
easier. This paper uses Python tools such as Gensim, SpaCy, and NLTK to analyze various
approaches for translating textual input into numerical representations.

In[3], Vedant Bahel describes an automated evaluation system that is intended to evaluate
descriptive responses on test questions. Their suggested method relies on automating the
assessment process through the use of Natural Language Processing (NLP) and Data Mining
techniques, providing a solution to the time-consuming and labor-intensive operation of grading
such answers. The core of their research involves the application of Siamese Manhattan LSTM
(MaLSTM) for text similarity analysis, taking into account variables such as response length,

DEPARTMENT OF CSE (AI & ML) Page | 2

STUDENT ANSWER EVALUATION USING LLMS

syntax, language proficiency, and correctness of answers. The study highlights the effectiveness
and institutional sustainability of their method by drawing contrasts with assignments that are
manually assessed. It does, however, recognize its limitations, especially when assessing
responses that include figures, diagrams, equations, or numerical data, indicating potential
directions for future.

In[4], Neslihan Suzen Writing for the University of Leicester in the United Kingdom and
Lobachevsky University in Russia, Neslihan Suzen, Alexander N. Gorban, Jeremy Levesley,
and Evgeny M. Mirkes explore the topic of Automatic Grading and Feedback mechanisms for
short answer questions, with a focus on the UK GCSE system. Using data from a University of
North Texas basic computer science course, the study uses conventional data mining techniques
to compare student responses with model answers, concentrating on word usage that is often
used. Furthermore, the study investigates clustering techniques for assigning grades and
providing feedback to students in an effective manner. Most importantly, the research promotes
computer methods to improve scoring reliability rather than to replace human scoring. Located
at the cutting edge of instructional technology.

In [5],Dr. A.Mercy Rani, The scholarly article "Automated Explanatory Answer Evaluation
Using Machine Learning Approach," written by Dr. A. Mercy Rani, an assistant professor at Sri
S. Ramasamy Naidu Memorial College in Sattur, India, describes a novel method for assessing
explanatory answers using a machine learning paradigm. The report, which was published in
July 2021 in the Design Engineering magazine, addresses the urgent need for effective online
assessment techniques, which is made more apparent by the pandemic-related shift to digital
schooling. The suggested approach uses Cosine Similarity as a grading metric after extracting
keywords from student responses using Natural Language Processing (NLP) techniques and
comparing them with an answer key. It highlights the benefits of online assessments and
highlights the necessity of automated assessment systems in the digital environment.

In [6], Steven Burrows ,"The Eras and Trends of Automatic Short Answer Grading," provides
a detailed analysis of ASAG (Automated Short Answer Grading). The research examines the
complex procedure of assessing succinct natural language responses using computer-based
methodologies, and it was published in the International Journal of Artificial Intelligence in
Education in 2015. The authors find five temporal patterns that signify important
methodological advances through a historical investigation of 35 ASAG systems. They also
examine six common dimensions, providing an extensive synopsis of the ASAG environment.

DEPARTMENT OF CSE (AI & ML) Page | 3

STUDENT ANSWER EVALUATION USING LLMS

In the conclusion, the study denotes a moment of consolidation in the field by defining an age
of evaluation as the most recent trend in ASAG research.

In [7], Jinzhu Luo, examines Automatic Short Answer Grading (ASAG) utilizing deep learning
methods, with a particular emphasis on the Sentence BERT model. Against the backdrop of
online learning, the paper tackles the ongoing difficulties in short response question grading
and proposes a model that outperforms conventional techniques in terms of accuracy and
efficiency. The thesis analyzes and contrasts the Sentence BERT model's performance with that
of the original BERT model, analyzing different task functions and evaluating the impact of
answer length on grading efficacy. Prominent advances in accuracy measures like the Marco
F1 score and the Weighted F1 score are explained, with a focus on the benefits of shorter replies.
submitted in fulfillment of the Master of Science degree requirements.

In[8], Md. Motiur Rahman The study explores an NLP-based Automatic Answer Script
Evaluation system. This approach uses a multidimensional technique with the goal of
accelerating the evaluation process while minimizing problems such as evaluator bias and the
time-consuming nature of manual grading. Text is extracted from answer scripts, summarized,
and a variety of similarity metrics are used to determine how closely student responses match
the right answers. Finally, points are assigned. The study investigates the effectiveness of the
suggested evaluation framework by utilizing four unique similarity metrics—Cosine, Jaccard,
Bigram, and Synonym—as well as keyword-based summarization. Promising results from the
experiments show that the automated evaluation system frequently.

In[9], Rick Somers The study explores an NLP-based Automatic Answer Script Evaluation
system. This approach uses a multidimensional technique with the goal of accelerating the
evaluation process while minimizing problems such as evaluator bias and the time-consuming
nature of manual grading. Text is extracted from answer scripts, summarized, and a variety of
similarity metrics are used to determine how closely student responses match the right answers.
Finally, points are assigned. The study investigates the effectiveness of the suggested evaluation
framework by utilizing four unique similarity metrics—Cosine, Jaccard, Bigram, and
Synonym—as well as keyword-based summarization. Promising results from the experiments
show that the automated evaluation system frequently.

In[10], Gyeong-Geon Lee An open-access article titled "Applying large language models and
chain-of-thought for automatic scoring" by Gyeong-Geon Lee and colleagues explores the use
of GPT-3.5 and GPT-4 in conjunction with Chain-of-Thought (CoT) to automatically score

DEPARTMENT OF CSE (AI & ML) Page | 4

STUDENT ANSWER EVALUATION USING LLMS

student responses in science assessments. The study, which was published in Computers and
Education: Artificial Intelligence, aims to address issues with accessibility, technical
complexity, and the lack of explainability that arises with AI-based scoring systems. Using six
prompt engineering strategies to experiment on a dataset of 1,650 student responses, the study
highlights the advantages of few-shot learning over zero-shot learning and the significant
improvement in scoring accuracy when CoT is combined with item stems and scoring rubrics.
Additionally, the study explores how Large Language Models (LLMs) can provide explicable.

Table 1.1: Comparison Table

S.N Author Title Algorith Merits Limitations Accuracy

O m
1 Muhamm Subjectiv Multino Approach can Approach requir 86% accura
ad Farruk e Answer mial evaluate subj es a large and do cy using Co
h Bashir s Evaluati Naive ective answer main- sine Similar
on Using Bayes s based on se specific dataset t ity
Machine mantic simila o train the machi
Learning rity, keyword ne learning mod
and NLP presence el effectively
2 Hossam Automati Word2V Proposal of Limited Highest
Magdy c Exam ec, AECF evaluation accuracy on
Balaha Correctio GloVe, framework scope, Lack of Quora
n FastText, comparison with dataset:
Framewo Doc2Vec other 77.95%
rk frameworks
3 Vedant Text Siamese Automated, Cannot evaluate Mean
Bahel Similarity Manhatta accurate, fast, answers with squared
Analysis n LSTM and fair figures, error of
for evaluation of diagrams, 1.372
Evaluatio descriptive equations
n of answers
Descripti based on
ve multiple
Answers factors

DEPARTMENT OF CSE (AI & ML) Page | 5

STUDENT ANSWER EVALUATION USING LLMS

4 Neslihan Automati A linear The proposed The proposed m Average cor

Suzen c short an regressio model can re odel may not be relation of 0
swer grad n model duce human e reliable for some .82
ing and fe that rrors and tim questions that a
edback us predicts e spent on ma re hard to mark
ing text marks nual grading
mining m based on
ethods the
similarit
y
5 Dr. A. Automate Cosine reducing time Does not cosine
Mercy d similarit and effort. consider similarity
Rani Explanat y spelling/gramma value has
ory r errors, lacks only 0.2
Answer semantic differences
Evaluatio with the
n Using manual
Machine marks
Learning
Approach
6 Steven The Eras ASAG Comprehensi Lack of Not
Burrows and systems, ve review of empirical Mentioned
Trends of including ASAG experiments/pro
Automati concept research/syst posed solutions
c Short mapping, ems,
Answer informati addressing
Grading on challenges/op
extractio portunities
n
7 Jinzhu AUTOM Sentence achieves Requires fine- 88.77% on
Luo ATIC -BERT, better results tuning, may not Spearman
SHORT triplet with capture long- Correlation
ANSWE network regression

DEPARTMENT OF CSE (AI & ML) Page | 6

STUDENT ANSWER EVALUATION USING LLMS

R structure task function term

GRADIN s and shorter dependencies
G answers
USING
DEEP
LEARNI
NG
8 Md. NLP- Utilizes Reduces Manual Not
Motiur based weight evaluation assignment of Mentioned
Rahman Automati paramete time and weight values
c Answer r-based effort - may lack
Script techniqu Eliminates optimality
Evaluatio e for bias and
n evaluatio inconsistency
n of human
evaluators
9 Rick Applying Ensembl applicability limitations with Above 95%
Somers
natural e of to small and content-specific for free-text
language transfor imbalanced words validity
processin mer datasets
g
10 Gyeong- Applying with achieving Relies on Achieving
Geon Lee large prompt high accuracy availability, average
language engineeri and lacks accuracy of
models ng explainability comparison with 0.6975
and strategies , overcoming other scoring
chain-of- challenges of models or
thought accessibility human scorers.

DEPARTMENT OF CSE (AI & ML) Page | 7

STUDENT ANSWER EVALUATION USING LLMS

1.2 PROBLEM STATEMENT

The evaluation of student answers in educational settings presents a significant challenge, often
requiring manual assessment by instructors. This process can be time-consuming and
subjective, leading to inconsistencies in grading. Automated methods for evaluating student
answers are desirable to streamline the assessment process and provide more objective
feedback. However, existing automated systems often lack the ability to accurately assess
student responses in diverse contexts and subject areas. This project aims to address these
limitations by developing a system that leverages question-answer retrieval and chunking
techniques to generate actual answers for comparison with student responses, enabling more
efficient and consistent evaluation of student performance.

1.3 PROPOSED SYSTEM

• Data Acquisition and Preprocessing:

I'm leveraging both a standard ML textbook and a self-made PDF containing ML and AI-related
questions and answers. This combination ensures a diverse dataset, covering foundational
concepts as well as contemporary developments in the field. Utilizing natural language
processing techniques, I extract the textual content from these documents and segment it into
manageable chunks. These chunks serve as the foundation for generating actual answers,
enabling comprehensive coverage of ML topics for effective learning and evaluation purposes.
• Chunking and Storage:
Utilizing natural language processing techniques, I segment the extracted text into meaningful
units such as paragraphs, sentences, or key phrases. By employing tools like Recurser Character
Text Splitter, I break down the content into smaller, digestible chunks. These chunks are then
structured and stored in a database, ensuring easy retrieval during the evaluation process. This
step lays the foundation for the subsequent retrieval of original answers, facilitating efficient
learning and assessment in ML and AI domains.
• Question-Answer Retrieval with Language Models:
To generate actual answers to questions related to ML, I employ advanced language models
like Llama and Mistral, both powerful open-source LLMs. Using the provided chunks as
context, along with metadata from the source, I input the question prompt into the language
model. By leveraging the contextual understanding encoded within these models, I generate
precise and informative answers tailored to the specific query. This approach ensures accuracy

DEPARTMENT OF CSE (AI & ML) Page | 8

STUDENT ANSWER EVALUATION USING LLMS

and relevance in delivering original answers based on the content extracted from the chunks,
enhancing the learning and retrieval process in ML and AI studies.
• Student Answer Input:
To facilitate comprehensive evaluation, the system incorporates a module for receiving student
answers as input. This functionality allows seamless interaction with learners, enabling them to
provide their responses to questions related to machine learning (ML) topics. Upon submission
of a student's answer, the system initiates a robust evaluation process to assess its accuracy and
relevance.
• Comparison and Evaluation:
During the comparison and evaluation process, the system meticulously assesses the student's
answer against the actual response, prioritizing semantic coherence over mere word matching.
It employs objective criteria, penalizing factual inaccuracies with a score of 0% while
maintaining fairness and consistency. Furthermore, the system is adept at handling edge cases
like students repeating questions as answers. Ultimately, a percentage score is calculated,
reflecting the degree of alignment between the student's response and the expected answer, thus
providing insightful feedback on the student's comprehension and performance.
• Percentage Calculation:

Calculate a percentage score representing the similarity or correctness of the student answer
compared to the actual answer.The percentage score can be based on various factors, such as
the number of matching words, semantic similarity, or syntactic structure similarity.

• Output:

Our system analyzes your answer alongside the original answer using advanced language
models (LLMs). These LLMs go beyond simple keyword matching and consider the meaning
behind the words. The result is a percentage score (0% to 100%) reflecting how closely your
answer aligns with the expected response. This score takes into account both semantic similarity
and factual accuracy.

DEPARTMENT OF CSE (AI & ML) Page | 9

SYSTEM ANALYSIS
STUDENT ANSWER EVALUATION USING LLMS

2.SYSTEM ANALYSIS

The system analysis for the Answer Evaluating project involves a deep dive into the manual
evaluation processes currently in place, aiming to understand instructor workflows and identify
inefficiencies. This analysis assesses the feasibility of integrating Large Language Models
(LLMs) like Llama Mistral into the evaluation process, considering operational, technical, and
behavioral factors. Additionally, stakeholder analysis gauges the needs of instructors, students,
and administrators. Ultimately, this phase aims to inform the development of an automated
evaluation system that utilizes LLMs to enhance accuracy, efficiency, and effectiveness in
assessing student answers.

2.1 SYSTEM STUDY

2.1.1 Feasibility Study:

The feasibility study investigates the viability of integrating Large Language Models (LLMs)
like Llama Mistral into the student answer evaluation process. It assesses scalability, resource
requirements, integration complexity, and ethical considerations to determine practicality and
effectiveness. Through rigorous analysis, the study aims to provide insights into the feasibility
of leveraging LLMs for enhanced student assessment methods. Its findings will inform
decision-making regarding the implementation and integration of LLMs in educational settings.

2.1.1.1 Operational Feasibility:

Operational feasibility assesses the practicality of integrating Large Language Models (LLMs)
like Llama Mistral into existing educational workflows. It examines whether the system can
seamlessly fit into teachers' and administrators' daily tasks without significant disruption. This
evaluation considers factors such as user training requirements, workflow adjustments, and the
overall impact on productivity and efficiency. Ultimately, operational feasibility aims to
determine whether implementing LLMs for student answer evaluation is operationally viable
within the educational context.

2.1.1.2 Technical Feasibility:

Technical feasibility scrutinizes the integration of Large Language Models (LLMs) such as
Llama Mistral within the technological infrastructure of educational institutions. It evaluates
the compatibility of LLMs with existing systems and software, ensuring seamless integration
without compromising performance. Assessing hardware and software requirements, this

DEPARTMENT OF CSE (AI & ML) Page | 10

STUDENT ANSWER EVALUATION USING LLMS

analysis delves into server capabilities, computational resources, and any necessary software
updates or modifications. Moreover, it investigates the scalability of the system to
accommodate varying workloads and user demands. By addressing these technical aspects
comprehensively, the feasibility study aims to determine the readiness and viability of
incorporating LLMs into the student answer evaluation process.

2.1.1.3 Behavioral Feasibility:

Behavioral feasibility examines the user acceptance and interaction dynamics of the Smart
Evaluator project. It focuses on creating a user-friendly interface that facilitates intuitive
interaction for both teachers and students. Through features like presenting original answers
alongside student responses and providing percentage scores, the system aims to enhance
engagement and comprehension. By prioritizing clarity and ease of use, the project seeks to
promote user satisfaction and adoption. This evaluation ensures that the system aligns with user
expectations and effectively supports the student evaluation process.

2.1.1.4 Financial and Economic Feasibility:

Utilizing entirely open-source resources such as Google Colab and LLMS like LLAMA and
Mistral ensures a cost-effective approach to implementing the project. Leveraging these free
and accessible tools minimizes initial investment and ongoing maintenance costs, enhancing
long-term sustainability. Integrating Streamlit for the web interface further contributes to
affordability, as it offers a user-friendly platform for development without additional licensing
fees. By embracing open-source solutions, the project not only maximizes cost-efficiency but
also promotes transparency, collaboration, and community-driven innovation.

2.2 SYSTEM REQUIREMENTS

The System Requirements Specification (SRS) outlines the essential criteria for developing an
automated sentiment analysis system tailored for news articles extracted from newspaper
images. It delineates the functionalities necessary to efficiently analyze sentiment distribution
within news content, aiming to provide valuable insights into public opinion and sentiment
trends. Approval of this document signifies acknowledgment and agreement that the resultant
system, fulfilling these stipulated requirements, will be deemed acceptable for implementation.
This document serves as a guiding framework to ensure the development of a robust and
effective solution aligned with the project's objectives and user needs.

DEPARTMENT OF CSE (AI & ML) Page | 11

STUDENT ANSWER EVALUATION USING LLMS

2.2.1 Functional Requirements:

The functional requirements of the system delineate its core capabilities, focused on
streamlining the evaluation process within educational settings. Firstly, it encompasses
retrieving answers from educational materials based on provided questions, ensuring relevance
and accuracy. Following this, the system segments responses into meaningful units, facilitating
comprehensive analysis. Utilizing Large Language Models (LLMs) like LLAMA and Mistral,
it generates actual answers, providing a benchmark for comparison. Through robust algorithms,
student responses are evaluated for correctness, and similarity percentages are calculated,
offering quantitative feedback. Furthermore, the system automates report generation for both
instructors and students, expediting the feedback loop and enhancing educational outcomes.
These features collectively ensure efficient assessment and feedback delivery, enriching the
teaching and learning experience.

2.2.2 Non-Functional Requirements:

• Usability: Emphasizing an intuitive and user-friendly interface is essential to cater to users

of varying skill levels. This includes providing clear navigation, concise instructions, and
visually appealing design elements. By prioritizing usability, the system aims to minimize
the learning curve and enhance user satisfaction, ultimately increasing adoption rates among
educators and students alike.
• Reliability: Ensuring high reliability is paramount, especially in educational environments
where consistent performance is crucial. Leveraging robust APIs and platforms like Python
can enhance the system's stability and resilience, reducing the risk of downtime or errors.
By prioritizing reliability, the system aims to instill confidence among users and foster trust
in its capabilities.
• Performance: High performance is imperative to deliver swift responses and efficient task
processing, particularly when dealing with large datasets or concurrent user interactions.
Optimization techniques and efficient algorithms can help minimize latency and maximize
throughput, ensuring a seamless user experience. By prioritizing performance, the system
aims to minimize wait times and maximize productivity for users.
• Supportability: Designing the system to be compatible with a wide range of hardware and
software platforms is essential for seamless integration and operation across diverse
environments. This includes ensuring compatibility with different operating systems,
browsers, and devices commonly used in educational settings. By prioritizing

DEPARTMENT OF CSE (AI & ML) Page | 12

STUDENT ANSWER EVALUATION USING LLMS

supportability, the system aims to minimize compatibility issues and facilitate smooth
deployment and maintenance processes
• Flexibility: Providing the flexibility to adapt and extend functionality is crucial for
accommodating evolving requirements and user needs over time. This includes designing
the system with modular architecture and well-defined APIs, enabling the integration of
new features or modules without disrupting existing components. By prioritizing flexibility,
the system aims to future-proof itself and remain adaptable to changing educational trends
and technologies.

2.3 SYSTEM REQUIREMENT SPECIFICATION

The System Requirements Specification (SRS) document for the Answer Evaluation project
serves as a comprehensive blueprint outlining the essential features and characteristics of the
automated student answer evaluation system. It meticulously defines the functionality, usability,
reliability, performance, supportability, and flexibility required to achieve project objectives
effectively. Approval of the SRS signifies a consensus that the developed system must strictly
adhere to these specified requirements to attain acceptance. As a guiding framework for
development, this document ensures alignment with project goals and stakeholder expectations,
fostering clarity and accountability throughout the development lifecycle. By delineating clear
parameters and standards, the SRS lays the foundation for the creation of a robust and efficient
solution tailored to the needs of educators and students.

2.3.1 Hardware Requirements:

• System: 11th Gen Intel(R) Core(TM) i5-1155G7 @ 2.50GHz 2.50 GHz

• RAM: 8GB

2.3.2 Software Requirements:

• Python (3 and above versions)
• Google colab
• Streamlit

DEPARTMENT OF CSE (AI & ML) Page | 13

STUDENT ANSWER EVALUATION USING LLMS

2.3.3 Required Libraries:

• Transformers
• Streamlit
• langchain
• vectorstores
• RecursiveCharacterTextSplitter

2.4 METHODOLOGIES

2.4.1 Proposed Methodology:

Employing advanced long language models like BERT, GPT, LLAMA, and Mistral, we seek to
revolutionize student answer evaluation by focusing on semantic comprehension over simple
word or sentence matching. We begin with data collection and preprocessing of student answers
across various subjects. Contextual embeddings are then extracted using pre-trained language
models to capture nuanced semantic meaning. Semantic similarity is calculated through
methods like cosine similarity or Euclidean distance for more nuanced evaluation. A scoring
mechanism and grading rubric are devised based on similarity scores to categorize answers by
proficiency. Through iterative refinement and optimization, including model training and fine-
tuning, we aim to create a robust evaluation system. Integration into an automated platform
with an educator-friendly interface ensures practicality. Our methodology will undergo rigorous
evaluation and comparison against traditional approaches, demonstrating its efficacy in
assessing student answers based on meaning.

DEPARTMENT OF CSE (AI & ML) Page | 14

STUDENT ANSWER EVALUATION USING LLMS

Fig 2.1: Types of LLMs

Large Language Models (LLMs) come in various types, each designed for specific tasks and
applications. Some prominent examples include GPT (Generative Pre-trained Transformer)
models like GPT-3, BERT (Bidirectional Encoder Representations from Transformers), and
XLNet. These models differ in architecture, training objectives, and capabilities, catering to
diverse natural language processing (NLP) needs. Additionally, LLMs can be categorized based
on their training stages: pretrained and fine-tuned. Pretrained LLMs are trained on massive text
corpora without specific task supervision, enabling them to learn general language patterns and
semantics. In contrast, fine-tuned LLMs undergo additional training on task-specific datasets,
refining their knowledge and performance for specialized tasks like sentiment analysis or
question answering. Fine-tuning adapts pretrained models to specific domains or applications,
enhancing their effectiveness and accuracy for targeted tasks.

2.4.2 Traditional Approaches:

Here are some traditional approaches that might have been employed for evaluating student
answers:

• Manual Grading: In educational settings, manual grading stands as a traditional approach

to evaluating student answers. Instructors assess responses against predefined criteria or
rubrics, offering personalized feedback and assigning grades. While this method is familiar
and allows for nuanced assessment, its subjective nature and time-consuming process pose
challenges, particularly in larger classes or when grading complex assignments. As a result,

DEPARTMENT OF CSE (AI & ML) Page | 15

STUDENT ANSWER EVALUATION USING LLMS

educators increasingly explore automated evaluation systems driven by AI and NLP

algorithms for more efficient and standardized grading procedures.
• Multiple-Choice: Multiple-choice questions (MCQs) are a common assessment method
where students choose the correct answer from provided options. They offer straightforward
grading processes, often automated through scanning machines or software, ideal for large-
scale evaluations. MCQs provide educators with a versatile tool to assess various topics
efficiently. However, while they expedite grading and accommodate scalability, they may
not fully gauge students' critical thinking or depth of understanding. Despite this, MCQs
remain a popular choice for their practicality in assessing knowledge across diverse
subjects.
• Short Answer Questions (SAQs): Short Answer Questions (SAQs) task students with
providing concise written responses to questions, allowing for more nuanced expression
compared to multiple-choice questions (MCQs). Instructors evaluate SAQ responses
manually, assessing factors like accuracy, relevance, and completeness. This approach
provides flexibility in evaluating students' understanding and application of concepts,
accommodating a wider range of possible responses. However, the manual grading process
of SAQs can be time-consuming and subjective, as instructors interpret and assess each
student's response individually. Despite their limitations, SAQs offer a valuable means of
assessing students' comprehension and critical thinking skills, bridging the gap between
MCQs and more extensive essay-style questions. As educational technology advances,
educators may explore automated grading solutions to mitigate the challenges associated
with manual evaluation of SAQs.
• Essay Questions: Essay questions require students to provide extensive, detailed responses
to prompts, enabling deeper exploration of topics compared to other assessment formats.
Grading essays involves evaluating factors such as content, organization, coherence, and
writing quality, demanding significant time and effort from instructors. This labor-intensive
process is inherently subjective, leading to variations in grading standards among
evaluators. Despite its challenges, essay questions remain valued for their ability to assess
students' critical thinking, analytical skills, and depth of understanding. However,
advancements in technology may offer opportunities to streamline essay grading processes
through automated tools and artificial intelligence algorithms.

DEPARTMENT OF CSE (AI & ML) Page | 16

STUDENT ANSWER EVALUATION USING LLMS

• Rubrics: Rubrics serve as invaluable scoring guides, delineating specific criteria for
evaluating student responses across various assignments. By offering a structured
framework, they facilitate consistent and fair assessment practices, aligning with
educational objectives. Traditionally paper-based, these rubrics require instructors to
manually apply scores to each criterion, which can be time-consuming and prone to
subjectivity. However, they remain essential tools for providing transparent feedback and
guiding students towards achieving learning outcomes. As technology evolves, digital
rubric platforms emerge, offering efficiencies in grading and enhancing collaboration
among educators. Transitioning to digital formats promises to streamline assessment
processes while maintaining the integrity and effectiveness of rubric-based evaluation.

• Peer Assessment: Peer Assessment, a collaborative evaluation method, empowers students

to assess and offer feedback on their peers' work, fostering the development of critical
thinking and communication skills. However, this approach hinges on subjective
judgments, potentially leading to variations in grading standards among peers. Despite its
reliance on subjective assessment, Peer Assessment promotes active engagement and
encourages students to reflect on their own work through the lens of others' perspectives.
Educators often implement structured peer assessment processes, incorporating rubrics or
guidelines to enhance consistency and fairness. While Peer Assessment offers valuable
opportunities for peer learning and self-reflection, it requires careful facilitation to ensure
constructive feedback and mitigate biases. Leveraging digital platforms can streamline the
Peer Assessment process, facilitating efficient peer feedback exchange while maintaining
accountability and transparency. Integrating Peer Assessment into pedagogical practices
enriches collaborative learning experiences, equipping students with valuable skills for
academic and professional success.

DEPARTMENT OF CSE (AI & ML) Page | 17

SYSTEM DESIGN
STUDENT ANSWER EVALUATION USING LLMS

3.SYSTEM DESIGN

3.1 ABOUT SYSTEM DESIGN

System design is a meticulous process involving the structuring of a system's elements,

including architecture, modules, components, interfaces, and data flow. This comprehensive
framework ensures seamless functioning and efficient communication between system
elements. Within the Answer Evaluation project, system design plays a critical role in
translating project requirements into a cohesive and scalable solution. By meticulously
arranging these components, the system can efficiently process student answers, assess
correctness, calculate similarity percentages, and generate automated reports. This phase lays
the foundation for the successful implementation and operation of the system within educational
environments.

Elements of a System:

• Architecture- Approach employs Large Language Models (LLMs) to generate responses

using segmented information blocks. It integrates a feature for receiving student answers,
which are subsequently compared to the generated responses to ascertain their similarity or
accuracy percentage. This strategy optimizes the process of response generation and
assessment, harnessing the capabilities of LLMs to improve precision and effectiveness.
• Modules - The "Modules" section outlines the essential components of the automated
student answer evaluation system. It includes the LLM Integration Module for
incorporating Large Language Models, the Chunking Module for segmenting educational
materials, and the User Interface Module built on Streamlit for user interaction.
Additionally, the Answer Evaluation Module compares student responses with LLM-
generated answers, while the Data Management Module ensures efficient handling of
system data..
• Components -The system automates student answer evaluation through integrated
components. The LLM Integration Component incorporates Large Language Models for
precise answer generation, while the Chunking Component segments educational materials
for analysis. The User Interface Component offers user-friendly interaction, and the Answer
Evaluation Component assesses student responses against generated answers. The Data
Management Component ensures efficient data handling, and the Integration Component

DEPARTMENT OF CSE (AI & ML) Page | 18

STUDENT ANSWER EVALUATION USING LLMS

facilitates smooth communication between modules. These components collectively form a

robust and streamlined automated evaluation system.
• Interfaces - The interface, powered by Streamlit, offers users an intuitive platform to
interact with the automated student answer evaluation system. It enables effortless input of
questions, viewing of generated answers, and submission of student responses. With its
user-friendly features and flexibility, Streamlit enhances the overall usability and
effectiveness of the system, providing a seamless experience for users.

• Data - An ML textbook forms the core training material, enriching the model's
understanding of machine learning concepts. Supplementing this are question and answer
datasets specific to machine learning, further enhancing the system's knowledge base. By
combining these datasets, the system can generate accurate responses and evaluate student
answers effectively. This diversity ensures the system's proficiency in handling a broad
spectrum of queries and tasks related to student answer evaluation.

3.1.1 Initialize Design Definition

In initializing the design definition, the plan outlines the development of an automated
educational assessment system leveraging AI technology. This system aims to streamline the
evaluation of student responses, offering immediate and insightful feedback. The process
encompasses thorough analysis of requirements, meticulous system design, integration of AI
algorithms, rigorous reliability testing, and deployment on scalable infrastructure. Key
technologies include the Transformers library for implementing Llama and Mistral models,
Streamlit for web application development, and the Hugging Face Hub for model versioning
and sharing. By leveraging these cutting-edge tools and methodologies, the project aims to
revolutionize the student assessment process, enhancing efficiency and effectiveness in
educational contexts..

3.1.2 Establish design characteristics

Establishing design characteristics for the automated educational assessment system involves
defining clear attributes for architecture, interfaces, and system elements. The focus lies on
achieving real-time performance, scalability, and accuracy, particularly in the implementation
of the pose detection model. Interfaces are refined to optimize user interaction and
accommodate external service integration, ensuring a seamless and intuitive experience for both

DEPARTMENT OF CSE (AI & ML) Page | 19

STUDENT ANSWER EVALUATION USING LLMS

educators and students. By prioritizing these design characteristics, the system aims to deliver
efficient assessment processes, reliable performance, and enhanced usability within educational
environments.

3.2 SYSTEM ARCHITECTURE

Fig 3.1 : System Architecture

DEPARTMENT OF CSE (AI & ML) Page | 20

STUDENT ANSWER EVALUATION USING LLMS

The system architecture of the project revolves around utilizing PDF documents, chunking them
into sections, and employing Large Language Models (LLMs) such as Llama and Mistral to aid
in question generation and answer evaluation. By segmenting the PDFs, the system extracts
relevant portions and generates questions using LLMs, considering each chunk as an original
answer. When students submit their answers, the system prompts LLMs with specific rules to
compare these responses against the original chunks, calculating the similarity percentage as an
output. This process ensures a streamlined approach to assessing student answers, leveraging
AI capabilities to enhance accuracy and efficiency.

Through the systematic integration of LLMs and PDF chunking, the system orchestrates a
seamless evaluation process, enabling educators to efficiently analyze student responses and
provide timely feedback. By harnessing the power of AI algorithms and prompt-guided
interactions, the system facilitates precise comparisons between student answers and original
sources. Ultimately, educators receive comprehensive insights into the similarity percentage,
along with the source of the original answer, empowering them to make informed decisions and
support student learning effectively.

3.2 DATA FLOW DIAGRAM

Data Flow Diagrams (DFDs) serve as vital tools for visualizing the flow of data within a
system or process. They provide a comprehensive overview of inputs, outputs, and
interactions among different entities, shedding light on the flow of data throughout the
system. Unlike flowcharts, DFDs focus solely on data movement and do not incorporate
control flow, loops, or decision rules. This structured-analysis modeling tool offers a clear
depiction of the major steps and data involved in software-system processes, aiding in
understanding system dynamics and identifying potential bottlenecks or inefficiencies. By
representing the flow of data in various ways, DFDs enable stakeholders to gain insights
into the structure and functionality of complex systems, facilitating effective
communication and decision-making throughout the software development lifecycle.

DEPARTMENT OF CSE (AI & ML) Page | 21

STUDENT ANSWER EVALUATION USING LLMS

Fig 3.2: Data Flow Diagram

In the Data Flow Diagram (DFD) depicting the process of PDF chunking, database
creation, loading LLMs, question generation, and similarity assessment, the flow of data
begins with the input of PDF documents. These documents are then segmented into

DEPARTMENT OF CSE (AI & ML) Page | 22

STUDENT ANSWER EVALUATION USING LLMS

manageable chunks, which are stored in the database for later retrieval. Upon receiving a
request for question generation, the system extracts chunks from the database and prompts
the LLMs to generate questions based on this content. The questions are then presented to
the users, initiating the process of answer submission by students.

Following student submissions, the system retrieves the corresponding original chunks
from the database and passes both the student's answer and the original chunk to the LLMs
for comparison. Utilizing predefined rules and algorithms, the LLMs analyze the similarity
between the two responses, generating a similarity percentage as an output. This output,
along with the source of the original answer, is then provided to educators for assessment
and feedback. Throughout this process, the DFD illustrates the flow of data, ensuring
transparency and understanding of each step involved in the evaluation process.

Components Of DFDs:
The data flow diagram has four components. They are:

• External Entity
• Process
• Data Flow
• Warehouse
External Entity:
An outside process or system that sends or receives data to and from the diagrammed
system.They are also known as sources, terminators, sinks or actors and are represented by
squares.

Fig 3.3: External Entity Representation

DEPARTMENT OF CSE (AI & ML) Page | 23

STUDENT ANSWER EVALUATION USING LLMS

Process:

Input to output transformation in a system takes place because of process function. The
symbols of a process are rectangular with rounded corners, oval, rectangle or a circle. The
process is named a short sentence, in one word or a phrase to express its essence

The process is represented using the following notation:

Fig 3.4: Process representation

Data Flow:

Data flow describes the information transferring between different parts of the systems.
The arrow symbol is the symbol of data flow. A relatable name should be given to the flow
to determine the information which is being moved. Data flow also represents material
along with information that is being moved. Material shifts are modeled in systems that are
not merely informative. A given flow should only transfer a single type of information. The
direction of flow is represented by the arrow which can also be bi-directional.

Fig 3.5: Data Flow Representation

Warehouse:

The data is stored in the warehouse for later use. Two horizontal lines represent the symbol
of the store. The warehouse is simply not restricted to being a data file rather it can be
anything like a folder with documents, an optical disc, a filing cabinet. The data warehouse
can be viewed independent of its implementation. When the data flow from the warehouse
it is considered as data reading and when data flows to the warehouse it is called data entry
or data updation.

DEPARTMENT OF CSE (AI & ML) Page | 24

STUDENT ANSWER EVALUATION USING LLMS

Fig 3.6: Warehouse/ Datastore Representation

Levels in DFDs:

In Software engineering DFD can be drawn to represent the system of different levels of
abstraction. Higher-level DFDs are partitioned into low levels-hacking more information
and functional elements. Levels in DFD are numbered 0, 1, 2 or beyond. Here, we will see
mainly 3 levels in the data flow diagram, which are: 0-level DFD, 1-level DFD, and 2-level
DFD.

Level-0:

Fig 3.7: LEVEL -0 DFD

The Level 0 Data Flow Diagram (DFD) outlines the core process of the automated student answer
evaluation system. It begins with the input of student responses, followed by the selection of the
appropriate Large Language Model (LLM) to process the query. The system then passes the query to the
selected LLM, which analyzes the student's response and the original answer, ultimately generating the
similarity percentage as an output. This simplified depiction provides a high-level overview of the
fundamental steps involved in the evaluation process, highlighting the flow of data from input to output
through the interaction with the LLM.

Level-1:

Fig 3.8: LEVEL 1 DFD

DEPARTMENT OF CSE (AI & ML) Page | 25

STUDENT ANSWER EVALUATION USING LLMS

In the Level 1 Data Flow Diagram (DFD), the system's steps are expanded to provide a more
detailed understanding of the student answer evaluation process. It begins with the input of
student responses, which are then passed to the LLM selection module. Here, the system
identifies the most suitable LLM based on predefined criteria and passes the selected model to
the query processing module. The query processing module extracts the relevant information
from both the student's answer and the original source, preparing the data for comparison.

Subsequently, the system passes the processed data to the similarity assessment module, where
the LLM evaluates the similarity between the student's response and the original answer. After
analysis, the module generates the similarity percentage, which is then presented as the output
of the system. This detailed breakdown in the Level 1 DFD allows for a clear visualization of
the sequential steps involved in the student answer evaluation process, from input to output,
and highlights the role of each module in facilitating efficient assessment and feedback.

3.4 DATASETS:
•Hands-on Machine Learning with Scikit-Learn, Keras & TensorFlow – Aurélien Géron

http://14.139.161.31/OddSem-0822-1122/Hands-On_Machine_Learning_with_Scikit-Learn-
Keras-and-TensorFlow-2nd-Edition-Aurelien-Geron.pdf

•I've compiled a comprehensive PDF encompassing AI and ML questions and answers, perfect
for database storage and retrieval. It's designed to facilitate easy chunking for efficient data
management and future reference. This resource streamlines knowledge acquisition and
enhances the retrieval process, ensuring seamless access to key insights across all AI and ML
topics.

DEPARTMENT OF CSE (AI & ML) Page | 26

SYSTEM
IMPLEMENTATION
STUDENT ANSWER EVALUATION USING LLMS

4.SYSTEM IMPLEMENTATION

4.1 SYSTEM SETUP

STREAM LIT INSTALLATION

Streamlit is a Python library for building interactive web applications. It offers a simple syntax and a
variety of widgets for data visualization and user interaction. Streamlit apps update in real-time as users
interact with them, providing a seamless experience. It integrates well with popular Python libraries like
Pandas and Matplotlib. Deployment to platforms like Streamlit Sharing and Heroku is straightforward.

Installation:

Fig: 4.1:Stream lit

Google Colab:

Collab typically refers to Google Colab, short for Google Colaboratory, which is a free cloud-
based platform provided by Google that allows users to write and execute Python code in a
Jupyter Notebook environment. It provides access to GPU and TPU resources, making it
particularly useful for machine learning tasks. Users can also collaborate in real-time on Colab
notebooks, making it a popular choice for collaborative coding and sharing code with others.

DEPARTMENT OF CSE (AI & ML) Page | 27

STUDENT ANSWER EVALUATION USING LLMS

Fig: 4.2 Google Colab

4.2 SOURCE CODE

Importing Libraries:
from langchain_community.embeddings import HuggingFaceEmbeddings

from langchain_community.vectorstores import FAISS

from langchain_community.document_loaders import PyPDFLoader, DirectoryLoader

from langchain.text_splitter import RecursiveCharacterTextSplitter

DATA_PATH = 'sourcedocs/'

DB_FAISS_PATH = 'vectorstore/db_faiss'

Creating Database:

# Create vector database

def create_vector_db():

loader = DirectoryLoader(DATA_PATH,

glob='*.pdf',

loader_cls=PyPDFLoader)

DEPARTMENT OF CSE (AI & ML) Page | 28

STUDENT ANSWER EVALUATION USING LLMS

documents = loader.load()

text_splitter = RecursiveCharacterTextSplitter(chunk_size=500,

chunk_overlap=50)

texts = text_splitter.split_documents(documents)

embeddings = HuggingFaceEmbeddings(model_name='sentence-transformers/all-MiniLM-
L6-v2',

model_kwargs={'device': 'cpu'})

db = FAISS.from_documents(texts, embeddings)

db.save_local(DB_FAISS_PATH)

if __name__ == "__main__":

create_vector_db()

from google.colab import drive

drive.mount("/content/drive")

Installing the Libraries:

!pip install pypdf langchain torch accelerate bitsandbytes

transformers sentence_transformers faiss_gpu -qq -U

!pip install huggingface_hub -qq -U

!pip install ctransformers -qq -U

Login the Hugging Face:

from huggingface_hub import login

DEPARTMENT OF CSE (AI & ML) Page | 29

STUDENT ANSWER EVALUATION USING LLMS

%%writefile qa_interface.py

Connection of Streamlit (Web Interface):

import streamlit as st

# Import necessary functions and libraries from your code

from langchain.embeddings import HuggingFaceEmbeddings

from langchain_community.vectorstores import FAISS

#from langchain.vectorstores import FAISS

from langchain import PromptTemplate

from langchain.llms import CTransformers

from langchain_community.llms import CTransformers

from langchain.chains import RetrievalQA

import google.generativeai as genai

DB_FAISS_PATH = '/content/drive/MyDrive/vectorstore/db_faiss/'

# Define functions to load QA model and generate response

def qa_bot():

embeddings = HuggingFaceEmbeddings(model_name="sentence-transformers/all-
MiniLM-L6-v2",

model_kwargs={'device': 'cpu'})

db = FAISS.load_local(DB_FAISS_PATH, embeddings,
allow_dangerous_deserialization=True)

llm = CTransformers(

model="TheBloke/Mistral-7B-Instruct-v0.1-GGUF",

model_type="llama",

max_new_tokens=1000,

temperature=0.4

DEPARTMENT OF CSE (AI & ML) Page | 30

STUDENT ANSWER EVALUATION USING LLMS

prompt = PromptTemplate(template="""Based on the user question generate an answer with

300 words. The answer should be meaningful.

Context: {context}

Question: {question}

Only return the helpful answer below and nothing else.

Helpful answer:

""", input_variables=['context', 'question'])

qa_chain = RetrievalQA.from_chain_type(llm=llm,

chain_type='stuff',

retriever=db.as_retriever(search_kwargs={'k': 2}),

return_source_documents=True,

chain_type_kwargs={'prompt': prompt})

return qa_chain

def generate_response_from_gemini(input_text):

genai.configure(api_key="….. ")

generation_config = {

"temperature": 0.5,

"top_p": 1,

"top_k": 32,

"max_output_tokens": 4096,

safety_settings = [

DEPARTMENT OF CSE (AI & ML) Page | 31

STUDENT ANSWER EVALUATION USING LLMS

{"category": f"HARM_CATEGORY_{category}", "threshold":

"BLOCK_MEDIUM_AND_ABOVE"}

for category in ["HARASSMENT", "HATE_SPEECH", "SEXUALLY_EXPLICIT",

"DANGEROUS_CONTENT"]

llm = genai.GenerativeModel(

model_name="gemini-pro",

generation_config=generation_config,

safety_settings=safety_settings,

output = llm.generate_content(input_text)

return output.text

# Streamlit app

def main():

st.title("Question Answering and Response Generation")

# User input fields

query = st.text_input("Enter the question:")

student_answer = st.text_area("Enter the student's answer:")

# Button to trigger processing

if st.button("Get QA Answer and Generate Response"):

# Call QA function

qa_result = qa_bot()({'query': query})

qa_answer = qa_result['result']

# Generate response from Gemini AI

DEPARTMENT OF CSE (AI & ML) Page | 32

STUDENT ANSWER EVALUATION USING LLMS

input_prompt_template = """

You are an interviewer who interviews the students. Your job is to give the marks in the
form of percentages for the answers given by the students to the actual answer. Before you give
the marks to the students take the time to give marks and there are some at most important set
of rules to be followed and edge cases to be handled. The input is given in the form of
studentAnswer: {studentAnswer}, actualAnswer: {actualAnswer}

Set of rules:-

1. compare studentAnswer with the actualAnswer.

2. make sure that you don't give marks based on the words matched for the studentAnswer
and the actualAnswer. Give marks based on comparing the whole meaning of the
studentAnswer and the actualAnswer.

3. Make sure that you don't give any explanation your work is only to give marks for the
studentAnswer in the form of a percentage.

4. If there is factual inaccuracy in the studentAnswer then give 0 marks.

5. Make sure the output is always in the form of "Percentage: %"

There are some edge cases to be handled:-

1. If the student repeats the question as the answer give him 0 marks.

2. Even if the words in the studentAnswer match with the words in the actualAnswer don't
give marks by just considering word matches give marks based on comparing the meaning of
the studentAnswer with the actualAnswer."""

response_text =
generate_response_from_gemini(input_prompt_template.format(actualAnswer=qa_answer,
studentAnswer=student_answer))

# Display results

st.subheader("Results:")

st.write("QA Answer:", qa_answer)

DEPARTMENT OF CSE (AI & ML) Page | 33

STUDENT ANSWER EVALUATION USING LLMS

st.write("Generated Response percentage:", response_text)

if __name__ == "__main__":

main()

4.3 RESULTS

Fig:4.3: Web Interface

The above figure 4.3 represents the web interface of Evaluating the student answer.

Fig: 4.4 : Result with wrong Answer

DEPARTMENT OF CSE (AI & ML) Page | 34

STUDENT ANSWER EVALUATION USING LLMS

The above figure 4.4 shows the percentage of student answer. Because of the wrong answer the
percentage is zero.

Fig: 4.5 : Result With Right Answer

The figure 4.5 represents the evaluation of the student answer and the Original answer.

DEPARTMENT OF CSE (AI & ML) Page | 35

STUDENT ANSWER EVALUATION USING LLMS

Fig: 4.6 Result With Right Answer

The figure 4.6 represents the evaluation of the student answer and the original answer. The
percentage represents similarity between the student answer and the original answer

DEPARTMENT OF CSE (AI & ML) Page | 36

TESTING
STUDENT ANSWER EVALUATION USING LLMS

5.TESTING

5.1 PERFOMANCE METRICS

Perplexity: Perplexity is a metric in language modeling that measures how well a model
predicts a sequence of words. Lower perplexity values indicate better predictive performance,
suggesting that the model is less surprised by the actual sequence of words. It is calculated as
the exponentiation of the average negative log likelihood of the test data, normalized by the
number of words.

5.2 VALIDATION OF TEST CASES:

Table 5.1 : Validating the Test Cases

Test
Case Input Excepted Output Actual output Remarks
No.
1 Question The percentage should Similarity Pass
with wrong be 0 Percentage:0
answer
2 Question The percentage should Similarity Pass
with be 0 Percentage:0
Irrelevant
answer
3 Question A good percentage Similarity Pass
with correct Percentage:50%
answer
4 Question A good percentage Similarity Pass
with correct Percentage:90%
answer

DEPARTMENT OF CSE (AI & ML) Page | 37

CONCLUSION
STUDENT ANSWER EVALUATION USING LLMS

6.CONCLUSION

The utilization of Large Language Models (LLMs) holds immense promise for revolutionizing
education through automated student response evaluation. With a focus on continuous study,
collaborative efforts, and innovative approaches, LLM-based evaluation models can undergo
further refinement to enhance accuracy and efficiency. Future endeavors should prioritize
iterative improvements in algorithms and methodologies, aiming to optimize the performance
of these models and meet the evolving needs of educational settings. Additionally, exploring a
diverse range of free-source LLMs and assessing their effectiveness presents valuable
opportunities for advancing the evaluation process, paving the way for innovative practices and
improved learning outcomes.

Furthermore, the integration of adaptive learning strategies with real-time feedback

mechanisms offers the potential to cultivate dynamic and tailored learning environments that
cater to the individual needs and progress of every student. By embracing cutting-edge
techniques and emerging technologies in educational evaluation, we have the opportunity to
redefine the landscape of educational assessment. This proactive approach not only promotes
fairness and deeper student engagement but also lays the foundation for transformative
educational experiences. By harnessing the potential of LLMs and embracing creative
pedagogical approaches, we can shape a future where education becomes not just accessible
and inclusive, but truly empowering, enabling students to thrive in an ever-evolving educational
landscape.

DEPARTMENT OF CSE (AI & ML) Page | 38

FUTURE WORK
STUDENT ANSWER EVALUATION USING LLMS

7.FUTURE WORK

In future endeavors, advancing the automated student answer evaluation system entails a
multifaceted exploration of various avenues for enhancement. Experimentation with a diverse
array of large language models (LLMs), ranging from GPT-3 to BERT, RoBERTa, and XLNet,
offers a rich opportunity to evaluate their efficacy in generating accurate answers and assessing
student responses. By fine-tuning selected LLMs on domain-specific datasets, we can bolster
their understanding and adaptability to the intricacies of educational contexts, fostering more
precise evaluations.

Furthermore, adopting ensemble methods that amalgamate predictions from multiple LLMs
holds promise for augmenting overall performance by capitalizing on the strengths of individual
models. Supplementing the dataset through techniques like paraphrasing and synonym
substitution can enhance model generalization and fortify robustness. Integrating advanced
evaluation metrics, such as semantic similarity and coherence, empowers deeper insights into
the quality of student responses, facilitating more nuanced assessments. Moreover, exploring
the development of domain-specific LLMs tailored explicitly to educational domains and
integrating user feedback mechanisms for continuous refinement represent pivotal steps
towards achieving heightened efficacy and relevance in educational assessment. Concurrently,
optimizing scalability and efficiency ensures the system's adeptness in managing larger datasets
and heightened demand, paving the way for broader adoption and impact across educational
landscapes.

DEPARTMENT OF CSE (AI & ML) Page | 39

REFERENCES
&
BIBLOGRAPHY
STUDENT ANSWER EVALUATION USING LLMS

8.REFERENCES

[1] M. F. Bashir, H. Arshad, A. R. Javed, N. Kryvinska and S. S. Band, "Subjective Answers

Evaluation Using Machine Learning and Natural Language Processing," in IEEE Access, vol.
9, pp. 158972-158983, 2021, doi: 10.1109/ACCESS.2021.3130902

.[2] H. M. Balaha and M. M. Saafan, "Automatic Exam Correction Framework (AECF) for the
MCQs, Essays, and Equations Matching," in IEEE Access, vol. 9, pp. 32368-32389, 2021, doi:
10.1109/ACCESS.2021.3060940.

[3] Vedant Bahel, Achamma Thomas,” Text similarity analysis for evaluation of descriptive
answers” https://doi.org/10.48550/arXiv.2105.02935

[4] Neslihan Süzen, Alexander N. Gorban, Jeremy Levesley, Evgeny M. Mirkes,Automatic

short answer grading and feedback using text mining methods,Procedia Computer
Science,Volume 169,2020,Pages 726-743,ISSN

1877-0509,https://doi.org/10.1016/j.procs.2020.02.171.

[5] Rani, A.M. Automated Explanatory Answer Evaluation Using Machine Learning Approach.
Design Engineering, pp.1181-1190, 2021.

[6] Burrows, S., Gurevych, I. & Stein, B. The Eras and Trends of Automatic Short Answer
Grading. Int J Artif Intell Educ 25, 60–117 (2015). https://doi.org/10.1007/s40593-014-0026-8

[7] Bonthu, S., Rama Sree, S., Krishna Prasad, M.H.M. (2021). Automated Short Answer
Grading Using Deep Learning: A Survey. In: Holzinger, A., Kieseberg, P., Tjoa, A.M., Weippl,
E. (eds) Machine Learning and Knowledge Extraction. CD-MAKE 2021. Lecture Notes in
Computer Science(), vol 12844. Springer, Cham. https://doi.org/10.1007/978-3-030-84060-0_5

DEPARTMENT OF CSE (AI & ML) Page | 40

STUDENT ANSWER EVALUATION USING LLMS

[8] S. K. Sinha, S. Yadav and B. Verma, "NLP-based Automatic Answer Evaluation," 2022 6th
International Conference on Computing Methodologies and Communication (ICCMC), Erode,
India, 2022, pp. 807-811, doi: 10.1109/ICCMC53470.2022.9754052. keywords: {Weight
measurement;Analytical models;Mood;Natural languages;Automatic Evaluation;NLP;Text
Summarization;Similarity Measure;Evaluation Function}

[9] Rick Somers, Samuel Cunningham-Nelson, Wageeh Boles, Applying natural language
processing to automatically assess student conceptual understanding from textual responses,
https://doi.org/10.14742/ajet.7121

[10] Gyeong-Geon Lee, Ehsan Latif, Xuansheng Wu, Ninghao Liu, Xiaoming Zhai,Applying
large language models and chain-of-thought for automatic scoring,Computers and Education:
Artificial Intelligence,Volume 6,2024,100213,ISSN 2666-920X,

https://doi.org/10.1016/j.caeai.2024.100213

DEPARTMENT OF CSE (AI & ML) Page | 41

STUDENT ANSWER EVALUATION USING LLMS

9.BIBLIOGRAPHY

1) https://huggingface.co/

2) https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GGUF

3) https://huggingface.co/search/full-text?q=TheBloke%2FLlama-2-7b-Chat-GGUF

4) https://www.langchain.com

5) https://research.ibm.com/blog/retrieval-augmented-generation-RAG

DEPARTMENT OF CSE (AI & ML) Page | 42

APPENDIX
STUDENT ANSWER EVALUATION USING LLMS

10.APPENDIX
10.1 Python Introduction

Python is a high-level, interpreted programming language known for its simplicity, readability,
and versatility. Developed by Guido van Rossum and first released in 1991, Python has gained
widespread popularity and has become one of the most widely used programming languages in
the world. Its syntax emphasizes code readability and simplicity, making it an ideal language
for both beginners and experienced developers alike.

Python's versatility is evident in its broad range of applications across various domains. In web
development, frameworks like Django and Flask are popular choices for building robust and
scalable web applications. In data science, Python's rich ecosystem of libraries such as NumPy,
Pandas, and Matplotlib makes it a preferred language for data analysis, visualization, and
machine learning. Moreover, Python is extensively used in artificial intelligence and scientific
computing, with libraries like TensorFlow, PyTorch, and SciPy powering advanced research
and applications in these fields.

Due to its open-source nature and active community support, Python continues to evolve
rapidly, with frequent updates and new features being added to the language. Its ease of learning
and powerful capabilities have contributed to its widespread adoption across industries and
domains, solidifying its position as one of the most popular programming languages in the
world.

10.2 History of Python

Python's origins can be traced back to the late 1980s when Guido van Rossum, a Dutch
programmer, began working on the language as a side project. His goal was to create a language

that prioritized simplicity and readability while still being powerful and versatile. In February
1991, Python's first version, Python 0.9.0, was released.

Over the years, Python has undergone several major releases, each introducing new features,
improvements, and optimizations. Python 2.x series, released in 2000, became widely popular
and remained in use for many years. However, with the introduction of Python 3.x series in
2008, the language underwent significant changes and improvements, leading to better
performance, enhanced features, and improved syntax.

DEPARTMENT OF CSE (AI & ML) Page | 43

STUDENT ANSWER EVALUATION USING LLMS

Python's community-driven development model has played a crucial role in its success. The
Python Software Foundation (PSF), established in 2001, oversees the development and
maintenance of the language, ensuring its continued growth and evolution. Today, Python
enjoys widespread adoption and usage across industries, with millions of developers worldwide
contributing to its ecosystem through libraries, frameworks, and open-source projects.

10.3 Python Features

Python is renowned for its rich set of features and characteristics that make it an attractive
choice for developers. Some of the key features of Python include:

Simplicity: Python's syntax is designed to be clear and concise, making it easy to read and write
code. This simplicity allows developers to focus on solving problems rather than worrying
about complex syntax.

Readability: Python emphasizes readability, with code that closely resembles English-like
syntax. This readability reduces the time and effort required to understand and maintain code,
especially in collaborative projects.

Versatility: Python supports multiple programming paradigms, including procedural,

objectoriented, and functional programming. This versatility allows developers to choose the
most suitable approach for their projects and enables seamless integration with existing
codebases and libraries.

Interpreted: Python is an interpreted language, meaning that code is executed line by line by
an interpreter at runtime. This allows for rapid development and testing, as changes to code can
be immediately evaluated without the need for compilation.

Dynamic Typing: Python uses dynamic typing, allowing variables to be assigned without
specifying their data types explicitly. This flexibility simplifies code development and enhances
code readability.

Rich Standard Library: Python comes with a comprehensive standard library that provides
built-in support for a wide range of tasks and functionalities, including file I/O, networking,
data manipulation, and more. This extensive library reduces the need for external dependencies
and simplifies development.

DEPARTMENT OF CSE (AI & ML) Page | 44

STUDENT ANSWER EVALUATION USING LLMS

Community Support: Python boasts a large and active community of developers who
contribute to its ecosystem by creating libraries, frameworks, and tools. This vibrant community
ensures continuous improvement and innovation within the Python ecosystem.

Fig 10.1 : Python Features

10.4 NLP

Natural Language Processing (NLP) is a branch of artificial intelligence that focuses on the
interaction between computers and human languages. NLP enables computers to understand,
interpret, and generate human language in a way that is both meaningful and useful.
Python plays a significant role in NLP due to its simplicity, readability, and rich ecosystem of
libraries and tools. Some of the key aspects of Python's involvement in NLP include: NLTK
(Natural Language Toolkit): NLTK is a popular Python library for NLP tasks such as
tokenization, stemming, tagging, parsing, and semantic reasoning. It provides a wide range of
tools and resources for building NLP applications and conducting research in the field.
TensorFlow and PyTorch: TensorFlow and PyTorch are popular deep learning frameworks
that can be used for NLP tasks such as text classification, sentiment analysis, machine
translation, and text generation. These frameworks provide tools and APIs for building and
training deep learning models for NLP applications.

Word Embeddings: Word embeddings such as Word2Vec, GloVe, and FastText are
techniques used to represent words as dense vectors in a high- dimensional space. Python
libraries like Gensim and TensorFlow provide implementations of these techniques, making it

DEPARTMENT OF CSE (AI & ML) Page | 45

STUDENT ANSWER EVALUATION USING LLMS

easy to generate word embeddings for NLP tasks.

10.5 Artificial Intelligence & Machine Learning Artificial Intelligence (AI):

Definition: Artificial Intelligence refers to the development of computer systems that can
perform tasks that typically require human intelligence. These systems are designed to simulate
human-like intelligence, including learning, reasoning, problem-solving, perception, and
language understanding.

Example: Virtual assistants like Siri, Alexa, and Google Assistant use AI algorithms to
understand and respond to user commands. They can perform tasks such as setting reminders,
answering questions, and playing music based on user preferences. Another example of AI is
autonomous vehicles, which use sensors, cameras, and AI algorithms to perceive their
environment, navigate roads, and make real-time driving decisions.

Machine Learning (ML):

Definition: Machine Learning is a subset of AI that focuses on the development of algorithms

and statistical models that enable computers to learn from and make predictions or decisions
based on data, without being explicitly programmed to do so. ML algorithms iteratively learn
from data, identifying patterns and making adjustments to improve their performance over
time.
Example: A common example of machine learning is spam email detection. ML algorithms
analyze email content and metadata to identify patterns associated with spam emails. As they
receive feedback from users marking emails as spam or not spam, the algorithms adjust their
criteria for identifying spam, improving their accuracy over time. Another example is
recommendation systems used by streaming platforms like Netflix and Spotify, which analyze
user behavior and preferences to suggest personalized content.

DEPARTMENT OF CSE (AI & ML) Page | 46

STUDENT ANSWER EVALUATION USING LLMS

Fig 10.2: Overview of AI,ML,DL,NLP

10.6 LLM’s (Large Language Model)

Large Language Models (LLMs) represent a diverse array of advanced systems designed to
understand and generate human language. Built upon complex neural network architectures like
Transformer models, such as GPT-3, BERT, and XLNet, LLMs are capable of comprehending
context and producing coherent text-based outputs. They undergo extensive training on massive
datasets, allowing them to learn general language patterns and semantics.

Moreover, LLMs can be fine-tuned on task-specific datasets, enhancing their effectiveness for
specialized tasks like sentiment analysis or question answering. This versatility and adaptability
make LLMs invaluable tools for a wide range of natural language processing (NLP)
applications, driving advancements in language understanding and generation technology.

The key distinction between pretrained and fine-tuned Large Language Models (LLMs) lies in
their training stages and objectives. Pretrained LLMs undergo initial training on vast amounts
of unlabeled text data without specific task supervision. This phase enables them to learn
general language patterns and semantics, forming a foundational understanding of human
language. In contrast, fine-tuned LLMs undergo additional training on task-specific datasets
after the initial pretrained phase.

This fine-tuning process tailors the model's knowledge and performance to specific tasks or
domains, such as sentiment analysis or question answering, enhancing its effectiveness and

DEPARTMENT OF CSE (AI & ML) Page | 47

STUDENT ANSWER EVALUATION USING LLMS

accuracy for targeted applications. Overall, pretrained LLMs establish a broad linguistic
understanding, while fine-tuned LLMs refine their capabilities for specialized tasks through
additional training.

• GPT-3 (Generative Pre-trained Transformer 3):

Developed by OpenAI, GPT-3 is one of the largest and most powerful language models.
It is a base model that has been pre-trained on a diverse range of internet text.
• BERT (Bidirectional Encoder Representations from Transformers):
BERT, developed by Google, is another example of a base model for natural language
processing tasks. It has been pre-trained on a large corpus of text data to understand
context and relationships between words.

• XLNet:
XLNet is a transformer-based language model that takes into account bidirectional
context as well as context from surrounding words. It is a base model used for various
natural language processing applications.

• T5 (Text-To-Text Transfer Transformer):

T5 is a transformer-based model that frames all NLP (Natural Language Processing)
tasks as a text-to-text problem. It is trained as a base model on diverse language data.

• Llama:
LLama is one of the prominent Large Language Models (LLMs) that has been compared
to other models like ChatGPT-4 and Mistral in terms of performance and
capabilities. Which can able to perform tasks like question answering, text
generation/summarization, translation. Suitable for research in AI ethics, educational
platforms, and language analysis tools.

DEPARTMENT OF CSE (AI & ML) Page | 48

STUDENT ANSWER EVALUATION USING LLMS

• Mistral Models:
Mistral AI’s models, including Mistral 7B and Mistral 8X7B, outperform competitors
like Llama 2 and GPT-3.5. These versatile models excel in question answering, text
generation, summarization, and translation, making them ideal for industrial
automation, energy-efficient AI deployments, and mobile applications.

Fig 10.3 : Applications of LLMs

DEPARTMENT OF CSE (AI & ML) Page | 49

Sentiment Analysis
75% (4)
Sentiment Analysis
45 pages
Lesson 2 - Determining The Progress Towards The Attainment of Learning Outcomes
No ratings yet
Lesson 2 - Determining The Progress Towards The Attainment of Learning Outcomes
89 pages
Student Performance Analysis Using Machine Learning
No ratings yet
Student Performance Analysis Using Machine Learning
40 pages
Implementing Isms
No ratings yet
Implementing Isms
10 pages
School Improvement Project Proposal
No ratings yet
School Improvement Project Proposal
1 page
Building and AI Chatbot Using LLM
No ratings yet
Building and AI Chatbot Using LLM
69 pages
Final Modified Document PG
No ratings yet
Final Modified Document PG
58 pages
New Lms Project 1
No ratings yet
New Lms Project 1
70 pages
Final Doc - Project
No ratings yet
Final Doc - Project
55 pages
Final Report On Chatbot
No ratings yet
Final Report On Chatbot
70 pages
Finals
No ratings yet
Finals
62 pages
CSDS-2 Batch-12 Project Report
No ratings yet
CSDS-2 Batch-12 Project Report
68 pages
Fake Review Detection Prj2
No ratings yet
Fake Review Detection Prj2
30 pages
Mini
No ratings yet
Mini
73 pages
Voice Based System Assistant Using NLP and Deep Learning-1
No ratings yet
Voice Based System Assistant Using NLP and Deep Learning-1
82 pages
Prashant Project Report Latest
No ratings yet
Prashant Project Report Latest
49 pages
Binder 1
No ratings yet
Binder 1
93 pages
Yug's Blackbook
No ratings yet
Yug's Blackbook
73 pages
Agalya Updated Word - Merged
No ratings yet
Agalya Updated Word - Merged
94 pages
Banking Smarter Chatbot Final
No ratings yet
Banking Smarter Chatbot Final
93 pages
MC4411 Project Work - Format
No ratings yet
MC4411 Project Work - Format
65 pages
Ad ZT8
No ratings yet
Ad ZT8
5 pages
Prooooject
No ratings yet
Prooooject
56 pages
Stating
No ratings yet
Stating
11 pages
Batch02 - Ai Recruitment Tool For Resume Analysis and Skill Matching
No ratings yet
Batch02 - Ai Recruitment Tool For Resume Analysis and Skill Matching
55 pages
Doctmentloki Merged
No ratings yet
Doctmentloki Merged
100 pages
Hostel Managment System
No ratings yet
Hostel Managment System
35 pages
Project - Report Susan
No ratings yet
Project - Report Susan
54 pages
Project - Report (1) Aaa
No ratings yet
Project - Report (1) Aaa
43 pages
Sample Project Final Document
No ratings yet
Sample Project Final Document
68 pages
Project Doc-File
No ratings yet
Project Doc-File
64 pages
Batch 9
No ratings yet
Batch 9
90 pages
B.E Cse Batchno 176
No ratings yet
B.E Cse Batchno 176
83 pages
BTP Endsem 83
No ratings yet
BTP Endsem 83
38 pages
Final
No ratings yet
Final
59 pages
Multiclass Prediction Model For Student Grade Prediction Using Machine Learning
No ratings yet
Multiclass Prediction Model For Student Grade Prediction Using Machine Learning
106 pages
Thesis 23010 Print
No ratings yet
Thesis 23010 Print
57 pages
Nerd AI
No ratings yet
Nerd AI
31 pages
Documentation
No ratings yet
Documentation
62 pages
Part3 Report
No ratings yet
Part3 Report
6 pages
B2 Salma Fayaz
No ratings yet
B2 Salma Fayaz
56 pages
Handboojk Updates
No ratings yet
Handboojk Updates
36 pages
Share Patientsafetyreport
No ratings yet
Share Patientsafetyreport
54 pages
B.SC Cs Batchno 20
No ratings yet
B.SC Cs Batchno 20
50 pages
1822 B.E Cse Batchno 41
No ratings yet
1822 B.E Cse Batchno 41
175 pages
Final Modified Document
No ratings yet
Final Modified Document
63 pages
Thesis Final (1) Updates To Send
No ratings yet
Thesis Final (1) Updates To Send
37 pages
Baseer
No ratings yet
Baseer
6 pages
AI Based Learning Abstract
No ratings yet
AI Based Learning Abstract
2 pages
Project 1
No ratings yet
Project 1
35 pages
CS3381-Oops Lab - Rubrics Final
No ratings yet
CS3381-Oops Lab - Rubrics Final
5 pages
File 4
No ratings yet
File 4
60 pages
Minor Project-1 R21-Cse Report Template Ss2425
No ratings yet
Minor Project-1 R21-Cse Report Template Ss2425
39 pages
5 Students Academic Performance SINGLE FINAL JAIPRAKASH
No ratings yet
5 Students Academic Performance SINGLE FINAL JAIPRAKASH
83 pages
CS - 6 Months Exp
No ratings yet
CS - 6 Months Exp
2 pages
Template To Prepare Documentation
No ratings yet
Template To Prepare Documentation
6 pages
Virtual HR - Report Final
No ratings yet
Virtual HR - Report Final
70 pages
R23 AIML RTRP-FBRP Report Format
No ratings yet
R23 AIML RTRP-FBRP Report Format
10 pages
Kirankumar Major
No ratings yet
Kirankumar Major
66 pages
Report
No ratings yet
Report
102 pages
Report of Student Management System
No ratings yet
Report of Student Management System
49 pages
The Openxp Solution: For the Strategic Consolidation of Multiple Diverse Perspectives on Agile Software Projects
From Everand
The Openxp Solution: For the Strategic Consolidation of Multiple Diverse Perspectives on Agile Software Projects
Dr. Sandra Walsh
No ratings yet
Mastering Project Management: PMP and Agile for Leaders
From Everand
Mastering Project Management: PMP and Agile for Leaders
Rupal Jain
No ratings yet
Supplier Evaluation Process Introduction: Department Doc. Title
No ratings yet
Supplier Evaluation Process Introduction: Department Doc. Title
5 pages
Training WI AGS HSE E - REV00 - 05 11 2024
No ratings yet
Training WI AGS HSE E - REV00 - 05 11 2024
18 pages
Istikshaf Book - English
100% (1)
Istikshaf Book - English
154 pages
INQAAHE Membership Criteria 2023
No ratings yet
INQAAHE Membership Criteria 2023
6 pages
Practicum Eval Form 3
No ratings yet
Practicum Eval Form 3
12 pages
Training and Exercise Guidelines
No ratings yet
Training and Exercise Guidelines
32 pages
Reviews On Teacher Education and Training: A Case of Cambodia
No ratings yet
Reviews On Teacher Education and Training: A Case of Cambodia
18 pages
C2022 - Varajão - Models and Methods For Information Systems Project Success Evaluation
No ratings yet
C2022 - Varajão - Models and Methods For Information Systems Project Success Evaluation
26 pages
2016 National ICT4Rag Strategy
No ratings yet
2016 National ICT4Rag Strategy
70 pages
ABE Level 5 UMAO Syllabus v2
No ratings yet
ABE Level 5 UMAO Syllabus v2
7 pages
Ccpna Lesson Plan: Ask About Lifestyle Evaluate Your Lifestyle
No ratings yet
Ccpna Lesson Plan: Ask About Lifestyle Evaluate Your Lifestyle
6 pages
The Concept of Value
No ratings yet
The Concept of Value
17 pages
Training Head
No ratings yet
Training Head
2 pages
Conclusion: Times Have Changed. The School System? Not So Much.
No ratings yet
Conclusion: Times Have Changed. The School System? Not So Much.
16 pages
MEDX 58 Total Productive Maintenance
No ratings yet
MEDX 58 Total Productive Maintenance
3 pages
Abdulazeez Et Al - Assessing Strucctural Intergrity of Existing Building Structures
No ratings yet
Abdulazeez Et Al - Assessing Strucctural Intergrity of Existing Building Structures
10 pages
MSC Written Report Marking - Research Based Projects
No ratings yet
MSC Written Report Marking - Research Based Projects
2 pages
Group Project Rubrics For Students
No ratings yet
Group Project Rubrics For Students
1 page
Audit Work Program - Receivables Subsidiary/Department: Date
100% (1)
Audit Work Program - Receivables Subsidiary/Department: Date
7 pages
Client Satisfaction Survey New
No ratings yet
Client Satisfaction Survey New
1 page
Termohigrometro 2017 ULTIMO PDF
No ratings yet
Termohigrometro 2017 ULTIMO PDF
1 page
De 1 - ISTQB - On Thi
No ratings yet
De 1 - ISTQB - On Thi
8 pages
Mas 1st PB October 2022 Suggested Solution
No ratings yet
Mas 1st PB October 2022 Suggested Solution
8 pages
Undergraduate Curriculum Guidebook 2022 2023
No ratings yet
Undergraduate Curriculum Guidebook 2022 2023
303 pages
Ao2023 0002
No ratings yet
Ao2023 0002
14 pages
Emqm5103 (Project Quality Management)
100% (2)
Emqm5103 (Project Quality Management)
23 pages
Seismic Feasibility Study Guidelines
No ratings yet
Seismic Feasibility Study Guidelines
10 pages