0% found this document useful (0 votes)

11 views3 pages

Requirement Document For Implementation of Bug Report-Based Test Input Extraction and Test Case Gene...

This document outlines a plan to implement a new approach for test input extraction and test case generation using Large Language Models (LLMs), inspired by the BRMINER technique. The implementation involves four phases: replicating BRMINER, preparing a dataset for LLM training, fine-tuning the LLM, and generating test cases with the LLM. The goal is to enhance adaptability, precision, and bug detection capabilities compared to the existing BRMINER method.

Uploaded by

ogutuisaiah1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views3 pages

Requirement Document For Implementation of Bug Report-Based Test Input Extraction and Test Case Gene...

Uploaded by

ogutuisaiah1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Requirement Document for Implementation of Bug

Report-Based Test Input Extraction and Test Case

Generation Using Large Language Models (LLMs)

Introduction

This document provides a detailed plan for implementing a novel approach to enhance test
input extraction and test case generation. Inspired by the BRMINER technique, the goal is to
leverage Large Language Models (LLMs) for the same task. The workflow involves replicating
BRMINER results and then transitioning to an LLM-based approach.

Existing Approach

BRMINER Overview

1. Purpose: BRMINER extracts relevant test inputs from bug reports and utilizes them to
generate automated test cases using EvoSuite.
2. Methodology:
○ Parses bug reports for structured and unstructured data.
○ Uses regular expressions and Java code parsing to extract literals and potential
test inputs.
○ Inputs are fed into EvoSuite to generate test cases, evaluated using the
Defects4J dataset.
3. Performance: Achieves 68.68% relevant input extraction using regular expressions and
detects 45 previously undetected bugs.

Limitations

● Dependency on regex reduces precision.

● Limited adaptability to diverse bug report formats.
● Fixed extraction rules lack contextual understanding, often requiring manual tuning.

New Approach

Objective

Develop a more adaptable and context-aware system by leveraging a fine-tuned LLM to:
1. Extract test inputs directly from structured and unstructured data in bug reports.
2. Generate precise and relevant test cases using the same dataset.
3. Enhance coverage and bug detection capabilities compared to BRMINER.

Implementation Plan

Phase 1: Replicating BRMINER

1. Implement BRMINER as described in the paper using the Defects4J dataset.

2. Tasks:
○ Extract bug reports and inputs using BRMINER's regex and Java parsing
methodology.
○ Generate test cases using EvoSuite.
○ Validate results against reported metrics (coverage, bug detection).
3. Expected Outcomes:
○ Establish baseline metrics.
○ Understand strengths and weaknesses of the BRMINER approach.

Phase 2: Dataset Preparation for LLM

1. Collect Data:
○ Bug reports (structured and unstructured formats).
○ Source code with and without bugs (Defects4J dataset).
○ Test cases and corresponding bug fixes.
2. Preprocess Data:
○ Normalize bug reports into a uniform format (remove noise, segment text).
○ Annotate inputs and outputs to train the LLM on extraction tasks.
3. Output:
○ A clean, structured dataset ready for LLM training.

Phase 3: Fine-Tuning the LLM

1. Use an open-source LLM (e.g., GPT or similar) as the base model.

2. Fine-tune the model:
○ Inputs: Preprocessed bug reports.
○ Outputs: Relevant test inputs and test case templates.
3. Train the model iteratively with evaluation metrics (e.g., precision, recall for test input
relevance).

Phase 4: LLM-Based Test Case Generation

1. Integrate the fine-tuned LLM:

○ Extract inputs from bug reports.
○ Generate test cases using EvoSuite with LLM-extracted inputs.
2. Evaluate against the baseline (BRMINER results):
○ Compare test input relevance.
○ Analyze bug detection improvements.
○ Measure test coverage enhancements.

Deliverables

1. Phase 1: Reproduced BRMINER results, including metrics for validation.

2. Phase 2: A structured dataset combining bug reports, source code, and test cases.
3. Phase 3: Fine-tuned LLM model capable of extracting test inputs.
4. Phase 4: Comparative analysis of BRMINER and LLM-based test case generation.

Key Tools and Technologies

1. For BRMINER:
○ EvoSuite for test case generation.
○ Java parsers and regex for data extraction.
2. For LLM Development:
○ Open-source LLMs (LLAMA).
○ Machine learning frameworks (PyTorch, TensorFlow).
○ Tools for fine-tuning and evaluation.

Timeline

1. Phase 1: 3 weeks (replication and validation).

2. Phase 2: 4 weeks (data collection and preprocessing).
3. Phase 3: 6 weeks (fine-tuning and testing).
4. Phase 4: 3 weeks (integration and comparison).

Evaluation Metrics

1. Relevance of Test Inputs: Measure precision and recall.

2. Bug Detection: Compare number of detected bugs.
3. Code Coverage: Evaluate instruction, branch, and line coverage.
4. Efficiency: Analyze runtime and computational resources.

Sociocultural Theory and Sla
No ratings yet
Sociocultural Theory and Sla
5 pages
06 Class Vi Syllabus FT 2025 DMD
No ratings yet
06 Class Vi Syllabus FT 2025 DMD
8 pages
Daffodils Poem
No ratings yet
Daffodils Poem
21 pages
TOEFL Complete 40 Skills
No ratings yet
TOEFL Complete 40 Skills
41 pages
Timeline
No ratings yet
Timeline
3 pages
PDF Tagalog - Filipino Orthography
No ratings yet
PDF Tagalog - Filipino Orthography
4 pages
English FAL P1 May-June 2016
No ratings yet
English FAL P1 May-June 2016
13 pages
Instrucciones de Instalacion Phisterer PDF
No ratings yet
Instrucciones de Instalacion Phisterer PDF
71 pages
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Tenses Workssheet Grade 9
No ratings yet
Tenses Workssheet Grade 9
4 pages
Mastering the Craft of Python Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Craft of Python Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Mastering the Craft of TypeScript Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Craft of TypeScript Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Leah Njoki Mwangi Utility Bill
No ratings yet
Leah Njoki Mwangi Utility Bill
1 page
Expert Python Programming - Second Edition
From Everand
Expert Python Programming - Second Edition
Tarek Ziadé
2/5 (1)
Mastering C: A Comprehensive Guide to Proficiency in The C Programming Language
From Everand
Mastering C: A Comprehensive Guide to Proficiency in The C Programming Language
Kameron Hussain
No ratings yet
Summer Express Between 3 4 Grades
No ratings yet
Summer Express Between 3 4 Grades
142 pages
Samuel Njeri
No ratings yet
Samuel Njeri
1 page
LLM's For Code Generation
No ratings yet
LLM's For Code Generation
31 pages
Django 1.1 Testing and Debugging
From Everand
Django 1.1 Testing and Debugging
Karen M. Tracey
4.5/5 (3)
Python-Based Evolutionary Algorithms for Engineers
From Everand
Python-Based Evolutionary Algorithms for Engineers
Pankaj Jayaraman
No ratings yet
Python Debugging from Scratch: A Practical Guide with Examples ASIN (Ebook):
From Everand
Python Debugging from Scratch: A Practical Guide with Examples ASIN (Ebook):
William E. Clark
No ratings yet
Comprehension and Communication Skills in English Engl101
No ratings yet
Comprehension and Communication Skills in English Engl101
133 pages
Rahab Utility
No ratings yet
Rahab Utility
1 page
Python Basics Made Simple: A Practical Guide with Examples
From Everand
Python Basics Made Simple: A Practical Guide with Examples
William E. Clark
No ratings yet
Mastering Python: A Comprehensive Crash Course for Beginners
From Everand
Mastering Python: A Comprehensive Crash Course for Beginners
Kameron Hussain
No ratings yet
Python Exception Handling Made Easy: A Practical Guide with Examples
From Everand
Python Exception Handling Made Easy: A Practical Guide with Examples
William E. Clark
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Mastering the Art of Unit Testing: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of Unit Testing: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Writing Clean Code Step by Step: A Practical Guide with Examples
From Everand
Writing Clean Code Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Data Manipulation with Python Step by Step: A Practical Guide with Examples
From Everand
Data Manipulation with Python Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
JavaScript Introduction
From Everand
JavaScript Introduction
Lisa Saldivar
No ratings yet
LLM Testing - Methods and Strategies-1
No ratings yet
LLM Testing - Methods and Strategies-1
20 pages
Practical Moq for .NET Developers: Definitive Reference for Developers and Engineers
From Everand
Practical Moq for .NET Developers: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Transcript Request Form 4
No ratings yet
Transcript Request Form 4
1 page
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
Verb Between Arabic and Chinese
No ratings yet
Verb Between Arabic and Chinese
16 pages
1 16 Share File
No ratings yet
1 16 Share File
17 pages
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
From Everand
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
Héctor Jorquera González
5/5 (1)
Computer-Controlled Systems: Theory and Design, Third Edition
From Everand
Computer-Controlled Systems: Theory and Design, Third Edition
Karl J Åström
3/5 (1)
Alopex
No ratings yet
Alopex
12 pages
The Roleof LLMsin Automating Test Case Generationand Software Validation
No ratings yet
The Roleof LLMsin Automating Test Case Generationand Software Validation
12 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Automated Unit Test Improvement Using Large Language Models at Meta
No ratings yet
Automated Unit Test Improvement Using Large Language Models at Meta
12 pages
Be Going To - 91604
No ratings yet
Be Going To - 91604
12 pages
The Reading Test: What's in The Cambridge English: Advanced Reading Paper?
No ratings yet
The Reading Test: What's in The Cambridge English: Advanced Reading Paper?
23 pages
Implementation of Bug Report
No ratings yet
Implementation of Bug Report
12 pages
Olympic Glory Y4m Y5d Y6e Sapphire Guided Reading Pack New NC
No ratings yet
Olympic Glory Y4m Y5d Y6e Sapphire Guided Reading Pack New NC
12 pages
Mighty 1
No ratings yet
Mighty 1
14 pages
The C++ Template Handbook: Advanced Techniques for Modern C++ Developers
From Everand
The C++ Template Handbook: Advanced Techniques for Modern C++ Developers
Robert Johnson
No ratings yet
EA 1 01rev100306
No ratings yet
EA 1 01rev100306
11 pages
Relative Clauses
No ratings yet
Relative Clauses
6 pages
CTS Detailed Document For Campus Placement
No ratings yet
CTS Detailed Document For Campus Placement
22 pages
The Simple Past Tense 1.1.use For The Relation of Past Events
No ratings yet
The Simple Past Tense 1.1.use For The Relation of Past Events
6 pages
AI Powered Software Testing The Impact of Large Language Models On Testing Methodologies
No ratings yet
AI Powered Software Testing The Impact of Large Language Models On Testing Methodologies
4 pages
Effective Mocha Testing: Definitive Reference for Developers and Engineers
From Everand
Effective Mocha Testing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Debugging and Testing from Scratch: A Practical Guide with Examples
From Everand
Debugging and Testing from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
Power Up Parents Letters Level 3 Power Up English Home-School Resources
No ratings yet
Power Up Parents Letters Level 3 Power Up English Home-School Resources
10 pages
Cultural Issues in Translation Anna Ginter
No ratings yet
Cultural Issues in Translation Anna Ginter
5 pages
GenAI Course For QA Engineers
No ratings yet
GenAI Course For QA Engineers
6 pages
When We Deal With LLMs
No ratings yet
When We Deal With LLMs
4 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
PyTest in Practice: Definitive Reference for Developers and Engineers
From Everand
PyTest in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
English Part 1 Reviewer For Licensure Examination For Teachers
No ratings yet
English Part 1 Reviewer For Licensure Examination For Teachers
5 pages
From JavaScript to TypeScript: Navigating the Modern Web Transition
From Everand
From JavaScript to TypeScript: Navigating the Modern Web Transition
Kameron Hussain
No ratings yet
IGNOU BCA Operating System Concepts and Networking Management Previous Year Solved Papers MCS 022
From Everand
IGNOU BCA Operating System Concepts and Networking Management Previous Year Solved Papers MCS 022
Manish Soni
No ratings yet
Japanese SrSec 2024-25
No ratings yet
Japanese SrSec 2024-25
7 pages
Melvin Cloud Evals Instructive BMs
No ratings yet
Melvin Cloud Evals Instructive BMs
4 pages
BA Sem-III (Reg & Ext) Supplementary Exam Dt. 15-10-2024
No ratings yet
BA Sem-III (Reg & Ext) Supplementary Exam Dt. 15-10-2024
3 pages
LA HW - Practice
No ratings yet
LA HW - Practice
6 pages
How Does Multiple Sclerosis Affect Working Memory-2
No ratings yet
How Does Multiple Sclerosis Affect Working Memory-2
6 pages
Quiz Updated
No ratings yet
Quiz Updated
6 pages
Trevor Noah Cambridge Styled Questions - Week 4
No ratings yet
Trevor Noah Cambridge Styled Questions - Week 4
3 pages
C++ Debugging from Scratch: A Practical Guide with Examples
From Everand
C++ Debugging from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
Document 3
No ratings yet
Document 3
3 pages
Document 7
No ratings yet
Document 7
3 pages
Elva Avilez
No ratings yet
Elva Avilez
3 pages
Ginkgo for Effective Go Testing: The Complete Guide for Developers and Engineers
From Everand
Ginkgo for Effective Go Testing: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Document 9
No ratings yet
Document 9
2 pages
Jellyfish Rubrics
No ratings yet
Jellyfish Rubrics
3 pages
Document 6
No ratings yet
Document 6
4 pages
Dynamic Test Case Adjustment
No ratings yet
Dynamic Test Case Adjustment
2 pages
Comprehensive Guide to MiniTest: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to MiniTest: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Arabic Gems PDF
No ratings yet
Arabic Gems PDF
46 pages
Outlier Update
No ratings yet
Outlier Update
2 pages
Debugging Like a Pro: A Practical Guide with Examples
From Everand
Debugging Like a Pro: A Practical Guide with Examples
William E. Clark
No ratings yet
Document 1
No ratings yet
Document 1
2 pages
Isidra Arcala
No ratings yet
Isidra Arcala
3 pages
Kafka Developer Certified: The Essential Guide
From Everand
Kafka Developer Certified: The Essential Guide
SUJAN
No ratings yet
Reco C
No ratings yet
Reco C
2 pages
Defect Prediction in Software Development & Maintainence
From Everand
Defect Prediction in Software Development & Maintainence
Rudra Kumar
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Python for Machine Learning: From Fundamentals to Real-World Applications
From Everand
Python for Machine Learning: From Fundamentals to Real-World Applications
Kameron Hussain
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
Managing the Testing Process: Practical Tools and Techniques for Managing Hardware and Software Testing
From Everand
Managing the Testing Process: Practical Tools and Techniques for Managing Hardware and Software Testing
Rex Black
4/5 (8)
Translator - Google Search 3
No ratings yet
Translator - Google Search 3
1 page
Jayden Transcript
No ratings yet
Jayden Transcript
1 page
Structured Software Testing: The Discipline of Discovering
From Everand
Structured Software Testing: The Discipline of Discovering
Arunkumar Khannur
No ratings yet
PHP 8: The Modern Web Developer's Guide
From Everand
PHP 8: The Modern Web Developer's Guide
Kameron Hussain
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Automating Software Tests Using Selenium
From Everand
Automating Software Tests Using Selenium
Hugo Peres
No ratings yet
Codeception Essentials: Definitive Reference for Developers and Engineers
From Everand
Codeception Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Feedback Control Theory
From Everand
Feedback Control Theory
Bruce Francis
5/5 (1)
Worksheet Pres Perf and Past Simple
No ratings yet
Worksheet Pres Perf and Past Simple
1 page
C Programming Concepts
From Everand
C Programming Concepts
Jitendra Patel
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
From Everand
ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
SUJAN
No ratings yet
1 Monthly Assessment in Oralcomm11: (1 Quarter)
No ratings yet
1 Monthly Assessment in Oralcomm11: (1 Quarter)
9 pages
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
From Everand
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
Gabriel Awoyemi
No ratings yet
Automated Software Testing Interview Questions You'll Most Likely Be Asked
From Everand
Automated Software Testing Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Requirement Document For Implementation of Bug Report-Based Test Input Extraction and Test Case Gene...

Uploaded by

Requirement Document For Implementation of Bug Report-Based Test Input Extraction and Test Case Gene...

Uploaded by

Requirement Document for Implementation of Bug

Report-Based Test Input Extraction and Test Case

● Dependency on regex reduces precision.

Phase 1: Replicating BRMINER

1. Implement BRMINER as described in the paper using the Defects4J dataset.

Phase 2: Dataset Preparation for LLM

Phase 3: Fine-Tuning the LLM

1. Use an open-source LLM (e.g., GPT or similar) as the base model.

Phase 4: LLM-Based Test Case Generation

1. Integrate the fine-tuned LLM:

1. Phase 1: Reproduced BRMINER results, including metrics for validation.

Key Tools and Technologies

1. Phase 1: 3 weeks (replication and validation).

1. Relevance of Test Inputs: Measure precision and recall.

You might also like