Mining Software Engineering Data
Mining Software Engineering Data
Harshad Panwar
Govt. Engg. College, Ajmer
Rahul
Govt. Engg. College, Ajmer
Abstract
The article presents the latest research in mining software engineering (SE) data, discusses challenges associated with mining SE Data. Highlights SE data mining success stories and outlines future research directions.
Introduction
Software engineering data (such as code bases, execution traces, historical code changes, mailing lists, and bug databases) contains a wealth of information about a software projects status, progress, and evolution. A recent issue of IEEE Transaction on SE(TSE) on the Mining Software Repositories (MSR) topic received over 15% of all the submission to thr TSE in 2005[]2.
1. Software Engineering:
(a) What types of SE data are available to be mined? (c) How are data mining techniques used in SE?
2. Data Mining:
(a) What are the challenges in applying data mining techniques to SE data? (b) Which data mining techniques are most suitable for specific types of SE data? (c) What are freely available data mining and analysis tools (e.g., R [1] and WEKA [2])?
3. Future Directions:
What are the challenges and opportunities for the data mining and software engineering communities?
Detailed Overview
From the categorization, we intend to investigate the following four issues.
First, we intend to identify inherent challenges of mining software engineering data. Second, we intend to understand the current research and development frontier of data mining practice in software engineering. Third, we intend to analyze successful cases of mining SE data. We shall review and demonstrate briefly several research prototypes of datamining systems for software engineering. Finally, we intend to give an overview on commonly used data mining tools. Our overview will help the participants gain a better understanding of available tools.
References
[1] The R Project for Statistical Computing. Available online at http://www.r-project.org/. Weka 3: Data Mining Software in Java. Available online at http://www.cs.waikato.ac.nz/ml/.
[2]