Data Mining
Data Mining
net/publication/300014089
CITATIONS READS
7 4,493
3 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Adebowale E. Shadare on 08 April 2016.
Matthew N. O. Sadiku,
Adebowale E. Shadare
Sarhan M. Musa
Roy G. Perry College of Engineering, Prairie View A&M University
Prairie View, USA
Abstract
Data mining may be regarded as the process of discovering insightful
and predictive models from massive data. It is the art of extracting useful
information from large amounts of data. It combines traditional data analysis
with sophisticated algorithms for processing large amount of data. It is an
interdisciplinary field merging concepts from database systems, statistics,
machine learning, computing, information theory, and pattern recognition.
It has the real potential of becoming part of electrical engineering education.
The main objective of this paper is to provide a brief introduction to data
mining.
Keywords:
Introduction
Technology now enables us to capture and store massive amount of
data. It has been estimated that more than 15 exabytes of new data is
generated each year. With the popular use of the World Wide Web and its
associated information services, such as Google, Yahoo, Excite, InfoSeek,
and American Online, our capacity to generate and collect data has increased
tremendously. This explosive growth has caused the need for techniques that
will assist us in transforming the data into useful information and
knowledge.1
Data are numbers, text or facts that can be processed by a computer.
They may take the form of transactional data such as sales, prices, payroll,
and accounting. For example, you generate data when you make a purchase
using your credit card or when you surf the Web. With the trends in
centralization of an organization’s data in large databases, the process of
extracting useful information is now known as data mining.
The concept of data mining has attracted a lot of attention in recent
years. It was started in the early 1990s by Tom Khabaza. Data mining, also
509
European Scientific Journal July 2015 edition vol.11, No.21 ISSN: 1857 – 7881 (Print) e - ISSN 1857- 7431
510
European Scientific Journal July 2015 edition vol.11, No.21 ISSN: 1857 – 7881 (Print) e - ISSN 1857- 7431
511
European Scientific Journal July 2015 edition vol.11, No.21 ISSN: 1857 – 7881 (Print) e - ISSN 1857- 7431
Conclusion
Data mining is the task of discovering useful patterns from a large
amount of data. The field of data mining is relatively new; it is recognized
as a rapidly emerging research area. Although it would not hurt to have
some exposure to statistical analysis, one does not need to be an expert in
statistics or a computer programmer to be a data miner.
Data has spread its wings in almost all areas.8-10 It has the real
potential of becoming part of electrical engineering education. The Society
of Data Miners (www.socdm.org) founded in 2013 is helpful for advancing
knowledge and career.
References:
I.H. Witten and E. Frank, Data Mining (Moran Kaufmann Publishers,
Amsterdam, 2005, 2nd ed.).
J. Ledolter, Data Mining and Business Analytics with R (John Wiley &
Sons, Hoboken, NJ, 2013).
P. N. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining
(Addison-Wesley, Boston, 2006).
M. North, Data Mining for the Masses (Global Text, Lexington, KY, 2012).
M. S. Brown, Data Mining for Dummies (John Wiley & Sons, Hoboken,
NJ, 2014).
C. McCue, Data Mining and Predictive Analysis (Butterworth-Heineman,
Burlington, MA, 2007).
J. Han and M. Kamber, Data Mining: Concepts and Techniques (Morgan
Kaufmann, San Francisco, CA, 3rd ed., 2011).
V.K. Deepa and J. R. Geetha, “Rapid development of applications in data
mining,” Proceedings of 2013 International Conference on Green High
Performance Computing, Mar. 2013.
D. Braha and A. Shmilovici, “Data mining for improving a cleaning process
in the semiconductor industry,” IEEE Transactions on Semiconductor
Manufacturing, vol. 15, no. 1, Feb, 2002, pp. 91-101.
M. S. Chen, J. Han, and P. S. Yu, “Data mining: an overview from a
database perspective,”
IEEE Transactions on Knowledge and Data Engineering, vol. 8, no. 6, Dec.
1996, pp. 866-883.
512
European Scientific Journal July 2015 edition vol.11, No.21 ISSN: 1857 – 7881 (Print) e - ISSN 1857- 7431
Figure 1 Data mining as an interdisciplinary field (Adapted from Han and Kamber, 2006).
513