0% found this document useful (0 votes)
43 views

TPW Data Mining

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views

TPW Data Mining

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

A REVIEW ON DATA MINING TECHNIQUES AND

THEIR APPLICATIONS

SAI NARENDRA VARMA SURYA VENKATA PRAVEEN MUSTI NAGA SUBA AMRUTHA PAVAN KUMAR POLA
UPPALAPATI
COMPUTER SCIENCE AND COMPUTER SCIENCE AND COMPUTER SCIENCE AND
COMPUTER SCIENCE AND ENGINEERING ENGINEERING ENGINEERING
ENGINEERING
[email protected] [email protected]
[email protected]
m
[email protected]
om

MENTOR: Mrs. JAYA PRADHA

ABSTRACT INTRODUCTION

Data mining is a process which finds useful The process of extraction of useful information
patterns from large amount of past and present and patterns from huge amount of data from
data. It is used in different fields like science, various domains. The research in databases and
Engineering, Health, Business etc... information technology has given rise to an
approach to store and analyze this precious data
The paper discusses few of the data mining
for further decision making. It is also called as
techniques and some of the organizations which
knowledge discovery process (KDP), knowledge
have adapted data mining technology to improve
mining from selected data to find relationships or
and enhance their businesses and organizations
patterns.
to found excellent results and shows the Data
mining scope in future.

Keywords

Datamining, Knowledge data discovery,


techniques, Applications.

1
LITERATURE SURVEY

METHODOLOGY

A study of Datamining techniques and their In the process of data mining, Choosing a dataset
applications from a huge repository is the primary thing.

By this process, many companies got profits on Datasets are divided into three types:
their respective domains. it Increases efficiency of
marketing campaigns and also increases the 1) Record data

cross-selling to existing customers 2) Graph-based data

[1] Soft map Company Ltd. Tokyo, 3) Sequential data

 Page views increased 67% per month after the


recommendation engine went live.
DATA MINING TECHNIQUES:
Profits tripled in 2001, as sales increased 18
percent versus the same period in the previous Based on the type of the task the Datamining

year. techniques are applied.

[2] Standard Life Mutual Financial Services Predictive tasks provide the results of future

Companies queries based on past data.

Achieved, with the model, a nine times greater Classification, Regression and Outer detection

response than that achieved by the control group. are predictive data mining techniques.

Secured $47 million worth of mortgage Association rules, Sequential patterns and

application revenue. prediction are few most commonly used data


mining techniques.
[3] Shenandoah Life insurance company United
States. Classification:

Reduced the time required to issue certain Classification is the most commonly applied data

policies by 20 %. mining technique, which invokes a set of pre-


classified examples to develop a model that can
Improved underwriting and employee classify the population of records at large.
performance review processes.
We use Decision Trees, Bayesian Classifiers,
Neural Networks, K-Nearest Neighbors, Support
Vector Machines, Linear Regression, Logistic
[4] FBTO Dutch Insurance Company
Regression, as classifiers in this technique.
Decreased mailing costs by 35 %.
Clustering:
Increased conversion rates by 40 %.

2
By using clustering techniques, we can further Alignment Search Tool), FASTA, CS-BLAST for
identify dense and sparse regions in object space finding sequence alignment, Gen-Scan, Gene-
and can discover overall distribution Mark for gene finding, P-fam, BLOCKS, Pro-
Dom for protein analysis.
pattern and correlations among data attributes.
We use different clustering methods for different
applications. Some methods are Partitioning
[2] Manufacturing-Engineering:
Method, Grid-Based Method, Density-based
Method, Model-Based Method, Hierarchical Manufacturing enterprise contains data related to
Method, Constraint-based Method. its company's products. Techniques like
Classification, Association Rule mining,
Outlier Detection:
Regression in data mining is used to predict
Outlier detection detects and excludes outliers product development time and cost, the
from the data set. Some outlier detection methods relationship between product architecture,
are Z-Score, DBSCAN, Isolation Forest, Linear customer needs, dependencies among tasks etc.
Regression Models. Fraud detection, Intrusion Data mining tools used in this field are Rapid
detection, Medical and health outlier detection, miner, Data melt, Board, Weka.
Fraud detection of Insurance claim are the
applications of outlier detection.

[3] Criminal Investigation:


APPLICATIONS OF DATA MINING:

Criminal analysis includes detecting crimes and


Data mining is applied vastly in many
criminal’s relationships with these crimes. From
organizations.
different crimes like cyber-crimes, violent crimes,
[1] Bioinformatics: fraud detection, drug offences, we get high
volumes of criminal datasets. Data mining is
Bioinformatics is the collection of various
utilized in this field for applications like counter-
methods to manage, store and study biological
terrorism activities, crime matching, crime trends,
data using computers. the data mining tools used
etc. Data mining tools used in this field are Weka,
in bioinformatics are BLAST (Basic Local
H2o, Orange etc. are field.

CONCLUSION etc., helps in finding the patterns to decide upon


the future trends in businesses to grow.
Data mining has importance regarding finding
the patterns, forecasting, discovery of knowledge Now a days almost every field is digitalized these

in different business domains. Data mining days, and because of this, a

techniques and algorithms such as classification,


large volume of data is generated every day. Data
clustering
mining plays a vital role in future-prediction.

3
plays a vital role in managing, analysing and [6]. http://www.kdnuggets.com/.Pu
extracting the
[7] Dr. M. Dhanabhakyam , Dr. M. Netravali ,
required information from these large databases. ―A Survey on Data Mining Algorithm for
Market Basket Analysis‖ in Global Journal of
REFERENCES
Computer Science and Technology Volume 11

[1]. Jiawei Han and Micheline Kamber (2006), Issue 11 Version 1.0 July 2011, Publisher: Global

Data Mining Concepts and Techniques, published Journals Inc. (USA) Online ISSN: 0975-4172 &

by Morgan Kauffman, 2nd ed. Print ISSN: 0975-4350.

[2]. Dr. Gary Parker, vol 7, 2004, Data Mining: [8] Stefano Lunardi, Jake Chen, ―Data Mining

Modules in emerging fields, CD-ROM. in Bioinformatics: Selected Papers from


BIOKDD‖ in IEEE/ACM Transactions on
[3]. Crisp-DM 1.0 Step by step Data Mining guide Computational Biology and Bioinformatics, Vol.
from http://www.crisp-dm.org/CRISPWP- 7, no. 2, April-June 2010
0800.pdf.
[9] V.K. Jha, R.K. Singh ―Application of Data
[4]. Customer Successes in your industry from Mining in Manufacturing Industry‖ in
http://www.spss.com/success/? International Journal of Information Sciences
source=homepage&hpzone=nav_bar. and Application. ISSN 0974- 2255 Volume 3,

[5]. https://www.allbusiness.com/Technology Number 2 (2011), pp. 59-64.

/computer-software-data-management/ 633425- [10] Brijendra Singh, Hemant Kumar Singh,


1.html, last retrieved on 15th Aug 2010. ―Web Data Mining Research: A Survey‖ in

IEEE International INTERNATIONAL


JOURNAL OF SCIENTIFIC &
TECHNOLOGY RESEARCH VOLUME 9,
ISSUE 02, FEBRUARY 2020 ISSN 2277-8616
3388 IJSTR©2020 www.ijstr.org Conference on
Computational Intelligence and Computing
Research, 2010.

You might also like