Unit 5 Introduction To Data Mining: Prashasti Kanikar 9/26/2020
Unit 5 Introduction To Data Mining: Prashasti Kanikar 9/26/2020
Object-relational databases
Multimedia database
Text databases
Outlier analysis
Outlier: A data object that does not comply with the general behavior of the data
Noise or exception? ― One person’s garbage could be another person’s treasure
Methods: by product of clustering or regression analysis, …
Useful in fraud detection, rare events analysis
Customer Profiling
• data mining can tell you what types of customers
buy what products
Identifying Customer Requirements
• identify the best products for different customers
• use prediction to find what factors will attract new
customers
13 PRASHASTI KANIKAR 9/26/2020
Data Mining Application:
Fraud Detection
• Association Rule Mining can detect a group of people who
stage accidents to collect on insurance
Biological and medical data analysis: classification, cluster analysis (microarray data
analysis), biological sequence analysis, biological network analysis
Data mining and software engineering (e.g., IEEE Computer, Aug. 2009 issue)
From major dedicated data mining systems/tools (e.g., SAS, MS SQL-Server Analysis
Manager, Oracle Data Mining Tools) to invisible data mining
Mining Methodology
Mining various and new kinds of knowledge
Mining knowledge in multi-dimensional space
Data mining: An interdisciplinary effort
Boosting the power of discovery in a networked environment
Handling noise, uncertainty, and incompleteness of data
Pattern evaluation and pattern- or constraint-guided mining
User Interaction
Interactive mining
Incorporation of background knowledge
Presentation and visualization of data mining results