Group 4
Group 4
&
Data Warehousing
Presented
By:
Group 4
Kirk Bishop Joe Draskovich Amber Hottenroth
Brandon Lee Stephen Pesavento
Introduction
What is Data Mining?
Data Mining is the process of
collecting large amounts of raw data
and transforming that data into useful
information.
Data Warehousing?
Banking
Detect Fraudulent Activity
Insurance
Risk Assessment
Medicine/Healthcare
Enhance Research
Retail
Track consumer buying trends
Data Mining Software Applications
Advantages
• Access to information
• Data Inconsistency
• Decrease Computing Cost
• Productivity Increase
• Increase company profits
Data Warehousing
Disadvantages
• Data must be cleaned, loaded, and
extracted
• 80% of the overall process
• User Variability
• Proper Training
• Difficult to Maintain
• Incongruence among systems
Data Mining
Advantages
• Improves Customer Satisfaction/service
• Increases profitability
Data Mining
Disadvantages
• Require skilled technical users to
interpret and analyze data from
warehouse
• Validity of the patterns
• Related to real world circumstances
• Unable to Identify Casual Relationships
• Reserved for the few instead of the many
Current Issues
Data Quality
• Duplicated records
• Lack of Data Standards
• Human Error
Inoperability
• Lack of communications among existing
systems
Mission Creep
Trends & Current Issues
•4 Major Trends
•Data – growing amount collected to be sifted
•Hardware – growing performance & storage
•Scientific Computing – theory, experiment,
simulation
•Business – Meet higher standard in order to
foresee risks, opportunities, and benefits
for the company
•The government is closely reviewing the uses of data mining, due to the possibilities
both good and bad
•Counterterrorism data mining has been done, but in some instances has been
deemed a violation of privacy
Future Research Possibilities
• Questions
• Questions
• Questions
• Questions
• Questions
• Questions