Module 1-1basic Concepts
Module 1-1basic Concepts
21AD402 3/0/0/3
MINING
Jiawei Han, MichelineKamber and Jian Pei, “Data Mining Concepts and
Techniques”, Third Edition, Elsevier, 2012.
“DATA WAREHOUSING
CONCEPTS”
Data
DATA
What is a Data???
● A collection of facts in a raw or unorganized forms like alphabets, numbers,
symbols etc…
13
So, Datawarehouse (vs)
ing (vs) Database
● Database is dump of data .
● Data warehousing is a ● A data warehouse is a federated
methodology to extract the repository for all the data that an
significant data that helps in enterprise's various business
1. Amazon Redshift
2. Teradata
3. Oracle 12c
4. Informatica
5. IBM Infosphere
Data Warehouse
Applications
● Retail Industry
✔ Forecasting, Market research, Merchandising etc.
● Manufacturing and distribution
✔ Sales history/trends, Market demand projects etc.
● Banks
✔ Spot market trends, Marketing, Credit cards etc.
● Insurance Companies
✔ Property and casualty fraud etc.
● Health Care Providers
✔ Fraud detection, Patient matching etc.
DW Applications [cont…]
● Government Agencies
✔ Auditing tax records, information sharing across
different agencies etc.
● Internet Companies
✔ Analyzing shopping behavior, CRM etc.
● Telecommunications
✔ Telemarketing, Product development etc.
● Sports
✔ Analyzing strategies, Winning player combinations etc.
Datawarehouse Sizes
● Terabyte (10^12) - Walmart (24 TB)