Business Intelligence Concepts: Rajashmita Kar
Business Intelligence Concepts: Rajashmita Kar
Business Intelligence Concepts: Rajashmita Kar
Concepts
RAJASHMITA KAR
[email protected]
AGENDA
What is BI??
BI Lifecycle
Chalenges of Building BI
Data Warehouse Concepts
Key Performance Indicator
Normalization and its type
Data Cleansing
Slowly Changing Dimension
Dec16,2013
April 13, 2011
What is BI??
Dec16,2013
April 13, 2011
The term Business Intelligence (BI) represents the tools and systems
that play a key role in the strategic planning process of the corporation.
Dec16,2013
April 13, 2011
BI LIFECYCLE
Dec16,2013
April 13, 2011
Data Sources
ETL Software
S
T
A
G
I
N
G
IBM
Mkt
IMS
HR
Fi n
VSAM
Ascential
Oracle
Ext ract
Acctg
Sybase
SAP
Informix
SAS
HarteHanks
Users
A NA L YS TS
Cognos
Teradata
IBM
Load
Informatica
D
A
T
A
A
R
E
A
O
P
E
R
A
T
I
O
N
A
L
Data Analysis
Tools and
Applications
SQL
Sagent
W e b D ata
Clickstream
Data Stores
S
T
O
R
E
Cl ean/Scrub
Trans form
Fi rst logic
Dat a
W arehouse
Dat a Mart s
SAS
M AN A GER S
Fi nance
Essbase
Marketing
Queri es,Reporting,
DSS/EIS,
Data M ining
EX EC U TIV ES
Micro Strat egy
Meta
Dat a
Sales
Microsoft
Si ebel
Busi ness
Objects
OP ER A TIO NA L
PER SO NN EL
W eb
Browser
C US TOM ER S/
October30,2012
April 13, 2011 SU PP LIE RS
Challenges of Building BI
Solution
Dec16,2013
April 13, 2011
Dec16,2013
April 13, 2011
Data WareHouse
Dec16,2013
April 13, 2011
Dec16,2013
April 13, 2011
Cubes
Measures
Key Performance Indicator
Dimensions
---Attributes
---Hierarchy
Dec16,2013
April 13, 2011
DATA MODELLING
Dec16,2013
April 13, 2011
DATA MODELING
process that produces abstract data models for one or more
database components of the data warehouse
Types of Data Modeling
Conceptual Data Model
Logical Data Model
Physical Data Model
Dec16,2013
April 13, 2011
DIMENSIONAL MODELLING
Fact Table
Dimension Table
Dec16,2013
April 13, 2011
Dec16,2013
April 13, 2011
Dec16,2013
April 13, 2011
NORMALIZATION
Dec16,2013
April 13, 2011
NORMALIZATION
Database normalization is the process of removing
redundant data from your tables in to improve storage
efficiency, data integrity, and scalability.
Normalization Forms:
Dec16,2013
April 13, 2011
3NF
Dec16,2013
April 13, 2011
If I know # of Pages, can I find out Author's Name? No. Can I find out
Author's pseudonym? No.
If I know Author's Name, can I find out # of Pages? No. Can I find
out Author's pseudonym YES.
Therefore, Author's pseudonym is functionally dependent upon
Author's Name, not the PK for its existence. It has to go.
Dec16,2013
BCNF
Client Interview
clientNo
(Candidate key)
roomNo
-As a consequece the ClientInterview relation may suffer from update anomalies.
-For example, two tuples have to be updated if the roomNo need be changed for staffNo
SG5 on the 13-May-02.
Dec16,2013
Dec16,2013
DATA CLEANSING
WHEN
WHY
WHAT
HOW
Dec16,2013
April 13, 2011
Dec16,2013
April 13, 2011
SCD (IMPLEMENTATION)
DIMENSIONS
CATEGORY
BRAND
MODEL
SCDIMPLEMENTEDONDIMENSIONTABLES
ONLYANDNOTONFACTTABLES
STORE
VENDOR
FACTS
INVENTORY_
CATEGORY(SCDI)
BRAND(SCDI)
MODEL(SCDIIbyDATE)
TRANSACTION_FILE
STORE(SCDIIbyFLAG)
GOODS_
RECEIVING
VENDOR(SCDI)
Dec16,2013
April 13, 2011
THANK YOU
Dec16,2013
April 13, 2011