Data Dictionary
Data Dictionary
Data Dictionary
Subject Oriented
Data warehouses are designed to help you analyze data. For example, to learn more
about your company's sales data, you can build a warehouse that concentrates on
sales. Using this warehouse, you can answer questions like "Who was our best
customer for this item last year?" This ability to define a data warehouse by
subject matter, sales in this case makes the data warehouse subject oriented.
Integrated
Integration is closely related to subject orientation. Data warehouses must put
data from disparate sources into a consistent format. They must resolve such
problems as naming conflicts and inconsistencies among units of measure. When they
achieve this, they are said to be integrated.
Nonvolatile
Nonvolatile means that, once entered into the warehouse, data should not change.
This is logical because the purpose of a warehouse is to enable you to analyze what
has occurred.
Time Variant
In order to discover trends in business, analysts need large amounts of data. This
is very much in contrast to online transaction processing (OLTP) systems, where
performance requirements demand that historical data be moved to an archive. A data
warehouse's focus on change over time is what is meant by the term time variant.
2. How many stages in Datawarehousing?
Data warehouse generally includes two stages
ETL
Report Generation
ETL
Short for extract, transform, load, three database functions that are combined into
one tool
• Extract -- the process of reading data from a source database.
• Transform -- the process of converting the extracted data from its previous
form into required form
• Load -- the process of writing the data into the target database.
ETL is used to migrate data from one database to another, to form data marts
anddata warehouses and also to convert databases from one format to another format.
It is used to retrieve the data from various operational databases and is
transformed into useful information and finally loaded into Datawarehousing system.
1 INFORMATICA
2 ABINITO
3 DATASTAGE
4. BODI
5 ORACLE WAREHOUSE BUILDERS
Report generation
In report generation, OLAP is used (i.e.) online analytical processing.
It is a set of specification which allows the client applications in retrieving the
data for analytical processing.
It is a specialized tool that sits between a database and user in order to provide
various analyses of the data stored in the database.
OLAP Tool is a reporting tool which generates the reports that are useful for
Decision support for top level management.
1. Business Objects
2. Cognos
3. Micro strategy
4. Hyperion
5. Oracle Express
6. Microsoft Analysis Services
In Snowflake schema, the example diagram shown below has 4 dimension tables, 4
lookup tables and 1 fact table. The reason is that hierarchies (category, branch,
state, and month) are being broken out of the dimension tables (PRODUCT,
ORGANIZATION, LOCATION, and TIME) respectively and separately.
It increases the number of joins and poor performance in retrieval of data.
In few organizations, they try to normalize the dimension tables to save space.
Since dimension tables hold less space snow flake schema approach may be
avoided.
Bit map indexes cannot be effectively utilized
Subject Oriented
Data warehouses are designed to help you analyze data. For example, to learn more
about your company's sales data, you can build a warehouse that concentrates on
sales. Using this warehouse, you can answer questions like "Who was our best
customer for this item last year?" This ability to define a data warehouse by
subject matter, sales in this case makes the data warehouse subject oriented.
Integrated
Integration is closely related to subject orientation. Data warehouses must put
data from disparate sources into a consistent format. They must resolve such
problems as naming conflicts and inconsistencies among units of measure. When they
achieve this, they are said to be integrated.
Nonvolatile
Nonvolatile means that, once entered into the warehouse, data should not change.
This is logical because the purpose of a warehouse is to enable you to analyze what
has occurred.
Time Variant
In order to discover trends in business, analysts need large amounts of data. This
is very much in contrast to online transaction processing (OLTP) systems, where
performance requirements demand that historical data be moved to an archive. A data
warehouse's focus on change over time is what is meant by the term time variant.
2. How many stages in Datawarehousing?
Data warehouse generally includes two stages
ETL
Report Generation
ETL
Short for extract, transform, load, three database functions that are combined into
one tool
• Extract -- the process of reading data from a source database.
• Transform -- the process of converting the extracted data from its previous
form into required form
• Load -- the process of writing the data into the target database.
ETL is used to migrate data from one database to another, to form data marts
anddata warehouses and also to convert databases from one format to another format.
It is used to retrieve the data from various operational databases and is
transformed into useful information and finally loaded into Datawarehousing system.
1 INFORMATICA
2 ABINITO
3 DATASTAGE
4. BODI
5 ORACLE WAREHOUSE BUILDERS
Report generation
In report generation, OLAP is used (i.e.) online analytical processing.
It is a set of specification which allows the client applications in retrieving the
data for analytical processing.
It is a specialized tool that sits between a database and user in order to provide
various analyses of the data stored in the database.
OLAP Tool is a reporting tool which generates the reports that are useful for
Decision support for top level management.
1. Business Objects
2. Cognos
3. Micro strategy
4. Hyperion
5. Oracle Express
6. Microsoft Analysis Services
In Snowflake schema, the example diagram shown below has 4 dimension tables, 4
lookup tables and 1 fact table. The reason is that hierarchies (category, branch,
state, and month) are being broken out of the dimension tables (PRODUCT,
ORGANIZATION, LOCATION, and TIME) respectively and separately.
It increases the number of joins and poor performance in retrieval of data.
In few organizations, they try to normalize the dimension tables to save space.
Since dimension tables hold less space snow flake schema approach may be
avoided.
Bit map indexes cannot be effectively utilized
ETL is used to migrate data from one database to another, to form data marts
anddata warehouses and also to convert databases from one format to another format.
It is used to retrieve the data from various operational databases and is
transformed into useful information and finally loaded into Datawarehousing system.
1 INFORMATICA
2 ABINITO
3 DATASTAGE
4. BODI
5 ORACLE WAREHOUSE BUILDERS
Report generation
In report generation, OLAP is used (i.e.) online analytical processing.
It is a set of specification which allows the client applications in retrieving the
data for analytical processing.
It is a specialized tool that sits between a database and user in order to provide
various analyses of the data stored in the database.
OLAP Tool is a reporting tool which generates the reports that are useful for
Decision support for top level management.
1. Business Objects
2. Cognos
3. Micro strategy
4. Hyperion
5. Oracle Express
6. Microsoft Analysis Services
In Snowflake schema, the example diagram shown below has 4 dimension tables, 4
lookup tables and 1 fact table. The reason is that hierarchies (category, branch,
state, and month) are being broken out of the dimension tables (PRODUCT,
ORGANIZATION, LOCATION, and TIME) respectively and separately.
It increases the number of joins and poor performance in retrieval of data.
In few organizations, they try to normalize the dimension tables to save space.
Since dimension tables hold less space snow flake schema approach may be
avoided.
Bit map indexes cannot be effectively utilized