E 17638
E 17638
E 17638
March 2013
Oracle OLAP User's Guide, 12c Release 1 (12.1) E17638-05 Copyright 2003, 2013, Oracle and/or its affiliates. All rights reserved. Primary Author: David McDermid
Contributor: The Oracle Database 12c documentation is dedicated to Mark Townsend, who was an inspiration to all who worked on this release. Contributors: David Bardwell, Donna Carver, Ken Chen, Sandeep Desai, Dave DeDonato, Bud Endress, Scott Feinstein, David Greenfield, Marty Gubar, AA Hopeman, Christopher Kearney, Pam Montalto, Anne Murphy, Zhiqi Qiu, Moh Rangwala, Ray Roccaforte, Marty Roth, Andrew Wesley This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable: U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government. This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this software or hardware in dangerous applications. Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group. This software or hardware and documentation may provide access to or information on content, products, and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services.
Contents
Preface ................................................................................................................................................................. xi
Audience....................................................................................................................................................... xi Documentation Accessibility ..................................................................................................................... xi Related Documents ..................................................................................................................................... xi Conventions ................................................................................................................................................ xii
Changes in This Release for Oracle OLAP User's Guide .................................................. xiii
Changes in Oracle Database 12c Release 1 (12.1).................................................................................. xiii
Overview
OLAP Technology in the Oracle Database.......................................................................................... Full Integration of Multidimensional Technology ........................................................................ Ease of Application Development ................................................................................................... Ease of Administration...................................................................................................................... Security ................................................................................................................................................ Unmatched Performance and Scalability ....................................................................................... Reduced Costs .................................................................................................................................... Developing Reports and Dashboards Using SQL Tools and Application Builders ................... Overview of the Dimensional Data Model ......................................................................................... Cubes.................................................................................................................................................... Measures.............................................................................................................................................. Dimensions.......................................................................................................................................... Hierarchies and Levels ...................................................................................................................... Level-Based Hierarchies ............................................................................................................ Value-Based Hierarchies............................................................................................................ Attributes............................................................................................................................................. 1-1 1-1 1-2 1-2 1-2 1-2 1-3 1-3 1-5 1-6 1-6 1-7 1-7 1-8 1-8 1-8
Opening a Database Connection...................................................................................................... Showing the Analytic Workspace Attachment Modes................................................................. Installing Plug-ins .............................................................................................................................. Upgrading Metadata From Oracle OLAP 10g .....................................................................................
iv
Defining Measure Folders ................................................................................................................... Saving and Re-Creating Dimensional Objects with Object Definitions ................................... Creating Dimensional Objects From XML Templates ............................................................... Saving Object Definitions to XML Templates............................................................................. Creating Analytic Workspaces from EIF Files ............................................................................ Saving Analytic Workspaces to EIF Files .................................................................................... Copying and Pasting Dimensional Objects .....................................................................................
Modifying a Template ....................................................................................................................... 5-5 Choosing a Range of Time Periods.................................................................................................. 5-6 Using Calculation Templates ................................................................................................................. 5-6 Arithmetic Calculations .................................................................................................................... 5-6 Index..................................................................................................................................................... 5-7 Prior and Future Periods................................................................................................................... 5-8 Period to Date ..................................................................................................................................... 5-8 Share..................................................................................................................................................... 5-9 Rank .................................................................................................................................................. 5-10 Parallel Period.................................................................................................................................. 5-11 Moving Calculations....................................................................................................................... 5-11 Cumulative Calculations................................................................................................................ 5-12 Nested Calculations ........................................................................................................................ 5-13 Creating User-Defined Expressions .................................................................................................. 5-14 Using the OLAP Expression Syntax ............................................................................................. 5-14 Expression Syntax Example Using an Arithmetic Operator..................................................... 5-14 Free-Form Calculation Example Using an Analytic Function.................................................. 5-15 Expression Syntax Analytic Functions......................................................................................... 5-15 Creating Calculated Measures Using the OLAP DML .................................................................. 5-17 Selecting an OLAP DML Calculation Type................................................................................. 5-17 OLAP DML Expression Examples................................................................................................ 5-18 OLAP DML Function Example ..................................................................................................... 5-19
vi
Storage Management ............................................................................................................................... 7-2 Creating an Undo Tablespace .......................................................................................................... 7-3 Creating Permanent Tablespaces for OLAP Use........................................................................... 7-3 Creating Temporary Tablespaces for OLAP Use .......................................................................... 7-3 Spreading Data Across Storage Resources ..................................................................................... 7-3 Dictionary Views and System Tables .................................................................................................. 7-4 Static Data Dictionary Views............................................................................................................ 7-4 System Tables ..................................................................................................................................... 7-4 Analytic Workspace Tables .............................................................................................................. 7-5 Maintenance Logs .............................................................................................................................. 7-5 Partitioned Cubes and Parallelism ....................................................................................................... 7-6 Querying Metadata for Cube Partitioning ..................................................................................... 7-6 Creating and Dropping Partitions ................................................................................................... 7-6 Parallelism........................................................................................................................................... 7-7 Analyzing Cubes and Dimensions .................................................................................................... 7-10 Monitoring Analytic Workspaces ...................................................................................................... 7-10 Dynamic Performance Views ........................................................................................................ 7-11 Basic Queries for Monitoring the OLAP Option ........................................................................ 7-11 Is the OLAP Option Installed in the Database?................................................................... 7-12 What Analytic Workspaces Are in the Database?............................................................... 7-12 How Big Is the Analytic Workspace?.................................................................................... 7-12 When Were the Analytic Workspaces Created?.................................................................. 7-13 OLAP DBA Scripts .......................................................................................................................... 7-13 Scripts for Monitoring Performance............................................................................................. 7-14 Monitoring Disk Space ................................................................................................................... 7-14 About Backing Up and Recovering Analytic Workspaces ............................................................ 7-15 About Copying Analytic Workspaces ............................................................................................... 7-15 About Saving Dimensional Object Definitions.............................................................................. 7-15 About XML Templates ................................................................................................................... 7-16 About EIF Files ................................................................................................................................ 7-16 Cube Materialized Views .................................................................................................................... 7-16 Acquiring Information From the Data Dictionary ..................................................................... 7-17 Identifying Cube Materialized Views................................................................................... 7-17 Identifying the Refresh Logs .................................................................................................. 7-17 Initiating a Data Refresh................................................................................................................. 7-17 Using DBMS_CUBE................................................................................................................. 7-18 Using DBMS_MVIEW ............................................................................................................. 7-18 Refresh Methods.............................................................................................................................. 7-18 Refresh Method Descriptions................................................................................................. 7-18 Fast Solve Refreshes................................................................................................................. 7-19 Using Query Rewrite ...................................................................................................................... 7-20 Acquiring Additional Information About Cube Materialized Views ..................................... 7-21
Security
Security of Multidimensional Data in Oracle Database .................................................................. 8-1 Security Management........................................................................................................................ 8-1 Types of Security ................................................................................................................................ 8-2
vii
About the Privileges .......................................................................................................................... 8-2 Layered Security................................................................................................................................. 8-2 Setting Object Security ........................................................................................................................... 8-3 Using SQL to Set Object Security ..................................................................................................... 8-3 Setting Object Security on an Analytic Workspace................................................................ 8-3 Setting Object Security on Dimensions.................................................................................... 8-3 Setting Object Security on Cubes.............................................................................................. 8-3 Using Analytic Workspace Manager to Set Object Security ........................................................ 8-5 Setting Object Security on an Analytic Workspace................................................................ 8-5 Setting Object Security on Dimensions ................................................................................... 8-6 Setting Object Security on Cubes ............................................................................................. 8-7 Creating Data Security Policies on Dimensions and Cubes ............................................................ 8-7 Creating OLAP Data Security Roles .................................................................................................. 8-10
Advanced Aggregations
What Is Aggregation? .............................................................................................................................. Aggregation Operators ............................................................................................................................ Basic Operators................................................................................................................................... Scaled and Weighted Operators ...................................................................................................... Hierarchical Operators ...................................................................................................................... When Does Aggregation Order Matter? .............................................................................................. Using the Same Operator for All Dimensions of a Cube ............................................................. Order Has No Effect ................................................................................................................... Order Changes the Aggregation Results................................................................................. Order May Be Important ........................................................................................................... Example: Mixing Aggregation Operators ...................................................................................... Example: Aggregating the Units Cube ................................................................................................. Selecting the Aggregation Operators and Hierarchies ................................................................. Choosing the Percentage of Precomputed Values ........................................................................ 9-1 9-3 9-3 9-3 9-4 9-4 9-5 9-5 9-5 9-5 9-5 9-6 9-6 9-7
B Keyboard Shortcuts
Menu Bar ................................................................................................................................................... B-1
viii
Navigation Tree ....................................................................................................................................... Property Sheets ........................................................................................................................................ Shuttle Keys ............................................................................................................................................. Mapping Canvas ......................................................................................................................................
Glossary Index
ix
Preface
Oracle OLAP User's Guide explains how SQL applications can extend their analytic processing capabilities and manage summary data by using the OLAP option of Oracle Database. It also provides information about managing resources for OLAP. The preface contains these topics:
Audience
This manual is intended for DBAs who perform these tasks:
Develop and manage a data warehouse Create and maintain dimensional data objects Administer Oracle Database with the OLAP option
Documentation Accessibility
For information about Oracle's commitment to accessibility, visit the Oracle Accessibility Program website at http://www.oracle.com/pls/topic/lookup?ctx=acc&id=docacc. Access to Oracle Support Oracle customers have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs if you are hearing impaired.
Related Documents
For more information about the OLAP option, see the following manuals in the Oracle Database 12c documentation set:
Oracle Database SQL Language Reference Contains complete syntax descriptions of the SQL CREATE, ALTER, and DELETE syntax for managing cubes, cube dimensions, and other dimensional database objects.
xi
Oracle Database Reference Contains full descriptions of the data dictionary views for cubes, cube dimensions, and other dimensional database objects.
Oracle Database PL/SQL Packages and Types Reference Contains full descriptions of DBMS_CUBE and several other PL/SQL packages for managing cubes.
Oracle OLAP DML Reference Contains a complete description of the OLAP Data Manipulation Language (OLAP DML).
Conventions
The following text conventions are used in this document:
Convention boldface italic monospace Meaning Boldface type indicates graphical user interface elements associated with an action, or terms defined in text or the glossary. Italic type indicates book titles, emphasis, or placeholder variables for which you supply particular values. Monospace type indicates commands within a paragraph, URLs, code in examples, text that appears on the screen, or text that you enter.
xii
New Features
The following Analytic Workspace Manager features are new in this release:
Connecting to a database using a service name. See "Defining a Database Connection" on page 2-4.
Specifying an attachment mode when connecting an analytic workspace to an Oracle Database. See "Showing the Analytic Workspace Attachment Modes" on page 2-4.
Creating calculated measures that are based on OBIEE-compatible SQL function expressions. If this feature is enabled, then Analytic Workspace Manager automatically creates additional calculated measures as needed for processing a calculated measure that you create. The cube that contains the calculated measures is suitable for exporting to Oracle Business Intelligence Enterprise Edition (OBIEE). You can export a cube to OBIEE by using the Export to OBIEE Administrator plug-in. See "Creating Cubes" on page 3-16 and "Creating Calculated Measures" on page 5-3.
Creating a fact view with a measure dimension table. The fact view pivots a fact table so that the measures identify rows instead of columns. See "Creating Measure Dimensions" on page 3-10.
Creating a measure dimension. You can now create a measure dimension, which has measures as dimension members. Dimensions now have a Class type, which is either None or Measure. See "Creating Measure Dimensions" on page 3-10.
xiii
Saving the SQL of a mapping to the clipboard or to a file. See "Mapping Cubes" on page 3-18
Specifying the aggregate functions SUM, MAX, MIN, AVG, and COUNT when mapping a cube to data sources. See "Aggregate Functions" on page 3-20.
Partitioning a cube on multiple levels of a dimension hierarchy. See "Partitioning a Cube" on page 3-22.
Saving an object definition to, or create an object from, a table. Previously, you could save an object definition as an XML template in a file. You can now save an XML template in a database table. You can use the XML template to re-create the dimensional objects. See "Saving and Re-Creating Dimensional Objects with Object Definitions" on page 3-37.
Copying and pasting objects. You can now copy objects such as an analytic workspace, a cube, a dimension, or a measure and paste that object in an appropriate location. See "Copying and Pasting Dimensional Objects" on page 3-39.
Joining cubes to tables and views. Oracle Database 12c introduces the CUBE JOIN operation. This operation improves the performance of joining a cube to a table or a view. See "Joining Cubes to Tables and Views" on page 4-16.
Desupported Features
The following features are no longer supported by Oracle. See Oracle Database Upgrade Guide for a complete list of desupported features in this release.
Other Changes
The following are additional changes in the release:
Data security policies use Oracle Real Application Security. When you create data security policies for cubes or dimensions, Oracle OLAP uses Oracle Real Application Security (ORAS) instead of Extensible Data Security (XDS), which it used in Oracle Database 11g releases. When you upgrade Oracle Database from an 11g release to 12c, then your XDS data security policies are automatically converted to ORAS.
Note: Data security roles defined in an 11g Oracle Database instance are not automatically converted to ORAS. Before you upgrade an 11g database to 12c, you must delete any data security roles that are defined in the 11g database. After the upgrade, you may use Analytic Workspace Manager 12c to define the data security roles again.
If you upgrade an 11g database to 12c without deleting the 11g data security roles, then any data security policies that include a data security role are invalid in the 12c database.
xiv
Some differences in the data security implementation in 12c are the following: Only the owner of an object can create a data security policy or OLAP data security role. In 11g, the owner and anyone who had write privileges for the object could create a data security policy or OLAP data security role. You can disable the data security policy for a dimension or a cube and then enable it again. You can have zero or one OLAP data security role. In 11g, you could have more than one.
See Also:
"Creating Data Security Policies on Dimensions and Cubes" on page 8-7 for information on creating data security policies in Analytic Workspace Manager "Disabling and Enabling Data Security" on page 8-9 Oracle Database Real Application Security Administrator's and Developer's Guide for information on configuring data security in Oracle Database
Viewing information about the partitions of a cube. The Partition Member Analysis subtab of the Partitioning properties of a cube displays information about the partitioning of the cube. This information replaces the Cube Partitioning Advisor of previous releases.
See Also:
Oracle Enterprise Manager Cloud Control (Cloud Control) In previous releases of Oracle Database, you used Oracle Enterprise Manager Database Control (Database Control) to manage database performance tuning from a graphical user interface. In this release, you can use the Cloud Control graphical user interface. You must install Cloud Control separately from Oracle Database.
See Also:
Oracle Enterprise Manager Cloud Control Basic Installation Guide for information on installing Cloud Control "Monitoring Analytic Workspaces" on page 7-10
xv
xvi
1
1
Overview
This chapter introduces the powerful analytic resources available in the Oracle Database with the OLAP option. It consists of the following topics:
OLAP Technology in the Oracle Database Developing Reports and Dashboards Using SQL Tools and Application Builders Overview of the Dimensional Data Model
Cubes and other dimensional objects are first class data objects represented in the Oracle data dictionary. Cubes and other dimensional objects are supported by standard SQL syntax in the CREATE, ALTER, DROP, and SELECT statements. The OLAP engine runs within the kernel of Oracle Database. Dimensional objects are stored in Oracle Database in their native multidimensional format. Data security is administered in the standard way, by granting and revoking privileges to Oracle Database users and roles.
The benefits to your organization are significant. Oracle OLAP offers the power of simplicity: One database, standard administration and security, standard interfaces and development tools.
Overview 1-1
Ease of Administration
Because Oracle OLAP is completely embedded in the Oracle database, there is no administration learning curve as is typically associated with standalone OLAP servers. You can leverage your existing DBA staff, rather than invest in specialized administration skills. A major administrative advantage of Oracle's embedded OLAP technology is automated cube maintenance. With standalone OLAP servers, the burden of refreshing the cube is entirely the responsibility of the administrator. This can be a complex and potentially error-prone job. You must create procedures to extract the changed data from the relational source, move the data from the source system to the system running the standalone OLAP server, load and rebuild the cube. You must take responsibility for the security of the deltas (changed values) during this process as well. With Oracle OLAP, in contrast, cube refresh is handled entirely by the Oracle database. The database tracks the staleness of the dimensional objects, automatically keeps track of the deltas in the source tables, and automatically applies only the changed values during the refresh process. You simply schedule the refresh at appropriate intervals, and Oracle Database takes care of everything else.
Security
With Oracle OLAP, standard Oracle Database security features are used to secure your multidimensional data. In contrast, with a standalone OLAP server, administrators must manage security twice: once on the relational source system and again on the OLAP server system. Additionally, they must manage the security of data in transit from the relational system to the standalone OLAP system.
Developing Reports and Dashboards Using SQL Tools and Application Builders
When Oracle Database is installed with Real Application Clusters (Oracle RAC), OLAP applications receive the same benefits in performance, scalability, fail over, and load balancing as any other application.
Reduced Costs
All these features add up to reduced costs. Administrative costs are reduced because existing personnel skills can be leveraged. Moreover, the Oracle database can manage the refresh of dimensional objects, a complex task left to administrators in other systems. Standard security reduces administration costs as well. Application development costs are reduced because the availability of a large pool of application developers who are SQL knowledgeable, and a large collection of SQL-based development tools means applications can be developed and deployed more quickly. Any SQL-based development tool can take advantage of Oracle OLAP. Hardware costs are reduced by Oracle OLAP's efficient management of aggregations, use of shared cursors, and Oracle RAC, which enables highly scalable systems to be built from low-cost commodity components.
Developing Reports and Dashboards Using SQL Tools and Application Builders
Analysts can choose any SQL query and analysis tool for selecting, viewing, and analyzing the data. You can use your favorite tool or application, or use a tool supplied with Oracle Database. Figure 11 displays a portion of a dashboard created in Oracle Application Express, which is distributed with Oracle Database. Application Express generates HTML reports that display the results of SQL queries. It only understands SQL; it has no special knowledge of dimensional objects. This dashboard demonstrates information-rich calculations such as ratio, share, prior period, and cumulative total. Separate tabs on the dashboard present Profitability Analysis, Sales Analysis, and Product Analysis. Each tab presents the data in dials, bar charts, horizontal bar charts, pie charts, and cross-tabular reports. A drop-down list in the upper left corner provides a choice of Customers. The dial displays the quarterly profit margin. To the right is a bar chart that compares current profits with year-ago profits.
Overview 1-3
Developing Reports and Dashboards Using SQL Tools and Application Builders
The pie chart in Figure 12 displays the percent share that each product family contributed to the total profits in the last quarter.
Figure 12 Contributions of Product Families to Total Profits
The horizontal bar chart in Figure 13 displays ranked results for locations with the largest gains in profitability from a year ago. Decision makers can see at a glance how each location improved by the last quarter.
Figure 14 compares current profits with year-to-date, year-to-date year ago, the change between year-to-date and year-to-date year ago, and percent change between year-to-date and year-to-date year-ago profits. The cross-tabular report features interactive drilling, so that decision makers can easily see the detailed data that contributed to a parent value of interest.
Figure 14 Year-to-Date Profits Compared to Year Ago
are interested in examining, which dimensions and attributes make the data meaningful, and how the dimensions of their business are organized into levels and hierarchies. Figure 15 shows the general relationships among dimensional objects.
Figure 15 Diagram of the OLAP Dimensional Model
The dimensional data model is highly structured. Structure implies rules that govern the relationships among the data and control how the data can be queried. Cubes are the physical implementation of the dimensional model, and thus are highly optimized for dimensional queries. The OLAP engine leverages this innate dimensionality in performing highly efficient cross-cube joins for inter-row calculations, outer joins for time series analysis, and indexing. Dimensions are pre-joined to the measures. The technology that underlies cubes is based on an indexed multidimensional array model, which provides direct cell access. The OLAP engine manipulates dimensional objects in the same way that the SQL engine manipulates relational objects. However, because the OLAP engine is optimized to calculate analytic functions, and dimensional objects are optimized for analysis, analytic and row functions can be calculated much faster in OLAP than in SQL. The dimensional model enables Oracle OLAP to support high-end business intelligence tools and applications such as OracleBI Discoverer Plus OLAP, OracleBI Spreadsheet Add-In, OracleBI Suite Enterprise Edition, BusinessObjects Enterprise, and Cognos ReportNet.
Cubes
Cubes provide a means of organizing measures that have the same shape, that is, they have the exact same dimensions. Measures in the same cube can easily be analyzed and displayed together. A cube usually corresponds to a single fact table or view.
Measures
Measures populate the cells of a cube with the facts collected about business operations. Measures are organized by dimensions, which typically include a Time dimension. An analytic database contains snapshots of historical data, derived from data in a transactional database, legacy system, syndicated sources, or other data sources. Three
1-6 Oracle OLAP User's Guide
years of historical data is generally considered to be appropriate for analytic applications. Measures are static and consistent while analysts are using them to inform their decisions. They are updated in a batch window at regular intervals: weekly, daily, or periodically throughout the day. Some administrators refresh their data by adding periods to the time dimension of a measure, and may also roll off an equal number of the oldest time periods. Each update provides a fixed historical record of a particular business activity for that interval. Other administrators do a full rebuild of their data rather than performing incremental updates. A critical decision in defining a measure is the lowest level of detail. Users may never view this detail data, but it determines the types of analysis that can be performed. For example, market analysts (unlike order entry personnel) do not need to know that Beth Miller in Ann Arbor, Michigan, placed an order for a size 10 blue polka-dot dress on July 6, 2006, at 2:34 p.m. But they might want to find out which color of dress was most popular in the summer of 2006 in the Midwestern United States. The base level determines whether analysts can get an answer to this question. For this particular question, Time could be rolled up into months, the Customer dimension could be rolled up into regions, and the Product dimension could be rolled up into items (such as dresses) with an attribute of color. However, this level of aggregate data could not answer the question: At what time of day are women most likely to place an order? An important decision is the extent to which the data has been aggregated before being loaded into a data warehouse. Calculated measures return values that are computed at run time from data stored in one or more measures. Like relational views, calculated measures store queries against data stored in other objects. Because calculated measures do not store data, you can create dozens of them without increasing the size of the database. You can use them as the basis for defining other calculated measures, which adds depth to the types of calculations you can create.
Dimensions
Dimensions contain a set of unique values that identify and categorize data. They form the edges of a cube, and thus of the measures within the cube. Because measures are typically multidimensional, a single value in a measure must be qualified by a member of each dimension to be meaningful. For example, the Sales measure has four dimensions: Time, Customer, Product, and Channel. A particular Sales value (43,613.50) only has meaning when it is qualified by a specific time period (Feb-06), a customer (Warren Systems), a product (Portable PCs), and a channel (Catalog). Base-level dimension values correspond to the unique keys of a fact table. A measure dimension is a dimension that has measures as dimension members. With a measure dimension, you can generate calculated measures for all of the measures in the cube simultaneously. Also, you do not have to create a new set of calculated measures for each measure that you add to the cube. The existing calculated measures apply to the new measure in the measure dimension. This is especially useful if you create new measures frequently.
Overview 1-7
Level-Based Hierarchies
Each level represents a position in the hierarchy. Each level above the base (or most detailed) level contains aggregate values for the levels below it. The members at different levels have a one-to-many parent-child relation. For example, Q1-05 and Q2-05 are the children of 2005, thus 2005 is the parent of Q1-05 and Q2-05. Suppose a data warehouse contains snapshots of data taken three times a day, that is, every 8 hours. Analysts might normally prefer to view the data that has been aggregated into days, weeks, quarters, or years. Thus, the Time dimension needs a hierarchy with at least five levels. Similarly, a sales manager with a particular target for the upcoming year might want to allocate that target amount among the sales representatives in his territory; the allocation requires a dimension hierarchy in which individual sales representatives are the child values of a particular territory. Hierarchies and levels have a many-to-many relationship. A hierarchy typically contains several levels, and a single level can be included in multiple hierarchies. Each level typically corresponds to a column in a dimension table or view. The base level is the primary key.
Value-Based Hierarchies
Although hierarchies are typically composed of named levels, they do not have to be. The parent-child relations among dimension members may not define meaningful levels. For example, in an employee dimension, each manager has one or more reports, which forms a parent-child relation. Creating levels based on these relations (such as individual contributors, first-level managers, second-level managers, and so forth) may not be meaningful for analysis. Likewise, the line item dimension of financial data does not have levels. This type of hierarchy is called a value-based hierarchy.
Attributes
An attribute provides additional information about the data. Some attributes are used for display. For example, you might have a product dimension that uses Stock Keeping Units (SKUs) for dimension members. The SKUs are an excellent way of uniquely identifying thousands of products, but are meaningless to most people if they are used to label the data in a report or a graph. You would define attributes for the descriptive labels. You might also have attributes like colors, flavors, or sizes. This type of attribute can be used for data selection and answering questions such as: Which colors were the most popular in women's dresses in the summer of 2005? How does this compare with the previous summer? Time attributes can provide information about the Time dimension that may be useful in some types of analysis, such as identifying the last day or the number of days in each time period. Each attribute typically corresponds to a column in dimension table or view.
2
2
This chapter describes the preliminary steps you should take to use Oracle OLAP. It assumes that you have installed Oracle Database 12c Enterprise Edition. The OLAP option is installed automatically as part of a Basic installation of Oracle Database. To start querying dimensional objects immediately, install the Global analytic workspace, as described in "Installing the Sample Schema". Then follow the instructions in Chapter 4.
Note:
Installing the Sample Schema Database Management Tasks Granting Privileges to DBAs and Application Developers Getting Started with Analytic Workspace Manager Upgrading Metadata From Oracle OLAP 10g
2-1
OLAP_XS_ADMIN role
CREATE ANY MATERIALIZED VIEW privilege CREATE ANY DIMENSION privilege ADVISOR privilege
Users also need an unlimited quota on the tablespace in which the dimensional objects are stored. The tablespaces should be defined specifically for OLAP use, as described in Chapter 7. If the source tables are in a different schema, then the owner of the dimensional objects needs SELECT object privileges on those tables. Example 21 shows the SQL statements for creating the GLOBAL user.
Example 21 SQL Statements for Creating the GLOBAL User CREATE USER "GLOBAL" IDENTIFIED BY password DEFAULT TABLESPACE glo TEMPORARY TABLESPACE glotmp QUOTA UNLIMITED ON glo PASSWORD EXPIRE; GRANT OLAP_USER TO GLOBAL; GRANT CREATE SESSION TO GLOBAL; GRANT OLAP_XS_ADMIN TO GLOBAL;
If you are installing on a remote system, then select either an Administrator or a Custom installation. The Administrator choice automatically installs Analytic Workspace Manager on the client.
See Also:
From the Start menu, select Oracle - Oracle_home, then Integrated Management Tools, and then OLAP Analytic Workspace Manager and Worksheet.
If Analytic Workspace Manager does not have access to the Internet, the property viewer shows links to several useful sites. It also shows an exception, because Analytic Workspace Manager cannot display the OLAP home page. To connect to the Internet, you typically need to identify the proxy server. To identify the proxy server:
1. 2. 3.
From the Tools menu, select Configuration to display the Configuration dialog box. Under OLAP Home Page Settings, enter the address of the proxy server. Enter the port number for the proxy server, if it is not default port 80.
2-3
4.
Click OK to save these settings. The OLAP Home page appears the next time you start Analytic Workspace Manager.
Right-click the top Databases folder in the navigation tree, then select New Database Connection from the shortcut menu. Complete the New Database Connection dialog box.
Figure 22 shows the connection information on the General tab of the New Database Connection dialog box.
Figure 22 Defining a Database Connection
Click the plus icon (+) next to a database connection in the navigation tree. Supply your database user name and password in the Connect to Database dialog box.
Read only In this mode a user can view the analytic workspace objects and data but cannot create or change objects. The user can export an object by copying it or saving it as a template. Any number of users can open an analytic workspace in Read Only mode.
Read Write In this mode a user can view the analytic workspace objects and data and create or change objects. The user can export or import an object. Only one user can open an analytic workspace in Read Write mode but any number of other users can open it in Read Only mode. This is the default mode.
Read Write Exclusive In this mode a user has the same access rights as in Read Write mode but no one else can open the analytic workspace. This mode is not available if another user has the analytic workspace open.
From the Tools menu, select Configuration. The Configuration dialog box opens.
2.
Installing Plug-ins
Plug-ins extend the functionality of Analytic Workspace Manager. Plug-ins are distributed as JAR files. Any Java developer can create a plug-in. The developer should provide information about what the plug-in does and how to use it. If you have one or more plug-ins, then you must identify their location to Analytic Workspace Manager. To use plug-ins:
1. 2. 3. 4.
Create a local directory for storing the plug-ins. Copy the JAR files to that directory. Open Analytic Workspace Manager. Select Configuration from the Tools menu. The Configuration dialog box opens.
5. 6.
Select Enable Plugins and identify the plug-in directory. Click OK. Close and reopen Analytic Workspace Manager. The functionality provided by the plug-ins is available in the navigator.
Some Analytic Workspace Manager plug-ins are available for download from the Oracle Technology Network (OTN). To download plug-ins from OTN:
The OLAP 10g analytic workspace can use OLAP standard form metadata. The original relational source data must be available to load into the new analytic workspace. If the data is in a different schema or the table names are different,
2-5
then you must remap the dimensional objects to the new relational sources after the upgrade.
You can create the OLAP 12c analytic workspace in the same schema as the OLAP 10g analytic workspace. However, if you choose to create the OLAP 12c analytic workspace in a different schema, you must grant the new user the appropriate privileges as described in "Granting Privileges to DBAs and Application Developers" on page 2-1.
Open Analytic Workspace Manager for Oracle Database 12c Release 1. If necessary, create a new database connection to the database instance with the analytic workspace. See "Defining a Database Connection" on page 2-4. Open the database connection. On the Connect to Database dialog box, select OLAP 11g for the Cube Type. See "Opening a Database Connection" on page 2-4. Expand the navigation tree until the name of the analytic workspace appears. Right-click the analytic workspace and select Create 12c Upgrade Template for 11g Analytic Workspace. Save the XML template to a file. The Create 12c Upgrade Template for 12c Analytic Workspace dialog box appears if any subobjects, such as a level and a hierarchy, have the same name. Duplicate object names are changed automatically for the upgrade. You cannot edit the names now, but you can change them later.
6. 7. 8. 9.
Click Close to close the dialog box. Right-click the connection in the tree and select Disconnect Database. Right-click the connection again and select Connect Database. On the Connect to Database dialog box, log in with the new user name and select OLAP 11g for the Cube Type. select Create Analytic Workspace From Template.
10. Expand the tree, right-click Analytic Workspaces under the new schema, and 11. Open the upgrade template that you created previously.
The Correct Duplicate Names From Analytic Workspace Template Import dialog box appears if any objects, such as a cube, dimensions, or the analytic workspace, duplicate object names that already exist in the schema.
12. Enter new names to resolve any conflicts, then click OK. 13. Before loading the data, you may want to browse the dimensional objects and
make any changes to the object names, cube partitioning, or aggregation strategy.
14. Load data into the new analytic workspace as described in "Loading Data Into
3
3
This chapter explains how to design a data model and create dimensions and cubes using Analytic Workspace Manager. It contains the following topics:
Designing a Dimensional Model for Your Data Introduction to Analytic Workspace Manager Creating a Dimensional Data Store Using Analytic Workspace Manager Creating Dimensions Creating Cubes Choosing a Data Maintenance Method Supporting Multiple Languages Defining Measure Folders Saving and Re-Creating Dimensional Objects with Object Definitions Copying and Pasting Dimensional Objects
Fact tables correspond to cubes. Data columns in the fact tables correspond to measures. Foreign key constraints in the fact tables identify the dimension tables. Dimension tables identify the dimensions. Primary keys in the dimension tables identify the base-level dimension members. Parent columns in the dimension tables identify the higher level dimension members.
3-1
Columns in the dimension tables containing descriptions and characteristics of the dimension members identify the attributes.
You can also get insights into the dimensional model by looking at the reports currently being generated from the source data. The reports identify the levels of aggregation that interest the report consumers and the attributes used to qualify the data. While investigating your source data, you may decide to create relational views that more closely match the dimensional model that you plan to create.
See Also:
"Overview of the Dimensional Data Model" on page 1-5 for an introduction to dimensional objects Appendix A, "Designing a Dimensional Model" for a case study of developing a dimensional model for the Global analytic workspace
Develop a dimensional model of your data. Instantiate that model as dimensional objects. Load data from relational tables into those objects. Define information-rich calculations. Create materialized views that can be used by the database refresh system. Automatically generate relational views of the dimensional objects.
You must have SELECT privileges on the relational data sources so you can load the data into the dimensions and cubes. This chapter assumes that you have a star, snowflake, or other relational schema that supports dimensional objects. Figure 31 shows the main window of Analytic Workspace Manager. It contains menus, a toolbar, a navigation tree, and property sheets. When you select an object in the navigation tree, the property sheet to the right provides detailed information about that object. When you right-click an object, you get a choice of menu items with appropriate actions for that object.
Analytic Workspace Manager has a full online Help system, which includes context-sensitive Help.
Figure 31 Analytic Workspace Manager Main Window
Open Analytic Workspace Manager and connect to your database instance as the user defined for this purpose. Create an analytic workspace in the database:
a. b. c.
In the navigation tree, expand the folders until you see the schema where you want to create the analytic workspace. Right-click Analytic Workspaces, then click Create Analytic Workspace. Complete the Create Analytic Workspace dialog box, then select Create. If the Attach Workspace dialog box appears, select the Read Write or Read Write Exclusive attachment mode. The analytic workspace appears in the Analytic Workspaces folder for the schema.
3.
Define the dimensions for the data. See "Creating Dimensions" on page 3-4.
4.
Define the cubes for the data. See "Creating Cubes" on page 3-16.
5.
Load data into the cubes and dimensions. See "Loading Data Into Cubes" on page 3-26.
3-3
Creating Dimensions
When you have finished, you have an analytic workspace populated with the detail data fetched from relational tables or views. You may also have summarized data and calculated measures.
Develop custom cube scripts to customize the builds. See "Creating and Executing Custom Cube Scripts" on page 3-31.
Generate materialized views that support automatic refresh and query rewrite. See "Adding Materialized View Capability to a Cube" on page 3-34.
Support multiple languages by adding translations of metadata and attribute values. See "Supporting Multiple Languages" on page 3-35.
Define measure folders to simplify access for end users. See "Defining Measure Folders" on page 3-36.
Click OK or the equivalent button in a dialog box. For example, when you click Create in the Create Dimension dialog box, the dimension is committed to the database.
Click Apply in a property sheet. For example, when you change the labels on the General property page for an object, the change takes effect when you click Apply.
Creating Dimensions
Dimensions are lists of unique values that identify and categorize data. They form the edges of a cube, and thus of the measures within the cube. In a report, the dimension values (or their descriptive attributes) provide labels for the rows and columns. You can define dimensions that have any of these common forms:
Level-based dimensions that use parent-child relationships to group members into levels. Most dimensions are level-based. Value-based dimensions that have parent-child relationships among their members, but these relationships do not form meaningful levels. List or flat dimensions that have no levels or hierarchies.
You define a dimension as a User, Time, or Measure dimension. Detail-level dimension values typically correspond to the unique keys of a fact table. A measure dimension has measures as dimension members. This section has the following topics:
Requirements of a Dimension
Creating Dimensions
Creating a Dimension Creating Levels Creating Hierarchies Creating Attributes Creating Measure Dimensions Mapping Dimensions Loading Data Into Dimensions Displaying the Dimension View Displaying the Default Hierarchy
Requirements of a Dimension
Dimensions must meet the following requirements:
Source keys are read from the relational sources without modification. To use the same exact keys as the source data, the values must be unique across levels. Because each level may be mapped to a different relational column, this uniqueness may not be enforced in the source data. For example, a dimension table might have a Day column with values of 1 to 366 and a Week column with values of 1 to 52. Unless you take steps to assure uniqueness, the values from the Week column overwrite the first 52 Day values. Surrogate keys ensure uniqueness by adding a level prefix to the members while loading them into the analytic workspace. For the previous example, surrogate keys create two dimension members named DAY_1 and WEEK_1, instead of a single member named 1. A dimension that has surrogate keys must be defined with at least one level-based hierarchy.
Analytic Workspace Manager creates surrogate keys unless you specify otherwise.
3-5
Creating Dimensions
Creating a Dimension
This section describes how to create a standard User or Time dimension. See "Creating Measure Dimensions" on page 3-10 for information on creating a measure dimension. To create a dimension:
1. 2.
Expand the folder for the analytic workspace. Right-click Dimensions, then select Create Dimension. The Create Dimension dialog box appears.
3. 4. 5.
Complete the General tab. If the keys in the source table are unique across levels, you can change the default setting on the Implementation Details tab. Click Create. The dimension appears as a subfolder under Dimensions.
Creating Levels
For business analysis, data is typically summarized by level. For example, your database may contain daily snapshots of a transactional database. Days are the base level. You might summarize this data at the weekly, quarterly, and yearly levels. Levels have parent-child or one-to-many relationships, which form a level-based hierarchy. For example, each week summarizes seven days, each quarter summarizes 13 weeks, and each year summarizes four quarters. This hierarchical structure enables analysts to detect trends at the higher levels, then drill down to the lower levels to identify factors that contributed to a trend. For each level that you define, you must identify a data source for dimension members at that level. Members at all levels are stored in the same dimension. In the previous example, the Time dimension contains members for weeks, quarters, and years. To create a level:
1. 2.
Expand the folder for the dimension. Right-click Levels, then select Create Level. The Create Level dialog box appears.
3.
Creating Dimensions
4.
Figure 33 shows the creation of the Class level for the Product dimension.
Figure 33 Creation of the Class Level
Creating Hierarchies
Dimensions can have one or more hierarchies. They can be level based or value based. Level-Based Hierarchies Most hierarchies are level based. Analytic Workspace Manager supports these common types of level-based hierarchies:
Normal hierarchies consist of one or more levels of aggregation. Members roll up into the next higher level in a many-to-one relationship, and these members roll up into the next higher level, and so forth to the top level. Ragged hierarchies contain at least one member with a different base, creating a "ragged" base level for the hierarchy. Ragged hierarchies are not supported for cube materialized views. Skip-level hierarchies contain at least one member whose parents are multiple levels above it, creating a hole in the hierarchy. An example of a skip-level hierarchy is City-State-Country, where at least one city has a country as its parent (for example, Washington D.C. in the United States). In relational source tables, a skip-level hierarchy may contain nulls in the level columns. Skip-level hierarchies are not supported for cube materialized views.
Multiple hierarchies for a dimension typically share the base-level dimension members and then branch into separate hierarchies. They can share the top level if they use all the same base members and use the same aggregation operators. Otherwise, they need different top levels to store different aggregate values. For example, a Customer dimension may have multiple hierarchies that include all base-level customers and are summed to a shared top level. However, a Time dimension with calendar and fiscal hierarchies must aggregate to separate Calendar Year (January to December) and Fiscal Year (July to June) levels, because they use different selections of base-level members.
3-7
Creating Dimensions
Value-Based Hierarchies You may also have dimensions with parent-child relations that do not support levels. For example, an employee dimension might have a parent-child relation that identifies each employee's supervisor. However, levels that group first-, second-, and third-level supervisors and so forth may not be meaningful for analysis. Similarly, you might have a line-item dimension with members that cannot be grouped into meaningful levels. In this situation, you can create a value-based hierarchy defined by the parent-child relations, which does not have named levels. You can create value-based hierarchies only for dimensions that use the source keys, because surrogate keys are formed with the names of the levels. To create a hierarchy:
1. 2.
Expand the folder for the dimension. Right-click Hierarchies, then select Create Hierarchy. The Create Hierarchy dialog box appears.
3.
Complete the General tab of the Create Hierarchy dialog box. Click Help for information about these choices.
4.
Figure 34 shows the creation of the Primary hierarchy for the Product dimension.
Figure 34 Creation of the Product Primary Hierarchy
Creating Attributes
Attributes provide information about the individual members of a dimension. They are used for labeling crosstabular and graphical data displays, selecting data, organizing dimension members, and so forth.
Creating Dimensions
All dimensions can be created with long and short description attributes. If your source tables include long and short descriptions, then you can map the attributes to the appropriate columns. However, if your source tables include only one set of descriptions, then you can create and map just one description attribute. If you map both the long and short description attributes to the same column, the data is loaded twice. Time dimensions are created with time-span and end-date attributes. This information must be provided for all Time dimension members.
User-Defined Attributes
You can create additional "User" attributes that provide supplementary information about the dimension members, such as the addresses and telephone numbers of customers, or the color and sizes of products. To create an attribute:
1. 2.
Expand the folder for the dimension. Right-click Attributes, then select Create Attribute. The Create Attribute dialog box appears.
3.
Complete the General tab of the Create Attribute dialog box. Some attributes apply to all dimension members, and others apply to only one level. Your selection in the Apply Attributes To box controls the mapping of the attribute to one column or to multiple columns. Click Help for information about these choices.
4. 5.
To change the data type from the default choice of VARCHAR2, complete the Implementation Details tab. Click Create. The attribute appears as an item in the Attributes folder.
Figure 35 shows the creation of the Marketing Manager attribute for the Product dimension. Notice that this attribute applies only to the Item level.
Figure 35 Creation of the Product Marketing Manager Attribute
3-9
Creating Dimensions
From the Tools menu, select Create Fact View with Measure Dimension. The Create Fact View with Measure Dimension dialog box appears.
2.
Complete the Create Fact View with Measure Dimension dialog box.
a. b. c. d. e. f.
From the Schema list, select a schema. From the Object list, select a fact table. In the Fact View Name field, keep the default name or enter a different name. In the table of the columns of the fact table, select the columns for the measures that you want the measure dimension to have. Optional: To automatically create a table for the measure dimension, select the Create Measure Dimension Table option. Click Create.
3. 4.
Expand the folder for the analytic workspace. Right-click Dimensions, then select Create Dimension. The Create Dimension dialog box appears.
5.
Complete the General tab. For the Dimension Class Type, be sure to select Measure Dimension. A measure dimension is a flat dimension, with no levels or hierarchies.
6.
After creating the measure dimension, create a cube and add the dimension to it. If you create a new column in the fact table and you want to add it to the measure dimension, then must create the fact view for the fact table again and maintain the measure dimension and the cube. To add a measure to the measure dimension:
1.
From the Tools menu, select Create Fact View with Measure Dimension. The Create Fact View with Measure Dimension dialog box appears.
2.
Complete the Create Fact View with Measure Dimension dialog box.
Creating Dimensions
a. b. c. d. e. f. 3. 4.
From the Schema list, select a schema. From the Object list, select the fact table that you used to create the measure dimension. In the Fact View Name field, keep the default name or enter a different name. In the table of the columns of the fact table, select the columns for the measures that you want the measure dimension to have. Optional: To automatically create a table for the measure dimension, select the Create Measure Dimension Table option. Click Create.
Right-click the measure dimension and then select Maintain Dimension. Right-click the cube that has the measure dimension and then select Maintain Cube.
See Also:
Mapping Dimensions
Mapping identifies the relational data source for each dimensional object. After mapping a dimension to a column of a relational table or view, you can load the data. You can create, map, and load each dimension individually, or perform each step for all dimensions before proceeding to the next step. SQL Data Types for Dimensions You can map dimensions and levels to columns having these SQL data types, which are converted to text during a data load:
VARCHAR2 NVARCHAR2 NUMBER INTEGER DECIMAL CHAR NCHAR DATE TIMESTAMP TIMESTAMP WITH TIMEZONE TIMESTAMP WITH LOCAL TIMEZONE
You can map attributes to the same data types as cubes and measures, as described in "Data Types" on page 3-18.
Creating Dimensions
Tabular view: Drag-and-drop the names of individual columns from the schema navigation tree to the rows for the dimensional objects. Graphical view: Drag-and-drop icons, which represent tables and views, from the schema navigation tree onto the mapping canvas. Then draw lines from the columns to the dimensional objects.
You can use the OLAP expression syntax when mapping dimensions in the tabular view. This capability enables you to create the top level of a dimension without having a source column in the dimension table. You can also map attributes from different tables. OLAP automatically joins the tables on columns with the same name. Click Help on the Mapping window for more information. To map a dimension:
1.
In the navigation tree, expand the dimension folder and click Mappings. The Mapping window contains a schema navigation tree on the left and a mapping table for the dimension with rows for the levels and their attributes. This is the tabular view.
2. 3. 4. 5.
For normalized dimension tables, select Snowflake Schema for the Type of Dimension Table. To enlarge the Mapping Window, drag the divider to the left. In the schema tree, expand the tables, views, or synonyms that contain the dimension members and attributes. Drag-and-drop the source columns onto the appropriate cells in the mapping table for the dimension. Map a measure dimension to the measure dimension table. Specify measure_id as the member value.
6. 7.
After you have mapped all levels and attributes, click Apply. Drag the divider back to the right to reveal the navigation tree.
Figure 36 shows the Product dimension mapped in the tabular view. The arrow highlights how the PRODUCT_DIM.ITEM_BUYER column maps to the PRODUCT.ITEM.BUYER attribute.
Creating Dimensions
Create the dimension and its levels (including the top level), hierarchies, and attributes. Map the dimension as described previously for all but the top level. Enter an expression in the OLAP expression syntax for the top level.
This example shows a top level for all years in the Time dimension. The mapping expressions used for a Total level (that is, all years) in the Time dimension might look like this:
Member: 'TOTAL' LONG_DESCRIPTION: 'Total' SHORT_DESCRIPTION: 'Total' END_DATE: TO_DATE('31-Dec-2007', 'dd-mon-yyyy') TIME_SPAN: 3646
Member, LONG_DESCRIPTION, and SHORT_DESCRIPTION are set to literal strings, END_DATE uses the TO_DATE function, and TIME_SPAN is set to a number.
Right-click the source object in either the schema tree or the graphical view of the mapping canvas. Select View Data from the shortcut menu.
Creating Dimensions
In the navigation tree, right-click the Dimensions folder or the folder for a particular dimension. Select Maintain Dimension. The Maintenance Wizard opens on the Select Objects page.
3. 4.
Select one or more dimensions from Available Target Objects and use the shuttle buttons to move them to Selected Target Objects. Click Finish to load the dimension values immediately. The additional pages of the wizard enable you to create a SQL script or submit the load to the Oracle job queue. To use these options, click Next instead.
5.
Review the build log, which appears when the build is complete. If the log shows that errors occurred, then fix them and run the Maintenance Wizard again. Errors are typically caused by problems in the mapping. Check for incomplete mappings or changes to the source objects.
Figure 38 shows the first page of the Maintenance Wizard. Only the Product dimension has been selected for maintenance. All the Product dimension members and attributes are fetched from the mapped relational sources.
Creating Dimensions
Figure 39 shows the Maintenance log for a dimension displayed by Analytic Workspace Manager. It refreshes throughout the build to provide you with the most up-to-date information.
Figure 39 Maintenance Log for the Product Dimension
Creating Cubes
Figure 310
In the navigation tree, right-click the name of a dimension. Select View Data.
Creating Cubes
Cubes are informational objects that identify measures with the exact same dimensions and thus are candidates for being processed together at all stages: data loading, aggregation, storage, and querying. Cubes define the shape of your business measures. They are defined by a set of ordered dimensions. The dimensions form the edges of a cube, and the measures are the cells in the body of the cube. To create a cube:
Creating Cubes
1. 2.
Expand the folder for the analytic workspace. Right-click Cubes, then select Create Cube. The Create Cube dialog box appears.
3.
On the General tab, enter a name for the cube and select its dimensions. Select Enable SQL Expressions to allow Analytic Workspace Manager to create additional calculated measures as needed in processing a calculated measure. Enabling SQL expressions is especially useful if you are using the Oracle Business Intelligence Enterprise Edition (OBIEE) Plug-in for Analytic Workspace Manager to export the cube to OBIEE.
4.
On the Aggregation tab, click the Rules subtab and select an aggregation method for each dimension. If the cube uses multiple methods, then you may need to specify the order in which the dimensions are aggregated to get the desired results. You can ignore the bottom of the tab, unless you want to exclude a hierarchy from the aggregation. For a measure dimension, the aggregation operator is non-additive.
5.
If you run the advisors after mapping the cube, Oracle OLAP can determine the best partitioning and storage options. Alternatively, to define these options yourself, complete the Partitioning and Storage tabs before creating the cube. Click Create. The cube appears as a subfolder under Cubes.
6.
Figure 312 shows the Rules subtab for the Units cube with the list of operators displayed. "Aggregation Operators" on page 9-3 for descriptions of the aggregation operators.
See Also:
Figure 312 Selecting an Aggregation Operator
Creating Measures
Measures store the facts collected about your business. Each measure belongs to a particular cube, and thus shares particular characteristics with other measures in the cube, such as the same dimensions. The default characteristics of a measure are inherited from the cube.
Creating Cubes
Note:
The cube for a measure dimension has only one measure, which Analytic Workspace Manager creates automatically.
To create a measure:
1. 2.
Expand the folder for the cube that has the dimensions of the measure. Right-click Measures, then select Create Measure. The Create Measure dialog box appears.
3. 4.
On the General tab, enter a name for the measure. Click Create. The measure appears in the navigation tree as an item in the Measures folder.
Figure 313 shows the General tab of the Create Measure dialog box.
Figure 313 Creating the Sales Measure
Mapping Cubes
You use the same interface to map cubes as you did to map dimensions, as described in "Mapping Dimensions" on page 3-11. You can map a cube directly to a single fact table, or you can create more complex mappings using the OLAP expression syntax, which supports expressions, join conditions, and filters. Although the dimension columns in a fact table typically contain only key values at the detail level, you can also map cubes to summary tables that contain the values from multiple levels. For example, a Time column might contain days, months, quarters, and years; a Geography column might contain cities, states, and countries. When a build rolls up the data in the cube from the detail level, the calculated values overwrite the loaded summary values, thereby correcting any inconsistencies. Data Types You can map cubes and measures to columns having these SQL data types:
Creating Cubes
NVARCHAR2 CHAR NCHAR DATE TIMESTAMP TIMESTAMP WITH TIMEZONE TIMESTAMP WITH LOCAL TIMEZONE INTERVAL YEAR TO MONTH INTERVAL DAY TO SECOND
Expressions You can use the OLAP expression syntax when mapping cubes in the tabular view. This capability enables you to perform tasks like these as part of data maintenance, without any intermediate staging of the data:
Perform calculations on the relational data using any combination of functions and operators available in the OLAP expression syntax. Create measures that are more aggregate than their relational sources. For example, suppose the Time dimension has columns for Day, Month, Quarter, and Year, and the fact table for Sales is related to Time by the Day foreign key column. In a basic mapping, you would store data in the cube at the Day level. However, you could aggregate it to the Month level during the data refresh. Using a technique called one-up mapping, you would map the cube to the Month column for Time, and specify a join between the dimension table and the fact table on the Day columns.
Note:
You cannot map a measure dimension to an expression. You must map it to a column.
Join Conditions In the tabular view, the mapping for each dimension includes a join condition. In the basic case where you are mapping the foreign keys in a fact table to the primary keys in the related dimension tables, you can leave the join condition blank. Analytic Workspace Manager derives this information from the relational source tables when you save the mapping. For example, Analytic Workspace Manager provides this join condition for the TIME dimension in the UNITS_CUBE mapping:
GLOBAL.TIME_DIM.MONTH_ID = GLOBAL.UNITS_FACT.MONTH_ID
Note:
Filters A filter applies a WHERE clause to the query that loads data from the relational source into the cube. You can use a filter to limit the rows to those matching a certain condition. This filter restricts the data to the year 2007:
Creating Cubes
You can also use a filter to join two or more tables containing the measures. This filter joins the UNITS_FACT and PRICE_FACT tables in the Global schema on the Time (MONTH_ ID) and Product (ITEM_ID) dimensions:
GLOBAL.PRICE_FACT.MONTH_ID=GLOBAL.UNITS_FACT.MONTH_ID AND GLOBAL.PRICE_FACT.ITEM_ ID=GLOBAL.UNITS_FACT.ITEM_ID
Aggregate Functions The aggregate function specifies how the fact table data is loaded into the cube. You select an aggregate function from the Group By list. The aggregate functions are the following:
To map a cube:
1.
In the navigation tree, expand the cube folder and click Mappings. The Mapping window contains a schema navigation tree on the left and a mapping table for the cube and its dimensions. This is the tabular view. The level of a dimension from which values are aggregated is indicated by the symbol . You specify the level in the Aggregate From Level column on the Rules subtab of the Aggregation tab of the property sheet of a cube.
2. 3. 4.
To enlarge the Mapping window, drag the divider to the left. In the schema tree, expand the tables, views, or synonyms that contain the data for the measures. Drag-and-drop the source columns onto the appropriate cells in the mapping table for the cube. Map a measure dimension to the measure dimension fact view. See "Creating Measure Dimensions" on page 3-10 for information on creating the measure dimension fact view. From the measure dimension fact view columns, specify MEASURE_VALUE as the source column for the measure of the cube and specify MEASURE_DIM as the source column for the measure dimension of the cube.
5. 6. 7.
Optional: To see the SQL statements for the mapping, click Show SQL. You can save the SQL to a file or to the clipboard. After you have mapped all dimensions and measures, click Apply. Drag the divider back to the right to reduce the size of the Mapping window.
Figure 314 shows the mapping canvas with the Units cube mapped to columns in the UNITS_FACT table. After you save the mappings, Analytic Workspace Manager provides the join conditions for base-level mappings such as the ones shown here.
Creating Cubes
Figure 314
Create the cube. Map all dimensions and measures to the source tables. Edit the mapping of the measure to include a calculation in the OLAP expression syntax. For example, you might change UNITS_FACT.SALES to UNITS_FACT.SALES*1.06. You can use row expressions, column expressions, and conditions, but not nested SQL queries.
Create the cube dimensions with the desired levels and map them to the source dimension table. Create the cube and its measures. Map each measure to its source column in the fact table. For dimensions that are not being consolidated, map the detail level to its source column in the fact table, the same as you would in a basic cube mapping. For dimensions being consolidated:
a.
Map the dimension to the appropriate column in the dimension table, not to the fact table. In the previous scenario, you would map the Month level of the Time dimension to the Month column of the Time dimension table. For example, you would map Month to time_dim.month_column.
Creating Cubes
b.
Enter a join condition between the fact table and the dimension table at the detail level. For example, time_dim.day_key = fact_tbl.day_foreign_key.
Create the cube dimensions with the desired levels and map them to the source dimension table. Create the cube and its measures. Map each measure to its source column in the appropriate table. Map the detail level of each of the dimensions to its source column in each of the tables. When you drop the additional source column names, you are asked whether to add or replace the existing mapping. Select Add.
This example maps the two measures of a cube to columns in two different fact tables. The data for UNIT_PRICE is in the UNITS_FACT table, and the data for UNITS_SOLD is in the PRICE_FACT table. The following mapping identifies the dimension keys in both tables for MONTH and PRODUCT.
UNIT_PRICE: GLOBAL.PRICE_FACT.UNIT_PRICE UNITS_SOLD: GLOBAL.UNITS_FACT.UNITS MONTH: GLOBAL.PRICE_FACT.MONTH_ID GLOBAL.UNITS_FACT.MONTH_ID PRODUCT: GLOBAL.PRICE_FACT.ITEM_ID GLOBAL.UNITS_FACT.ITEM_ID
The next example maps one measure of a cube to columns in two different fact tables. The data for North America is in the AMERICA table, and the data for Europe is in the EMEA table. The following mapping for the UNITS_SOLD measure of UNION_CUBE creates a union of the two fact columns.
UNITS_SOLD: GLOBAL.AMERICA.UNITS GLOBAL.EMEA.UNITS TIME: GLOBAL.AMERICA.MONTH_ID GLOBAL.EMEA.MONTH_ID CHANNEL: GLOBAL.AMERICA.CHANNEL_ID GLOBAL.EMEA.CHANNEL_ID CUSTOMER: GLOBAL.AMERICA.SHIP_TO_ID GLOBAL.EMEA.SHIP_TO_ID PRODUCT: GLOBAL.AMERICA.ITEM_ID GLOBAL.EMEA.ITEM_ID
Partitioning a Cube
Partitioning is a method of physically storing the measures in a cube. It improves the performance of large measures in the following ways:
Improves scalability by keeping data structures small. Each partition functions like a smaller measure. Keeps the working set of data smaller both for queries and maintenance, since the relevant data is stored together. Enables parallel aggregation during data maintenance. Each partition can be aggregated by a separate process. Simplifies removal of old data from storage. Old partitions can be dropped, and new partitions can be added.
Creating Cubes
The number of partitions affects the database resources that can be allocated to loading and aggregating the data in a cube. Partitions can be aggregated simultaneously when sufficient resources have been allocated. You can select multiple levels of a hierarchy for partitioning. You select partitions and specify properties of them on the Partitioning tab of the property sheet for a cube. You can also view information about the partitions to help you decide on a partitioning strategy.
Note:
To select partitions:
1. 2.
In the navigation tree, select a cube. In the property sheet, select the Partitioning tab. The Partitioning tab appears, as shown in Figure 315.
3. 4. 5. 6.
Select Partition Cube and the Select Partitions subtab. Complete the Select Partitions subtab. Optional: To view information about the partitions, select the Partition Member Analysis subtab. To apply the partitioning to the cube, click Apply.
Selecting Partitions
Figure 315
See Also:
Creating Cubes
Selecting Partitions
You select the dimension and levels to be used for partitioning on the Select Partitions subtab. This section describes the following choices you can make on the subtab. Dimension A dimension for partitioning the cube. The dimension must have at least one level-based hierarchy. In developing a partitioning strategy, you typically want the members to be distributed evenly, such that each partition has about the same amount of data as the others, to support the best performance. You can switch among dimensions without losing your selections in Aggregation Hierarchies, and so you can freely explore your data. By default, partitions are created on a time dimension. Aggregation Hierarchies From the hierarchies and their levels for the selected dimension, you select the levels for partitioning. If the dimension has multiple hierarchies and you are partitioning on only one of them, choose the one that has the most members; it should be defined as the default hierarchy. After you make a selection, brackets enclose the levels that will be stored in the same partition. Each dimension member at the selected level is stored in a separate partition, along with its descendants. Any dimension members that are at higher levels or are not in the hierarchy are stored together, unless you select multiple levels for partitioning. Choose the levels with care to distribute the data evenly across the partitions. For example, if the time dimension has 10 years of data at the year, quarter, month, and day levels, then you might partition at the quarter level. This choice creates 40 partitions, one for each quarter and its descendants (months and days). The 10 members at the year level are stored together in a separate partition. If the data is very sparse, then you might partition by year instead of quarter. Another example is a time dimension with two hierarchies, calendar and fiscal, with month and day levels in both hierarchies. In this scenario, you might partition on the month, calendar year, and fiscal year levels. The goal is to create partitions that fit in memory, which optimizes performance. The more memory your computer has, the larger the partitions can be and still achieve this goal. Order Hierarchies You can change the aggregation order of the hierarchies for the selected dimension. Clear Selections You can delete all hierarchy selections from the current display. Any selected hierarchies in other dimensions are unaffected. Edit the Precompute Values You can edit the percentage of values that are calculated and stored during data maintenance. The remaining members are calculated on demand in response to a query. In general, you should precompute the values that are queried most frequently. A value of 0 does not create any aggregate values; they are calculated at run-time to provide the answer sets to queries. The result of 0% pre-aggregation is the fastest maintenance, the least storage space, but the slowest query response time. A value of 100 creates all of the aggregate values, which are simply fetched in response to queries. The result of 100% pre-aggregation is the longest maintenance, the most storage space, but the fastest query response time. Most DBAs choose values between these two
3-24 Oracle OLAP User's Guide
Creating Cubes
extremes to balance the performance requirements for queries with the limitations of a data maintenance window. A value of 1 only creates 1% of the aggregate values, but also creates the data structures for storing and tracking the aggregates. Thus, the amount of time to calculate this small percentage is correspondingly longer. You may want to adjust the percentages over time to balance runtime performance with maintenance restrictions on time and disk space.
Partition Order: The order in which the partitions are aggregated. Partition Name: Name assigned to the partition. Partition Includes: Levels included in the partition. Precompute: The percentage of precomputed values in this partition. You can edit this value unless Disable Editing of Cube Precompute Values is selected in the Configuration dialog box.
Automatically Manage Partition Order You can enable Oracle OLAP to determine the optimal aggregation order. Do not select this option when the aggregation order changes the results. Order is important for some aggregation operators, such as Average, and when a cube uses multiple aggregation methods, such as Hierarchical Last Member for Time and Sum for all other dimensions. This option appears only when the Show Automatic Partitioning Order Check Box is selected in the Analytic Workspace Manager Configuration dialog box.
See Also:
Partition Name: Name of the partition, as shown in the Select Partitions subtab. Number Partitions: Number of partitions created by partitioning on the selected level. Total Members: Total number of dimension members being distributed across the partitions. This number includes the members at the level selected for partitioning and their children at levels included in the partition. Minimum Members: Minimum number of dimension members assigned to a partition. Maximum Members: Maximum number of dimension members assigned to a partition. Average Members: Average number of dimension members assigned to a partition.
Creating Dimensions and Cubes 3-25
Creating Cubes
Standard Deviation: Amount of variation among the partitions from the average. A lower standard deviation is better than a high standard deviation.
Graph The graph illustrates the partition selected in the table. It provides a visual representation of the number of members in each partition and their level in the dimension hierarchy. A tool bar enables you to make temporary changes to the graph. The text tools are disabled. You can use these tools:
Fill Color: Changes the background color surrounding the graph. Graph Type: Provides a variety of standard graph types, as described in Table 31. Legend: Controls whether the legend is displayed. Grid Lines: Controls whether horizontal grid lines are displayed on graphs with an X/Y axis. Gradient Effect: Controls whether colored areas appear solid or with a slight variation in color. 3-D Effect: Controls whether the graph appears flat or three-dimensional.
Partitioning Graph Types Usage Comparisons (default) Comparisons Percentage or comparisons of percentages; relationship between the parts and the whole Trends over time; rate of data change Trends over time; rate of data change Trends over time; effect of one variable on another Correlations of two or three measures Stock prices over time Cyclical or directional patterns Highest and lowest contributors to a total; ranking Three-dimensional comparison
Horizontal Bar Pie Line Area Combination Scatter Stock Circular Pareto 3-D
In the navigation tree, right-click the Cubes folder or the name of a particular cube. Select Maintain Cube. The Maintenance Wizard opens on the Select Objects page.
Creating Cubes
3.
Select one or more cubes from Available Target Objects and use the shuttle buttons to move them to Selected Target Objects. If the dimensions are loaded, you can omit them from Selected Target Objects. If you click Next, the Data Refresh Methods page appears.
4.
The Data Refresh Methods page identifies the cubes and dimensions included in the build, the load options, sort order, refresh methods, and the cube script that defines the steps of the build. Click Help for information about these choices. Figure 316 shows the Data Refresh Methods page.
Figure 316
On the Processing Options page, you can keep the default values. If you click Next, the Scheduling page appears.
6.
On the Scheduling page, you can specify task processing options. You can submit the build to the Oracle job queue or create a SQL script that you can run outside of Analytic Workspace Manager. You can also select the number of processes to dedicate to this build. The number of parallel processes is limited by the smallest of these numbers: the number of partitions in the cube, the number of processes dedicated to the build, and the setting of the JOB_QUEUE_PROCESSES initialization parameter. Click Help for information about these choices.
7.
Click Finish.
Figure 317 shows the build submitted immediately to the Oracle job queue.
Creating Cubes
Figure 317
Example 318, "Maintenance Log for the Units Cube" shows the maintenance log displayed by Analytic Workspace Manager for a cube. The log refreshes throughout the build to provide you with the most up-to-date information. The maintenance log appears automatically for maintenance tasks that run immediately in the session. When you submit a job to the Oracle job queue, you can track its progress through the various reports in the Maintenance Reports folder: Jobs Scheduled, Jobs Running, and Jobs History. The reports in Jobs Running and Jobs History are the same as the one shown in Example 318.
Figure 318 Maintenance Log for the Units Cube
Creating Cubes
In the navigation tree, right-click the cube. Select View Data from the shortcut menu.
The Measure Data Viewer displays the selected measure in a crosstab at the top of the page and a graph at the bottom of the page. On the crosstab, you can expand and collapse the dimension hierarchies that label the rows and columns. You can also change the location of a dimension by pivoting or swapping it. If you want, you can use multiple dimensions to label the columns and rows, by nesting one dimension under another. To change the default display:
To pivot, drag a dimension from one location and drop it at another location, usually above or below another dimension. To swap dimensions, drag and drop one dimension directly over another dimension, so they exchange locations.
To make extensive changes to the selection of data, select Query Builder from the File menu. Figure 319 shows the Units cube in the Measure Viewer.
Figure 319 Displaying the Units Cube
Figure 320
If you are defining cubes to replace existing materialized views, then you use the materialized views as an integral part of data maintenance. Materialized view capabilities restrict the types of analytics that can be performed by a custom cube script.
See Also:
Clear Data: Clears the data from the entire cube, from selected measures, or from selected portions of the cube. You can clear just the detail data (called leaves) for a fast refresh, just the aggregate data, or both for a complete refresh. Clearing old data values is typically done before loading new values. Load: Loads the data from the source tables into the cube. You can load all measures in the cube or just selected measures. Aggregation: Generates aggregate values using the rules defined for the cube. You can aggregate the entire cube, selected measures, or selected portions of the cube. Analyze: Generates optimizer statistics, which can improve the performance of some types of queries. For more information, see "Analyzing Cubes and Dimensions" on page 7-10. Generating statistics is typically done immediately after data maintenance. OLAP DML: Executes a command or program in the OLAP DML. PL/SQL: Executes a PL/SQL command or script. You can run a PL/SQL script, for example, at the beginning of data maintenance to initiate a refresh of the relational source tables.
If a cube is used to support advanced analytics in a cube script, then it cannot be enhanced as a cube materialized view, as described in "Adding Materialized View Capability to a Cube" on page 3-34. In this case, you are responsible for detecting when the data in the cube is stale and must be refreshed.
Expand the folder for a cube that is not defined as a cube materialized view. Right-click Cube Scripts, then select Create Cube Script. The Create Cube Script dialog box appears.
3. 4. 5.
On the General tab, enter a name for the cube script. To create a step, click New Step. Select the type of step. The New Step dialog box appears for that type of step.
6.
Complete the tabs, then click OK. The step is listed on the Cube Script General tab.
7.
Click Create. The cube script appears as an item in the Cube Script folder.
8.
Right-click the cube script on the navigation tree, and select Run Cube Script. The Maintenance Wizard opens.
b.
c.
To view the results, right-click the cube and select View Data.
Figure 321 shows the Create Cube Script dialog box, in which several steps have been defined.
Figure 321 Creating a Cube Script
Expand the Cube Scripts folder for the cube. Right-click the cube script and select Run Cube Script to open the Maintenance Wizard. Follow the steps of the Maintenance Wizard.
Expand the Cube Scripts folder for the cube. Select the cube script so the General tab appears. Select Default Script For This Cube and click Apply. Open the Maintenance Wizard anywhere on the navigation tree and select the cube. Follow the steps of the Maintenance Wizard.
Create a maintenance script. Add the cube script as a step. Run the maintenance script.
These are the same steps described in "Creating and Executing Custom Cube Scripts" on page 3-31.
In the navigation tree, right-click Maintenance Scripts, then select Create Maintenance Script to display the Create Maintenance Script dialog box. Enter the name, labels, and description on the General tab. To create a new step, click Add, then select the type of step from the list. Create additional steps as desired. You can edit, delete, or re-order the steps at any time. Click Create. The new maintenance script appears as an object in the Maintenance Scripts folder.
Figure 322 shows the General tab of the Create Maintenance Script dialog box.
Figure 322 Creating a Maintenance Script
Expand the Maintenance Scripts folder. Right-click the script, then select Run Maintenance Script. The Maintenance Wizard opens. Follow the steps of the Maintenance Wizard.
All dimensions of the cube have at least one level and one level-based hierarchy. Ragged and skip-level hierarchies are not supported. The dimensions must be mapped. All dimensions of the cube use the same aggregation operator, which is either SUM, MIN, or MAX. The cube has one or more dimensions and one or more measures. The cube is fully defined and mapped. For example, if the cube has five measures, then all five are mapped to the source tables. The data type of the cube is NUMBER, VARCHAR2, NVARCHAR2, or DATE. The source detail tables support dimension and rely constraints. If they have not been defined, then use the Relational Schema Advisor on the Materialized Views tab of the cube property sheet to generate a script that defines them on the detail tables. The cube is compressed. The cube can be enriched with calculated measures, but it cannot support more advanced analytics in a cube script.
See Also:
In the navigation tree, select a cube. The property sheets for the cube are displayed.
2. 3.
Select the Materialized Views tab. Review the checklist and, if some tests failed, fix the cause of the problem. You cannot define a cube materialized view until the cube is valid.
4.
For automatic refresh, complete just the top half page. For query rewrite, complete the entire page.
Click Apply.
The cube materialized views appear in the same schema as the analytic workspace. A materialized view is created for the cube and each of its dimensions. Unlike traditional materialized views, cube materialized views do not use relational tables to store data; the data is stored in the backing cube. A CB$ prefix identifies the tables as cube materialized views. The initial state of a new materialized view is invalid, so it does not support query rewrite until after it is refreshed. You can specify the first refresh time on the Materialized View tab of the cube, or you can run the Maintenance Wizard. Figure 323 shows the Materialized View tab of the Units Cube.
Figure 323 Defining a Materialized View
In the navigation tree, expand the folder for the analytic workspace. Select Languages to display its property page. On the General tab, click Modify Languages.
4.
On the Modify Languages dialog box, select the languages that the analytic workspace must support. Use the shuttle keys to move them to the Selected Languages box. Click OK to return to the Languages property page. Enter the translations of the various labels and descriptions. Each language has a column where you can enter this information. For each dimension, open the Mappings window. Map the attributes to the source columns for each language.
5. 6. 7.
Expand the folder for the analytic workspace. Right-click Measure Folders, then select Create Measure Folder from the shortcut menu. Complete the General tab of the Create Measure Folder dialog box. Click Help for specific information about these choices.
The measure folder appears in the navigation tree under Measure Folders. You can also create subfolders.
Creating Dimensional Objects From XML Templates Saving Object Definitions to XML Templates Creating Analytic Workspaces from EIF Files Saving Analytic Workspaces to EIF Files
See Also:
"Mapping Cubes" on page 3-18 for information on saving the SQL statements for a mapping.
1. 2. 3. 4.
In the navigation tree, right-click Analytic Workspaces, Dimensions, Cubes, or Measure Folders. Select Create Object from Template to display the Create Object from Template dialog box. Select the schema in which to create the objects and click OK. Complete the Create Object from Template dialog box. To overwrite the metadata for an existing object select Modify Existing Objects on the Options tab.
See Also:
In the navigation tree, right-click an analytic workspace, dimension, cube, or measure folder. Select Save Object to Template to display the Save Object to Template dialog box. Select Save to File or Save to Table. Verify the selection of objects in the Object Selection tab. To modify the use of the schema name in the template, use the Options tab. Complete the remaining fields to identify the name and location of the saved template. You can overwrite an existing template.
See Also:
In the navigation tree, right-click Analytic Workspaces and select Create Analytic Workspace From EIF File. The Create Analytic Workspace From EIF File dialog box appears.
2.
Specify the directory that contains the EIF file and the name of the file, a name for the new analytic workspace and the tablespace for it, and then click OK.
See Also:
1. 2. 3.
In the navigation tree, right-click the analytic workspace. Select Export Analytic Workspace ObjectTo EIF File to display the Export Analytic Workspace Object to EIF File dialog box. Specify the directory and file name for the EIF file, then click OK.
See Also:
In the navigation tree, right-click an analytic workspace, dimension, cubes, measure, or calculated measure. Select Copy.
In the navigation tree, right-click Analytic Workspaces, Dimensions, Cubes, Measures, or Calculated Measures. Select Paste.
4
4
Oracle OLAP adds power to your SQL applications by providing extensive analytic content and fast query response times. A SQL query interface enables any application to query cubes and dimensions without any knowledge of OLAP. The OLAP option automatically generates a set of relational views on cubes, dimensions, and hierarchies. SQL applications query these views to display the information-rich contents of these objects to analysts and decision makers. You can also create custom views that follow the structure expected by your applications, using the system-generated views like base tables. In this chapter, you learn the basic methods for querying dimensional objects in SQL. It contains the following topics:
Exploring the OLAP Views Creating Basic Queries Creating Hierarchical Queries Using Calculations in Queries Using Attributes for Aggregation Joining Cubes to Tables and Views Viewing Execution Plans Querying the Data Dictionary
See Also:
"Developing Reports and Dashboards Using SQL Tools and Application Builders" on page 1-3 for a sample dashboard created using Oracle Application Express "Overview of the Dimensional Data Model" on page 1-5 for a discussion of cubes, dimensions, and hierarchies
These views are related in the same way as fact and dimension tables are in a star schema. Cube views serve the same function as fact tables, and hierarchy views and dimension views serve the same function as dimension tables. Typical queries join a cube view with either a hierarchy view or a dimension view.
Cube Views
Each cube has a cube view that presents the data for all the measures and calculated measures in the cube. You can use a cube view like a fact table in a star or snowflake schema. However, the cube view contains all the summary data in addition to the detail level data.
The next query returns the names of all the cube views in your schema from USER_ CUBE_VIEWS:
SELECT view_name FROM user_cube_views; VIEW_NAME -----------------------------UNITS_CUBE_VIEW PRICE_CUBE_VIEW
The USER_CUBE_VIEW_COLUMNS data dictionary view describes the columns of a cube view, as shown by the following query.
SELECT column_name, column_type FROM user_cube_view_columns 4-2 Oracle OLAP User's Guide
WHERE view_name = 'UNITS_CUBE_VIEW'; COLUMN_NAME -----------------------------SALES UNITS COST SALES_PP SALES_CHG_PP SALES_PCTCHG_PP SALES_PROD_SHARE_PARENT SALES_PROD_SHARE_TOTAL SALES_PROD_RANK_PARENT_PP TIME CUSTOMER PRODUCT CHANNEL 13 rows selected. COLUMN_TYPE -------------MEASURE MEASURE MEASURE MEASURE MEASURE MEASURE MEASURE MEASURE MEASURE KEY KEY KEY KEY
You can use dimension views and hierarchy views like dimension tables in a star schema.
USER_CUBE_HIER_VIEWS identifies the hierarchy views for all the dimensions. For a hierarchy view, the default name is dimension_hierarchy_VIEW. The following query returns the dimension, hierarchy, and view names.
SELECT * FROM user_cube_hier_views ORDER BY dimension_name; DIMENSION_NAME --------------CHANNEL CUSTOMER CUSTOMER PRODUCT TIME TIME HIERARCHY_NAME --------------PRIMARY MARKET SHIPMENTS PRIMARY FISCAL CALENDAR VIEW_NAME -----------------------------CHANNEL_PRIMARY_VIEW CUSTOMER_MARKET_VIEW CUSTOMER_SHIPMENTS_VIEW PRODUCT_PRIMARY_VIEW TIME_FISCAL_VIEW TIME_CALENDAR_VIEW
22 rows selected.
SELECT dim_key, long_description description, parent, calendar_year year, calendar_quarter quarter, month FROM time_calendar_view WHERE calendar_year='CY2005' ORDER BY level_name, end_date; DIM_KEY -----------CY2005.Q1 CY2005.Q2 CY2005.Q3 CY2005.Q4 CY2005 2005.01 2005.02 2005.03 2005.04 2005.05 2005.06 2005.07 2005.08 2005.09 2005.10 2005.11 2005.12 DESCRIPTION -----------Q1.05 Q2.05 Q3.05 Q4.05 2005 JAN-05 FEB-05 MAR-05 APR-05 MAY-05 JUN-05 JUL-05 AUG-05 SEP-05 OCT-05 NOV-05 DEC-05 PARENT -----------CY2005 CY2005 CY2005 CY2005 TOTAL CY2005.Q1 CY2005.Q1 CY2005.Q1 CY2005.Q2 CY2005.Q2 CY2005.Q2 CY2005.Q3 CY2005.Q3 CY2005.Q3 CY2005.Q4 CY2005.Q4 CY2005.Q4 YEAR -----------CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 CY2005 QUARTER MONTH ------------ -----------CY2005.Q1 CY2005.Q2 CY2005.Q3 CY2005.Q4 CY2005.Q1 CY2005.Q1 CY2005.Q1 CY2005.Q2 CY2005.Q2 CY2005.Q2 CY2005.Q3 CY2005.Q3 CY2005.Q3 CY2005.Q4 CY2005.Q4 CY2005.Q4 2005.01 2005.02 2005.03 2005.04 2005.05 2005.06 2005.07 2005.08 2005.09 2005.10 2005.11 2005.12
17 rows selected.
Apply a filter to every dimension. The cube contains both detail level and aggregated data. A query with an unfiltered dimension typically returns more data than users need, which negatively impacts performance.
Let the cube aggregate the data. Because the aggregations are calculated in the cube, a typical query does not need a GROUP BY clause. Simply select the aggregations you want by using the appropriate filters on the dimension keys or attributes.
Several data dictionary views list the names of the levels. The following example queries USER_CUBE_HIER_LEVELS.
SELECT level_name FROM user_cube_hier_levels WHERE dimension_name = 'TIME' AND hierarchy_name ='CALENDAR'; LEVEL_NAME -------------------TOTAL CALENDAR_YEAR CALENDAR_QUARTER MONTH
To see the importance of applying a filter to every dimension, consider the query in Example 41, which has no filter on the time dimension.
Example 41 Displaying Aggregates at All Levels of Time /* Select key descriptions and facts */ SELECT t.long_description time, ROUND(f.sales) sales /* From dimension views and cube view */ FROM time_calendar_view t, product_primary_view p, customer_shipments_view cu, channel_primary_view ch, units_cube_view f /* No filter on Time */ WHERE p.level_name = 'TOTAL' AND cu.level_name = 'TOTAL' AND ch.level_name = 'TOTAL' /* Join dimension views to cube view */ AND t.dim_key = f.time AND p.dim_key = f.product AND cu.dim_key = f.customer AND ch.dim_key = f.channel ORDER BY t.end_date;
Without a filter on the Time dimension, the query returns values for every level of time. This is more data than users typically want to see, and the volume of data returned can negatively impact performance.
TIME SALES ---------- ---------JAN-98 8338545 FEB-98 7972132 Q1.98 24538588 MAR-98 8227911 APR-98 8470315 MAY-98 8160573 JUN-98 8362386 Q2.98 24993273 JUL-98 8296226 AUG-98 8377587 SEP-98 8406728
Now consider the results when a filter restricts Time to yearly data. Example 42 shows a basic query. It selects the Sales measure from UNITS_CUBE_VIEW, and joins the keys from the cube view to the hierarchy views to get descriptions of the keys.
Example 42 Basic Cube View Query /* Select key descriptions and facts */ SELECT t.long_description time, ROUND(f.sales) sales /* From dimension views and cube view */ FROM time_calendar_view t, product_primary_view p, customer_shipments_view cu, channel_primary_view ch, units_cube_view f /* Create level filters */ WHERE t.level_name = 'CALENDAR_YEAR' AND p.level_name = 'TOTAL' AND cu.level_name = 'TOTAL' AND ch.level_name = 'TOTAL' /* Join dimension views to cube view */ AND t.dim_key = f.time AND p.dim_key = f.product AND cu.dim_key = f.customer AND ch.dim_key = f.channel ORDER BY t.end_date;
Example 42 selects the following rows. For CUSTOMER, PRODUCT, and CHANNEL, only one value is at the top level. TIME has a value for each calendar year.
TIME SALES -------- ---------1998 100870877 1999 134109248 2000 124173522 2001 116931722 2002 92515295 2003 130276514 2004 144290686 2005 136986572 2006 140138317
Dimension attributes also provide a useful way to select the data for a query. The WHERE clause in Example 43 uses attributes values to filter all of the dimensions.
Example 43 Selecting Data with Attribute Filters /* Select key descriptions and facts */ SELECT t.long_description time,
p.long_description product, cu.long_description customer, ch.long_description channel, ROUND(f.sales) sales /* From dimension views and cube view */ FROM time_calendar_view t, product_primary_view p, customer_shipments_view cu, channel_primary_view ch, units_cube_view f /* Create attribute filters */ WHERE t.long_description in ('2005', '2006') AND p.package = 'Laptop Value Pack' AND cu.long_description LIKE '%Boston%' AND ch.long_description = 'Internet' /* Join dimension views to cube view */ AND t.dim_key = f.time AND p.dim_key = f.product AND cu.dim_key = f.customer AND ch.dim_key = f.channel ORDER BY time, customer;
The query selects two calendar years, the products in the Laptop Value Pack, the customers in Boston, and the Internet channel.
TIME -----2005 2005 2005 2005 2005 2005 2005 2006 2006 2006 2006 2006 2006 2006 PRODUCT -----------------------------Laptop carrying case 56Kbps V.92 Type II Fax/Modem Internal 48X CD-ROM Standard Mouse Envoy Standard Laptop carrying case Standard Mouse Standard Mouse Laptop carrying case 56Kbps V.92 Type II Fax/Modem Internal 48X CD-ROM Envoy Standard Laptop carrying case Standard Mouse CUSTOMER --------------------KOSH Entrpr Boston KOSH Entrpr Boston KOSH Entrpr Boston KOSH Entrpr Boston Warren Systems Boston Warren Systems Boston Warren Systems Boston KOSH Entrpr Boston KOSH Entrpr Boston KOSH Entrpr Boston KOSH Entrpr Boston Warren Systems Boston Warren Systems Boston Warren Systems Boston CHANNEL SALES -------- ---------Internet 5936 Internet 45285 Internet 2828 Internet 638 Internet 19359 Internet 13434 Internet 130 Internet 555 Internet 6357 Internet 38042 Internet 3343 Internet 24198 Internet 13153 Internet 83
14 rows selected.
The next query fetches the exact same results from a cube using filters:
/* Querying a cube */ SELECT t.long_description time, f.sales sales FROM time_calendar_view t, product_primary_view p, customer_shipments_view cu, channel_primary_view ch, units_cube_view f /* Apply filters to every dimension */ WHERE t.long_description IN ('2005', '2006') AND p.level_name = 'TOTAL' AND cu.level_name = 'TOTAL' AND ch.level_name = 'TOTAL' /* Join dimension views to cube view */ AND t.dim_key = f.TIME AND p.dim_key = f.product AND cu.dim_key = f.customer AND ch.dim_key = f.channel ORDER BY time;
The query against the cube does not compute the aggregate values with a SUM operator and GROUP BY clause. Because the aggregates exist in the cube, this would re-aggregate previously aggregated data. Instead, the query selects the aggregates directly from the cube and specifies the desired aggregates by applying the appropriate filter to each dimension.
Query Processing
The most efficient queries allow the OLAP engine to filter the data, so that the minimum number of rows required by the query are returned to SQL. The following are among the WHERE clause operations that are pushed into the OLAP engine for processing:
The OLAP engine also processes nested character functions, including INSTR, LENGTH, NVL, LOWER, UPPER, LTRIM, RTRIM, TRIM, LPAD, RPAD, and SUBSTR.
SQL processes other operations and functions in the WHERE clause, and all operations in other parts of the SELECT syntax.
The query drills down from Year to Quarter. The four quarters Q1-05 to Q4-05 are the children of year CY2005 in the Calendar hierarchy.
TIME SALES -------- ---------Q1.05 31381338 Q2.05 37642741 Q3.05 32617249 Q4.05 35345244
Drilling Up to Parents
The PARENT column of a hierarchy view identifies the parent of each dimension key. Columns of level keys identify the full heritage. The following WHERE clause selects the parent of a Time key based on its LONG_DESCRIPTION attribute.
/* Select the parent of a Time key*/ WHERE t.dim_key = (SELECT DISTINCT parent FROM time_calendar_view WHERE long_description='JAN-05') AND p.dim_key= 'TOTAL'
The query drills up from Month to Quarter. The parent of month JAN-05 is the quarter Q1-05 in the Calendar hierarchy.
TIME SALES -------- ---------Q1.05 31381338
The query drills down two levels, from year to quarter to month. The 12 months Jan-05 to Dec-05 are the descendants of year 2005 in the Calendar hierarchy.
TIME SALES -------- ---------JAN-05 12093518 FEB-05 10103162 MAR-05 9184658 APR-05 9185964 MAY-05 11640216 JUN-05 16816561 JUL-05 11110903 AUG-05 9475807 SEP-05 12030538 OCT-05 11135032 NOV-05 11067754 DEC-05 13142459
Drilling Up to Ancestors
The hierarchy views provide the full ancestry of each dimension key, as shown in "Displaying the Contents of a Hierarchy View" on page 4-5. The following WHERE clause uses the CALENDAR_YEAR level key column to identify the ancestor of a MONTH dimension key.
/* Select the ancestor of a Time key based on its Long Description attribute */ WHERE t.dim_key = (SELECT calendar_year FROM time_calendar_view WHERE long_description = 'JAN-05') AND p.dim_key = 'TOTAL' AND cu.dim_key = 'TOTAL' AND ch.dim_key = 'TOTAL'
The query drills up two levels from month to quarter to year. The ancestor of month Jan-05 is the year 2005 in the Calendar hierarchy.
TIME SALES -------- ----------
2005
136986572
This is the data extracted from the cube by the inner query:
CUSTOMER ---------------------------------------Dept. of Labor Dept. of Labor Ministry of Intl Trade Ministry of Intl Trade Ministry of Intl Trade Royal Air Force Royal Air Force UK Environmental Department TIME SALES -------- ---------JAN-05 1553.26 MAR-05 1555.6 JAN-05 1553.26 FEB-05 1554.56 MAR-05 1555.6 JAN-05 1553.26 FEB-05 6218.23 JAN-05 4659.78 Querying Dimensional Objects 4-13
UK Environmental Department
FEB-05
3109.12
The outer query calculates the minimum, maximum, and average sales for each customer:
CUSTOMER MINIMUM MAXIMUM AVERAGE ------------------------------ ---------- ---------- ---------Dept. of Labor 1553 1556 1554 Ministry of Intl Trade 1553 1556 1554 Royal Air Force 1553 6218 3886 UK Environmental Department 3109 4660 3884
Units Cube uses the SUM operator for all dimensions, and the query uses the SUM operator to aggregate over Sales. The Package attribute applies only to the Item level of the Product dimension, so the query selects the Item level of Product. It also eliminates nulls for Package, so that only products that belong to a package are included in the calculation. The GROUP BY clause breaks out Total Sales by Time and Package.
Example 45 Aggregating Over an Attribute SELECT t.long_description time, p.package package,
SUM(f.sales) sales FROM time_calendar_view t, product_primary_view p, customer_shipments_view cu, channel_primary_view ch, units_cube_view f /* Select Product by level and attribute */ WHERE p.level_name = 'ITEM' AND p.package IS NOT NULL AND t.long_description = '2005' AND cu.level_name = 'TOTAL' AND ch.level_name = 'TOTAL' /* Join dimensions and cube */ AND t.dim_key = f.time AND p.dim_key = f.product AND cu.dim_key = f.customer AND ch.dim_key = f.channel GROUP BY t.long_description, p.package;
Example 46 shows a query that aggregates over the Package attribute and calculates Percent Change From Prior Period. The inner query aggregates Sales and Sales Prior Period over the attributes, and the outer query uses the results to compute the percent change. These are the results of the query, which show the expected results for PCT_ CHG_PP:
TIME -----2005 2006 2005 2006 2005 2006 2005 2006 PACKAGE -----------------All All Executive Executive Laptop Value Pack Laptop Value Pack Multimedia Multimedia SALES PRIOR_PERIOD PCT_CHG_PP ---------- ------------ ---------1809157.64 1853928.06 -.02414895 1720399.03 1809157.64 -.04906074 19836977 20603879.8 -.03722128 19580638.4 19836977 -.01292226 9547494.81 10047298.6 -.04974509 9091450.58 9547494.81 -.04776585 18083256.3 19607675.5 -.07774604 18328678.7 18083256.3 .013571806
8 rows selected. Example 46 Querying Over Attributes Using Calculated Measures /* Calculate Percent Change */
SELECT TIME, package, sales, prior_period, ((sales - prior_period) / prior_period) pct_chg_pp FROM /* Fetch data from the cube and aggregate over Package */ (SELECT t.long_description time, p.package package, SUM(f.sales) sales, SUM(f.sales_pp) prior_period FROM time_calendar_view t, product_primary_view p, customer_shipments_view cu, channel_primary_view ch, units_cube_view f /* Create filters */ WHERE p.level_name = 'ITEM' AND p.package IS NOT NULL AND t.long_description IN ('2005', '2006') AND cu.level_name = 'TOTAL' AND ch.level_name = 'TOTAL' /* Join dimension views to cube view */ AND t.dim_key = f.time AND p.dim_key = f.product AND cu.dim_key = f.customer AND ch.dim_key = f.channel GROUP BY t.long_description, p.package ORDER BY p.package);
Tables Views including external tables and PL/SQL table functions Other row source types, like other joins
Typically, you do not need a fully aggregated cube when joining it to a table or view, and a CUBE JOIN operation limits the number of fetched values to improve performance automatically. The cube must be on the right side of the equation. If the query does not support CUBE JOIN, then the more expensive HASH JOIN, MERGE JOIN, or NESTED LOOPS are commonly used. You can use hints in the query to influence the use of CUBE JOIN:
See "Viewing Execution Plans" on page 4-17 for more information about CUBE JOIN. Example 47 joins a table that contains French descriptions of the Customer dimension to a cube that supports only English. The query returns these results:
CUSTOMER SALES ---------------------------------------- -----------La Marine des USA Washington 600.34 Monolith Motor Co. Chattanooga 17946.51 Piedmont, Inc. San Jose 24874.41 Ministere du Commerce Int. Nagano 27595.97 Depart. des commun. - Stuttgart 30706.10 Min. Env. Brit. Londres 38125.77 Departement de travail Nouvelle-Orleans 42507.50
Ministere des Finances Sorbonne Monolith Motor Co. Knoxville Serv. des USA de recherche Wyo Depart. des commun. - Bonn . . . Example 47 Joining a Cube and a Table SELECT cu.ship_to_dsc_french customer, f.sales sales FROM time_calendar_view t, product_primary_view p, customer_dim cu, channel_primary_view ch, units_cube_view f WHERE t.dim_key = 'CY2006' AND p.level_name = 'TOTAL' AND ch.level_name = 'TOTAL' AND t.dim_key = f.TIME AND p.dim_key = f.product AND cu.ship_to_id = f.customer AND ch.dim_key = f.channel ORDER BY f.sales;
ORDER BY t.end_date;
The DISPLAY table function of the DBMS_XPLAN PL/SQL package formats and displays information from an execution plan, as shown in Example 48.
Example 48 Execution Plan for a Cube Query
SQL> SELECT plan_table_output FROM TABLE(dbms_xplan.display()); PLAN_TABLE_OUTPUT -----------------------------------------------------------------------------------------Plan hash value: 1667678335 ---------------------------------------------------------------------------------------------| Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time | ---------------------------------------------------------------------------------------------| 0 | SELECT STATEMENT | | 1 | 100 | 104 (3)| 00:00:02 | | 1 | SORT ORDER BY | | 1 | 100 | 104 (3)| 00:00:02 | | 2 | JOINED CUBE SCAN PARTIAL OUTER| | | | | | | 3 | CUBE ACCESS | UNITS_CUBE | | | | | | 4 | CUBE ACCESS | CHANNEL | | | | | | 5 | CUBE ACCESS | CUSTOMER | | | | | | 6 | CUBE ACCESS | PRODUCT | | | | | |* 7 | CUBE ACCESS | TIME | 1 | 100 | 103 (2)| 00:00:02 | ---------------------------------------------------------------------------------------------Predicate Information (identified by operation id): --------------------------------------------------7 - filter(SYS_OP_ATG(VALUE(KOKBF$),12,13,2)='CALENDAR_YEAR' AND SYS_OP_ATG(VALUE(KOKBF$),43,44,2)='TOTAL' AND SYS_OP_ATG(VALUE(KOKBF$),33,34,2)='TOTAL' AND SYS_OP_ATG(VALUE(KOKBF$),23,24,2)='TOTAL') 22 rows selected.
Example 49 shows an execution plan for a query that joins a cube and a table. See "Joining Cubes to Tables and Views" on page 4-16 for the query.
Example 49 Execution Plan for a Cube Join
PLAN_TABLE_OUTPUT ------------------------------------------------------------------------------------------------Plan hash value: 3634608218 ------------------------------------------------------------------------------------------------| Id | Operation |Name |Rows | Bytes |TempSpc|Cost (%CPU)| Time | -------------------------------------------------------------------------------------------------| 0 | SELECT STATEMENT | |1464 | 128K| | 1524 (94)| 00:00:19| | 1 | SORT ORDER BY | |1464 | 128K| 152K| 1524 (94)| 00:00:19| |* 2 | CUBE JOIN | |1464 | 128K| | 1422 (100)| 00:00:18| | 3 | TABLE ACCESS FULL |CUSTOMER_DIM| 61 | 2379 | | 4 (0)| 00:00:01| | 4 | JOINED CUBE SCAN PARTIAL OUTER| | | | | | | | 5 | CUBE ACCESS |UNITS_CUBE | | | | | | | 6 | CUBE ACCESS |CHANNEL | | | | | | | 7 | CUBE ACCESS |PRODUCT | | | | | | |* 8 | CUBE ACCESS |TIME |2520 | 125K| | 1417 (100)| 00:00:18| -------------------------------------------------------------------------------------------------Predicate Information (identified by operation id): ---------------------------------------------------
See Also: Oracle Database SQL Language Reference for descriptions of these join types.
Table 42 View
Static Data Dictionary Views for OLAP Description Describes the visibility of the attributes for cube dimensions. Describes the attributes for cube dimensions. Describes the cube build processes and maintenance scripts. Describes the calculated members (keys) for cube dimensions. Describes the cube dimension levels. Describes the models for cube dimensions. Describes the columns of the system-generated relational views of cube dimensions. Describes the system-generated relational views of OLAP dimensions. Describes the dimension order of the OLAP cubes. Describes the cube dimensions. Describes the hierarchy levels for cube dimensions. Describes the columns of relational hierarchy views of cube dimensions. Describes the hierarchies for cube dimensions. Describes the OLAP dimension hierarchies. Describes the measures in the OLAP cubes. Describes the columns of the relational views of OLAP cubes. Describes the system-generated relational views of OLAP cubes. Describes the OLAP cubes. Describes the contents of OLAP measure folders. Describes the OLAP measure folders.
ALL_CUBE_ATTR_VISIBILITY ALL_CUBE_ATTRIBUTES ALL_CUBE_BUILD_PROCESSES ALL_CUBE_CALCULATED_MEMBERS ALL_CUBE_DIM_LEVELS ALL_CUBE_DIM_MODELS ALL_CUBE_DIM_VIEW_COLUMNS ALL_CUBE_DIM_VIEWS ALL_CUBE_DIMENSIONALITY ALL_CUBE_DIMENSIONS ALL_CUBE_HIER_LEVELS ALL_CUBE_HIER_VIEW_COLUMNS ALL_CUBE_HIER_VIEWS ALL_CUBE_HIERARCHIES ALL_CUBE_MEASURES ALL_CUBE_VIEW_COLUMNS ALL_CUBE_VIEWS ALL_CUBES ALL_MEASURE_FOLDER_CONTENTS ALL_MEASURE_FOLDERS
See Also:
5
5
Oracle OLAP provides an extensive set of analytic functions for enhancing your database with information-rich content. This chapter explains how you can use Analytic Workspace Manager to create calculated measures using templates and free-form calculations. This chapter contains the following topics:
What Is a Calculated Measure? Functions for Defining Calculations Creating Calculated Measures Using Calculation Templates Creating User-Defined Expressions Creating Calculated Measures Using the OLAP DML
Arithmetic Operators: Perform calculations on the values of two measures. Analytic Functions: Perform calculations on an ordered series or a range of values in a single measure or column. Single-Row Functions: Perform calculations on a value in a single row.
5-1
Arithmetic Operators
You can perform the following arithmetic operations using two measures. The calculations in the cube are performed on a cell-by-cell basis at all levels of the dimension hierarchies.
Addition: Adds the values of two measures. Subtraction: Subtracts the values of one measure from the values of another measure. Multiplication: Multiplies the values of two measures. Division or Ratio: Divides the values of one measure by the values of another measure. Percent Difference: Calculates the percent difference between the values of two measures.
The arithmetic operations are available in Analytic Workspace Manager as templates. as described in "Using Calculation Templates" on page 5-6.
Analytic Functions
The analytic functions provide the most powerful computations and fuel the most useful queries for business intelligence and similar applications. They include a variety of rank, share, time series, and other single-column functions. The analytic functions enable analysts and decision makers to make comparisons and identify trends. Analytic functions provided by Oracle OLAP leverage the knowledge associated with the dimensions about levels and family relationships. Time dimensions have additional information that enables them to support time series methods such as lags, leads, moving and cumulative calculations. Because the knowledge is stored with the dimension, you do not need to specify these relationships when creating a calculated measure. The analytic functions are available in Analytic Workspace Manager as templates. They are described in "Using Calculation Templates" on page 5-6.
Single-Row Functions
Oracle OLAP supports most of the SQL single-row functions including:
Numeric functions such as ABS, CEIL, FLOOR, POWER, ROUND, and TRUNC. Character functions such as CONCAT, LPAD, RPAD, LTRIM, RTRIM, REPLACE, and SUBSTR. Datetime functions such as CURRENT_DAY, MONTHS_BETWEEN, NEXT_DAY, and SYSTIMESTAMP. Comparison functions GREATEST and LEAST. Conversion functions such as TO_CHAR, TO_DATE, TO_NUMBER, and TO_TIMESTAMP.
You can use these functions to manipulate the data values in a measure, typically as part of a more complex calculation. These functions are not available as templates, but you can use them in free-form calculations, as described in "Creating User-Defined Expressions" on page 5-14.
In the navigation tree, right-click a cube and select Generate Calculated Measures. On the Calculations tab, select the measures on which to base the calculated measures. Scroll down the Calculation Details and select each type of calculated measure you want to create for this selection of measures. Modify the calculations as desired by altering the templates. Select the Time dimension to use for time series calculations. Review the list of calculated measures. You can change the generated names by using the Naming Rules tab. Click Generate Calculations to create the calculated measures. Repeat this procedure if you want to generate variations of the same basic types of calculations, such as another Share calculation for the same measure but on a different dimension. Change the naming rules to generate new, unique names.
4. 5. 6. 7.
5-3
In the navigation tree, expand a cube folder. Right-click Calculated Measures, then select Create Calculated Measure from the context menu. The Create Calculated Measure dialog box appears.
3. 4.
Enter a descriptive name. Select a calculation type. Your choice of an arithmetic or analytic function dynamically changes the Calculation template.
5. 6.
Modify the calculation template. Click Create. The calculated measure appears in the navigation tree in the Calculated Measures folder.
7.
Select the Advanced option to display the References, Dependencies, and Expressions tabs. The tabs have the following information:
The References tab has a table that lists the measures that Analytic Workspace Manager references as it performs the calculations specified by this calculated measure. If the Enable SQL Expressions option is selected for the cube, then the table has a check mark in the Create column for any additional calculated measure that Analytic Workspace automatically creates. The Dependencies tab has a table that lists the other calculated measures that depend on this calculated measure. Analytic Workspace Manager uses this calculated measure as it performs the calculations for the measures in this table.
The Expressions tab has a table that lists the expressions used by the calculated measure. This tab appears only if the Enable SQL Expressions option is selected for the cube.
Modifying a Template
The calculation that you selected is presented as template, which is a description of the calculation with variable parts that enable you to customize it. Figure 53 shows the template for calculating the prior period. You can view the choice lists by clicking the links.
Figure 53 Changing the Variable Parts of a Calculation
You can include all values of a measure in a calculation, or, for some types of calculations, you can filter the measure to include only a selection of values. To limit one or more dimensions to a single dimension member, click the ellipses (...) next to the measure. The Qualify Measure dialog box appears, as shown in Figure 54.
5-5
Level: Calculates all time periods at the same level, so that all months in the cube are included in one calculation, all quarters are included in another calculation, and so forth. Parent: Calculates all time periods with the same parent, so that all months in Q1-07 are included in one calculation, all months in Q2-07 are included in another calculation, and so forth. Ancestor at level: Calculates all time periods with the same ancestor at a specified level. For example, if the specified level is Year in a Year-Quarter-Month hierarchy, then Q1-06 to Q4-06 are included in one calculation, Q1-07 to Q4-07 are included in another calculation, Jan-06 to Dec-06 are included in a third calculation, and so forth. Any levels higher in the hierarchy are not calculated. Gregorian periods: The Gregorian periods -- Year, Quarter, Month, and Week -impose the Gregorian calendar onto the selected hierarchy. This can be useful for analyzing data that uses nonstandard calendar hierarchies. For example, if you use Gregorian Year on a fiscal hierarchy that begins July 1 and ends June 30, then the last half of one fiscal year and the first half of the next fiscal year are calculated together. Time periods higher in the hierarchy than the specified Gregorian period are not calculated.
Arithmetic Calculations
Basic mathematical operations enable you to perform cell-by-cell calculations on two measures, as described in "Arithmetic Operators" on page 5-2.
Arithmetic Example This template defines a calculated measure for the Global Price Cube using Percent Difference:
Percent difference between measure UNIT_PRICE and measure UNIT_COST.
A query against this calculated measure returns results like these. The PCT_CHG column shows the percent change between PRICE and COST, which is calculated as PRICE-COST/COST.
PRODUCT PRICE COST PCT_DIFF -------------------- ---------- ---------- ---------Envoy Ambassador 2892 2664 .09 Envoy Executive 2803 2644 .06 Envoy Standard 1662 1737 -.04 Sentinel Financial 1755 1658 .06 Sentinel Multimedia 1770 1813 -.02 Sentinel Standard 1552 1410 .1
Index
An index is a mathematical operation calculated on a single measure. An index calculates the percentage difference between the values of a measure and a selected value that serves as a base number. An index does not use a calculation template. Instead, it provides a list of dimension members for each dimension of the cube, from which you can choose one to use as an index, as shown in Figure 55.
Figure 55 Calculating a Product Index
Index Example This example creates an index on the Product dimension using Desktop PCs as the index.
PRODUCT SALES PROD_INDEX -------------------- ---------- ---------Desktop PCs 76682955 100 Portable PCs 18072328 24 CD/DVD 17302122 23 Modems/Fax 5565552 7 Memory 5347292 7 Monitors 3926632 5
5-7
Prior Period: Returns the value of a measure at an earlier time period. Difference From Prior Period: Calculates the difference between values for the current time period and an earlier period. Percent Difference From Prior Period: Calculates the percent difference between the values for the current time period and an earlier period. Future Period: Returns the value of a measure at a later time period. Difference From Future Period: Calculates the difference between the values for the current time period and a later period. Percent Difference From Future Period: Calculates the percent difference between the values for the current time period and a later period.
When creating a calculation for prior or future time periods, you choose the measure, the time dimension, the hierarchy, and the number of periods from the current period. Prior Period Example This template defines a calculated measure using Prior Period:
Prior period for measure SALES in TIME dimension and TIME.CALENDAR hierarchy 1 period ago.
These are the results of a query against the calculated measure. The PRIOR_PERIOD column shows the value of Sales for the preceding period at the same level in the Calendar hierarchy.
TIME -------2005 2006 Q1.05 Q2.05 Q3.05 Q4.05 Q1.06 Q2.06 Q3.06 Q4.06 TIME_LEVEL SALES PRIOR_PERIOD -------------------- ---------- -----------CALENDAR_YEAR 136986572 144290686 CALENDAR_YEAR 140138317 136986572 CALENDAR_QUARTER 31381338 41988687 CALENDAR_QUARTER 37642741 31381338 CALENDAR_QUARTER 32617249 37642741 CALENDAR_QUARTER 35345244 32617249 CALENDAR_QUARTER 36154815 35345244 CALENDAR_QUARTER 36815657 36154815 CALENDAR_QUARTER 32318935 36815657 CALENDAR_QUARTER 34848911 32318935
Period to Date
Period-to-date functions perform a calculation over time periods with the same parent up to the current period. These functions calculate period-to-date:
Period to Date: Calculates the values up to the current time period. Period to Date Period Ago: Calculates the data values up to a prior time period. Difference From Period to Date Period Ago: Calculates the difference in data values up to the current time period compared to the same calculation up to a prior period. Percent Difference From Period To Date Period Ago: Calculates the percent difference in data values up to the current time period compared to the same calculation up to a prior period.
When creating a period-to-date calculation, you can choose from these aggregation methods:
You also choose the measure, the time dimension, and the hierarchy. Period to Date Example This template defines a calculated measure using Period to Date.
Gregorian Year to date for SALES in the TIME dimension and TIME.CALENDAR hierarchy. Aggregate using MINIMUM from the beginning of the period.
These are the results of a query against the calculated measure. The MIN_TO_DATE column displays the current minimum SALES value within the current level and year.
TIME -------Q1.06 Q2.06 Q3.06 Q4.06 JAN-06 FEB-06 MAR-06 APR-06 MAY-06 JUN-06 JUL-06 AUG-06 SEP-06 OCT-06 NOV-06 DEC-06 TIME_LEVEL SALES MIN_TO_DATE -------------------- ---------- ----------CALENDAR_QUARTER 36154815 36154815 CALENDAR_QUARTER 36815657 36154815 CALENDAR_QUARTER 32318935 32318935 CALENDAR_QUARTER 34848911 32318935 MONTH 13119235 13119235 MONTH 11441738 11441738 MONTH 11593842 11441738 MONTH 11356940 11356940 MONTH 13820218 11356940 MONTH 11638499 11356940 MONTH 9417316 9417316 MONTH 11596052 9417316 MONTH 11305567 9417316 MONTH 11780401 9417316 MONTH 10653184 9417316 MONTH 12415325 9417316
Share
Share calculates the ratio of a measure's value for the current dimension member to the value for a related member of the same dimension. You can choose whether the related member is:
Top of hierarchy: Calculates the ratio of each member to the total. Member's parent: Calculates the ratio of each member to its parent. Member's ancestor at level: Calculates the ratio of each member to its ancestor, that is, a member at a specified level higher in the hierarchy.
When creating a share calculation, you can choose the measure, dimension, and hierarchy. You also have the option of multiplying the results by 100 to get percentages instead of fractions. Share Example This template defines a calculated measure using SHARE:
Share of measure SALES in PRODUCT.PRIMARY hierarchy of the PRODUCT dimension as a
5-9
These are the results of a query against the calculated measure. The TOTAL_SHARE column displays the percent share of the total for the selected products.
PRODUCT -------------------Total Product Hardware Desktop PCs Portable PCs CD/DVD Software/Other Accessories Operating Systems Memory Modems/Fax Monitors Documentation PROD_LEVEL SALES TOTAL_SHARE --------------- ---------- ----------TOTAL 144290686 100 CLASS 130145388 90 FAMILY 78770152 55 FAMILY 19066575 13 FAMILY 16559860 11 CLASS 14145298 10 FAMILY 6475353 4 FAMILY 5738775 4 FAMILY 5430466 4 FAMILY 5844185 4 FAMILY 4474150 3 FAMILY 1931170 1
Rank
Rank orders the values of a dimension based on the values of the selected measure. When defining a rank calculation, you choose the dimension, a hierarchy, and the measure. You can choose a method for handling identical values:
Rank: Assigns the same rank to identical values, so there may be fewer ranks than there are members. For example, it may return 1, 2, 3, 3, 4 for a series of five dimension members. Dense Rank: Assigns the same minimum rank to identical values. For example, it may return 1, 2, 3, 3, 5 for a series of five dimension members. Average Rank: Assigns the same average rank to identical values. For example, it may return 1, 2, 3.5, 3.5, 5 for a series of five dimension members.
You can also choose the group in which the dimension members are ranked:
Member's level: Ranks members at the same level. Member's parent: Ranks members with the same parent. Member's ancestor at level: Ranks members with the same ancestor at a specified level higher in the hierarchy.
These are the results of a query against the calculated measure in which the products are ordered by RANK:
PRODUCT SALES RANK -------------------- ---------- ---------Monitors 4474150 1 Memory 5430466 2 Modems/Fax 5844185 3 CD/DVD 16559860 4 5-10 Oracle OLAP User's Guide
19066575 78770152
5 6
Parallel Period
Parallel periods are at the same level as the current time period, but have different parents in an earlier period. For example, you may want to compare current sales with sales for the prior year at the quarter and month levels. Oracle OLAP provides several functions for parallel periods:
Parallel Period: Calculates the value of the parallel period. Difference From Parallel Period: Calculates the difference in values between the current period and the parallel period. Percent Difference From Parallel Period: Calculates the percent difference in values between the current period and the parallel period.
To identify the parallel period, you specify a level and the number of periods before the current period. You can also decide what happens when two periods do not exactly match, such as comparing daily sales for February (28 days) with January (31 days). You also choose the measure, the time dimension, and the hierarchy. Parallel Period Example This template defines a calculated measure using Parallel Period.
Parallel period for SALES in the TIME dimension and TIME.CALENDAR hierarchy 1 TIME.CALENDAR.QUARTER ago based on position from beginning to ending of period.
These are the results of a query against the calculated measure, which lists the months for two calendar quarters. The parallel month has the same position within the previous quarter. The prior period for JUL-06 is APR-06, for AUG-06 is MAY-06, and for SEP-06 is JUN-06.
TIME -------APR-06 MAY-06 JUN-06 JUL-06 AUG-06 SEP-06 PARENT SALES LAST_QTR ---------- ---------- ---------CY2006.Q2 11356940 13119235 CY2006.Q2 13820218 11441738 CY2006.Q2 11638499 11593842 CY2006.Q3 9417316 11356940 CY2006.Q3 11596052 13820218 CY2006.Q3 11305567 11638499
Moving Calculations
Moving calculations are performed over the time periods surrounding the current period. Oracle OLAP provides several aggregation methods for moving calculations:
Moving Average: Calculates the average value for a measure over a fixed number of time periods. Moving Maximum: Calculates the maximum value for a measure over a fixed number of time periods. Moving Minimum: Calculates the minimum value for a measure over a fixed number of time periods. Moving Total: Returns the total value for a measure over a fixed number of time periods.
You can choose the measure, the time dimension, and the hierarchy. You can also select the range, as described in "Choosing a Range of Time Periods" on page 5-6, and the number of time periods before and after the current period to include in the calculation. Moving Calculation Example This template defines a calculated measure using Moving Minimum.
Moving minimum of SALES in the TIME dimension and TIME.CALENDAR hierarchy. Include 1 preceding and 1 following members within level.
These are the results of a query against the calculated measure, which displays values for the descendants of calendar year 2004. Each value of Minimum Sales is the smallest among the current value and the values immediately before and after it. The calculation is performed over all members of a level in the cube.
TIME -------Q1.04 Q2.04 Q3.04 Q4.04 JAN-04 FEB-04 MAR-04 APR-04 MAY-04 JUN-04 JUL-04 AUG-04 SEP-04 OCT-04 NOV-04 DEC-04 TIME_LEVEL SALES MIN_SALES -------------------- ---------- ---------CALENDAR_QUARTER 32977874 32977874 CALENDAR_QUARTER 35797921 32977874 CALENDAR_QUARTER 33526203 33526203 CALENDAR_QUARTER 41988687 31381338 MONTH 11477898 10982016 MONTH 10982016 10517960 MONTH 10517960 10517960 MONTH 11032057 10517960 MONTH 11432616 11032057 MONTH 13333248 11432616 MONTH 12070352 11108893 MONTH 11108893 10346958 MONTH 10346958 10346958 MONTH 14358605 10346958 MONTH 12757560 12757560 MONTH 14872522 12093518
Cumulative Calculations
Cumulative calculations start with the first time period and calculate up to the current member, or start with the last time period and calculate back to the current member. Oracle OLAP provides several aggregation methods for cumulative calculations:
Cumulative Average: Calculates a running average across time periods. Cumulative Maximum: Calculates the maximum value across time periods. Cumulative Minimum: Calculates the minimum value across time periods. Cumulative Total: Calculates a running total across time periods.
You can choose the measure, the time dimension, and the hierarchy. You can also select the range, as described in "Choosing a Range of Time Periods" on page 5-6, and whether you want to start the calculation with the first period and calculate forward, or start with the last period and calculate back. Cumulative Calculation Example This template defines a calculated measure using Cumulative Minimum.
Cumulative minimum of SALES in the TIME dimension and TIME.CALENDAR hierarchy within ancestor at level TIME.CALENDAR_YEAR. Total from beginning to current member.
These are the results of a query against the calculated measure, which displays values for the descendants of calendar year 2004. The minimum value for quarters begins with Q1-04 and ends with Q4-04, and for months begins with Jan-04 and ends with Dec-04.
TIME -------Q1.04 Q2.04 Q3.04 Q4.04 JAN-04 FEB-04 MAR-04 APR-04 MAY-04 JUN-04 JUL-04 AUG-04 SEP-04 OCT-04 NOV-04 DEC-04 TIME_LEVEL SALES MIN_SALES -------------------- ---------- ---------CALENDAR_QUARTER 32977874 32977874 CALENDAR_QUARTER 35797921 32977874 CALENDAR_QUARTER 33526203 32977874 CALENDAR_QUARTER 41988687 32977874 MONTH 11477898 11477898 MONTH 10982016 10982016 MONTH 10517960 10517960 MONTH 11032057 10517960 MONTH 11432616 10517960 MONTH 13333248 10517960 MONTH 12070352 10517960 MONTH 11108893 10517960 MONTH 10346958 10346958 MONTH 14358605 10346958 MONTH 12757560 10346958 MONTH 14872522 10346958
Nested Calculations
You can extend the variety of functions available through the templates by using a calculated measure as the basis for another calculated measure. For example, Analytic Workspace Manager has templates for Moving Average and for Difference From Prior Period. You can create a calculated measure that calculates a moving average, then calculate the difference between the current and the previous moving averages. Nested Calculations Example This template creates a moving average for Units named UNITS_MOVING_AVG:
Moving average of UNITS in the TIME dimension and TIME.CALENDAR hierarchy. Include 1 preceding and 1 following members within level.
The next template creates a Difference From Prior Period calculation from UNITS_ MOVING_AVG.
Difference from prior period for UNITS_MOVING_AVG in TIME dimension and TIME.CALENDAR hierarchy 1 period ago.
These are the results of a query against the Units measure and the two calculated measures. The MOVING_AVG column shows the moving average, and the DIFF column shows the difference between the current moving average and the prior period's.
TIME -------JAN-06 FEB-06 MAR-06 APR-06 MAY-06 JUN-06 JUL-06 AUG-06 TIME_LEVEL UNITS MOVING_AVG DIFF -------------------- ---------- ---------- ---------MONTH 47776 48520 66 MONTH 47695 48940 419 MONTH 51348 48683 -257 MONTH 47005 50387 1705 MONTH 52809 48411 -1976 MONTH 45419 48872 461 MONTH 48388 47546 -1326 MONTH 48830 47857 312
Open the Create Calculated Measure dialog box. Select the calculation type that most closely matches the one you want to define. Modify the template as desired. Cut-and-paste the calculation from the Calculation box into a text editor. Repeat these steps if your calculation uses two or more functions. Modify the calculation as desired in the text editor. You can combine numeric operators, analytic functions, and single-row functions in a single calculation. From the Calculation Types list, select OLAP Expression Syntax. Cut-and-paste the calculation from the text editor into the Calculation box. Click Create.
See Also: Analytic Workspace Manager Help for detailed information about the OLAP expression syntax.
The template generates this calculation using the multiplication operator (*). It appears in the Calculation box. Notice that UNITS is in the Units Cube and UNIT_COST is in the Price Cube.
UNITS_CUBE.UNITS * PRICE_CUBE.UNIT_COST 5-14 Oracle OLAP User's Guide
The syntax of this calculation is so simple that you only need the template to obtain the qualified name of the measure. Following is a free-form calculation that calculates a 2% increase in units sold:
UNITS_CUBE.UNITS * 1.02
Following is a free-form calculation that computes the percent difference between current units sold and the cumulative average. It uses the AVG function and the subtraction (-), division (/) and multiplication (*) operators.
((UNITS_CUBE.UNITS - AVG(UNITS_CUBE.UNITS) OVER HIERARCHY (TIME.CALENDAR BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING WITHIN LEVEL)) / AVG(UNITS_CUBE.UNITS) OVER HIERARCHY (TIME.CALENDAR BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING WITHIN LEVEL)) * 100
Calculate the cumulative average of UNITS with the Cumulative Average template. Calculate the percent difference between current UNITS and the cumulative average with the Percent Difference template.
Table 51 Function
OLAP Expression Syntax Analytic Functions Description Orders the members of a dimension based on the values of an expression. The function returns the sequence numbers of the dimension members, and assigns the same average rank to identical values. Returns the average of a selection of values calculated over time. Tallies the number of data values identified by a selection of dimension members. Orders dimension members based on the values of an expression. The function returns the sequence numbers of the dimension members, and assigns the same minimum rank to identical values. Returns an ancestor at a particular level of a hierarchy for either all members in the hierarchy or a particular member. Returns the number of children of either all dimension members in a hierarchy or a particular member. Returns a number representing the level depth of either all members of a hierarchy or a particular member, where 0 is the top level. Returns the level of either all members of a hierarchy or a particular member. Returns the parent of either all dimension members in a hierarchy or a particular member. Returns the topmost ancestor of either all members of a hierarchy or a particular member. Returns the value of an expression at a specified number of time periods before the current period. Returns the difference between values for the current time period and a prior period. Returns the percent different between values for the current time period and a prior period. Returns the value of an expression at a specified number of time periods after the current period. Returns the difference between values for the current time period and a future period. Returns the percent different between values for the current time period and a future period. Returns the largest of a selection of data values calculated over a particular dimension. Returns the smallest of a selection of data values calculated over a particular dimension. Executes an expression in the OLAP DML language. Orders the members of a dimension based on the values of an expression. The function returns the sequence numbers of the dimension members, and assigns the same rank to identical values.
AVERAGE_RANK
HIER_LEVEL HIER_PARENT HIER_TOP LAG LAG_VARIANCE LAG_VARIANCE_PERCENT LEAD LEAD_VARIANCE LEAD_VARIANCE_PERCENT MAX MIN OLAP_DML_EXPRESSION RANK
Table 51 (Cont.) OLAP Expression Syntax Analytic Functions Function ROW_NUMBER Description Orders the members of a dimension based on the values of an expression. The function returns the sequence numbers of the dimension members, and assigns a unique and arbitrary rank to identical values. Calculates the ratio of an expression's value for the current dimension member to the value for a related member of the same dimension. Returns the total of a selection of values calculated over a particular dimension.
SHARE
SUM
The OLAP DML is a mature language that was developed specifically for creating and managing dimensional objects and for manipulating dimensional data. Although programming in the OLAP DML requires significant skill, the language offers more power and flexibility than any other language.
OLAP DML Expression: Calculates an OLAP DML expression. Choose this calculation type to execute an existing program, a built-in function, or a single expression. The expression is stored as the EQ statement of a formula in the analytic workspace. OLAP DML Function: Executes an OLAP DML program entered in the Program Body field that returns values. Choose this calculation type to develop a new program in the OLAP DML. The name of the program is stored in the EQ statement of a formula in the analytic workspace.
Open the Create Calculated Measure dialog box. From the Calculation Types list, select OLAP DML Expression. For Data Type, select the data type of the return value. Enter the expression in the OLAP DML field.
5. 6.
Click Compile Expression to check for syntax errors and to save a compiled version of the expression. Click Create to create the calculated measure.
Open the Create Calculated Measure dialog box. From the Calculation Types list, select OLAP DML Function. For Data Type, select the data type of the return value. Enter a name for the function. Enter the program in the Program Body field. Omit the DEFINE, PROGRAM, and END commands, because they are generated automatically. Click Compile Expression to check for syntax errors and to save a compiled version of the program. Click Create to create the calculated measure.
The next example uses an arithmetic operator to calculate a 2% increase in units sold. This example of the OLAP DML is identical to the example in "Expression Syntax Example Using an Arithmetic Operator" on page 5-14. However, note the difference in naming convention for the measure.
units_cube_units * 1.02
These are the results of a query against the two calculated measures created as OLAP DML expressions:
PRODUCT UNITS TARGET RANDOM -------------------- ---------- ---------- ---------Envoy Ambassador 2116 2158 .6467 Envoy Executive 2481 2531 .0773 Envoy Standard 3300 3366 .2349 Sentinel Financial 30513 31123 .6027 Sentinel Multimedia 7948 8107 .6494 Sentinel Standard 7302 7448 .5912
Catalog Catalog Catalog Direct Sales Direct Sales Direct Sales Direct Sales Internet Internet Internet Internet
Q2.06 Q3.06 Q4.06 Q1.06 Q2.06 Q3.06 Q4.06 Q1.06 Q2.06 Q3.06 Q4.06
-1 -3 -7 -3 -1 10 -4 29 3 0 16
ALERT ALERT ALERT ALERT ALERT OKAY ALERT OKAY ALERT ALERT OKAY
6
6
You can use any SQL development tool or application to create reports and dashboards populated with data from OLAP cubes. This chapter shows the basic steps for working with the tools provided with Oracle Database: Oracle Business Intelligence Publisher (BI Publisher) and Oracle Application Express. You can try these tools, or you can apply the methods shown here to your favorite SQL tool. This chapter contains the following topics:
Developing OLAP Applications Developing a Report Using BI Publisher Developing a Dashboard Using Application Express
See Also:
Choice Lists: You can create a drop-down list for each dimension to drill on the dimensions in a report or dashboard. Linked Dimension Columns: In Application Express, you can add links to the dimension columns of a crosstab to drill down to the bottom of a hierarchy, and use a Reset button to return to the top level.
These user interfaces set the values of bind variables in the WHERE clause of the source query. When a user changes the current selection in a choice list or clicks a link in a crosstab, that action dynamically changes the value of the variable. When the variable changes, so does the condition of the query and the contents of the report or dashboard. When the variable sets the value of the PARENT column of the hierarchy views, users can drill on a parent to view its children.
6-1
Example 61 shows a basic SQL query against UNITS_CUBE_VIEW in the Global sample schema. The query selects the SALES measure and three calculated measures that use SALES as the basis for the calculations:
SALES_PP: Sales from the prior period. SALES_CHG_PP: Difference in sales between the current period and the prior period. SALES_PCTCHG_PP: Percent difference in sales between the current period and the prior period.
This query is used in the sample applications developed in this chapter. The PARENT columns for the Product, Customer, and Time dimensions support drilling in these applications. The Channel dimension remains anchored at the Total level.
Example 61 SQL Query Against the Sales Cube
SELECT p.long_description "Product", cu.long_description "Customer", t.long_description "Time", ROUND(f.sales) "Sales", ROUND(f.sales_pp) "Prior Period", ROUND(f.sales_chg_pp) "Change", ROUND(f.sales_pctchg_pp * 100) "Percent Change" /* From dimension views and cube view */ FROM product_primary_view p, customer_shipments_view cu, time_calendar_view t, channel_primary_view ch, units_cube_view f /* Use parent columns to implement drilling */ WHERE p.parent = 'TOTAL' AND cu.parent = 'TOTAL' AND t.parent = 'CY2006' AND ch.level_name = 'TOTAL' /* Join dimension views to cube view */ AND p.dim_key = f.product AND cu.dim_key = f.customer AND t.dim_key = f.time AND ch.dim_key = f.channel ORDER BY product, customer, t.end_date; Product --------------Hardware Hardware Hardware Hardware Hardware Hardware Customer --------------North America North America North America North America Asia Pacific Asia Pacific . . . Asia Pacific Europe Europe Europe Europe Time Sales Prior Period Change Percent Change ---------- ---------- ------------ ---------- -------------Q1.06 16002175 14493426 1508749 10 Q2.06 16032643 16002175 30469 0 Q3.06 15698208 16032643 -334436 -2 Q4.06 15958791 15698208 260583 2 Q1.06 13416447 14273900 -857453 -6 Q2.06 14306431 13416447 889984 7
1 16 -8 -26 42
24 rows selected.
6-3
2. 3.
Click Create a New Folder. Enter a name for the folder in the text box, such as OLAP Reports. Click Create.
4. 5.
Click Create a New Report. Enter a report name in the text box. This example creates a report named Global Sales.
c.
Click Create.
To define the contents of the report, click Edit. The Report Editor opens.
2.
For General Settings, enter a description and select a default data source. If the list does not include a connection to the database and schema containing your cubes, contact your BI Publisher administrator.
3.
Select Data Model, then click New. The Data Set page opens.
4.
Enter a name for the data set and enter a SQL query like the one shown in Example 61. Do not use a semicolon.
5. 6.
Click Save. Click View. BI Publisher checks the report definition for errors. If there are none, then it generates the XML for the report.
Figure 63 shows the Report Editor with the Data Set page displayed.
Figure 63 Creating a Data Model in the BI Publisher Report Editor
Static text and graphics that you enter like any other Word document. Dynamic fields such as the date and time or page numbers, which are processed by Word. Codes that identify the XML tags for your data, which are processed by BI Publisher. When BI Publisher generates a report, it replaces the codes with the data identified by these tags.
You can format all parts of the report template in Word, selecting the fonts, text and background colors, table design, and so forth. Example 62 shows the XML for a row of data returned by the sample query. The tags match the column names in the select list, except that underscores replace the spaces. The tags are Product, Customer, Time, Sales, Prior_Period, Change, and Percent_ Change. XML tags are case-sensitive. You use the HTML tag names as the codes in the Word document.
6-5
Example 62 XML for a SQL Query <ROW> <Product>Hardware</Product> <Customer>North America</Customer> <Time>Q1.06</Time> <Sales>16002175</Sales> <Prior_Period>14493426</Prior_Period> <Change>1508749</Change> <Percent_Change>10</Percent_Change> </ROW>
Figure 64 shows the Word document that is used as the template for the sample report. It contains these elements:
A table used to format the banner, which consists of a graphic, the company name, and a horizontal line. (Static) The name of the report. (Static) A table for the query results that contains two rows: A heading row. (Static) A body row containing text form fields, which identify the XML tags and the appropriate formatting for the data. BI Publisher replaces these fields with data from the query. The first and last columns contain two fields. The first and last fields identify the range of repeating columns. (Dynamic)
A date field. Word updates this field with the current date. (Dynamic)
This example uses a blank Word template, but you could use a template with, for example, the banner already defined.
Figure 64 Sample Report Template Created in Word for BI Publisher
The following procedure defines the template manually. Alternatively, you can use a Word plug-in called Oracle BI Publisher Desktop. On the BI Publisher My Folders page, click Template Builder to download the plug-in. To create a BI Publisher template in Word:
1. 2. 3.
Open a new document in Word. Compose the page according to your preferences. For the query results, create a table. The table shown in Figure 64 is very simple. You can use much more elaborate formatting if you want, including nested columns and tables.
4.
From the View menu, select Toolbars, then Forms. The Forms toolbar opens.
5.
Position the cursor in the appropriate cell. On the Forms toolbar, click the Text Form Field icon. The Text Form Field Options dialog box opens.
c. d. e.
Select an appropriate Type, generally Regular Text for dimension labels and Number for measures. Enter a default value and a format. Click Add Help Text. The Form Field Help Text dialog box opens.
f.
Type the appropriate XML tag in the Type Your Own box, using the format <?tag?>. Enter the tag name exactly as it appears in the XML report. For example, enter <?Product?> for the XML tag <Product>.
g. h. 6.
Click OK to close the Form Field Help dialog box. Click OK to close the Text Form Field Options dialog box.
In the Text Form Field Options dialog box, enter any default value, such as For-Each. In the Form Field Help Text dialog box, enter this text:
<?for-each:ROW?>
7.
In the Text Form Field Options dialog box, enter any default value, such as End. In the Form Field Help Text dialog box, enter this text:
<?end for-each?>
8. 9.
Make any additional formatting changes in Word, such as the appropriate justification of the table headings and data columns. Save the document as an RTF file.
Open the report editor in BI Publisher. Select Layouts. The Create Layouts page opens.
3.
Click New.
Developing Reports and Dashboards 6-7
Enter a name and select RTF for the template type. Select Layouts again, and select the layout as the default template for this report. Under Manage Template Files, click Browse. Select the RTF file you created. Click Upload. The uploaded file is listed under Manage Template Files. Whenever you change the file in Word, upload it again. Otherwise, BI Publisher continues to use its copy of the previous version.
8. 9.
10. To change the format, select a format from the list and click View.
To see the XML, select Data. Figure 65 shows the report in HTML format.
Figure 65 BI Publisher Report Displayed in HTML Format
Create one or more Lists of Values (LOV) to be displayed in the menu. Create menus for displaying the LOVs. Edit the query to use the bind variables created for the menus.
Open the Report Editor in BI Publisher. Select List of Values, then click New. The List of Values page opens.
3.
Enter a name for the list, such as Product_LOV. For the type, select SQL Query. Enter a query against the dimension hierarchy view, as shown previously.
4.
Click Save.
Repeat these steps for the other dimensions. This example uses lists for Product, Customer, and Time.
Creating a Menu
In BI Publisher, a menu is a type of parameter. Creating a parameter automatically creates a bind variable that you can use in the query for the report. To create a menu:
1.
2.
For the Identifier, enter a name such as product. This is the case-sensitive name of the bind variable that you will use in the query.
b. c.
Select an appropriate data type, typically String. For the Default Value, enter the dimension key used in the WHERE clause of the LOV query. The menu initially displays this key.
d. e.
For the Parameter Type, select Menu. Select the appropriate List of Values.
6-9
f. 3.
Click Save.
Repeat these steps for the other dimensions. This example creates menus for Product, Customer, and Time.
In this example, the WHERE clause uses the bind variables for Time, Product, and Customer:
WHERE AND AND AND p.parent = :product cu.parent = :customer t.parent = :time ch.level_name = 'TOTAL'
Under Data Model, select the data set you defined for this report. The Data Set page opens.
2. 3.
In the SQL Query box, edit the WHERE clause to use the bind variables created by the parameter definitions. Click Save.
Figure 66 shows a report in HTML format displayed in BI Publisher. The choice lists for Product, Customer, and Time appear across the top. The crosstab lists the months in Q3.06, the Hardware products, and the countries in Europe. To see a different selection of data, you choose a Time Period, Product, and Customer from the menus, then click View. This report was generated by the same report entry, using the same query, as the one shown in Figure 61. You can continue working on this report, adding charts and other tables.
Open a browser to the Application Express home page and log in. Click the Application Builder icon. The Application Builder opens.
3.
4. 5. 6.
Select Create Application, then Next. On the Name page, enter a title for the application such as Global Dashboard and select From Scratch. On the Pages page, select the Report page type, then define the page:
a. b.
For Page Source, select SQL Query. For Title, enter a name such as Sales Analysis. This title appears on the page.
c.
For Query, enter a SQL SELECT statement for your cube, like the one shown in Example 61. Do not include an ORDER BY clause or a semicolon.
d.
Click Add Page. The page definition appears in the Create Application Box.
7.
Click Next, then complete the Create Application wizard according to your own preferences. This example was created with no tabs, no shared components, no authentication, and Theme 15 (Light Blue).
8. 9.
On the Confirm page, click Create. On the Application Builder home page, click the Run Application icon.
Tip: To continue working on this page, click the Edit Page 1 link at the bottom of the display.
Figure 68 shows the results of the query displayed in Application Express. Several items are automatically added to the page: breadcrumbs, Search box, Display list, Go button, Reset button, and Spread Sheet link. This application only needs the Reset button, so you can delete the other items if you want.
Figure 68 Basic Sales Report in Application Express
Create a region on the page to display the list. Create a list of values (LOV). Create a list item with a bind variable to display the LOV. Create an unconditional branch for the list. Edit the query to use the bind variable.
The Page Definition is where you can create and edit pages, including adding and modifying graphical items. The items are organized in three columns: Page Rendering, Page Processing, and Shared Components. To open the Page Definition:
After running the application, click the Edit Page link at the bottom of the page. or
On the Application home page, click the icon for the page where the report is defined.
Creating a Region
You can create the choice list in a plain HTML area at the top of the page. To create an empty HTML region:
1.
On the Page Definition under Regions, click the Create icon. The Create Region wizard opens.
2. 3.
On the Region pages, select HTML, click Next, then select HTML again. On the Display Attributes page, enter a descriptive title and select an appropriate template and location on the page for the lists. For this example, the name is lov_region, the template is No Template, and the location is Page Template Body (1 items below template content). The name can be displayed on the rendered page, but it is hidden in this example.
4.
Click Create Region. The region appears on the Page Definition under Regions.
On the Page Definition under List of Values, click the Create icon. The Create List of Values wizard opens.
2. 3.
On the Source page, select From Scratch. On the Name and Type page, enter a descriptive name and select Dynamic. This example uses the name CUSTOMER_LOV.
4. 5.
On the Query page, enter a query like the one shown previously. Do not use a semicolon. Click Create List of Values. The LOV appears in the Page Definition under List of Values.
For additional LOVs, repeat these steps. This example creates LOVs for the Product and Customer dimensions.
On the Page Definition under Items, click the Create icon. The Create Item wizard opens.
2. 3. 4.
On the Item Type page, select Select List. For Control Type, select Select List with Submit. On the Display Position and Name page:
Enter a name that identifies the dimension, such as P1_CUSTOMER for the name of the Customer bind variable. P1 is the page number, and CUSTOMER identifies the Customer dimension. Select the new HTML region for the location of the list.
5.
Named LOV to the List Of Values created for this dimension, such as CUSTOMER_LOV. Display Null Option to No.
6. 7.
Select the Item attributes according to your own preferences. On the Source page, enter the name of the top dimension key for the default value. For the Global Customer dimension, the value is TOTAL.
8.
Repeat these steps for other lists. This example creates lists for the Product and Customer dimensions. To activate the list item:
1.
On the Page Definition under Branches, click the Create icon. The Edit Branch wizard opens.
2. 3.
On the Point and Type page, accept the default settings. On the Target page:
Set Target to Page in This Application. Set Page to the page with the list item, which is 1 in this example. Select Reset Pagination For This Page.
4. 5.
On the Branch Conditions page, accept the default settings to create an unconditional branch. Click Apply Changes. The Edit Branch page closes, and you return to the Page Definition. The unconditional branch is listed under Branches.
The NVL function substitutes the name of the top dimension key in the hierarchy for null values. The dimension keys at the top have no parent key. To edit the query:
1. 2.
Open the Page Definition. Under Regions, click the Edit Region link. In this example, the region is named Sales Report. The Edit Region page opens.
3.
Change the WHERE clause to use the bind variables. Delete the outer SELECT added by Application Express.
4.
For this example, the WHERE clause now looks like this:
WHERE p.parent = NVL(:P1_PRODUCT, 'TOTAL') AND cu.parent = NVL(:P1_CUSTOMER, 'TOTAL') AND t.parent = 'CY2006' AND ch.level_name = 'TOTAL'
Figure 610 shows the modified page with choice lists for Product and Customer.
Figure 610 Dashboard With Choice Lists for Drilling
Create hidden items with bind variables. Edit the query to use the bind variables. Add links to the dimension columns of the crosstab.
Open the Page Definition. Under Items, click the Create icon. The Create Item wizard opens.
3. 4.
On the Item Type page, select Hidden. On the Display Position and Name page:
Developing Reports and Dashboards 6-17
Enter a name that identifies the dimension, such as P1_TIME for the name of the Time bind variable. Select the region where the report is defined.
5.
On the Source page, enter the dimension key at the top of the hierarchy. TOTAL is the top of all hierarchies in the Global schema. For this example, Time is set to CY2006 to restrict the selection to one year.
6. 7.
Click Create Item. Repeat these steps for any other dimensions that support drilling only on the column links. For this example, a hidden item is defined for Time.
Select list: Application Express manages only those columns that appear in the select list. You can choose to display or hide the columns. For defining the column links, add the DIM_KEY and PARENT columns in the hierarchy views to the query select list. WHERE clause: Add the bind variables for the hidden items like you did for the choice lists in "Editing the Query" on page 6-10.
Open the Page Definition. Under Regions, click the Report link. The Report Attributes page opens.
3.
Clear the Show check boxes for columns to hide, such as the DIM_KEY and PARENT columns. Set the Sort and Sort Sequence check boxes for appropriate sorting for the report. In this example, the sort order is Product (1), Customer (2), and Time (3).
4.
Click the Edit icon for a dimension column. The Column Attributes page opens.
5.
Link Text: Select the dimension name. Page: Enter the page number. Name: List the dimensions in the order they appear in the report. Item is the name of the bind variable. Value is the DIM_KEY column for the dimension being defined or the PARENT column for the other dimensions.
Figure 611 shows the link definition for the Time dimension.
6.
Click Apply Changes. The Column Attributes page closes, and you return to the Report Attributes page.
7. 8.
Define links on the other dimension columns. Click Apply Changes. The Report Attributes page closes, and you return to the Page Definition.
Figure 611
Open the Page Definition. Under Branches, click the Go to Page conditional link. The Reset button was created on the page automatically along with its conditional branch. The Edit Branch page opens.
3. 4. 5.
Under Action, set Clear Cache to the page number (in this example, 1). Under Conditions, set When Button Pressed to RESET. Click Apply Changes. The Edit Branch page closes, and you return to the Page Definition.
6.
Figure 612 shows the finished page displaying months in Q3.06. You can continue working on this application, adding more reports and charts to the page. For the SQL queries providing data to those reports and charts, you can reuse the same bind variables for the dimensions.
Figure 612 Sales Analysis Report With Column Links in Application Express
7
7
Because Oracle OLAP is contained in the database and its resources are managed using the same tools, the management tasks of Oracle OLAP and the database converge. Nonetheless, you should address tasks such as database tuning in the specific context of data warehousing. This chapter contains the following topics:
Setting Database Initialization Parameters Storage Management Dictionary Views and System Tables Partitioned Cubes and Parallelism Analyzing Cubes and Dimensions Monitoring Analytic Workspaces About Backing Up and Recovering Analytic Workspaces About Copying Analytic Workspaces Cube Materialized Views
Oracle Database Performance Tuning Guide for information about tuning parameter settings Oracle Database Reference for descriptions of individual parameters
7-1
Storage Management
Initial Settings for Database Parameters Default Value 1000 Recommended Setting If you reduce this value to limit the maximum number of job slaves running on an instance, then calculate the following number of processes for use by OLAP: Number of CPUs, plus one additional process for every three CPUs; in a multi-core CPU, each core counts as a CPU For example, JOB_QUEUE_ PROCESSES=5 for a four-processor computer Description Controls the degree of parallelism in OLAP builds, as described in "Parallelism" on page 7-7
PARALLEL_ DEGREE_POLICY
MANUAL
AUTO or LIMITED
Controls how the degree of parallelism is determined When set to AUTO or LIMITED, Oracle determines whether a SQL statement executes in parallel and, if so, the degree of parallelism used
Provides sufficient background processes for each user Specifies use of an undo tablespace
Identifies the undo tablespace defined for OLAP use, as shown in "Creating an Undo Tablespace" on page 7-3
Open the init.ora initialization file in a text editor. Add or change the settings in the file, as described in Table 71. Stop and restart the database. On Windows, use the Services utility to stop and restart OracleService. On Linux, use commands like the following. Be sure to identify the initialization file in the STARTUP command.
SQLPLUS '/ AS SYSDBA' SHUTDOWN IMMEDIATE STARTUP pfile=$ORACLE_BASE/admin/orcl/pfile/init.ora.724200516420
Storage Management
Analytic workspaces are stored in the owner's default tablespace, unless the owner specifies otherwise. All tablespaces for OLAP use should specify EXTENT MANAGEMENT LOCAL. Tablespaces created using default parameters may use resources inefficiently. You should create undo, permanent, and temporary tablespaces that are appropriate for storing analytic workspaces.
Storage Management
After creating the undo tablespace, change your system parameter file to include the following settings, then restart the database as described in "Setting Database Initialization Parameters" on page 7-1.
UNDO_TABLESPACE=tablespace UNDO_MANAGEMENT=AUTO
7-3
performance of analytic workspaces both on systems with Oracle RAC and those without Oracle RAC. However, you do not need ASM to use Oracle OLAP. You can still spread your data across multiple disks, just by defining the tablespaces like in this example:
CREATE TABLESPACE glo DATAFILE 'disk1/oradata/glo1.dbf' SIZE 64M REUSE AUTOEXTEND ON NEXT 8M MAXSIZE 1024M EXTENT MANAGEMENT LOCAL SEGMENT SPACE MANAGEMENT AUTO; ALTER TABLESPACE glo ADD DATAFILE 'disk2/oradata/glo2.dbf' SIZE 64M REUSE AUTOEXTEND ON NEXT 8M MAXSIZE 1024M, 'disk3/oradata/glo3.dbf' SIZE 64M REUSE AUTOEXTEND ON NEXT 8M MAXSIZE UNLIMITED;
See Also:
"Querying the Data Dictionary" on page 4-19 for a list of data dictionary views that describe OLAP dimensional objects Oracle Database Reference for full descriptions of all data dictionary views
System Tables
The SYS user owns several tables associated with analytic workspaces. Table 73 provides brief descriptions.
Important:
These tables are vital for the operation of Oracle OLAP. Do not delete them or attempt to modify them directly without being fully aware of the consequences.
OLAP Tables Owned By SYS Description Maintains a record of all analytic workspaces in the database, recording its name, owner, and other information. Stores the AWCREATE analytic workspace, which contains programs for using OLAP Catalog metadata in Oracle Database 10g Release 10.1.0.2 and earlier releases. It exists only for backward compatibility. Stores the AWCREATE10G analytic workspace, which contains programs for using OLAP Catalog metadata in Oracle Database 10g Release 10.1.0.3. The OLAP Catalog is not used by later releases. It exists only for backward compatibility. Stores the AWMD analytic workspace, which contains programs for creating metadata catalogs. Stores the AWREPORT analytic workspace, which contains a program named AWREPORT for generating a summary space report. Stores the AWXML analytic workspace, which contains programs for creating and managing analytic workspaces for Oracle Database 10g Release 10.1.0.4 and later. Stores the EXPRESS analytic workspace. It contains objects and programs that support basic operations. EXPRESS is used any time a session is open. Describes the objects stored in analytic workspaces. Stores program data. Not currently used. Stores analytic workspace object properties. Stores tracking data about access to aggregate cells. Not currently used. Maintains a history of all page spaces. A page space is an ordered series of bytes equivalent to a file. Oracle OLAP manages a cache of workspace pages. Pages are read from storage in a table and written into the cache in response to a query. The same page can be accessed by several sessions. The information stored in PS$ enables Oracle OLAP to discard pages that are no longer in use, and to maintain a consistent view of the data for all users, even when the workspace is being modified during their sessions. When changes to a workspace are saved, unused pages are purged and the corresponding rows are deleted from PS$.
AW$AWCREATE10G
Maintenance Logs
The first time you load data into a cube or dimension using Analytic Workspace Manager, it creates several logs. These logs are stored in tables in the same schema as the analytic workspace:
7-5
Cube Build Log: Contains information about what happened during a build. Use this log to determine whether the build produced the results you were expecting, and if not, why not. The log is continually updated whenever a cube or dimension is refreshed, whether by Analytic Workspace Manager, the database materialized view refresh subsystem, or a PL/SQL procedure. You can query the log at any time to evaluate the progress of the build and to estimate the time to completion. The default table name is CUBE_BUILD_LOG. Cube Dimension Compile Log: Contains errors that occur during the validation of the dimension hierarchies when OLAP is aggregating a cube. The default table name is CUBE_DIMENSION_COMPILE. Cube Operations Log: Contains messages and debugging information for all OLAP engine events. The default table name is CUBE_OPERATIONS_LOG. Cube Rejected Records Log: Identifies any records that were rejected because they did not meet the expected format. The default table name is CUBE_REJECTED_ RECORDS.
These logs enable you to track the progress of long running processes, then use the results to profile performance characteristics. They provide information to help you diagnose and remedy problems that may occur during development and maintenance of a cube. They also help diagnose performance problems in querying cubes. You can also run the $ORACLE_HOME/olap/admin/utlolaplog.sql script to create the build log with some useful views. The Maintenance Wizard in Analytic Workspace Manager displays the relevant rows from these tables during every build on the Maintenance Log page. You can query the tables directly in any SQL interface.
See Also: DBMS_CUBE_LOG entry in Oracle Database PL/SQL Packages and Types Reference.
For example, assume that in the sample Global analytic workspace, the Units cube is partitioned on the Time dimension, using the Calendar hierarchy, and at the Calendar Quarter level. The OLAP engine creates a partition for each Calendar Quarter and its children. The default top partition contains Calendar Years and all members of the Fiscal hierarchy. If Global has three years of data, then the Units cube has 13 partitions: Four bottom partitions for each Calendar Year, plus the top partition. A data refresh typically creates new time periods and deletes old ones. Whenever a Calendar Quarter value is loaded into the Time dimension, a corresponding partition is added to the cube. Whenever a Calendar Quarter value is deleted from the Time dimension, the corresponding empty partition is deleted from the cube.
Parallelism
You can improve the performance of data maintenance by enabling parallel processing. There are two levels of parallelism:
Parallel job execution: Loading and aggregating the data using multiple processes. Parallel update: Moving the data from temporary to permanent tablespaces using multiple processes.
The number of objects that can be aggregated in parallel. Each cube and each partition (including the top partition) can use a separate process. You can control the number of partitions in a cube on the Partitioning tab of the cube property sheet in Analytic Workspace Manager.
The number of simultaneous database processes the user is authorized to run. This number is controlled by the JOB_QUEUE_PROCESSES parameter. If you have SYS privileges, you can obtain the current parameter setting with the following SQL command:
SHOW PARAMETER JOB_QUEUE_PROCESSES
For parallel update, the number of processes you allocate to the job. You can specify the number of processes in the Maintenance Wizard of Analytic Workspace Manager when specifying the task processing options, or on the Materialized View tab of the cube. The number of processes allocated to SQL to fetch rows from the relational source tables. When PARALLEL_DEGREE_POLICY is set to AUTO or LIMITED, the database can allocate additional processes for executing SQL statements.
Suppose that a cube is partitioned on the Quarter level of Time, and the cube contains three years of data. The cube has 3*4=12 bottom partitions, JOB_QUEUE_PROCESSES is set to 8, and you set the parallelism option to 4 for the build. Oracle Database processes the cube in this way when PARALLEL_DEGREE_POLICY is set to its default value of MANUAL:
1. 2.
Load and build the dimensions of the cube serially using a single process. Load and build the 12 bottom partitions in parallel using 4 processes. As soon as one process finishes, another begins until all 12 are complete. This cube could use the 8 processes allowed by JOB_QUEUE_PROCESSES, but it is limited to 4 by the build setting.
3.
7-7
When PARALLEL_DEGREE_POLICY is set to AUTO or LIMITED, Oracle Database may allocate more than the designated processes. Example 71 shows excerpts from CUBE_BUILD_LOG for a build of the Units cube and its dimensions. Partitioning on the Calendar Year level of the Time dimension created 10 bottom partitions for 1998 to 2007. JOB_QUEUE_PROCESSES is set to 2 and the parallelism option is set to 2 for the build also. The log shows that Oracle Database processed the Global in this way:
1. 2.
Processed the four dimensions serially. Processed each partition of the Units cube
Example 71 Build Log for Global Units Cube SLAVE_NUMBER -----------0 0 0 0 0 0 0 STATUS ---------STARTED STARTED COMPLETED STARTED COMPLETED STARTED SQL . . . SQL SQL COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMMAND -------------------BUILD ATTACH AW RW WAIT ATTACH AW RW WAIT FREEZE FREEZE LOAD NO SYNCH LOAD NO SYNCH BUILD_OBJECT PARTITION --------------- ---------------
TIME TIME
0 0 0 0 0 0 0 0 0 0 0 0 0 0
LOAD NO SYNCH LOAD NO SYNCH LOAD NO SYNCH COMPILE COMPILE COMPILE AGGMAP COMPILE AGGMAP COMPILE AGGMAP COMPILE AGGMAP UPDATE/COMMIT UPDATE/COMMIT UPDATE/COMMIT UPDATE/COMMIT REATTACH AW MULTI TH AW REATTACH AW MULTI TH AW
PRODUCT PRODUCT PRODUCT PRODUCT PRODUCT UNITS_CUBE UNITS_CUBE PRICE_CUBE PRICE_CUBE PRODUCT PRODUCT
0 COMPLETED
0 0 1 1 1 1 1 1 1 1 1 1
STARTED STARTED STARTED STARTED COMPLETED STARTED COMPLETED STARTED SQL COMPLETED STARTED COMPLETED . . .
SLAVE UNITS_CUBE SLAVE UNITS_CUBE BUILD ATTACH AW MULTI THAW UNITS_CUBE ATTACH AW MULTI THAW UNITS_CUBE ACQUIRE UNITS_CUBE ACQUIRE UNITS_CUBE LOAD UNITS_CUBE LOAD UNITS_CUBE LOAD UNITS_CUBE UPDATE/COMMIT UNITS_CUBE UPDATE/COMMIT UNITS_CUBE
P10:CY2007 P9:CY2006 P10:CY2007 P10:CY2007 P10:CY2007 P10:CY2007 P10:CY2007 P10:CY2007 P10:CY2007 P10:CY2007 P10:CY2007 P10:CY2007
10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 0 0
STARTED STARTED COMPLETED STARTED COMPLETED STARTED SQL COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED COMPLETED COMPLETED STARTED
BUILD ATTACH AW MULTI THAW ATTACH AW MULTI THAW ACQUIRE ACQUIRE LOAD LOAD LOAD SOLVE SOLVE UPDATE/COMMIT UPDATE/COMMIT DETACH AW DETACH AW BUILD SLAVE REATTACH AW MULTI TH AW REATTACH AW MULTI TH AW SLAVE BUILD ATTACH AW MULTI THAW ATTACH AW MULTI THAW ACQUIRE ACQUIRE LOAD LOAD SOLVE SOLVE UPDATE/COMMIT UPDATE/COMMIT DETACH AW DETACH AW BUILD SLAVE REATTACH AW RW WAIT REATTACH AW RW WAIT ANALYZE ANALYZE THAW THAW DETACH AW DETACH AW BUILD
UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE
P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998 P1:CY1998
0 COMPLETED
0 11 11 11 11 11 11 11 11 11 11 11 11 11 11 0 0 0 0 0 0 0 0 0 0
STARTED STARTED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED COMPLETED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED COMPLETED
UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE
P0 P0 P0 P0 P0 P0 P0 P0 P0 P0 P0 P0 P0 P0 P0 P0
UNITS_CUBE UNITS_CUBE
Oracle Database allocates the specified number of processes regardless of whether all of them can be used simultaneously at any point in the job. For example, if your job can use up to three processes, but you specify five, then two of the processes allocated to your job cannot be used by it or by any other job. If Oracle Database is installed with Real Application Clusters (Oracle RAC), then a script submitted to the job queue is distributed across all nodes in the cluster. The performance gains can be significant. For example, a job running on four nodes in a cluster may run up to four times faster than the same job running on a single computer.
Administering Oracle OLAP 7-9
The argument can be either a cube or a dimension. Example 72 shows a sample script for generating statistics on the Units cube and its dimensions.
Example 72 Generating Statistics for the Units Cube BEGIN DBMS_AW_STATS.ANALYZE('units_cube'); DBMS_AW_STATS.ANALYZE('time'); DBMS_AW_STATS.ANALYZE('customer'); DBMS_AW_STATS.ANALYZE('product'); DBMS_AW_STATS.ANALYZE('channel'); END; /
Although you cannot view the statistics directly, you can examine the execution plans, as described in "Viewing Execution Plans" on page 4-17.
See Also:
Oracle Enterprise Manager Cloud Control (Cloud Control) is a general database management and administration tool. In addition to facilitating basic tasks like adding users and modifying datafiles, Cloud Control presents a graphic overview of a database's current status. It also provides an interface to troubleshooting and performance tuning utilities. Automatic Workload Repository collects database performance statistics and metrics for analysis and tuning, shows the exact time spent in the database, and saves session information. Automatic Database Diagnostic Monitor watches database performance statistics to identify bottlenecks, analyze SQL statements, and offer suggestions to improve performance.
Oracle Database also provides system views to help you diagnose performance problems. The following topics identify views that are either specific to OLAP or provide database information that is pertinent to OLAP.
Oracle Database Reference for full descriptions of the OLAP dynamic performance views.
OLAP Dynamic Performance Views Description Lists the aggregation operators available in analytic workspaces. Lists the allocation operators available in analytic workspaces. Collects information about the use of cache space and the status of dynamic aggregation. Collects status information about SQL fetches. Collects information about each active session. Collects information about the status of active analytic workspaces.
Table 74 View
Table 75 describes some other dynamic performance views that are not specific to OLAP, but which you may want to use when tuning your database for OLAP.
Table 75 View V$LOG V$LOGFILE V$PGASTAT Selected Database Performance Views Description Displays log file information from the control file. Contains information about redo log files. Provides PGA memory usage statistics and statistics about the automatic PGA memory manager when PGA_AGGREGATE_TARGET is set. Displays statistics for data dictionary activity. Each row contains statistics for one data dictionary cache. Lists system statistics.
V$ROWCACHE V$SYSSTAT
"System Tables" on page 7-4 for descriptions of the analytic workspaces owned by SYS.
See Also:
To see the size of the LOB table containing an analytic workspace, use a SQL command like the following, replacing GLOBAL.AW$GLOBAL with the qualified name of your analytic workspace.
SELECT ROUND(SUM(dbms_lob.getlength(awlob))/1024,0) kb FROM global.aw$global; KB ---------187282
7 rows selected.
Table 76 (Cont.) OLAP DBA Scripts SQL Script session_resources shared_pool_hits Description Identifies the use of cursors, PGA, and UGA for each open session. Calculates the shared pool hit ratio.
7 rows selected.
Data Pump. Analytic workspaces are copied with the other objects in a schema or database export. Use the expdp/impdp database utilities.
Tip: Verify that the target schema of an import has the OLAP_XS_ ADMIN privilege. Otherwise, the analytic workspace will not be created with the necessary permissions.
Transportable Tablespaces. Analytic workspaces are copied with the other objects to a transportable tablespace. However, you can only transport the tablespace to the same platform (for example, from Linux to Linux, Solaris to Solaris, or Windows to Windows) because the OLAP DECIMAL data type is hardware dependent. Use the expdp/impdp database utilities. Transportable tablespaces are much faster than dump files.
The owner of an analytic workspace can export the schema to a dump file. Only users with the EXP_FULL_DATABASE privilege or a privileged user (such as SYS or a user with the DBA role) can export the full database or create a transportable tablespace.
See Also:
"Saving and Re-Creating Dimensional Objects with Object Definitions" on page 3-37 for information about XML templates Oracle Database Utilities for information about Oracle Data Pump and the expdp/impdp commands
Analytic workspace: Saves all dimensional objects and all user-defined OLAP DML programs and objects. Dimension: Saves the dimension and its levels, hierarchies, attributes, and mappings. Cube: Saves the cube and its measures, calculated measures, dimensions, mappings, and all user-defined OLAP DML programs and objects associated with the cube. Measure Folder: Saves a list of the measures in the measure folder. It does not save the objects.
Templates store metadata, not data. You can store templates in a small text file or in a database table. When re-creating objects from a template, you must have access to the source data.
See Also:
"Saving and Re-Creating Dimensional Objects with Object Definitions" on page 3-37
EIF files are upwardly compatible among releases of Oracle Database. An EIF file saves the definitions of OLAP DML objects and optionally saves the data also. When you create an EIF file, you can save only the data that you have permission to access. EIF files do not save object security rules. You can export and import EIF files for analytic workspaces. You can use EIF files at a more granular level, such as saving just your custom programs, using the OLAP DML.
See Also:
"Saving and Re-Creating Dimensional Objects with Object Definitions" on page 3-37
page 3-34
Oracle Database Reference for complete descriptions of the data dictionary views
The example shows the cube materialized views defined by Analytic Workspace Manager: One for each dimension hierarchy and one for each cube.
Automatic Refresh: On the Materialized View tab for a cube, you can create a regular schedule for the materialized view refresh subsystem, as described in "Adding Materialized View Capability to a Cube" on page 3-34.
Maintenance Wizard: The Maintenance Wizard is available for refreshing all cubes and dimensions, including cube materialized views. DBMS_CUBE: The DBMS_CUBE PL/SQL package is available for refreshing all cubes, cube dimensions, and cube materialized views. DBMS_MVIEW: The DBMS_MVIEW PL/SQL package contains several procedures for use with cube materialized views.
Using DBMS_CUBE
You can use DBMS_CUBE to create and populate an analytic workspace or to maintain any cube, including cube materialized views. The following command initiates a complete refresh of UNITS_CUBE, which is enabled as a cube materialized view. It automatically refreshes any stale dimensions before refreshing the cube.
EXECUTE dbms_cube.build('GLOBAL.UNITS_CUBE');
You can determine the refresh method from USER_MVIEWS, as shown in "Identifying Cube Materialized Views" on page 7-17.
Using DBMS_MVIEW
You can use DBMS_MVIEW to refresh all types of materialized views. These refresh procedures can be used with cube materialized views:
REFRESH refreshes a list of one or more materialized views. REFRESH_ALL_MVIEWS refreshes all materialized views that meet certain criteria. REFRESH_DEPENDENT refreshes all materialized views that depend on a particular master table and meet certain criteria.
Dimensions must be refreshed before the cube. An error is raised during refresh of a cube materialized view if any of its associated dimension materialized views are stale. The procedures in DBMS_MVIEW can refresh multiple materialized views in one call, but they do not guarantee the refresh order. To control the refresh order, call DBMS_ MVIEW.REFRESH for the cube materialized view separately from its dimension materialized views. The following command initiates a refresh of the materialized view for the CHANNEL_ PRIMARY hierarchy.
EXECUTE dbms_mview.refresh('CB$CHANNEL_PRIMARY', 'C');
Refresh Methods
In Analytic Workspace Manager, you can specify the COMPLETE, FAST, or FORCE methods for refreshing a cube. Two additional methods, FAST_PCT and FAST_SOLVE, are invoked by the materialized view subsystem. They are not separate choices.
Table 77
Refresh Methods For Cube Materialized Views Description Deletes and recreates the cube. This option supports arbitrarily complex mappings from the source tables to the cube.
FAST
Loads and re-aggregates only changed values, based on the materialized view logs or, after direct path loading, on the ALL_ SUMDELTA data dictionary view. The source for the refresh is the incremental differences that have been captured in the materialized view logs, rather than the original mapped sources. These differences are used to incrementally rebuild the cube. Only cells that are affected by the changed values are re-aggregated. This option supports only simple mappings for cube materialized views, that is, where no expressions (other than table.column), views, or aggregations occur in the query defining the mapping. The materialized view subsystem determines whether to perform a FAST or a FAST_PCT refresh. See Oracle Database Data Warehousing Guide for information about the methodology.
FAST_PCT
Loads and re-aggregates data only from changed partitions. This method works best when the source table and the cube are partitioned on the same dimension. FAST_PCT does not use change logs. The materialized view subsystem determines whether to perform a FAST or a FAST_PCT refresh. See Oracle Database Data Warehousing Guide for information about the methodology.
FAST_SOLVE
Loads and re-aggregates only changed values, based on the original mapped data source. FAST_SOLVE is a type of refresh only for cube materialized views. It incrementally re-aggregates the cube even when the refresh source is the original mapped source instead of the materialized view logs. The aggregation subsystem identifies the differences and then incrementally re-aggregates the cube. This option is supported for arbitrarily complex mappings from the source tables to the cube. To discover whether a FAST_SOLVE refresh has occurred, review the CUBE_BUILD_LOG table as shown in "Fast Solve Refreshes". Or review the LAST_REFRESH_TYPE column of ALL_MVIEWS; a FAST_SOLVE refresh appears as FAST_CS.
FORCE
Loads and re-aggregates values using the best method possible. When a COMPLETE refresh is not necessary, the materialized view system first attempts a FAST refresh. If it cannot FAST refresh a cube materialized view, it performs a FAST_SOLVE refresh.
Example 75 Identifying a FAST SOLVE Refresh SELECT build_object, status, command FROM cube_build_log WHERE build_object='UNITS_CUBE'
AND build_id=8; BUILD_OBJECT -----------UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE UNITS_CUBE STATUS ---------STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED STARTED COMPLETED COMMAND ------------------------COMPILE AGGMAP COMPILE AGGMAP UPDATE UPDATE CLEAR LEAVES CLEAR LEAVES LOAD LOAD SOLVE SOLVE UPDATE UPDATE ANALYZE ANALYZE
14 rows selected.
QUERY_REWRITE_ENABLED: Enables or disables query rewrite globally for the database. QUERY_REWRITE_INTEGRITY: Determines the degree to which query rewrite monitors the consistency of materialized views with the source data. The trusted or stale tolerated settings are recommended when using rewrite to cube materialized views.
Administration of cube materialized views is the same as any other materialized view except that the cube materialized views must be in the same schema as the analytic workspace. Users require the GLOBAL QUERY REWRITE privilege to have rewrite to materialized views that are in schemas other than their own. However, the owner can access the materialized views from any schema without additional privileges.
See Also:
"Analyzing Cubes and Dimensions" on page 7-10 for information about optimizer statistics Viewing Execution Plans on page 4-17 for information about execution plans Oracle Database Reference for complete descriptions of the initialization parameters
8
8
Security
Oracle OLAP secures your data using the standard security mechanisms of Oracle Database. This chapter contains the following topics:
Security of Multidimensional Data in Oracle Database Setting Object Security Creating Data Security Policies on Dimensions and Cubes Creating OLAP Data Security Roles
Security Management
Because you have just one system to administer, you do not have to replicate basic security tasks such as these:
Creating user accounts Creating and administering rules for password protection Securing network connections Detecting and eliminating security vulnerabilities Safeguarding the system from intruders
The cornerstone of data security is the administration of user accounts and roles. Users open a connection with Oracle Database with a user name and password, and they have access to both dimensional and relational objects in the same session.
Security 8-1
Types of Security
Users by default have no access rights to an analytic workspace or any other data type in another user's schema. The owner or an administrator must grant them, or a role to which they belong, any access privileges. Oracle OLAP provides two types of security: Object security and data security.
Object security provides access to dimensional objects. You must set object security before other users can access them. Object security is implemented using SQL GRANT and REVOKE. Data security provides fine-grained control of the data on a cellular level. This type of security is optional. You must define data security policies only when you want to restrict access to specific areas of a cube. Data security is implemented using Oracle Real Application Security.
Note:
Only the owner of a schema can create data security policies and OLAP data security roles. The data security policies and OLAP data security roles apply only to objects in the schema.
You can administer both data security and object security in Analytic Workspace Manager. For object security, you also have the option of using SQL GRANT and REVOKE.
Alter: Change the definition of a cube or dimension. Users need this privilege to create and modify a dimensional model. Delete: Remove old dimension members. Users need this privilege to refresh a dimension. Insert: Add new dimension members. Users need this privilege to refresh a dimension. Select: Query the cube or dimension. Users need this privilege to query a view of the cube or dimension or to use the CUBE_TABLE function. CUBE_TABLE is a SQL function that returns the values of a dimensional object. Update: Change the data values of a cube or the name of a dimension member. Users need this privilege to refresh a dimension or cube.
Users exercise these privileges either by using Analytic Workspace Manager to create and administer dimensional objects, or by using SQL to query them. They do not issue commands such as SQL INSERT and UPDATE directly on the cubes and dimensions.
Layered Security
For dimensional objects, you can manage security at these levels:
The privileges are layered so that, for example, a user with SELECT data security on Software products must also have SELECT object security on the PRODUCT dimension and the Global analytic workspace. Users also need SELECT privileges on the views of the dimensional objects. You administer security on views and materialized views for dimensional objects the same way as for any other views and materialized views in the database.
Security 8-3
The privileges apply to the entire cube. However, you can create a data security policy on the cube or on its dimensions to restrict the privileges, as described in "Creating Data Security Policies on Dimensions and Cubes" on page 8-7. Example 82 shows the SQL commands that enable Scott to query the Units cube. They give Scott SELECT privileges on the Global analytic workspace, the cube, and all of its dimensions. Scott also gets privileges on the dimension views so that he can query the dimension attributes for formatted reports.
Example 82 Privileges to Query the Units Cube /* Grant privileges on the analytic workspace */ GRANT SELECT ON global.aw$global TO scott; /* Grant privileges on the cube */ GRANT SELECT ON global.units_cube TO scott; /* Grant privileges on the dimensions */ GRANT SELECT ON global.channel TO scott; GRANT SELECT ON global.customer TO scott; GRANT SELECT ON global.product TO scott; GRANT SELECT ON global.time TO scott; /* Grant privileges on the cube, dimension, and hierarchy views */ GRANT SELECT ON global.units_cube_view TO scott; GRANT SELECT ON global.channel_view TO scott; GRANT SELECT ON global.channel_primary_view TO scott; GRANT SELECT ON global.customer_view TO scott; GRANT SELECT ON global.customer_shipments_view TO scott; GRANT SELECT ON global.customer_segments_view TO scott; GRANT SELECT ON global.product_view TO scott; GRANT SELECT ON global.product_primary_view TO scott; GRANT SELECT ON global.time_view TO scott; GRANT SELECT ON global.time_calendar_view TO scott; GRANT SELECT ON global.time_fiscal_view TO scott;
Example 83 shows the SQL commands that give SCOTT the privileges to query the relational tables for the detail level data and to use query rewrite to obtain summary data from the Units cube.
Example 83 Privileges to Use Cube Materialized Views for Query Rewrite /* Grant privileges on materialized views using query rewrite */ GRANT GLOBAL QUERY REWRITE TO scott; /* Grant privileges on the relational source tables */ GRANT SELECT ON global.channel_dim TO scott; GRANT SELECT ON global.customer_dim TO scott; GRANT SELECT ON global.product_dim TO scott; GRANT SELECT ON global.time_dim TO scott; GRANT SELECT ON global.units_fact TO scott; /* Grant privileges on the analytic workspace */ GRANT SELECT ON global.aw$global TO scott; /* Grant privileges on the cube */ GRANT SELECT ON global.units_cube TO scott; /* Grant privileges on the dimensions */ GRANT SELECT ON global.channel TO scott;
GRANT SELECT ON global.customer TO scott; GRANT SELECT ON global.product TO scott; GRANT SELECT ON global.time TO scott;
Example 84 shows the SQL commands that give SCOTT the privileges to modify and update all dimensional objects in GLOBAL using Analytic Workspace Manager.
Note:
The GRANT ALL commands encompass more privileges than those discussed in this chapter. Be sure to review the list of privileges before using GRANT ALL.
Example 84 Privileges to Modify and Refresh GLOBAL /* Grant privilege to use Analytic Workspace Manager */ GRANT OLAP_USER TO scott; /* Grant privileges on the analytic workspace */ GRANT ALL ON global.aw$global TO scott; /* Grant privileges on the cubes */ GRANT ALL ON global.units_cube TO scott; GRANT ALL ON global.price_cost_cube TO scott; /* Grant privileges on the dimensions */ GRANT ALL ON global.channel TO scott; GRANT ALL ON global.customer TO scott; GRANT ALL ON global.product TO scott; GRANT ALL ON global.time TO scott;
In the navigation tree, right-click the analytic workspace and select Set Analytic Workspace Object Security. The Set Analytic Workspace Object Security dialog box appears.
2.
Complete the dialog box, then click OK. Click Help for specific information about the choices.
3.
Grant privileges on one or more cubes and their dimensions. Privileges on the analytic workspace do not automatically extend to the cubes and dimensions contained in the analytic workspace.
Security 8-5
In the navigation tree, right-click any dimension and select Set Dimension Object Security. The Set Dimension Object Security dialog box appears.
2.
Complete the dialog box, then click OK. You can set privileges on all of the dimensions simultaneously. You can extend the privileges to the dimension and hierarchy views and to the analytic workspace. Click Help for specific information about the choices.
In the navigation tree, right-click any cube and select Set Cube Object Security. The Set Cube Object Security dialog box appears.
2.
Complete the dialog box, then click OK. You can set privileges on all of the cubes simultaneously. You can extend the privileges to the cube views and to the analytic workspace. Click Help for specific information about the choices.
Only the owner of a schema can create data security policies for dimensions and cubes in the schema. When you create a data security policy on a dimension, the policy extends to all cubes with that dimension. You do not need to re-create the policy for each cube. When you create a data security policy on a cube, you select the members for each dimension of the cube. The policy only applies to that cube. When you create data security policies on both dimensions and cubes, users have privileges on the most narrowly defined portion of the data, where the policies overlap.
Security 8-7
Granting Data Privileges You can apply a policy to one or more database users and roles. You can also apply a policy to an OLAP data security role. An OLAP data security role is a group of database users and roles that you can manage in Analytic Workspace Manager just for use in security policies. You create OLAP data security roles and data security policies in Analytic Workspace Manager. Selecting Data By Criteria When defining a data security policy, you can select specific dimension members or those that meet certain criteria based on the dimension hierarchy. By using criteria instead of hard-coding specific dimension members, the selection remains valid after a data refresh. You do not need to modify the selection after adding members. For example, a security policy that grants SELECT privileges to all Hardware products remains valid when old products are rolled off and new products are added to the PRODUCT dimension.
Note: You must have the OLAP_XS_ADMIN role to manage data security policies in Analytic Workspace Manager.
Expand the folder for a dimension or a cube. Right-click Data Security and select Create Data Security Policy. The Create Data Security Policy dialog box appears.
3. 4. 5.
On the General tab, enter a descriptive name in the Data Security Policy Name field. Optional: Enter a description in the Description field. For a dimension, select the method you want to use to select the viewable dimension members, either Member Selection or OLAP DML Expression. The related tab becomes active. For a cube, the method is Member Selection.
6.
Click Add Users or Roles. The Add Users or Roles dialog box appears.
7.
Select the database users and roles and the OLAP data security role to use this policy. Then click OK to close the dialog box. The selected database users and roles and OLAP data security role are now listed in the table on the General tab.
8.
Select the permissions you want to grant to each user or role. You cannot assign permissions to the OLAP data security role because the permissions are part of its definition. Complete the Member Selection tab or the OLAP DML Expression tab, depending on the previously selected method.
9.
The data security policy appears in the navigation tree in the Data Security folder for the dimension or cube.
11. Grant these users and roles object privileges on the dimension or cube, and on the
analytic workspace.
See Also:
"Setting Object Security on an Analytic Workspace" on page 8-5 "Setting Object Security on Dimensions" on page 8-6 "Setting Object Security on Cubes" on page 8-7 "Creating OLAP Data Security Roles" on page 8-10
Figure 83 shows the Member Selection tab of the data security policy for PRODUCT. Users who have privileges on the PRODUCT dimension based on this policy have access to all Hardware products. They do not have access to Software products or Total Product.
Figure 83 Restricting Product to Hardware and Descendants
Disabling and Enabling Data Security When you create a data security policy, Oracle OLAP enables data security for the dimension or cube. You can disable data security for a dimension or a cube. You can then enable data security for the object again. To disable or enable data security:
1. 2.
Expand the folder for a dimension or a cube. Right-click Data Security and select Disable Data for Object or Enable Data for Object. The Disable Confirmation dialog box or the Enable Confirmation dialog box appears.
3.
Click Yes.
Security 8-9
In the navigation tree, right-click Data Security Roles and then select Create Data Security Role. The Create Data Security Role dialog box appears.
2. 3. 4.
On the General tab, enter a descriptive name in the Data Security Role Name field. Optional: Enter a description in the Description field. Click Add Users or Roles. The Add Users or Roles dialog box appears.
5.
Select the users and roles that you want to include in this OLAP data security role. Then click OK to close this dialog box. The selected users and roles are now listed in the table on the General tab.
6. 7.
Select the permissions you want to grant to each user or role. Click Create to save the OLAP data security role. The new OLAP data security role appears in the navigation tree in the Data Security Roles folder.
See Also:
9
9
Advanced Aggregations
A cube always returns summary data to a query as needed. While the cube may store data at the day level, for example, it can return a result at the quarter or year level without requiring a calculation in the query. This chapter explains how to optimize the unique aggregation subsystem of Oracle OLAP to provide the best performance for both data maintenance and querying. This chapter contains the following topics:
What Is Aggregation? Aggregation Operators When Does Aggregation Order Matter? Example: Aggregating the Units Cube
What Is Aggregation?
Aggregation is the process of consolidating multiple values into a single value. For example, data can be collected on a daily basis and aggregated into a value for the week, the weekly data can be aggregated into a value for the month, and so on. Aggregation allows patterns in the data to emerge, and these patterns are the basis for analysis and decision making. When you define a data model with hierarchical dimensions, you are providing the framework in which aggregate data can be calculated. Aggregation is frequently called summarization, and aggregate data is called summary data. While the most frequently used aggregation operator is Sum, there are many other operators, such as Average, First, Last, Minimum, and Maximum. Oracle OLAP also supports weighted and hierarchical methods. Following are some simple diagrams showing how the basic types of operators work. For descriptions of all the operators, refer to "Aggregation Operators" on page 9-3. Figure 91 shows a simple hierarchy with four children and one parent value. Three of the children have values, while the fourth is empty. This empty cell has a null or NA value. The Sum operator calculates a value of (2 + 4 + 6)=12 for the parent value.
What Is Aggregation?
The Average operator calculates the average of all real data, producing an aggregate value of ((2 + 4 + 6)/3)=4, as shown in Figure 92.
Figure 92 Average Aggregation in a Simple Hierarchy
The hierarchical operators include null values in the count of cells. In Figure 93, the Hierarchical Average operator produces an aggregate value of ((2 + 4 + 6 +NA)/4)=3.
Figure 93 Hierarchical Average Aggregation in a Simple Hierarchy
The weighted operators use the values in another measure to generate weighted values before performing the aggregation. Figure 94 shows how the simple sum of 12 in Figure 91 changes to 20 by using weights ((3*2) + (2*4) + (NA*6) +(4*NA)).
Aggregation Operators
Aggregation Operators
Analytic workspaces provide an extensive list of aggregation methods, including weighted, hierarchical, and weighted hierarchical methods.
Basic Operators
The following are descriptions of the basic aggregation operators:
Average: Adds non-null data values, then divides the sum by the number of data values. First Non-NA Data Value: Returns the first real data value. Last Non-NA Data Value: Returns the last real data value. Maximum: Returns the largest data value among the children of each parent. Minimum: Returns the smallest non-null data value among the children of each parent. Nonadditive: Does not aggregate the data. Sum: Adds data values.
Scaled Sum: Adds the value of a weight object to each data value, then adds the data values. Weighted Average: Multiplies each data value by a weight factor, adds the data values, and then divides that result by the sum of the weight factors.
Weighted First: Multiplies the first non-null data value by its corresponding weight value. Weighted Last: Multiplies the last non-null data value by its corresponding weight value. Weighted Sum: Multiplies each data value by a weight factor, then adds the data values.
Hierarchical Operators
The following are descriptions of the hierarchical operators. They include all cells identified by the hierarchy in the calculations, whether or not the cells contain data. Hierarchical Average and the Hierarchical Weighted operators use outer joins.
Hierarchical Average: Adds data values, then divides the sum by the number of the children in the dimension hierarchy. Unlike Average, which counts only non-null children, hierarchical average counts all of the children of a parent, regardless of whether each child does or does not have a value. Hierarchical First Member: Returns the first data value in the hierarchy, even when that value is null. Hierarchical Last Member: Returns the last data value in the hierarchy, even when that value is null. Hierarchical Weighted Average: Multiplies non-null child data values by their corresponding weight values, then divides the result by the sum of the weight values. Unlike Weighted Average, Hierarchical Weighted Average includes weight values in the denominator sum even when the corresponding child values are null. Hierarchical Weighted First: Multiplies the first data value in the hierarchy by its corresponding weight value, even when that value is null. Hierarchical Weighted Last: Multiplies the last data value in the hierarchy by its corresponding weight value, even when that value is null.
Maximum Minimum Sum Hierarchical First Member Hierarchical Last Member Hierarchical Average
Average First Non-NA Data Value Last Non-NA Data Value Weighted First Weighted Last Hierarchical Weighted First Hierarchical Weighted Last Scaled Sum
Figure 96 shows the same cube, except Maximum is calculated first down one dimension of the cube, then Sum is calculated across the other dimension. The maximum value of the sums in Figure 95 is 15, while the sum of the maximum values in Figure 96 is 19.
Figure 96 Max Method Followed by Sum Method
A
A
This guide uses the Global schema for its examples. This appendix explores the business requirements of the fictitious Global Computing Company and discusses how the design of a data model emerges from these requirements. This appendix contains the following topics:
Case Study Scenario Identifying Required Business Facts Designing a Dimensional Model for Global Computing
Traditionally, Global Computing experiences low third-quarter sales (July through September). However, recent sales in other quarters have also been lower than expected. The company has experienced bursts of growth but, for no apparent reason, has had lower first-quarter sales during the last two years as compared with prior years. Global has been successful with its newest sales channel, the Internet. Although sales within this channel are growing, overall profits are declining. Perhaps the most significant factor is that margins on personal computers previously the source of most of Global Computing's profits - are declining rapidly.
Global Computing must understand how each of these factors is affecting its business. Current reporting is done by the IT department, which produces certain standard reports on a monthly basis. Any ad hoc reports are handled on an as-needed basis and are subject to the time constraints of the limited IT staff. Complaints have been widespread within the Sales and Marketing department regarding the delayed
Designing a Dimensional Model A-1
response to report requests. Complaints have also been numerous in the IT department regarding analysts who change their minds frequently or ask for further information. The Sales and Marketing department has been struggling with a lack of timely information about what it is selling, who is buying, and how they are buying. In a meeting with the CIO, the VP of Sales and Marketing states, "By the time I get the information, it is no longer useful. I am only able to get information at the end of each month, and it does not have the details I need to do my job."
Reporting Requirements
When asked to be more specific about what she needs, the Vice President of Sales and Marketing identifies the following requirements:
Trended sales data for specific customers, regions, and segments. The ability to provide information and some analysis capabilities to the field sales force. A web interface would be preferred, since the sales force is distributed throughout the world. Detail regarding mail-order, phone, and email sales on a monthly and quarterly basis, and a comparison to past time periods. Information must identify when, how, and what is being sold by each channel. Margin information on products to understand the dollar contribution for each sale. Knowledge of percent change versus the prior and year-ago period for sales, units, and margin. The ability to perform analysis of the data by ad hoc groupings.
The CIO has discussed these requirements with his team and has concluded that a standard reporting solution against the production order entry system would not be flexible enough to provide the required analysis capabilities. The reporting requirements for business analysis are so diverse that the projected cost of development, along with the expected turnaround time for requests, would make this solution unacceptable. The CIO's team recommends using an analytic workspace to support analysis. The team suggests that the Sales and Marketing department's IT group work with Corporate IT to build an analytic workspace that meets their needs for information analysis.
Business Goals
The development team identifies the following high-level business goals that the project must meet:
Global Computing's strategic goal is to increase company profits by increasing sales of higher margin products and by increasing sales volume overall. The Sales and Marketing department objectives are to: Analyze industry trends and target specific market segments. Analyze sales channels and increase profits. Identify product trends and create a strategy for developing the appropriate channels.
Information Requirements
Once you have established business goals, you can determine the type of information that helps achieve these goals. To understand how end users examine the data in the analytic workspace, it is important to conduct extensive interviews. From interviews with key end users, you can determine how they look at the business, and what types of business analysis questions they want to answer.
What products are profitable? Who are our customers, and what and how are they buying? What accounts are most profitable? What is the performance of each distribution channel? Is there still a seasonal variance to the business?
We can examine each of these business analysis questions in detail. What products are profitable? This business analysis question consists of the following questions:
What is the percent of total sales for any item, product family, or product class in any month, quarter or year, and in any distribution channel? How does this percent of sales differ from a year ago? What is the unit price, unit cost, and margin for each unit for any item in any particular month? What are the price, cost, and margin trends for any item in any month? What items were most profitable in any month, quarter, or year, in any distribution channel, and in any geographic area or market segment? How did profitability change from the prior period? What was the percent change in profitability from the prior period? What items experienced the greatest change in profitability from the prior period? What items contributed the most to total profitability in any month, quarter, or year, in any distribution channel, and in any geographic area or market segment? What items have the highest per unit margin for any particular month? In summary, what are the trends?
Who are our customers, and what and how are they buying? This business analysis question consists of the following questions:
What were sales for any item, product family, or product class in any month, quarter, or year? What were sales for any item, product family, or product class in any distribution channel, geographic area, or market segment? How did sales change from the prior period? What was the percent change in sales from the prior period? How did sales change from a year ago? What was the percent change in sales from a year ago?
Which accounts are most profitable? This business analysis question consists of the following questions:
Which accounts are most profitable in any month, quarter, or year, in any distribution channel, by any item, product family, or product class? What were sales and extended margin (gross profit) by account for any month, quarter, or year, for any distribution channel, and for any product? How does account profitability compare to the prior time period? Which accounts experienced the greatest increase in sales as compared to the prior period? What is the percent change in sales from the prior period? Did the percent change in profitability increase at the same rate as the percent change in sales? In summary, what are the trends?
What is the performance of each distribution channel? This business analysis question consists of the following questions:
What is the percent of sales to total sales for each distribution channel for any item, product family, or product class, or for any geographic area or market segment? What is the profitability of each distribution channel: direct sales, catalog sales, and the Internet? Is the newest distribution channel, the Internet, "cannibalizing" catalog sales? Are customers simply switching ordering methods, or is the Internet distribution channel reaching additional customers? In summary, what are the trends?
Is there still a seasonal variance to the business? This business analysis question consists of the following questions:
Are there identifiable seasonal sales patterns for particular items or product families? How do seasonal sales patterns vary by geographic location? How do seasonal sales patterns vary by market segment? Are there differences in seasonal sales patterns as compared to last year?
Global Computing has a strong need for profitability analysis. The company must understand profitability by product, account, market segment, and distribution channel. It also must understand profitability trends. Global Computing must understand how sales vary by time of year. The company must understand these seasonal trends by product, geographic area, market segment, and distribution channel.
Global Computing has a need for ad hoc sales analysis. Analysis must identify what products are sold to whom, when these products are sold, and how customers buy these products. The ability to perform trend analysis is important to Global Computing.
Identifying Dimensions
Four dimensions are used to organize the facts in the database:
Product shows how data varies by product. Customer shows how data varies by customer or geographic area. Channel shows how data varies according to each distribution channel. Time shows how data varies over time.
Identifying Levels
Now that we have identified dimensions, we can identify the levels of summarization within each dimension. Analysis requirements at Global Computing reveal that:
There are three distribution channels: Sales, Catalog, and Internet. These three values are the lowest level of detail in the data warehouse and are grouped in the Channel level. From the order of highest level of summarization to the lowest level of detail, the levels are Total and Channel. Global performs customer and geographic analysis along the line of shipments to customers and by market segmentation. Shipments and Market are two hierarchies in the Customer dimension. In each case, the lowest level of detail in the data model is the Ship To location. When analyzing along the line of customer shipments, the levels of summarization are (highest to lowest): Total, Region, Warehouse, and Ship To. When analyzing by market segmentation, the levels of summarization are (highest to lowest): Total, Market Segment, Account, and Ship To.
The Product dimension has four levels (highest to lowest): Total, Class, Family, and Item. The Time dimension has four levels (highest to lowest): Total, Year, Quarter, and Month. The dimension has two hierarchies: Calendar and Fiscal.
All dimensions have a Total level as the highest level of summarization. Adding this highest level provides additional flexibility as application users analyze data.
Identifying Hierarchies
We can identify the hierarchies that organize the levels within each dimension. To identify hierarchies, we group the levels in the correct order of summarization and in a way that supports the identified types of analysis. For the Channel and Product dimensions, Global Computing requires only one hierarchy for each dimension. For the Customer dimension, Global Computing requires two hierarchies. Analysis within the Customer dimension tends to be either by geographic area or market segment. Therefore, we organize levels into two hierarchies, Shipments and Segment. Analysis over time also requires two hierarchies, a Calendar hierarchy and a Fiscal hierarchy.
All of the other facts can be derived from these basic facts. The derived facts can be calculated in the analytic workspace on demand. If experience shows that some of these derived facts are being used heavily and the calculations are putting a noticeable load on the system, then some of these facts can be calculated and stored in the analytic workspace as a data maintenance procedure.
B
B
Keyboard Shortcuts
Keyboard shortcuts support accessibility in Analytic Workspace Manager. Most shortcuts work on all platforms, but Windows provides the most reliable results for all of them. If you use keyboard shortcuts for accessibility, then install Analytic Workspace Manager on a Windows platform. The keyboard shortcuts are active within particular areas of the user interface:
Menu Bar Navigation Tree Property Sheets Shuttle Keys Mapping Canvas
Menu Bar
File menu: Alt+F Tools menu: Alt+T Help menu: Alt+H
Navigation Tree
To display a menu for the selected object, press Shift+F10. This is equivalent to clicking the right mouse button. To close the menu for a selected object, press Esc. To expand a folder, press the Right Arrow key. To collapse a folder, press the Left Arrow key. To move the cursor down the tree, press the Down Arrow key. To move the cursor up the tree, press the Up Arrow key. To move the cursor from the navigation tree to a property sheet, press Tab. To move the cursor from a property sheet to the navigation tree, press Shift+Tab.
Property Sheets
To move the cursor from the navigation tree to a property sheet, press Tab. To move the cursor to the next tab, press the Right Arrow key.
Keyboard Shortcuts B-1
Shuttle Keys
To move the cursor to the previous tab, press the Left Arrow key. To move the cursor from a property sheet to the navigation tree, press Shift+Tab. To move the splitter between the navigation tree and a property sheet, press F8 Right Arrow or Left Arrow. To change a menu choice in a table, press F2 Down Arrow.
Shuttle Keys
Move all: Alt+L Move selected: Alt+D Remove selected: Alt+R Remove all: Alt+O To select multiple items, press Ctrl+Arrow, then press the spacebar.
Mapping Canvas
Table mapping view: Ctrl+T Graphical mapping view: Ctrl+G Automatically arrange mappings: Ctrl+Alt+K Automatically map star schema: Ctrl+M Remove all mappings: Ctrl+D Schema Viewer Navigator: All keyboard shortcuts for the navigation tree are available, plus the following additions for the table view: To copy the name of the selected column from the tree: Ctrl+C. To paste a column name into the selected field: Ctrl+V. To expand the width of a column: Select the header and press Alt+Right Arrow. To reduce the width of a column: Select the header and press Alt+Left Arrow.
Glossary
additive Describes a measure or fact that can be summarized through addition, such as a SUM function. An additive measure is the most common type. Examples include sales, cost, and profit. Contrast with nonadditive. aggregation The process of consolidating data values into a single value. For example, sales data could be collected on a daily basis and then be aggregated to the week level, the week data could be aggregated to the month level, and so on. The data can then be referred to as aggregate data. The term aggregation is often used interchangeably with summarization, and aggregate data is used interchangeably with summary data. However, there are a wide range of aggregation methods available in addition to SUM. analytic workspace A container for storing related dimensional objects, such as dimensions and cubes. An analytic workspace is stored in a relational table. See also cube, cube dimension. ancestor A dimension member at a higher level of aggregation than a particular member. For example, in a Time dimension, the year 2007 is the ancestor of the day 06-July-07. The member immediately above is the parent. In a dimension hierarchy, the data value of the ancestor is the aggregated value of the data values of its descendants. Contrast with descendant. See also hierarchy, level, parent. attribute A database object related to an OLAP cube dimension. An attribute stores descriptive characteristics for all dimension members, or members of a particular hierarchy, or only members at a particular level of a hierarchy. When the values of an attribute are unique, they provide supplementary information that can be used for display (such as a descriptive name) or in analysis (such as the number of days in a time period). When the values of an attribute apply to a group of dimension members, they enable users to select data based on like characteristics. For example, in a database representing footwear, you might use a color attribute to select all boots, sneakers, and slippers of the same color. See also cube dimension.
Glossary-1
base level data See detail data. base measure See measure. calculated measure A stored expression that executes in response to a query. For example, a calculated measure might generate the difference in costs from the prior period by using the LAG_ VARIANCE function on the COSTS measure. Another calculated measure might calculate profits by subtracting the COSTS measure from the SALES measure. The expression resolves only the values requested by the query. See also expression, measure. cell A single data value of an expression. In a dimensioned expression, a cell is identified by one value from each of the dimensions of the expression. For example, if you have a measure with the dimensions MONTH and CUSTOMER, then each combination of a month and a customer identifies a separate cell of that measure. See also cube dimension. child A dimension member that is part of a more aggregate member in a hierarchy. For example, in a Time dimension, the month Jan-06 might be the child of the quarter Q1-2006. A dimension member can be the child of a different parent in each hierarchy. Contrast with parent. See also descendant, hierarchy. composite A compact format for storing sparse multidimensional data. Oracle OLAP provides two types of composites: a compressed composite for extremely sparse data, and a regular composite for moderately sparse data. See also dimension, sparsity. compressed cube A cube with very sparse data that is stored in a compressed composite. See also composite. compression See compressed cube. consistent solve specification See solve specification. cube An organization of measures with identical dimensions and other shared characteristics. The edges of the cube contain the dimension members, and the body of the cube contains the data values. For example, sales data can be organized into a cube whose edges contain values from the Time, Product, and Customer dimensions and whose body contains Volume Sales and Dollar Sales data.
Glossary-2
derived measure
cube dimension A cube dimension is a dimensional object that stores a list of values. It is an index for identifying the values of a measure. For example, if Sales data has a separate sales figure for each month, then the data has a Time dimension that contains month values, which organize the data by month. In the context of multidimensional analysis, a cube dimension is called a dimension. See also dimension. cube materialized view A cube that has been enhanced with materialized view capabilities. A cube materialized view can be incrementally refreshed through the Oracle Database materialized view subsystem, and it can serve as a target for transparent rewrite of queries against the source tables. Also called a cube-organized materialized view. cube script A sequence of steps that prepare the data for querying, such as loading and aggregating data. cube view A relational view of the data stored in a cube, which can be queried by SQL. It contains columns for the dimensions, measures, and calculated measures of the cube. custom measure See calculated measure. custom member A dimension member whose data is calculated from the values of other members of the same dimension using the rules defined in a model. See model. data security role A group of users and database roles that is defined just for use in managing OLAP security policies. data source A relational table, view, synonym, or other database object that provides detail data for cubes and cube dimensions. data warehouse A database designed for query and analysis rather than transaction processing. A data warehouse usually contains historical data that is derived from transaction data, but it can include data from other sources. It separates analysis workload from transaction workload and enables a business to consolidate data from several sources. denormalized Permit redundancy in a table. Contrast with normalize. derived measure See calculated measure.
Glossary-3
descendant
descendant A dimension member at a lower level of aggregation than a particular member. For example, in a Time dimension, the day 06-July-07 is the descendant of year 2007. The member immediately below is the child. In a dimension hierarchy, the data values of the descendants roll up into the data values of the ancestors. Contrast with ancestor. See also aggregation, child, hierarchy, level. detail data Data at the lowest level, which is acquired from another source. Contrast with aggregation. dimension A structure that categorizes data. Among the most common dimensions for sales-oriented data are Time, Geography, and Product. Most dimensions have hierarchies and levels. In a cube, a dimension is a list of values at all levels of aggregation. In a relational table, a dimension is a type of object that defines hierarchical (parent-child) relationships between pairs of column sets. See also cube dimension, hierarchy, measure dimension. dimension key See dimension member. dimension member One element in the list that composes a cube dimension. For example, a Time dimension might have dimension members for days, months, quarters, and years. dimension table A relational table that stores all or part of the values for a dimension in a star or snowflake schema. Dimension tables typically contain columns for the dimension keys, levels, and attributes. dimension value See dimension member. dimension view A relational view of a cube dimension that provides information about all members of all hierarchies. It includes columns for the dimension keys, level, and attributes. See also cube dimension, hierarchy view. drill To navigate from one item to a set of related items. Drilling typically involves navigating up and down through the levels in a hierarchy. Drilling down expands the view to include child values that are associated with parent values in the hierarchy. Drilling up collapses the list of descendant values that are associated with a parent value in the hierarchy.
Glossary-4
key
EIF file A specially formatted file for transferring data between analytic workspaces, or for storing versions of an analytic workspace (all of it or selected objects) outside the database. embedded total A list of dimension members at all levels of a hierarchy, such that the aggregate members (totals and subtotals) are interspersed with the detail members. For example, a Time dimension might contain dimension members for days, months, quarters, and years. expression A combination of one or more values (typically provided by a measure or a calculated measure), operators, and functions that evaluates to a value. An expression generally assumes the data type of its components. The following are examples of expressions, where SALES is a measure: SALES, SALES*1.05, TRUNC(SALES). fact See measure. fact table A table in a star schema that contains factual data. A fact table typically has two types of columns: those that contain facts and those that are foreign keys to dimension tables. The primary key of a fact table is usually a composite key that is made up of all of its foreign keys. A fact table might contain either detail facts or aggregated facts. Fact tables that contain aggregated facts are typically called summary tables or materialized views. A fact table usually contains facts with the same level of aggregation. See also materialized view. hierarchy A way to organize data at different levels of aggregation. Hierarchies are used to define data aggregation; for example, in a Time dimension, a hierarchy might be used to aggregate data from days to months to quarters to years. Hierarchies are also used to define a navigational drill path. In a relational table, hierarchies can be defined as part of a dimension object. See also level-based hierarchy, ragged hierarchy, skip-level hierarchy, value-based hierarchy. hierarchy view A relational view of a cube dimension that provides information about the members that belong to a particular hierarchy. It includes columns for the dimension keys, parents, levels of the hierarchy, and attributes. See also cube dimension, dimension view. key A column or set of columns included in the definition of certain types of integrity constraints. Keys describe the relationships between the different tables and columns of a relational database.
Glossary-5
leaf data
See also dimension member. leaf data See detail data. level A named position in a hierarchy. For example, a Time dimension might have a hierarchy that represents data at the month, quarter, and year levels. The levels might be named Month, Quarter, and Year. The names provide an easy way to reference a group of dimension members at the same distance from the base. level-based hierarchy A hierarchy composed of levels. For example, Time is always level based with levels such as Month, Quarter, and Year. Most hierarchies are level based. See also value-based hierarchy. mapping The definition of the relationship and data flow between source and target objects. For example, the metadata for a cube includes the mappings between each measure and the columns of a fact table or view. materialized view A database object that provides access to aggregate data and can be recognized by the automatic refresh and the query rewrite subsystems. See also cube materialized view. measure Data that represents a business measure, such as sales or cost data. You can select, display, and analyze the data in a measure. The terms measure and fact are synonymous; measure is more commonly used in a multidimensional environment and fact is more commonly used in a relational environment. Measures are dimensional objects that store data, such as Volume Sales and Dollar Sales. Measures belong to a cube. See also calculated measure, fact, cube. measure dimension A dimension that has measures as dimension members. measure folder A database object that organizes and label groups of measures. Users may have access to several schemas with measures named Sales or Costs, and measure folders provide a way to differentiate among them. model A set of interrelated equations specified using the members of a particular dimension. Line item dimensions often use models to calculate the values of dimension members. See also custom member. Contrast with calculated measure.
Glossary-6
NA value A special data value that indicates that data is "not available" (NA) or null. It is the value of any cell to which a specific data value has not been assigned or for which data cannot be calculated. See also cell, sparsity. nonadditive Describes a measure or fact that cannot be summarized through addition, such as Unit Price. Maximum is an example of a nonadditive aggregation method. Contrast with additive. normalize In a relational database, the process of removing redundancy in data by separating the data into multiple tables. Contrast with denormalized. OLAP Online Analytical Processing. OLAP functionality is characterized by dynamic, dimensional analysis of historical data, which supports activities such as the following:
Calculating across dimensions and through hierarchies Analyzing trends Drilling up and down through hierarchies Rotating to change the dimensional orientation
Contrast with OLTP. OLAP DML A set of commands, functions, and options used to manage dimensional data stored in analytic workspaces within Oracle Database. Analytic Workspace Manager, the OLAP expression syntax, the OLAP Java API, and various applications and PL/SQL packages enable users to access dimensional data without using the OLAP DML directly, but those tools use the OLAP DML to accomplish the desired tasks. The OLAP Data Manipulation Language (DML) operates exclusively within analytic workspaces, whose primary data structures are dimensions, variables, formulas, relations, and valuesets. These dimensional objects in analytic workspaces support the high-level dimensional objects in the database, such as cubes, cube dimensions, measures, attributes, and hierarchies. Contrast with OLAP expression syntax. OLAP expression syntax An extension of the SQL syntax that is used to manipulate the data stored in dimensional database objects such as cubes, cube dimensions, attributes, and measures. Contrast with OLAP DML.
Glossary-7
OLTP
OLTP Online Transaction Processing. OLTP systems are optimized for fast and reliable transaction handling. Compared to data analysis systems, most OLTP interactions involve a relatively small number of rows, but a larger group of tables. Contrast with OLAP. on the fly Calculated at run time as needed in response to a specific query. In a cube, calculated measures and custom members are typically calculated as needed. Aggregate data can be precomputed, calculated as needed, or a combination of the two methods. Contrast with precompute. override solve specification See solve specification. page A unit for swapping data in and out of memory. Also called a block. page space A grouping of related data pages. parent A dimension member immediately above a particular member in a hierarchy. In a dimension hierarchy, the data value of the parent is the aggregated total of the data values of its children. Contrast with child. See also hierarchy, level. parent-child relation A one-to-many relationship between one parent and one or more children in a hierarchical dimension. For example, New York (at the state level) might be the parent of Albany, Buffalo, Poughkeepsie, and Rochester (at the city level). See also child, parent. precalculate See precompute. precompute Calculate and store as a data maintenance procedure. In a cube, aggregate data can be precomputed, calculated as needed, or a combination of the two methods. Contrast with on the fly. ragged hierarchy A hierarchy that contains at least one member with a different base level, creating a "ragged" base level for the hierarchy. Organization dimensions are frequently ragged. refresh Load new and changed values from the source tables and recompute the aggregate values.
Glossary-8
star query
security role See data security role. skip-level hierarchy A hierarchy that contains at least one member whose parents are multiple levels above it, creating a hole in the hierarchy. For example, in a Geography dimension with levels for City, State, and Country, Washington D.C. is a city that does not have a State value; its parent is United States at the Country level. snowflake schema A type of star schema in which the dimension tables are partly or fully normalized. See also normalize, star schema. solve specification The aggregation method for each dimension of the cube. solved data A result set in which all derived data has been calculated. Data fetched from a cube is always fully solved, because all of the data in the result set is calculated before it is returned to the SQL-based application. The result set from the cube is the same whether the data was precomputed or calculated as needed. See also on the fly, precompute. source See data source. sparsity A concept that refers to multidimensional data in which a relatively high percentage of the combinations of dimension values do not contain actual data. There are two types of sparsity:
Controlled sparsity occurs when a range of values of one or more dimensions has no data; for example, a new measure dimensioned by Month for which you do not have data for past months. The cells exist because you have past months in the Month dimension, but the cells are empty. Random sparsity occurs when nulls are scattered throughout a measure, usually because some combinations of dimension members never have any data. For example, a district might only sell certain products and never have sales data for the other products.
Some dimensions may be sparse while others are dense. For example, every time period may have at least one data value across the other dimensions, making Time a dense dimension. However, some products may not be sold in some cities, and may not be available anywhere for some time periods; both Product and Geography may be sparse dimensions. See also composite. star query A join between a fact table and several dimension tables. Each dimension table is joined to the fact table using a primary key to foreign key join, but the dimension tables are not joined to each other.
Glossary-9
star schema
star schema A relational schema whose design represents a dimensional data model. The star schema consists of one or more fact tables and one or more dimension tables that are related through foreign keys. See also snowflake schema. status The list of currently accessible values for a given dimension. The status of a dimension persists within a particular session, and does not change until it is changed deliberately. When an analytic workspace is first attached to a session, all members are in status. See also cube dimension, dimension member. summary See aggregation. update window The length of time available for loading data into a database. value-based hierarchy A hierarchy defined only by the parent-child relationships among dimension members. The dimension members at a particular distance from the base level do not form a meaningful group for analysis, so the levels are not named. For example, an employee dimension might have a parent-child relation that identifies each employee's supervisor. However, levels that group first-, second-, and third-level supervisors and so forth may not be meaningful for analysis. See also hierarchy, level-based hierarchy.
Glossary-10
Index
A
ADVISOR privilege, 2-2 aggregate functions, 3-20 aggregation average operator, 9-2 calculated measures, 4-15 definition, 9-1 hierarchical average operator, 9-2 over attributes, 4-14 sum operator, 9-2 weighted operators, 9-2 aggregation operators, 3-17, 4-14, 9-3 aggregation order, 9-4 aggregation percentages, 9-7 aggregation step (cube scripts), 3-31 ALL_AW_OBJ view, 7-4 ALL_AW_PROP view, 7-4 ALL_AW_PS view, 7-4 ALL_AWS view, 7-4 ALL_CUBES view, 7-6 analysis tools, 1-3 analytic functions, 5-2, 5-15 Analytic Workspace Manager configuring, 2-3, 2-5, 3-25 installing, 2-2 opening, 2-3 using, 3-2 to 3-39 analytic workspace security, 8-3, 8-5 analytic workspaces backing up and recovering, 7-15 creating, 3-3 database storage, 7-5 disk space consumption, 7-14 enhancing functionality, 3-4 identifying owners, 7-12 listing, 7-12 saving and re-creating, 3-37 size, 7-12 analyze step (cube scripts), 3-31 Application Express, 1-3, 6-11 arithmetic operations, 5-2 attachment modes configuring, 2-5 selecting, 3-3 showing, 2-4 attribute aggregation, 4-14 attributes creating, 3-9 defined, 1-8, 3-8 authentication, 2-1 Automatic Database Diagnostic Monitor, 7-10 Automatic Storage Management, 7-3 Automatic Workload Repository, 7-10 average cumulative, 5-12 moving, 5-11 average operator (aggregation), 9-2 average rank, 5-10 AVERAGE_RANK function, 5-16 AVG function, 3-20, 5-16 AW$ tables, 7-5
B
backup and recovery, 7-15 backup options, 7-15 batch processing, 7-7 BI Publisher, 6-3 BI Suite, 1-6 bind variables, 6-1, 6-10, 6-16, 6-18 branches (Application Express), 6-16 build logs, 3-14 BusinessObjects Enterprise, 1-6
C
calculated measures and measure dimensions, 1-7 copying and pasting, 3-39 creating, 5-3 defined, 5-1 generator, 5-3 calculation templates, 5-5, 5-6 calculations free-form, 5-14 in queries, 4-13 nested, 5-13 time ranges, 5-6 changes, saving, 3-4 character functions, 4-10 clear data step (cube scripts), 3-31
Index-1
CLEAR LEAVES command, 7-19 Cloud Control, 7-10 Cognos ReportNet, 1-6 column links, 6-19 configuring partitioning options, 3-25 configuring Analytic Workspace Manager for a proxy server, 2-3 for partitioning options, 3-25 for plug-ins, 2-5 for showing attachment modes, 2-5 connect string, for Analytic Workspace Manager, 2-4 connections, defining, 2-4 COUNT function, 3-20 CREATE ANY DIMENSION privilege, 2-2 CREATE ANY MATERIALIZED VIEW privilege, 2-2 CREATE DIMENSION privilege, 2-2 CREATE MATERIALIZED VIEW privilege, 2-2 CREATE SESSION privilege, 2-2 creation dates of analytic workspaces, 7-13 CUBE JOIN, 4-16 cube materialized views, 3-34, 7-16 CUBE SCAN operation, 4-19 cube scripts, 3-31 cube security, 8-3 cube views, 3-29, 4-2 CUBE_TEMPLATES table, 3-37 cubes copying and pasting, 3-39 creating, 3-16 defined, 1-6, 3-16 joining, 4-16 mapping, 3-20 requirements for materialized views, 3-34 saving and re-creating, 3-37 cumulative calculations, 5-12 cursors, 1-2
database integration, 1-1 database security, 2-1 DBA scripts download, 7-13 DBA_AW_OBJ view, 7-4 DBA_AW_PROP view, 7-4 DBA_AW_PS view, 7-4 DBA_AWS view, 7-4, 7-12 DBA_OBJECTS view, 7-13 DBA_REGISTRY view, 7-12 DBMS_AW_STATS PL/SQL package, 7-10 DBMS_CUBE PL/SQL package, 3-30, 7-18 DBMS_LOB PL/SQL package, 7-12 DBMS_METADATA PL/SQL package, 7-21 DBMS_MVIEW PL/SQL package, 7-18, 7-21 DBMS_SCHEDULER PL/SQL package., 3-30 DBMS_XPLAN PL/SQL package, 7-21 dense rank, 5-10 DENSE_RANK function, 5-16 dimension hierarchies See hierarchies dimension object security, 8-6 dimension order, affecting aggregation, 9-5 dimension security, 8-3 dimension views, 4-3 dimensions copying and pasting, 3-39 creating, 3-6 defined, 1-7, 3-4 saving and re-creating, 3-37 viewing members, 3-16 Discoverer Plus OLAP, 1-6 disk space consumption, 7-14 disks, spreading data across, 7-3 displaying data, 3-29 drillable reports, 6-3 drilling, 4-10, 6-19 drilling (Application Express), 6-17 dump files, 7-15 dynamic performance tables, 7-11
D
dashboard, 1-3 data dictionary views, 4-19, 7-4 data display, 3-16, 3-29 data loads, 3-14, 3-26 data maintenance, 3-30 data model description of dimensional, 1-6 designing, 3-1 saving, 3-37 Data Pump, 7-15 data security, 8-2 disabling and enabling, 8-9 implementation, xiv policies, 8-7 roles, 8-10 data sources database objects, 3-2 mapping, 3-11, 3-18 database connections, defining, 2-4
E
edits, saving, 3-4 EIF files about, 7-16 creating analytic workspaces from, 3-38 saving analytic workspaces to, 3-38 end date attributes, 3-9 Enterprise Manager Cloud Control, 7-10 execution plans, 4-17 EXP_FULL_DATABASE privilege, 7-15 EXPLAIN PLAN command, 4-17 extensibility using plug-ins, 2-5 EXTENT MANAGEMENT LOCAL, 7-2
F
FAST SOLVE method, 7-19 filtering queries, 4-6 free-form calculations, 5-14
Index-2
G
generator, calculated measures, 5-3 Global Computing Company data requirements, A-2 to A-6 GLOBAL QUERY REWRITE privilege, 7-20 Global schema download, 2-1 Gregorian calendar, 5-6
6-8, 6-15
M
maintenance alternatives, 3-30 maintenance scripts, 3-33 Maintenance Wizard, 3-14, 3-26 mappings cube, 3-18 dimension, 3-11 materialized views access privileges, 7-20 creating cube, 3-34 refresh logs, 7-17 MAX function, 3-20, 5-16 maximum cumulative, 5-12 moving, 5-11 measure dimension table mapping dimension to, 3-12 measure dimensions aggregation method of cube, 3-17 and calculated measures, 1-7 mapping, 3-12 measure folders creating, 3-36 saving and re-creating, 3-37 measures copying and pasting, 3-39 creating, 3-17 defined, 1-6 MIN function, 3-20, 5-16 minimum cumulative, 5-12 moving, 5-11 moving calculations, 5-11
H
hidden items (Application Express), 6-17 HIER_ANCESTOR function, 5-16 HIER_CHILD_COUNT function, 5-16 HIER_DEPTH function, 5-16 HIER_LEVEL function, 5-16 HIER_PARENT function, 5-16 HIER_TOP function, 5-16 hierarchical average operator (aggregation), hierarchical operators, 9-4 hierarchical queries, 4-10 hierarchies creating, 3-8 defined, 1-7, 3-7 level-based, 3-6 supported types, 3-7 hierarchy views, 4-3
9-2
I
index, 5-7 initialization parameters, 7-1 init.ora file, 7-1 installing Analytic Workspace Manager, 2-2 installing OLAP option, validation, 7-12 integration in database, 1-1
J
JOB_QUEUE_PROCESSES parameter, joining cubes, 4-16 7-2, 7-7
N
natural keys, 3-5 nested calculations, 5-13 NO_USE_CUBE hint, 4-16 normal hierarchies, 3-7
L
LAG function, 5-8, 5-16 LAG_VARIANCE function, 5-16 LAG_VARIANCE_PERCENT function, 5-16 language support, 3-35 layout template (BI Publisher), 6-3 LEAD function, 5-8, 5-16 LEAD_VARIANCE function, 5-16 LEAD_VARIANCE_PERCENT function, 5-16 level-based dimensions, 3-4 level-based hierarchy, 3-6 levels creating, 3-6 defined, 1-8 load step (cube scripts), 3-31 loading data, 3-14, 3-26
O
object security, 8-2, 8-3, 8-5 objects copying and pasting, 3-39 mapping, 3-11, 3-18 saving and re-creating, 3-37 OLAP data security roles, 8-10 OLAP DML calculated measures, 5-17 expressions for data security policies, 8-8 OLAP DML step (cube scripts), 3-31 OLAP option, verifying installation, 7-12 OLAP_DBA role, 2-2 OLAP_USER role, 2-2 Index-3
OLAP_XS_ADMIN role, 2-2, 8-8 optimizer statistics, 7-10 Oracle Application Express, 1-3 Oracle Business Intelligence, 1-6 Oracle Real Application Clusters, 1-3, 7-9 Oracle Real Application Security, xiv, 8-2 Oracle Recovery Manager, 7-15 OracleBI Discoverer Plus OLAP, 1-6 OracleBI Spreadsheet Add-In, 1-6 OracleBI Suite Enterprise Edition, 1-6 OUTER plan option, 4-19 owners of analytic workspaces, identifying, 7-12
report entry (BI Publisher), 6-3 report layout (BI Publisher), 6-7 reports, 6-3 RMAN, 7-15 ROW_NUMBER function, 5-17
S
sample schema download, 2-1 saving analytic workspaces to EIF files, 3-38 objects to XML Templates, 3-38 scaled operators, 9-3 scheduling maintenance, 7-7 security about, 8-1 data, 8-2 materialized views, 7-20 object, 8-5 See also data security server parameter file, 7-2 SESSIONS parameter, 7-2 share, 5-9 SHARE function, 5-17 single-row functions, 5-2 size of analytic workspace, 7-12 skip-level hierarchies, 3-7 source data, 3-2 Spreadsheet Add-In, 1-6 static data dictionary views, 4-19, 7-4 step types, 3-31 SUM function, 3-20, 5-17 sum operator (aggregation), 9-2 surrogate keys, 3-5 system tables, 7-4
P
page definition (Application Express), 6-14 parallel periods, 5-11 parallel processing, 7-7 parameter file, 7-2 parent-child relations, 1-8 PARTIAL OUTER plan option, 4-19 partitioning analyzing partition members, 3-25 benefits, 3-22 discussed, 7-6 selecting partitions, 3-23 performance counters, 7-11 period to date, 5-8 pfile settings, 7-2 PLAN_TABLE table, 4-17 PL/SQL step (cube scripts), 3-31 plug-ins configuring, 2-5 installing, 2-5 prior periods, 5-8 privileges, 8-2 proxy server configuring, 2-3 PS$ tables, 7-5
T
tablespaces, 7-2 templates BI Publisher, 6-5 calculation, 5-5 creating XML, 3-37 saving object definitions to, 3-38 time dimensions, 3-5 time ranges in calculations, 5-6 time span attributes, 3-9 total cumulative, 5-12 moving, 5-11 transportable tablespaces, 7-15
Q
queries, filtering, 4-6 query rewrite, 7-20 query tools, 1-3 QUERY_REWRITE_ENABLED parameter, 7-20 QUERY_REWRITE_INTEGRITY parameter, 7-20 querying dimensions and cubes, 4-1
R
RAC See Oracle Real Application Clusters ragged hierarchies, 3-7 rank, 5-10 RANK function, 5-16 Real Application Clusters See Oracle Real Application Clusters refresh logs, 7-17 refresh methods, 7-17, 7-18 Relational Schema Advisor, 3-34, 7-17 Index-4
U
UNDO_MANAGEMENT parameter, 7-2 UNDO_TABLESPACE parameter, 7-2 unique key attributes, 3-10 upgrading metadata, 2-5 USE_CUBE hint, 4-16 user names, 2-1
USER_AW_OBJ view, 7-4 USER_AW_PROP view, 7-4 USER_AW_PS view, 7-4 USER_AWS view, 7-4 USER_CUBE_DIM_VIEW_COLUMNS view, USER_CUBE_DIM_VIEWS view, 4-4 USER_CUBE_HIER_LEVELS view, 4-7 USER_CUBE_HIER_VIEWS view, 4-4 USER_CUBE_VIEW_COLUMNS view, 4-2 USER_MVIEWS view, 7-17
4-4
V
V$AW_AGGREGATE_OP view, 7-11 V$AW_ALLOCATE_OP view, 7-11 V$AW_CALC view, 7-11 V$AW_LONGOPS view, 7-11 V$AW_OLAP view, 7-11 V$AW_SESSION_INFO view, 7-11 value-based dimensions, 3-4 value-based hierarchies, 3-8
W
weighted operators, 9-3 weighted sum (aggregation), 9-2 WHERE clause operations, 4-10
X
XML templates about, 7-16 creating objects from, 3-37 saving object definitions to, 3-38
Index-5
Index-6