Memory Analysis
Memory Analysis
Analysis
Delivering Insights
at the Speed of Thought
BY WAYNE ECKERSON
Director of Research, Business Applications and Architecture Group, TechTarget, December 2011
Research Background
RESEARCH
BACKGROUND
RECOMMENDATIONS
Figure 1:
Respondent Profile
Based on 198
respondents,
BI Leadership Forum,
October 2011
(www.bileadership.com).
Figure 2:
Company Size of Survey Respondents
Based on 198 respondents, BI Leadership Forum, October 2011 (www.bileadership.com).
SMALL
(< $50m)
RESEARCH
BACKGROUND 23% LARGE
($1b)
EXECUTIVE MEDIUM
47%
SUMMARY ($50m to $1b)
30%
BI MEGA TRENDS
NEXT-GENERATION
BI CAPABILITIES
RECOMMENDATIONS
Executive Summary
RESEARCH
BACKGROUND
making it easy for users to meet their own information needs once an IT
person, power user or superuser has done the initial setup. Top-down tools
expose semantic layers and widget libraries built by IT professionals and allow
superusers to build ad hoc reports and dashboards (i.e., mashboards). Con-
versely, bottom-up tools, such as popular visual
analysis tools, let power users and superusers
RESEARCH explore data culled from a variety of back-end
BACKGROUND
systems and build fast, highly interactive dash- You cant discuss
boards for their departmental colleagues. next-generation BI
Finally, you cant discuss next-generation BI
EXECUTIVE
SUMMARY tools without examining their back-end data
tools without examin-
architectures. To deliver the highest level of ing their back-end
BI MEGA TRENDS
query performance possible, many new BI tools data architectures.
store data locally in an in-memory database or
intelligent mid-tier cache. Others query back-
NEXT-GENERATION end databases directly, relying on ROLAP (rela-
BI CAPABILITIES tional online analytical processing) SQL generation capabilities or the power
of analytical platforms to handle complex queries and deliver super fast per-
formance. And some tools give users the flexibility of caching data locally or
SELF-SERVICE BI
querying back-end databases, depending on business requirements and sys-
tems availability. n
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION
BI
RECOMMENDATIONS
BI Mega Trends
RESEARCH
BACKGROUND
BI FRAMEWORK 2020
EXECUTIVE At a macro level, next-generation BI capabilities are depicted in my BI Frame-
SUMMARY work 2020, which I introduced in my 2011 report titled Analytic Architectures:
Approaches to Supporting Analytics Users and Workloads. Ive updated the
framework since its first publication, but the contents remain much the same.
BI MEGA TRENDS
The framework depicts four intelligences for delivering reporting and analysis
applications: business intelligence, continuous intelligence, analytics intelli-
NEXT-GENERATION gence and content intelligence, each of which brings different types of BI tools
BI CAPABILITIES to the table (see Figure 3).
SELF-SERVICE BI
End-User Tools
FOR BI
Content Intelligence
MAD Dashboards
Architecture
MapReduce, XML, Key-value
Xquery, Hive, Java, etc.
Data Ware-
Pairs, Graph Notation, etc.
NEXT-GENERATION
Figure 3:
Dashboards
Dashboard Alerts
Hadoop/NoSQL
Search,Boxes
Event Detection
Alerts and
and Correlation
CEP, Streams
BI
Event-driven
Reporting BI Framework
& 2020
Analysis
RECOMMENDATIONS
Analytic
Analytic Sandboxes
Sandboxes
Ad hoc SQL
Analytics Intelligence
NEXT-GENERATION Figure 4:
BI CAPABILITIES
Reporting Versus Analysis: Distinct Workloads, Users and Architectures
SELF-SERVICE BI
Pros:
- Alignment TOP DOWN- Business Intelligence
- Consistency
Corporate Objectives and Strategy
ARCHITECTING
Cons:
FOR BI - Hard to build Reporting & Monitoring (Casual Users)
- Politically
charged
Predefined Non-volatile
- Hard to change DW
Metrics
- Expensive Architecture Data
USER INPUT ON - Schema
NEXT-GENERATION Heavy
BI
Reports Analysis
Beget Begets
Analysis Reports
RECOMMENDATIONS
Pros:
- Quick to build Analytics Ad hoc Volatile
- Politically queries Data
Architecture
uncharged
- Easy to change
- Low cost
Analysis and Prediction (Power Users)
Cons: Processes and Projects
- Alignment
- Consistency
- Schema Light
its business operates. Thus, reports and dashboards built on a data ware-
house take a lot of time and money to create and are hard to change, but they
ensure a consistent mapping of enterprise information, which is a key busi-
ness requirement among executives who want to run their businesses on a
single set of numbers.
The solution to this dynamic is to recognize that you need both top-down
and bottom-up BI tools, and that these tools need to work together, not
against each other. Buying reporting and analysis tools from a single enter-
prise BI vendor does not guarantee the tools will interoperate, but it is a
step in the right direction. Also, it is important to establish BI tool standards
for each type of user in your organization. At the highest level, casual users
RESEARCH should get one type of tool and power users another, although casual users
BACKGROUND
may end up using the Web-based published output of the power-user tool.
At a finer-grained level, there are many different types of casual and power
EXECUTIVE users, and you may need a tool standard for each type.
SUMMARY
Multiple modalities. Muddying up the waters even more, some users play
multiple roles and thus should have different types of tools for each role.
BI MEGA TRENDS
Unfortunately, few BI sponsors want to purchase multiple BI tools, and most
casual users dont want to have to learn to use multiple tools. So were back to
NEXT-GENERATION the quest for a single BI tool that offers multiple modalities geared to different
BI CAPABILITIES types of BI activity. We are starting to see tools that offer such functionality.
For example, a prototypical casual user spends 80% of his time viewing,
navigating and interacting with data but then adds a new metric, dimension,
SELF-SERVICE BI
attribute or prompt that is not in the data set. The user crosses the 80/20
line and becomes a power user who wants to create new data. Most BI prod-
ARCHITECTING ucts dont handle this pivot point gracefully, but some are starting to provide
FOR BI optional do more modalities that expose new functionality on the fly, when
users need it and are ready for it.
USER INPUT ON
NEXT-GENERATION Summary. The BI Framework 2020 provides a macro view of the BI environ-
BI ment of the future. It shows that there is no one BI environment that can ade-
quately support the various BI activities involved in building and using report-
ing and analysis applications. Rather than search for a single BI architecture,
RECOMMENDATIONS
tool set and design environment, BI professionals need to think about creating
a BI ecosystem in which multiple reporting and analysis environments inter-
operate to help businesspeople use information to make smarter decisions.
The BI team may not oversee all these distinct environments, but they need to
be aware that they exist. They must work to propagate standards for shared
information entities used across these environments and eliminate overlaps
and inefficiencies in the information delivery workflow. n
Next-Generation
BI Capabilities
RESEARCH
BACKGROUND
END-USER CHARACTERISTICS
NEXT-GENERATION If you are looking for a next-generation BI tool, here are some of the key end-
BI CAPABILITIES user characteristics the tool should support:
Interactive. An interactive BI tool lets users interact with the display rather
than just view the data. There are many ways to make a BI tool interactive.
RECOMMENDATIONS
Users may want to personalize the look and feel of the display or select the
metrics, charts and tables to display from a library of such objects. Or they
may want to navigate the data by drilling down along predefined paths or
interact with the objects themselves, applying filters, toggling between chart
types or printing, exporting or snapshotting views of the data. A menu bar
of icons or right-click should expose a list of context-sensitive actions that
administrators should be able to hide or expose, depending on a users role,
task or experience.
Visual. Visualizing data makes it easier for users to quickly see patterns,
trends and anomalies in the data. Next-generation BI tools support a rich array
of interactive visualizations and suggest appropriate visualizations based on
the type of data users want to display. Rather than render static images, BI
tools should let users interact with the graphical displays. For instance, a user
should be able to mouse over a chart element to view its underlying data prop-
RESEARCH erties or click on the chart to drill down and view more detail. On a scatter plot,
BACKGROUND
a user should be able to lasso data points and use them to create a new group
or as a filter for another chart or table. And on a heat map, a user should be
EXECUTIVE able to right-click to display a time-series chart, among other things.
SUMMARY
Collaborative. Few decisions are made in a vacuum, but most BI tools today
look like they were designed to be used by an isolated employee in a cubicle.
New BI tools let power users publish live, interactive reports and dashboards
to internal and external Web pages where others with permission can access
the information. Beyond publishing, casual users need BI tools that enable
them to associate comments with an entire
RESEARCH dashboard, individual charts or tables or even
BACKGROUND
cells within a chart or table. Ideally, the com-
Collaboration is
ments form an ongoing discussion among team
members about whats happened and what to perhaps even more
EXECUTIVE
SUMMARY do about it. The discussion culminates when important for power
team members evaluate the effectiveness of the users.
decision or action taken by the team.
BI MEGA TRENDS
But collaboration is perhaps even more
important for power users, who always work in
NEXT-GENERATION isolation and often re-create the work of other analysts without knowing it. A
BI CAPABILITIES BI tool should allow power users to share their analyses with others and track
whos doing what. Power users should be given the ability to rate analyses,
follow other power users and add comments and links to any published analy-
SELF-SERVICE BI
sis. This type of power user collaboration can greatly enhance productivity
through reuse and make work more interesting and dynamic.
ARCHITECTING
FOR BI Mobile. Users should be able to access interactive reports and dashboards
through any device. These reports and dashboards should look, feel and act
like the normal applications that users access through the Web or a desktop
USER INPUT ON
NEXT-GENERATION tool, yet they should also exploit the unique characteristics of the underlying
BI device, giving users the best of both worlds. The applications should also work
offline.
RECOMMENDATIONS
IT CHARACTERISTICS
The above list focused on end-user characteristics of a next-generation BI tool
set. The following list focuses on IT characteristics.
Fast. Because of the Google effect, users now expect subsecond response
times for complex queries. After all, if Google can locate highly relevant docu-
ments out of billions on the Internet, why cant a BI tool locate and display the
right data from a corporate data warehouse and other systems?
enterprise-caliber database servers that can hold and process large volumes.
These servers include massively parallel processing (MPP) databases, data-
base appliances and NoSQL systems that typically run on a grid of commodity
servers.
Comprehensive. Since its unlikely that one BI tool can support the needs of
all users, the best BI tools offer tools or modules to support the complete BI
stack, from pixel-perfect reporting and online analytical processing (OLAP)
to visual analysis and data mining. Some even go beyond the BI stack and
offer data integration and data quality tools as well as databases and even the
hardware that everything runs on. Some of the bigger enterprise BI vendors
RESEARCH will soon package all these components into a single BI appliance or online
BACKGROUND
service (i.e., cloud offering).
SUMMARY
Next-generation BI tools merge top-down and bottom-up capabilities into
SELF-SERVICE BI
a single-user environment. Top-down BI vendors are working to acceler-
ate the time required to install, configure and model a BI semantic layer and
ARCHITECTING data model and are adding new modules to compete with in-memory visual
FOR BI analysis tools. Bottom-up BI vendors are working to increase the scalability,
reliability, reusability and manageability of their BI tool environments to avoid
creating departmental silos.
USER INPUT ON
NEXT-GENERATION Casual users benefit because they will get more visual, interactive and
BI easier-to-use tools while superusers will get tools that make it easier to build
ad hoc, interactive reports and dashboards for casual users. Power users
also benefit because they get more flexible BI tools that attach effortlessly to
RECOMMENDATIONS
almost any data source and provide the analytical flexibility they need to carry
out their analyses without having to dump data into Excel.
However, until vendors ship an uber BI tool, many organizations standardize
on top-down BI tools for metrics-driven reports and bottom-up BI tools for ad
hoc analysis. n
Self-Service BI
RESEARCH
BACKGROUND
The Holy Grail of BI. For years, self-service BI has been the Holy Grail for BI
EXECUTIVE professionals and a key characteristic of next-generation BI tools. Self-service
SUMMARY BI promises to provide business users with easy-to-use tools that enable them
to get the information they want, when and how they want it without IT or
power-user intervention. And self-service BI liberates the BI team from hav-
BI MEGA TRENDS
ing to fulfill endless requests for custom reports, each of which takes weeks
or months to deliver and rarely contains all the data users want in the form
NEXT-GENERATION or shape they desire. Thus, self-service BI provides a win-win situation that
BI CAPABILITIES makes everyone more productive.
person who has query and design knowledge when creating ad hoc reports
and dashboards. And bottom-up tools require business analysts to dumb
down their workeliminating filters, simplifying parameters, cleaning up
screensbefore publishing them for casual users. Nonetheless, both types of
tools are closing the gap between top-down and bottom-up BI and delivering
true self-service capabilities.
RESEARCH
BACKGROUND
Top-Down Approaches
Semantic layers. The traditional top-down approach to self-service BI
EXECUTIVE involves creating a semantic layer that defines back-end data entities as a set
SUMMARY of business-oriented objects. Users then drag and drop these objects from
a folder structure into a query panel and submit the query. They then use a
point-and-click design tool to format the output into tables or charts with
BI MEGA TRENDS
appropriate fonts, colors and other design elements. This is a lot of work for a
casual user, who can barely remember how to log in to the BI tool. However,
NEXT-GENERATION its a perfect environment for superusers, who can now create ad hoc reports
BI CAPABILITIES without writing a line of code or learning SQL.
SELF-SERVICE BI
Figure 5.
Top-Down and Bottom-Up Approaches to Self-Service BI
ARCHITECTING
FOR BI
RECOMMENDATIONS
Self-Service Semantic
Visual Analysis
BI Tools Layer/Mashboards
Ad hoc queries
And data
Bottom-Up Approach
Visual analysis. The primary bottom-up In many respects,
SELF-SERVICE BI
approach to self-service BI uses visual desktop visual analysis tools
analysis tools to publish dashboards to a man-
offer the best of both
ARCHITECTING aged server. Here, power users conduct an anal-
FOR BI ysis and publish their output for casual users worlds.
to use as a live, interactive dashboard. Con-
versely, superusers can use visual analysis tools
USER INPUT ON
NEXT-GENERATION explicitly to create departmental dashboards
BI for their colleagues. In many respects, visual analysis tools offer the best of
both worlds. They are powerful analysis tools that provide easy access to any
structured data source and speed-of-thought analysis, yet their publishing
RECOMMENDATIONS
capabilities turn them into highly interactive departmental dashboards.
Many in-memory visual analysis tools can hold 50 million records or more,
depending on the memory footprint of the server hardware. In effect, these
tools serve as departmental data marts (or small data warehouses) tailored
to a specific audience. Although most of these tools update their in-memory
databases once a day, administrators can schedule the data to refresh on an
incremental basis, as frequently as every five minutes. Some even offer the
option to store data locally in memory or query the data directly in the source
systems. This lets users opt for greater speed (i.e., local in-memory system)
or current data (i.e., direct access), depending on their requirements and the
capabilities of source systems.
FUNCTIONALITY ON DEMAND
So far, weve defined self-service BI as giving users the ability to create new
reports and dashboards without ITs assistance. This is important, but its
only one dimension of self-service BI. A more refined view of self-service BI
Functional Hierarchies
Next, its important to understand that information consumers and producers
USER INPUT ON
NEXT-GENERATION
BI
Figure 6:
Mapping Types of
RECOMMENDATIONS
Users to Self-Service
Hierarchies
Both casual users
and power users both
consume and produce
reports and analyses,
but power users exploit
more advanced
features.
1. See the second edition of my book, Performance Dashboards: Measuring, Monitoring, and Managing Your Business (Hoboken, NJ:
John Wiley & Sons, 2010), available at most online bookstores.
Evolving Requirements
When casual users first use a BI tool to consume information, they may only
view data. After a while, as they become more familiar with the tool, they may
want to navigate to the details. Later, they may want to modify data in the
report or dashboard by adding a calculated column. Conversely, information
producers may first want to assemble data from pre-existing report parts or
RESEARCH craft them using a semantic layer. But after a while they may want to source
BACKGROUND
data independently and mix it with other data.
Next-generation BI tools need to offer the full spectrum of functionality for
EXECUTIVE both information consumers and producers. But, more important, they need
SUMMARY to expose this functionality on demand, as users need them and are capable
of using them. Most tools enable administrators to control what functional-
ity users can access using fine-grained access control lists. But this approach
BI MEGA TRENDS
alone is cumbersome, since its hard to know exactly when users are ready
for more. But exposing all functionality at once can overwhelm even the most
NEXT-GENERATION experienced users, undermining their productivity. (Think of Microsoft Office
BI CAPABILITIES 2007.) So the best BI tools expose functionality discreetly by means of icons
displayed in the menu bar, ribbon or content frame, but only to users who are
most likely to use those functions.
SELF-SERVICE BI
SUMMARY
ARCHITECTING Self-service BI is a key feature of next-generation BI tools. Top-down and
FOR BI bottom-up BI tools both offer self-service capabilities. Top-down tools are
designed to help casual users perform ad hoc tasks, while bottom-up tools
help power users publish interactive dashboards. Today, mashboards and
USER INPUT ON
NEXT-GENERATION visual analysis tools hold the most promise for delivering on the promise of
BI self-service BI.
Besides tools that bridge the gulf between metrics-driven reporting and ad
hoc analysis, we also need BI tools that expose functionality on demand to
RECOMMENDATIONS
information consumers and producers. The two types of users traverse dif-
ferent functional hierarchies. And individual users descend to deeper levels
in the hierarchy as they gain more experience with a tool. Thus, its impera-
tive that tools support the full range of functionality for each type of user but
expose functionality as users need it to maximize adoption and usage. This is
another critical, but often overlooked, dimension of self-service BI. n
Architecting for BI
RESEARCH
BACKGROUND
ITS FUN TO TALK ABOUT BI tool functionality, past, present and future. But, BI
EXECUTIVE tools dont work unless there is a back-end data management system that
SUMMARY they can query. The nature of that systemhow it stores data and runs que-
riesplays a significant role in the success of any BI tool. A well-designed BI
tool running against a slow data management system inevitably fails.
BI MEGA TRENDS
To ensure adequate performance, BI professionals need to think long and
hard about how to architect their BI environ-
NEXT-GENERATION ments. The key decision they need to make is
BI CAPABILITIES where to house data and how to process que-
A well-designed BI tool
ries. There are three basic options to perform
this work: database, application server or client. running against a slow
SELF-SERVICE BI
In more complex scenarios, architects spread data management
data and query processing across two or more system inevitably fails.
ARCHITECTING of these layers to optimize performance.
FOR BI At the same time, many BI tools now come
with in-memory databases, which, in effect, cre-
ate self-contained analytical environments that may or may not interoperate
USER INPUT ON
NEXT-GENERATION much with the larger BI environment. This is a blessing and a curse. On one
BI end, a self-contained BI environment lets business departments and power
users access and analyze data without IT involvement. On the other, it has the
potential to create redundant silos of analytical information, unless the tools are
RECOMMENDATIONS
implemented under the purview of the IT department, which can point them to
existing data warehouses and data marts to preserve unity of information.
DATABASE PROCESSING
The most basic BI architecture is one in which a BI tool queries a remote data-
base. Here, the BI tool generates the queries, displays the results and enables
users to view and manipulate the delivered data set. The database holds all
the data, optimizes and processes the queries and generates the result sets,
which it feeds to the BI tool. With the database providing so much of the pro-
cessing, there are several ways to architect a BI system.
Data warehouse. For simplicity, most BI architects store all data in a data
warehouse, a single repository of integrated data optimized for query process-
ing. Data warehouses work well until the number of concurrent users submit-
RESEARCH ting queries, both easy and complex, starts to bog down performance. At this
BACKGROUND
point, architects can scale up the data warehouse by adding more indexes,
aggregations or hardware; throttle or kill offending queries; or offload data and
EXECUTIVE users to other machines.
SUMMARY
BI PROCESSING
The second most basic architecture puts more of the onus for storing
data and processing queries on the BI tool. There are several options here
as well:
In-memory database. Here, the BI tool comes with its own in-memory data-
RESEARCH base to store and process all data used by the BI tool. Typically, these tools
BACKGROUND
pull data from a data warehouse or directly from transaction systems in a
single nightly batch load. However, most can be updated at any frequency,
EXECUTIVE applying only changes in source systems to the target files. The benefit here
SUMMARY is that queries run extraordinarily fast when data is held in memory (i.e., ran-
dom-access memory [RAM]) versus disk. Plus, an in-memory database func-
tions as a data mart, freeing up processing cycles in the data warehouse to
BI MEGA TRENDS
handle ad hoc queries and other workloads. The downside is that you may not
be able to fit all your data into RAM and support large numbers of concurrent
NEXT-GENERATION users. And although the cost of RAM has dropped considerably, if you have
BI CAPABILITIES lots of data and users, it can still get expensive. Plus, you again replicate data
and systems, which can lead to analytical silos.
Some tools now give users the option to hold data in memory or query it
SELF-SERVICE BI
directly in the data warehouse or other system. Users may decide to down-
load data to a local in-memory database to get consistently fast performance
ARCHITECTING on a frequently used data set or when back-end systems are bogged down.
FOR BI Theyll choose to query data sources directly when they want the most cur-
rent data possible or if they dont own the data and cant download it.
USER INPUT ON
NEXT-GENERATION Mid-tier cache. To gain the benefits of memory without the constraints, some
BI BI vendors create an intelligent, mid-tier cache on the BI server that stores
query result sets temporarily (see Figure 7). Here, BI tools run queries against
the cached data when possible, instead of the remote database. When the
RECOMMENDATIONS
cache fills up, it bumps the least used data back to disk. Intelligent caches
apply permissions to data to keep users from viewing unauthorized data. In
many respects, mid-tier caches deliver the same results as in-memory data-
bases, except the data is not permanently stored in the cache. The problem
with mid-tier caches is that the first user to run a query each day doesnt get
the benefit of in-memory processing. To circumvent this problem, administra-
tors often run certain queries early in the morning to ensure that frequently
used data is already in the cache.
Figure 7:
In-Memory Databases Versus Intelligent Caches
Intelligent caches speed response times for pre-run queries while still giving users access to
data stored in remote databases (of any size). In-memory databases speed response times
for all queries running against a downloaded data set but dont provide direct access to
remote data. Some tools use a hybrid approach users can choose to query remote data or
download data to a local server or desktop to improve query performance.
RESEARCH
BACKGROUND
EXECUTIVE
SUMMARY
BI MEGA TRENDS
NEXT-GENERATION
BI CAPABILITIES
SELF-SERVICE BI
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION
BI ROLAP servers. To get the benefits of cubes without replicating data, some BI
tools use a relational OLAP, or ROLAP, architecture to generate dimensional
result sets on the fly. These tools generate complex SQL that runs against
RECOMMENDATIONS
data warehouses or data marts. The downside is performance, since generat-
ing dimensional aggregations on the fly can be slow. To circumvent this prob-
lem, some ROLAP vendors now split query processing between a database
server, which handles basic SQL operations, and the BI server. To help ROLAP
performance, administrators often create aggregate tables, which function
like multidimensional cubes but without the downsides of duplicating or mov-
ing data, although aggregate tables can suck up IT development and adminis-
tration time.
Federated queries. Some BI tools assume responsibility not just for generat-
ing queries but optimizing them as well. BI virtualization toolssuch as Ora-
cles Common Enterprise Information Model, part of Oracle BI Enterprise Edi-
tioncreate a global semantic model that makes multiple remote databases
appear as a single local database. Behind the scenes, the BI virtualization tool,
which understands the profile of each remote database, optimizes queries to
RESEARCH run against those systems, deciding which database to query first, then moves
BACKGROUND
data to the next database or brings all data back to the BI server to finish an
operation.
EXECUTIVE
SUMMARY Client cache. Some BI tools also move data to the client machine to speed
performance. For example, Flash-based dashboards require users to download
data and animations to their client machines before they can view and interact
BI MEGA TRENDS
with the data. Although Flash now can query back-end systems, Flash-based
designers need to be careful to minimize the amount of data displayed on any
NEXT-GENERATION given dashboard page to avoid lengthy downloads that can irritate users. Also,
BI CAPABILITIES some mobile BI applications, namely, those designed for Apples iPad and
iPhone, download data to those devices to enhance performance.
SELF-SERVICE BI
DATABASE OPERATIONS
Database management vendors continually add new features and techniques
ARCHITECTING to their products to improve query performance and keep up with growing
FOR BI workloads from both top-down and bottom-up BI. To display a simple dash-
board may require a database to execute multiple, complex queries and return
results in a split second. Power users can generate hundreds of complex, ad
USER INPUT ON
NEXT-GENERATION hoc queries that may require scanning a huge fact table and joining it with
BI other tables. Here are some of the more salient features that BI architects
need to understand to deliver fast performance:
RECOMMENDATIONS
Parallelization. Databases now try to parallelize all operations to remove
bottlenecks. This includes loading, streaming, aggregating, scanning, joining,
summing, sorting and merging. This parallelization happens at the processor
level to exploit multi-core processors as well as the node level, if the database
runs on a massively parallel processing system.
Storage-level filtering. Many databases now run some or all SQL functions
RESEARCH at the storage layer so data can be processed at the binary level, further mini-
BACKGROUND
mizing the data that must be retrieved from disk.
Native SQL. Similarly, most databases have native SQL dialects that contain
functions which can dramatically speed performance if the BI tools know the
USER INPUT ON
NEXT-GENERATION dialect and can exploit it within SQL queries.
BI
Special data types. Similarly, databases now support unique data types,
such as geospatial data, along with special SQL functions to manipulate this
data. Without built-in support for such functions, users would need to down-
load data to a specialized application server and carry out the processing
there.
Hadoop. Many companies are storing large volumes of clickstream and other
unstructured or semi-structured data in Hadoop, which runs queries against
an open source distributed file system. Specialized analysts, known as data
SUMMARY
RESEARCH There are many options for architecting an analytics environment. BI archi-
BACKGROUND
tects need to select the right option or combination of options to ensure
adequate performance and scalability for their BI applications. If the IT depart-
EXECUTIVE ment controls the data management architecture, it will need to adapt to what
SUMMARY already exists unless it wants to create and manage its own data architecture.
Most vendors are quick to adapt new technologies and often leapfrog each
other in capability. New advances in the use of memory, solid-state disks,
BI MEGA TRENDS
columnar storage, storage-level processing and in-database analytics are rap-
idly advancing the state of the art, giving BI applications a badly needed turbo
NEXT-GENERATION boost. n
BI CAPABILITIES
SELF-SERVICE BI
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION
BI
RECOMMENDATIONS
User Input on
Next-Generation BI
RESEARCH
BACKGROUND
THIS FINAL SECTION PRESENTS the results of our user survey. To keep the survey
EXECUTIVE short, I focused the survey on casual users and the BI tool capabilities needed
SUMMARY to support them. I also asked a question on BI architectures.
END-USER CAPABILITIES
BI MEGA TRENDS
To identify the most important BI tool features for this set of users, I asked BI
professionals a delicious question: If next month you had to buy a new BI tool
NEXT-GENERATION for your casual users, what end-user capabilities would be most important?
BI CAPABILITIES Ease of use grabbed the top spot by a wide
margin. Almost all respondents (95%) rated
ease of use as very high or high in impor-
SELF-SERVICE BI
tance. In fact, an astounding 75% rated it very Almost all survey
high in importance, almost double for any other respondents (95%)
feature in our list (see Figure 8).
ARCHITECTING rated ease of use as
FOR BI This raises the questionWhat constitutes
ease of use? Examining responses to the very high or high
surveys only open-ended question shows that in importance.
USER INPUT ON
NEXT-GENERATION ease of use is a catch-all category to address
BI all the problems that casual users experience
with BI tools.
In response to the question What are the biggest drawbacks to your cur-
RECOMMENDATIONS
rent BI tool set for casual users? many respondents said their tools were
too clunky or cumbersome or complicated. Others cited more specific
issues, such as difficulty drilling or inability to insert user calculations
or No Google-like, easy, metadata-driven search. Others said self-service
capabilities of the tools simply expect too much of casual users.
[The tool] assumes users spend most of their work time dinking with data
when they dont.
Figure 8:
End-User Capabilities of BI Tools Desired by BI Professionals
RESEARCH
BACKGROUND
EXECUTIVE
SUMMARY
BI MEGA TRENDS
NEXT-GENERATION
BI CAPABILITIES
SELF-SERVICE BI
ARCHITECTING
FOR BI Not intuitive enough. Users shouldnt have to think, except about the busi-
ness problem they are trying to solve.
USER INPUT ON
NEXT-GENERATION Others shifted the onus from the tool to power users who design interactive
BI reports and dashboards for casual users or provide mashboards for ad hoc
dashboard creation:
RECOMMENDATIONS
Power users have made it too complicated to use for the casual users
because we dont have a varied tool set.
The creation of an ad hoc mashboard is a little too complex for casual
HIERARCHY OF NEEDS
As the results in Figure 8 show, there is a four-tiered hierarchy of end-user
capabilities that BI professionals expect BI tools to offer.
At the top of the heap is ease of use and performance. Unless a tool is
intuitive and fast, users wont bother. Next in importance are visualization and
interactivity, which now seem to go hand in hand: Users want to view data
graphically and change those views with a simple mouse click or two. Third in
the hierarchy is self-service and analytic flexibility, both of which empower
casual users to add, change or delete data in reports or dashboards, although
RESEARCH this is still beyond the capabilities of many casual users today, as evidenced by
BACKGROUND
respondent comments cited above. Last, BI professionals value emerging capa-
bilities, such as mobile BI, collaboration and metadata-driven search.
EXECUTIVE
SUMMARY Satisfaction ratings. Despite the criticism, BI professionals gave decent
scores to the end-user capabilities of their current BI tools for casual users.
More than a third of respondents (39%) rated their satisfaction with their
BI MEGA TRENDS
current BI tools for casual users as very high or high. Another 37% were
moderately satisfied with their BI tools, while 17% gave the tools a fair or
NEXT-GENERATION low rating (see Figure 9).
BI CAPABILITIES Interestingly, small companies (15%) are twice as likely as medium-sized
companies (7%) to give very high satisfaction ratings to their current tools,
and medium-sized companies are twice as likely as large companies (3%) to
SELF-SERVICE BI
ARCHITECTING
FOR BI
Figure 9:
BI Professionals Satisfaction with End-User Capabilities of Current BI Tools
USER INPUT ON
NEXT-GENERATION
BI
RECOMMENDATIONS
give a very high satisfaction rating. This raises the question: What type of
BI system do small companies use that delivers such high degrees of satisfac-
tion? While we didnt ask about specific tools, we can guarantee that small
companies can benefit most from the avalanche of new low-cost, BI tools built
on the latest technologies. Most other rating degrees were comparable across
organizations of different sizes (see Figure 10).
RESEARCH
BACKGROUND
EXECUTIVE
Figure 10:
SUMMARY Satisfaction with Existing BI Tools by Company Size
BI MEGA TRENDS
NEXT-GENERATION
BI CAPABILITIES
SELF-SERVICE BI
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION
BI IT CAPABILITIES
Respondents had varied answers when asked to rate the importance of IT
capabilities for casual users in a BI tool they might purchase next month. All
RECOMMENDATIONS
but one of the listed IT capabilities, enterprise pedigree, received a very
high or high importance rating from 50% or more of the respondents.
And five of the nine selections garnered a very high or high rating from
75% or more of the respondents. In other words, BI professionals universally
favor high levels of IT capabilities in their BI tools. No surprise there (see
Figure 11).
Cost to maintain a BI tool was deemed the most important overall, with
81% of respondents citing its importance as very high or high. Obviously,
Figure 11:
IT Capabilities of Casual User BI Tools Desired by BI Professionals
RESEARCH
BACKGROUND
EXECUTIVE
SUMMARY
BI MEGA TRENDS
NEXT-GENERATION
BI CAPABILITIES
SELF-SERVICE BI
ARCHITECTING
FOR BI as IT teams are asked to do more with less, keeping a close watch on main-
tenance budgets is key to delivering more value to the business. BI tools that
require developers to create and modify reports and semantic layers are cum-
USER INPUT ON
NEXT-GENERATION bersome to administer and add to expense. And they carry annual software
BI maintenance fees. Presumably, a tool that offers greater self-service capabili-
ties reduces the tools overall cost of ownership.
In addition, BI professionals who responded to the survey were almost
RECOMMENDATIONS
equally enamored with other IT capabilities, including data scalability
(76%), time to deploy (75%), end-to-end functionality (75%), and user
scalability (74%). There was a slight drop-off to the next set of desired IT
capabilities, including design (68%) and cost to buy (65%), and another
drop-off to administration (58%) and enterprise pedigree (38%).
Its clear that BI professionals want their BI tools to offer strong IT capabili-
ties. And they dont care whether its an enterprise BI tool or not. A closer
examination, however, shows they want enterprise scalable BI tools (both
user and data scalability) that dont cost a lot to buy or maintain. In essence,
they want the best of both worlds.
Figure 12:
BI Professionals Satisfaction with IT Capabilities of Current BI Tools
BI MEGA TRENDS
NEXT-GENERATION
BI CAPABILITIES
SELF-SERVICE BI
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION
BI
RECOMMENDATIONS
BI ARCHITECTURES
When it comes to architecting a BI environment, most organizations point
their BI tools at a data warehouse (70%) or data mart (59%) (see Figure
13). After that, there is a significant drop-off, but the next two most popular
data sources are a surprise: local files (36%) and operational data stores, or
ODSes, (35%). The local files represent the unplanned, ad hoc nature of many
RESEARCH BI activities, while the ODS represents operational reporting, still a major part
BACKGROUND
of most BI endeavors.
Interestingly, weve given a lot of attention to in-memory databases, data-
EXECUTIVE base appliances and dynamic data caches in this report, but these sources
SUMMARY only account for 18% of sources among surveyed companies. NoSQL data-
bases barely register at 5%.
BI MEGA TRENDS
Data sources by company size. When we filter the same data by company,
a few interesting things pop out. First, large companies are most likely to
NEXT-GENERATION
BI CAPABILITIES
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION
BI
RECOMMENDATIONS
employ data warehouses (78%) or data marts (69%) with their BI tools than
either small or medium-sized companies (Figure 14). This is no surprise, since
large companies have been doing BI longer, and BI has traditionally gone hand
in hand with data warehouse and data mart deployments. In addition, larger
companies largely have a greater need for enterprise views of data, hence
their investment in data warehouses and BI tools that run against them.
RESEARCH Next, small companies are more likely to employ local files (44%), such as
BACKGROUND
Excel spreadsheets, as the source of data for their BI tools than medium-sized
or large companies are. Again, thats no surprise, since small companies large-
EXECUTIVE ly run on spreadsheets and are less advanced in their adoption of BI tools.
SUMMARY In addition, medium-sized companies are more likely to run their BI tools
against transaction systems (29%) or an in-memory database (23%) than
small or large companies are. My speculation is that medium-sized compa-
BI MEGA TRENDS
nies are just beginning to invest in BI and, with limited capital budgets, are
most likely to purchase departmentally based in-memory BI tools, which have
NEXT-GENERATION been popular in recent years.
BI CAPABILITIES
SELF-SERVICE BI
Figure 14:
Data Sources by Company Size
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION
BI
RECOMMENDATIONS
Finally, large companies are slightly more likely to purchase analytical appli-
ances (20%) than either small companies (18%) or medium-sized compa-
nies (14%) are. These appliances typically arent cheap. Large companies use
them as analytical sandboxes to complement a data warehouse, while small
and medium-sized companies use them as their
data warehouse platforms.
RESEARCH
BACKGROUND
SUMMARY If BI professionals could
If BI professionals could purchase a new BI tool purchase a new BI tool
today, they would look for ones that are faster,
EXECUTIVE
SUMMARY better and easier to use. They also want tools
today, they would look
that are more visual, interactive and analyti- for ones that are faster,
cal. From an administrative perspective, they better and easier to use.
BI MEGA TRENDS
want tools that are scalable and easier to main-
tain. Despite these and other emphatic wishes,
NEXT-GENERATION about 40% of BI professionals are satisfied with
BI CAPABILITIES their current BI tools. Small companies seem happier with their tools than
large companies, perhaps because they jumped into the BI game later and
purchased more modern, lower-cost, departmentally oriented BI tools. n
SELF-SERVICE BI
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION
BI
RECOMMENDATIONS
Recommendations
RESEARCH
BACKGROUND
1.
SUMMARY
2.
Standardize on top-down and bottom-up tools. Today, you need
USER INPUT ON
NEXT-GENERATION both top-down and bottom-up BI tools to meet the needs of casual
BI users and power users. Top-down tools built on data warehouses
deliver enterprise-caliber standard reports and dashboards for casual users,
while bottom-up tools give power users the ability to perform ad hoc analy-
RECOMMENDATIONS
ses against any data source and create highly interactive, departmental
dashboards for their colleagues. In the future, one tool set may be sufficient
to meet the needs of all users. Bottom-up BI vendors, in particular, are work-
ing to deliver the scalability, reliability and maintainability espoused by their
enterprise brethren. In contrast, enterprise BI vendors are delivering in-mem-
ory visual analysis tools to compete with the upstart bottom-up folks. Today,
its unclear which set of vendors will dominate the market or even whether its
possible to meet all needs in a single tool set.
3.
Embrace self-service BI with governance. Self-service BI promises to
liberate end users, giving them the information they want, when and
how they want it. Reporting tools with semantic layers and mashboard
services and visual analysis tools with in-memory databases have made it
easier for power users to create ad hoc reports and analyses. Visual analysis
tools, in particular, have the potential to finally deliver on the promise of self-
RESEARCH service BI. However, self-service BI must be accompanied by a strong gov-
BACKGROUND
ernance program to ensure that superusers and power users dont create a
plethora of conflicting reports that overwhelm and confuse casual users.
4.
EXECUTIVE
SUMMARY Recognize that there are two types of self-service. Casual users
need self-service 20% of the time, while power users need self-
service 80% of the time. Casual users need BI tools that allow them
BI MEGA TRENDS
to change the views in a report or dashboard with simple mouse clicks to
suit their needs. Power users need highly flexible tools that can attach to any
NEXT-GENERATION source. They also need tools that let them model and manipulate the data
BI CAPABILITIES without scripting and collaborate with other power users to reuse data work-
flows and analyses instead of reinventing them each time. Dont give casual
users self-service tools designed for power users or vice versa. This doesnt
SELF-SERVICE BI
work. Casual users will find the tools too hard, while power users will find
them inflexible. n
ARCHITECTING
FOR BI
USER INPUT ON
NEXT-GENERATION ABOUT THE AUTHOR
BI Wayne Eckerson has been a thought leader in the data warehousing, business intelli-
gence (BI) and performance management fields since 1995. He has conducted numer-
ous in-depth research studies and is the author of the best-selling book Performance
RECOMMENDATIONS
Dashboards: Measuring, Monitoring, and Managing Your Business. He is a noted keynote
speaker and blogger and he consults and conducts workshops on business analytics,
performance dashboards and BI, among other topics. For many years, Eckerson served
as director of education and research at The Data Warehousing Institute, where he oversaw the com-
panys content and training programs and chaired its BI Executive Summit.
Eckerson is currently director of research at TechTarget, where he writes a popular weekly blog called
Waynes World, which focuses on industry trends and examines best practices in the application of BI
(www.b-eye-network.com/blogs/eckerson). Eckerson is also president of BI Leader Consulting (www.bi
leader.com) and founder of BI Leadership Forum (www.bileadership.com), a network of BI directors who
meet regularly to exchange ideas about best practices in BI and educating the larger BI community. He
can be reached at [email protected].
About SAS:
SAS is the leader in business analytics software and services, and the largest independent
vendor in the business intelligence market. Through innovative solutions, SAS helps customers
at more than 50,000 sites improve performance and deliver value by making better decisions
faster. Since 1976 SAS has been giving customers around the world The Power to Know .