20PE301 Data Visualization
20PE301 Data Visualization
how it brings actionable insights to the surface. Unlike one-dimensional tables and
charts that can only be viewed, data visualization tools enable users to interact with
data.
Geo-Spatial Visualization
Another scope of data visualization that has emerged in the business world lately
is the geo-spatial visualization. The popularity of geo-spatial visualization has occurred
due to a lot of websites providing web-services, attracting visitors’ interest. This type
of business needs to take advantage of location specific information, which is already
present in the system in the form of customer’s zip code, providing better daily analysis
experience. This type of visualization adds a new dimension to the figures and helps in
better understanding of the matter.
Customized Data-Visualization
A yet another vital advantage of Data-visualization is that it not only provides
graphical representation of data but also allows changing the form, omitting what is
not required, and filter more to get further details. This is a great eye catcher and
attracts business executive’s attention better and provides better communication.
Additionally, it provides a great advantage over traditional methods of presenting
data.
6 Introduction to Data Visualization Tools
Unique Features
• Connectors to Google Data Sources
• Transformation tools for working with raw data
ChAPTer I DATA VISUALIZATION TOOLS 7
Advantages
• Developed as part of the Google Analytics suite, fully integrated with other rele-
vant Google products
• Simple in all major aspects, easy to use
• Great collaboration capabilities
Disadvantages
• Not flexible as its competitors
• Less ability to add custom visuals, only to modify the existing ones to some extent
• Interactivity is not supported
• No functionality for mixing and blending data
• The data must be ready for visualization, only minor changes can be done to it
1.3.2 Tableau
It is a popular and market-leading data visualization tool used to visualize and
analyze the data in an easily digestible format. It is an extremely powerful tool that
focuses on business intelligence and analysis, utilized by thousands of companies
worldwide. It allows working with live data-sets and spends more time on data analysis.
It has a very large customer base across industries due to its simplicity and ability to
produces interactive visualization. It is particularly well suited to handling the huge and
very fast changing data-sets which are used in big data operations, including artificial
intelligence and machine learning applications.
Tableau has different licensing plans similar to other BI solutions. Tableau offers
three distinct products with drastically different prices.The three products are the
Tableau desktop, Tableau online, and Tableau server that come at different price points.
It supports integration with large number of advanced variety of data sources with
many systematic types, such as data systems organized in file formats (CSV, JSON,
8 Introduction to Data Visualization Tools
XML, MS Excel, etc.), relational and non-relational data systems (PostgreSQL, MySQL,
SQL Server, MongoDB, etc.), cloud systems (AWS, Oracle Cloud, Google BigQuery,
Microsoft Azure).
The core distinction from competitors is that Tableau has a special feature of
Data Blending. Another unique feature is the ability for collaboration in real time that
makes it a valuable investment for commercial and non-commercial organizations alike.
There are several ways to share the reports in Tableau, by publishing them to a Tableau
server, via email Tableau Reader capability, by publishing Tableau workbook openly
and giving access to anyone who has a link. This magnitude of options enables great
flexibility and removes many restrictions.
Tableau offers a broad variety of visualization capabilities with distinct features,
enabling smart ways of data discovery and deep insight. The rich library of visualizations
types includes word clouds and bubble charts that provide high levels of comprehension
unique to Tableau. Tree diagram and Treemap provide contextual information to the
visuals. The latter is usually utilized for the depiction of parts categorical data, focusing
attention on the most relevant pieces of the information.
Tableau dashboards are amazingly flexible. Its central features allow the remar-
kable ability to layout the dashboard in the desired way with any overlaps, which comes
really handy in the screen space ergonomics.
Tableau is easy to apprehend as a working tool, its learning curve is pretty gentle,
as it strives to provide all of its powers to any kind of users, even those who haven’t
been previously exposed to technical details of visualization workflows. This objective
is accomplished by using intuitive interface, everything is always no more than two
clicks away, robust filters and drill-downs are easy to find and use, operations are well
documented and labeled.
Unique Features
• Complimentary sharing ability (with certain limitations)
• Support for connection to 30+ data source types
• Mixing data sources
• Support for cubes
Advantages
• Intuitive and attractive user interface
• Seamless integration with big data platforms, from Hadoop to Google BigQuery
• Provides an extensive roaster of native data connections allowing easy integra-
tion with data from many resources
ChAPTer I DATA VISUALIZATION TOOLS 9
Disadvantages
• Initial data preparation is required (structured data)
• Although great for analytical purposes, Tableau and other BI tools cannot replace
financial reporting applications
• No concept of versioning with Tableau server
1.3.3 Qlikview
It is another data visualization tool which is a major player in the market and
Tableau’s biggest competitor. The key advantage of the tool is highly customizable and
wide range of features. In addition to its data visualization capabilities, Qlikview offers
powerful business intelligence analytics and enterprise reporting capabilities and clean
and clutter-free user interface.QlikView is rated as one of the most expensive platforms
in the BI field.
QlikView is a solution that focuses on the user as the receiver of data. It allows
users to explore and discover your data in a workflow similar to the way developers
work when processing data. To sustain flexibility in its approach to data exploration and
visualization, this software strives to maintain the association between data. This faci-
litates the discovery of your data by the end-user seeking a certain piece of data with
awareness of retrieval of any relevant items, in spite of any circumstances, even if the
origins of the items applicable to the search are incredibly disjoint.
QlikView is incredibly flexible. It allows setting and tweaking every little aspect of
each object and customizing the look and feel of any visualizations and dashboards.
With such great deal of flexibility, there also comes an incorporated ETL (Extract, Tran-
sform, Load) Engine that enables you to conduct the ordinary data cleansing opera-
tions. However, it may turn out to be costly.
Unique Features
• Uniqueness and flexibility
• Rich set of features for creating advanced dashboards
• Manipulate data associations automatically
• Allows faster queries and quicker data exploration by keeping data in-memory
10 Introduction to Data Visualization Tools
Advantages
• Attractive user interface
• Easy to set up filtering for any kind of visuals
• Fast rendering of both graphs and tables
• Ability to mail reports in the convenient form of PDF
Disadvantages
• Unintentional combining of some data aspects while filtering
• No ability to union bookmark results together
• Complications in using it as an enterprise tool
1.3.4 Power BI
Power BI is the software solution, developed and supported by Microsoft, for
business intelligence and analytics needs. At the core of Power BI is an online service
with various options for interaction, also featuring several outlets for connection to data
provided by thirdparty software and services.
Power BI provides a simple web-based interface with lotof useful features
varying from customizable visualization to certainly limited controls of data sources.
The desktop application expands the available functionality to an even larger extent
with the addition of tools for data cleansing and normalization.Another way to work
and make data-driven decisions on the go is through the mobile app, which is availa-
ble for multiple platforms. It is also amazingly simple to share insights by publishing
your work to Power BI service and forming lively dashboards from a combination of
reports which makes the data communication centralized and easy to follow up for all
the participants.Power BI is concise and minimalistic, yet powerful and robust. Howe-
ver, like any other software, it also has its ups and downs which must be carefully
considered.
First of all, as it is a Microsoft product, it follows a philosophy, principles, and
architecture similar to other major Microsoft products. It also exposes a familiar inter-
face for the Windows users.Power BI was created and designed with the aim to build
upon the functionalities of MS Excel, upgrade it to the next level, extend its operability
even further to unlock new use cases, cover more platforms, and reach out to the cloud.
As a Microsoft product, Power BI has connections to some other software from
the Microsoft’s toolbelt but goes much farther than that by utilizing a whole suite of novel
business analytics tools. Thus, Power BI is not just related to other products; it is tightly
integrated with the main Microsoft tools including MS Excel, Azure Cloud Service, and
SQL Server.
ChAPTer I DATA VISUALIZATION TOOLS 11
Unique Features
• Power BI has a free basic version, giving users a chance to explore it first
• It supports plenty of ways to incorporate or import your data (streaming data,
cloud services, excel spreadsheets, and third-party connections)
• It has interactive dashboards with real-time feed of data
• Simple API for integrating Power BI with your applications
• Different ways to share reports and dashboards
• Multiplatform support (Web, Desktop, Mobile)
Advantages
• More affordable than other tools and offers a free version
• Integration with other Microsoft products, Azure, Excel, SQL Server
• The built-in library of visuals is impressive and robust
• Ability to connect almost any kind of data source
Disadvantages
• Unnecessary complexity
• Performance issues when streaming and importing big data sets
• Lack of data preparation and cleaning tools
1.4 FEATURES
1.4.1 Common features of Data Visualization Tool
Data Visualization tool helps enterprises, organizations and companies to display
data in the structured and ordered format, which is not only easy to interpret but mea-
ningful and receptive tomaking decisions. It identifies patterns, limits noise and insignifi-
cant values from the data to produce actionable insights.So, in order to make most out
of the Data Visualization, companies need to select the right tool with variety of features
and capabilities. Some of the features that a data visualization tool should have are
discussed below. The following are a variety of features and capabilities that experts
recommend organizations consider when adopting visualization tools for big data:
dashboard should be able to present all the key information in a glance. A sample
dashboard prepared using Microsoft Power BI visualization tool is shown in figure 1.1.
Embeddability
The ability to seamlessly integrate the visual reports into any other applications
in use is important to really utilize the power of data visualization. For the teams to work
efficiently, collaborate better and share across different platforms, the data visualization
software must have the feature to allow using the various media like graphs and charts
into different mobile or web applications. The quality and richness of the visual reports
ChAPTer I DATA VISUALIZATION TOOLS 13
should not diminish when transporting it onto a different application. The reports should
still remain interactive and allow further investigation of the data.
Not all departments need to analyze all the high-level data your tool collects.
Most of them would only want one part of that data to seamlessly integrate with their
specific applications. They need immediate actionable insights that can help them incre-
ase the efficiency of their tasks and campaigns. A good data visualization tool must
allow for easy embeddability.
Performance
If visualization tools for big data distract workers from the flow of their work,
they’re less likely to be used. A few seconds delay may not be significant for some use
cases but may discourage users tasked with evaluating hundreds of decisions throu-
ghout the day. Features that help improve performance include prompts, data optimiza-
tion settings and dynamic loading options.
Another performance-related feature to consider is the tool’s ability to run com-
putations on GPUs. As data sets have grown, rendering large amounts of data with
traditional architectures has become harder. GPUs used with direct memory access
can help crunch large volumes of data faster and more efficiently. This makes it easier
to build high-definition visualizations on the server side that simply get served by the
application via a web application.
Interactive Reporting
The visual reports generated by a data visualization tool must be extremely inte-
ractive, allowing easy investigation into trends and insights. Interactive data visualization
helps identify trends and tell a story through data. This includes capabilities for filtering,
slicing and dicing, and drilling up and down at speeds that make it possible for users to
investigate huge volumes of data and get answers to their questions immediately.
Data analysts and decision makers need to be able to collate data from various
sources and combine datasets to produce insightful reports. The tool should allow for the
reports to be viewed in various different formats and different parts can be highlighted at
different times. Industry specific KPIs need to be customized to provide tailored insights.
To enable all of this, the business intelligence and data visualization tool needs to be
highly interactive.
Also the tool must provide the facility to share the reports to team members and
other stakeholders. The reports must be exportable to other applications.
Collaboration
Real-time collaboration capabilities in visualization tools for big data allow
employees to have more meaningful conversations about their discoveries. This
includes the ability for employees to collaborate in real time on current data, rather than
requiring them to send static files and screenshots to one another.
Qlikview- Qlikview is probably the strongest competitor to Tableau. It was in fact cho-
sen as the Gartner Magic Quadrant Leader 2019 and boasts clients like Conde Nast,
Subaru and Global retail Bank. Some of its best features include,
• Embedded Analytics
• Advanced Analytics Integration with third party engines like Python
• Customizable Dashboard
• Predictive Analysis
• Shared file management
Sisense - Sisense is more than a traditional analytics tool. It is scalable and can handle
all sorts of data. With high profile clients including NASA, NASDAQ, Samsung and
Comcast, Sisense is definitely one of the best. Top futures include,
• Customizable dashboard with sharing, drag and drop and built-in chart widgets
• In-memory columnar database can crunch terabytes of data on a single server
• Extremely fast implementation
• Advanced machine learning and AI
• Instant insights that update in real-time
• Interactive and automatic scheduled reporting
Domo - Domo is not just a data visualization tool but a complete business management
platform that handles your analytics and reporting from just one platform, with clients
ranging from eBay to National Geographic and Sage. Its features include,
• Hundreds of data connectors including Facebook, Salesforce etc.
• Workbench functionality imports on-premise data into Domo easily
• Easily cleans, combines and transforms data in multiple ways
• Easy data sharing with custom tools
• Mobile optimized with automatic alerts
• Automatic schedule reporting and customizable dashboard
Microsoft Power BI -Coming from Microsoft gives Power BI a familiarity that makes it
easy for new entrants to adopt and explore. To add to this ease of adoption, Power BI
offers a free basic version and is open-source. With clients like Adobe, HP and Toshiba,
it offers features like,
• Interactive dashboard with real-time data feed and easy sharing
• Customized reports that can be created from scratch
• Easy data capture and sharing with Datasets
• Explore data by asking questions in natural language
• Cloud based and easy to implement
16 Introduction to Data Visualization Tools
Klipfolio - With over 500 data sources it can connect to including Google Analytics,
Twitter and Moz, Klipfolio is a great choice indeed. Top features include,
• Widespread data sourcing
• Financial forecasting
• Customizable dashboard with built in templates
• Real-time accuracy
Plotly -One of the most colorful, yet elegant BI solutions out there, Plotly helps create
interactive graphs for easy comprehension. Some of its top features are,
• 2D and 3D charts with designer input and customizability
• Integration with analytics oriented languages like Python, R and Matlab
• User friendly with inbuilt APIs
Chartio - Chartio is a BI and data visualization tool for all businesses big and small.
Some features include,
• Real-time analytics with live changes
• Comparative analytics
• Easy set-up
• Multiple chart formats
Geckoboard - With over 80 pre-built services for real-time analysis, Geckoboard make
data visualization easy for anyone. Some of its best features are,
• Custom dashboards with pre-built widgets
• Rich integrations with APIs for Facebook, twitter, Salesforceetc
• Pull and push data integrations
• Customizable style sheets, schema and widgets
Datawrapper - Datawrapper’s simple, clear and easy to use interface has quickly made
it a top choice among non-technical clients like media organizations such as Fortune,
Mother Jones and The Times. Some of its best features are,
• Easy to use with no coding or design skills required
• Fast and interactive charts
• Styled to your branding
To summarize, choosing the right data visualization tool is a big decision not only
because they are fairly expensive, but also because they play a huge role in shaping the
business strategy. A tool that can present the most clear, interactive and accurate visual
reports can help business people to take better decisions, make better plans and track
ChAPTer I DATA VISUALIZATION TOOLS 17
the KPI’s better. So depending on what features matter most to the business, choose a
tool that will give just the representations that are in need.
REVIEW QUESTIONS
1. Write the benefits of Data Visualization tools.
2. What is Geo-Spatial Visualization?
ChAPTer I DATA VISUALIZATION TOOLS 19
also. Hence loading it directly into the data warehouse may damage it and rollback will
be much more difficult. Therefore, this is one of the most important steps of ETL process.
Transformation
The second step of the ETL process is transformation. In this step, a set of rules
or functions are applied on the extracted data to convert it into a single standard format.
It may involve following processes/tasks:
• Filtering – loading only certain attributes into the data warehouse.
• Cleaning – filling up the NULL values with some default values, mapping U.S.A,
United States and America into USA, etc.
• Joining – joining multiple attributes into one.
• Splitting – splitting a single attribute into multiple attributes.
• Sorting – sorting tuples on the basis of some attribute (generally key-attribute).
Loading
The third and final step of the ETL process is loading. In this step, the transfor-
med data is finally loaded into the data warehouse. Sometimes the data is updated by
loading into the data warehouse very frequently and sometimes it is done after longer
but regular intervals. The rate and period of loading solely depends on the requirements
and varies from system to system.
This example is a simple one, but anyone who has worked with a public data set
will understand these issues and the need to preprocess data to make it useful. Data
sets that have such obvious errors make the results of the processed data somewhat
questionable. The observations with errors result in incomplete data or invalid observa-
tions that can lead to incorrect results. Cleansing data is therefore a key step in the data
processing pipeline.
Data may also come from multiple sources. Although each source may be valid
in isolation, bringing the data together may require processing for consistency and uni-
formity. For example, one data set may have a different unit of measure for a given field
than another, requiring that they be normalized.One key factor for data validity, then, is
the format in which the data is represented.
Figure 2.4 Data set errors made visible through data profiling
In line 1, given that the real values represent physical measurements, a zero
value may indicate an issue with this observation. In line 3, you see that the range of
the measurement is obviously not in the same range as other measurements of this
field (and its type differs). Finally, in line 5, notice that the class name is misspelled. In
some cases, these issues can be detected automatically through profiling. We could
indicate that all measurements should be greater than 0 to catch the first issue. Through
26 Introduction to Data Visualization Tools
statistical analysis, we could identify the second outlier measurement. The final issue
could be identified by capturing the unique class names and through their frequencies
understand that this particular class name is an outlier (likely an occurrence of one). We
can validate time-series data in the context of flow to ensure that the data is processed
in the correct order given timestamps.
Figure 2.9 Stacked column bar chart for world population data
• By default, the dimension is “Year”, Data Studio will automatically select a metric
(eg. ‘population’, what’s displayed on the Y axis). Metric and dimension can be
changed. For instance, female%. Internet users% etc.
• In the style tab, how many bars and how many country details can be changed.
• To see the finished product, click “View” in the top corner. This transitions you
from Editor to Viewer mode. To edit the report Click “Edit.”
• To finish up, we need to give the report a name. Double-click the title (right now
it’s “Untitled Report”) to change it as ‘World Population Barchart’
• From the right side pane select data source. (for eg. [Sample] World Population
Data 2..)
• Click “add a chart” in the toolbar. Data Studio makes it easy to compare chart
types with some handy illustrations as shown below.
Figure 2.15 Donut chart for world population data with country as
dimension and population as metric
• By default, the dimension is “Year”, Data Studio will automatically select a metric
(eg. ‘population’, what’s displayed on the Y axis). Metric and dimension can be
changed. For instance, female%. Internet users % etc.
• In the style tab, how many bars and how many country details can be changed.
• To see the finished product, click “View” in the top corner. This transitions you
from Editor to Viewer mode. To edit the report Click “Edit.”
• To finish up, we need to give the report a name. Double-click the title (right now
it’s “Untitled Report”) to change it as ‘World Population pie chart’
Table 2.2 shows just the category dimension and quantity metric for table 2.1. It
has aggregated the quantities sold per category. Since there are only 3 categories in the
data set, the table shows just 3 rows.
Bird 28
Dog 27
Cat 12
Table 2.3 contains 6 rows, 1 for each item. The quantity sold metric is now aggre-
gated per item.
From the figure 2.17, cost per mile ishigher than average when less than about
1,700 miles or more than about 3,300 miles observations can be made.
• From the right side pane select data source. (for eg. [Sample] World Population
Data 2..)
• Click “add a chart” in the toolbar. Data Studio makes it easy to compare chart
types with some handy illustrations as shown below.
Figure 2.22 Bubble chart for world population data with country as
dimension and population as metric
• By default, the dimension is “Year”, Data Studio will automatically select a metric
(eg. ‘population’, what’s displayed on the Y axis). Metric and dimension can be
changed. For instance, female%, Internet users % etc.
• In the style tab, how many bars and how many country details can be changed.
• To see the finished product, click “View” in the top corner. This transitions you
from Editor to Viewer mode. To edit the report Click “Edit.”
• To finish up, we need to give the report a name. Double-click the title (right now
it’s “Untitled Report”) to change it as ‘World Population Scatter Chart”.
REVIEW QUESTIONS
1. Explain Transformation process in ETL.
2. Elaborate messy data
3. What is self –describing format?
4. Describe data blending.
5. Brief the data parsing method in data cleaning
6. Elucidate data profiling with an example.
7. List out the advanced options for Barchart in Google Data Studio.
8. Write the advantages of barchart.
9. Explain Data tables with an example.
10. Elaborate Scatter chat with an eample.
ChAPTer II CONTENTS 41
Step 5: In the worksheet, from Dimension under the Data pane, drag the Order
Date to the Column shelf. (On dragging the Order Date to the columns
shelf, a column for each year of Orders is created in the dataset. An ‘Abc’
indicator is visible under each column which implies that text or numerical
or text data can be dragged here. On the other hand, if the Sales pulled
here, a cross-tab would be created which would show the total Sales for
each year.)
Step 6: Similarly, from the Measures tab, drag the Sales field onto the Rows shelf.
(Tableau populates a chart with sales aggregated as a sum. Total aggregated
sales for each year by order date is displayed. Tableau always populates a
line chart as shown in fig 3.5 for a view that includes time-field which in this
example is Order Date.)
Fig 3.5 Line Chart for Aggregated Sales for Each Year
Step 7: In the chart above, the display is in years. To further drill down to quarter and
month levels, click on the plus icon on the order date in the Columns shelf.
This will generate the following output, which now displays the data broken
down to the month and quarter level as shown in fig 3.6.
ChAPTer III CONTENTS 47
Fig 3.6 Line Chart for Aggregated Sales for Each Month
and Quarter Level
The above chart is useful, but it is displayed in a discrete format. It will be more
beneficial if the data is displayed in continuous form.
Step 8: To convert the chart into a continuous format time series chart, the first step is
to roll up the YEAR (Order Date) back to year level, and then the second step
is to right-click on it and select the Year and Continuous options as shown in
fig 3.7.
Step 9: Drill down to quarter and Month level as in step 7 by changing the Columns shelf
from YEAR (Order Date) to MONTH (Order Date). This will generate a monthly
time series chart. From an analytics perspective, this chart shown in fig 3.8 is
more insightful as it allows us to see the sales fluctuations across months and
years. This is also useful for decomposing the seasonality and trend compo-
nents of the time series data.
Fig 3.8 Time-Series Chart For Sales Analysis for Each Month
Step 10: Change the Path Property by going into the Marks shelf and clicking on
the Path option. There are three options for the type of line graph for the view,
and selecting the second option will produce the chart as shown in fig 3.9.
The output is like the previous chart, but the trend shifts are more pronounced
now.
A time series chart with two variables, Sales and time can be further improved by
adding more variables to a chart. For instance, it could be useful to visualize sales by
segment across time. This can be done easily in two ways. First, simply drag the Seg-
ment field to the Color pane in the Marks shelf. The second method is to move the
category to the Rows shelf to show it separately as shown in fig 3.10.
Fig 3.10 Time Series Chart With Sales with Different Category
3.2 SCORECARDS
3.2.1 Introduction
Scorecards offer organizations a snapshot of their current performance when
compared to their goals. They are useful tools for organizations which need to manage
performance and make strategic decisions better based on the distance between cur-
rent performance and the goal. As such, scorecards present a more static view of an
organization at a point in time rather than a dynamic hub to monitor success.
Scorecards are most commonly used to track KPIs, as they focus on both the cur-
rent status of the metric being tracked and the target value. However, scorecards aren’t
live, so data is not updated in real-time. Instead, scorecards serve to monitor strategic
goals relative to KPIs and to make decisions on a larger scale.
These decisions can include tracking the progress of a set strategy, measuring
the efficiency of particular teams or departments towards meeting goals or even iden-
tifying problems and how they can be resolved. Scorecards are generally periodic mea-
sures, usually updated at set intervals such as weekly or monthly.
50 Introduction to Data Visualization Tools
For example, a scorecard can summarize total sales, average bounce rate, count
of ad impressions, maximum hold time, minimum failure rate, etc. Scorecards in Google
Data Studio appear as numbers, and, optionally, the name of the metric being summari-
zed. The format of the displayed number depends on how the metric is configured in the
data source. For example, the data source for a fictional pet store contains the following
metrics. The Qty Sold metric is simply a number coming from the data set. The Avg Qty
Sold metric is a duplicate of the Qty Sold field, with the Average aggregation type. Total
Items, and Unique Items are calculated fields as given in table below as well as Score-
cards for these metrics are shown in fig 3.11.
Step 8: Add formatting to an area chart (if required), using Format menu. Choose the
part of the view that is to be formatted, such as Font, Borders, or Filters.
Step 9: Add a highlight action using the highlight button in the toolbar if required.
REVIEW QUESTIONS
1. Create the Procedure to Create Time Series Chart in Tableau
2. Differentiate between Scorecard Vs Dashboard.
3. What are the Components of Bullet Chart?
4. Explain the advantages and disadvantages of Bullet Chart.
5. Discuss the uses of area chart.
6. Demonstrate the procedure to create bullet chart in Tableau
7. Compare and contrast Line chart Vs Area Chart.
8. Write the procedure to create an area chart in Tableau
9. Explain the uses of times series chart with suitable example.
10. How to Connect a Data Source in tableau? Explain.
11. Create a table with your own data set and draw the time-series chart for sales
analysis for each Month.
CHAPTER IV
A density heat map is used to analyze the areas in a plot where data points are
dense or scattered. Heat maps are specifically used where there is a huge data set with
overlapping data values. This helps the analyst to see the areas with greater density and
discover data trends. A sample heat map shown in fig 4.2 visualizes the visitor’s behaviour
on a web page and provides clue on where the most important content to be placed.
Step 5: Next Add Measure Sales to Rows Section and again select Average of the
field values. Observe the chart as shown in fig 4.5, an empty plot with two
axes appears on the canvas.
Step 6: Add Dimension Field State into Detail card present in Marks section. This will
add a group circle representing different states on the plot showing average
sales and average profit for each state as shown in fig 4.6.
Step 7: To convert this plot into a density heat map, select the Shape as Density.
This will change the shape of data points from circles to density spots. That
is, the color scheme of data points will follow a density gradient as shown
in fig 4.7.
ChAPTer IV CONTENTS 63
The regions with most data points or dense regions will be in red/orange
whereas, the areas with lesser or scattered data points appear in greenish-
blue shades. The color schemes can be modified for heat maps.
Step 8: Right-click on Color card and set the intensity, opacity and other border effects
for the heat map as given in fig 4.7. Select the color scheme from a long list of
available options.
Step 4: (Formatting) Depending on the type of map to create, assign certain data
types, data roles, and geographic roles to the fields (or columns). On the
Data Source page, click the data type icon (the globe) for Postal Code and
select String as shown in fig 4.11.
Step 10: On the Marks card, click the + icon on the Country field. The State field is
added to Detail on the Marks card and the map updates to include a data point
for every state in the data source as in fig 4.13.
68 Introduction to Data Visualization Tools
Step 3: Open a new worksheet. In the Data pane, under Measures, double-click Lati-
tude and Longitude. Latitude is added to the Rows shelf, and Longitude is
added to the Columns shelf. A map view with one data point is created.
Step 4: From Dimensions, drag ID to Detail on the Marks card. If a warning dialog
appears, click Add all members. A lower level of detail is added to the view as
shown in Fig 4.17.
Step 5: From Measures, drag Magnitude^10 to Size on the Marks card. The Magni-
tude^10 field is used to encode size, instead of the Magnitude field. This is
because Magnitude^10 contains a wider range of values, so the differences
between values can be seen visually. A proportional symbol map appears as in
fig 4.18. The larger data points represent earthquakes with larger magnitudes,
and the smaller data points represent earthquakes with smaller magnitudes.
Fig 4.18 Symbol map appears as small and larger data points as per the
proportion of the Earthquake Magnitude
Step 6: From Measures, drag Magnitude to Color on the Marks card.
Step 7: On the Marks card, click Color > Edit Colors.
Step 8: In the Edit Colors dialog box, do the following:
• Click the color drop-down and select the Orange-Blue Diverging palette from the list.
• Select Stepped Color, and then enter 8. This creates eight colors: four shades of
orange, and four shades of blue.
• Select Reversed. This reverses the palette so that orange represents a higher
magnitude than blue.
• Click Advanced, select Center, and then enter 7. This shifts the color palette and
ensures that any earthquake over 7.0 magnitudes will appear orange in color,
and any earthquake under 7.0 magnitudes will appear blue in color.
• Click OK.
Step 9: On the Marks card, click Color again, and then do the following: For Opacity,
enter 70%. Under Effects, click the Border drop-down menu and select a dark
blue border color. The map view updates with new colors. The dark orange
ChAPTer IV CONTENTS 73
data points represent earthquakes with higher magnitudes, while the dark
blue data points represent earthquakes with lower magnitudes. The opacity of
the marks is at 70% allows to see where the data points overlap as in fig 4.19.
Fig 4.19 Enhance the visualization by using boundary colors and opacity
Step 10: On the Marks card, right-click the ID field and select Sort.
Step 11: In the Sort dialog box, do the following: For Sort Order, select Descending.
For Sort By, select Field, and then click the drop-down and select Magnitude.
Click OK. This sorts the data points in the view so that the larger magnitudes
appear on top. Observe the completed proportional symbol map as in fig 4.20.
or Country, to the view defines the correct location for that data. If there is a hierarchy
in the data pane, Tableau will automatically use the appropriate levels of the hierarchy
to solve location ambiguities.
Summary
In Data Visualization Tools, map charts are very effective in visualizing the geo-
graphical data for providing real world scenario to the business people.
REVIEW QUESTIONS
1. Discuss about the Uses of heat map.
2. How to create tableau heat map?
3. Why we need map visualization?
4. List out the uses of Geo map.
5. When should you use a map to represent your data?
6. What is latitude and longitude on a map?
7. Tableau is revolutionizing data analysis and has truly made geographic analysis
accessible to everyone. Comment your views.
8. Elucidate the salient features of categorical color palette for geographical map.
9. Discuss about the different types of maps in geo map.
ChAPTer IV CONTENTS 79
5.1 DASHBOARD
5.1.1 Introduction
Today, the use of dashboards forms an important part of decision making. In
information technology, a dashboard is an easy to read, often single page, real-time
user interface, showing a graphical presentation of the current status (snapshot) and
historical trends of an organization’s or department’s key performance indicators to ena-
ble instantaneous and informed decisions to be made at a glance.
• Operational Dashboards
• Informational Dashboards
Strategic Dashboards
Strategic dashboards support managers at any level in an organization for deci-
sion making. They provide the snapshot of data, displaying the health and opportunities
of the business, focusing on the high level measures of performance and forecasts.
• Strategic dashboards require to have periodic and static snapshots of data (e.g. daily,
weekly, monthly, quarterly and annually). They need not be constantly changing from
one moment to the next and require an update at the specified intervals of time.
• They portray only the high level data not necessarily giving the details.
• They can be interactive to facilitate comparisons and different views in case of
large data sets at the click of a button. But, it is not necessary to provide more
interactive features in these dashboards.
A screenshot given in fig 5.2 shows an example of an executive dashboard which
displaying goals and progress.
Analytical Dashboards
Analytical dashboards include more context, comparisons and history. They
focus on the various facets of data required for analysis. Analytical dashboards typically
support interactions with the data, such as drilling down into the underlying details and
hence should be interactive. Examples of analytical dashboards include Finance Mana-
gement dashboard and Sales Management dashboard as shown in fig 5.3.
84 Introduction to Data Visualization Tools
Operational Dashboards
Operational dashboards are for constant monitoring of operations. They are often
designed differently from strategic or analytical dashboards and focus on monitoring
of activities and events that are constantly changing and might require attention and
response at a moment’s notice. Thus, operational dashboards require live and up to
date data available at all times and hence should be dynamic. An example of an ope-
ration dashboard could be a support-system dashboard, displaying live data on service
tickets that require an immediate action from the supervisor on high-priority tickets as
given in fig 5.4.
Informational Dashboards
Informational dashboards are just for displaying figures, facts and/or statistics.
They can be either static or dynamic with live data but not interactive. For example,
flights arrival/departure information dashboard in an airport as in fig 5.5.
Business Dashboard
In a way, virtually any dashboard used by a business falls into this category.
The term “business dashboard” specifically refers to reporting tools that fulfill these
purposes:
• Tracking important business metrics
• Monitoring business intelligence initiatives
• Reporting data to stakeholders
An effective business dashboard should focus on top-level data related to the
overall success of the business. In most cases, every metric in the dashboard should
86 Introduction to Data Visualization Tools
support the business’ most important metric: the bottom line. The goal of a business
dashboard is to not only communicate data about the business’ success, though;
it facilitates understanding ad alignment between departments, holds each team
accountable for their goals and progress, and helps users identify areas that need
immediate action.
Executive Dashboard
An executive dashboard gathers and holds information that top-level stakehol-
ders need to run a company, business, or organization. Executive dashboards function
much like business dashboards, except the information in them should cater specifically
to the needs and expectations of executives. Executives only have so much bandwidth
to gather and understand information, which means they need access to the information
they need, when they need it. Some key benefits include:
• Performance Management: An overview of how departments are meeting their
goals
• Scorecards: Insight regarding specific employee performance and goals
• Visibility: Access to high-level goals and metrics related to the overall success
of the organization
• Time Management: Cohesive reports and drilldowns in one place, accessible on
any device
KPI Dashboard
KPIs (Key Performance Indicators) are the heart and soul of the organization’s
performance. They are the stepping stones that will guide your business to long-term
success, so tracking and comparing them in one place is vital. KPIs should be measu-
rable, tangible metrics that let each employee, team, and department understand how
their performance influences the success of the organization – and the KPI dashboard
is where these metrics are stored. A successful KPI dashboard should:
• Set tangible goals and targets for each department
• Facilitate accountability within each department
• Provide real-time updates on goals and progress
Project Dashboard
Much like KPI dashboards, project dashboards track tangible goals; however, the
goal of a project dashboard is not about hitting a sales quota or increasing marketing
revenue by a certain margin. Instead, project dashboards track specific metrics related
to the progress and complete of a project. This means, project dashboards involve more
scheduling metrics as,
ChAPTer V CONTENTS 87
Step 4: To replace a sheet, select it in the dashboard at right. In the Sheets list at
left, hover over the replacement sheet, and click the Swap Sheets button.
Step 5: Adding objects and set their options: The user can add dashboard objects
that add visual appeal and interactivity. The various objects are,
• Horizontal and Vertical objects provide layout containers that allows to
group related objects together and fine-tunes when the dashboard resizes
during user interaction.
• Web Page objects display target pages in the context of the dashboard.
• Blank object helps to adjust spacing between dashboard items.
• Text objects serve can provide headers, explanations, and other information.
• Button objects helps to navigate from one dashboard to another, or to other
sheets or stories. For the button style, choose an image or text to indicate
the button’s destination to the users.
• Image objects add to the visual flavor of a dashboard, and can be linked to
specific target URLs.
• Extension objects helps to add unique features to dashboards or integrate
them with applications outside Tableau.
To add an object in dashboard, Select an item under Objects on the left, and drag
it to the dashboard sheet on the right as given in fig 5.8.
ChAPTer V CONTENTS 89
• From the Navigate to menu, choose a sheet outside the current dashboard.
• Choose image or text for Button Style, specify the image or text you want to
appear, and then set related formatting options.
• For Tooltip text, add explanatory text that appears when viewers hover over the
button. This text is optional and typically best used with image buttons.
To add interactivity in the dashboard,
• In the upper corner of sheet, enable the Use as Filter option to use selected
marks in the sheet as filters for other sheets in the dashboard as shown in fig
5.11.
Actions often have unique behavior when the source or destination is a dashbo-
ard. Because a dashboard can contain multiple views, a single filter or highlight action
can have broad impact. Dashboards can also contain web page objects, which you can
target with interactive URL actions. Use a single view to filter other views in a dashbo-
ard. Imagine the created dashboard that contains three views about profitability: a map,
a bar chart, and a table of customer names. A filter action can be used to make one
of the views in the dashboard, such as the map, the “master.” When the users select
a region in the map, the data in the other views is filtered so that it relates to just that
region.
• On the dashboard, select the view that the user wants to use as a filter.
• On the view’s shortcut menu, choose Use as Filter. Perform the same action by
clicking the Use as Filter icon as shown in fig 5.12.
ChAPTer V CONTENTS 91
range: The dashboard scales between minimum and maximum sizes that you specify.
If the window used to display the dashboard is smaller than the minimum size, scroll
bars are displayed. If it’s larger than the maximum size, white space is displayed. Use
this setting when we are designing for two different display sizes that need the same
content and have similar shapes such as small- and medium-sized browser windows.
Range also works well for mobile dashboards with vertical layouts, where the width may
change to account for different mobile device widths, but the height is fixed to allow for
vertical scrolling.
Automatic: The dashboard automatically resizes to fill the window used to display it.
Use this setting for Tableau to take care of any resizing. For best results, use a tiled
dashboard layout. In Tableau Desktop, we can create dashboard layouts for different
device types to create unique layouts optimized for desktop computers, tablets, and
phones. In addition to adapting to different screen sizes, each device layout can contain
different items.