Module 2 Notes
Module 2 Notes
Module 2
TABLEAU
Contents
Chapter 1:
• Storytelling with Data: Creating a good data set for
analysis.
• Selecting data for your KPIs
• Approaches to storytelling with data
• Dashboards vs. Storyboards vs. Infographics
• The Duell Rules for Actionable Visualizations.
Chapter 2:
• Tableau: What is Tableau? History of Tableau
• Tableau Public and Tableau Desktop
• Workspace
• Connecting to data source, Files and folder.
• Tableau navigation
• Terminologies
• Data types
• Data aggregation
• File types
1
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
Chapter 1:
Storytelling with Data: Creating a good data set for analysis
a) A good data set is one that suits the purpose. A good dataset should meet the
following conditions:
• Contain the elements you need
• Are disaggregated data
• Have at least a couple dimensions and a couple measures
• Have good metadata or a data dictionary
• Are useable (not too messy, or too cumbersome)
• If the data is too aggregated, there isn't much you can do for analysis.
• Due to privacy or practicality, some data sets will never be fully granular.
• For example, it is unlikely to find a data set with case-by-case reporting of malaria by
address, so monthly totals by region might be granular enough.
2
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
3
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
A KPI dashboard provides management a platform for monitoring and analysis. It allows
companies to track the performance of individuals, departments, teams, or the entire
organization.
Dashboards enable management to see trends quickly and can be alerted to KPIs that have
exceeded set thresholds.
KPIs come in three types: strategic, operational, and individual.
(i) Strategic:
• KPIs used at the strategic level would typically reflect the business's health and track
the performance of future business goals, such as growth.
• These KPIs aren't generally measured on a daily basis as they aim to provide an accurate
representation of progress over time.
(ii) Operational:
• Operational KPIs are typically connected to strategic KPIs; however, they focus on core
activities or departments like product development, marketing, or internal projects.
4
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
• Individual KPIs are used to measure how an employee is applying their skills to a task.
• They are also used to provide managers a tool for appraisal, including how
remuneration in departments such as sales can be managed.
Strategic dashboards are predominantly used to view the business in its entirety. They contain
high-level financial and process-based KPIs. These dashboards are a great way to see the
company’s current state and growth. Strategic decisions are made using these dashboards,
highlighting opportunities and weaknesses to key individuals.
Operational dashboards provide real-time data to managers and employees on a regular basis
- vital for making insightful decisions on day-to-day operations. They interpret large amounts
of information from many sources into a single understandable visual. Industries such as
Construction and Manufacturing, and departments such as Sales and Marketing make these
dashboards an indispensable part of their business process.
Analytical dashboards allow for further investigation of any unexpected anomalies in the
data. Sometimes referred to as KPI Reports, they are a tool to investigate any fluctuations to
determine the root cause. They likewise offer a way to compare performance and trends using
historical data.
5
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
Data analysis focuses on creating valuable insights from data to give further context and
understanding to an intended audience. With the rise of digital business and data-driven
decision-management, data storytelling has become a skill often associated with data science
and business analytics.
The idea is to connect the dots between sophisticated data analyses and decision-makers who
might not have the skills to interpret the data. Data storytelling can also be used to convey
interesting usage metrics for customers.
Data storytelling is described as a set of features within visualization tools that enable a more
interactive experience with the data.
With only charts, dashboards and data visualization tools, decision-makers in an organization
might not understand what a specific amount of data means. So instead of looking at it purely
from a data-driven perspective, data storytelling wraps that data in a narrative that's more
understandable.
Data storytelling uses data collected from charts, dashboards and data visualization tools to tell
a story.
For example, if there's an issue with a product, to convey why it should be fixed, data
storytelling should take the collected data and tell a story about how end users will encounter
this issue and how it will affect them.
a) The three components of data storytelling
Data storytelling comprises data, narrative and visualizations.
(i) Data: The data serves as the base of a data story. It is the information from accurate data
gathering and analysis. Data can be gathered from places as charts and dashboards using data
analysis tools.
(ii) Narrative: The narrative is a verbal or written storyline that is used to effectively
communicate insights from the data. The narrative should be within the context of the data and
aim to show a clear reasoning for following actions or decisions. Narratives should be based
on data and present a clear explanation of what the data means and its importance.
(iii)Visualization: Visualizations act as further representations of both the data and narrative
and are used to communicate the story more clearly. Visualizations include graphs, charts,
diagrams and photos.
6
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
A good data story must use data, a narrative and visualizations to be effective.
To make the narrative, a data story must also include the following:
(i) A setting: The setting should be based on the data. If, for example, the data is about internal
systems, then the setting would be inside an organization with the same internal setup.
(ii) Characters: The characters could include customers, the organization, stakeholders or other
key players the data surrounds.
(iii) A conflict: The conflict is any issue and the effects of that issue that the data might present.
The conflict will affect the characters or setting.
(iv) Resolution: The resolution is a proposed solution to any apparent issues or anything that
might help inform the decision-making processes.
7
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
The interactivity of the storyboard is reflected through capabilities for the end user to:
➢ Sort
8
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
c) An Infographic
An infographic is a visual representation of a large collection of information. It tells a more
comprehensive story than a single data visualization. An infographic can contain one or more
data visualizations.
These are often accompanied by short blurbs, quotes, or other pieces of text that elaborate on
the topic. While the infographic will usually offer several pieces of quantifiable and statistical
data, it doesn’t always carry the viewer through to a single conclusion.
Infographics often broadly educate viewers on a topic so they can learn more about it or come
to their own conclusion on the topic. Infographics are comprehensive enough to stand on their
own. Though they’re often accompanied by a short introduction, it’s not always necessary. The
purpose of an infographic is to tell a complete story, so you can share infographics nearly
anywhere.
They’re commonly used in:
Case studies
Brochures
Flyers
Blog posts
Website content
Social media
Posters
9
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
Chapter 2
Tableau
Tableau is a data visualization tool or business intelligence (BI) tool which analyzes raw data
and shows data in a chart or report quickly.
Tableau helps people and organizations be more data-driven.
It is very easy to use, because it does not require any programming skill.
Data analysis is very fast with Tableau, and the visualizations created are in the form of
worksheets and dashboards.
• Data Blending: Data blending is the most important feature in Tableau. It is used when
we combine related data from multiple data sources, which we want to analyze together
in a single view, and represent in the form of a graph.
Example: Assume, there is Sales data in relational database and Sales Target data in an
Excel sheet. Now, we have to compare actual sales with target sales, and blend the data
based on common dimensions to get access. The two sources which are involved in
data blending referred to as primary data and secondary data sources. A left join will
be created between the primary data source and the secondary data source with all the
10
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
data rows from primary and matching data rows from secondary data source to blend
the data.
⚫ Real-time analysis: Real-Time Analysis makes users able to quickly understand and
analyze dynamic data, when the velocity is high, and real-time analysis of data is
complicated. Tableau can help extract valuable information from fast moving data with
interactive analytics.
⚫ Collaboration of data: Data analysis is not isolating task. Team members can share data,
make follow up queries, and forward visualizations to others who could gain value from
the data. Tableau is built for collaboration.
⚫ Visual Discovery − The user explores and analyzes the data by using visual tools like
colors, trend lines, charts, and graphs. There is very little script to be written because
most are done by drag and drop.
⚫ Architecture Agnostic − Tableau works in all kinds of devices where data flows. Hence,
the user need not worry about specific hardware or software requirements to use
Tableau.
⚫ Connectivity with Live and In-Memory Data - Tableau offers in-memory data
connection to both live and external data sources. This allows the user to freely combine
data from several types of data sources. By creating live data connections, you may
consume data straight from the data source or maintain data in memory by extracting
data from a data source as needed
11
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
⚫ Tableau Desktop has a rich feature set and allows us to code and customize reports.
Right from creating the reports, charts to blending them all to form a dashboard, all the
necessary work is created in Tableau Desktop.
⚫ The dashboards and the workbooks created here can be either shared locally or publicly.
⚫ Based on the connectivity to the publishing option and data sources, Tableau Desktop
is also classified into two parts-
⚫ (i) Tableau Desktop Personal: - The personal version of the Tableau desktop keeps the
workbook private, and the access is limited. The workbooks can't be published online.
So, it should be distributed either offline or in Tableau public.
12
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
b) Tableau Public
⚫ This Tableau version is specially built for cost-effective users. The word 'Public' means
that the created workbooks cannot be saved locally. They should be kept on the
Tableau's public cloud, which can be accessed and viewed by anyone.
⚫ There is no privacy of the files saved on the cloud, so anyone can access and download
the same data. This version is the best for them who want to share their data with the
general public and for the individuals who want to learn Tableau.
c) Tableau Online
⚫ Its functionality is similar to the tableau server, but data is stored on the servers that
hosted on the cloud, which is maintained by the Tableau group.
⚫ There is no storage limit on the data which is published in the Tableau Online. Tableau
Online creates a direct link over 40 data sources who are hosted in the cloud such as
the Hive, MySQL, Spark SQL, Amazon Aurora, and many more. To be published, both
Tableau Server and Tableau online require the workbooks that are created by Tableau
Desktop
d) Tableau Server
⚫ The software is used to share the workbooks, visualizations, which is created in the
Tableau Desktop application over the organization.
⚫ To share dashboards in the Tableau Server, you should first publish your workbook in
the Tableau Desktop. Once the workbook has been uploaded to the server, it will be
accessible only to the authorized users.
⚫ The admin of the organization has full control over the server. The organization
maintains the hardware and the software.
e) Tableau Reader
⚫ Tableau Reader is a free tool which allows to view the visualizations and workbooks,
which is created using Tableau Desktop or Tableau Public. The data can be filtered,
but modifications and editing are restricted.
⚫ There is no security in Tableau Reader as anyone can view workbook using Tableau
Reader.
⚫ To share the dashboards which are created, the receiver should have Tableau Reader
to view the document.
13
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
Go to the home page and select the global superstore sales-Excel sheet
Open a connection to a saved data source, you also should have an open blank worksheet.
14
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
15
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
Menu Bar: It consists of menu options like File, Data, Worksheet, Dashboard, Story, Analysis,
Map, Format, Server, Window, and Help. The options in the menu bar, including features like
data source connection, file saving, design, table calculation options, and file export features
for creating a dashboard, worksheet, and storyboard.
a) File Menu: For any Windows program the file menu contains New, Open, Close, Save,
Save As, and Print, functions. The most frequently used feature found in this menu is the Print
to pdf option. This allows to export dashboard or worksheet in pdf form. To change the default
file-save location, use the repository location option. A packaged workbook can be created
from the export packaged workbook option in a fast manner.
b) Data Menu: Used to highlight and copy the data from any website, then use the Paste Data
option to input it into Tableau. Once pasted, then Tableau will copy the data from the Windows
clipboard and add a data source in the data window. The Edit Relationships menu option is
used in data blending. This menu option is needed if the field names are not identical in two
different data sources. It allows you to define the related fields correctly.
c) Worksheet Menu: The Export option allows to export the worksheet as an Excel crosstab,
an image, or in Access database file format. The Duplicate as Crosstab option creates a crosstab
version of the worksheet and places it in a new worksheet.
d) Dashboard Menu: The Action Menu is a useful feature that is reachable from both the
Worksheet Menu and the Dashboard Menu.
e) Analysis Menu: In this menu, one can access the stack marks and aggregate measures
options. These switches allow you to adjust default Tableau behaviours that are useful if it is
required to build non-standard chart types. The Create Edit Calculated Field and Calculated
Field options are used to make measures and new dimensions that don't exist in your data
source.
f) Map Menu: The Map Menu bar is used to alter the base map colour schemes. The other
menu bar are related in the way of replacing Tableau's standard maps with other map sources.
One can also import the geocoding for the custom locations using the geocoding menu.
g) Format Menu: This menu is not used very commonly because pointing at anything, and
right-clicking gets you to a context-specific formatting menu more quickly. If you don't like
the default workbook theme, use the Workbook Theme menu to select one of the other two
options.
⚫ Toolbar Icon: Toolbar icon below the menu bar can be used to edit the workbook using
different features like redo, undo, new data source, save, slideshow, and so on.
⚫ Dimension Shelf: The dimension presents in the data source for example- customer
(customer name, segment), order (order date, order id, ship date, and ship mode), and
location (country, state, and city) - these all type of data source can be viewed in the
dimension shelf.
⚫ Measure Shelf: The measures present in the data source, for example- Discount, Profit,
Profit ratio, Quantity, and Sales- These all types of data source can be viewed in the
measure shelf.
16
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
⚫ Sets and Parameters Shelf: A parameter is a workbook variable such as a number, date,
or string that can replace a constant value in a calculation, filter, or reference line. Sets
are custom fields that define a subset of data based on some conditions.
⚫ Page Shelf: Page shelf is used to view the visualization in video format by keeping the
related filter on the page shelf.
⚫ Filter Shelf: Filter Shelf is used to filter the graphical view by the help of the measures
and dimensions.
⚫ Marks Cards: Marks card is used to design the visualization. The data components of
the visualization like size, color, path, shape, label, and tooltip can be modified in the
marks card.
⚫ Worksheet: The worksheet is the space where the actual visualization, design, and
functionalities are viewed in the workbook.
⚫ Tableau Repository: Tableau repository is used to store all the files related to the
Tableau desktop. It includes various folders like Connectors, Bookmarks, Data sources,
Logs, Extensions, Map sources, Shapes, Services, Tab Online Sync Client, and
Workbooks.
⚫ Data Source: The existing data source can be modified, new data source can be created
or added using the 'Data source' tab, which is present at the bottom of the Tableau
desktop window.
17
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
⚫ Current Sheet: Current Sheet is a sheet of workbook in which we are currently working.
All the dashboards, worksheets, and storyboard present in the workbook, are available
in this tab.
⚫ New Sheet: The new sheet icon present in the tab is used to create a new worksheet in
the Tableau workbook.
⚫ First Sheet: This first sheet icon present in the tab at the bottom of the right-hand side
of Tableau desktop window is used for visiting the first sheet directly.
⚫ Previous Sheet: The previous sheet icon is used to return back to the last worksheet
from the new sheet.
⚫ Next Sheet: The next sheet icon is used to jump to the next worksheet of Tableau
desktop.
⚫ Last Sheet: The last sheet icon is used to visit the final sheet of tableau workbook.
⚫ Show Sheet Sorter: One can view all the created worksheet in tableau desktop by
clicking on the show sheet sorter icon.
18
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
⚫ Show Filmstrip: All the tabs are shown here with their icons by clicking on the show
filmstrip.
⚫ Show Tabs: This tab concludes all tabs such as worksheets, data sources, dashboards,
and storyboard.
19
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
⚫ Workbook: A workbook is a file with .twb extension that holds one or more worksheets
as well as dashboards and stories.
⚫ Worksheet: It's a place where you build views of your data by dragging various fields
onto the shelves.
⚫ Calculated field: Calculated field is a new field that the user creates by using a formula
to modify the existing fields in the data source.
⚫ Data Pane: The data pane on the left side of the workbook displays the fields of the data
sources to which Tableau is connected. The fields are further divided into measures and
dimensions. The data pane also reflects custom fields such as groups, binned fields,
calculations, and many more.
⚫ Data Source Page: Data Source is a page where the data source can be set up. This data
source page generally consists of four main areas - join area, left pane, a preview area,
and metadata area.
⚫ Filters Shelf: Filter shelf is located on the left side of the workbook. Filters shelf is used
to exclude the data from a view by filtering it using both dimensions and measures.
⚫ Marks Card: Marks card is on the left side of the worksheet. The user can drag fields to
the control mark properties such as color, type, shape, size, label, detail, and tooltip.
⚫ Pages Shelf: Page shelf is on the left side of the view. With the help of the page shelf,
one can split a view into a sequence of pages based on the values and members in a
continuous or discrete field.
⚫ Rows shelf: Row shelf is on the top of the workbook. It is used to create the rows of a
data table. The Row shelf provides any numbers of measures and dimensions. When a
dimension is placed on the Rows shelf, then Tableau creates headers for the members
of that dimension. And when a measure is placed on the Rows shelf, Tableau creates
quantitative axes for that particular measure.
⚫ Crosstab: Crosstab is used for a text table view. It uses various text tables to display the
numbers associated with dimension members.
⚫ Format Pane: The Format pane is on the left side of the workbook, and it contains
various formatting settings. It controls the entire view of the worksheet, as well as the
individual fields in the view.
20
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
⚫ Shelves: The shelves are named areas that are located on the top and left of the view.
One can build views by placing fields onto the shelves. Some shelves are only available
when a particular mark type is selected.
2.12. Data Types in Tableau
⚫ If the data source appoints the data type, Tableau will use that data type. If the data
source doesn't individually assign a data type, Tableau will assign one.
⚫ The data type reflects the kind of information stored in that field.
⚫ The data type of a field is identified in the Data pane by one of the icons shown below:
One can change the data type for a field either in the Data pane or in the View.
a) Change the data type for a field in the Data pane:
To change the data type of a field in the Data pane, click the icon to the left of the field name,
and then choose a new data type from the drop-down list.
21
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
To change a field's data type in a view, click the field in the Data pane, choose Change Data
Type, and then select the appropriate data type from the drop-down list
22
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
⚫ The type of aggregation applied varies depending on the context of the view.
a) Change the Aggregation of a Measure in the View:
⮚ When you add a measure to the view, Tableau automatically aggregates its values.
⮚ Tableau provides a set of predefined aggregations that are shown in the table below.
23
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
24
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
The current aggregation appears as part of the measure's name in the view.
For example, Sales becomes SUM(Sales). Every measure has a default aggregation which is
set by Tableau when you connect to a data source.
One can view or change the default aggregation for a measure.
One can change the aggregation for a measure in the view from its context menu:
25
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
b) Aggregating Dimensions:
You can aggregate a dimension in the view as Minimum, Maximum, Count, or Count
(Distinct).
When you aggregate a dimension, you create a new temporary measure column, so the
dimension actually takes on the characteristics of a measure.
26
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
27
AI&DS Dept. Data Visualization Using Tableau (22ADS61) 2024-25
These files are saved in the associated folders in the My Tableau Repository directory,
which is created in My Documents folder by default when Tableau is installed.
Also, the work files can be saved in other locations, such as desktop etc.
***
28