Tableau Training
Tableau Training
Tableau Desktop: Tableau Desktop has a rich feature set and allows you to code
and customize reports. Right from creating the charts, reports, to blending them all
together to form a dashboard, all the necessary work is created in Tableau Desktop.
Tableau Public: It is Tableau version specially build for the cost-effective users. By
the word "Public," it means that the workbooks created cannot be saved locally, in
turn, it should be saved to the Tableau's public cloud which can be viewed and
accessed by anyone.
Tableau Online: As the name suggests, it is an online sharing tool of Tableau. Its
functionalities are similar to Tableau Server
Tableau Reader: Tableau Reader is a free tool which allows you to view the
workbooks and visualizations created using Tableau Desktop or Tableau Public
Connecting to Data Sources
Tableau can connect to various types of data sources. It can connect to text files,
excel files, PDF files, etc. It can also connect to various databases. Tableau has the
capability to connect to servers and web connectors. There are two types of
connections –
Live: The workbook is connected live to the data source . Any update in the data
source will update the workbook with the data automatically.
Data Extract: Data is extracted and saved as a ”.tde” file. The workbook runs on the
“.tde” file. If there is any update in the data, we need to update the “.tde” file and then
run the workbook.
Data Relationship
Joins: It can join up to 32 tables in a data source. The joining condition can be given
based on the primary key. One or more joining conditions can be specified to join the
tables.
Inner Join Left Join Right Join Outer Join
It joins all the A left join is used to A right join is used An Outer join is
common records join all the records to join all the used to join all the
between two tables from a left table and records from a right records from both
common records table and common left and right table.
from the right table. records from left
table.
Union: Union is used to append the data from two or more tables. In most of the
cases, tables with same headers are appending together using union function.
Joining conditions are not required while union two or more tables.
Join Union
Sort data:
Data in a visualization can be sorted by data source order, alphabetically, with
respect to any dimension or measure or manually as well.
Types of Filters
The filters can be applied in a worksheet to restrict the number of records present in a
dataset –
Extract Filters: Extract filters are used to filter the extracted data from data source.
This filter is utilized only if the user extracts the data from data source.
Data Source Filter: A data source filter is used to filter the data in data source level.
It can restrict the records present in the data set. Data source filter works on both live
and extracts connection.
Context Filter: A Context filter is an independent filter that can create a separate
dataset out of the original data set and compute the selections made in the
worksheet.
Other filters that can be applied in the sheet will be dependent on the context filter.
Priority of filters –
Extract Filters > Data Source filters > Context filter > Fixed LOD filter > Dimension
filter > Include & Exclude LOD filter > Measure filter > Table calculation filter
Filter conditions
Set is a custom filed used to hold the subset of data based on some condition. A set
can be created by selecting members from the list or By writing custom Condition or
Selecting Top or Bottom few records based on Measure value so on. It acts as a
separated field or dimension
The Combined Sets in Tableau are handy to compare two existing sets for further
analysis. Comparing is similar to joins. Two sets can be combined only if they are
created on the same dimension or measure.
Generated Fields
Tableau generates some fields which can be visible in the data pane-
Number of Records:
Number of records shows the count of records present in the data set.
Measure Names:
A measure name consists of all the names of the measure present in a data set.
Measure Values:
All the measure values present in a data set are kept together in the field called
measure values.
Measure names and Measure values can be used to see the aggregation of all
measure present in a data set.
File Types
Data Blending is a very powerful feature in Tableau. It is used when there is related
data in multiple data sources, which you want to analyze together in a single view. It
is used to join two data sources having related data with a different level of
granularity. Primary dataset left joins on the secondary dataset.
Sales for Sub categories for Consumer segment versus total sales for sub categories
Calculated Field
Calculated fields can be used to create new dimensions, or new measures. They
can also be used with any data type, a multitude of functions and aggregations, as
well as logical operators, making the calculated results virtually limitless.
Table Calculations
These are the calculations which are applied to the values in the entire table on a
single measure. For example, calculating a running total of sales.
FIXED LOD: This expression computes values using the specified dimensions without
reference to any other dimensions in the view.
{ FIXED [State] : sum(Sales) }
INCLUDE LOD: This level of detail expressions compute values using the
specified dimensions in addition to whatever dimensions are in the view.
AVG{ INCLUDE [City] : sum(Profit) }
Tableau Parameters plays a significant role in creating Dynamic reports. They can be
used on filters, sets, calculate fields.
Bins
Tableau Bins are useful to create a Range of data. They can be created both for
dimensions and measures. Used for histograms
Clusters
Clustering in Tableau is dividing a data set into segments or clusters having relevant
data values. It helps us conduct a comparative analysis of data in Tableau. In
Tableau, K-means clustering algorithm is used. Clusters created by tableau can be
converted into a dimension so that it can be used in other visualizations
Pages
The Pages shelf lets you break a view into a series of pages so you can better
analyze how a specific field affects the rest of the data in a view.
Common Charts
Bar Chart
Cross Tab
Pie Chart
Scatter plot
Area Chart
Bubble Chart
Map
Tree map
Bullet Chart
Box plot
Advanced Charts
Dual Axis Chart
• Create a bin with any of the measure and edit the bin size
• Pull the created bin field as a column and pull count of any related field as a row
• Consider one attribute and one metric. Select the metric twice.
• Choose bar graph for the first metric and circle chart for duplicate metric
• Select dual axis
Bar In Bar Chart
I have compared the total sales and consumer segment sales monthly
Pareto Chart
Here we want to see what percentage of customers give 80% of the sales.
• Consider count distinct of Customer Name in columns and sales in rows.
• Create a table calculation on sales to get the running total and a secondary
calculation to get the percentage of running total. Compute using Customer Name
• Create a table calculation on “distinct count of customer” to get the running total
and a secondary calculation to get the percentage of running total. Compute using
Customer Name.
• Convert graph into dual axis. Consider “Reference line” from analytics pane and
select constant value of “0.8” for “% of total running sales”.
Dumbbell Chart
Improve Performance
Tableau dashboards can get slow when you have multiple data sources and huge
amount of data. Some steps to optimize dashboards -
• Use extract over live connection
• Filter the data at the data source level itself
• Hide unused fields
• Reduce filter usage, use context filters
• Remove unnecessary calculations and sheets
Trend Lines and Forecasting
Trend lines are used to predict the continuation of a certain trend of a variable. It also
helps to identify the correlation between two variables by observing the trend in both
of them simultaneously. Tableau provides Linear, Logarithmic, Exponential, and
Polynomial models.
Forecasting is about predicting the future value of a measure. Tableau uses the
model known as exponential smoothing.
We can show reference lines and bands also in Tableau both constant and dynamic.