0% found this document useful (0 votes)
85 views51 pages

Tableau Classes

The document outlines the importance of learning Tableau, a leading data visualization tool, and describes its various product lines and functionalities. It explains key concepts such as dimensions, measures, data connections, and visualization techniques, including worksheets, dashboards, and calculated fields. Additionally, it covers data blending, joins, and different types of charts used for effective data representation.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
85 views51 pages

Tableau Classes

The document outlines the importance of learning Tableau, a leading data visualization tool, and describes its various product lines and functionalities. It explains key concepts such as dimensions, measures, data connections, and visualization techniques, including worksheets, dashboards, and calculated fields. Additionally, it covers data blending, joins, and different types of charts used for effective data representation.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 51

Reasons to Learn Tableau :

• It is most demanding Tool around all over the world


• Easy to learn
• 30% faster than other visualization Tools in the market, You will be having rich and smooth
experience in tableau.
• Tableau comes under business intelligent Developer Role, Data Analyst, Data Scientist
Tableau: Tableau is a visualization tool which comes under business intelligence

PRODUCT LINES :

1. Tableau desktop : Develop the reports


2. Tableau online/Cloud : Managing reports
3. tableau server : Managing reports
4. Tableau mobile : access server reports from mobile
5. Tableau reader : Access server reports from laptop
6. Tableau public : Www.TableauPublic.com
Tableau work space :
Where we are saving the data as well we are developing the visualizations is known

as tableau workspace
Two parts :
1.Data window: where we are saving the source data is known as data window. Data window consist of
dimensions and Measures
2. Visualization area : where we developing the reports is known as visualization area

Data window
or Data pane

Visualization Area
Visualization area Consist :-

• Column Shelve
• Row Shelve
• Page Shelve
• Filter Shelve
• Marks Card
Dimension:
• Any Textual Data Which describes about any product or any business is known as dimension
• it consist of textual data + primary keys (it will not allow any duplicates )
• It behaves as discrete or independent ( there is no dependency between one field to another field )
• Examples : textual data (customer name, vendor name, segment,)
geographical data (country, state . City , etc)
date Fields ( years, quarter, month, week, day ) ,
Primary keys (customer id, Order id, vendor id,)
Measures :
• measure is a number which analyse about the business is known as measures.
• it consist of numerical data + foreign keys ( it will allow duplicates)
• it behaves as continuous and dependent ( there will be dependency between one filed to another field )
Eg : (sales, profits , salary , Discount etc )
if we need build meaning full Visualization we need both dimension and measures without dimension there is no measure
without measure there is no analysis
Shelves : by the helps of shelves we can develop the reports.

Row we need to drag the dimensions and measures into rows and columns for building the reports.
column

If we are dragging the dimension into rows where we will get horizontal visualization
if we dragging the dimension into columns where we will get the vertical visualization

Marks card : by the help of marks card we can make reports in more meaningful format
Colour
Size
Tool tip
Worksheet :
Worksheet consist of single visualization where we can able to see the insights.

Dashboard : it is the combination of multiple worksheets where we can able to see the multiple
visualizations.

Story board : it is the combination of Sheets and Dashboards where we can able to narrate the story
What ever we want.
Data connections :
By the help data connection we will get to know different Types of source systems

Types of connections:
1. live connections
2. extract connection or import connection : two types
1. extract all data
2. Extract some data

Live connection :
Live connection path

Data source Tableau desktop


File server

• By the help of live connection we will get the live data from source level to tableau environment
level, here we have only reference of the data at tableau
• Live connection always Shows with single Cylinder in Tableau
Advantages of live :
• When ever the data is updating to source level we will get data into tableau environment.
• When ever data is updated it will updates only updated data.
DISADVANTAGES :
• 1, every time we need to query on the data source to get the results it will impact performance of the visualization.
• 2. IF we lost the connection we don’t have any reports any large level just we have structure only.
Extract connection:
Internal mechanism for extract connection

Tableau data
Snap shot Data engine extract
image

Data source
Files Tableau desktop
Servers(db)

• By help of extract connection we will connect with source system and it will take whole data in the form of snapshot
But it will be in format of image, tableau cannot read the image file it will send the image file to one more server which is
know as data engine, these data engine have some in memory capability to handle the image file as well as where it will
convert image file to normal data and it will save in the extension of (.hyper )(tableau data extract _)
• Extract connection will always shows with double cylinders in Tableau.
Extract Connection :
When ever we are using extract connection it will automatically ask us to save the files in the form of .Hyper.
• What ever the fields that we are dragging it will give results from this saved .hyper file
• it will always shows with double cylinders
• It will improve the performance of the dashboard.
Extract Connection : After selecting Extract connection if your going to sheet level, It will automatically asks us to save the file in
the form .Hyper
Advantage :
1, each and every time we don’t need query on the data source to get the result by the help of that where we can improve
the performance of the visualization.
2, if we lost the connection we do have saved reports.

Disadvantages:
1. We will not get the updated data but we will have the option known as extract refresh, when we are performing the
extract refresh it will replace previous file with new file.

Extract some data :


Before starting the extract mechanism where we will apply the filters and then we will follow the same process.

1.How to assign tittle, caption and summery


2. how to change from one aggregation another aggregation
3.How to change default aggregation
Basic visualization:
By making use of single dimension and single measure if we are building the reports then we can called as basic
visualization

how to assign title caption, summery


2.Title :
Default what ever sheet name is there it will assign a tittle for us if we need to change double click on the
title it will open on the window remove the sheet name and assign the required title.

Caption:
What ever we are using at visualization area each and every thing will save as one place is known as
caption.
SUMMARY:
It will going to give detail information about the measure with different aggregation.

It can show the sum, Avg, min , max, count,

How to change one aggregation to another aggregation ?


If we need to change aggregation at the visualization level, select the measure field right click on that we have a option
known as measure in that drop down select what ever the aggregation we required.

How to change default aggregation?


If we need to change the default aggregation right click on measure we have option known as default properties under that we
have aggregation from there select what ever the aggregation we need make it as default.
Calculated field : By using the calculated field we can create the new data from the existing data. We can able
to create the new columns in tableau by using calculated filed.

Calculated field
Calculated filed
• LEFT (string, number)\
• LOWER() → Converts text to lowercase. • Extracts a specific number of characters from
the start of a string.
LOWER([Category])
LEFT([Order id],5)

• UPPER([Category]) → Converts text to uppercase


• RIGHT(string, number)
UPPER([Category])
Extracts a specific number of characters from
• MID([Category], start, length) the end of a string.
Extracts a substring from the category starting at a specific
position.
RIGHT([Customer Name], 4)
MID([order id], 2, 4)

• TRIM
Removes leading and trailing spaces.

TRIM([Category])
• REPLACE
Replaces part of a string with another string.

REPLACE([Category], "Office", "Home"

• SPLIT([Category], " ", 1)


Splits the category based on a space and returns the first word.

SPLIT([Customer Name], " ", 1)

• CONCATENATION

category +" "+subcategory

[Region]+“ "+[State/Province]
IF Statement in Tableau
The IF statement in Tableau is a conditional statement used to evaluate expressions and return different values
based on conditions

Example 1

IF SUM([Sales])<=95000 THEN "Low Sales"

ELSEIF SUM([Sales])<=20000 THEN "Medium SALES"

ELSE "High Sales" END

Example 2

IF [Profit] > 0 THEN "Positive Profits"


ELSE "Netegive Profits"
END
• Display the number of orders for every region and year
• Display the number of customer for every Region
• Display number of customers for only 2022
• Display number of customers for only 2023
• Display only number of customers for only south region
CASE Statement in Tableau
A CASE statement in Tableau is an alternative to multiple IF-ELSEIF conditions. It is used when you want to compare a
field against multiple values in a cleaner and simpler way.
Example :
CASE [Category]
WHEN "Furniture" THEN "Home Furniture"
WHEN "Technology" THEN "Tech Gadgets"
WHEN "Office Supplies" THEN "Office Products"
END
• How to get present year sales and previous year sales. And % change

Present year sales

SUM(IF YEAR([Order Date]) = YEAR(TODAY()) THEN [Sales] END )

Previous year sales

SUM(IF YEAR([Order Date]) = YEAR(TODAY())-1 THEN [Sales]


END

% Change

((Present year sales - Previous year sales) / Previous year


sales)
Once % change calculation is created right click on this column and go to default properties and select number format
Select percentage
% Change

((Present year YTD – Previous year YTD )/Previous year YTD)

• ZN() Function
The ZN() function in Tableau is used to convert NULL values to zero (0).

Zn (sum(sales))

▲ ▼

• How to Highlight Above average and below average values

IF SUM([Sales])>=WINDOW_AVG(SUM([Sales])) THEN
'High sales'
ELSE 'Low Sales'
END
--DRAG this Calculation into Colours
• How to Highlight Top Highest sales and bottom Least sale Value

RANK(SUM([Sales]),'desc’)<=1 OR RANK(SUM([Sales]),'asc’)<=1

--Drag this Calculation into Colors in the marks card it will


highlights the highest sale value and least sale value
Formatting :
by help of formatting we can make visualization in more meaningful way.

Level of formatting :
1.Sheet level formatting : if we are performing formatting at sheet level it will be replicate to both dashboard and storyboard
2.Dashboard level formatting: if we are performing formatting at dashboard level it will replicate to both sheet and
storyboard
3.Story board level formatting : if we are performing formatting at story board level it will not replicate to sheet or dashboard,
we cannot performing the formatting for visualization fields in storyboard only for outline we can perform formatting.

Table : overall visualization we will call as table


Pane : subset / subpart of the visualization we will call as pane
Cell : each and every individual field we will call as cell .

Types of formatting :
Font: by the help of font formatting where we can change the font type , font name, font colour etc.
Alignment : by the help of alignment where we can make the adjustment of headers either left right or mid and change the
form horizontal vertical and vertical to horizontal.
Shades: applying the colours to visualization fields.
Boarders : if we need to build any of the text table visualization where we can make use boarders in that we will use the lines
Lines
Joins and cross data base joins
========================

Joins:
If we need to combine the tables from single data source where we will make use of joins,
Joins will combines tables and aggregate the data.

Rules :
1. common column between the both the tables
2.Common data type in that column

Types :

1. INNER JOIN (also known as Equi Join)


2. LEFT JOIN (or LEFT OUTER JOIN)
3. RIGHT JOIN (or RIGHT OUTER JOIN)
4. FULL JOIN (or FULL OUTER JOIN)
Full Outer Join : What ever the matching and unmatching records from the both the tables.

6 6 8 8
Table 1 table 2

Cust id cust Id
1 1
1 1
1 1
4 7

Inner join: 9
Left outer join: 10
Right outer join: 10
Full outer join: 11
Cross Join : 16

Cross data base joins :


If we need combine the tables from multiple same data sources which are coming from same data source where we
will make use of cross data joins. In Cross data base joins we can only perform singular granularity levels.

It should Consist of same column name from both the data sources.
Relationships or data modelling : creating the link between one table
to another table.

Difference between Joins and Relationships


Data blending :-
=============
If we need to combine the data from different data sources at sheet level where we will make use of data blending.

Blending :
data blending will aggregate the data first then will combine the fields.
Default join in the data blending will be left outer join.
If we need to choose the data blending if we have the data at two granularity levels,
Rules;
Common column between both the data sources

• Common data type in the columns

• Ways to perform blending :


1.Automatic: if it is satisfy the condition where we can perform the data blending
2. Manual : by renaming the field name where we can perform the data blending
3. Edit relationship :by the help of edit relationship where we can provide the relationship between two data sources for
common field ,

Data = edit relationship===add ==products ===product then it will perform the data blending
Excel SQL

DB1 db2

REGION Territory
EAST EAST
WEST WEST
NORTH NORTH
SOUNTH SOUTH

In Order to perform data blending if your having two granularity levels in both the
databases we can use the data blending
DATA SOURCE 2
Data source 1
Products , sales
Produts , profits
A. 200
A,20
B. 400
B,50
C. 300
C100
D. 600
D,130
E. 800
E, 150
F. 700
F, 180
Primary data source :
what ever the field first we are dragging in to visualization area that data source becomes as primary data source
remaining all the data sources comes under secondary data sources.
Primary data source will indicate in the form of blue colour tick mark and secondary data source will indicate in the
orange colour tick mark.

If it is performing the data blending it will shows the orange colour link symbol,

Limitations of data blending


1. If your working on large level of data sources you will get performance issues
2. Data blending is only suitable for aggregated data.
3. We need to depend on Common data Types. Granularity Levels
Difference between Joins and Blending
Types of charts : Circle relevant charts :
1.Circle chart
Bar chart :( string relevant 2.Side by side circle
Information) 3.Pie chart
4.Bubble chart
1.Bar chart 5.Scatter plot
2.Stacked bar
3.Side by side bar
4.Histogram
5. Bullet graph
6.Box and whisker plot Tree map
heat map
Line chart :
1. Line continuous
2. Line discrete Text table
3. Area continuous Highlight table
4. Area discrete
5. Dual lines Symbol map
6. Dual combination Filled map

Gannt charts
Bar chart :
If we need to compare individual values with other categorical values where we will make use of bar charts .

Stacked bar :
By the help of stacked where we can compare total value segregated by different dimension fields.

Side by side bar :


If we are using more than one measure and if we need compare side by side where we will make use of side by side bar

Histogram : (BIN)
If we need to compare the results with different size variables where we can make use of histogram charts

Bullet graph:
If we need to compare two measures like actual and target in single bar where it indicating with reference sales
Or if we want to compare present year sales and previous year sales

Box and whisker plot:


It will compute on overall table and it will gives the results in the form of IQR ( INTER QUARTEL RANGE)
Lower whisker, upper whisker, lower quartile, upper quartile , median )

If we are having 1000 or 2000 records, In that cases it will compute all the records and it will gives resu
Continuous vs discrete

Continuous :
There will be a dependency between one field to another field

Discrete :
there is no dependency between one filed to another field

Pie chart:
In single circle it will segregate the dimension field with different slices.

Bubble charts:
Each and every individual field will shows as one circle and we are sizing that circle based on measure value,

Scatter plot :
If we need to find the relationship between both x and y axis where we make use of scatter plot.
If we need build the scatter plot at least we require two measures

Tree map: it will shows the results in the form of hierarchal order which is drill down, that means where we can show results
from higher level to lower level .

Heat map : always it will follows the data source order and we are sizing the field based on measure.
Advanced charts :
==============
In advanced charts we need get the logic to get the results.

Word cloud
Bar in bar chart
Funnel chart
Donut chart
Waterfall chart
Bump chart
Pareto chart
Lollipop Chart, Rounded Bar Chart, Butterfly Chart

Word cloud:
in these we are showing result in the form of text and we are sizing the text based on the measure value

Bar in bar chart :


If you want to compare two measure in single bar where we are going to make use of bar in bar charts.

Sales or profit comparison


Actual vs target comparison.

If we need to perform to get the results of bar in bar we need make use as dial axes and then we will get the results.
Donut Chart : if you want to see the overall sale values with individual sale values we can use donut chart

Steps to create Donut Chart


• Create a calculated filed and type zero. And drag that calculated filed two times into rows. And select second calculation
and make it as dual axis.
• Select the ALL marks card change mark type pie chart. And drag the required fields into second marks cards on colours

.
Maps :

If we need to represent the geographical data where we will make use of maps
If we need to get the maps we required geographical fields.
If we need to plot the maps we required latitude and longitude.

Types of maps :
Symbol map: in that geographical location where we can show the results in the form of symbol either circle or square.
Filled map: it will fills the entire portion of the geographical location.

Ways to perform maps:


Back ground maps : (none, offline, tableau , wms server, map box , set as default)
2.Background images
3.Edit locations
4.Map legends
5.Map layers
6.Map options
7.Custom geocoding
None: when ever we need to add background images we need to make background maps as None.
Wms server.
By the help of wms maps where we can show the satellite maps
Map box:
If we need to show maps in different styles where we will make use of map box maps.
Edit location :
Null and unknown :

Null : if we don’t have the data tableau will recognize as null.


Unknown: we have the data but tableau is not recognizing that data where we will call as unknown.

By the help of edit location where we can make unknown values in the form of known values.

Map legends:
Always map legends are in hide mode , if we need to highlight the map legends we need to select any one of the later from
map layers .

Map layers :
Styles
Layer selection

Map option :
Zoom in
zoom out
Reset
Selection ( rectangular, circular , lasso drag zoom in single filed)
Rank function :
===========
Rank unique : IF any one of duplicate are there always it will assign the unique only.
Rank dense : if we have duplicate values are there it will assign min ranking as well as it will not skip the order.
Rank competitive : if we duplicate values are there always it will assign the min ranking but it will skip the order and it will
assign maximum ranking
Rank modified: if we have duplicate values are there always it will assign the maximum ranking but it will skip the order based
n row number
PERCENTILE:

Asc = lowest 0 highest 100


Desc = highest 0 lowest 100
Lods : ( level of details )
it will add more granularity the visualization
• By the help of lods where we can apply aggregations on dimensions (date)
• We can fix the measure for multiple dimensions by making use of lods.
• Lod’s will add more granularity to the visualization.

Three types of Lods are there


• Fixed
• Include
• Exclude

Fixed :-
In fixed Lod's we are fixing the dimension to get the required results.

Example : {Fixed (customer name ) : max(order date ) }.

Explanation : By using fixed Lod's we can able to find out first order and last order of the customers

Include :- Without dragging fields into the visualization. We can able to impact the visualization by using the include lod’s

• Example : { INCLUDE [Sub-Category],[Region]:AVG([Sales])}

Explanation : if I have subcategory wise sales if I want to include subcategory wise, region wise average sales I can use
include Lod’s
Exclude :-
If I want to exclude anything from the visualization I can able to exclude by using the exclude Lod's

Example : { Exclude state : sum (sales)}

Explanation : if I have region wise state wise sales if I want to exclude state wise sales I can use the exclude lod.
Schemas:
When ever we are loading the data into tableau environment we will get some structure that structure is known as schema
Types of schema

Types of schemas:
1.Star schema.
2.Snowflake schema.
3.Mixed or galaxy or fact constellation schema.

Star schema:
If each and every dimension table directly connected with fact table then such type of schema is known as star schema.

dim dim

fact

dim dim
Snow flake schema :
If there any interrelationship between dimension table to dimension table or fact table then such type of schema where we
will call as snowflake schema.

dim dim
dim

fact
dim

dim dim

dim
How can you improve the performance of the dashboard
There are some ways we can improve the performance of the dashboard
1.Extract connect
2. If I use nested calculations it will impact performance.
3.In dashboard we need to keep minimum visuals so that it will improve performance of the dashboard.
3. If keep more filter in dashboard it will impact the performance of the dashbaord.
5. I can use parameter in dashboard.
6.Using context filter it will improve the performance.
7. After developing the dashboard hiding all the unused fields will improve the performance.
8. Lot of tableau developers they do not remove unnecessary container in the dashboard layout.
- Doing aggregation on data source level rather than doing it in tableau it improves performance of the dashbaord,
9. If I’m still facing performance issue I can run optimizer options.
10. Still if I’m facing I can use performance records. So that filters are taking more time or calculations are taking more
time.
Agile Sprint review Retrospective
Sprint session
methodology meeting

Agile Methodology – A project management and software development approach that emphasizes iterative progress,
collaboration, and flexibility in adapting to changes.

Sprint : it is a short, time-boxed period in Agile methodology during which a development team works to complete a
set amount of work

Sprint Review Meeting – A meeting at the end of a sprint where the development team showcases completed work to
stakeholders and gathers feedback for future improvements.

Retrospective Session – A meeting held after a sprint where the team reflects on what went well, what could be
improved, and how to enhance future work processes.

You might also like