Tableau Classes
Tableau Classes
PRODUCT LINES :
as tableau workspace
Two parts :
1.Data window: where we are saving the source data is known as data window. Data window consist of
dimensions and Measures
2. Visualization area : where we developing the reports is known as visualization area
Data window
or Data pane
Visualization Area
Visualization area Consist :-
• Column Shelve
• Row Shelve
• Page Shelve
• Filter Shelve
• Marks Card
Dimension:
• Any Textual Data Which describes about any product or any business is known as dimension
• it consist of textual data + primary keys (it will not allow any duplicates )
• It behaves as discrete or independent ( there is no dependency between one field to another field )
• Examples : textual data (customer name, vendor name, segment,)
geographical data (country, state . City , etc)
date Fields ( years, quarter, month, week, day ) ,
Primary keys (customer id, Order id, vendor id,)
Measures :
• measure is a number which analyse about the business is known as measures.
• it consist of numerical data + foreign keys ( it will allow duplicates)
• it behaves as continuous and dependent ( there will be dependency between one filed to another field )
Eg : (sales, profits , salary , Discount etc )
if we need build meaning full Visualization we need both dimension and measures without dimension there is no measure
without measure there is no analysis
Shelves : by the helps of shelves we can develop the reports.
Row we need to drag the dimensions and measures into rows and columns for building the reports.
column
If we are dragging the dimension into rows where we will get horizontal visualization
if we dragging the dimension into columns where we will get the vertical visualization
Marks card : by the help of marks card we can make reports in more meaningful format
Colour
Size
Tool tip
Worksheet :
Worksheet consist of single visualization where we can able to see the insights.
Dashboard : it is the combination of multiple worksheets where we can able to see the multiple
visualizations.
Story board : it is the combination of Sheets and Dashboards where we can able to narrate the story
What ever we want.
Data connections :
By the help data connection we will get to know different Types of source systems
Types of connections:
1. live connections
2. extract connection or import connection : two types
1. extract all data
2. Extract some data
Live connection :
Live connection path
• By the help of live connection we will get the live data from source level to tableau environment
level, here we have only reference of the data at tableau
• Live connection always Shows with single Cylinder in Tableau
Advantages of live :
• When ever the data is updating to source level we will get data into tableau environment.
• When ever data is updated it will updates only updated data.
DISADVANTAGES :
• 1, every time we need to query on the data source to get the results it will impact performance of the visualization.
• 2. IF we lost the connection we don’t have any reports any large level just we have structure only.
Extract connection:
Internal mechanism for extract connection
Tableau data
Snap shot Data engine extract
image
Data source
Files Tableau desktop
Servers(db)
• By help of extract connection we will connect with source system and it will take whole data in the form of snapshot
But it will be in format of image, tableau cannot read the image file it will send the image file to one more server which is
know as data engine, these data engine have some in memory capability to handle the image file as well as where it will
convert image file to normal data and it will save in the extension of (.hyper )(tableau data extract _)
• Extract connection will always shows with double cylinders in Tableau.
Extract Connection :
When ever we are using extract connection it will automatically ask us to save the files in the form of .Hyper.
• What ever the fields that we are dragging it will give results from this saved .hyper file
• it will always shows with double cylinders
• It will improve the performance of the dashboard.
Extract Connection : After selecting Extract connection if your going to sheet level, It will automatically asks us to save the file in
the form .Hyper
Advantage :
1, each and every time we don’t need query on the data source to get the result by the help of that where we can improve
the performance of the visualization.
2, if we lost the connection we do have saved reports.
Disadvantages:
1. We will not get the updated data but we will have the option known as extract refresh, when we are performing the
extract refresh it will replace previous file with new file.
Caption:
What ever we are using at visualization area each and every thing will save as one place is known as
caption.
SUMMARY:
It will going to give detail information about the measure with different aggregation.
Calculated field
Calculated filed
• LEFT (string, number)\
• LOWER() → Converts text to lowercase. • Extracts a specific number of characters from
the start of a string.
LOWER([Category])
LEFT([Order id],5)
• TRIM
Removes leading and trailing spaces.
TRIM([Category])
• REPLACE
Replaces part of a string with another string.
• CONCATENATION
[Region]+“ "+[State/Province]
IF Statement in Tableau
The IF statement in Tableau is a conditional statement used to evaluate expressions and return different values
based on conditions
Example 1
Example 2
% Change
• ZN() Function
The ZN() function in Tableau is used to convert NULL values to zero (0).
Zn (sum(sales))
▲ ▼
IF SUM([Sales])>=WINDOW_AVG(SUM([Sales])) THEN
'High sales'
ELSE 'Low Sales'
END
--DRAG this Calculation into Colours
• How to Highlight Top Highest sales and bottom Least sale Value
RANK(SUM([Sales]),'desc’)<=1 OR RANK(SUM([Sales]),'asc’)<=1
Level of formatting :
1.Sheet level formatting : if we are performing formatting at sheet level it will be replicate to both dashboard and storyboard
2.Dashboard level formatting: if we are performing formatting at dashboard level it will replicate to both sheet and
storyboard
3.Story board level formatting : if we are performing formatting at story board level it will not replicate to sheet or dashboard,
we cannot performing the formatting for visualization fields in storyboard only for outline we can perform formatting.
Types of formatting :
Font: by the help of font formatting where we can change the font type , font name, font colour etc.
Alignment : by the help of alignment where we can make the adjustment of headers either left right or mid and change the
form horizontal vertical and vertical to horizontal.
Shades: applying the colours to visualization fields.
Boarders : if we need to build any of the text table visualization where we can make use boarders in that we will use the lines
Lines
Joins and cross data base joins
========================
Joins:
If we need to combine the tables from single data source where we will make use of joins,
Joins will combines tables and aggregate the data.
Rules :
1. common column between the both the tables
2.Common data type in that column
Types :
6 6 8 8
Table 1 table 2
Cust id cust Id
1 1
1 1
1 1
4 7
Inner join: 9
Left outer join: 10
Right outer join: 10
Full outer join: 11
Cross Join : 16
It should Consist of same column name from both the data sources.
Relationships or data modelling : creating the link between one table
to another table.
Blending :
data blending will aggregate the data first then will combine the fields.
Default join in the data blending will be left outer join.
If we need to choose the data blending if we have the data at two granularity levels,
Rules;
Common column between both the data sources
Data = edit relationship===add ==products ===product then it will perform the data blending
Excel SQL
DB1 db2
REGION Territory
EAST EAST
WEST WEST
NORTH NORTH
SOUNTH SOUTH
In Order to perform data blending if your having two granularity levels in both the
databases we can use the data blending
DATA SOURCE 2
Data source 1
Products , sales
Produts , profits
A. 200
A,20
B. 400
B,50
C. 300
C100
D. 600
D,130
E. 800
E, 150
F. 700
F, 180
Primary data source :
what ever the field first we are dragging in to visualization area that data source becomes as primary data source
remaining all the data sources comes under secondary data sources.
Primary data source will indicate in the form of blue colour tick mark and secondary data source will indicate in the
orange colour tick mark.
If it is performing the data blending it will shows the orange colour link symbol,
Gannt charts
Bar chart :
If we need to compare individual values with other categorical values where we will make use of bar charts .
Stacked bar :
By the help of stacked where we can compare total value segregated by different dimension fields.
Histogram : (BIN)
If we need to compare the results with different size variables where we can make use of histogram charts
Bullet graph:
If we need to compare two measures like actual and target in single bar where it indicating with reference sales
Or if we want to compare present year sales and previous year sales
If we are having 1000 or 2000 records, In that cases it will compute all the records and it will gives resu
Continuous vs discrete
Continuous :
There will be a dependency between one field to another field
Discrete :
there is no dependency between one filed to another field
Pie chart:
In single circle it will segregate the dimension field with different slices.
Bubble charts:
Each and every individual field will shows as one circle and we are sizing that circle based on measure value,
Scatter plot :
If we need to find the relationship between both x and y axis where we make use of scatter plot.
If we need build the scatter plot at least we require two measures
Tree map: it will shows the results in the form of hierarchal order which is drill down, that means where we can show results
from higher level to lower level .
Heat map : always it will follows the data source order and we are sizing the field based on measure.
Advanced charts :
==============
In advanced charts we need get the logic to get the results.
Word cloud
Bar in bar chart
Funnel chart
Donut chart
Waterfall chart
Bump chart
Pareto chart
Lollipop Chart, Rounded Bar Chart, Butterfly Chart
Word cloud:
in these we are showing result in the form of text and we are sizing the text based on the measure value
If we need to perform to get the results of bar in bar we need make use as dial axes and then we will get the results.
Donut Chart : if you want to see the overall sale values with individual sale values we can use donut chart
.
Maps :
If we need to represent the geographical data where we will make use of maps
If we need to get the maps we required geographical fields.
If we need to plot the maps we required latitude and longitude.
Types of maps :
Symbol map: in that geographical location where we can show the results in the form of symbol either circle or square.
Filled map: it will fills the entire portion of the geographical location.
By the help of edit location where we can make unknown values in the form of known values.
Map legends:
Always map legends are in hide mode , if we need to highlight the map legends we need to select any one of the later from
map layers .
Map layers :
Styles
Layer selection
Map option :
Zoom in
zoom out
Reset
Selection ( rectangular, circular , lasso drag zoom in single filed)
Rank function :
===========
Rank unique : IF any one of duplicate are there always it will assign the unique only.
Rank dense : if we have duplicate values are there it will assign min ranking as well as it will not skip the order.
Rank competitive : if we duplicate values are there always it will assign the min ranking but it will skip the order and it will
assign maximum ranking
Rank modified: if we have duplicate values are there always it will assign the maximum ranking but it will skip the order based
n row number
PERCENTILE:
Fixed :-
In fixed Lod's we are fixing the dimension to get the required results.
Explanation : By using fixed Lod's we can able to find out first order and last order of the customers
Include :- Without dragging fields into the visualization. We can able to impact the visualization by using the include lod’s
Explanation : if I have subcategory wise sales if I want to include subcategory wise, region wise average sales I can use
include Lod’s
Exclude :-
If I want to exclude anything from the visualization I can able to exclude by using the exclude Lod's
Explanation : if I have region wise state wise sales if I want to exclude state wise sales I can use the exclude lod.
Schemas:
When ever we are loading the data into tableau environment we will get some structure that structure is known as schema
Types of schema
Types of schemas:
1.Star schema.
2.Snowflake schema.
3.Mixed or galaxy or fact constellation schema.
Star schema:
If each and every dimension table directly connected with fact table then such type of schema is known as star schema.
dim dim
fact
dim dim
Snow flake schema :
If there any interrelationship between dimension table to dimension table or fact table then such type of schema where we
will call as snowflake schema.
dim dim
dim
fact
dim
dim dim
dim
How can you improve the performance of the dashboard
There are some ways we can improve the performance of the dashboard
1.Extract connect
2. If I use nested calculations it will impact performance.
3.In dashboard we need to keep minimum visuals so that it will improve performance of the dashboard.
3. If keep more filter in dashboard it will impact the performance of the dashbaord.
5. I can use parameter in dashboard.
6.Using context filter it will improve the performance.
7. After developing the dashboard hiding all the unused fields will improve the performance.
8. Lot of tableau developers they do not remove unnecessary container in the dashboard layout.
- Doing aggregation on data source level rather than doing it in tableau it improves performance of the dashbaord,
9. If I’m still facing performance issue I can run optimizer options.
10. Still if I’m facing I can use performance records. So that filters are taking more time or calculations are taking more
time.
Agile Sprint review Retrospective
Sprint session
methodology meeting
Agile Methodology – A project management and software development approach that emphasizes iterative progress,
collaboration, and flexibility in adapting to changes.
Sprint : it is a short, time-boxed period in Agile methodology during which a development team works to complete a
set amount of work
Sprint Review Meeting – A meeting at the end of a sprint where the development team showcases completed work to
stakeholders and gathers feedback for future improvements.
Retrospective Session – A meeting held after a sprint where the team reflects on what went well, what could be
improved, and how to enhance future work processes.