0% found this document useful (0 votes)
8 views45 pages

Project Report

Uploaded by

22kq1a4732
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views45 pages

Project Report

Uploaded by

22kq1a4732
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 45

DATA ANALYSIS USING TABLEAU

INTERNSHIP PROJECT REPORT

by

GUGGILAM VENKATA SAI RANJITH KUMAR


22KQ1A54B2

Department of Artificial Intelligence and Data Science


PACE Institute of Science and Technology
Vallur
June 2024

i
BONAFIDE CERTIFICATE

This is to certify that this project report entitled “DATA ANALYSIS USING
TABLEAU” submitted to PACE Institute of Science and Technology, Vallur, is
a bonafide record of work done by “GUGGILAM VENKATA SAI RANJITH
KUMAR” under my supervision from “20-05-2024” to “11-06-2024”

<<Signature of the Supervisor>>


<<Name and Designation>>

<<Countersignature>>
<<Name>>
<<Designation, e.g. Head of Dep. / Divisional Head)
<<Name of Department / Division and Centre>>

ii
Declaration

This is to declare that this report has been written by Guggilam Venkata
Sai Ranjith Kumar. No part of the report is plagiarized from other sources.
All information included from other sources have been duly acknowledged. I
aver that if any part of the report is found to be plagiarized, I shall take full
responsibility for it.

GUGGILAM VENKATA SAI RANJITH KUMAR


22KQ1A54B2

iii
ABSTRACT

This project addresses the challenge of effectively analyzing retail data from
the Sample Superstore dataset to gain insights into sales, profit, and quantity metrics
across various dimensions. Traditional data analysis methods often lack the
capability to provide a comprehensive and interactive exploration of these metrics.

The significance of this project lies in its utilization of Tableau to create an


interactive dashboard, enabling stakeholders to dynamically analyze sales, profit,
and quantity across category, sub-category, ship mode, segment, and state
dimensions. This approach is novel in its integration of multiple interactive features,
such as dynamic filters and highlight actions, providing a user-friendly interface for
detailed data analysis.

The approach involved connecting to the Sample Superstore dataset,


preparing the data, and constructing a series of interconnected visualizations. These
visualizations include sales analysis, profit analysis, and quantity analysis across the
specified dimensions. Interactive filters and parameters were incorporated to allow
users to customize their views, enhancing the data exploration experience.

The major results obtained include the identification of sales trends and profit
margins across different categories and sub-categories, insights into the impact of
ship modes and customer segments on sales and profitability, and a detailed analysis
of sales and profit distribution across various states. These insights are crucial for
strategic decision-making and identifying areas for growth and optimization.

In conclusion, the interactive dashboard created in Tableau serves as a


powerful tool for transforming raw retail data into meaningful visual insights. It
facilitates better decision-making and strategic planning, showcasing the value of
advanced data visualization techniques in the retail industry. This project highlights
the importance of interactive and user-friendly data analysis tools in enhancing
business intelligence and operational efficiency.

iv
TABLE OF CONTENTS

CHAPTER NO. TITLE PAGE NO.

ABSTRACT IV
CONTENTS V
1. Introduction to Tableau 1
2. Features of Tableau 1
3. Tableau Products 2
4. Advantages and Disadvantages of Tableau Public 3
5. Installation of Tableau Public 4
6. Tableau Public Interface 5
7. Connections 5
8. Types of Connections 5
9. How to import data source 6
10. Data Types 6
11. Data Interpreter 8
12. Worksheet 9
13. Charts 9
14. Types of Charts 9
15. Set 15
16. Combined Set 15
17. Group 15
18. Difference between Set and Group 15
19. Parameters 16
20. Calculation Field 17
21. Filters 17
22. Types of Filters 17
23. Filtering Order of Operations 19
24. Operators 19
25. Types of Operators 19
26. Functions 21
27. Types of Functions 21
28. Sorting 22
29. Reference Line 23
30. Forecast 25
31. Trend Line 25

v
32. Calculations 26
33. Types of Calculations 26
34. Relationships 27
35. Joins 28
36. Types of Joins 28
37. Cross Database Joins 31
38. Unions 32
39. Blending 33
40. Dashboard 33
41. Story 33
42. Mini Projects
a. Netflix Insights Dashboard 35
b. HR Insights Dashboard 36
c. Wine Sales Insights Dashboard 37
d. Super Store Data Insights Dashboard 38
43. Main Project
a. Super Store Interactive Dashboard 39

vi
Introduction to Tableau:
Tableau is a data visualization tool that transforms raw data into interactive, shareable
visual insights. It is a drag-and-drop interface.

Features of Tableau:
Tableau has several features that make it a powerful data visualization tool. Here are
some of its key features

 Drag and Drop Interface: Tableau offers a user-friendly interface that allows
users to drag and drop fields to create visualizations.

 Data Connection: Tableau supports a wide range of data sources, including Excel,
SQL, and big data sources like Hadoop and Spark.

 Data Blending: Tableau allows users to combine data from multiple sources into
a single visualization.

 Real-Time Analytics: Tableau offers real-time data analytics and visualization.

 Interactive Dashboards: Tableau enables users to create interactive dashboards


that can be shared with others.

 Storytelling: Tableau's storytelling feature allows users to create a narrative


around their data visualizations.

 Collaboration: Tableau offers collaboration features that enable users to share


and collaborate on visualizations.

 Security: Tableau has robust security features, including data encryption and
authentication.

 Advanced Analytics: Tableau offers advanced analytics capabilities, including


predictive analytics and statistical modeling.

 Mobile Support: Tableau offers mobile apps for iOS and Android devices.

 Cloud Support: Tableau offers cloud support, allowing users to access their
visualizations from anywhere.

1
 Machine Learning: Tableau offers machine learning capabilities, including
predictive modeling and clustering.

 Geospatial Analytics: Tableau offers geospatial analytics capabilities, allowing


users to analyze and visualize geospatial data.

 Big Data Support: Tableau supports big data sources like Hadoop, Spark, and
NoSQL databases.

 Integration: Tableau integrates with other tools and platforms, including R,


Python, and Salesforce.

Products of Tableau:

Tableau offers several products and platforms tailored to different user needs and
deployment scenarios. Here are the main types of Tableaus

 Tableau Desktop: This is a flexible and AI-powered data analytics platform.

 Tableau Server: This offers data visualization and business intelligence features.

 Tableau Prep Builder: This allows users to prepare and analyze data.

 Tableau Visible: This is a consumer data visualization app.

 Tableau Public: This is a free data visualization tool.

 Tableau Reader: This is a free tool that allows users to view and interact with
data visualizations.

 Tableau Mobile: This is a mobile app version of Tableau's data visualization


tools.

 Tableau CRM: This is a customer relationship management tool that uses


Tableau's data visualization features.

 Tableau Bridge: This allows users to extend the capabilities of Tableau Cloud

 Cloud Support: Tableau offers cloud support, allowing users to access their
visualizations from anywhere.

 Machine Learning: Tableau offers machine learning capabilities, including


predictive modeling and clustering.

2
 Geospatial Analytics: Tableau offers geospatial analytics capabilities, allowing
users to analyze and visualize geospatial data.

 Big Data Support: Tableau supports big data sources like Hadoop, Spark, and
NoSQL databases.

 Integration: Tableau integrates with other tools and platforms, including R,


Python, and Salesforce.

Advantages of Tableau:
Offers several advantages that make it a popular choice for data visualization and
analysis

 Ease of Use: Tableau's intuitive drag-and-drop interface makes it easy for users of
all skill levels to create visualizations and analyze data without requiring extensive
coding or technical expertise.
 Interactive Visualizations: Tableau enables the creation of highly interactive
visualizations, allowing users to explore and interact with data dynamically. This
interactivity enhances understanding and facilitates deeper insights.
 Wide Range of Data Sources: Tableau supports connectivity to numerous data
sources, including databases, spreadsheets, cloud services, and web data connectors,
enabling users to work with diverse data sets from various sources.
 Speed of Analysis: With its powerful in-memory data engine and efficient
processing capabilities, Tableau enables fast data analysis and visualization, even
when dealing with large and complex data sets.
 Scalability: Tableau is scalable, capable of handling increasing volumes of data
and users as organizational needs grow. It can also be deployed on-premises or in
the cloud, providing flexibility to adapt to changing requirements.

Disadvantages of tableau:
While Tableau offers numerous advantages, there are also some potential disadvantages
to consider

 Cost: Tableau can be expensive, especially for organizations with large user bases
or complex deployment needs. Licensing costs, implementation fees, and ongoing
maintenance expenses can add up, potentially making it prohibitive for smaller
businesses or budget-constrained organizations.

 Learning Curve: While Tableau's drag-and-drop interface is intuitive, mastering


its full capabilities and advanced features may require time and effort. Users with
limited technical skills or experience may face a steep learning curve, particularly
when trying to create complex visualizations or perform advanced analytics.

3
 Performance Issues with speed; Datasets: While Tableau's in-memory data
engine is optimized for speed, users may encounter performance issues when
working with extremely large datasets or complex queries. In such cases, additional
hardware resources or optimization strategies may be required to maintain
acceptable performance levels.

 Data Security and Governance: Tableau's self-service analytics capabilities


can pose challenges in terms of data security and governance. Without proper
controls in place, there's a risk of sensitive data being exposed or misused,
potentially leading to compliance issues or breaches of confidentiality.

 Limited Data Manipulation Capabilities: While Tableau excels at data


visualization and analysis, its data manipulation capabilities are somewhat limited
compared to dedicated data preparation tools or programming languages like Python

Installation of tableau Public:


Installing Tableau Public on Windows is a straightforward process. Here's a step-bystep guide:

1.Download Tableau Public: Visit the Tableau Public website


https://www.tableau.com/product/public/download click on the "Download Tableau Public"
button. This will start the download process for the Tableau Public installer

2. Run the Installer: Once the download is complete, locate the downloaded installer
file(typically in your Downloads folder) and double-click on it to run the installer

3. Accept the License Agreement: The installer will prompt you to review and accept
the Tableau Public License Agreement. Read through the agreement and click on the "I accept
the terms of this agreement" checkbox if you agree. Then, click on the "Install" button to
proceed.

4. Choose Installation Location: Next, you'll be asked to choose the installation location
for Tableau Public. You can either accept the default location or specify a different folder. Once
you've made your selection, click on the "Install" button to continue.

5. Installation Progress: The installer will now begin installing Tableau Public on your
Windows system. This process may take a few minutes, so be patient.

6. Finish Installation: Once the installation is complete, you'll see a confirmation message
indicating that Tableau Public has been successfully installed. Click on the "Finish" button to
close the installer.

7. Launch Tableau Public: After the installation is complete, you can launch Tableau
Public by finding it in your list of installed programs or by double-clicking on the desktop
shortcut, if one was created during the installation process.

4
8. Sign In or Create an Account: When you launch Tableau Public for the first time,
you'll be prompted to sign in with your Tableau Public account or create a new one. Follow the
on-screen instructions to sign in or create an account.

Tableau Public Interface:


The Tableau Public interface is designed to be intuitive and user-friendly, featuring a
drag-and-drop functionality that allows users to easily create visualizations. The main
workspace includes a central canvas for building and viewing visualizations, surrounded by
various panels such as the Data pane on the left, where users can connect and manage their data
sources, and the Analytics pane, offering tools for enhancing visualizations with trends,
forecasts, and other analytical features. Menus and toolbars at the top provide access to file
operations, visualization types, and customization options. This layout supports an efficient
workflow, enabling users to focus on data exploration.

Connections:

In Tableau, a "connection" refers to the link established between Tableau and a data
source. This connection allows Tableau to access and analyze data from various sources,
including databases, spreadsheets, cloud services, and more. Understanding connections is
fundamental to using Tableau effectively as they determine how data is accessed, processed,
and visuals.

Types of connections:
1. Live Connections:

 With a live connection, Tableau queries the data source directly in real-time whenever
visualizations are interacted with.

5
 Changes or updates in the underlying data source are immediately reflected in Tableau
visualizations.

 Ideal for analyzing real-time or frequently changing data.

 Offers the most up-to-date view of the data but may be slower in performance compared
to extracts for large datasets or complex queries.

2. Extract Connections:

 With an extract connection, Tableau imports a subset or entire dataset from the data
source into its proprietary data engine (TDE or Hyper).

 Once extracted, Tableau works with the data locally, enabling faster performance and
offline access.

 Extracts are useful for working with large datasets, improving performance, or when a
live connection is not feasible or efficient.

 Users can schedule extract refreshes to keep the data up-to-date or manually refresh as
needed.

How to Import Data Source:

To import a data source into Tableau, open Tableau Desktop and navigate to the
"Connect" pane on the left side of the Start Page. Select the type of data source you want to
connect to, such as a file (e.g., Excel, CSV) or a server-based database (e.g., SQL Server,
MySQL). Provide the necessary connection details, like file path or server credentials, and click
"Connect." Once connected, you'll be taken to the Data Source page, where you can preview the
data, configure joins, apply filters, and perform any necessary data preparations.

Data Types:
In Tableau, data types play a crucial role in defining how data is interpreted, formatted,
and displayed within visualizations. Tableau supports various data types, including integers,
decimals, strings, dates, datetimes, Booleans, geographical coordinates, currency values,
percentages, and even images. Each data type serves a specific purpose and comes with its own
formatting and aggregation options. Understanding and correctly assigning data types in
Tableau is essential for accurate analysis and visualization of data. By ensuring that data types
are properly defined, users can effectively leverage Tableau's powerful features to gain insights,
make informed decisions, and communicate findings with clarity and precision.

Certainly! Here are common data types used in Tableau along with examples:

6
1. Integer:

 Represents whole numbers without decimal places.

 Example: 10, 25, -100

2. Decimal/Float:

 Represents numbers with decimal places.

 Example: 3.14, 0.75, -123.45

3. String/Character:

 Represents text or alphanumeric characters.

 Example: "John Doe", "Sales Department", "ABC123"

4. Date:

 Represents dates without time information.

 Example: 2022-05-20, 10/15/2021, May 5, 2023

5. Datetime/Timestamp:

 Represents both date and time information.

 Example: 2022-05-20 08:30:00, 10/15/2021 14:45:30

6. Boolean:

 Represents true or false values.

 Example: True, False

7. Geographical:

 Represents geographical data such as latitude and longitude coordinates.

 Example: (40.7128° N, 74.0060° W), (34.0522° N, 118.2437° W)

8. Currency:

 Represents monetary values with a specific currency symbol.

 Example: $100, €50, ¥5000

7
9. Percentage:

 Represents values as percentages.

 Example: 25%, 50.5%, 100%

10. Image:

 Represents image files or URLs pointing to image resources.

 Example: [Image file], [URL to an image]

Data interpreter:
Data Interpreter in Tableau is a feature that helps clean and prepare messy or poorly
structured data for analysis. When you import a file, such as an Excel spreadsheet, Data
Interpreter automatically detects and interprets the structure of your data, identifying headers,
sub-tables, merged cells, and other formatting issues. It then suggests a more organized version
of the data, making it easier to work with in Tableau. This feature is particularly useful for
handling data that originates from non-database sources where the layout is not immediately
analysis-friendly. You can enable Data Interpreter from the Data Source page by clicking on the
"Use Data Interpreter" option.

8
Worksheet:
Tableau, a powerful data visualization tool, offers the functionality to create interactive
dashboards and worksheets. Worksheets are individual visualizations or charts that are used to
present data in Tableau. Here's a breakdown of worksheets in Tableau:

 Creation: Worksheets are created within Tableau's interface. Users can drag and drop
fields from the data pane onto the shelves (rows, columns, marks, filters) to create
various types of visualizations such as bar charts, line charts, scatter plots, maps, and
more.

 Data Visualization: Each worksheet represents a single view of the data. Users can
choose from a variety of visualization types to represent their data in the most effective
way possible.

 Customization: Tableau offers extensive customization options for worksheets. Users


can format colors, fonts, labels, tooltips, axes, and other elements to tailor the
visualization.

 Interactivity: Worksheets in Tableau can be made interactive by adding filters,


parameters, actions, and tooltips. This interactivity allows users to explore the data
dynamically and gain insights from different perspectives.

 Integration: Worksheets can be combined to create dashboards. Dashboards in Tableau


provide a way to present multiple visualizations together, allowing users to see different
aspects of the data in one consolidated view.

 Sharing and Collaboration: Tableau worksheets can be shared with others through
Tableau Server or Tableau Online. Users can collaborate on worksheets, create
subscriptions for automated updates, and control access to the data based on
permissions.
 Performance Optimization: Tableau provides features for optimizing worksheet
performance, such as data extracts, data blending, and efficient use of filters and
aggregations, to ensure that visualizations are rendered quickly even with large datasets.

 Integration with Other Tools: Tableau can integrate with other tools and data sources,
allowing users to combine data from multiple sources in their worksheets and
visualizations.

Charts:
In Tableau, a chart refers to a graphical representation of data that visualizes
relationships, trends, distributions, and comparisons. Tableau provides a wide variety of chart
types that users can create and customize to effectively communicate insights from their data.

Types of Charts:
Tableau Public offers a wide variety of chart types to visualize data, including but not
limited
9
1.Bar Chart: A bar chart is a type of data visualization that represents categorical data with
rectangular bars, where the length or height of each bar corresponds to the value it represents.

2. Line Charts: A line chart is a type of data visualization that displays data points connect
by straight lines.

10
3. Area Charts: An area chart is similar to a line chart, but the area below the line is filled
with color or shading to represent the cumulative effect of the data points.

4. Scatter Plots: A scatter plot is a two-dimensional data visualization that represents


individual data points as dots on a graph.

11
5. Pie Charts : A pie chart is a circular data visualization that represents the proportions of
different categories within a dataset as slices of a pie.

6. Histograms: A histogram is a graphical representation of the distribution of numerical


data, where data points are grouped into bins or intervals, and the height of each bar represents
the frequency or count of data points within that bin.

12
7. Heat Map: A heat map is a graphical representation of data where values are represented
as colors within a matrix.

8.Gantt Chart: A Gantt chart is a type of bar chart that represents the schedule, progress,
and dependencies of tasks or activities over time.

13
9. Bullet Graph: A bullet graph is a variation of a bar chart that is used to compare the
performance or progress of a single measure against predefined qualitative or quantitative
targets.

10. Waterfall Chart: A waterfall chart is a specialized type of column chart that is used to
illustrate the cumulative effect of positive and negative values on a starting point.

14
Set:
In Tableau, sets are custom fields that define a subset of data based on certain conditions.
They are useful for creating groups of data points that share common characteristics or for
isolating specific data for analysis. Here's how you can work with sets in Tableau.

 Static Set: A static set is manually defined by selecting specific members from a
dimension or field. Once created, the members within a static set remain fixed unless
manually modified.
 Dynamic Set: A dynamic set is defined based on conditions or criteria, and it
automatically updates as the underlying data changes. Dynamic sets are useful for
creating subsets of data based on changing criteria.

Combined Set:
In Tableau Public, a combined set is a logical grouping of multiple sets that allows for
combined analysis and comparison. Sets are subsets of data that meet certain criteria defined by
the user. When sets are combined, Tableau creates a new set containing members that belong to
any of the original sets, providing a way to analyze data that meets the criteria of at least one
set. Combined sets are particularly useful for performing inclusive analyses where data may
belong to multiple categories simultaneously, allowing users to gain insights into overlapping
segments of their data.

Group:
In Tableau, grouping allows you to combine multiple members of a dimension into a
single group for analysis or visualization purposes. Here's how you can work with groups in
Tableau

Group can be used for various purposes, such as simplifying data analysis, creating
hierarchies, or aggregating data for easier visualization.

Difference between Set and Group:

Set:
 To define a custom subset of data based on specific conditions or rules
 Can be created based on computed conditions, formulas, or selected data points
 Can be dynamic (updated based on data changes) or static
 Used for advanced calculations, filters, and complex logic

Group:
 To combine multiple dimension members into higher-level categories
 Created manually by selecting dimension members and grouping them together
 Typically, static (manual adjustments needed for changes)
 Used for simplifying and categorizing data in visualizations

15
Parameter:
In Tableau, parameters are dynamic values that allow users to control certain aspects of
the visualization, such as filters, calculations, or reference lines. They provide interactivity and
flexibility to your dashboards. Here's how you can work with parameters in Tableau.

1.Creating Parameters:

 To create a parameter, go to the "Parameters" shelf in the bottom left corner of


Tableau's interface and click on the drop-down arrow.

 Select "Create Parameter" and define the parameter's properties, including its data
type (e.g., integer, float, string, date), allowable values (range or list), and default
value.

2. Using Parameters in Calculations:

 Once you've created a parameter, you can use it in calculations to dynamically


change the behavior of your visualizations.

 For example, you can create a calculated field that filters data based on a parameter
value, performs mathematical operations using the parameter, or dynamically adjusts
the size or color of data points.

3. Using Parameters as Filters:

 Parameters can also be used as filters to allow users to interactively select values to
filter the data.

16
 You can create calculated fields that compare the parameter value to the values in
your data and return a Boolean result (true/false) to determine which data points
should be included in the visualization.

4. Parameter Controls:

 Tableau automatically generates a parameter control based on the data type and
allowable values you specify when creating the parameter.

 Parameter controls can be displayed as dropdown lists, sliders, input boxes, or


checkboxes, depending on the parameter settings.

5. Dynamic Reference Lines and Bands:

 can be used to dynamically adjust reference lines or bands in your visualization.

 For example, you can create a parameter that allows users to set a target value, and
then use that parameter to dynamically adjust the position of a reference line on a
chart.

6. Parameter Actions:

 Parameter actions allow you to dynamically change parameter values based on user
interactions with the dashboard.

 For example, you can set up a parameter action that updates the parameter value
when a user clicks on a data point in the visualization.

Calculation Field:
A calculated field in Tableau is a new data field created by applying a formula to
existing data field within your dataset. Calculated field in Tableau provide a powerful way to
enrich your data analysis by enabling custom calculations and transformations.

Filters:
In Tableau, filters are used to limit the data displayed in your visualizations based on
certain criteria. They allow you to focus on specific subsets of your data or exclude irrelevant
information. Here's how you can work with filters in Tableau.

Types of Filters:
In Tableau, there are 6 types of filters. They are as follow:

1. Extract Filters.

2. Data Source Filters

3. Context Filters
17
4. Dimension Filters

5. Measure Filters

6. Table Calculation Filters

1. Extract Filter:
 Extract filters allow you to filter data before extracting it into a Tableau data extract
(hyper file).
 These filters can help optimize performance and reduce the size of the data extract
by excluding unnecessary data.

2.Data Source filter:

 In Tableau, a data supply filter is a filter applied directly at the data source level,
rather than at the visualization level.

 This type of filter is commonly used when connecting to data sources such as
databases or cubes.

3. Context Filter:

 Context filters are applied before other filters in the visualization, allowing you to
control the order of operations for filtering.

 Context filters can improve performance by reducing the amount of data processed
by subsequent filters.

4. Dimension Filter:

 In Tableau, dimensions filters allow you to restrict the data displayed in your
visualizations based on specific values or categories within dimension fields.

 Dimensions represent qualitative data, such as names, categories, or geographic


regions.

5.Measure:

 Continuous filters are applied to numeric fields (measures) and allow you to define a
range of values to include or exclude in your visualization.

 You can set up sliders or input boxes to specify the range of values, such as filtering
sales data for a specific price range or date range.

18
6.Table calculation:

 Table calculations in Tableau allow you to perform calculations on your data that are
based on the data displayed in the visualization.

 These calculations are applied dynamically as the visualization is updated or as the


underlying data changes.

Filtering Order of Operations:


Tableau performs actions on our view in a specific order. This is called Order of
Operations it helps to avoid filters conflicts and achieve efficiency with the dashboard.

Operators:
In Tableau, operators are symbols or keywords used to perform mathematical, logical,
or comparison operations within calculated fields, filters, and other expressions. These
operators allow you to manipulate and analyze your data effectively.

Types of Operators:
Here are some common operators in Tableau

1. Arithmetic Operators:

 Addition (+): SUM([Sales]) + [Profit

 Subtraction (-): [Revenue] - [Expenses]

 Multiplication (*): [Quantity] * [Unit Price]

 Division (/): [Total Sales] / [Number of Customers]

 Modulus (%): MOD([Number], 2) (returns the remainder when dividing [Number]


by 2)

19
2. Comparison Operators:

 Equal to (=): [Region] = 'North'

 Not equal to (<>): [Product Category] <> 'Furniture'

 Greater than (>): [Profit] > 10.1

 Less than (<): [Sales] < 500

 Greater than or equal to (>=): [Discount] >= 0.1

 Less than or equal to (<=): [Quantity] <= 10

3. Logical Operators:

 AND: [Category] = 'Office Supplies' AND [Sub-Category] = 'Paper


'
 OR: [Segment] = 'Corporate' OR [Segment] = 'Consumer'

 NOT: NOT [Region] = 'South'

4. String Operators:

 Concatenation (+): [First Name] + ' ' + [Last Name]

 LIKE: [Product Name] LIKE '%chair%'

 CONTAINS: CONTAINS ('apple', [Product Name])

 STARTSWITH: STARTSWITH([City], 'New')

 ENDSWITH: ENDSWITH([Name], 'Inc.')

5. Conditional Operators:

 IF-THEN-ELSE: IF [Profit] > 0 THEN 'Profitable' ELSE 'Unprofitable' END

 CASE: CASE [Segment] WHEN 'Corporate' THEN 'Large Business' WHEN


'Consumer' THEN 'Individual' ELSE 'Other' END

6. Set Operators:

 UNION: UNION ([Region 1], [Region 2])

 INTERSECTION: INTERSECTION ([Segment A], [Segment B])

 EXCEPT: EXCEPT ([Category A], [Category B])

20
Function:
In Tableau, functions are predefined operations that allow you to perform various
calculations, manipulations, and transformations on your data. These functions can be used in
calculated fields, filters, parameters, and other expressions to analyze and visualize your data
effectively.

Types of functions:
Certainly! Here's a list of different types of functions in Tableau along with examples:

1. Aggregate Functions:

 Example: Calculating the total sales amount: SUM([Sales])

2. Date Functions:

 Example: Extracting the year from a date field: YEAR ([Order Date])

3. String Functions:

 Example: Converting text to uppercase: UPPER ([Product Name])

4. Logical Functions:

 Evaluating a condition using IF: IF [Profit] > 0 THEN 'Profitable' ELSE


'Unprofitable' END

5. Table Calculation Functions:

 Example: Calculating a running total of sales: RUNNING_SUM(SUM([Sales]))

6. Statistical Functions:

 Example: Calculating the standard deviation of sales: STDEV([Sales])

7. Geospatial Functions:

 Example: Calculating the distance between two points: DISTANCE([Latitude 1],


[Longitude 1], [Latitude 2], [Longitude 2])

8. User Functions:

 Example: Checking if the user is an administrator: ISMEMBEROF('Administrator')

21
9. Data Type Conversion Functions:

 Example: Converting a string to a date: DATEPARSE ('YY-MM-dd', [Date String])

10. Set Functions:

 Example: Finding the union of two sets: UNION ([Set A], [Set B])

11. Cube Functions:

 Example: Calculating the rank of a member in a cube dimension: RANK([Product],


[Sales])

12. Window Functions:

 Example: Calculating a moving average of sales: WINDOW_AVG(SUM([Sales]), -


2, 0)

13. Financial Functions:

 Example: Calculating the present value of an investment: PV([Rate], [Periods],


[Payment])

14. Data Source Functions:

 Example: Refreshing a data source extract: REFRESH_EXTRACT ('My Data


Source')

15. Data Functions:

 Example: Creating a calculated field using a Python script: SCRIPT_REAL ('return


x * 2;', [Sales]

Sorting:
Sorting in Tableau Public is the process of arranging data in a specified order to enhance
clarity and facilitate analysis within visualizations. You can sort data in ascending or
descending order based on values in a field, whether it's a dimension (like names or categories)
or a measure (like sales or profit). Sorting can be applied to various chart types, such as bar
charts or tables, to highlight trends, rankings, or to make comparisons more intuitive. Tableau
offers multiple ways to sort, including manual sorting, sorting by specific fields, or using the
default sort order. This functionality helps users quickly uncover insights and present data in a
more organized and interpretable manner.

22
Reference Line:
In Tableau, reference lines are horizontal, vertical, or trend lines that you can add to
your visualizations to provide context or highlight specific values. These lines help users
understand the data by visually indicating key points or benchmarks.

Here's how you can work with reference lines in Tableau:

1. Adding Reference Lines:

 To add a reference line to your visualization, right-click on an axis or a data point in


the view and select "Add Reference Line."

 You can choose from various types of reference lines, including constant, average,
median, percentiles, and more.

23
2. Customizing Reference Lines:

 Once a reference line is added, you can customize its appearance and behavior by
editing its properties.

 You can adjust the line type, color, label, tooltip, and other settings to match your
visualization requirements.

3. Types of Reference Lines :

 Constant Line: Represents a fixed value on the axis.

 Average Line: Represents the average value of the measure.

 Median Line: Represents the median value of the measure.

 Percentile Line: Represents a specific percentile of the measure (e.g., 90th


percentile).

 Minimum/Maximum Line: Represents the minimum or maximum value of the


measure.

 Trend Line: Represents the trend or pattern in the data using a mathematical model
(e.g., linear, exponential).

4. Scope and Calculation:

 You can specify the scope of the reference line to apply at different levels, such as
cell, pane, or table across.

 For trend lines, you can choose the type of mathematical model and specify the
order of the polynomial.

5. Interactivity:

 Reference lines can be interactive, allowing users to hover over or click on them to
view additional information.

 You can enable tooltips to display details such as the value represented by the
reference line.

6. Multiple Reference Lines:

 You can add multiple reference lines to a visualization to provide additional context
or compare multiple benchmarks.

 Tableau allows you to add multiple reference lines for different measures or
categories.

24
Forecast:
In Tableau, you can create forecasts to predict future trends based on historical data in
your visualizations. With just a few clicks, you can add a forecast line to your chart, extending
into the future. Tableau offers customizable options for adjusting the forecast period,
confidence levels, and model types to suit your analysis needs. The forecasted values are
displayed alongside the actual data, allowing you to compare and evaluate the predicted trends.
Forecasting in Tableau empowers users to make data-driven decisions by providing insights
into potential outcomes and trends, helping to inform strategic planning and decision-making
processes.

Trend line :
In Tableau, trend lines are graphical representations of the direction and pattern of data
points in a visualization. They help users understand the overall trend or pattern in the data by
fitting a mathematical model to the plotted points.

25
Calculations:
A calculation is a custom formula that allows you to create new data fields or transform
existing data to enhance your analysis. Calculations enable you to perform operations such as
arithmetic, logical comparisons, string manipulations, date transformations, and aggregations
directly within Tableau. By using calculated fields, you can derive additional insights, create
dynamic measures and dimensions, and apply complex business logic to your visualizations, all
without needing to alter the underlying data source. This flexibility makes calculations a
powerful feature for tailoring your data analysis to specific needs and uncovering deeper
insights.

Types of Calculations:
There are 3 main types of table calculations. They are as follow:

1. Basic Calculations

2. Table Calculations

3. Level of Detail (LOD) Expressions

1. Basic Calculations:

 Row-level Calculations: Operations applied to individual rows of data.

 Example: ‘[Sales] * 1.1’

2. Table Calculations:

 Running totals, moving averages, percent of total, rank, and other calculation

 Example: ‘RUNNING_SUM(SUM([Sales]))’

3. Level of Detail (LOD) Expressions:

 FIXED: Computes values using the specified dimension(s), independent of the


view's context.

 Example: ‘{FIXED [Region]: SUM([Sales]) }’

 INCLUDE: Computes values including the specified dimension(s) along with


the dimensions in the view.

 Example: ‘{INCLUDE [Category]: AVG([Profit]) }

 EXCLUDE: Computes values excluding the specified dimension(s) from the


dimensions in the view.

 Example: ‘{EXCLUDE [Year] : SUM([Sales]) }’


26
Relationship:
In Tableau, establishing relationships between data sources allows you to combine and
analyze data from multiple tables or sheets efficiently. This feature is particularly useful when
working with relational databases or different sheets within the same workbook.
Here's how you can establish relationships in Tableau:

1. Data Source Connections:

 When you connect to multiple data sources in Tableau, you can create
relationships between them based on common fields.

 Tableau automatically detects fields with matching names or similar data


types and suggests creating relationships.

2. Defining Relationships:

 To define a relationship manually, you can drag a field from one data source
onto a field from another data source in the Data pane.

 Tableau prompts you to define the relationship type (e.g., inner join, left
join) and cardinality (e.g., one-to-one, one-to-many) between the fields.

3. Joining Tables:

 When creating relationships, Tableau performs joins between tables based on


the defined relationships.

 Joins combine records from two tables into a single dataset based on
matching values in the specified fields.

4. Data Blending:

 In scenarios where direct relationships between tables are not possible,


Tableau supports data blending.

 Data blending allows you to combine data from different data sources or
sheets based on common dimensions, even if there is no direct relationship
between them.

5. Working with Multiple Tables:

 After establishing relationships, you can use fields from multiple tables in
the same visualization or calculation.

27
 Tableau handles the underlying data blending and joins transparently,
allowing you to analyze the combined dataset seamlessly.

6. Optimizing Performance:

 To optimize performance, Tableau automatically generates efficient SQL


queries based on the defined relationships and joins.

 You can further optimize performance by indexing fields used in


relationships or joins in your database.

Joins:
In Tableau, joins facilitate the combination of data from multiple tables or sheets within
the same data source. By linking related data based on common fields, joins create a unified
dataset for analysis and visualization. Tableau offers various types of joins, including Inner,
Left, Right, and Outer joins, each serving different purposes in data integration. Join conditions
specify how fields from different tables are matched, ensuring accuracy in data merging. While
joins merge data at the database level, data blending is used to combine data from different
sources or sheets within the same workbook at the visualization level. Joins are essential for
integrating and analyzing data from relational databases, Excel files, or cloud sources, enabling
users to derive insights and make data-driven decisions effectively.

Types of joins:
Certainly! Here are the types of joins supported in Tableau along with examples:

1. Inner Join:

 Returns only the rows where there is a match in both tables.

28
 Example: SELECT * FROM Table1 INNER JOIN Table2 ON Table1.ID =
Table2.ID

2. Left Join (or Left Outer Join):

 Returns all rows from the left table and the matched rows from the right
table. If there is no match, NULL values are returned for the right table.

 Example: SELECT * FROM Table1 LEFT JOIN Table2 ON Table1.ID =


Table2.ID

3. Right Join (or Right Outer Join):

 Returns all rows from the right table and the matched rows from the left
table. If there is no match, NULL values are returned for the left table.

 Example: SELECT * FROM Table1 RIGHT JOIN Table2 ON Table1.ID =


Table2.ID

29
4. Full Outer Join:

 Returns all rows when there is a match in either table. If there is no match,
NULL values are returned for the missing side.

 Example: SELECT * FROM Table1 FULL OUTER JOIN Table2 ON


Table1.ID = Table2.ID

5. Self Joins:

 Joins a table to itself.

 Example: SELECT * FROM Employees e1 JOIN Employees e2


ON e1. Manage rid = e2. Employed

30
Cross Database Joins:
A cross-database join allows you to combine and analyze data from different databases
or data sources as if they were part of the same database. This feature is particularly useful
when your data is stored in multiple disparate systems. It enables you to join tables from
different types of data sources, such as a combination of a SQL database and a cloud-based data
service, using a common field. This seamless integration expands your ability to analyze and
visualize data from multiple sources in a unified way.

Example: Consider two tables from different databases:

Orders Table (from a SQL Database):

Order ID Customer ID Order Amount

1 C001 250

2 C002 150

3 C003 300

4 C004 200

Customer Feedback Table (from a CSV File):

Customer ID Feedback

C001 Excellent service

C002 Good quality

C003 Quick delivery

C005 Satisfactory

Cross-Database Join on Customer ID:

Performing a cross-database join on the ‘Customer ID’ field will result in:

Order ID Customer ID Order Amount Feedback

1 C001 250 Excellent service

2 C002 150 Good quality

3 C003 300 Quick deliveries

31
4 C004 200 NULL

NULL C005 NULL Satisfactory

In this example, data from the Orders table in the SQL database is combined with
the Customer Feedback table from Google Sheets. The resulting table includes all records from
both sources, joined on the common ‘Customer ID’ field. Where there is no match, ‘NULL’
values are shown. This cross-database join enables analysis across these distinct data sets within
a single Tableau visualization.

Unions:
In Tableau, unions are a feature that allows you to combine data from multiple tables or
sheets with the same structure into a single dataset. This capability is particularly handy when
dealing with fragmented data across different files or tables that share identical schemas. By
performing a union, Tableau seamlessly merges the rows from each source, ensuring
consistency across columns. Whether you're appending rows from similar Excel files or uniting
data from matching CSV files, unions simplify the process of consolidating data for analysis
and visualization. This straightforward approach streamlines data integration, enabling users to
create comprehensive datasets without the need for complex data manipulation.

Types of Unions:
In Tableau, there are primarily two types of unions: Append and Union. Here's a
breakdown of each type along with examples:

1. Append Union:

 Description: Append union is used to stack the rows from the secondary data source
beneath the rows of the primary data source. Both data sources must have identical
column names and data types.

 Example: Appending data from two Excel files with the same structure

Excel1: [Order ID], [Product], [Quantity]


Excel2: [Order ID], [Product], [Quantity]

2. Union:

 Description: Union is used to combine the rows from the secondary data source
with the rows of the primary data source. It matches columns based on name and
data type, adding new columns if necessary.

 Example: Unironing data from two CSV files with the same structure:

CSV1: [Customer ID], [Name], [Age]


CSV2: [Customer ID], [Name], [Age]

32
Blending:
In Tableau, data blending is a method used to integrate and analyze data from multiple
sources or sheets within the same workbook. It allows users to combine datasets that cannot be
directly related through common fields. By establishing relationships based on shared
dimensions, Tableau creates blended fields that enable users to aggregate and visualize data
from both primary and secondary sources in a single visualization or dashboard. Data blending
facilitates comprehensive analysis by providing insights from diverse datasets, helping users
make informed decisions and discover meaningful connections within their data.

Dashboard:
In Tableau, a dashboard serves as a comprehensive visual interface that brings together
various visualizations, charts, and elements to provide a holistic view of data insights. It's a
dynamic canvas where users can interactively explore data, analyze trends, and uncover
meaningful patterns. Tableau dashboards allow for the arrangement of multiple visualizations
on a single screen, enabling users to compare different aspects of their data in real-time. With
customizable layouts, filters, and interactivity features, dashboards empower users to tailor their
analytical experience to their specific needs. Whether presenting key performance indicators,
monitoring business metrics, or sharing insights with stakeholders, Tableau dashboards offer a
powerful means to communicate data-driven insights effectively and drive informed decision-
making.

Story:
In Tableau, a story is a feature that allows users to combine multiple dashboards,
sheets, and visualizations into a narrative-driven sequence. It's an interactive way to present
insights and guide viewers through a data-driven story, helping them understand the context,
33
trends, and conclusions derived from the data. Tableau stories enable users to create a structured
flow of information by arranging individual visualizations in a logical order, adding
annotations, and incorporating text to provide explanations or highlight key points. With the
ability to add captions, annotations, and navigation controls, stories in Tableau offer a
compelling way to communicate analytical findings, share insights, and drive decision-making
processes. They provide a cohesive and engaging experience for both creators and viewers,
fostering a deeper understanding of complex datasets and facilitating data-driven storytelling.

34
Mini projects
Mini Project 1:

Title: Netflix Insights Dashboard

I have analyzed the Netflix Titles data and build an dashboard to give insights for
the business use cases.

Business Use Cases:

1.How many titles are there based on the rating

2.Number of titles by country (count the titles by country wise)

3.Distribution of titles by rating and year

4.No. of Titles added by month year (like nov 2015)

5.Percentage of total count of titles for top 10 countries

6.Distribution of Movies and TV shows (percentage of total)

 I have created a dashboard by analyzing the above business use cases

Dashboard:

Dashboard Link:
https://public.tableau.com/views/NeflixInsightsDasboard_17170462765670/Dashboard1?:l
anguage=enUS&:sid=&:display_count=n&:origin=viz_share_link

35
Mini Project -2

Title: HR Insights Dashboard

I have analyzed the OMEGA HR data set and build an dashboard to give insights for
the business use cases.

Business Use cases:

1. Header Metric
2. Hiring Trend vs Termination Trend
3. No. of Employees by Department
4. Gender Distribution
5. Age Distribution
6. Top 10 job Titles

 I have created a dashboard by analyzing the above business use cases.

Dashboard:

Dashboard Link:
https://public.tableau.com/views/HRInsightsDasboard_17170511228530/Dashboard1?:lan
guage=en-US&:sid=&:display_count=n&:origin=viz_share_link

36
Mini Project -3:
Title: wine sales Insights Dashboard

I have analyzed the Wine Sales data set and build a dashboard to give insights for the
business use cases.

Business Use Cases:

1.All Header metrics in one single chart

2.Revenue by region and subsegment (Create a heat map)

3.find revenue by first 3 state/province combined, next 3 states/provinces combined and


next 2 states combined. (revenue state/province breakdown)

4.Find the top 1 subsegment by revenue (you need to show always top 1 subsegment in
separate color using sets). Now suppose that subsegment X is top 1 it should be colored
differently. Later if subsegment Y is top1 not X it should be colored differently

5.Find the top 10 customers by revenue (using sets)

6.find top 10 customers by order quantity (using sets)

7. Find the customers that are there in both the sets (find the cream customers or loyal
customers) (combine set) (scatter plot)

 I have created a dashboard by analyzing the above business use cases.

Dashboard:

Dashboard Link:
https://public.tableau.com/views/WineSalesInsightsDashboard_17171382867550/Dashboa
rd1?:language=en-US&:sid=&:display_count=n&:origin=viz_share_link
37
Mini Project -4:

Title: Super Store Insight Dashboard


I have analyzed the Sample Super Store data set and build an dashboard to give
insights for the business use cases.

Business Use Cases:


1.How many subcategories are not met sales target of $75,000
2. In one chart do the sales vs Profit analysis by product Name and by Customer name
3. Distribution by sales (tree map)
4. See the Sales and profit given month year in one chart (combined chart) and forecast
5. Dual Axis chart

 I have created a dashboard by analyzing the above business use cases.

Dashboard:

Dashboard Link:
https://public.tableau.com/views/SuperStoreInsightsdasboard/Dashboard1?:language=en-
US&:sid=&:display_count=n&:origin=viz_share_link

38
Main Project

Title: Super Store Interactive Dashboard


I have analyzed the Sample Super Store data set and build an Interactive and dynamic
dashboard to give insights for the business use cases.

Business Use Cases:

1. Header Metric (Profit, Sales, quantity)


1. Analysis (profit, sales, quantity)
2. Category X Sales, Category X Profit, Category X quantity
3. Sub Category X Sales, Sub Category X Profit, Sub Category X Quantity
4. Segment X Sales, Segment X Profit, Segment X quantité
5. Ship mode X Sales, Ship mode X Profit, Ship mode X Quantity
6. State X Sales, State X Profit, state X Quantity

 I have created an Interactive and dynamic dashboard by analyzing the above business
use cases

Dashboard:

Dashboard Link:
https://public.tableau.com/views/SuperStoreInteractiveDsahboard/dasboard?:language=en-
US&:sid=&:display_count=n&:origin=viz_share_link

39

You might also like