0% found this document useful (0 votes)
11 views103 pages

Data Visualization New

The document provides an overview of data visualization (DV), including its definition, importance, benefits, and various techniques and tools used in the field. It highlights the roles of DV in different sectors such as education, business, and military, and outlines the steps involved in creating effective visualizations. Additionally, it discusses programming techniques and specific tools like Tableau, Python, and R that facilitate data visualization.

Uploaded by

deepaa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views103 pages

Data Visualization New

The document provides an overview of data visualization (DV), including its definition, importance, benefits, and various techniques and tools used in the field. It highlights the roles of DV in different sectors such as education, business, and military, and outlines the steps involved in creating effective visualizations. Additionally, it discusses programming techniques and specific tools like Tableau, Python, and R that facilitate data visualization.

Uploaded by

deepaa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 103

Data visualization

Presented By
Dr.Deepa A
1
2
Presentation Overview

 Data Visualization Introduction – What is DV?

Why DV?

Benefits

Techniques

Who Uses DV
Steps in Data Visualization

Data Visualization Roles

Data Visualization Tools

Techniques in Programming
3
Examples
INTRODUCTION

What is DV?
 Is the practice of translating information into a visual
context such as map or graph.
 Is one of the steps of the data science process, which states
that after data has been collected, processed and modeled, it
must be visualized for conclusions to be made.
 The term is often used interchangeably with others, including
information graphics, information visualization and statistical
graphics. 4
5
Why DV?
 The main goal of data visualization is to make it easier to identify

patterns, trends and outliers in large data sets.

 DV aims to identify, locate, manipulate, format and deliver data

in the most efficient way possible.

6
7
Benefits of DV

• The ability to absorb information quickly, improve insights and


make faster decisions.
• An increased understanding of the next steps that must be taken to
improve the organization.
• An improved ability to maintain the audience's interest with information
they can understand.

8
Benefits of DV

 An easy distribution of information that increases the opportunity to


share insights with everyone involved.

 Eliminate the need for data scientists since data is more accessible
and understandable.

 An increased ability to act on findings quickly and, therefore, achieve


success with greater speed and less mistakes.
9
10
Who uses DV

Data visualization is important for almost every career.


 DV in Education:

 To monitor students learning progress throughout the semester.


 Advisers can take prompt actions to help the students who are failing
 It can be used by teachers to display student test results, by

computer scientists exploring advancements in artificial intelligence


(AI) or by executives looking to share information with stakeholders .

11
 DV in Business

 Provides Business with vastly improved Decision making process because it

enhance the available information and represents it in a pictorial format.


 Visualization charts with actionable data can display and help to communicate

the message effectively.


 Data visualization can support organization leaders in identifying patterns and

gaps and interpreting the information in a meaningful manner.

 DV in Military

 For the military, clear and actionable data is critical


 To quickly share accurate information in the most concise structure.
 Better understanding of past Data can make more accurate. 12
DV Steps
 Develop your Research question
 Get or create your Data
 Clean your Data.
 Choose a Chart Type.
 Choose your Tool.
 Prepare data.
 Create Chart

13
DV Techniques

 Box plots

 Histograms

 Heat Maps

 Charts

 Tree Maps

 Network Diagram
14
Box Plots
When we display the data distribution in a standardized way using 5 summary – minimum,
Q1 (First Quartile), median, Q3(third Quartile), and maximum, it is called a Box plot. It is also
termed as box and whisker plot.
We use these types of graphs or graphical representation to know:
Distribution Shape
Central Value of it
Variability of it

15
Check the image below which shows the minimum, maximum, first quartile, third quartile, median and outliers.

16
Histogram

• Histograms are a type of Bar


Chart.
• The bars on a Histogram
represent ranges along a
continuous quantifiable spectrum.
• A Chart that shows the
frequency distribution of data
points across a continuous range
of Data values.
• A chart that displays numeric
17
Data Visualization
Roles
 Showing change over time

 Showing a part-to-whole composition

 Depicting flows and processes

 Looking at how data is distributed

 Comparing values between groups

 Observing relationships between variables

 Looking at geographical data

18
Change over time

19
Spark Line Chart Examples

20
21
22
Part-to-whole composition

23
24
25
26
Flows and processes

27
28
How data is distributed?

29
30
31
Comparing values between
groups

32
33
34
Relationships between variables

35
36
37
38
39
Geographical Data

40
41
42
Data Visualization Tools

 Tableau

• Infogram
• ChartBlocks
• D3.js
• Google Charts
• Fusion Charts
• Chart.js

43
Tableau

•Tableau has a variety of options available, including a desktop app,


server and hosted online versions, and a free public option.
• There are hundreds of data import options available, from CSV files to
Google Ads and Analytics data to Sales force data.
• Output options include multiple chart formats as well as mapping
capability. That means designers can create color-coded maps that
showcase geographically important data in a format that’s much easier to
digest than a table or chart could ever be.

44
Infogram

• Infogram is a fully-featured drag-and-drop visualization tool that allows


even non-designers to create effective visualizations of data for marketing
reports, infographics, social media posts, maps, dashboards, and more.
• Finished visualizations can be exported into a number of formats: .PNG,
.JPG, .GIF, .PDF, and .HTML. Interactive visualizations are also possible,
perfect for embedding into websites or apps.
• Infogram also offers a WordPress plugin that makes embedding
visualizations even easier for WordPress users.

45
Chart
Blocks
Chart Blocks claims that data can be imported from “anywhere” using
their API, including from live feeds. While they say that importing data
from any source can be done in “just a few clicks,” it’s bound to be more
complex than other apps that have automated modules or extensions for
specific data sources.
• The app allows for extensive customization of the final visualization
created, and the chart building wizard helps users pick exactly the right
data for their charts before importing the data.
• Designers can create virtually any kind of chart, and the output is
responsive—a big advantage for data visualization designers who want
46
D3.js

• D3.js is a JavaScript library for manipulating documents using data.


• D3.js requires at least some JS knowledge, though there are apps out
there that allow non programming users to utilize the library.
• Those apps include NVD3, which offers reusable charts for D3.js;
Plotly’s Chart Studio, which also allows designers to create WebGL and
other charts; and Ember Charts, which also uses the Ember.js
framework.

47
Google Charts

•Google Charts is a powerful, free data visualization tool that is


specifically for creating interactive charts for embedding online.
• It works with dynamic data and the outputs are based purely on
HTML5 and SVG, so they work in browsers without the use of additional
plugins. Data sources include Google Spreadsheets, Google Fusion
Tables, Salesforce, and other SQL databases.
• There are a variety of chart types, including maps, scatter charts,
column and bar charts, histograms, area charts, pie charts, treemaps,
timelines, gauges, and many others. 48
Fusion Charts

•Fusion Charts is another JavaScript-based option for creating web and


mobile dashboards. It includes over 150 chart types and 1,000 map
types.
• It can integrate with popular JS frameworks (including React, jQuery,
React, Ember, and Angular) as well as with server-side programming
languages (including PHP, Java, Django, and Ruby on Rails).
• FusionCharts gives ready-to-use code for all of the chart and map
variations, making it easier to embed in websites even for those
designers with limited programming knowledge. 49
Chart.js

•Chart.js is a simple but flexible JavaScript charting library.


•It’s open source, provides a good variety of chart types (eight total),
and allows for animation and interaction.
• Chart.js uses HTML5 Canvas for output, so it renders charts well
across all modern browsers.
•Charts created are also responsive, so it’s great for creating
visualizations that are mobile-friendly.

50
Visualization using Programming

Python
• Python is considered one of the top-level programming languages
for data visualization because it is known for having many libraries
that allow for greater flexibility and its large and active scientific
computing community.
• It also controls the specific elements of the created graphics and
makes the specifications repeatable through code.

51
• Python is also very good at processing data, it provides open-source communities and
rich third-party libraries that allow continuous optimization for data visualization.
– matplotlib
– seaborn
– plotly
– pylab

52
R
• is an open-source software environment designed for creating
graphics.
• R is designed for data analysis.
•Although Python is becoming more and more popular, especially in
the areas of machine learning and in-depth learning, the R language
still has absolute advantages in data analysis and visualization, with
ggplot2 package and its extension package humanized drawing
grammar favored by users, especially bioinformatics and medical
researchers.
R 53
Power BI

•Power BI is able to extract data from a variety of data sources in


addition to supporting Microsoft's own products.
•The drag-and-drop graphical development model used by Power BI will
free data analysts from the visual chores and put more effort into data
management, algorithm research, and business communication.

54
Features of Tableau

55
Tableau products

56
Tableau Reports

57
Tableau file types…

58
Tableau can connect to:

59
Connecting to Data
Connecting to Data
Sample Data
Importing Data to tableau
Data in Tableau
Tableau Worksheet
Making Charts
Making Charts
Changing the axes
Multiple Dimensions in Charts
Multiple Measures in Charts
Various Types of Charts
Tree Map
Mark Options in Tableau
Color
Size
Label
Detail
Scatter Plot
Sorting
Sorting
Filters
Filters
Types of Aggregation
Count
Results in Percentages
Coloumn wise Percentage
Coloumn wise Percentage
Row wise Percentage
Calculated Fields
Creating Calculated Fields
Viewing Calculated Fields
Viewing Calculated Fields
Using Calculated Fields
Sets
Creating Sets
Using Sets
Hierarchy
Hierarchy
Maps
Creating Maps
Making Dashboards in Tableau
Conclusion
The tutorial covers the basic functionalities in Tableau. Much more
options are available which can be explored by one self after one
gets a feel of the software
THANK YOU

103

You might also like