0% found this document useful (0 votes)

17 views18 pages

Daunit 5

The document outlines various data visualization techniques, including pixel-oriented, geometric projection, icon-based, and hierarchical methods, aimed at effectively representing complex data and relationships. It emphasizes the importance of data visualization in gaining insights, identifying patterns, and enhancing understanding of large datasets. Additionally, it discusses specific visualization types such as scatter plots, bar graphs, and word clouds, detailing their applications and limitations.

Uploaded by

sudharani.am

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views18 pages

Daunit 5

Uploaded by

sudharani.am

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 18

UNIT - V

Syllabus:
Data Visualization: Pixel-Oriented Visualization Techniques, Geometric Projection
Visualization Techniques, Icon-Based Visualization Techniques, Hierarchical
Visualization Techniques, Visualizing Complex Data and Relations.

Data Visualization

 Data visualization is the art and practice of gathering, analyzing, and

graphically representing empirical information.
 They are sometimes called information graphics, or even just charts and
graphs.
 The goal of visualizing data is to tell the story in the data.
 Telling the story is predicated on understanding the data at a very
deep level, and gathering insight from comparisons of data points in
the numbers

Why data visualization?

 Gain insight into an information space by mapping data onto

graphical primitives provide qualitative overview of large data sets
 Search for patterns, trends, structure, irregularities, and
relationships among data.
 Help find interesting regions and suitable parameters for further
quantitative analysis.
 Provide a visual proof of computer representations derived.

Categorization of visualization methods

 Pixel-oriented visualization techniques
 Geometric projection visualization techniques
 Icon-based visualization techniques
 Hierarchical visualization techniques
 Visualizing complex data and relations

Pixel-Oriented Visualization Techniques

 For a data set of m dimensions, create m windows on the screen,
one for each dimension.
 The m dimension values of a record are mapped to m
pixels at the corresponding positions in the windows.
 The colors of the pixels reflect the corresponding values.

 To save space and show the connections among multiple

dimensions, space filling is often done in a circle segment.

Examples of Pixel-Oriented Visualizations

1. Multidimensional Attribute Mapping
In this technique, each dimension of the dataset is displayed in a
separate window. For instance, consider a customer database with
attributes like income, credit limit, transaction volume, and age. By
sorting customers based on income and mapping each attribute to a
pixel in its respective window, one can observe correlations such as
credit limits increasing with income or identify purchasing behaviors
across different age groups.
2. Financial Market Visualization
Each pixel represents a stock's daily closing price, with color gradients
indicating price levels—cooler colors for lower prices and warmer colors
for higher prices. Over time, this visualization can reveal market trends,
fluctuations, and anomalies, aiding in investment analysis and decision-
making.
3. Circle Segments Technique
This method visualizes high-dimensional data by representing the entire
dataset within a circle divided into segments, each corresponding to a
different attribute. Within each segment, attribute values are depicted as
colored pixels, arranged from the center outward. This layout facilitates
the identification of patterns and relationships across multiple
dimensions.
4. Dense Pixel Displays
By assigning one pixel per data value and grouping pixels belonging to
each dimension into adjacent areas, dense pixel displays enable the
visualization of up to a million data values on standard monitors. This
high data density is particularly beneficial for exploring large datasets,
allowing users to discern patterns that might be obscured in sparser
visualizations.
5. Pixel-Based Timelines
In historical data analysis, each pixel can represent an event or data
point in a timeline, with color coding to indicate significance or category.
This approach allows for a compact and detailed overview of events over
time, facilitating the detection of trends and significant periods.

Geometric Projection Visualization Techniques

Visualization of geometric transformations and

projections of the data.Methods
 Direct visualization
 Scatterplot and scatterplot matrices
 Landscapes Projection pursuit technique: Help users find
meaningful projections of multidimensional data
 Prosection views
 Hyperslice
 Parallel coordinates
Line Plot:
 This is the plot that you can see in the nook and corners of any sort
of analysis between 2 variables.

 The line plots are nothing but the values on a series of data
points will be connected with straight lines.
 The plot may seem very simple but it has more applications not only
in machine learning but in many other areas.
 Used to analyze the performance of a model using the ROC- AUC curve.

Bar Plot
 This is one of the widely used plots, that we would have seen multiple
times not just in data analysis, but we use this plot also wherever
there is a trend analysis in many fields.
 We can visualize the data in a cool plot and can convey the details
straight forward to others.
 This plot may be simple and clear but it’s not much frequently used in
Data science applications.
Stacked Bar Graph:

 Unlike a Multi-set Bar Graph which displays their bars side-by-side,

Stacked Bar Graphs segment their bars. Stacked Bar Graphs are used
to show how a larger category is divided into smaller categories and
what the relationship of each part has on the total amount. There are
two types of Stacked Bar Graphs:
 Simple Stacked Bar Graphs place each value for the segment after the
previous one. The total value of the bar is all the segment values
added together. Ideal for
comparing the total amounts across each group/segmented bar.
 100% Stack Bar Graphs show the percentage-of-the-whole of each
group and are plotted by the percentage of each value to the total
amount in each group. This makes it easier to see the relative
differences between quantities in each group.
 One major flaw of Stacked Bar Graphs is that they become harder to
read the more segments each bar has. Also comparing each segment
to each other is difficult, as they're not aligned on a common
baseline.
Scatter Plot

 It is one of the most commonly used plots used for visualizing simple
data in Machine learning and Data Science.
 This plot describes us as a representation, where each point in the
entire dataset is present with respect to any 2 to 3
features(Columns).
 Scatter plots are available in both 2-D as well as in 3-D. The 2-D
scatter plot is the common one, where we will primarily try to find the
patterns, clusters, and separability of the data.
 The colors are assigned to different data points based on how they
were present in the dataset i.e, target column representation.
 We can color the data points as per their class label given in the
dataset.
Box and Whisker Plot
 This plot can be used to obtain more statistical details about the data.
 The straight lines at the maximum and minimum are also called
whiskers.
 Points that lie outside the whiskers
will be considered as an outlier.
 The box plot also gives us a
description of the 25th, 50th,75th
quartiles.
 With the help of a box plot, we can
also determine the
Interquartile range(IQR) where
maximum details of the data will
be present
 These box plots come under
univariate analysis, which means
that we are exploring data only
with one variable.

Pie Chart :
A pie chart shows a static number and how categories represent part of a
whole the composition of something. A pie chart represents numbers in
percentages, and the total sum of all segments needs to equal 100%.
 Extensively used in presentations and offices, Pie Charts help show
proportions and percentages between categories, by dividing a circle into
proportional segments. Each arc length represents a proportion of each
category, while the full circle represents the total sum of all the data, equal
to 100%.
Donut Chart:

 A donut chart is essentially a Pie Chart

with an area of the centre cut out. Pie
Charts are sometimes criticised for
focusing readers on the proportional
areas of the slices to one another and to
the chart as a whole. This makes it
tricky to see the differences between
slices, especially when you try to
compare multiple Pie Charts together.
 A Donut Chart somewhat remedies this problem by de-emphasizing the use
of the area. Instead, readers focus more on reading the length of the arcs,
rather than comparing the proportions between slices.
 Also, Donut Charts are more space-efficient than Pie Charts because the
blank space inside a Donut Chart can be used to display information
inside it.

Marimekko Chart:

Also known as a Mosaic Plot.

 Marimekko Charts are used to visualise categorical data over a pair of
variables. In a Marimekko Chart, both axes are variable with a percentage
scale, that determines both the width and height of each segment. So
Marimekko Charts work as a kind of two- way 100% Stacked Bar Graph. This
makes it possible to detect relationships between categories and their
subcategories via the two axes.
 The main flaws of Marimekko Charts are that they can be hard to read,
especially when there are many segments. Also, it’s hard to accurately make
comparisons between each segment, as they are not all arranged next to
each other along a common baseline. Therefore, Marimekko Charts are
better suited for giving a more general overview of the data.

Icon-Based Visualization Techniques

 It uses small icons to represent multidimensional data values
 Visualization of the data values as features of icons
 Typical visualization methods
o Chernoff Faces
o Stick Figures
Chernoff Faces

Chernoff Faces
A way to display variables on a two-dimensional surface, e.g.,
let x be eyebrow slant, y be eye size, z be nose length, etc.
 The figure shows faces produced using 10 characteristics–head
eccentricity,
eye size, eye spacing, eye eccentricity, pupil size, eyebrow slant, nose
size, mouth shape, mouth size, and mouth opening. Each assigned
one of 10 possible values.

Stick Figure

 A census data figure showing age, income, gender, education

 A 5-piece stick figure (1 body and 4 limbs w. different angle/length)
 Age, income are indicated by position of the figure.
 Gender, education are indicated by angle/length.
 Visualization can show a texture pattern.
 2 dimensions are mapped to the display axes and the remaining
dimensions are mapped to the angle and/ or length of the limbs.
Hierarchical Visualization
Circle Packing

 Circle Packing is a variation of a Treemap that uses circles instead of

rectangles. Containment within each circle represents a level in the
hierarchy: each branch of the tree is represented as a circle and its
sub-branches are represented as circles inside of it. The area of each
circle can also be used to represent an additional arbitrary value,
such as quantity or file size. Colour may also be used to assign
categories or to represent another variable via different shades.
 As beautiful as Circle Packing appears, it's not as space-efficient as a
Treemap, as there's a lot of empty space within the circles. Despite
this, Circle Packing actually reveals hierarchal structure better than
a Treemap.
Sunburst Diagram

 As known as a Sunburst Chart, Ring Chart, Multi-level Pie Chart, Belt Chart,
Radial Treemap.
 This type of visualisation shows hierarchy through a series of rings,
that are sliced for each category node. Each ring corresponds to a
level in the hierarchy, with the central circle representing the root
node and the hierarchy moving outwards from it.
 Rings are sliced up and divided based on their hierarchical
relationship to the parent slice. The angle of each slice is either
divided equally under its parent node or can be made proportional
to a value.
 Colour can be used to highlight hierarchal groupings or specific
categories.

Treemap:

 Treemaps are an alternative way of visualising the hierarchical

structure of
a Tree Diagram while also displaying quantities for each category via
area size. Each category is assigned a rectangle area with their
subcategory rectangles nested inside of it.
 When a quantity is assigned to a category, its area size is displayed in
proportion to that quantity and to the other quantities within the
same parent category in a part-to-whole relationship. Also, the area
size of the parent category is the total of its subcategories. If no
quantity is assigned to a subcategory, then it's area is divided equally
amongst the other subcategories within its parent category.
 The way rectangles are divided and ordered into sub-rectangles is
dependent on the tiling algorithm used. Many tiling algorithms
have been developed, but the "squarified algorithm" which keeps
each rectangle as square as possible is the one commonly used.
 Ben Shneiderman originally developed Treemaps as a way of
visualising a vast file directory on a computer, without taking up too
much space on the screen. This makes Treemaps a more compact
and space-efficient option for displaying hierarchies, that gives a
quick overview of the structure. Treemaps are also great at
comparing the proportions between categories via their area size.
 The downside to a Treemap is that it doesn't show the hierarchal
levels as clearly as other charts that visualise hierarchal data (such
as a Tree Diagram or Sunburst Diagram).
Visualizing Complex Data and Relations

 For a large data set of high dimensionality, it would be difficult to

visualize all dimensions at the same time.
 Hierarchical visualization techniques partition all dimensions into
subsets (i.e., subspaces).
 The subspaces are visualized in a hierarchical manner
 “Worlds-within-Worlds,” also known as n-Vision, is
a representative hierarchical visualization
method.
 To visualize a 6-D data set, where the dimensions are
F,X1,X2,X3,X4,X5.
 We want to observe how F changes w.r.t. other dimensions.
We can fix X3,X4,X5dimensions to selected values and visualize
changes to F w.r.t. X1, X2
 Most visualization techniques were mainly for numeric data.
 Recently, more and more non-numeric data, such as text
and social networks, havebecome available.
 Many people on the Web tag various objects such as pictures,
blog entries, and productreviews.
 A tag cloud is a visualization of statistics of user-generated tags.
 Often, in a tag cloud, tags are listed alphabetically or in a user-
preferred order.
 The importance of a tag is indicated by font size or color.

Word Cloud:

Also known as aTag Cloud.

 A visualisation method that displays how frequently words appear in a given

body of text, by making the size of each word proportional to its frequency.
All the words are then arranged in a cluster or cloud of words. Alternatively,
the words can also be arranged in any format: horizontal lines, columns or
within a shape.
 Word Clouds can also be used to display words that have meta-data
assigned to them. For example, in a Word Cloud with all the World's
country's names, the population could be assigned to each name to
determine its size.
 Colour used on Word Clouds is usually meaningless and is primarily aesthetic,
but it can be used to categorise words or to display another data variable.
 Typically, Word Clouds are used on websites or blogs to depict keyword
or tag usage. Word Clouds can also be used to compare two different
bodies of text together.
 Although being simple and easy to understand, Word Clouds have some major
flaws:
 Long words are emphasised over short words.
 Words whose letters contain many ascenders and descenders may
receive more attention.
 They're not great for analytical accuracy, so used more for aesthetic reasons
instead.

C++ Lab Manual
100% (1)
C++ Lab Manual
88 pages
750-Article Text-3615-1-10-20240613
No ratings yet
750-Article Text-3615-1-10-20240613
16 pages
Data Visualization 21st June
No ratings yet
Data Visualization 21st June
110 pages
03 Temporal, Geospatial Multivariate Data
No ratings yet
03 Temporal, Geospatial Multivariate Data
69 pages
Da Unit 5
No ratings yet
Da Unit 5
61 pages
Data Visualization Notes
No ratings yet
Data Visualization Notes
22 pages
Data Visualization Unit-V 21.11.24
No ratings yet
Data Visualization Unit-V 21.11.24
69 pages
Maths English Medium 11th Model Question Paper WWW tn11th in
No ratings yet
Maths English Medium 11th Model Question Paper WWW tn11th in
5 pages
Unit III
No ratings yet
Unit III
105 pages
5 Knowledge Representation
No ratings yet
5 Knowledge Representation
19 pages
Daunit 3
No ratings yet
Daunit 3
32 pages
DVP 3
No ratings yet
DVP 3
97 pages
An Invariant Approach To Statistical Analysis of Shapes 1st Edition Subhash R. Lele All Chapters Instant Download
100% (14)
An Invariant Approach To Statistical Analysis of Shapes 1st Edition Subhash R. Lele All Chapters Instant Download
85 pages
DAUnit 4
No ratings yet
DAUnit 4
51 pages
Data Preprocessing
No ratings yet
Data Preprocessing
76 pages
Unit 1 Data Objects Attributes Visualization
No ratings yet
Unit 1 Data Objects Attributes Visualization
34 pages
Common Visualization Idioms
0% (1)
Common Visualization Idioms
95 pages
Chapter 4 Common Visualization Idioms
No ratings yet
Chapter 4 Common Visualization Idioms
39 pages
Unit 4
No ratings yet
Unit 4
35 pages
Unit-5 New
No ratings yet
Unit-5 New
31 pages
Principles of Artificial Intelligence
No ratings yet
Principles of Artificial Intelligence
15 pages
All Unit DV Notes
No ratings yet
All Unit DV Notes
31 pages
DA UNIT V Notes
No ratings yet
DA UNIT V Notes
17 pages
Stability & Determinacy of Trusses PDF
No ratings yet
Stability & Determinacy of Trusses PDF
5 pages
Da Unit - V
No ratings yet
Da Unit - V
17 pages
Chapter 3 - Data Visualization Chapter 4 - Summary Statistics
No ratings yet
Chapter 3 - Data Visualization Chapter 4 - Summary Statistics
38 pages
Data Analytics Unit V
No ratings yet
Data Analytics Unit V
18 pages
DAUnit 2
No ratings yet
DAUnit 2
18 pages
9 - Maths - L-3-Coordinate Geometry WS-1
No ratings yet
9 - Maths - L-3-Coordinate Geometry WS-1
6 pages
Chapter 3 Non Spatial Data Visualization
No ratings yet
Chapter 3 Non Spatial Data Visualization
45 pages
DataScience&Analytics DataVisualiztn
No ratings yet
DataScience&Analytics DataVisualiztn
26 pages
Time-Cost Trade-Off Numerical
No ratings yet
Time-Cost Trade-Off Numerical
8 pages
Data Visualization
No ratings yet
Data Visualization
16 pages
Unit 5-Data Visualization
No ratings yet
Unit 5-Data Visualization
22 pages
02 Data
No ratings yet
02 Data
42 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
12 pages
Data Analytics
No ratings yet
Data Analytics
14 pages
UNIT 5 Data Analytics
No ratings yet
UNIT 5 Data Analytics
20 pages
Data Visualization Guide: 1. Common Types of Data Visualizations
No ratings yet
Data Visualization Guide: 1. Common Types of Data Visualizations
11 pages
L5 Data Visualization
No ratings yet
L5 Data Visualization
33 pages
Astm-D7336 D7336M
No ratings yet
Astm-D7336 D7336M
9 pages
Unit 3 DATA VISUAIZATION
No ratings yet
Unit 3 DATA VISUAIZATION
25 pages
Data Analytics
No ratings yet
Data Analytics
20 pages
Da Unit-5
100% (1)
Da Unit-5
19 pages
Week 02.1 Chaptr002
No ratings yet
Week 02.1 Chaptr002
29 pages
Dsbda Ut6
No ratings yet
Dsbda Ut6
11 pages
Winspire
No ratings yet
Winspire
44 pages
3238-Article Text-5879-1-10-20180104
No ratings yet
3238-Article Text-5879-1-10-20180104
140 pages
Unit 2
No ratings yet
Unit 2
12 pages
Unit 5
No ratings yet
Unit 5
15 pages
Da Unit - V
No ratings yet
Da Unit - V
14 pages
Sds 2205 Data Visualization Assignment 2
No ratings yet
Sds 2205 Data Visualization Assignment 2
3 pages
CTET Paper Complete Analysis by Himanshi Singh
No ratings yet
CTET Paper Complete Analysis by Himanshi Singh
37 pages
YOLO-LLTS: Real-Time Low-Light Traffic Sign Detection Via Prior-Guided Enhancement and Multi-Branch Feature Interaction
No ratings yet
YOLO-LLTS: Real-Time Low-Light Traffic Sign Detection Via Prior-Guided Enhancement and Multi-Branch Feature Interaction
15 pages
Visualization
No ratings yet
Visualization
15 pages
Notes DV 2025
No ratings yet
Notes DV 2025
10 pages
Data Analytics - Unit-V
0% (1)
Data Analytics - Unit-V
9 pages
Course Subjects: List of Subjects According To CFS Courses in UIA
No ratings yet
Course Subjects: List of Subjects According To CFS Courses in UIA
3 pages
Da Sem Unit 5
No ratings yet
Da Sem Unit 5
8 pages
Da Unit 5
No ratings yet
Da Unit 5
11 pages
Data Visulization Techniques
No ratings yet
Data Visulization Techniques
10 pages
Unit 4 Part A
No ratings yet
Unit 4 Part A
51 pages
Lesson 2
No ratings yet
Lesson 2
18 pages
DM14 Visualisation
100% (1)
DM14 Visualisation
67 pages
Functions Unit2
No ratings yet
Functions Unit2
9 pages
DA Unit-5
No ratings yet
DA Unit-5
6 pages
5th Unit Fds
No ratings yet
5th Unit Fds
5 pages
Elkies N.D. Lectures On Analytic Number Theory (Math259, Harvard, 1998) (100s) - MT
No ratings yet
Elkies N.D. Lectures On Analytic Number Theory (Math259, Harvard, 1998) (100s) - MT
100 pages
Lab Report Projectile Motion - Docx 2
No ratings yet
Lab Report Projectile Motion - Docx 2
8 pages
Different Types of Graphs To Present Data
No ratings yet
Different Types of Graphs To Present Data
3 pages
Basics of Data Visualization A Necessity
No ratings yet
Basics of Data Visualization A Necessity
11 pages
Data Visualisation: Why Is Data Visualization Important?
No ratings yet
Data Visualisation: Why Is Data Visualization Important?
19 pages
DV Methods
No ratings yet
DV Methods
6 pages
Unit 5
No ratings yet
Unit 5
6 pages
Week 4 Assignment
No ratings yet
Week 4 Assignment
5 pages
Che Lab Report On Flow Over Weirs
100% (1)
Che Lab Report On Flow Over Weirs
14 pages
Normal Modes - Rigid Element Analysis With RBE2 and CONM2
No ratings yet
Normal Modes - Rigid Element Analysis With RBE2 and CONM2
22 pages
5 Da
No ratings yet
5 Da
6 pages
Scientific Design Choices in Data Visualization
No ratings yet
Scientific Design Choices in Data Visualization
11 pages
Quesioner Design and Analyisis
No ratings yet
Quesioner Design and Analyisis
25 pages
Unit 11 Area and Its Boundary
No ratings yet
Unit 11 Area and Its Boundary
2 pages
Strings Js Notes
No ratings yet
Strings Js Notes
3 pages
Hasselbring 07
No ratings yet
Hasselbring 07
33 pages
Data Mining Notes C3
No ratings yet
Data Mining Notes C3
11 pages
Data Visualization Tech.
No ratings yet
Data Visualization Tech.
6 pages
Answers)
100% (1)
Answers)
12 pages
Thyroid Cancer Letter
No ratings yet
Thyroid Cancer Letter
8 pages
Physics Mechanics Review
No ratings yet
Physics Mechanics Review
17 pages
Ejemplos de Programación de Agentes en JADE
No ratings yet
Ejemplos de Programación de Agentes en JADE
7 pages
OTS Matrices Determinants PDF
No ratings yet
OTS Matrices Determinants PDF
5 pages
Ameer Data Visualization and Techniques
No ratings yet
Ameer Data Visualization and Techniques
4 pages
FM Code To Clear Customer Open Item
No ratings yet
FM Code To Clear Customer Open Item
5 pages
Practical Skills
No ratings yet
Practical Skills
35 pages
Image Histogram: Unveiling Visual Insights, Exploring the Depths of Image Histograms in Computer Vision
From Everand
Image Histogram: Unveiling Visual Insights, Exploring the Depths of Image Histograms in Computer Vision
Fouad Sabry
No ratings yet

Daunit 5

Uploaded by

Daunit 5

Uploaded by

UNIT - V

 Data visualization is the art and practice of gathering, analyzing, and

Why data visualization?

 Gain insight into an information space by mapping data onto

Categorization of visualization methods

Pixel-Oriented Visualization Techniques

 To save space and show the connections among multiple

Examples of Pixel-Oriented Visualizations

Geometric Projection Visualization Techniques

Visualization of geometric transformations and

 Unlike a Multi-set Bar Graph which displays their bars side-by-side,

 A donut chart is essentially a Pie Chart

Also known as a Mosaic Plot.

Icon-Based Visualization Techniques

 A census data figure showing age, income, gender, education

 Circle Packing is a variation of a Treemap that uses circles instead of

 Treemaps are an alternative way of visualising the hierarchical

 For a large data set of high dimensionality, it would be difficult to

Also known as aTag Cloud.

 A visualisation method that displays how frequently words appear in a given

You might also like