0% found this document useful (0 votes)
30 views

Data Analytics-Data Visualization UNIT-V

Uploaded by

Hemanth Kumar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views

Data Analytics-Data Visualization UNIT-V

Uploaded by

Hemanth Kumar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE

KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

III BTECH I SEMESTER 2021-2022

DATA ANALYTICS UNIT-V

Data visualization
• Data visualization is the practice of translating information into a visual context, such as a map or
graph, to make data easier for the human brain to understand and pull insights from.
• The main goal of data visualization is to make it easier to identify patterns, trends and outliers in
large data sets. The term is often used interchangeably with others, including information graphics,
information visualization and statistical graphics.
• Data visualization is one of the steps of the data science process, which states that after data has been
collected, processed and modeled, it must be visualized for conclusions to be made. Data visualization
is also an element of the broader data presentation architecture (DPA) discipline, which aims to
identify, locate, manipulate, format and deliver data in the most efficient way possible.

Why is data visualization important?


• Data visualization provides a quick and effective way to communicate information in a universal
manner using visual information.
• The practice can also help businesses identify which factors affect customer behavior; pinpoint areas
that need to be improved or need more attention; make data more memorable for stakeholders;
understand when and where to place specific products; and predict sales volumes.

Other benefits of data visualization include the following:


• The ability to absorb information quickly, improve insights and make faster decisions;
• an increased understanding of the next steps that must be taken to improve the organization;
• an improved ability to maintain the audience's interest with information they can understand;
• an easy distribution of information that increases the opportunity to share insights with everyone
involved;
• eliminate the need for data scientists since data is more accessible and understandable; and
• an increased ability to act on findings quickly and, therefore, achieve success with greater speed and
less mistakes.

Categorization of visualization methods

• Pixel-oriented visualization techniques

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 1


JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

• Geometric projection visualization techniques


• Icon-based visualization techniques
• Hierarchical visualization techniques
• Visualizing complex data and relations

Pixel-Oriented Visualization Techniques


✓ A simple way to visualize the value of a dimension is to use a pixel where the color of the pixel reflects
the dimension’s value.
✓ For a data set of m dimensions pixel oriented techniques create m windows on the screen, one for
each dimension.
✓ The m dimension values of a record are mapped to m pixels at the corresponding position in the
windows.
✓ The color of the pixel reflects other corresponding values. Inside a window, the data values are
arranged in some global order shared by all windows.
Eg: All Electronics maintains a customer information table, which consists of 4 dimensions:
income, credit_limit, transaction_volume and age. We analyze the correlation between income
and other attributes by visualization.
✓ We sort all customers in income in ascending order and use this order to layout the customer data in
the 4 visualization windows as shown in fig.
✓ The pixel colors are chosen so that the smaller the value, the lighter the shading.
✓ Using pixel based visualization we can easily observe that credit_limit increases as income increases
customer whose income is in the middle range are more likely to purchase more from All Electronics,
these is no clear correlation between income ages.

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 2


JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Laying Out Pixels in Circle Segments


• To save space and show the connections among multiple dimensions, space filling is often done in a
circle segment.

Geometric Projection Visualization Techniques


• In the pixel-oriented visualization technique, the distribution of data in multidimensional space cannot
be specified clearly. A drawback of pixel-oriented visualization techniques is that they cannot help us
much in understanding the distribution of data in a multidimensional space.
• Geometric projection techniques help users find interesting projections of multidimensional data sets.

The various techniques of geometric projection visualization includes the following:

Methods
✓ Direct visualization
✓ Scatter plot and scatter plot matrices
✓ Landscapes Projection pursuit technique: Help users find meaningful projections of multidimensional
data
✓ Prosection views
✓ Hyperslice
✓ Parallel coordinates

Direct visualization
Direct visualizations of image data make use of the images in their original visible format. The first technique,
the slice histogram, arranges slices of images as histograms, organized by both visual and non-visual variables.
Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 3
JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Scatter Plots
✓ A scatter plot displays 2-D data points using Cartesian coordinates.
✓ A third dimension can be added using different colors or shapes to represent different data points
✓ Through this visualization, in the adjacent figure, we can see that points of types “+” and “×” tend to be
collocated.
✓ Scatterplots show many points plotted in the Cartesian plane. Each point represents the values of two
variables. One variable is chosen in the horizontal axis and another in the vertical axis.

✓ The technique of scatter plot is inefficient if the number of dimensions in a data set is greater than
four. So enhanced techniques of scatter plot is called scatter-plot matrix.

Scatterplot Matrices

✓ The scatter-plot matrix is an extension to the scatter plot.


✓ In this technique, if data set contains ‘k’ dimensions then ‘k x k grid of 2-D scatter plots represents a
scatter-plot matrix. Thus provides visualization for each dimension with remaining dimensions.
✓ When we have more than two variables and we want to find the correlation between one variable
versus the remaining ones we use scatter plot matrix.
✓ For k-dimensional data a minimum of (k2-k)/2 scatter plots of 2D will be required.
✓ There can be maximum of k2 plots of 2D
✓ In the adjoining figure , there are k2 plots.
✓ Out of these, k are X-X plots, and all X-Y plots (where X, Y are distinct dimensions) are given in 2
orientations (X vs Y and Y vs, X)
Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 4
JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Parallel Coordinates
✓ The scatter-plot matrix becomes less effective as the dimensionality increases.
✓ Another technique, called parallel coordinates, can handle higher dimensionality
✓ n equidistant axes which are parallel to one of the screen axes and correspond to the attributes (i.e. n
dimensions)
✓ The axes are scaled to the [minimum, maximum]: range of the corresponding attribute
✓ Every data item corresponds to a polygonal line which intersects each of the axes at the point which
corresponds to the value for the attribute

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 5


JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Icon-Based Visualization Techniques

Icon based visualization techniques makes use of small icons for representing multidimensional data vales.

✓ Visualization of the data values as features of icons


✓ Typical visualization methods
o Chernoff Faces
o Stick Figures
✓ General techniques
o Shape coding: Use shape to represent certain information encoding
o Color icons: Use color icons to encode more information
o Tile bars: Use small icons to represent the relevant feature vectors in document retrieval

Chernoff Faces

✓ It displays multidimensional data in the form of cartoon human face upto 18 dimensions.
✓ It specifies the dimensional value of various components like eyes, ears, mouth and nose by their
shape, position and orientation.

✓ Moveover, it also utilizes the mindset of a human in identifying the differences between facial
features.

✓ A way to display variables on a two-dimensional surface, e.g., let x be eyebrow slant, y be eye size, z be
nose length, etc.
✓ The figure shows faces produced using 10 characteristics–head eccentricity, eye size, eye spacing, eye
eccentricity, pupil size, eyebrow slant, nose size, mouth shape, mouth size, and mouth opening): Each
assigned one of 10 possible values.

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 6


JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Stick Figure
✓ It maps multidimensional data to five –piece stick figure, where each figure has 4 limbs and a body.

✓ 2 dimensions are mapped to the display axes and the remaining dimensions are mapped to the angle
and/ or length of the limbs.

Hierarchical Visualization Techniques


Visualization of the data using a hierarchical partitioning into subspaces.
✓ For a large data set of high dimensionality, it would be difficult to visualize all dimensions at the same
time.
✓ Hierarchical visualization techniques partition all dimensions into subsets (i.e., subspaces).
✓ The subspaces are visualized in a hierarchical manner
✓ “Worlds-within-Worlds,” also known as n-Vision, is a representative hierarchical visualization method.
✓ To visualize a 6-D data set, where the dimensions are F,X1,X2,X3,X4,X5.
✓ We want to observe how F changes w.r.t. other dimensions. We can fix X3,X4,X5 dimensions to
selected values and visualize changes to F w.r.t. X1, X2.

Methods:
• Dimensional Stacking
• Worlds-Within-Worlds
• Tree-Map
• Cone Trees
• InfoCube

Dimensional Stacking:
• Partitioning of the n-dimensional attribute space in 2-D subspaces, which are 'stacked' into each other.

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 7


JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

• Partitioning of the attribute value ranges into classes. The important attributes should be used on the
outer levels.
• Adequate for data with ordinal attributes of low cardinality
• But, difficult to display more than nine dimensions
• Important to map dimensions appropriately

Visualization of oil mining data with longitude and Latitude mapped to the outer x-,y-axes and ore grade and
depth mapped to the inner x-,y-axes.

Worlds-within-worlds

• Assign the function and two most important parameters to innermost world.
• Fix all other parameters at constant values-draw other (1 or 2 or 3 dimensional worlds choosing these
as the axes)
• Software that uses this paradigm.

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 8


JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

• N-Vision: Dynamic interaction through data glove and stereo displays, including rotation, scaling (inner)
and translation (inner/outer).
• Auto Visual: Static interaction by means of queries.

Tree-Map
• Screen-Filling method which uses a hierarchical partitioning of the screen into regions departing on the
attribute values.
• The X- and Y- dimension of the screen are partitioned alternately according o the attribute values
(classes)

Info Cube
• A 3-D visualization techniques where hierarchical information is displayed as nested semi-transparent
cubes.
• The outermost cubes correspond to the top level data, while the sub nodes or the lower level data are
represented as smaller cubes inside the outermost cubes, and so on.

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 9


JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Three-D Cone Trees


• 3D cone tree visualization techniques works well for up to a thousand nodes or so.
• First build a 2D circle tree that arranges its nodes in concentric circles centered on the root
node.
• Cannot avoid overlaps when projected to 2D.

Visualizing Complex Data and Relations.

• Most visualization techniques were mainly for numeric data.


• Recently, more and more non-numeric data, such as text and social networks, have become available.
• Many people on the Web tag various objects such as pictures, blog entries, and product reviews.
• A tag cloud is a visualization of statistics of user-generated tags.
• Often, in a tag cloud, tags are listed alphabetically or in a user-preferred order.
• The importance of a tag is indicated by font size or color.

❖ Visualizing non-numeric data: Text and social networks


❖ Tag cloud: visualizing user-generated tags.
❖ Importance of tag is represented by font size/color.

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 10


JYOTHISHMATHI INSTITUTE OF TECHNOLOGY AND SCIENCE
KARMNAGAR - 505481
(Approved by AICTE, New Delhi and Affiliated to JNTU, Hyderabad)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

❖ Besides text data, there are also methods to visualize relationships, such as visualizing social network.

Prepared by N.Venkateswaran, Associate Professor, CSE Dept. JITS - Karimnagar Page 11

You might also like