DA Unit-5
DA Unit-5
DA Unit-5
Data Visualization
Data visualization is the art and practice of gathering, analyzing, and graphically
representing empirical information.
They are sometimes called information graphics, or even just charts and graphs.
The goal of visualizing data is to tell the story in the data.
Telling the story is predicated on understanding the data at a very deep level, and
gathering insight from comparisons of data points in the numbers
Gain insight into an information space by mapping data onto graphical primitives
Provide qualitative overview of large data sets
Search for patterns, trends, structure, irregularities, and relationships among data.
Help find interesting regions and suitable parameters for further quantitative analysis.
Provide a visual proof of computer representations derived.
For a data set of m dimensions, create m windows on the screen, one for each dimension.
The m dimension values of a record are mapped to m pixels at the corresponding
positions in the windows.
The colors of the pixels reflect the corresponding values.
To save space and show the connections among multiple dimensions, space filling is
often done in a circle segment.
Geometric Projection Visualization Techniques
Methods
Direct visualization
Scatterplot and scatterplot matrices
Landscapes Projection pursuit technique: Help users find meaningful projections of
multidimensional data
Prosection views
Hyperslice
Parallel coordinates
Scatter Plots
Scatterplot Matrices
Parallel Coordinates
Chernoff Faces
Stick Figure
For a large data set of high dimensionality, it would be difficult to visualize all
dimensions at the same time.
Hierarchical visualization techniques partition all dimensions into subsets (i.e.,
subspaces).
The subspaces are visualized in a hierarchical manner
“Worlds-within-Worlds,” also known as n-Vision, is a representative hierarchical
visualization method.
To visualize a 6-D data set, where the dimensions are F,X1,X2,X3,X4,X5.
We want to observe how F changes w.r.t. other dimensions. We can fix X3,X4,X5
dimensions to selected values and visualize changes to F w.r.t. X1, X2
Visualizing Complex Data and Relations