Daunit 5
Daunit 5
Syllabus:
Data Visualization: Pixel-Oriented Visualization Techniques, Geometric Projection
Visualization Techniques, Icon-Based Visualization Techniques, Hierarchical
Visualization Techniques, Visualizing Complex Data and Relations.
Data Visualization
The line plots are nothing but the values on a series of data
points will be connected with straight lines.
The plot may seem very simple but it has more applications not only
in machine learning but in many other areas.
Used to analyze the performance of a model using the ROC- AUC curve.
Bar Plot
This is one of the widely used plots, that we would have seen multiple
times not just in data analysis, but we use this plot also wherever
there is a trend analysis in many fields.
We can visualize the data in a cool plot and can convey the details
straight forward to others.
This plot may be simple and clear but it’s not much frequently used in
Data science applications.
Stacked Bar Graph:
It is one of the most commonly used plots used for visualizing simple
data in Machine learning and Data Science.
This plot describes us as a representation, where each point in the
entire dataset is present with respect to any 2 to 3
features(Columns).
Scatter plots are available in both 2-D as well as in 3-D. The 2-D
scatter plot is the common one, where we will primarily try to find the
patterns, clusters, and separability of the data.
The colors are assigned to different data points based on how they
were present in the dataset i.e, target column representation.
We can color the data points as per their class label given in the
dataset.
Box and Whisker Plot
This plot can be used to obtain more statistical details about the data.
The straight lines at the maximum and minimum are also called
whiskers.
Points that lie outside the whiskers
will be considered as an outlier.
The box plot also gives us a
description of the 25th, 50th,75th
quartiles.
With the help of a box plot, we can
also determine the
Interquartile range(IQR) where
maximum details of the data will
be present
These box plots come under
univariate analysis, which means
that we are exploring data only
with one variable.
Pie Chart :
A pie chart shows a static number and how categories represent part of a
whole the composition of something. A pie chart represents numbers in
percentages, and the total sum of all segments needs to equal 100%.
Extensively used in presentations and offices, Pie Charts help show
proportions and percentages between categories, by dividing a circle into
proportional segments. Each arc length represents a proportion of each
category, while the full circle represents the total sum of all the data, equal
to 100%.
Donut Chart:
Marimekko Chart:
Chernoff Faces
A way to display variables on a two-dimensional surface, e.g.,
let x be eyebrow slant, y be eye size, z be nose length, etc.
The figure shows faces produced using 10 characteristics–head
eccentricity,
eye size, eye spacing, eye eccentricity, pupil size, eyebrow slant, nose
size, mouth shape, mouth size, and mouth opening. Each assigned
one of 10 possible values.
Stick Figure
As known as a Sunburst Chart, Ring Chart, Multi-level Pie Chart, Belt Chart,
Radial Treemap.
This type of visualisation shows hierarchy through a series of rings,
that are sliced for each category node. Each ring corresponds to a
level in the hierarchy, with the central circle representing the root
node and the hierarchy moving outwards from it.
Rings are sliced up and divided based on their hierarchical
relationship to the parent slice. The angle of each slice is either
divided equally under its parent node or can be made proportional
to a value.
Colour can be used to highlight hierarchal groupings or specific
categories.
Treemap:
Word Cloud: