0% found this document useful (0 votes)
7 views

Data VisualizationModule3

Data Visualization

Uploaded by

r8342254
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

Data VisualizationModule3

Data Visualization

Uploaded by

r8342254
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 53

DAT A VIS U ALIZ AT IO N

CO M M U N ICAT IN G W IT H DAT A

P R AM O D K U M AR N AIK
CH AIR P E R S O N AI AN D R O B O T ICS
DS U B AN G ALO R E
DAT A VIS U ALIZ AT IO N
What are the problems with this graph?
Display the data table

Calories in Common Candies

250
200
150
100
50
0

fy
Taf
i nt Corn m ar s i s t s . . . . l i ces r Balls
nerM andy g Gu y Be e
. .
Tw ocola. cola. cola. ectinS Sou
in C i n m i c o o P
f t erD C hew Gum Licor ilk Ch ilkCh ilkCh
A M M M

CO LU M N
CH AR T
ALT E R N AT E DIS P L AY

Sorting and expanding the scale of the graph allows all labels to be seen
as well as displaying a characteristic of the data.

Calories in Common Candies

250

200
150
100

50
0
y
pop ffee els Taff
olli lls ts To s n s
Gum rscotch L u r Ba i n ars ts Sli c e t or
C Car a m
Be a n rittle
wing Butte
S o
ligh
t M
y Be iceTwisPectin rMiCn andy elly nutB r Bar
Star m r i nn e J Pe a teBa late
Che Gum L i co
Afte r D s o c o l a
aorc o
Chip
s s Ball kCh nlkdCBh ts
late R aisin d Milk Dar A lmMoi anu
C ho c o
v e r e d
Ma l t e
c o l a te
e r e dPe
et o e Cho Cov
iSwe teC olat late
Sem cola hoc Milk oco
Cho ilkC h

CO LU M N
i l k M C
M Milk

CH AR T
O B J E CT IVE S
As you create graphics keep the following in mind.
 Avoid distortion of the true story.
 Induce the viewer to think about the substance, not the graph.
 Reveal the data at several layers of detail.
 Encourage the eye to compare different pieces.
 Support the statistical and verbal descriptions of the data.

STA6166-2-5
DAT A VIS U ALIZ AT IO N Graphical Display
Objectives
The visual portrayal of quantitative information • Tabulation
Are used to: • Description
• Illustration
• Display the actual data table
• Exploration
• Display quantities derived from the
data
• Show what has been learned
about the data from other analyses
• Allow one to see what may be
occurring in the data over and
above what has already been
described

“A picture is w orth a thousand w ord s… ”


DAT A VIS U ALIZ AT IO N
W hat is d ata visualization?
“A picture speaks a thousand w ord s.” S imilarly an
infog raphic/visual can help us analyze d ata and hid d en
patterns in a much easier w ay.
IN T R O DU CT IO N T O DAT A VIS U ALIZ AT IO N
IN T R O DU CT IO N T O DAT A VIS U ALIZ AT IO N
W hy visualize d ata?

Data visualization is a w ay you can create a story throug h your d ata. W hen d ata is complex and und erstand ing
the micro-d etails is essential,the b est w ay is to analyze d ata throug h visuals.

Visuals can b e used for tw o purposes:

1 . E xploratory d ata analysis: T his is used b y d ata analysts,statisticians,and d ata scientists to b etter
und erstand d ata. As it is rig htly called ,it is used to explore the hid d en trend s,patterns in d ata.

2 . E xplanatory d ata analysis: O nce the analysts und erstand the d ata and find their results,the b est w ay to
convey their id eas and find ing s is throug h visuals! T his is used to craft a story that w ill appeal to the view er
offering d eeper insig hts.
Importance of P lotting
The purpose of plotting scientific data is to visualize variation

or show relationships between variables, but not all data sets

require a plot. If there are only one or two points, it is easy to

examine the numbers directly, and little or nothing is gained by

putting them on a graph. Similarly, if there is no variation in

the data, it is easy enough to see or state the fact without using

a graph of any sort.


Line P lot

Line plots is a graph that is used for the representation of

continuous data points on a number line. Line plots are

created by first plotting data points on the Cartesian plane

then joining those points with a number line. Line plots

can help display data points for both single variable

analysis as well as multiple variable analysis.


S catter P lot

A scatter plot (also called a scatterplot, scatter g raph,

scatter chart, scatterg ram, or scatter d iag ram) is a type of

plot or mathematical d iag ram using Cartesian coord inates

to d isplay values for typically tw o variab les for a set of

d ata.
S catter P lot for M achine Learning
T hese are the charts/plots that are used to ob serve
and d isplay relationships b etw een variab les using
Cartesian Coord inates. T he values (x: first variab le ,y:
second variab le) of the variab les are represented b y
d ots. S catter plots are also know n as scatterg rams,
scatter g raphs, scatter charts ,or scatter d iag rams. It
is b est suited for situations w here the d epend ent
variab le can have multiple values for the ind epend ent
variab le.
B ar Chart

B ar charts are b est suited for the visualization of

categ orical d ata b ecause they allow you to easily see the

d ifference b etw een feature values b y measuring the

size(leng th) of the b ars. T here are 2 types of b ar charts

d epend ing upon their orientation (i.e. vertical or

horizontal).
G rouped B ar Chart
B ox P lot

B ox-plots tell us ab out the d istrib ution of d ata and scan


for outliers.

Also notice that even thoug h the numb er of nod es is a


more useful feature,there is some overlap w ith b oth the
classes.
Violin-P lot

A violin plot is a hyb rid of a b ox plot and a kernel d ensity plot,


w hich show s peaks in the d ata. It is used to visualize the
d istrib ution of numerical d ata.

U nlike a b ox plot that can only show summary statistics,


violin plots d epict summary statistics and the d ensity of each
variab le.
A heat map is a tw o-d imensional
representation of d ata in w hich values are
represented b y colors. A simple heat map
provid es an immed iate visual summary of
information. M ore elab orate heat maps
allow the view er to und erstand complex
d ata sets.
H eatmap

H eatmaps are the g raphical representation of


d ata w here each value is represented in a matrix
w ith d ifferent color cod ing . M ostly heatmaps are
used to find correlations b etw een various d ata
columns in a d ataset.
Speaker Details
Email:[email protected]
[email protected]
Mobile No: 8105895179
LinkedIn: https://in.linkedin.com/in/pramod.kn

You might also like