Data Visualization
Data Visualization
Data
66
Analysis
The
process
of
inspecting cleansing transforming
, , ,
I
making .
-
[ Wikipedia]
of well defined
• Data analysis has the
objective answering a
-
research statement
problem :
Prediction :
given Xi ,
can we
predict Y ?
Classification of Data
Data
f \v
Qualitative / Quantitative
Kisii :'m::I:÷.is#::e:sgC::::IreIe::m.eniay
•
Categorical Data
Nominal :
gender blood ele
is
group
.
,
,
4 ( too ) , Scsevere )] )
high
Binary
' '
I
'
0 : O I .
• Quantitative Data
of in each
• no .
computers department .
of items at the
no
buy grocery every
month .
you
• .
o continuous →
histograms ( single variable) scatter
plot
C two
,
variables )
eg :
- .
age ,
height weight ,
.
• Blood Pressure ,
Blood flow .
•
Temperature ,
Pressure .
NOTE : -
Choose statistical
analysis based on the
type of data .
Why visualize data ?
for
→
easy interpretation .
3 .
Seaborn
of charts
Types
A- hine charts :
of time
period
• each value is plotted on the chart then the points are
,
connected to trend the compared time
display a over
span
Histogram :
bars of different
•
graphical representation of date
using
heights
.
each bar
•
groups no 's into
ranges
& the
height of
depicts the freq .
of each R
range @ )
bin .
• When to use ?
o to represent freq -
distribution of numerical variables .
( movies data .
Csv )
•
Matplottib implementation :
Better visualization : -
• Seaborn implementation : -
C .
Bar chart
•
presents categorical data with
rectangular bars with
lengths
proportional to the counts that
they represent
.
•
Horizontal bar charts are a
good option when
you
have
lot of bars to @ K) the labels them
a
plot ,
on
require
additional to be
space legible .
•
Matplotlib implementation : -
• Seaborn implementation : -
The variable you want to
map
anis W
.
on n -
syntax : -
sus .
countplot C '
ate .
; , III! )
d
of the function name of the dataframe
name
D) Scatter plot
two variables
•
displays values on numeric
using points
for each variable
positioned two on anis : one .
linear
cops negative linear @ R2 non - .
•
helps in
identifying outliers -
•
Matplotlib implementation : -
• Seaborn implementation : -
this is True
by denfault .
Syntax : - sus .
veg plot ( n
,
y ,
marker =
"
*
"
,
fit !eg= False)
E.) Pie Chart
illustrating proportion .
H)
.
1
Wikipedia
.
"
5th edition
Picturing
''
2
Elementary Statistics the World
' -
-
3 .
https://visme.co/blog/types-of-graphs/
4 .
https://chartio.com/learn/charts/essential-chart-types-for-data-visualization/
5 . https://www.youtube.com/watch?v=a9UrKTVEeZA
G .
https://www.gapminder.org/fw/world-health-chart/