S09 Notes
S09 Notes
S09 Notes
Seminar 9:
Descriptive Analysis
Descriptive Statistics
Central
1
9/21/20
Range
– Difference between the highest and lowest value in data.
Quartiles
– Divide data into quarters, four equal parts (Q1, Q2, and Q3) with Q2
sitting at the median (2nd quartile is the median)
Variance
– Measures the width of its spread from center.
– Average squared difference between a variable’s value and the mean.
– Denotes the variability.
Standard Deviation
– Square root of variance. 4
2
9/21/20
3
9/21/20
Skewness
4
9/21/20
variables.
9
Data Indexing
• Indexing refers to the
position of a subset
of data within an
iterable structure.
• Iterable means loop-
able, you can make a
for-loop to go from
one element to next
element.
5
9/21/20
String Revisit
11
6
9/21/20
13
14
7
9/21/20
15
16
8
9/21/20
import pandas as pd
df = pd.read_csv("OnlineRetail.csv")
9
9/21/20
20
10
9/21/20
21
How can you get data from multiple rows and multiple columns?
11
9/21/20
23
12