Lecture W6 EDA
Lecture W6 EDA
Program: M.C.A.
Course Code: MCAS9220
Course Name: Data Science Fundamentals
Exploratory Data Analysis
Lecture overview
2. Data Screening
– check for statistical hiccups
NameVoyager
NameMapper
NameVoyager
Is there a pattern?
Data reduction – fewer numbers
• Summarise proportion
27 / 48 children in class A are boys
16 / 23 children in class B are boys
Re-presented: 56% of class A, 69% of class B are boys
• Summarise change
Before: 112, 134, 121, 97
After: 116, 132, 140, 108
Re-presented
Change: 4, -2, 19, 11
Simpler descriptions are better
• Hick's Law
• Choice Reaction Time experiment
• RT increases with number of possible response alternatives
Hick's law
Hick's law
Interpreting EDA
Multiplicity
Interpreting EDA
2. Visualisation
(a) NameVoyager
(b) Bullying data