Lecture2_IntroData
Lecture2_IntroData
Data
Summary – last week
• Last week:
– Course Motivation
– Data Mining basics
• This week:
– Data
Objects
variable, field, characteristic,
4 Yes Married 120K No
dimension, or feature
5 No Divorced 95K Yes
• A collection of attributes
6 No Married 60K No
describe an object
• Object is also known as 7 Yes Divorced 220K No
record, point, case, sample, 8 No Single 85K Yes
entity, or instance 9 No Married 75K No
10 No Single 90K Yes
10
timeout
season
coach
game
score
play
team
win
ball
lost
Document 1 3 0 5 0 2 6 0 2 0 2
Document 2 0 7 0 2 1 0 0 3 0 0
Document 3 0 1 0 0 1 2 2 0 3 0
TID Items
1 Bread, Coke, Milk
2 Beer, Bread
3 Beer, Coke, Diaper, Milk
4 Beer, Bread, Diaper, Milk
5 Coke, Diaper, Milk
Acknowledgment - Thanks to Tan, Steinbach, Karpatne, Kumar for the slides 15
Graph Data
• Examples: Generic graph, a molecule, and webpages
2
5 1
2
5
Items/Events
An element of
the sequence
Acknowledgment - Thanks to Tan, Steinbach, Karpatne, Kumar for the slides 17
Ordered Data
• Spatio-Temporal Data
Average Monthly
Temperature of
land and ocean