PPDM - 3 Input (Part1)
PPDM - 3 Input (Part1)
3 – Input (part1)
Dr. Kambiz Ghazinour
Fall 2016
Kent State University
© 2015 Department of Computer Science/Kambiz Ghazinour
Input: Concepts, instances, attributes
●
What’s a concept?
Classification, association, clustering, numeric prediction
●
What’s in an example?
Relations, flat files, recursion
●
What’s in an attribute?
Nominal, ordinal, interval, ratio
●
Preparing the input
ARFF, attributes, missing values, getting to know data
●
Outcome is called the class of the
example
●
Measure success on fresh data for which
class labels are known (test data)
rules
●
Measure success on test data
Outlook Temperature Humidity Windy Play-time
Sunny Hot High False 5
Sunny Hot High True 0
Overcast Hot High False 55
Rainy Mild Normal False 40
… … … … …
Anna Nikki
F F
●
Denormalization may produce spurious
regularities that reflect structure of database
Example: “supplier” predicts “supplier address”
●
Appropriate techniques are known as
“inductive logic programming”