Decision Trees: Assignment
Decision Trees: Assignment
Assignment
Homework 1
• Apply ID3 to the UCI-ML data sets:
– Car Evaluation
– Census Income
• Apply J48 (a C4.5-like algorithm) to the same
data sets
– Without pruning
– With rule-post pruning
– With reduced-error pruning
• Perform the following evaluation methods:
– Evaluation on the training data
– Disjoint training-test sets (50%-50%)
– 10-fold cross-validation
2
Homework 2
• Implement a decision tree learner based on the ID3
algorithm
– Programming language of your choice
– Interface of your choice
• Implement 3 different splitting functions:
– Random
– Information gain
– Gain ratio
• Your program should be able to
– read the UCI Car Evaluation dataset
– provide the following evaluation methods
• Holdout (p% for training (1-p)% for test)
• 10-fold cross-validation
3
What to Submit?
• Homework 1:
– A report describing the results and conclusions
• Homework 2:
– The code (both source and executable)
– A report describing the function of the program
and likely difficulties during implementation