CSC 522 Lecture3
CSC 522 Lecture3
Aggregation
Sampling
Feature subset selection
Dimensionality Reduction
Feature creation
Discretization
Stratified sampling
▶ Sample from each group
▶ Ensures all groups are represented
Progressive sampling
▶ Start with a small sample
▶ Progressively, increase the sample size until the size is sufficient
▶ Eliminates the need to determine the sample size
▶ Requires a way to evaluate the sample and judge if it’s large enough
y = f (X )
Data:
1 2 4 4 6 25 30 80 100
Split 1
Split 1