INT233
INT233
Data Joining
Join
we work according to columns
one column should be common
work on attributes
Union
we work according to rows
no. of columns should be same in both the sheets
Data Cleaning
On Columns
Naming Convention
Dimensionality Reduction [ID, Remove Redundancy {Dependent
& Independent column}, Remove Dependent column, Co-
relation]
Handling Definition
On Rows
Remove NA, Null Value
Outliers
Normalization
Subsetting