Data Privacy
Data Privacy
Data mining
Data mining is the process of analyzing data from different
perspectives and summarizing it into useful information
It allows users to analyze data from different dimensions or
angles
Data mining (also known as Knowledge Discovery in
Databases) is the nontrivial extraction of implicit, previously
unknown, and potentially useful information from database.
Generalization
publish more general values, i.e., given a domain hierarchy, roll-up
Suppression
remove tuples, i.e., do not publish outliers
often the number of suppressed tuples is bounded
MONDRIAN
It is multidimensional ,local recoding technique
Splits d-dimensional space into two partitions
Terminates when group contains <2k records
TOPDOWN
start with the entire data set
iteratively split in two reminiscent of R-tree quadratic split
continue until left with groups which contain <2k-1 tuples
k-anonymity
join
JKA
x3
x3 x3