0% found this document useful (0 votes)
38 views1 page

Data Leakage Detection

The document discusses a method for detecting data leakage without watermarking. It proposes allocating data across agents in ways that improve the ability to identify leaks and leaked data coming from specific agents. This involves not just distributing real data but also injecting fake records to further help identify leaks and the guilty party. Traditionally, leakage is detected through watermarking each distributed copy, but watermarks have limitations and can be removed. The method aims to help data distributors, like hospitals and companies, detect if their sensitive data shared with third parties gets leaked.

Uploaded by

Arjun
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views1 page

Data Leakage Detection

The document discusses a method for detecting data leakage without watermarking. It proposes allocating data across agents in ways that improve the ability to identify leaks and leaked data coming from specific agents. This involves not just distributing real data but also injecting fake records to further help identify leaks and the guilty party. Traditionally, leakage is detected through watermarking each distributed copy, but watermarks have limitations and can be removed. The method aims to help data distributors, like hospitals and companies, detect if their sensitive data shared with third parties gets leaked.

Uploaded by

Arjun
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

Data Leakage Detection

Abstract: -
A data distributor has given sensitive data to a set of supposedly trusted agents (third parties).
Some of the data is leaked and found in an unauthorized place (e.g., on the web or laptop). The
distributor must assess the likelihood that the leaked data came from one or more agents, as
opposed to having been independently gathered by other means. We propose data allocation
strategies (across the agents) that improve the probability of identifying leakages. These methods
do not rely on alterations of the released data (e.g., watermarks). In some cases we can also inject
acoerealistic but fake data records to further improve our chances of detecting leakage and
identifying the guilty party.
Traditionally, leakage detection is handled by watermarking, e.g., a unique code is embedded in
each distributed copy. If that copy is later discovered in the hands of an unauthorized party, the
leaker can be identified. Watermarks can be very useful in some cases, but again, involve some
modification of the original data. Furthermore, watermarks can sometimes be destroyed if the data
recipient is malicious. E.g. A hospital may give patient records to researchers who will devise new
treatments. Similarly, a company may have partnerships with other companies that require sharing
customer data. Another enterprise may outsource its data processing, so data must be given to
various other companies. We call the owner of the data the distributor and the supposedly trusted
third parties the agents.
Under The Guidance of

Dr.K Suneetha (Y.ARJUN)

HOD,MCA Department 16121f0046

You might also like