0% found this document useful (0 votes)

9 views40 pages

Differential Privacy

The document provides an overview of Differential Privacy (DP), including its definition, formalization, and various algorithms such as Laplace, Randomized Response, and Exponential Mechanism. It emphasizes the importance of maintaining individual privacy while allowing for useful statistical analysis of data. Additionally, it discusses concepts like privacy compositions and the sensitivity of functions in the context of DP.

Uploaded by

a.trandafir-ilica.1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views40 pages

Differential Privacy

Uploaded by

a.trandafir-ilica.1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

DATA PRIVACY

Fatih Turkmen, PhD

E-mail : [email protected]

Some slides are based on Dwork&Roth’s book (The Algorithmic Foundations of Differential Privacy),
Machanavajjhala et al.’s SIGMOD’17 tutorial and Takahashi’s Slides (Data Science with Privacy at Scale).
TODAY

• Introduction to Differential Privacy (DP)

• DP Formalization
• Algorithms for DP
• Laplace
• Randomized Response
• Exponential Mechanism
• Compositions
• Shuffle Model
• Privacy-preserving ML with DP

2
DIFFERENTIAL PRIVACY 𝜺

• Lots of publicly available data, many statistical studies…

• SEE THIS: https://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.800-
226.ipd.pdf

• Concrete Examples:
Useful Info
• Medical records of a (e.g. statistics)
Governor Statistical
Inference
• IMDB + Netflix à user
identification DB with sensitive Individual
data Identification
• Individual Identification (i.e. privacy violation)
from AOL search queries

• Paradoxical situation: Learning nothing about an individual while

learning useful information about a population
3
DIFFERENTIAL PRIVACY: DEFINITION 𝜺

• Differential privacy aims to solve the problem:

“The risk associated with privacy violation of an individual should not
substantially increase as a result of participating in a statistical database”

• Here we need an algorithm/mechanism K (a differentially private

mechanism) that for all pairs of very similar data sets D and D’, it will behave
approximately the same on both data sets: If K(D) = X and K(D’) = Y then X
and Y should be indistinguishable.

• Most common method: Add controlled noise to data (e.g. Laplace,

Gaussian..) but there are many others...

4
DIFFERENTIAL PRIVACY FORMALIZATION 𝜺

• A randomized function K gives ε-differential privacy if for all data sets D

and Dʹ differing on one entry and all S ⊆ Range(K), s.t.

Pr [K(D) ∈ S] ≤ 𝑒 ! . Pr(K(D’) ∈ S)

in other words, let O Î S as above :

Pr[𝐾 𝐷𝑘 = 𝑂]
≤ 𝑒!
Pr[𝐾 𝐷𝑘 ∓ 1 = 𝑂]
What is the relation
between ε and privacy?
The larger ε means more or
less privacy?
5
VALUES OF 𝜀 𝜺

Figure borrowed from https://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.800-

226.ipd.pdf

6
DIFFERENTIAL PRIVACY FORMALIZATION (CONT.) 𝜺

Pr [K(D) = O] ≤ exp(𝜀) . Pr[K(D’) = O]

Intuition: By looking at the output O, the adversary

should not be able to make a distinction between
different (even if very close!) inputs D and D’.

Intuition: ε gives a means to control the distinction

between D and D’.

7
EXAMPLE

• Let’s start with an example:

Suppose you have access to a database that allows you to

compute the total income of all residents in a certain area. If
you knew that Mr.White was going to move to that area
from his current location, simply querying this database before
and after his move would allow you to deduce his income.

Example by the courtesy of http://research.neustar.biz/2014/09/08/differential-privacy-the-basics/

8
PUBLIC DATAB ASES

9
PUBLIC DATAB ASES

10
B ACK TO EXAMPLE

• Assume table represents the residents Id Name Income

of the selected region (where Mr White is 1 John Malkovich 80K

going to move).
2 Jamal Malik 90K
à 100 residents (+1 with Mr White)
3 Amelie Steiner 100K
4 Mirko Stanavic 75K
• Adversary has a query (Q(i)) mechanism 5 Mike Stanley 140K
to get the sum of income up to the given
6 Mehmet Uzun 90K
row i. Run Q before and after Mr White
7 Stijn Neuer 60K
moves…
.. .. ..
.. .. ..
• Q(101) – Q(100) = Mr White’s income

11
B ACK TO EXAMPLE

• If K behaves as expected then we have a Id Name Income

guarantee that whether an individual is in a 1 John Malkovich 80K
given data set or not will not effect the
outcome of a query significantly. 2 Jamal Malik 90K

3 Amelie Steiner 100K

• Q(5) = 485K, Q(6) = 575K 4 Mirko Stanavic 75K

5 Mike Stanley 140K
6 Mehmet Uzun 90K
• K(Q(5)) = X, K(Q(6)) = Y
7 Stijn Neuer 60K
.. .. ..
• How do we define X and Y? .. .. ..

12
ALGORITHMS FOR K

• Deterministic algorithms do not guarantee differential privacy

Inputs Outputs
Pr = 0
D
O1

O2
D’

Pr [K(D) = O1]
≤ 𝑒 $ where D,D’∈ {Inputs} and
Pr[K(D’) = O2]
O1,O2 ∈ {Outputs}
Ref: https://courses.cs.duke.edu/fall12/compsci590.3/slides/lec7.pdf 13
ALGORITHMS FOR K

• Random sampling (i.e., selecting a subset of individuals from a population ) does

not guarantee differential privacy.

K : Performing the
“aggregate function” over a
random sample from D1 or
D2.
à This may have 0
probability:
Pr[K(D2) = O] = 0
Pr [D1à O] if the samples contain the
Pr[D2à O] = 0 means =∞
Pr[D2à O] elements from the
difference (e.g., D1 \ D2)
Ref: https://courses.cs.duke.edu/fall12/compsci590.3/slides/lec7.pdf 14
DIFFERENTIAL PRIVACY FORMALIZATION 𝜺, 𝜹

• What we have seen (i.e., ε-differential) “pure” privacy, 𝛿 = 0.

• Approximate Differential Privacy: A randomized function K gives (ε, 𝛿)-
differential privacy (i.e., epsilon/delta privacy) if for all data sets D and
Dʹ differing on on entry and all S ⊆ Range(K), s.t.
Pr [K(D) = O] ≤ 𝑒 ! . Pr[K(D’) = O] + 𝜹

• Pure DP is a bit rigid: the probabilities of “unlikely events” with much

rare appearances (i.e., smaller Pr [K(D) ∈ S]) are preserved.

• Approximate Differential Privacy: Events with probabilities much

smaller than 𝜹 (𝜹 ≫ Pr[K(D’) = O]) are pruned.

15
OUTPUT RANDOMIZATION

Query Query
K
O O’ (=O+ 𝜂)
Database

Adding noise to the query result:

• Results do not leak info about the database.
• O’ is very close to O.

Ref: https://courses.cs.duke.edu/fall12/compsci590.3/slides/lec7.pdf 16
NOTE ON “ 𝜂 ” (NOISE) 𝜺

• Probability Mass Function (pmf) where X and Y are discrete random variables,
i.e., X, Y Î {2.6, 2.8, 3.0, 3.3 ….}

• Probability Density Function (pdf) where X and Y are continuous random

variables, i.e., X and Y are in a range such as 2.8 ≤ X ≤ 3.0…

• So K is really about the distribution of values in its range (K(D)) for the data sets
it is applied, i.e., the addition of noise.

17
FUNCTION SENSITIVITY 𝜺

Pr [K(D) = O] ≤ 𝑒 ! . Pr[K(D’) = O]

• Implements a (aggregate) query Q.

• Function/Query Sensitivity: The largest (possible) distance between
the query results.
S(q) =

• In other words, the smallest number s.t. for any neighboring tables D
and D’.
|QD - QD’| ≤ S(q) What is the sensitivity of
COUNT?

18
FUNCTION SENSITIVITY (CONT.)

Id Name Income

Say Income has the range [50K , 200K] 1 John Malkovich 80K

2 Jamal Malik 90K

3 Amelie Steiner 100K

What is the sensitivity of SUM for the income?
4 Mirko Stanavic 75K
5 Mike Stanley 140K
6 Mehmet Uzun 90K
7 Stijn Neuer 60K
That said though, summation queries
have unbounded sensitivity when no lower .. .. ..
and upper bounds exist on the value of the .. .. ..
attribute being summed!!

19
FUNCTION SENSITIVITY (CONT.)

We’re going to run our differentially private mechanisms on

Any two neighbouring datasets an actual dataset - shouldn’t we consider neighbors
of that dataset?

Fix one of the two datasets to be the actual dataset being queried, and consider
all of its neighbours. Pay attention to parameter x from the “”fixed dataset

20
DIFFERENTIAL PRIVACY WITH LAPLACE

Let S(q) denote the sensitivity of a query q.

(the maximum difference in the values QD and QD’,
for D and D’, a pair of databases that differ in only one row)

• Laplace Mechanism: Add controlled noise

with Laplace(μ,b):

|"#$|
C F( )
Lap(x | 𝜇, b) = 𝑒 %
DE
where
• b is the scale parameter and is set to
Courtesy of
S(q) / ε (calibrating the noise to the function’s https://en.wikipedia.org/wiki/Laplace_distribution

sensitivity)
• μ is the location parameter and it refers to distance to function’s true value (often
set to 0)
21
DIFFERENTIAL PRIVACY WITH LAPLACE (CONT.)

• Now, given the noise 𝜂 drawn from

the Laplace distribution, the result of
the function is:
K(D) = QD + 𝜂
Why?

Because b = S(COUNT) / ε
Disease
(Y/N)
Example: Count the number of
Y
people with the disease. Courtesy of
Y
https://en.wikipedia.org/wiki/Laplace_distribution
N
Solution: 3 + 𝜂 where 𝜂 is drawn
Y from Lap(1 / ε).
N - b = 1 / ε , thus the variance is 2 / ε2.
N - No shift, so μ (mean) is 0.
22
RANDOMIZED RESPONSE

Originally intended for improving bias in survey responses. Mostly used over Have you
“Yes/No” (i.e., binary) type of data aggregation but can be generalized. commited
a crime?
1. Flip a coin
2. If the coin is heads, answer the question truthfully
3. If the coin is tails, flip another coin
4. If the second coin is heads, answer “yes”; if it is tails, answer “no”
heads tails

If “yes” (property) answer is incriminating, Respond

randomized response provides that with Truthfully
probability at least 1/4 whether or not the
respondent actually has property P. heads tails

à Provides plausible deniability!! Respond Respond

Yes No

27
RANDOMIZED RESPONSE

Disease Disease

Y
With probaility Y
p, report true
Y value N

N N

Y N
With probaility
N Y
1-p, report
N flipped value N

Ref: https://sigmod2017.org/wp-content/uploads/2017/03/04-Differential-Privacy-in-the-wild-1.pdf
28
RANDOMIZED RESPONSE

Related to randomized process (not about output randomization!).

Randomized response satisfies ε-differential privacy for ε = ln(3) = 1.09

The Chrome Web browser has implemented and deployed RAPPOR to collect data
about Chrome clients à Based on Randomized Response [RAPPOR14].

Example Implementation: https://blog.openmined.org/randomized-response-in-privacy/

[RAPPOR14] Erlingsson et al.: RAPPOR: Randomized Aggregatable Privacy-Preserving Ordinal Response. CCS 2014: 1054-1067
https://arxiv.org/pdf/1407.6981
29
EXPONENTIAL MECHANISM

Laplace/Gaussian à The utility of the response is directly related to the noise

values generated; that is, the popularity of the name or condition is appropriately
measured on the same scale and in the same units as the magnitude of the noise.
Ø Focus on numerical answers
Ø Add noise directly to the answer itself.

Exponential Mechanism:
Ø For aggregates that do not return a (real)
number!
ØWhen perturbation leads to invalid outputs.

30
EXPONENTIAL MECHANISM

The analyst defines which element is the “best” by specifying a scoring

function that outputs a score for each element in the set, and also defines
the set of things to pick from.

Sensitivity of the scoring

function

Note: the output of the exponential mechanism is always a member of the set ℛ.

31
EXPONENTIAL MECHANISM

• The mechanism provides differential privacy by approximately maximizing the

score of the element it returns
• To satisfy differential privacy, the exponential mechanism sometimes returns an
element from the set which does not have the highest score.

32
COMPOSABILITY

1. Aggregate functions are often combined with other aggregate functions!

2. Repeatedly computing the same statistic using a DP mechanism will

degrade the protection provided by 𝜺, 𝜹.

Dinur/Nissim Result: A vast majority of records in a database of size n can

be reconstructed when n log(n)2 queries are answered by a statistical
database …

3. A statistical database must leak some information about each individual for
providing utility after all..

33
COMPOSABILITY (CONT.)

Compositions: It is important to be able to reason about privacy

guarantees when complex functions are built from simple building
blocks!
• If building blocks are proven to be private, it would be
easy to reason about privacy of a complex algorithm built
entirely using these building blocks.

If K1, K2, ..., Kk are algorithms that access a

private database D such that each Ki satisfies
εi -differential privacy, then running all k
algorithms sequentially satisfies ε-differential This is like going to shopping…
privacy with ε=ε1+...+εk What limits you in your
shopping?

34
COMPOSABILITY (CONT.)

ε=ε1+...+εk

ε= max{ε1,...,εk}

Courtesy of: https://programming-dp.com/ch6.html

35
LOC AL DP (LDP)

36
CENTRALIZED DP VS LDP

37
CENTRALIZED DP VS LDP VS SHUFFLE

Courtesy of https://blog.openmined.org/differential-privacy-by-shuffling/

38
SHUFFLE MODEL

39
SHUFFLE MODEL (CONT.)

Courtesy of https://speakerdeck.com/line_developers/differential-
privacy-data-science-with-privacy-at-scale?slide=56

40
PRIVACY-PRESERVING ML WITH DP [5]

• Not a collaborative setting, centralized training with DP:

• Suitable for applications of machine learning on mobile phones, tablets,
and other devices.
• Storing models on-device enables power-efficient, low-latency inference,
and may contribute to privacy since inference does not require
communicating user data to a central server

• Recall Differential Privacy

41
PRIVACY-PRESERVING DL WITH DP: DP-SGD

• Uses Gaussian noise

• Defined over loss function
(not cost)

1. Compute the gradients

2. Clip each gradient in l2

norm, i.e., replace g by
g / max (1, ||g||2 / C).

• The differential privacy

guarantee of Algorithm 1
requires bounding the influence
of each individual example on gt

Abadi et al., Deep learning with differential privacy. In Proceedings of the 2016 ACM SIGSAC Conference on
Computer and Communications Security, pages 308–318, 2016

42
PRIVACY-PRESERVING DL WITH DP: DP-SGD

3. Compute the average,

while adding noise

4. Take a step in the

opposite direction of
this average noisy
gradient.

43
WHAT DID WE LEARN?

• Motivation for Differential Privacy

• Formal Definition Bad Privacy Protection
• Algorithms
• Output Randomization (Laplace),

Privacy

Utility
Randomized Response, Exponential Mechanism
• Composability
• Shuffle Model
• Privacy-preserving ML with DP Bad Service Experience

Programmingdp
No ratings yet
Programmingdp
124 pages
Gaussian Differential Privacy
No ratings yet
Gaussian Differential Privacy
86 pages
Optimal Algorithms For Mean Estimation Under Local Differential Privacy
No ratings yet
Optimal Algorithms For Mean Estimation Under Local Differential Privacy
27 pages
Don't Look at The Data! How Differential Privacy Reconfigures The Practices of Data Science
No ratings yet
Don't Look at The Data! How Differential Privacy Reconfigures The Practices of Data Science
19 pages
Evaluating Differentially Private Machine Learning in Practice
No ratings yet
Evaluating Differentially Private Machine Learning in Practice
20 pages
Privacy Preserving Data-Analytics
No ratings yet
Privacy Preserving Data-Analytics
39 pages
Privacy Book
No ratings yet
Privacy Book
281 pages
Wifs 2024
No ratings yet
Wifs 2024
6 pages
Waye Lucas
No ratings yet
Waye Lucas
8 pages
Privacy Models Differential Privacy I
No ratings yet
Privacy Models Differential Privacy I
27 pages
Data Science Ethics - Lecture 5 - Privacy in Data Preprocessing and Modeling
No ratings yet
Data Science Ethics - Lecture 5 - Privacy in Data Preprocessing and Modeling
23 pages
Using Differential Privacy Technique To Measure Incrementality of Ads Performance
No ratings yet
Using Differential Privacy Technique To Measure Incrementality of Ads Performance
4 pages
Privacy and Utility Tradeoff in Approximate Differential Privacy
No ratings yet
Privacy and Utility Tradeoff in Approximate Differential Privacy
15 pages
Optimizing Noise For - Differential Privacy Via Anti-Concentration and Stochastic Dominance
No ratings yet
Optimizing Noise For - Differential Privacy Via Anti-Concentration and Stochastic Dominance
32 pages
Bayesian Differential Privacy For Machine Learning - 1901.09697v5
No ratings yet
Bayesian Differential Privacy For Machine Learning - 1901.09697v5
15 pages
Lab 6
No ratings yet
Lab 6
9 pages
13-4 GoogleDIfferntialPrivacy
No ratings yet
13-4 GoogleDIfferntialPrivacy
20 pages
Private Linear Programming
No ratings yet
Private Linear Programming
8 pages
Introduction To Privacy Preserving Data Publishing Concepts and Techniques Chapman Hall CRC Data Mining and Knowledge Discovery Series
No ratings yet
Introduction To Privacy Preserving Data Publishing Concepts and Techniques Chapman Hall CRC Data Mining and Knowledge Discovery Series
355 pages
Privacy and
No ratings yet
Privacy and
18 pages
Bayesian Differential Privacy For Linear Dynamic System
No ratings yet
Bayesian Differential Privacy For Linear Dynamic System
6 pages
Lvilhuber, Journal Manager, Fulltext
No ratings yet
Lvilhuber, Journal Manager, Fulltext
36 pages
Extending Partial Differential Private Mechanisms
No ratings yet
Extending Partial Differential Private Mechanisms
6 pages
Data 102 Fall 2023 Lecture 24 - Privacy in Machine Learning
No ratings yet
Data 102 Fall 2023 Lecture 24 - Privacy in Machine Learning
46 pages
Distributed DP in Mixnets
No ratings yet
Distributed DP in Mixnets
38 pages
Differentially Private Depth Functions and Their Associated Medians
No ratings yet
Differentially Private Depth Functions and Their Associated Medians
22 pages
Research Paper 3
No ratings yet
Research Paper 3
20 pages
09 - COE426-Differential Privacy II
No ratings yet
09 - COE426-Differential Privacy II
30 pages
Event Data Privacy
No ratings yet
Event Data Privacy
33 pages
w9 Differential Privacy
No ratings yet
w9 Differential Privacy
30 pages
08 - COE426-Differential Privacy I
No ratings yet
08 - COE426-Differential Privacy I
23 pages
Week 6 - Solution
No ratings yet
Week 6 - Solution
4 pages
Adjacent Initial States Based Differential Privacy F - 2024 - Expert Systems Wit
No ratings yet
Adjacent Initial States Based Differential Privacy F - 2024 - Expert Systems Wit
12 pages
Week 7 - Solution
No ratings yet
Week 7 - Solution
3 pages
Diffrential Privacy
No ratings yet
Diffrential Privacy
35 pages
Preserving and Randomizing Data Responses in Web Application Using Differential Privacy
100% (1)
Preserving and Randomizing Data Responses in Web Application Using Differential Privacy
9 pages
Q2 and 4
No ratings yet
Q2 and 4
4 pages
cs6359 hw1 With Hints
No ratings yet
cs6359 hw1 With Hints
2 pages
Data Science Ethics - Lecture 5 - Privacy in Data Preprocessing and Modeling
No ratings yet
Data Science Ethics - Lecture 5 - Privacy in Data Preprocessing and Modeling
23 pages
Differential Privacy For Deep and Federated Learning A Survey
No ratings yet
Differential Privacy For Deep and Federated Learning A Survey
22 pages
The Algorithmic Foundations of Differential Privacy
No ratings yet
The Algorithmic Foundations of Differential Privacy
281 pages
Math for Deep Learning: What You Need to Know to Understand Neural Networks
From Everand
Math for Deep Learning: What You Need to Know to Understand Neural Networks
Ronald T. Kneusel
No ratings yet
Sanskrut Lesson 1 To 9 Month 11
No ratings yet
Sanskrut Lesson 1 To 9 Month 11
61 pages
The Promise of Differential Privacy: Cynthia Dwork, Microsoft Research
No ratings yet
The Promise of Differential Privacy: Cynthia Dwork, Microsoft Research
50 pages
Differential Privacy: On The Trade-Off Between Utility and Information Leakage
No ratings yet
Differential Privacy: On The Trade-Off Between Utility and Information Leakage
26 pages
Tipos de Ruido en Privacidad Diferencial
No ratings yet
Tipos de Ruido en Privacidad Diferencial
10 pages
Differentially Private Instance-Based Noise Mechanisms in Practice
No ratings yet
Differentially Private Instance-Based Noise Mechanisms in Practice
33 pages
Differentially Private Significance Tests For Regression Coefficients
No ratings yet
Differentially Private Significance Tests For Regression Coefficients
15 pages
Introduction To Differential Privacy
No ratings yet
Introduction To Differential Privacy
11 pages
WDS Unit 5 Notes
No ratings yet
WDS Unit 5 Notes
20 pages
Differential Privacy: 1 N I 1 N N
No ratings yet
Differential Privacy: 1 N I 1 N N
7 pages
Privacy Chapter
No ratings yet
Privacy Chapter
6 pages
DP 200
No ratings yet
DP 200
370 pages
Journal of Biomedical Informatics: Ada Wai-Chee Fu, Ke Wang, Raymond Chi-Wing Wong, Jia Wang, Minhao Jiang
No ratings yet
Journal of Biomedical Informatics: Ada Wai-Chee Fu, Ke Wang, Raymond Chi-Wing Wong, Jia Wang, Minhao Jiang
12 pages
A Statistical Framework For Differential Privacy
No ratings yet
A Statistical Framework For Differential Privacy
16 pages
Node Js Lab Manual
No ratings yet
Node Js Lab Manual
108 pages
Differential Privacy
No ratings yet
Differential Privacy
56 pages
Differentially Private Decision Trees
No ratings yet
Differentially Private Decision Trees
5 pages
Differential Privacy
No ratings yet
Differential Privacy
12 pages
Chapter 9: Transactions: Modified From: Database System Concepts, 6 Ed
No ratings yet
Chapter 9: Transactions: Modified From: Database System Concepts, 6 Ed
55 pages
CERIAS Presentation PDF
No ratings yet
CERIAS Presentation PDF
17 pages
Getting MEAN With Mongo Express Angular and Node Second Edition Simon D Holmes Clive Harber
No ratings yet
Getting MEAN With Mongo Express Angular and Node Second Edition Simon D Holmes Clive Harber
63 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Module 5 Bivariate Analysis
No ratings yet
Module 5 Bivariate Analysis
81 pages
Im Lec
No ratings yet
Im Lec
337 pages
Erytjyukk 88 Oioio
No ratings yet
Erytjyukk 88 Oioio
7 pages
Lecture 1 - Getting To Know Scalability
No ratings yet
Lecture 1 - Getting To Know Scalability
49 pages
Differential Database Backups in SQL Server
No ratings yet
Differential Database Backups in SQL Server
7 pages
Build and Study RS, D, JK, and T Flip Flops Using TTL Logic Gates
From Everand
Build and Study RS, D, JK, and T Flip Flops Using TTL Logic Gates
GURUPRASAD N H
No ratings yet
Hybrid HBase: Leveraging Flash SSDs To Improve Cost Per Throughput of HBase
No ratings yet
Hybrid HBase: Leveraging Flash SSDs To Improve Cost Per Throughput of HBase
12 pages
BE Message Format 2.12 (10mar2023)
No ratings yet
BE Message Format 2.12 (10mar2023)
62 pages
Tracker User Guide
No ratings yet
Tracker User Guide
8 pages
Partition Types
No ratings yet
Partition Types
4 pages
MIS & ERP Digital Notes
No ratings yet
MIS & ERP Digital Notes
36 pages
12.multi-Valued Dependencies and Fourth Normal Form
No ratings yet
12.multi-Valued Dependencies and Fourth Normal Form
4 pages
Backend SDE Intern Assignment
No ratings yet
Backend SDE Intern Assignment
3 pages
Introduction To Database Mid-Term Answer Sheet
No ratings yet
Introduction To Database Mid-Term Answer Sheet
10 pages
Quiz 6
No ratings yet
Quiz 6
10 pages
IANS - Third Party Software Security Checklist
No ratings yet
IANS - Third Party Software Security Checklist
4 pages
12 Density Graphs
No ratings yet
12 Density Graphs
5 pages
Eei5466 Tma 02 2023 24
No ratings yet
Eei5466 Tma 02 2023 24
1 page
Kunci Jawaban Sap Fundamental 5
No ratings yet
Kunci Jawaban Sap Fundamental 5
5 pages
Oracle® Database: Unplugging, Plugging, and Upgrading A PDB Toanewcdb
No ratings yet
Oracle® Database: Unplugging, Plugging, and Upgrading A PDB Toanewcdb
13 pages
Iso 898-1 - 2013
No ratings yet
Iso 898-1 - 2013
24 pages
Library Management System Project
No ratings yet
Library Management System Project
8 pages
Semantic Web: Abstra CT
No ratings yet
Semantic Web: Abstra CT
15 pages
Blood Bank Management System - Database System
100% (3)
Blood Bank Management System - Database System
27 pages
1671084975 9. अा.प्र, खुला तथा समावेशी तर्फको लिखित परीक्षा कार्यक्रम
No ratings yet
1671084975 9. अा.प्र, खुला तथा समावेशी तर्फको लिखित परीक्षा कार्यक्रम
7 pages
Oracle View - Javatpoint
No ratings yet
Oracle View - Javatpoint
9 pages
1Z0-1085-23 - Oracle Cloud Infrastructure 2023-Exam
No ratings yet
1Z0-1085-23 - Oracle Cloud Infrastructure 2023-Exam
6 pages

Differential Privacy

Uploaded by

Differential Privacy

Uploaded by

DATA PRIVACY

Fatih Turkmen, PhD

• Introduction to Differential Privacy (DP)

• Lots of publicly available data, many statistical studies…

• Paradoxical situation: Learning nothing about an individual while

• Differential privacy aims to solve the problem:

• Here we need an algorithm/mechanism K (a differentially private

• Most common method: Add controlled noise to data (e.g. Laplace,

• A randomized function K gives ε-differential privacy if for all data sets D

in other words, let O Î S as above :

Figure borrowed from https://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.800-

Pr [K(D) = O] ≤ exp(𝜀) . Pr[K(D’) = O]

Intuition: By looking at the output O, the adversary

Intuition: ε gives a means to control the distinction

• Let’s start with an example:

Suppose you have access to a database that allows you to

Example by the courtesy of http://research.neustar.biz/2014/09/08/differential-privacy-the-basics/

• Assume table represents the residents Id Name Income

of the selected region (where Mr White is 1 John Malkovich 80K

• If K behaves as expected then we have a Id Name Income

3 Amelie Steiner 100K

• Q(5) = 485K, Q(6) = 575K 4 Mirko Stanavic 75K

• Deterministic algorithms do not guarantee differential privacy

• Random sampling (i.e., selecting a subset of individuals from a population ) does

• What we have seen (i.e., ε-differential) “pure” privacy, 𝛿 = 0.

• Pure DP is a bit rigid: the probabilities of “unlikely events” with much

• Approximate Differential Privacy: Events with probabilities much

Adding noise to the query result:

• Probability Density Function (pdf) where X and Y are continuous random

• Implements a (aggregate) query Q.

2 Jamal Malik 90K

3 Amelie Steiner 100K

We’re going to run our differentially private mechanisms on

Let S(q) denote the sensitivity of a query q.

• Laplace Mechanism: Add controlled noise

• Now, given the noise 𝜂 drawn from

If “yes” (property) answer is incriminating, Respond

à Provides plausible deniability!! Respond Respond

Related to randomized process (not about output randomization!).

Randomized response satisfies ε-differential privacy for ε = ln(3) = 1.09

Example Implementation: https://blog.openmined.org/randomized-response-in-privacy/

Laplace/Gaussian à The utility of the response is directly related to the noise

The analyst defines which element is the “best” by specifying a scoring

Sensitivity of the scoring

• The mechanism provides differential privacy by approximately maximizing the

1. Aggregate functions are often combined with other aggregate functions!

2. Repeatedly computing the same statistic using a DP mechanism will

Dinur/Nissim Result: A vast majority of records in a database of size n can

Compositions: It is important to be able to reason about privacy

If K1, K2, ..., Kk are algorithms that access a

Courtesy of: https://programming-dp.com/ch6.html

• Not a collaborative setting, centralized training with DP:

• Recall Differential Privacy

• Uses Gaussian noise

1. Compute the gradients

2. Clip each gradient in l2

• The differential privacy

3. Compute the average,

4. Take a step in the

• Motivation for Differential Privacy

You might also like