Asking the Proper Question: Adjusting Queries to Statistical Procedures Under Differential Privacy

  • Tomer Shoham*
  • , Yosef Rinott
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

We consider a dataset S held by an agency, and a vector query of interest, f(S) ∈ Rk, to be posed by an analyst, which contains the information required for some planned statistical inference. The agency will release an answer to the queries with noise that guarantees a given level of Differential Privacy using the well-known Gaussian noise addition mechanism. The analyst can choose to pose the original vector query f(S) or to transform the query and adjust it to improve the quality of inference of the planned statistical procedure, such as the volume of a confidence interval or the power of a given test of hypothesis. Previous transformation mechanisms that were studied focused on minimizing certain distance metrics between the original query and the one released without a specific statistical procedure in mind. Our analysis takes the Gaussian noise distribution into account, and it is non-asymptotic. In most of the literature that takes the noise distribution into account, a given query and a given statistic based on the query are considered and the statistic’s asymptotic distribution is studied. In this paper we consider both non-random and random datasets, that is, samples, and our inference is on f(S) itself, or on parameters of the data generating process when S is a random sample. Our main contribution is in proving that different statistical procedures can be strictly improved by applying different specific transformations to queries, and in providing explicit transformations for different procedures in some natural situations.

Original languageEnglish
Title of host publicationPrivacy in Statistical Databases - International Conference, PSD 2022, Proceedings
EditorsJosep Domingo-Ferrer, Maryline Laurent
PublisherSpringer Science and Business Media Deutschland GmbH
Pages46-61
Number of pages16
ISBN (Print)9783031139444
DOIs
StatePublished - 2022
EventInternational Conference on Privacy in Statistical Databases, PSD 2022 - Paris, France
Duration: 21 Sep 202223 Sep 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13463 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Privacy in Statistical Databases, PSD 2022
Country/TerritoryFrance
CityParis
Period21/09/2223/09/22

Bibliographical note

Publisher Copyright:
© 2022, Springer Nature Switzerland AG.

Keywords

  • Confidence region
  • Differential privacy
  • Gaussian mechanism
  • Statistical inference on noisy data
  • Testing hypotheses

Fingerprint

Dive into the research topics of 'Asking the Proper Question: Adjusting Queries to Statistical Procedures Under Differential Privacy'. Together they form a unique fingerprint.

Cite this