Data dissemination and disclosure limitation in a world without microdata: A risk-utility framework for remote access analysis servers

Published

Journal Article

Given the public's ever-increasing concerns about data confidentiality, in the near future statistical agencies may be unable or unwilling, or even may not be legally allowed, to release any genuine microdata - data on individual units, such as individuals or establishments. In such a world, an alternative dissemination strategy is remote access analysis servers, to which users submit requests for output from statistical models fit using the data, but are not allowed access to the data themselves. Analysis servers, however, are not free from the risk of disclosure, especially in the face of multiple, interacting queries. We describe these risks and propose quantifiable measures of risk and data utility that can be used to specify which queries can be answered and with what output. The risk-utility framework is illustrated for regression models. © Institute of Mathematical Statistics, 2005.

Full Text

Duke Authors

Cited Authors

  • Gomatam, S; Karr, AF; Reiter, JP; Sanil, AP

Published Date

  • May 1, 2005

Published In

Volume / Issue

  • 20 / 2

Start / End Page

  • 163 - 177

International Standard Serial Number (ISSN)

  • 0883-4237

Digital Object Identifier (DOI)

  • 10.1214/088342305000000043

Citation Source

  • Scopus