Privacy issues in scientific workflow provenance


Other Article

A scientific workflow often deals with proprietary modules as well as private or confidential data, such as health or medical information. Hence providing exact answers to provenance queries over all executions of the workflow may reveal private information. In this paper we first study the potential privacy issues in a scientific workflow - module privacy, data privacy, and provenance privacy, and frame several natural questions: (i) can we formally analyze module, data or provenance privacy giving provable privacy guarantees for an unlimited/bounded number of provenance queries? (ii) how can we answer provenance queries, providing as much information as possible to the user while still guaranteeing the required privacy? Then we look at module privacy in detail and propose a formal model from our recent work in [11]. Finally we point to several directions for future work. © 2010 ACM.

Full Text

Duke Authors

Cited Authors

  • Davidson, SB; Khanna, S; Roy, S; Boulakia, SC

Published Date

  • January 1, 2010

Published In

International Standard Serial Number (ISSN)

  • 0730-8078

International Standard Book Number 13 (ISBN-13)

  • 9781450301886

Digital Object Identifier (DOI)

  • 10.1145/1833398.1833401

Citation Source

  • Scopus