Scholars@Duke publication: A General Framework for Data-Use Auditing of ML Models

A General Framework for Data-Use Auditing of ML Models

Publication , Conference

Huang, Z; Gong, NZ; Reiter, MK

Published in: Ccs 2024 Proceedings of the 2024 ACM Sigsac Conference on Computer and Communications Security

December 9, 2024

Auditing the use of data in training machine-learning (ML) models is an increasingly pressing challenge, as myriad ML practitioners routinely leverage the effort of content creators to train models without their permission. In this paper, we propose a general method to audit an ML model for the use of a data-owner’s data in training, without prior knowledge of the ML task for which the data might be used. Our method leverages any existing black-box membership inference method, together with a sequential hypothesis test of our own design, to detect data use with a quantifiable, tunable false-detection rate. We show the effectiveness of our proposed framework by applying it to audit data use in two types of ML models, namely image classifiers and foundation models.

Duke Scholars

Author Michael Reiter Computer Science

Published In

Ccs 2024 Proceedings of the 2024 ACM Sigsac Conference on Computer and Communications Security

DOI

10.1145/3658644.3690226

Publication Date

December 9, 2024

Start / End Page

1300 / 1314

Citation

APA

Chicago

ICMJE

MLA

NLM

Huang, Z., Gong, N. Z., & Reiter, M. K. (2024). A General Framework for Data-Use Auditing of ML Models. In Ccs 2024 Proceedings of the 2024 ACM Sigsac Conference on Computer and Communications Security (pp. 1300–1314). https://doi.org/10.1145/3658644.3690226

Huang, Z., N. Z. Gong, and M. K. Reiter. “A General Framework for Data-Use Auditing of ML Models.” In Ccs 2024 Proceedings of the 2024 ACM Sigsac Conference on Computer and Communications Security, 1300–1314, 2024. https://doi.org/10.1145/3658644.3690226.

Huang Z, Gong NZ, Reiter MK. A General Framework for Data-Use Auditing of ML Models. In: Ccs 2024 Proceedings of the 2024 ACM Sigsac Conference on Computer and Communications Security. 2024. p. 1300–14.

Huang, Z., et al. “A General Framework for Data-Use Auditing of ML Models.” Ccs 2024 Proceedings of the 2024 ACM Sigsac Conference on Computer and Communications Security, 2024, pp. 1300–14. Scopus, doi:10.1145/3658644.3690226.

Huang Z, Gong NZ, Reiter MK. A General Framework for Data-Use Auditing of ML Models. Ccs 2024 Proceedings of the 2024 ACM Sigsac Conference on Computer and Communications Security. 2024. p. 1300–1314.

Published In

Ccs 2024 Proceedings of the 2024 ACM Sigsac Conference on Computer and Communications Security

DOI

10.1145/3658644.3690226

Publication Date

December 9, 2024

Start / End Page

1300 / 1314