Paper 2024/2040

Verified Foundations for Differential Privacy

Markus de Medeiros, New York University
Muhammad Naveed, Amazon
Tancrède Lepoint, Amazon
Temesghen Kahsai, Amazon
Tristan Ravitch, Amazon
Stefan Zetzsche, Amazon
Anjali Joshi, Amazon
Joseph Tassarotti, New York University
Aws Albarghouthi, Amazon
Jean-Baptiste Tristan, Amazon
Abstract

Differential privacy (DP) has become the gold standard for privacy-preserving data analysis, but implementing it correctly has proven challenging. Prior work has focused on verifying DP at a high level, assuming the foundations are correct and a perfect source of randomness is available. However, the underlying theory of differential privacy can be very complex and subtle. Flaws in basic mechanisms and random number generation have been a critical source of vulnerabilities in real-world DP systems. In this paper, we present SampCert, the first comprehensive, mechanized foundation for differential privacy. SampCert is written in Lean with over 12,000 lines of proof. It offers a generic and extensible notion of DP, a framework for constructing and composing DP mechanisms, and formally verified implementations of Laplace and Gaussian sampling algorithms. SampCert provides (1) a mechanized foundation for developing the next generation of differentially private algorithms, and (2) mechanically verified primitives that can be deployed in production systems. Indeed, SampCert’s verified algorithms power the DP offerings of Amazon Web Services (AWS), demonstrating its real-world impact. SampCert’s key innovations include: (1) A generic DP foundation that can be instantiated for various DP definitions (e.g., pure, concentrated, Rényi DP); (2) formally verified discrete Laplace and Gaussian sampling algorithms that avoid the pitfalls of floating-point implementations; and (3) a simple probability monad and novel proof techniques that streamline the formalization. To enable proving complex correctness properties of DP and random number generation, SampCert makes heavy use of Lean’s extensive Mathlib library, leveraging theorems in Fourier analysis, measure and probability theory, number theory, and topology.

Metadata
Available format(s)
PDF
Category
Implementation
Publication info
Preprint.
Keywords
Differential privacyDPFormal verification
Contact author(s)
mjd9606 @ nyu edu
naveedmd @ amazon com
tlepoint @ amazon com
teme @ amazon com
travitch @ amazon com
stefanze @ amazon co uk
anjalijs @ amazon com
jt4767 @ nyu edu
awsb @ amazon com
trjohnb @ amazon com
History
2024-12-18: approved
2024-12-18: received
See all versions
Short URL
https://ia.cr/2024/2040
License
Creative Commons Attribution
CC BY

BibTeX

@misc{cryptoeprint:2024/2040,
      author = {Markus de Medeiros and Muhammad Naveed and Tancrède Lepoint and Temesghen Kahsai and Tristan Ravitch and Stefan Zetzsche and Anjali Joshi and Joseph Tassarotti and Aws Albarghouthi and Jean-Baptiste Tristan},
      title = {Verified Foundations for Differential Privacy},
      howpublished = {Cryptology {ePrint} Archive, Paper 2024/2040},
      year = {2024},
      url = {https://eprint.iacr.org/2024/2040}
}
Note: In order to protect the privacy of readers, eprint.iacr.org does not use cookies or embedded third party content.