ScrambleDB: Oblivious (Chameleon) Pseudonymization-as-a-Service

Pseudonymization is a widely deployed technique to de-sensitize data sets by consistently replacing identifying attributes with non-sensitive surrogates. However, all existing solutions are impractical to deploy in settings where data is accumulated from distributed sources: they either require shar...

Full description

Bibliographic Details
Main Author: Lehmann Anja
Format: Article
Language:English
Published: Sciendo 2019-07-01
Series:Proceedings on Privacy Enhancing Technologies
Subjects:
Online Access:https://doi.org/10.2478/popets-2019-0048
id doaj-6f3325dea5e44a0588f593e39736231f
record_format Article
spelling doaj-6f3325dea5e44a0588f593e39736231f2021-09-05T14:01:10ZengSciendoProceedings on Privacy Enhancing Technologies2299-09842019-07-012019328930910.2478/popets-2019-0048popets-2019-0048ScrambleDB: Oblivious (Chameleon) Pseudonymization-as-a-ServiceLehmann Anja0IBM Research – ZurichPseudonymization is a widely deployed technique to de-sensitize data sets by consistently replacing identifying attributes with non-sensitive surrogates. However, all existing solutions are impractical to deploy in settings where data is accumulated from distributed sources: they either require sharing the same secret key with all sources, or rely on a fully trusted service to consistently compute these pseudonyms. Further, the consistency of pseudonyms, which is required to maintain the data’s utility, comes with inherent and severe privacy limitations. This paper solves the key management and privacy challenges by introducing oblivious pseudonymization-as-a-service. Therein, the pseudonymization is outsourced to a central, yet fully oblivious entity, i.e., the service neither learns the sensitive information nor the pseudonyms it produces. Further, to obtain better privacy we no longer require pseudonyms to be computed consistently and instead introduce a dedicated join procedure. When data is stored at rest, all data is pseudonymized in a fully unlinkable manner. Only when certain subsets of the data are needed, the linkage is established through a controlled and nontransitive join operation. We formally define the desired security properties in the UC framework and propose a generic protocol that provably satisfies them. The core of our scheme is a 3-party oblivious and convertible PRF, which we believe to be of independent interest.https://doi.org/10.2478/popets-2019-0048pseudonymizationprivacyoprf
collection DOAJ
language English
format Article
sources DOAJ
author Lehmann Anja
spellingShingle Lehmann Anja
ScrambleDB: Oblivious (Chameleon) Pseudonymization-as-a-Service
Proceedings on Privacy Enhancing Technologies
pseudonymization
privacy
oprf
author_facet Lehmann Anja
author_sort Lehmann Anja
title ScrambleDB: Oblivious (Chameleon) Pseudonymization-as-a-Service
title_short ScrambleDB: Oblivious (Chameleon) Pseudonymization-as-a-Service
title_full ScrambleDB: Oblivious (Chameleon) Pseudonymization-as-a-Service
title_fullStr ScrambleDB: Oblivious (Chameleon) Pseudonymization-as-a-Service
title_full_unstemmed ScrambleDB: Oblivious (Chameleon) Pseudonymization-as-a-Service
title_sort scrambledb: oblivious (chameleon) pseudonymization-as-a-service
publisher Sciendo
series Proceedings on Privacy Enhancing Technologies
issn 2299-0984
publishDate 2019-07-01
description Pseudonymization is a widely deployed technique to de-sensitize data sets by consistently replacing identifying attributes with non-sensitive surrogates. However, all existing solutions are impractical to deploy in settings where data is accumulated from distributed sources: they either require sharing the same secret key with all sources, or rely on a fully trusted service to consistently compute these pseudonyms. Further, the consistency of pseudonyms, which is required to maintain the data’s utility, comes with inherent and severe privacy limitations. This paper solves the key management and privacy challenges by introducing oblivious pseudonymization-as-a-service. Therein, the pseudonymization is outsourced to a central, yet fully oblivious entity, i.e., the service neither learns the sensitive information nor the pseudonyms it produces. Further, to obtain better privacy we no longer require pseudonyms to be computed consistently and instead introduce a dedicated join procedure. When data is stored at rest, all data is pseudonymized in a fully unlinkable manner. Only when certain subsets of the data are needed, the linkage is established through a controlled and nontransitive join operation. We formally define the desired security properties in the UC framework and propose a generic protocol that provably satisfies them. The core of our scheme is a 3-party oblivious and convertible PRF, which we believe to be of independent interest.
topic pseudonymization
privacy
oprf
url https://doi.org/10.2478/popets-2019-0048
work_keys_str_mv AT lehmannanja scrambledbobliviouschameleonpseudonymizationasaservice
_version_ 1717810705065836544