TY - JOUR
T1 - A Framework for Privacy-Preserving Data Publishing with Enhanced Utility for Cyber-Physical Systems
AU - Sangogboye, Fisayo Caleb
AU - Jia, Ruoxi
AU - Hong, Tianzhen
AU - Spanos, Costas
AU - Kjærgaard, Mikkel Baun
PY - 2018/12
Y1 - 2018/12
N2 - Cyber-physical systems have enabled the collection of massive amounts of data in an unprecedented level of spatial and temporal granularity. Publishing these data can prosper big data research, which, in turn, helps improve overall system efficiency and resiliency. The main challenge in data publishing is to ensure the usefulness of published data while providing necessary privacy protection. In our previous work (Jia et al. 2017a), we presented a privacy-preserving data publishing framework (referred to as PAD hereinafter), which can guarantee k-anonymity while achieving better data utility than traditional anonymization techniques. PAD learns the information of interest to data users or features from their interactions with the data publishing system and then customizes data publishing processes to the intended use of data. However, our previous work is only applicable to the case where the desired features are linear in the original data record. In this article, we extend PAD to nonlinear features. Our experiments demonstrate that for various data-driven applications, PAD can achieve enhanced utility while remaining highly resilient to privacy threats.
AB - Cyber-physical systems have enabled the collection of massive amounts of data in an unprecedented level of spatial and temporal granularity. Publishing these data can prosper big data research, which, in turn, helps improve overall system efficiency and resiliency. The main challenge in data publishing is to ensure the usefulness of published data while providing necessary privacy protection. In our previous work (Jia et al. 2017a), we presented a privacy-preserving data publishing framework (referred to as PAD hereinafter), which can guarantee k-anonymity while achieving better data utility than traditional anonymization techniques. PAD learns the information of interest to data users or features from their interactions with the data publishing system and then customizes data publishing processes to the intended use of data. However, our previous work is only applicable to the case where the desired features are linear in the original data record. In this article, we extend PAD to nonlinear features. Our experiments demonstrate that for various data-driven applications, PAD can achieve enhanced utility while remaining highly resilient to privacy threats.
KW - Cyber-physical systems
KW - Deep learning
KW - K-anonymity
KW - Privacy preservation
KW - Smart buildings
U2 - 10.1145/3275520
DO - 10.1145/3275520
M3 - Journal article
SN - 1550-4859
VL - 14
JO - ACM Transactions on Sensor Networks
JF - ACM Transactions on Sensor Networks
IS - 3-4
M1 - 30
ER -