Journal of Official Statistics

The Journal of Statistics Sweden

4 Issues per year

IMPACT FACTOR 2016: 0.411
5-year IMPACT FACTOR: 0.776

CiteScore 2016: 0.63

SCImago Journal Rank (SJR) 2016: 0.710
Source Normalized Impact per Paper (SNIP) 2016: 0.975

Open Access
A Simple Method for Limiting Disclosure in Continuous Microdata Based on Principal Component Analysis

Aida Calviño
  • Department of Computer Science and Mathematics, Universitat Rovira i Virgili, 43007 Tarragona, Spain Spain
  • Department of Statistics and Operations Research III, Complutense University of Madrid, 28040 Madrid, Spain
  • Email
Published Online: 2017-02-21 | DOI: https://doi.org/10.1515/jos-2017-0002


In this article we propose a simple and versatile method for limiting disclosure in continuous microdata based on Principal Component Analysis (PCA). Instead of perturbing the original variables, we propose to alter the principal components, as they contain the same information but are uncorrelated, which permits working on each component separately, reducing processing times. The number and weight of the perturbed components determine the level of protection and distortion of the masked data. The method provides preservation of the mean vector and the variance-covariance matrix. Furthermore, depending on the technique chosen to perturb the principal components, the proposed method can provide masked, hybrid or fully synthetic data sets. Some examples of application and comparison with other methods previously proposed in the literature (in terms of disclosure risk and data utility) are also included.

Keywords: Statistical disclosure control; microdata protection; hybrid microdata; masking method; propensity score


About the article

Received: 2015-09-01

Revised: 2016-08-01

Accepted: 2016-08-01

Published Online: 2017-02-21

Published in Print: 2017-03-01

Citation Information: Journal of Official Statistics, ISSN (Online) 2001-7367, DOI: https://doi.org/10.1515/jos-2017-0002.

© by Aida Calviño. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. BY-NC-ND 4.0

