Elements of statistical disclosure control
Statistical disclosure control is the discipline that deals with producing statistical data that are safe enough to be released to external researchers. This book concentrates on the methodology of the area. It deals with both microdata (individual data) and tabular (aggregated) data. The book attempts to develop the theory from what can be called the paradigm of statistical confidentiality: to modify unsafe data in such a way that safe (enough) data emerge, with minimum information loss. This book discusses what safe data, are, how information loss can be measured, and how to modify the data in a (near) optimal way. Once it has been decided how to measure safety and information loss, the production of safe data from unsafe data is often a matter of solving an optimization problem. Several such problems are discussed in the book, and most of them turn out to be hard problems that can be solved only approximately. The authors present new results that have not been published before. The book is not a description of an area that is closed, but, on the contrary, one that still has many spots awaiting to be more fully explored. Some of these are indicated in the book. The book will be useful for official, social and medical statisticians and others who are involved in releasing personal or business data for statistical use. Operations researchers may be interested in the optimization problems involved, particularly for the challenges they present. Leon Willenborg has worked at the Department of Statistical Methods at Statistics Netherlands since 1983, first as a researcher and since 1989 as a senior researcher. Since 1989 his main field of research and consultancy has been statistical disclosure control. From 1996-1998 he was the project coordinator of the EU co-funded SDC project.
What people are saying - Write a review
We haven't found any reviews in the usual places.
Disclosure Risks for Microdata
Data Analytic Impact of SDC Techniques on Microdata
7 other sections not shown
Other editions - View all
adding noise algorithm applied approach assume calculate categorical variable cell i,j cell suppression problem cell total chapter combination of values consider constraints controlled rounding corner points corresponding data protector defined denoted Dijkstra's algorithm disclose disclosure control disclosure risk discussed distribution dominance rule entries entropy equal estimator example feasibility interval frequency count table given global recoding heuristic hypercube identifying variables information loss key variables linked tables loss function LP relaxation marginal totals measure method microdata file microdata set MINUCs nonnegative odd cells original table partition perturbative population possible poststrata PRAM matrix primary suppression probability procedure protection interval random record region released response variable rounded value rounding base safe SDC techniques secondary cell secondary suppressions Section sensitive cell simulated annealing solution spanning variables Statistics Netherlands stochastic rounding Suppose suppressed cells table redesign tabular data target unit two-dimensional table unsafe combinations weights zero zero-extended table