Secure computation with horizontally partitioned data using adaptive regressive splines (2006)

Abstract:

When several data owners possess data on different records but the same variables, known as horizontally partitioned data, the owners can improve statistical inferences by sharing their data with each other. Often, however, the owners are unwilling or unable to share because the data are confidential or proprietary. Secure computation protocols enable the owners to compute parameter estimates for some statistical models, including linear regressions, without sharing individual records’ data. A drawback to these techniques is that the model must be specified in advance of initiating the protocol, and the usual exploratory strategies for determining good- fitting models have limited usefulness since the individual records are not shared. In this paper, we present a protocol for secure adaptive regression splines that allows for flexible, semi-automatic regression modeling. This reduces the risk of model misspecification inherent in secure computation settings. We illustrate the protocol with air pollution data.

Keywords:

Confidentiality, Disclosure, Regression, Secure computation, Spline

Author: 
Joyee GhoshJerome P. ReiterAlan F. Karr
Publication Date: 
Thursday, June 1, 2006
File Attachment: 
PDF icon tr160.pdf
Report Number: 
160