'Unmasking' masked address data: A medoid geocoding solution. 2023

Edward Helderop, and Jake R Nelson, and Tony H Grubesic
Center for Geospatial Sciences, School of Public Policy, University of California Riverside.

In recent years, there has been a consistent push for more open data initiatives, particularly for datasets collected by public agencies or groups that receive public funding. However, there is a tension between the release of open data and the preservation of individual and household privacy, whose balance shifts due to increased data availability, the sophistication of analysis techniques, and the computational power available to users. As a result, data masking is a standard tool used to preserve privacy. This is a process in which the data publishers obfuscate some identifying features in the dataset while attempting to maintain as much accuracy and precision as possible. For spatial datasets, the geocoding of administratively-masked data has been a consistent problem. Here, we present a medoid-based technique that geocodes masked data while minimizing the spatial uncertainty associated with the masking approach. Unfortunately, many commercial geocoding software packages either fail to geocode administratively-masked data or provide false positives by assigning points to city or street centroids. We demonstrate the results of our medoid-based geocoding approach by comparing it to commercial geocoding software. The results suggest that a medoid geocoding approach is mechanically simple to deploy and maximizes the spatial accuracy of the resulting geocodes.•Administratively-masked data are difficult to geocode•A medoid geocoding method maximizes geocoding accuracy•This method outperforms commercial geocoding software.

UI MeSH Term Description Entries

Related Publications

Edward Helderop, and Jake R Nelson, and Tony H Grubesic
April 2014, Journal of clinical hypertension (Greenwich, Conn.),
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
May 2003, American journal of public health,
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
July 2010, Spatial and spatio-temporal epidemiology,
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
November 2019, Scientific data,
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
April 1980, The American journal of psychiatry,
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
March 2020, Annals of surgery,
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
September 2000, MMW Fortschritte der Medizin,
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
March 1999, Journal of public health management and practice : JPHMP,
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
January 2007, GeoInformatica,
Edward Helderop, and Jake R Nelson, and Tony H Grubesic
January 2012, Health information management : journal of the Health Information Management Association of Australia,
Copied contents to your clipboard!