Reverse geocoding and implica1ons for geospa1al privacy. Paul Zandbergen Department of Geography University of New Mexico

Similar documents
A GI Science Perspective on Geocoding:

Geocoding and Address Matching

Geocoding Techniques and Options for US and International Locations

Geocoding An Introduction

Lecture 8 Geocoding. Dr. Zhang Spring, 2017

Geocoding DoubleCheck: A Unique Location Accuracy Assessment Tool for Parcel-level Geocoding

Pitney Bowes Software Geocoding: Capabilities and Roadmap

ArcGIS Geocoding What s New and the Road Ahead. Jeff Rogers Brad Niemand

geocoding crime data in Southern California cities for the project, Crime in Metropolitan

ARCGIS DESKTOP DEMO (GEOCODING, SERVICE AREAS, TABULAR & SPATIAL JOINS)

Influence of street reference data on geocoding quality

Geocoding Techniques and Options for US and International Locations. Brady Hoak, Tosia Shall

Analysis and Geoprocessing Sessions and Demo Theater Presentations

Geocoding with ArcGIS FedGIS 2017 (2/14/2017)

2010 Census Mapping Evolution, Potentialities and Integration to the National Spatial Data Infrastructure

Analysis & Geoprocessing: Case Studies Problem Solving

Version 2.2 April Census Local Update of Census Addresses Operation (LUCA) Frequently Asked Questions

An ESRI White Paper May 2009 ArcGIS 9.3 Geocoding Technology

GIS Data Sources. Thomas Talbot

Working with Geocoding APIs

The role of MNEs for innova1on networks. Petra Ahrweiler Michel Schilperoord UCD Innova1on Research Unit IRU

Public Safety Geocoding Using ArcGIS Online and HERE Data

Na6onal Agricultural Imagery Program (NAIP)

Geocoding Techniques and Options for US and International Locations

The Alamo Campaign Dr. Jeffrey D. McCausland Founder and CEO Diamond6 Leadership and Strategy, LLC

Microscope Imaging. Colin Sheppard Nano- Physics Department Italian Ins:tute of Technology (IIT) Genoa, Italy

Chapter 10. What is geocoding?

Objec&ve % Explain design concepts used to create digital graphics.

Central Cancer Registry Geocoding Needs

2020 CENSUS LOCAL UPDATE OF CENSUS ADDRESSES OPERATION (LUCA) U.S. Census Bureau Geography Division

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Guidance on the anonymisation of clinical reports for the purpose of publication in accordance with policy 0070

Improving the Quality of Geocoded Data

Data Preparation. Warren Vick Europa Technologies Ltd.

Modeling & Simulation Capability for Consequence Management

Overview: Radio Frequency Spectrum

In-Office Address Canvassing for the 2020 Census: an Overview of Operations and Initial Findings

THE LIST USABILITY PUG 2007

Spatial Analyst is an extension in ArcGIS specially designed for working with raster data.

ArcGIS Online: World Geocoding. Bruce Harold, Esri

CRA Wiz & Fair Lending Wiz Geocoding Basics. August 2017

Part 1: Introduc.on to RLS 1.1 Spec. Part 2: Introduc.on to RoIS 1.0 Spec. Part 3: Introduc.on of UNR Pla<orm

6th Grade Key Terms What do the Students Have to Know to Master the Standard? How will you assess mastery of the standard?

GIS Lecture 8: Geocoding

Geocoding regional and remote poor quality address records with confidence

GPS Accuracy in Urban Environments Using Post-Processed CORS Data

Vendor Accuracy Study

MapMarker NZL 4.5 Server Release Notes Data

SPE Copyright 1998, Society of Petroleum Engineers Inc.

Provide input and learn more about the study at

Using the ModelBuilder of ArcGIS 9 for Landscape Modeling

MSc(CompSc) List of courses offered in

National Census Geography Some lessons learned and future challenges in European countries

Poverty in the United Way Service Area

The ARK Iden+fier Scheme at Ten Years Old

Guidance for Calculating Incidence by Census Tract Poverty Level Using 2010 Census and ACS

Systematic Privacy by Design Engineering

Seon Ho Kim, Ph.D. Associate Director, IMSC Viterbi School of Engineering University of Southern California Los Angeles, CA

Microsimulation, Big Data and Predictive Analytics

Geocoding Address Data & Using Geocoded Data

9 LAND SURVEYING. 9.1 General. 9.2 Administrative Requirements Standards Meetings Survey Data Provided to the Design-Builder

Objec&ve % Compare and contrast digital graphics tools and their uses.

Sharing Considerations Between Small Cells and Geostationary Satellite Networks in the Fixed-Satellite Service in the GHz Frequency Band

Computational Methods for Analysis of Footwear Impression Evidence

ArcGIS 9 Using ArcGIS StreetMap

How to put the Image Services in the Living Atlas to Work in Your GIS. Charlie Frye, Chief Cartographer Esri, Redlands

The 2020 Census Geographic Partnership Opportunities

A Probabilistic Geocoding System based on a National Address File

The 2020 Census Geographic Partnership Opportunities. Geography Division U.S. Census Bureau

Lecture 8: GIS Data Error & GPS Technology

Open Standard based Visualization Infrastructure for 3D Geospatial Information

COMPUTER. 1. PURPOSE OF THE COURSE Refer to each sub-course.

Massive Multi-Agent Simulation - Master Seminar

Visualizing a Pixel. Simulate a Sensor s View from Space. In this activity, you will:

UNIGIS University of Salzburg. Module: ArcGIS for Server Lesson: Online Spatial analysis UNIGIS

Before the FEDERAL COMMUNICATIONS COMMISSION Washington, D.C

Satellite Issues. WRC-19 agenda items 1.5, 1.6, 7. Jack Wengryniuk Chairman, WP 4A

2020 Census Local Update of Census Addresses Opera6on (LUCA) Linda Gray US Census Bureau

Geocoding Techniques and Options for US and International Locations. Thomas Oaks Tosia Shall

2020 Census Local Update of Census Addresses Operation (LUCA)

Part 1. Tracing the Dimensions of Some Common Pixel Sizes using a GPS Receiver

The Road to 2020 Census

WIA Repeater and Beacon Licence Recommenda3on Policy. December Background

TOURISM for several country is a primordial matter to

Introducing Foresight Alliance

ProBeam: A Prac,cal Mul,cell Beamforming System for Small- cell Networks

Guidance on the anonymisation of clinical reports for the purpose of publication

Galvanic Skin Response, Heart Rates, GPS, Art and You (and TomTom)

Using mobile network big data for urban and transporta5on planning in Colombo

Empirical Performance of Self- calibra2ng WiFi Loca2on Systems. Daniel Turner Stefan Savage Alex C. Snoeren UCSD 10/05/2011

ArcGIS Online Content

ArcGIS Runtime: Analysis. Lucas Danzinger Mark Baird Mike Branscomb

ISED Consultation Submission by the Wireless Broadband Alliance

Tactical Cellular Positioning. Tactical Enablers for Operational and Intelligence Units

Sharing Oblique and Oriented Imagery. Cody Benkelman Cristelle D Souza UC2018

Provides the data analysis, management and

Increasing Access to Certain North Carolina Environmental Data -- North Carolina Policy Collaboratory Project Update

Notes on the 2014 ACS 5-Year Estimates

Searching, Exporting, Cleaning, & Graphing US Census Data Kelly Clonts Presentation for UC Berkeley, D-lab March 9, 2015

What to do with 500M Location Requests a Day?

Transcription:

Reverse geocoding and implica1ons for geospa1al privacy Paul Zandbergen Department of Geography University of New Mexico

Outline Geospatial privacy Geocoding / reverse geocoding Experimental design Results and Conclusion

Geospa1al Privacy Under increasing threat from new technologies, high resolu1on data and easy to use tools

How easy is it to hack this map?

Geocoding Widely employed Well understood Substan1al errors

Tool of a hacker: Reverse geocoding Geocoding in reverse Rela1ve easy, rela1vely new Key tool for hacking published maps Not well understood ID Address ZIP 101 123 Main St 12345 102 456 Central Ave 12346

How to protect spa1al confiden1ality? Geographic masking But how to do this most effec1vely? Need berer understanding of reverse geocoding random point within circle random point on circle random distance and direc1on donut masking donut masking with exclusion

Review of the State of the Art Panel of confidentiality issues arising from the integration of remotely sensed and self-identifying data No known technical strategy [.] for managing linked spatial-social data adequately resolves conflicts among the objectives of data linkage, open access, data quality, and confidentiality protection across datasets and across uses. (Conclusion 3) National Research Council, 2007

Research Ques1ons What are the capabili5es of reverse geocoding to iden5fy individuals from published loca5ons? How does this vary with the methods employed for geocoding and reverse geocoding? How does this vary with popula5on density?

Experimental Design Actual building loca1ons with known addresses Travis County, TX (Aus1n) Sample of 2,500 residen1al loca1ons Stra1fied across 5 popula1on density classes Geocode using 5 different geocoders Reverse geocode using 3 different techniques Determine accuracy of reverse geocoding

Address Points

Geocoders TeleAtlas Address Points (commercial) TeleAtlas Streets (commercial) Google Maps (free API) StreetMap USA Pro 2007 in ArcGIS Geoly1cs 2007 (using TIGER 2007data)

Study Area Travis County, TX Popula1on Density Zones Sample of residen1al address points

Reverse Geocoding & Accuracy Original residen1al address points Snap to nearest residen1al building TeleAtlas reverse street geocoding Submit for commercial processing Google Maps reverse geocoding Free API Accuracy of reverse matches 1. Perfect match (street name and number) 2. Close match (number within 10) 3. Same street only

Results Match Rates (%) Geocoding Technique Popula1on density (people/km 2 ) < 50 50 to 250 250 to 1000 1000 to 2500 > 2500 Total TeleAtlas AP 43.6 70.2 91.8 92.8 93.6 78.4 TeleAtlas Street 93.2 92.8 98.0 99.8 96.8 96.1 Google Maps 92.2 95.6 98.2 99.0 96.2 96.2 StreetMap Pro 81.2 83.6 95.8 99.0 95.8 91.1 Geoly1cs 77.0 80.2 92.4 96.2 90.2 87.2 Combined 31.4 59.0 86.0 89.6 86.8 70.6

Results Same Street Matches Reverse Geocoding Technique Aus1n AP Google Maps TeleAtlas Street Aus1n AP 100.0 96.7 90.1 Geocoding Technique TeleAtlas AP 99.5 92.0 89.6 Google Maps 99.2 32.0 90.5 TeleAtlas Street 88.2 92.5 99.5 StreetMap Pro 89.1 76.8 82.5 Geoly1cs 54.5 54.7 56.9

Results Close Reverse Matches Reverse Geocoding Technique Aus1n AP Google Maps TeleAtlas Street Aus1n AP 100.0 95.5 23.1 Geocoding Technique TeleAtlas AP 99.1 91.8 24.3 Google Maps 98.8 28.6 24.3 TeleAtlas Street 69.8 63.2 92.5 StreetMap Pro 60.8 42.9 44.0 Geoly1cs 33.8 27.4 29.3

Results Perfect Reverse Matches Reverse Geocoding Technique Aus1n AP Google Maps TeleAtlas Street Aus1n AP 100.0 94.2 9.1 Geocoding Technique TeleAtlas AP 97.9 91.8 9.0 Google Maps 97.2 27.8 9.1 TeleAtlas Street 18.7 9.4 55.0 StreetMap Pro 17.0 7.3 8.2 Geoly1cs 7.1 2.3 2.6

Effect of Popula1on Density Percent Perfect Matches Geocoding Reverse Popula1on density (people/km 2 ) < 50 50 to 250 250 to 1000 1000 to 2500 > 2500 StreetMap Pro Aus1n AP 22.9 16.9 15.3 16.1 17.3 Geoly1cs Aus1n AP 14.0 7.1 7.0 5.4 6.5 Aus1n AP TeleAtlas Street 2.5 6.8 11.6 11.4 8.3 TeleAtlas Street TeleAtlas Street 51.6 63.7 56.7 53.8 49.8 Google Maps TeleAtlas Street 4.5 6.4 12.3 11.2 7.4 TeleAtlas Street Google Maps 9.6 9.8 8.8 9.6 9.4 Geoly1cs Google Maps 1.9 2.0 2.6 2.5 2.3

Results Summary Accuracy of reverse geocoding varies greatly Building level (reverse) geocoding is typically most accurate Street geocoding is quite noisy Easy to get the right street Very few perfect matches Accuracy is substan1ally improved if knowledge of the original geocoding technique is available! No clear parern with popula1on density

Roopop Geocoding in Google Maps and Virtual Earth

Commercial Address Points - TeleAtlas 51 million address points in the US Source: TeleAtlas, 2009 Licensed to Google, Virtual Earth, ArcGIS Business Analyst, Pitney Bowes / Group 1 / MapInfo

Reverse Geocoding Reverse geocoding now supported in Google Maps and Microsop Virtual Earth Also supported in latest version of ArcGIS requires some customiza1on or ArcWeb services Numerous free easy to use online u1li1es

Conclusions Accuracy of reverse geocoding Varies greatly with geocoder / reverse geocoder combina1on Between 2% and 98% perfect reverse matches Knowledge of original geocoding method is cri1cal noisy results from street geocoding can be reverse coded Trends: Address points are the new standard in geocoding Reverse geocoding is rela1vely easy Techniques to protect privacy may need to assume a worstcase scenario: very high resolu1on address data

Future Research Replicate in other study areas Examine urban/rural gradients more closely Experiment with different masking techniques Develop a framework for spa1al κ anonymity

Acknowledgements Na1onal Science Founda1on UNM Research Alloca1on CommiRee American Civil Liber1es Union