Geocoding DoubleCheck: A Unique Location Accuracy Assessment Tool for Parcel-level Geocoding

Similar documents
CRA Wiz & Fair Lending Wiz Geocoding Basics. August 2017

MAPS & ENHANCED CONTENT

APCAS/10/21 April 2010 ASIA AND PACIFIC COMMISSION ON AGRICULTURAL STATISTICS TWENTY-THIRD SESSION. Siem Reap, Cambodia, April 2010

Visualizing a Pixel. Simulate a Sensor s View from Space. In this activity, you will:

Central Cancer Registry Geocoding Needs

ArcGIS Online Content

Geocoding and Address Matching

Geocoding Techniques and Options for US and International Locations

Spatial Analyst is an extension in ArcGIS specially designed for working with raster data.

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Using Freely Available. Remote Sensing to Create a More Powerful GIS

Raster is faster but vector is corrector

ArcGIS Pro: What s New in Analysis

GIS and Remote Sensing BIO8014. Data acquisition

Our Quality Promise WHITE PAPER

Module 3 Introduction to GIS. Lecture 8 GIS data acquisition

A map says to you, 'Read me carefully, follow me closely, doubt me not.' It says, 'I am the Earth in the palm of your hand. Without me, you are alone

White paper brief IdahoView Imagery Services: LISA 1 Technical Report no. 2 Setup and Use Tutorial

IceTrendr - Polygon. 1 contact: Peder Nelson Anne Nolin Polygon Attribution Instructions

SUGAR_GIS. From a user perspective. Provides spatial distribution of a wide range of sugarcane production data in an easy to use and sensitive way.

Hydraulics and Floodplain Modeling Managing HEC-RAS Cross Sections

A GI Science Perspective on Geocoding:

A Probabilistic Geocoding System based on a National Address File

Managing Imagery and Raster Data. Peter Becker

Geocoding Address Data & Using Geocoded Data

Lecture 8 Geocoding. Dr. Zhang Spring, 2017

Hydraulics and Floodplain Modeling Managing HEC-RAS Cross Sections

Remote Sensing. The following figure is grey scale display of SPOT Panchromatic without stretching.

Lecture 8: GIS Data Error & GPS Technology

earthobservation.wordpress.com

Lecture 13: Remotely Sensed Geospatial Data

Image Registration Issues for Change Detection Studies

Detecting artificial areas inside reference parcels. A technique to assist the evaluation of non-eligibility in agriculture

Sharing Oblique and Oriented Imagery. Cody Benkelman Cristelle D Souza UC2018

A Final Report to. The New Hampshire Estuaries Project. Submitted by

Geocoding: Acquiring Location Intelligence to Make Be er Business Decisions

Satellite Data Used in Land Development

Lesson Plan 1 Introduction to Google Earth for Middle and High School. A Google Earth Introduction to Remote Sensing

ArcGIS Pro: What s New in Analysis. Rob Elkins

Choosing the best path:

Esri UC 2014 Technical Workshop

GIS Data Sources. Thomas Talbot

Open Standard based Visualization Infrastructure for 3D Geospatial Information

Downloading Imagery & LIDAR

March 10, Greenbelt Road, Suite 400, Greenbelt, MD Tel: (301) Fax: (301)

Green/Blue Metrics Meeting June 20, 2017 Summary

Documenting Land Cover and Vegetation Productivity Changes in the NWT using the Landsat Satellite Archive

QGIS document from the previous exercise: worldmap.qgs

Chapter 10. What is geocoding?

Customer Showcase > Defense and Intelligence

The Utility and Limitations of Remote Sensing in Land Use Change Detection and Conservation Planning

Economic and Social Council

ArcGIS Tutorial: Geocoding Addresses

Chapter 2 Outdoor Navigation

Orthoimagery Standards. Chatham County, Georgia. Jason Lee and Noel Perkins

MSB Imagery Program FAQ v1

Business-strength Geocoding

DIGITALGLOBE ATMOSPHERIC COMPENSATION

ENVI Orthorectification Module

BIM FOR INFRASTRUCTURE THE IMPACT OF TODAY S TECHNOLOGY ON BIM

An ESRI White Paper May 2009 ArcGIS 9.3 Geocoding Technology

Using Imagery for Intelligence Analysis. Jim Michel Renee Bernstein

Analysis and Geoprocessing Sessions and Demo Theater Presentations

Separation of crop and vegetation based on Digital Image Processing

Data Preparation. Warren Vick Europa Technologies Ltd.

Public Safety Geocoding Using ArcGIS Online and HERE Data

ASTER GDEM Readme File ASTER GDEM Version 1

Introducing Elsevier Research Intelligence

geocoding crime data in Southern California cities for the project, Crime in Metropolitan

18 th January 2018 Hyderabad. Shishir Verma Sr. Vice President

Pitney Bowes Software Geocoding: Capabilities and Roadmap

Acquisition of Aerial Photographs and/or Satellite Imagery

Lesson 3: Working with Landsat Data

Automated Damage Analysis from Overhead Imagery

Sample Copy. Not For Distribution.

VGIN Geocoding Service

Geocoding An Introduction

Natural Disaster Hotspots Data

Accuracy and Precision of the NAACCR Geocoder. Recinda L Sherman, MPH CTR David J Lee, PhD University of Miami, Florida Cancer Data System

OUR INDUSTRIAL LEGACY WHAT ARE WE LEAVING OUR CHILDREN REAAA Roadshow Taupo, August 2016 Young presenter s competition

Managing and serving large collections of imagery

Land Cover Type Changes Related to. Oil and Natural Gas Drill Sites in a. Selected Area of Williams County, ND

Realigning Historical Census Tract and County Boundaries

Figure 3: Map showing the extension of the six surveyed areas in Indonesia analysed in this study.

ENVI Orthorectification Module

2010 Census Mapping Evolution, Potentialities and Integration to the National Spatial Data Infrastructure

Satellite image classification

3. GPS receiver instruction cards GPS receivers

GIS Lecture 8: Geocoding

Land Cover Analysis to Determine Areas of Clear-cut and Forest Cover in Olney, Montana. Geob 373 Remote Sensing. Dr Andreas Varhola, Kathry De Rego

Digital Image Processing

Designing a WebGIS architecture for aviation impact assessment

Catalogue no X. Geography Catalogue. Census year 2011

GIS Tips and Tricks. Jennifer Harrison President, Inner Corridor Technologies, Inc. ESRI Certified Training Partner (CTP) Let s talk about chocolate

White paper brief IdahoView Imagery Services: LISA 1 Technical Report no. 1 Processing and Evaluation

HARRIS GEOSPATIAL MARKETPLACE. HarrisGeospatial.com

Planning urban areas

The power of virtual globes for valorising cultural heritage and enabling sustainable tourism: NASA World Wind applications

Satellite data processing and analysis: Examples and practical considerations

The effects of uncertainty in forest inventory plot locations. Ronald E. McRoberts, Geoffrey R. Holden, and Greg C. Liknes

Transcription:

Measuring, Modelling and Mapping our Dynamic Home Planet Geocoding DoubleCheck: A Unique Location Accuracy Assessment Tool for Parcel-level Geocoding Page 1 Geocoding is a process of converting an address (typically in the form of street number, street name, city, state and country) into a physical location on the globe (often expressed as latitude and longitude). It is one of the most popular spatial analysis techniques, and almost all major GIS software (e.g. QGIS, ArcGIS and MapInfo) and web mapping portals (e.g. Google Maps, Bing Maps and Apple Maps) provide geocoding tools and API options. But the big question is how reliable is for the geocoding result from each geocoder. Geocoding (location) accuracy is fundamental to the exploration of location-sensitive or locationcentric business intelligence, and important for applications such as asset management and risk analysis. Currently very few assessment tools on this exist. Without the confidence on the location accuracy, subsequent analytics and results would be seriously compromised. We have developed Geocoding DoubleCheck, a unique tool for assessing the location accuracy of the widely used parcel-level geocoding. The current coverage is for 48 contiguous U.S. States.

Parcel-level Geocoding vs. Building-level Geocoding In a basic form, geocoding is realised by interpolating the address range numbers specific to individual street segments (i.e. street-level geocoding). For advanced geocoding, the following two levels can be categorised (Figure 1): Page 2 Parcel-level geocoding: The geometric centroid of a land parcel is used to represent the location of an address. This is a popular and increasingly-applied form for geocoding in some developed countries, e.g. about a half dozen private companies offering land parcel data with varying completeness in the U.S., and the open release of the Geocoded National Address File in Australia. Building-level geocoding: The geocoded location is indicated by the building footprint of a physical building and/or a point within (also known as rooftop-level geocoding). The latest pursuits are certainly geared towards this (e.g. the development of building footprint data by CoreLogic in the U.S. and by PSMA in Australia), but it needs significant investment for a very large geographic coverage. Google produces and holds very comprehensive data sets on digitised building footprints, as seen in Google Maps, but they are proprietary and not open to external analysis. Obviously, for many location-centric applications such as site-level risk analysis, the location accuracy of parcel-level geocoding is still not enough and what is required is the geocoding at the physical building level. Figure 1: Comparison between parcel-level geocoding (left) and building-level geocoding (right). Location: Tompkins County, NY. (Acknowledgements: In this document, the sample land parcel and building footprint data are obtained from the Tompkins County GIS Division; high-resolution aerial imagery is from the USDA NAIP series.)

The difference between the two advanced geocoding levels above can be more easily appreciated if high-resolution aerial or satellite imagery is superimposed (Figure 2). Imagery provides a rich context and clearly shows various land covers. In this demonstration, the limitation of the parcel-level geocoding for location-centric applications such as risk mapping is obvious. For example, a house might be far away from adjacent fire-prone forest but its parcel centroid could be within forest. (Figure 3 shows another example in risk mapping: a house might be located on a high ground 50m away from a nearby river but its parcel centroid could be in a very flood-prone zone.) Page 3 Figure 2: Land parcels and centroids (polygons and dots in orange) and building footprints (white polygons) are overlaid with 1m-resolution aerial imagery. The spatial discrepancy between parcel-level geocoding and building-level geocoding is shown.

The Use of High-Resolution Imagery and Classified Vegetation High-resolution imagery and classified vegetation are essential for the development of Geocoding DoubleCheck in assessing geocoding location accuracy (Figure 3). Ideally, high-resolution land cover maps can be readily sourced but they usually do not exist for a very large territory at this time. Figure 4 compares classified vegetation at 1m resolution and classified land covers from the 2011 National Land Cover Database (NLCD) at 30m resolution, and it is clear that for this type of application, the use of high-resolution imagery and classified land covers is a prerequisite. Page 4 Figure 3: Parcel-level geocoding result superimposed with raw imagery (top) and classified vegetation (in green colour, bottom). The classified vegetation map can be used to determine if the land cover for the geocoded spot (parcel centroid) is vegetation or not.

Page 5 Figure 4: Comparison between classified 1m-resolution vegetation (left, in green) and the classified 30m-resolution land covers from NLCD 2011 (right, colours indicating different classified land covers). Land parcels (white polygons) and their geometric centroids (black dots) are superimposed. Geocoding DoubleCheck takes advantage of the new, comprehensive high-resolution (1m) digital aerial imagery for 48 continuous U.S. States (from the USDA NAIP 2015/2014/2013 series) and the recently classified unique vegetation dataset by BigData Earth (link). We are now able to determine if a geocoded location at the parcel centroid is vegetation or not. If it is vegetation, the geocoded spot is subject to further location scrutiny or improvement. As the imagery we analysed is from agricultural growing seasons or with leaf on conditions, the presence of vegetation is a universal feature and a good indicator for open space. Although not perfect accuracy, like any other image classification tasks, it does serve as a highly relevant and reliable basemap that can be used to determine site land cover conditions for the overwhelming majority of geocoded locations.

Application Example: Advancing Loss Modelling and Risk Mapping We identify and offer three application levels in relation to the Geocoding DoubleCheck product: Level 1: Summary Page 6 If only a very small number of geocoded addresses (for various asset and exposure types) are involved, a user may rely on high-resolution imagery to cross check their true locations and make on-screen adjustment manually. Geocoding DoubleCheck Summary software tool is ideally used to analyse 1,000s or even millions of geocoded addresses in an automated workflow and report the % of addresses that might just hit open space, i.e. vegetation. A typical application for this is catastrophe loss modelling used by the (re)insurance industry, where a large number of geocoded addresses (for exposure or portfolio data) are routinely analysed. Location accuracy is very important as a few meters could mean a different hazard level. Better location accuracy for exposure data has long been demanded by the industry, and we are now able to automatically assess the location accuracy of a large quantity of exposure data used in aggregate loss modelling. The use of geospatial big data analytics and high-resolution digital imagery has now made this large-scale implementation possible. It has been widely recognised by academics and risk management practitioners that exposure location accuracy is a major source of uncertainties in loss modelling. Imagine that every exposure input is filtered through software like the Geocoding DoubleCheck Summary tool, the end user would be much more confident on the exact input that has greater location accuracy. This will certainly reduce uncertainties and make catastrophe loss modelling more transparent and powerful. Software inputs and outputs: Typical input for the software tool is very simple and requires four basic fields: unique ID for each exposure record, geocoded latitude, geocoded longitude, and State name. Typical output is a summary table showing the % of geocoded addresses that hit vegetated areas, and the % of geocoded addresses that have greater location accuracy and been analysed for loss calculation. This can be reported by regions (e.g. states) or by any other input attributes. We offer customised processing services and/or standalone software tools with terabytes of classified vegetation basemaps stored on external hard drives. Web services are not available at this time.

Level 2 Refinement As illustrated in Figure 3, if the distribution of classified vegetation is known, the centroid of each land parcel can be adjusted relative to non-vegetation areas and a new geocoded location in close proximity to a physical building may be determined. Indeed, this type of analysis can be extended by considering any other non-building land covers (e.g. open water and road). We offer advanced image processing services to facilitate such location adjustments and refinement at the land parcel level. In this case, in addition to high-resolution imagery, land parcel data need to be outsourced as they provide predefined control boundaries for detailed land cover classification. Page 7 Level 3 Supplement This examines geocoded locations by considering many terrain and environmental metrics at a site level. It provides an opportunity to double check and diagnose the surroundings of each individual site from multiple perspectives. A typical example is site-level risk mapping. This application overlaps with another data product we are developing (i.e. New Generation Property Location Information, link), which measures and quantifies a range of terrain and environmental attributes, including land covers, terrain, climate change and natural hazards.