Overview. Scotland s Census. Development of methods. What did we do about it? QA panels. Quality assurance and dealing with nonresponse

Similar documents
Population of Edinburgh Census Online - Old Edinburgh Club

Session V: Sampling. Juan Muñoz Module 1: Multi-Topic Household Surveys March 7, 2012

Supplementary questionnaire on the 2011 Population and Housing Census SWITZERLAND

Sierra Leone - Multiple Indicator Cluster Survey 2017

SURVEY ON USE OF INFORMATION AND COMMUNICATION TECHNOLOGY (ICT)

Comparing the Quality of 2010 Census Proxy Responses with Administrative Records

6 Sampling. 6.2 Target Population and Sample Frame. See ECB (2011, p. 7). Monetary Policy & the Economy Q3/12 addendum 61

It s good to share... Understanding the quality of the 2011 Census in England and Wales

Botswana - Botswana AIDS Impact Survey III 2008

Survey of Massachusetts Congressional District #4 Methodology Report

CENSUS DATA COLLECTION IN MALTA

Section 2: Preparing the Sample Overview

2011 UK Census Coverage Assessment and Adjustment Methodology

MATRIX SAMPLING DESIGNS FOR THE YEAR2000 CENSUS. Alfredo Navarro and Richard A. Griffin l Alfredo Navarro, Bureau of the Census, Washington DC 20233

6 Sampling. 6.2 Target population and sampling frame. See ECB (2013a), p. 80f. MONETARY POLICY & THE ECONOMY Q2/16 ADDENDUM 65

COUNTRY REPORT MONGOLIA

2011 Census. Report on changes to Government Statement published in December 2008

Turkmenistan - Multiple Indicator Cluster Survey

Symposium 2001/36 20 July English

AF Measure Analysis Issues I

Thailand - The Population and Housing Census of Thailand IPUMS Subset

A QUALITY ASSURANCE STRATEGY IN MALAYSIA 2020 POPULATION AND HOUSING CENSUS

United Nations Statistics Division Programme in Support of the 2020 Round of Population and Housing Censuses

2011 UK Census Overview of E&I Process

Prepared by. Deputy Census Manager Zambia

Chapter 4: Sampling Design 1

Guyana - Multiple Indicator Cluster Survey 2014

2007 Census of Agriculture Non-Response Methodology

Adjusting for linkage errors to analyse coverage of the Integrated Data Infrastructure (IDI) and the administrative population (IDI-ERP)

Use of administrative sources and registers in the Finnish EU-SILC survey

Nigeria - Multiple Indicator Cluster Survey

Introduction INTRODUCTION TO SURVEY SAMPLING. Why sample instead of taking a census? General information. Probability vs. non-probability.

Vincent Thomas Mule, Jr., U.S. Census Bureau, Washington, DC

Supplementary questionnaire on the 2011 Population and Housing Census SLOVAKIA

South Devon and Torbay CCG. CCG 360 o stakeholder survey 2015 Main report Version 1 Internal Use Only

Enfield CCG. CCG 360 o stakeholder survey 2015 Main report. Version 1 Internal Use Only Version 1 Internal Use Only

Oxfordshire CCG. CCG 360 o stakeholder survey 2015 Main report. Version 1 Internal Use Only Version 1 Internal Use Only

Southern Derbyshire CCG. CCG 360 o stakeholder survey 2015 Main report. Version 1 Internal Use Only Version 1 Internal Use Only

Portsmouth CCG. CCG 360 o stakeholder survey 2015 Main report. Version 1 Internal Use Only Version 1 Internal Use Only

UK Data Service Introduction to Census

October 6, Linda Owens. Survey Research Laboratory University of Illinois at Chicago 1 of 22

Lao PDR - Multiple Indicator Cluster Survey 2006

Sutton CCG. CCG 360 o stakeholder survey 2015 Main report. Version 1 Internal Use Only Version 1 Internal Use Only

Supplementary questionnaire on the 2011 Population and Housing Census FRANCE

Data Integration Activities on the Way to the Dutch Virtual Census of 2011

An Introduction to ACS Statistical Methods and Lessons Learned

Thailand - The Population and Housing Census of Thailand IPUMS Subset

Using Administrative Records for Imputation in the Decennial Census 1

Methodology for Evaluating Data Quality

1 NOTE: This paper reports the results of research and analysis

2011 Census quality assurance: The estimation process

Eastern Cheshire CCG CCG 360 o Stakeholder Survey

Kernow CCG CCG 360 o Stakeholder Survey

Statistical Aspects of a Census

Ghana - Ghana Living Standards Survey

The Savvy Survey #3: Successful Sampling 1

Ghana - Financial Inclusion Insights Survey 2014

SAMPLING. A collection of items from a population which are taken to be representative of the population.

; ECONOMIC AND SOCIAL COUNCIL

Estimation Methodology and General Results for the Census 2000 A.C.E. Revision II Richard Griffin U.S. Census Bureau, Washington, DC 20233

PUBLIC EXPENDITURE TRACKING SURVEYS. Sampling. Dr Khangelani Zuma, PhD

2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression

American Community Survey 5-Year Estimates

American Community Survey 5-Year Estimates

Census: Gathering information about every individual in a population. Sample: Selection of a small subset of a population.

West Norfolk CCG. CCG 360 o stakeholder survey 2014 Main report. Version 1 Internal Use Only Version 7 Internal Use Only

Planning for an increased use of administrative data in censuses 2021 and beyond, with particular focus on the production of migration statistics

MAT 1272 STATISTICS LESSON STATISTICS AND TYPES OF STATISTICS

Elements of the Sampling Problem!

Census Data for Transportation Planning

Other Effective Sampling Methods

Sampling and Weighting

Comparing Generalized Variance Functions to Direct Variance Estimation for the National Crime Victimization Survey

Monday, 1 December 2014

USE OF GEOSPATIAL TECHNOLOGY DURING ENUMERATION

Stats: Modeling the World. Chapter 11: Sample Surveys

Swindon CCG CCG 360 o Stakeholder Survey

Conducting Research in the ACRDC

Southwark CCG CCG 360 o Stakeholder Survey

Liberia - Household Income and Expenditure Survey 2016

Sampling Terminology. all possible entities (known or unknown) of a group being studied. MKT 450. MARKETING TOOLS Buyer Behavior and Market Analysis

NHS NORTH & WEST READING CCG Latest survey results

Statistical and operational complexities of the studies I Sample design: Use of sampling and replicated weights

Some Indicators of Sample Representativeness and Attrition Bias for BHPS and Understanding Society

Response ID ANON-TX5D-M5FX-5

2011 National Household Survey (NHS): design and quality

These days, surveys are used everywhere and for many reasons. For example, surveys are commonly used to track the following:

Final technical report on Improvement of the use of administrative sources (ESS.VIP ADMIN WP6 Pilot studies and applications)

Can a Statistician Deliver Coherent Statistics?

Using administrative data in production of population statistics; register-based surveys

The Census questions. factsheet 9. A look at the questions asked in Northern Ireland and why we ask them

Rushcliffe CCG CCG 360 o Stakeholder Survey

Benefits of Sample long Form to Enlarge the scope of Census Data Analysis: The Experience Of Bangladesh

2021 Coding Plans. Paul Waruszynski Office for National Statistics

The Representation of Young Children in the American Community Survey

Session 12. Quality assessment and assurance in the civil registration and vital statistics system

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001

The Canadian Century Research Infrastructure: locating and interpreting historical microdata

7.1 Sampling Distribution of X

Introduction to the course, lecturers, participants and the European Census 2021

Transcription:

Overview Scotland s Census Quality assurance and dealing with nonresponse in the Census Quality assurance approach Documentation of quality assurance The Estimation System in Census and its Accuracy Cecilia Macintyre and Ali Greig June 25 th 2014 Development of methods Carried out agreed series of simple univariate checks at early stages. Benefits of early sight of data was that feedback could be provided to processing team Developed systems and tools to be used throughout process and for dissemination of quality information What did we do about it? Carried out more in-depth checks, prioritising key data used in first release Analysed data for issues which would cause problems in later processes, in particular edit and imputation Recoded some text responses including ethnic group and language Sometimes nothing but will need to report quality to users QA panels Metadata available online Met with internal quality assurance working group to discuss approach to quality assurance External panel - provided knowledge and comparator data - provide a source of local contact - provide insights to NRS on final results 1

Quality Assurance Pack To accompany the first release of population and household statistics, NRS published detailed data used in the quality assurance process The following slides are extracts from the pack Quality Topic Report Format 1: Questions & Variables Covered 2: Tracking Missing Data 3: Data Changes through process 4: Internal Analysis 5: External Analysis 6: Known Quality Issues (may only be relevant for some variables) 7: Definitions and references 8: Documentation Current work and next steps Further information Quality assurance of migration and workplace flow data Investigation of issues arisen following publications Impact of approach to dealing with overlapping areas Use of microdata to investigate household compositions Planning for documentation and quality products, QA papers, enhanced metadata, item level imputation rates and deterministic edit rates All data available at: www.scotlandscensus.gov.uk Also sign up there for our e-newsletter Media enquiries: 2011Comms@gro-scotland.gsi.gov.uk General enquiries: Customer@gro-scotland.gsi.gov.uk Questions? 2

Quick Question The Estimation System in Census and its Accuracy A Quick Guide Does anyone know the census estimation methodology? Fundamentals of Estimation System Key goal: estimate census non-response. Quantify the number of people that did not complete a census questionnaire. This is primarily achieved through a Census Coverage Survey 1.5% sample of Scottish postcodes Stratified two-stage cluster sampling Estimation Modelling Framework Capture-Recapture Modelling Using the CCS and census, the probability individuals and households were missed on the census can be estimated for different groups. This is used to estimate the true population for CCS areas. These are then used to derive weights which are then applied nationally. Key methodological issues Although 1.5% is a relatively big sample, standard sampling issues apply to the CCS. And standard issues around questionnaire design apply to both the CCS and census. Independence of CCS and Census The probability of not responding to the census and not responding to the CCS need to be independent. e.g., if, in an extreme example, a particular group does not respond to either the census or CCS we will be unable to estimate the probability that individuals from that group are missed on the census. Some remedial steps are taken. How accurate is the estimation system? Theoretically complex to estimate. Sample size of CCS / Stratification Independence assumption Edit & Imputation assumptions Based on nearest neighbour algorithm Data processing and other adjustments Symmetric sampling distributions Note on small area population estimates 3

Different ways to estimate accuracy Imputation / Non-response rates Bootstrapping to derive confidence intervals Theoretical development of confidence intervals. Imputation Response Rates The number of values/people which are synthetic. Imputation rates (or response rates) are a very useful indication of data quality and easy to interpret. But the indicator is not an error rate (i.e., variables with higher imputation rates are not necessarily the most inaccurate). Bootstrap methodology Basically, shuffle CCS responses within each strata across Scotland. Estimation system is rerun with new PU-level data, and new estimates generated. Provides an indicator which can be interpreted as a confidence interval. Not really a conventional confidence interval (i.e., it asks what variance is expected from the estimation system? ). Theoretical Confidence Interval Produce a confidence interval using statistical theory. We tried this using a Bayesian Approach. Can investigate independence assumption, and produce consistent confidence interval less reliant on responses. but depends on the extent of dependence and this is difficult to measure. Current thoughts Best approach would likely be a combination of bootstrap or theoretical approach and imputation rates, although theoretical approaches are useful in learning about estimation system. General conclusion: Demographic subpopulations with relatively larger numbers of responses (in CCS areas) will produce good data. Further information All data available at: www.scotlandscensus.gov.uk Also sign up there for our e-newsletter Media enquiries: 2011Comms@gro-scotland.gsi.gov.uk General enquiries: Customer@gro-scotland.gsi.gov.uk Questions? 4

Contact Details Cecilia.Macintyre@gro-scotland.gsi.gov.uk Alastair.Greig@gro-scotland.gsi.gov.uk 5