An Introduction to ACS Statistical Methods and Lessons Learned

Similar documents
Census Data for Transportation Planning

The American Community Survey Motivation, History, and Design. Workshop on the American Community Survey Havana, Cuba November 16, 2010

American Community Survey: Sample Design Issues and Challenges Steven P. Hefter, Andre L. Williams U.S. Census Bureau Washington, D.C.

American Community Survey Accuracy of the Data (2014)

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001

INTEGRATED COVERAGE MEASUREMENT SAMPLE DESIGN FOR CENSUS 2000 DRESS REHEARSAL

1 NOTE: This paper reports the results of research and analysis

MATRIX SAMPLING DESIGNS FOR THE YEAR2000 CENSUS. Alfredo Navarro and Richard A. Griffin l Alfredo Navarro, Bureau of the Census, Washington DC 20233

Claritas Demographic Update Methodology Summary

Introduction INTRODUCTION TO SURVEY SAMPLING. Why sample instead of taking a census? General information. Probability vs. non-probability.

Estimation Methodology and General Results for the Census 2000 A.C.E. Revision II Richard Griffin U.S. Census Bureau, Washington, DC 20233

Who s in Your Neighborhood? Using the American FactFinder. Salma Abadin and Carrie Koss Vallejo Data You Can Use

In-Office Address Canvassing for the 2020 Census: an Overview of Operations and Initial Findings

Italian Americans by the Numbers: Definitions, Methods & Raw Data

An Overview of the American Community Survey

The American Community Survey. An Esri White Paper August 2017

Understanding and Using the U.S. Census Bureau s American Community Survey

Paper ST03. Variance Estimates for Census 2000 Using SAS/IML Software Peter P. Davis, U.S. Census Bureau, Washington, DC 1

Table 5 Population changes in Enfield, CT from 1950 to Population Estimate Total

Can a Statistician Deliver Coherent Statistics?

2010 Census Data. Get Ready for Changes in Your 2014 AAPs. Ellen Shong & Associates, LLC 9/13/ Past EEO Tabulations

2020 Census Program Update

Variance Estimation in US Census Data from Kathryn M. Coursolle. Lara L. Cleveland. Steven Ruggles. Minnesota Population Center

A PROTOTYPE CONTINUOUS MEASUREMENT SYSTEM FOR THE U.S. CENSUS OF POPULATION AND HOUSING

Section 2: Preparing the Sample Overview

Comparing the Quality of 2010 Census Proxy Responses with Administrative Records

Using Administrative Records and the American Community Survey to Study the Characteristics of Undercounted Young Children in the 2010 Census

How Will the Changing U.S. Census Affect Decision-Making?

Quick Reference Guide

2020 Census Update. Presentation to the Council of Professional Associations on Federal Statistics. December 8, 2017

2012 AMERICAN COMMUNITY SURVEY RESEARCH AND EVALUATION REPORT MEMORANDUM SERIES #ACS12-RER-03

Proposed Information Collection; Comment Request; The American Community Survey

Introduction INTRODUCTION TO SURVEY SAMPLING. General information. Why sample instead of taking a census? Probability vs. non-probability.

The 2020 Census: A New Design for the 21 st Century Deirdre Dalpiaz Bishop Chief Decennial Census Management Division U.S.

The U.S. Decennial Census A Brief History

2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression

Sierra Leone - Multiple Indicator Cluster Survey 2017

Finding U.S. Census Data with American FactFinder Tutorial

Census: Gathering information about every individual in a population. Sample: Selection of a small subset of a population.

Using Administrative Records for Imputation in the Decennial Census 1

Sample size, sample weights in household surveys

Statistical Issues of Interpretation of the American Community Survey s One-, Three-, and Five-Year Period Estimates

American Community Survey 5-Year Estimates

American Community Survey 5-Year Estimates

Measuring ICT use by businesses in Brazil: The Project of the Brazilian Institute of Geography and Statistic (IBGE)

Claritas Update Demographics Methodology

Experiences with the Use of Addressed Based Sampling in In-Person National Household Surveys

October 6, Linda Owens. Survey Research Laboratory University of Illinois at Chicago 1 of 22

The 2020 Census A New Design for the 21 st Century

Claritas Demographic Update Methodology

Vincent Thomas Mule, Jr., U.S. Census Bureau, Washington, DC

Session V: Sampling. Juan Muñoz Module 1: Multi-Topic Household Surveys March 7, 2012

SELECTED SOCIAL CHARACTERISTICS IN THE UNITED STATES American Community Survey 5-Year Estimates

Using 2010 Census Coverage Measurement Results to Better Understand Possible Administrative Records Incorporation in the Decennial Census

Design and Methodology

Demystifying Census Data. Legislative Research Librarians September 18, 2013 Boise, Idaho

2020 Census: Researching the Use of Administrative Records During Nonresponse Followup

2020 Census. Bob Colosi Decennial Statistical Studies Division February, 2016

Working Paper n. 79, January 2009

Polls, such as this last example are known as sample surveys.

THE EVALUATION OF THE BE COUNTED PROGRAM IN THE CENSUS 2000 DRESS REHEARSAL

PROBABILITY-BASED SAMPLING USING Split-Frames with Listed Households

Notes on the 2014 ACS 5-Year Estimates

b. Stopping students on their way out of the cafeteria is a good way to sample if we want to know about the quality of the food there.

Botswana - Botswana AIDS Impact Survey III 2008

AP Statistics S A M P L I N G C H A P 11

COMPARISON OF ALTERNATIVE FAMILY WEIGHTING METHODS FOR THE NATIONAL HEALTH INTERVIEW SURVEY

American Community Survey Overview

Participant Statistical Areas Program for the 2010 Census. Vince Osier COPAFS Quarterly Meeting Washington, DC December 8, 2006

American Community Survey Review and Tips for American Fact Finder. Sarah Ehresman Kentucky State Data Center August 7, 2014

Simulated Statistics for the Proposed By-Division Design In the Consumer Price Index October 2014

Working with United States Census Data. K. Mitchell, 7/23/2016 (no affiliation with U.S. Census Bureau)

Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu Albuquerque, UNM

South Dakota State Demographic Conference

Census Pro Documentation

My Tribal Area: Census Data Overview & Access. Eric Coyle Data Dissemination Specialist U.S. Census Bureau

Comparing Generalized Variance Functions to Direct Variance Estimation for the National Crime Victimization Survey

Chapter 4: Sampling Design 1

Environmental Justice Tool Guide

Virginia Employment Commission

Understanding the Census A Hands-On Training Workshop

Geog 3340: Census Basics

Chapter 12: Sampling

Investigation of Variance Estimators for the Survey of Business Owners (SBO)

REVISED - Census Tract Measures for Fragile Families Mothers and Fathers at Baseline. September 16, 2005

The American Community Survey and the 2010 Census

Survey of Massachusetts Congressional District #4 Methodology Report

Grid. Grid. Grid. Some grids. Grid. Grid. A Grid in Lithuania. BNU 2012, Valmiera Seppo 1

National Longitudinal Study of Adolescent Health. Public Use Contextual Database. Waves I and II. John O.G. Billy Audra T. Wenzlow William R.

Other Effective Sampling Methods

Stats: Modeling the World. Chapter 11: Sample Surveys

Handout Packet. QuickFacts o Frequently Asked Questions

Learning to Use the ACS for Transportation Planning Report on NCHRP Project 8-48

Use of administrative sources and registers in the Finnish EU-SILC survey

Guyana - Multiple Indicator Cluster Survey 2014

ERROR PROFILE FOR THE CENSUS 2000 DRESS REHEARSAL

A Guide to Sampling for Community Health Assessments and Other Projects

Chapter 3 Monday, May 17th

Working with NHS and Taxfiler data to measure income and poverty in Toronto neighbourhoods

Nigeria - Multiple Indicator Cluster Survey

Transcription:

An Introduction to ACS Statistical Methods and Lessons Learned Alfredo Navarro US Census Bureau Measuring People in Place Boulder, Colorado October 5, 2012

Outline Motivation Early Decisions Statistical Methodology Reliability of ACS estimates, specifically for small areas and population groups - Sample Size - Mail/CATI Response Rates - Population Controls Successes, Challenges, Lessons Learned 2

Motivating Goals Produce more timely social, economic, and housing data for all geographic areas, particularly for small areas. Simplify decennial census operations to collect only the most basic data. 3

Basic Tenets of the ACS ACS Program serves as the replacement to the Census long form Selected sample is spread over the years throughout the decade Accumulate data over time to generate more reliable estimates 4

Design Origins and Early Proposals Concept of rolling sample design Mid-decade census Proposed Decade Census Program Continuous measurement alternatives to the Census 2000 long form 5

Early Decisions - Data Collection Methodology based on best practices from decennial census and demographic surveys Monthly samples using overlapping multi-mode data collection methods Mail Telephone Personal Visit 6

Early Decisions Data Collection Strategy Calendar Month Sample Panel FebruaryMarchApril May June July August 2005 2005 2005 2005 2005 2005 2005 Feb 2005 Mail Phone Personal Visit March 2005 Mail Phone Personal Visit April 2005 Mail Phone Personal Visit May 2005 Mail Phone Personal Visit June 2005 Mail Phone Personal Visit 7

Early Decisions Residence Rules Should the residence rule in the ACS be based on current residence or should be made to be more consistent with a more usual residence based rule? 8

Residence Rules - Options 1. Current residence at time of interview. 2. Usual residence at time of interview. 3. Usual residence with a constant reference date. 4. Delay decision, conduct experimental research and consult with data users. 9

Residence Rules-Decision Criteria 1. Simplicity of implementation 2. Complete coverage of the population 3. Completeness of data collection 4. Meeting data users needs 10

Sample Design

Sample Design Survey designed to include U.S. Stateside and Puerto Rico Population in both housing units and group quarters (group quarters started in 2006) Survey designed to produce annually updated single-year and multi-year estimates 12

Sample Design Frame Sample cases selected from an updated Master Address File (MAF) MAF updated through the use of Postal Service updates in most areas Special field updating in more rural areas and presence of non-city style addresses 13

Sample Design Un-clustered one-stage systematic sample of housing units selected as initial sample each month Sub-sample of nonrespondents selected after mail and phone attempts for personal visit follow-up 14

ACS Initial Sample Design Governmental Unit Size: Estimate of Occupied Housing Units ACS 1-year Sampling Rates 0-200 10.0% 201-800 ~7.0% 801-1200 ~3.5% Census Tract Size 2000 or less ~2.4% Over 2000 ~1.7% 15

ACS Sample Design Rate Definitions 2005 to 2010 Sampling rates function of base rate (BR) One fixed rate stratum Stratum Block MOS Criteria Sampling Rates 5 0 < GUMOS 200 10% (fixed) 2 200 < GUMOS 800 3 BR 3 800 < GUMOS 1,200 1.5 BR 1 TRACTMOS 2,000 BR 4 2,000 < TRACTMOS 0.735 BR 16

Reallocation of the HU Address Sample - Improvement Increase the number of sampling strata Smaller stratum intervals allows smoother transitions between rates Increase sampling rates for blocks in the very smallest governmental units Increase reliability of the estimates 17

Reallocation of the HU Address Sample 2011 Stratification New Stratification (small GUs) increased number of fixed rate strata increased the rates Stratum Block MOS Criteria Sampling Rates 1 0 < GUMOS 200 15% (fixed) 2 200 < GUMOS 400 10% (fixed) 3 400 < GUMOS 800 7% (fixed) 4 800 < GUMOS 1,200 2.8 BR ~ 5% 18

Reallocation of the HU Address Sample 2011 Stratification Stratum Block MOS Criteria Sampling Rates 5 0 < TRACTMOS 400 3.5 BR 6 0 < TRACTMOS 400 H.R. 0.92 3.5 BR 7 400 < TRACTMOS 1,000 2.8 BR 8 400 < TRACTMOS 1,000 H.R. 0.92 2.8 BR 9 1,000 < TRACTMOS 2,000 1.7 BR 10 1,000 < TRACTMOS 2,000 H.R. 0.92 1.7 BR 11 2,000 < TRACTMOS 4,000 BR 12 2,000 < TRACTMOS 4,000 H.R 0.92 BR 13 4,000 < TRACTMOS 6,000 0.6 BR 14 4,000 < TRACTMOS 6,000 H.R. 0.92 0.6 BR 15 6,000 < TRACTMOS 0.35 BR 16 6,000 < TRACTMOS H.R. 0.92 0.35 BR 19

Sub-sampling Rates Nonresponse Follow-up Address and Tract Characteristics Sub-sampling Rate Unmailable Addresses 2 in - 3 Mailable addresses with the lowest mail/cati rates 1 in 2 Mailable addresses in tracts with average mail/cati rates 2 in 5 Other mailable addresses 1 in 3 20

Weighting and Estimation

Annual Weighting Process 3 Major Components Initial weights to reflect the probability of selection Adjust weights of interviewed households to account for noninterviews Adjust weights to independent housing unit and population estimates (controls) 22

Initial Weight Probabilities of Selection Initial probability of selection is assigned as a function of the sample design Nonresponse follow-up (Personal Visit CAPI) sample design 23

Nonresponse Adjustment The weight of the nonrespondents is transferred to the respondents Nonresponse adjustment is carried out at the census tract level for groups of households with characteristics correlated with nonresponse: Census tract Type of building (single vs. multi-unit) Month of data collection 24

Ratio Adjustments to Housing Unit and Population Controls Post-censal estimates are produced by updating the previous census results using various administrative records data In a multi-stage process, housing unit and population adjustment ratios are applied to the weights Applied at the county (or group of counties) level by sub-county areas and race/ethnicity and age/sex groups. 25

Ratio Adjustments to Controls - Why? Reduce variability of the estimates Reduce bias Undercoverage of housing units Undercoverage of people within housing units 26

Reliability of ACS Estimates Have we learned anything? 27

ACS Sample Design Sample size about 3.54 M addresses on an annual basis Stratification and sample allocation Similar to census long form Unlike census long form, only a sample is selected for personal visit during non-response follow-up 28

Sample Size Effect on Reliability Census 2000 LF Sampling Rate = 1- in 6 Planned ACS Sampling Rate = 1- in - 8 Reliability of ACS estimates relative to LF 1.25 29

Non-response Follow up Sampling Differential Response Rates Mail/CATI response rates continue to decline below the levels assumed during the planning phase of the survey Substantial variation exists in mail response rates by geography Census tracts with high proportions of African American and Hispanic origin populations tend to have lower mail and telephone response rates (Griffin 2005) 30

Mail/CATI Response Rates Census Tracts Source: 2008 and 2009 ACS Size (Occ. HU s) Median (%) Range (Q3 Q1) 0-399 31.8 35.8 400-999 45.8 31.1 1000 1999 57.2 26.3 2000 3999 59.6 21.8 4000 5999 58.3 18.0 Over 6000 61.0 15.4 31

Effect of Non-response Follow up Sampling (CAPI) Estimated from the 2005-2009 ACS preliminary weighted files The range of the median effect is 1.25 1.28 32

Effect of Population Controls Source: Small Area Estimates Research Research on 1999-2005 ACS data showed that standard errors for small areas census tracts and places - were much higher than anticipated. Lack of tract-level controls identified as a leading cause contributing to an increase of between 15 25 percent. (Starsinic 2006) 33

ACS Reliability Source: 2005 2009 ACS The survey was designed to produce estimates with CV about 1.33 the CV of corresponding long form estimates The most current results show that the CV is about 1.75 the CV of corresponding long form estimates 34

Preliminary Tract CV Distribution for % Persons in Poverty CV 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0-400 400-1000 1001-2000 2001-4000 4001-6000 6001+ Size of Tract (Occupied Housing Units) Source: 2005 2009 ACS 5-year Data 35

CV 1.1 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 Preliminary Tract CV Distribution for % Unemployed 0-400 400-1000 1001-2000 2001-4000 4001-6000 6001+ Size of Tract (Occupied Housing Units) Source : 2005 2009 ACS 5-year Data 36

ACS Today Successes and Challenges Developed and implemented a sampling plan that over-samples census tracts with lower than average response by mail and phone. Developed a model-assisted estimation application that relies on auxiliary information to reduce variance for census tracts and medium-sized places. Use of experienced interviewers (including bilingual) facilitate high levels of survey response. 37

Group Quarters Small Area Estimation Research Improving the GQ estimation process, specifically for small areas such as census tracts NAS Panel on measuring GQ population 38

Additional Improvements for FY11 The ACS sample was expanded from 2.9 M to 3.54 M housing unit addresses Improved stratification to produce a more equitable distribution of tract level estimates with respect to reliability Updated population controls based on the 2010 Census 39

Additional Improvements 2 100 Percent Follow up of non mailable addresses in American Indian areas with high concentrations of Native Population Enhanced Variance Estimates will reduce margins of error by 3-5 %. 40

Lessons Learned and Summary Thoughts Do not overpromise Emphasis on Research and Evaluation Secure resources with specific knowledge and skills set to accomplish broad spectrum of objectives 41

Contact Information U.S. DEPARTMENT OF COMMERCE U.S. Census Bureau Washington, DC 20233 Alfredo Navarro Room 4K071 Phone: 301-763-3600 Email: ACS Alfredo.Navarro@census.gov Chief Statistician 42