Working with United States Census Data. K. Mitchell, 7/23/2016 (no affiliation with U.S. Census Bureau)

Similar documents
Taming the Census TIGER:

American Community Survey Review and Tips for American Fact Finder. Sarah Ehresman Kentucky State Data Center August 7, 2014

Understanding and Using the U.S. Census Bureau s American Community Survey

American Community Survey Overview

Quick Reference Guide

My Tribal Area: Census Data Overview & Access. Eric Coyle Data Dissemination Specialist U.S. Census Bureau

Census Data for Grant Writing Workshop Cowlitz-Wahkiakum Council of Governments. Heidi Crawford Data Dissemination Specialist U.S.

Geog 3340: Census Basics

Dallas Regional Office US Census Bureau

1980 Census 1. 1, 2, 3, 4 indicate different levels of racial/ethnic detail in the tables, and provide different tables.

Understanding the Census A Hands-On Training Workshop

The American Community Survey and the 2010 Census

Census Data Determines Who Gets $300 Billion Annually Are You Getting Your Share?

An Overview of the American Community Survey

Handout Packet. QuickFacts o Frequently Asked Questions

Who s in Your Neighborhood? Using the American FactFinder. Salma Abadin and Carrie Koss Vallejo Data You Can Use

Italian Americans by the Numbers: Definitions, Methods & Raw Data

Welcome to: A Tour of Data Sources from the U.S. Census Bureau. Monday, October 19, :00 am 12:00 noon CT

Redistricting San Francisco: An Overview of Criteria, Data & Processes

Finding U.S. Census Data with American FactFinder Tutorial

American Community Survey 5-Year Estimates

American Community Survey 5-Year Estimates

U.S. Census Bureau. Measuring America: People, Places, and Our Economy. Community Analysis Workshop. Armando Mendoza Data Dissemination Specialist

SELECTED SOCIAL CHARACTERISTICS IN THE UNITED STATES American Community Survey 5-Year Estimates

Learning to Use the ACS for Transportation Planning Report on NCHRP Project 8-48

Demystifying Census Data. Legislative Research Librarians September 18, 2013 Boise, Idaho

Census Data Tools. Hands-on exercises July 17 & 19, LULAC National Convention

Overview of Census Bureau Geographic Areas and Concepts

Reference Guide for Journalists: Using the American Community Survey

Census Overview: Terminology & Definitions. Basics, Decennial, ACS, and Estimates. Census Datafiles

The American Community Survey Motivation, History, and Design. Workshop on the American Community Survey Havana, Cuba November 16, 2010

Census Data Access Workshop Census Data On A Dealine

Finding and Using Census Data

The American Community Survey. An Esri White Paper August 2017

ESP 171 Urban and Regional Planning. Demographic Report. Due Tuesday, 5/10 at noon

Acquiring and Using New Census Data to Understand Service Area, Gaps, and Need

US Census. Thomas Talbot February 5, 2013

In-Office Address Canvassing for the 2020 Census: an Overview of Operations and Initial Findings

Census Data for Transportation Planning

2010 Census Data. Get Ready for Changes in Your 2014 AAPs. Ellen Shong & Associates, LLC 9/13/ Past EEO Tabulations

Public Use Microdata Sample Files Data Note 1

Poverty in the United Way Service Area

The 2010 Census: Count Question Resolution Program

Overview of Demographic Data

HOW TO USE THE NEW AMERICAN FACT FINDER

Supplement No. 7 published with Gazette No. 18 dated 30 August, THE STATISTICS LAW (1996 REVISION) THE CENSUS (CAYMAN ISLANDS) ORDER, 2010

Environmental Justice Tool Guide

National Longitudinal Study of Adolescent Health. Public Use Contextual Database. Waves I and II. John O.G. Billy Audra T. Wenzlow William R.

Census Pro Documentation

CONTRIBUTIONS OF THE INTERNATIONAL METROPOLIS PROJECT TO THE GLOBAL DISCUSSIONS ON THE RELATIONS BETWEEN MIGRATION AND DEVELOPMENT 1.

Claritas Demographic Update Methodology Summary

Census Data Boot Camp

The IPUMS-Europe project: Integrating the Region s Census Microdata

Conducting Research in the ACRDC

Variance Estimation in US Census Data from Kathryn M. Coursolle. Lara L. Cleveland. Steven Ruggles. Minnesota Population Center

UK Data Service Introduction to Census

FOR SALE Bees Ferry Rd & Main Rd/Hunt Club Charleston, SC. $1,250, Acres

Making Sense of Census Data Robert Matthews, University of Alabama at Birmingham, Birmingham, Alabama

Scenario 5: Family Structure

Country report Germany

: Geocode File - Census Tract, Block-Group and Block. Codebook

The Representation of Young Children in the American Community Survey

How It Works and What s at Stake for Massachusetts. Wednesday, October 24, :30-10:30 a.m.

1 NOTE: This paper reports the results of research and analysis

Data Integration Projects

How Will the Changing U.S. Census Affect Decision-Making?

Introduction to the Wisconsin Census Research Data Center. Health Projects

An Introduction to ACS Statistical Methods and Lessons Learned

What s New & Upcoming in 2017

Using Data to Improve Health Services. A workshop for Community Supported Clinics

Table 5 Population changes in Enfield, CT from 1950 to Population Estimate Total

Ghana - Ghana Living Standards Survey

Searching, Exporting, Cleaning, & Graphing US Census Data Kelly Clonts Presentation for UC Berkeley, D-lab March 9, 2015

Census Overview: Basics, Decennial, ACS, and Estimates

GIS Data Sources. Thomas Talbot

Event History Calendar (EHC) Between-Wave Moves File. Codebook

Notes on the 2014 ACS 5-Year Estimates

Prince William County

2011 National Household Survey (NHS): design and quality

U.S. Synthetic Population 2010 Version 1.0

2020 Census. Bob Colosi Decennial Statistical Studies Division February, 2016

; ECONOMIC AND SOCIAL COUNCIL

American Community Survey Accuracy of the Data (2014)

Employer Location file. Codebook

Statistical Issues of Interpretation of the American Community Survey s One-, Three-, and Five-Year Period Estimates

Introduction. Uses of Census Data

Government of Puerto Rico Department of Labor and Human Resources Bureau of Labor Statistics BUSINESS EMPLOYMENT DYNAMICS: FOURTH QUARTER

Methodology Statement: 2011 Australian Census Demographic Variables

SAMOA - Samoa National Population and Housing Census 2006

21,400 SF Pacific Hwy S. Kent, WA

The 2020 Census Geographic Partnership Opportunities

Proposed Information Collection; Comment Request; The American Community Survey

Survey of Massachusetts Congressional District #4 Methodology Report

(Members only)

The Road to 2020 Census

ACS ACS Long form long form ACS Kish 1990 Kish, 1990 Alexander, 2000, p.54 Kish 1941 annual sample census Kish 1981 Current Population Survey C

American Community Survey: Sample Design Issues and Challenges Steven P. Hefter, Andre L. Williams U.S. Census Bureau Washington, D.C.

Working with NHS and Taxfiler data to measure income and poverty in Toronto neighbourhoods

United Nations Demographic Yearbook Data Collection System

Virginia Employment Commission

Transcription:

Working with United States Census Data K. Mitchell, 7/23/2016 (no affiliation with U.S. Census Bureau)

Outline Types of Data Available Census Geographies & Timeframes Data Access on Census.gov website American FactFinder Data Sets Data topics in the ACS Using the American FactFinder Census Bureau API s R packages to work with Census Data What s the deal with PUMS data? Geography: Public Use Microdata Areas (PUMAs) Integrated Public Use Microdata Series: University of Minnesota Questions??

Types of Data Available from Census.gov Decennial Complete Census Short form, 100% of United States residents American Community Survey Long form sample of residents Many more questions, much smaller sample size Topologically Integrated Geographic Encoding and Referencing database (TIGER) https://tigerweb.geo.census.gov/tigerwebmain/tigerweb_main.html Other data collected/reported by the census bureau

Census Geographies (partial list) Whole United States Region: Northeast, Midwest, South, West Division: New England, Middle Atlantic, East North Central, West North Central, South Atlantic, East South Central, West South Central, Mountain, Pacific State County County Subdivision Zip code Census Block Public Use Microdata Areas (PUMA) Congressional District School District Metropolitan Statistical Area / Micropolitan Statistical Area

Census Timeframes (since 2000) Decennial Census Once every 10 years, complete data American Community Survey Sample taken every year, mandatory participation Results available for 1 year, 3 year, and 5 year estimates Results for 1 year estimate available for only larger geographical areas: represents a sample of 1% of the U.S. Population Results for 3 and 5 year estimates available for smaller geographical areas 5 year estimate represents a 5% sample of the U.S Population

Data Access On the Census Bureau Website census.gov Already Processed Online Easiest to use Visualizations Quick Facts Easy Stats And more! Summary Datasets Medium difficulty American Factfinder Community Facts Guided and Advanced Search Download Center Individual Survey Records Most difficult to use 5% Public Use Microdata Sample (PUMS) Individual replies to census surveys

American FactFinder: Available Data Sets American Community Survey American Housing Survey Annual Economic Surveys Annual Surveys of Governments Census of Governments Decennial Census Economic Census Equal Employment Opportunity (EEO) Tabulation Population Estimates Program Puerto Rico Community Survey

American Community Survey Topics Demographic Age and Sex * Group Quarters Population * Hispanic or Latino Origin * Race * Relationship * Total Population Housing Computer Ownership & Internet Access * House Heating Fuel * Kitchen Facilities * Occupancy/Vacancy Status * Occupants per Room * Owner Monthly Costs * Plumbing Facilities * Rent Statistics * Rooms * Bedrooms * Telephone Service Available * Tenure * Units in Structure * Value of Home * Vehicles Available * Year Householder Moved Into Unit * Year Structure Built Economic Class of Worker * Commuting to Work/Journey to Work * Employment Status * Food Stamps/Supplemental Nutrition Assistance Program (SNAP) * Health Insurance Coverage * Income and Earnings * Industry and Occupation * Poverty * Work Status Social Ancestry * Citizenship Status * Disability Status * Educational Attainment * Fertility * Field of Degree * Grandparents as Caregivers * Language * Marital History * Marital Status * Place of Birth * School Enrollment * Residence 1 Year Ago/Migration * Veterans * Year of Entry

American FactFinder: https://factfinder2.census.gov/

American FactFinder: Guided Search

American FactFinder: Guided Search

American FactFinder: Search results

American FactFinder: View and Modify Table

American FactFinder: View and Modify Table

American FactFinder: Download Table

ACS Measures of Statistical Variability Coefficient of Variation Coefficient of variation (CV), also known as relative standard deviation (RSD), is a standardized measure of dispersion of a probability distribution or frequency distribution. It is often expressed as a percentage, and is defined as the ratio of the standard deviation σ to the mean µ (or its absolute value, µ ). Margin of Error, % Margin of Error The margin of error is the difference between an estimate and its upper or lower confidence bound. All ACS published margins of error are based on a 90 percent confidence level. Standard Error = Margin of Error / 1.645 Lower Confidence Bound = Estimate - Margin of Error Upper Confidence Bound = Estimate + Margin of Error

American FactFinder: https://factfinder2.census.gov/

Download DP02, DP03, DP04, DP05

Census Data API s The census bureau has an ongoing initiative to enable your applications to access data. Sign up for their newsletter: https://public.govdelivery.com/accounts/uscensus/subscriber/new? topic_id=uscensus_7480 New API dataset discovery tool (beta): http://api.census.gov/data.html Request an API key at this link: http://api.census.gov/data/key_signup.html

R Packages to work with census data API acs Download, Manipulate, and Present American Community Survey and Decennial Data from the US Census Provides a general toolkit for downloading, managing, analyzing, and presenting data from the U.S. Census, including SF1 (Decennial short-form), SF3 (Decennial longform), and the American Community Survey (ACS). Confidence intervals provided with ACS data are converted to standard errors to be bundled with estimates in complex acs objects. Package provides new methods to conduct standard operations on acs objects and present/plot data in statistically appropriate ways. Current version is 2.0 +/-.033. Requires API key choroplethr to map census data (internally calls acs package) choroplethrzip to map census data by zip code (accessible via github) https://github.com/arilamstein/choroplethrzip

Example: Choropleths of industry participation by zip code

Example: Choropleths of industry participation by zip code library(choroplethrzip); library(mapproj); library(ggplot2) econ_zip <- read.csv("acs_14_5yr_dp03_with_annkm.csv") z <- data.frame(cbind(substr(as.character(econ_zip$geography),7,11), econ_zip[154])) names(z) <- c("region","value") z$region <- as.character(z$region) z$value <- as.numeric(as.character(z$value)) zip_choropleth(z, state_zoom = "new jersey", title="% in Transportation and warehousing, and utilities") + coord_map() z <- data.frame(cbind(substr(as.character(econ_zip$geography),7,11), econ_zip[158])) names(z) <- c("region","value") z$region <- as.character(z$region) z$value <- as.numeric(as.character(z$value)) zip_choropleth(z, state_zoom = "new jersey", title="% in Information") + coord_map()

What s the deal with PUMS data? American Community Survey (ACS) Public Use Microdata Sample (PUMS) files The full range of population and housing unit responses collected on individual ACS questionnaires Each record in the file represents a single person, or--in the household-level dataset--a single housing unit. PUMS files for an individual year, such as 2014, contain data on approximately one percent of the United States population. PUMS files covering a five-year period, such as 2010-2014, contain data on approximately five percent of the United States population. PUMS datasets on census.gov: https://www.census.gov/programssurveys/acs/data/pums.html Integrated Public Use Microdata Series maintained by the University of Minnesota. https://usa.ipums.org/usa/

Geography: Public Use Microdata Areas (PUMAs) Public Use Microdata Areas (PUMAs) are statistical geographic areas defined for the dissemination of Public Use Microdata Sample (PUMS) data. They are also used for disseminating American Community Survey (ACS) and Puerto Rico Community Survey period estimates. 1 2010 PUMAs: Nest within states or equivalent entities Contain at least 100,000 people Cover the entirety of the United States, Puerto Rico, Guam, and the U.S. Virgin Islands 2 Are built on census tracts and counties Should be geographically contiguous

Integrated Public Use Microdata Series: University of Minnesota Easy-to-use interface Series of drop-downs allow the selection of specific variables instead of dealing with the entire dataset at once. Once the data selection is made an extract may be requested in the desired format. Harmonized variables with other data sets including IPUMS USA U.S. Census ACS IPUMS International Official census from other countries At least one sample from 82 countries IPUMS CPS Current Population Survey; Bureau of labor statistics (BLS), primary source of labor force statistics for the United States population.

IPUMS USA: Select Variables via dropdown

IPUMS USA: Variables / data sets / codes

IPUMS USA: Select Samples

Examples: PUMS Data Usage Visualization of U.S. Household Configurations; Most Common Family Types in America, Nathan Yau, http://flowingdata.com/2016/07/20/modern-family-structure/

Questions??????