The Accuracy and Coverage of Internet based Data collection for Korea Population and Housing Census

Similar documents
2012 UN International Seminar for Global Agenda - The Population and Housing Census. Hyong-Joon Noh Statistics Korea

Article. The Internet: A New Collection Method for the Census. by Anne-Marie Côté, Danielle Laroche

Population Censuses and Migration Statistics. Keiko Osaki Tomita, Ph.D.

The Internet Response Method: Impact on the Canadian Census of Population data

Population and dwellings Number of people counted Total population

Strategies for the 2010 Population Census of Japan

Collection and dissemination of national census data through the United Nations Demographic Yearbook *

The progress in the use of registers and administrative records. Submitted by the Department of Statistics of the Republic of Lithuania

Population and dwellings Number of people counted Total population

Internet Survey Method in the Population Census of Japan. -- Big Challenges for the 2015 Census in Japan -- August 1, 2014

Overview of the Course Population Size

Using registers E-enumeration and CAPI Electronic map. Census process. E-enumeration. Census moment and census period E-enumeration process

9 th World Telecommunication/ICT Indicators Meeting (WTIM-11) Mauritius, 7-9 December 2011

Keynote Speech for the International Seminar on Population and Housing Censuses in a Changing World. Seoul, South Korea November 27 29, 2012

Economic and Social Council

Year Census, Supas, Susenas CPS and DHS pre-2000 DHS Retro DHS 2007 Retro

Economic and Social Council

; ECONOMIC AND SOCIAL COUNCIL

population and housing censuses in Viet Nam: experiences of 1999 census and main ideas for the next census Paper prepared for the 22 nd

Census Response Rate, 1970 to 1990, and Projected Response Rate in 2000

Neighbourhood Profiles Census

Removing Duplication from the 2002 Census of Agriculture

The Demographic situation of the Traveller Community 1 in April 1996

A Special Case of integrating administrative data and collection data in the context of the 2016 Canadian Census

Population of Edinburgh Census Online - Old Edinburgh Club

Maintaining knowledge of the New Zealand Census *

Key Words: age-order, last birthday, full roster, full enumeration, rostering, online survey, within-household selection. 1.

Comparing the Quality of 2010 Census Proxy Responses with Administrative Records

2020 Census Update. Presentation to the Council of Professional Associations on Federal Statistics. December 8, 2017

5 TH MANAGEMENT SEMINARS FOR HEADS OF NATIONAL STATISTICAL OFFICES (NSO) IN ASIA AND THE PACIFIC SEPTEMBER 2006, DAEJEON, REPUBLIC OF KOREA

Decision Making Process for Adoption of Electronic Data Collection. Dr. Amara Satharasinghe Department of Census & Statistics Sri Lanka

2016 Census Profile on the Town of Richmond Hill

Country report Germany

Country Paper : Macao SAR, China

FINANCIAL PROTECTION Not-for-Profit and For-Profit Cemeteries Survey 2000

A gender perspective on the 2005 Census of Korea (R.O.K) Focusing on Economic Activity, and Living Expense of the Aged.

2020 Census: How Communities Can Prepare

Supplementary questionnaire on the 2011 Population and Housing Census FRANCE

Economic and Social Council

Neighbourhood Profiles Census and National Household Survey

2016 Census of Population: Age and sex release

Neighbourhood Profiles Census and National Household Survey

1996 CENSUS: ABORIGINAL DATA 2 HIGHLIGHTS

2010 World Population and Housing Census Programme. United Nations Statistics Division

Italian Americans by the Numbers: Definitions, Methods & Raw Data

NEW COLLECTION METHODOLOGY IN THE 2006 CENSUS OF POPULATION

0-4 years: 8% 7% 5-14 years: 13% 12% years: 6% 6% years: 65% 66% 65+ years: 8% 10%

Supplementary questionnaire on the 2011 Population and Housing Census SLOVAKIA

Methodology Statement: 2011 Australian Census Demographic Variables

1981 CENSUS COVERAGE OF THE NATIVE POPULATION IN MANITOBA AND SASKATCHEWAN

How It Works and What s at Stake for Massachusetts. Wednesday, October 24, :30-10:30 a.m.

1 NOTE: This paper reports the results of research and analysis

SURVEY ON USE OF INFORMATION AND COMMUNICATION TECHNOLOGY (ICT)

FOREWORD. [ ] FAO Home Economic and Social Development Department Statistics Division Home FAOSTAT

Canada Agricultural Census 2011 Explanatory notes

Economic and Social Council

SESSION 3: ESSENTIAL FEATURES, DEFINITION AND METHODOLOGIES OF POPULATION AND HOUSING CENSUSES: MALAYSIA

2011 National Household Survey (NHS): design and quality

Workshop on Census Data Evaluation for English Speaking African countries

Vietnam - Household Living Standards Survey 2004

Supplementary questionnaire on the 2011 Population and Housing Census SWITZERLAND

CENSUS DATA COLLECTION IN MALTA

ECE/ system of. Summary /CES/2012/55. Paris, 6-8 June successfully. an integrated data collection. GE.

THE 2009 VIETNAM POPULATION AND HOUSING CENSUS

Working with NHS and Taxfiler data to measure income and poverty in Toronto neighbourhoods

Prepared by. Deputy Census Manager Zambia

Municipal Census Manual

The main focus of the survey is to measure income, unemployment, and poverty.

census 2016: count yourself in

The 2020 Census A New Design for the 21 st Century

Follow your family using census records

Aboriginal Demographics. Planning, Research and Statistics Branch

POPULATION AND HOUSING CENSUS MALAYSIA 2010 NEW APPROACHES AND TECHNOLOGICAL ADVANCEMENTS

SOURCE: Malaysian Communications and Multimedia Commission, Malaysia TITLE: Primary, Secondary and Administrative Data in Telecommunications

1999 AARP Funeral and Burial Planners Survey. Summary Report

Planning for the 2010 Population and Housing Census in Thailand

1940 QUESTIONNAIRE CENSUS OF VACANT DWELLINGS

Country presentation

Introduction Strategic Objectives of IT Operation for 2008 Census Constraints Conclusion

Trends, Data and Definitions The Household Reference Person. Greg Ball BSPS Council & independent consultant

Housekeeping items. Bathrooms Breaks Evaluations

LOGO GENERAL STATISTICS OFFICE OF VIETNAM

Economic and Social Council

UK Data Service Introduction to Census

Thailand s Planning for the next Census in 2020 (Draft ) Thailand s Team National Statistical Office,Thailand 24 Jan.

Community Radio. National Listener Survey Wave #1 FACT SHEET ACT. July Prepared for:

SESSION 11. QUALITY ASSESSMENT AND ASSURANCE IN THE CIVIL REGISTRATION

American Community Survey 5-Year Estimates

American Community Survey 5-Year Estimates

2018 End-to-End Census Test: Peak Operations. Deborah Stempowski Decennial Census Management Division

Quality assessment in a register-based census administrative versus statistical concepts in the case of households

1) Analysis of spatial differences in patterns of cohabitation from IECM census samples - French and Spanish regions

Lesson Learned from the 2010 Indonesia Population and Housing Census Dudy S. Sulaiman, BPS-Statistics Indonesia

Community Radio. National Listener Survey Wave #1 FACT SHEET NON-METRO QLD. July Prepared for:

Reengineering the 2020 Census

Chapter 1: Economic and Social Indicators Comparison of BRICS Countries Chapter 2: General Chapter 3: Population

Road to the 2020 Census October 13, :15 p.m. 5:15 p.m. WEBINAR Presentation for: South Dakota - State Data Center s 5 th Annual Demography

Census 2000 and its implementation in Thailand: Lessons learnt for 2010 Census *

The 2020 Census: A New Design for the 21 st Century Deirdre Dalpiaz Bishop Chief Decennial Census Management Division U.S.

The Population Estimation Survey (PESS)

Transcription:

24 th Population Census Conference Hong Kong, March 25-27, 2009 The Accuracy and Coverage of Internet based Data collection for Korea Population and Housing Census By Jin-Gyu Kim & Jae-Won Lee Korea National Statistical Office

The Accuracy and Coverage of Internet-based Data Collection for Korea Population and Housing Census 1. Introduction The Population and Housing Census of Korea has been conducted based on a five-year cycle since 1925. The census results have played an important role in national and sub-national policies and plans for socioeconomic development, research, and business purposes. The census results provide valuable insight on the economic, social and demographic conditions and trends of Korean society. Until the 1995 Population and Housing Census, the primary enumeration method was the canvasser (or enumerator) method which was conducted through drop-off and collect questionnaires by enumerators. Only a small portion of questionnaires were returned using the householder method. Enumerators visited every household in the country and gathered information concerning each individual, household and living quarter. The householder method, in which household members enter the information on the paper questionnaire that is dropped-off by an enumerator and return to enumerator in the return envelope supplied within the packet, was offered when respondents desired the householder methods to keep their privacy. The environment for the canvasser method has been deteriorating. The increase of one-person households and dual-income households with busy life styles make it difficult to contact households. The Korea National Statistical Office (KNSO) had a favorable circumstance for the Internet questionnaire method of the 2005 census. Korea had the highest level information technology and high-speed Internet penetration in the world. - 2 -

For the 2005 Population and Housing Census, the experimental Internet questionnaire option was introduced to overcome hard-toenumerate circumstances and ever-increasing census cost, and to protect respondents' privacy. While the traditional enumerator method remained for a majority of the households, the KNSO developed an Internet questionnaire option system which enables about 2% of the households to submit their census questionnaire via the Internet. However, actual Internet penetration rates of the 2005 census was 0.9% because advertising campaigns were focused on the smaller percentage of hard-to-enumerate inhabitants instead of the larger general public. Through the experience of the Internet questionnaire option of the 2005 census and several pre-tests for the 2010 census, the KNSO ascertained the possibility of Internet questionnaire option and discovered some advantages as compared to the enumerator method such as data quality, census-taking cost, and confidentiality. Therefore, the KNSO set a target Internet penetration rate of 30% for the 2010 Population and Housing Census which is a marked advancement from 0.9% of the 2005 census. To expand the Internet questionnaire option of the 2010 census and its pre-test, the pull & push strategy was adapted and applied. For the pull strategy, the KNSO strengthened public campaigns for the Internet questionnaire option and provided incentives such as promotional goods or gift certificates. The incentives were offered to the participants of the Internet questionnaire option by lot. For the push strategy, the questionnaire was replaced with a letter asking respondents to complete their census questionnaire on-line was distributed to the households. On the 3 rd pre-test, which was conducted on October 2008, the KNSO expanded the Internet penetration rate to 22.1% using the pull & push strategy. - 3 -

As the Internet questionnaire option has become one of major data collection methods of the 2010 census, the need to check the mode effect of the Internet questionnaire option is on the increase. This paper aims at analyzing the accuracy and coverage of the Internet questionnaire option for inclusion in the 2010 census. 2. Accuracy of Internet Based Data Collection Internet based data collection is increasingly used for sample surveys as well as the population and housing censuses. The accuracy of data should be measured for the expansion of the Internet option in several areas such as respondent error, processing error, non-response and coverage error. The Internet questionnaire seems to provide more reliable information through interactive control of responses such as online checking of completeness, automated skips, interactive aids and explanations. Some countries which have utilized the Internet questionnaire option on the census reported advantages of Internet option for data quality. Following their 2006 census, Statistics Canada reported that the data quality from the Internet questionnaire option was higher than other data collection methods, and the edit failure rates of the data from the Internet questionnaire option were much lower than those of the paper questionnaire. Moreover, the item non-response rates of the Internet questionnaire option demonstrated lower rates as compared to the paper questionnaire. The Swiss Federal Statistical Office conducted their 2000 census on the Internet and reported Better data quality and more reliable information thanks to interactive control of the survey. - 4 -

The data quality of the Korean census on the Internet will examine the aspects of respondent error, non-response, coverage error, and data processing error. Regarding the processing error, the Internet questionnaire option can skip the data input stage among all data input and editing process. Therefore, the Internet questionnaire option can reduce the data processing error as much as data input error. The data input error rates of the 2005 census totaled 0.19%. Respondent error The mode effect of Internet questionnaire method can affect on the census data quality. This means a respondent s answer could differ between the data from the Internet questionnaire and the paper questionnaire by an enumerator in an interview. To examine the mode effect of the Internet, the data from the 2005 census and the post-enumeration survey was matched and compared to discern the level of correspondence between answers. Table 1 indicates that the correspondence rates were slightly different according to the questions. In age and marital status questions, the correspondence rates of the Internet questionnaire method were higher. Additionally, in the questions of relationship to the head of households, the correspondence rates of enumerator interview method were higher as well. In general, the data quality of the Internet questionnaire method has been satisfactory and respondents tend to provide answers more frankly to questions related to privacy such as age and marital status. Table 1. Correspondence rates between the 2005 census and the postenumeration survey Interview Internet Age 98.7% 99.0% Relationship to the head of households 99.3% 99.1% Marital status 98.9% 99.9% - 5 -

Non-response error The item non-response rates are regarded as one of indicators for data quality. Through the online checking of completeness of answers, item non-response rates can be reduced on the Internet questionnaire method. On the 3 rd pre-test for the 2010 census which was conducted in October 2008, the average item non-response rates were 1.7%. However, item nonresponse rates of the Internet questionnaire method showed much lower rates than those of the interview and mail-returned questionnaire. The overall item non-response rates of the Internet questionnaire method were 0.01%. And it was 2.1 % for the interview method, and 2.2% for mail-returned methods. For most of the questions that were designed to be answered by a checked-box or number, item non-response rates of the Internet questionnaire method were close to 0.0%. However, questions which were to be answered with typed language such as job and occupation showed 3.2% and 1.1% of non-response rates respectively. Table 2. Item non-response rates by data collection methods Item non-response rates Total Interview Mail Internet Total 1.7 % 2.1% 2.2% 0.01% Question for household member 2.3% 2.9% 2.9% 0.02% Question for household 1.0% 1.2% 1.3% 0.00% Question for house 0.1% 0.1% 0.1% 0.00% - 6 -

Coverage error The KNSO conducted an evaluation survey on October 2006 for the 2005 census s Internet data collection method. According to the results of the evaluation survey, coverage error rates of the Internet questionnaire method were lower than those of the 2005 census. This means that there were fewer missing and duplicated answers of the Internet questionnaire method than the enumerator method. The disparity can be explained by the flexibility of the Internet questionnaire system and also by the difference of characteristics of respondents between the Internet and enumerator method. Validation messages and user-friendly explanations using the pictures for the concept about usual residence seem to contribute in preventing omitted or duplicated enumeration. Persons who possess a higher education tend to participate more via the Internet option. In addition, young persons also tend to participate more via the Internet questionnaire option. These characteristics of Internet participants also may affect to the decrease of coverage error. Table 3. Coverage error for household members All respondent Respondent via internet Missing rates 1.49% 0.4% Duplication rates 2.39% 0.8% Net coverage rates 0.90% 0.4% Total Coverage rates 3.88% 1.20% * Foreigners coverage rates are not included - 7 -

Edit failure rates The edit failure rates are regarded as one of indicators for data quality because edit failure rates indicate the amount of inaccurate answers included in the data file. The edit failure rates of the Internet questionnaire method are much lower than those of other data collection methods. From the 1 st to 3 rd pre-test for the 2010 census, the overall edit failure rates per household were 2.0 cases. On the other hand, the edit failure rates for the Internet questionnaire method was 1.2 cases which was lower than the 1.9 cases for interviewed data and 2.4 cases for mail-returned data. Table 4. Edit failure rates per household of the 1~3 pre-test Total Internet Interview Mail 2.0 1.2 1.9 2.4 (Unit : case ) 3. Coverage of Korean Census on the Internet The coverage of the Internet questionnaire method can affect data quality of the census results since data quality is different from data collection methods. The coverage of the data from the Internet questionnaire method among all census data has been increasing from the 2005 census to last year s 3 rd pre-test. In the 2005 census, the Internet questionnaire method covered only 0.9% of all respondents. Moreover, it covered 13.3% for the 1 st pre-test, 3.9% for the 2 nd pre-test, and 22.1% for the 3 rd pre-test. The more respondents that answer via Internet, the higher the data quality that can be achieved since the Internet questionnaire method showed higher data quality than the other data collection methods. - 8 -

Table 5. Coverage of Internet among all census answers (Internet penetration rates) (Unit : %) 2005 Census 1 st pre-test 2 nd pre-test 3 rd pre-test 0.9 13.3 3.9 22.1 The coverage of the Internet questionnaire method varies according to the groups of participants characteristics. The total coverage rates of the Internet questionnaire method from the 1 st pre-test to 3 rd test totaled 15.5%. Furthermore, there was no gap between men and women. However, regarding the age, younger persons tended to respond more via the Internet. This was mainly a result of advertising campaigns in schools and gaps of Internet accessibility between the young and old generation. The younger generation seems to be over-represented in terms of the results of the Internet questionnaire method. Table 6. Coverage of respondents via Internet among all respondents by age group ( Unit : % ) Age Coverage Age Coverage Total 15.5 30-39 17.0 Under 10 20.0 40-49 16.6 10-19 19.0 50-59 12.4 20-29 17.2 60+ 7.3 According to coverage by marital status, the never-married group showed the highest coverage of 15.6% followed by the married group (15.1%), divorced group (8.8%), and widowed group (8.2%). - 9 -

By the type of household, household that consisted of family and non-family members group showed the highest coverage of 25.8% followed by the one-family household (14.5%), one-person household (5.6%), and households of persons who have no blood ties group (4.1%). The oneperson household group and households of persons who have no blood ties group have proved to be hard-to-enumerate groups with both the Internet questionnaire method and enumerator data collection method. Table 7. Coverage of respondents via Internet among all respondents by type of household ( Unit : % ) Type of household Coverage Type of household Coverage Total 12.7 One-person households 5.6 One-family household 14.5 Households with no blood ties group 4.1 Household consisted a family and non-family members 25.8 - - Additionally, there is great difference in coverage rates by type of house. The persons who resided in apartment buildings participated via the Internet three times more than those who resided in ordinary house. By the type of occupancy of house, the persons who owned the house tended to participate via the Internet more than the persons who resided in a rent house. Coverage rates by type of house : apartment (18.0%), Ordinary house (6.1%) - 10 -

Coverage rates by type of occupancy of house : owned the house (14.7%), Rent (11.0%) 4. Conclusion The Internet questionnaire method will become one of the major data collection methods in the future Population and Housing Census. According to this study, mode effect exists between the Internet questionnaire method and enumerator method. In several aspects such as coverage error, item non-response rates, and edit-failure rates, the data accuracy of the Internet questionnaire method was higher than other data collection methods. Therefore, the expansion of the Internet questionnaire method to the Population and Housing Census will contribute to increase overall data quality of the census results. However, the coverage of the Internet questionnaire method was different according to the groups of participants characteristics. These differences may have a negative affect on time series analysis of census results by the characteristics of respondents. Therefore, the accuracy and coverage of the Internet questionnaire method should be further studied and also should be utilized for time series analysis of the census results. Reference Danielle Laroche. Statistics Canada (2005). 2004 Census of Population Test Evaluation of the Internet option. UNECE Work Session, Ottawa Statistics Canada (2006). Statistics Canada - 2006 Census on the Internet. CES Group of Experts on Population and Housing Censuses. Statistics New Zealand (2007). Implications of the Interent Census for the Management of Field Operations. UNECE/Eurostat meeting on Population - 11 -

and Housing Censuses. Wemer Haug (2001). Population Censuses on the Internet. IUSSP General Population Conference 2001. Swiss Federal Office. - 12 -