Master sampling frames for agricultural, rural and agroenvironmental statistics, methodological and practical issues
|
|
- Willa Hart
- 6 years ago
- Views:
Transcription
1 Master sampling frames for agricultural, rural and agroenvironmental statistics, methodological and practical issues Elisabetta, Carfagna Department of Statistical Sciences, University of Bologna Via Belle Arti 41, 40126, Bologna, Italy DOI: /icasVII.2016.f36d ABSTRACT Methodological and practical problems have to be faced when building a master sampling frame for agricultural, rural and agri-environmental statistics. This paper addresses some of them, focusing on quality and coverage issues and on the impact of increasing computational ability to handle massive data sets on the generation and updating of master sampling frames. Advantages, disadvantages and requirements of the combination of different kinds of frames and the main methods for linking frames at the design stage and at the estimation stage are analysed. A proposal for increasing the efficiency of the allocation of the sample units to the different combined frames is also discussed. Keywords: Master sampling frame, Multiple frames, Single and two-stage estimators 1. Introduction In this paper, we present an analysis of methodological and practical problems to be faced when building a master sampling frame for agricultural, rural and agri-environmental statistics. We start from the traditional approach for generating a master sampling frame for agricultural statistics and analyse the effect of incomplete or out of date sampling frames. In section 3, the impact of increasing computational ability to handle massive data sets on the generation and updating of master sampling frames is discussed. Then, other kinds of master sampling frames are taken into consideration (section 4). Section 5, presents a review of the main methods for linking frames at the design stage and at the estimation stage, focusing both on single-stage and two-stage estimators. In 1
2 section 6, we talk about the use of area sampling frames for collecting crop and agri-environmental data, the advantages, disadvantages and requirements when list frames are combined with area frames, with single-stage, as well as with two-stage estimators, and the difficulties in the identification of the farms selected through the area frame, according to the kind of area frame and adopted technological tools. Section 7 focuses on a method for improving the efficiency of the allocation of the sample units to the different combined frames. Finally, some conclusions are drawn. 2. The traditional approach for generating a master sampling frame A master sampling frame is a sampling frame that provides the basis for all data collections through sample surveys and censuses in a certain sector, allowing to select samples for several different surveys or different rounds of the same survey, as opposed to building an ad-hoc sampling frame for each survey. The aims of the development of a master sampling frame are: avoiding duplication of efforts, reducing statistics discrepancies, connecting various aspects of the sector, allowing the analysis of the sampling units from the different viewpoints, and having a better understanding of the sector. The traditional approach for producing agricultural statistics adopted in most developed countries is the following (see Benedetti et al. eds. 2010): a complete enumeration census is carried out every 5-10 years. Data are collected through mail, , personal interviews, computer assisted personal interviews, computer assisted telephone interviews, or the web. The census allows generating the list frame that is updated on the basis of administrative data, in the period between two successive censuses and is used for all kinds of sample surveys of farms; thus, it could be considered as a master sampling frame for agricultural statistics. An assessment of the quality of the data collected allows deciding if and how to use this list as a master sampling frame. For example, at the end of the data collection of the Italian agricultural census, a sample survey for assessing the quality of collected data was designed (Mazziotta, 2013). A stratified random sample of about 50,000 farms was selected and the farmers were interviewed through computer assisted telephone interviews in the period from 20 May 2011 to January This assessment showed that the complete enumeration census systematically underestimates the main structural variables that are generally used for stratification, when annual sample surveys are designed. In addition, the level of the bias varies in the different regions of the country, reducing the efficiency of the stratification. 3. Impact of increasing computational ability on the generation and updating of master sampling frames The unbiasedness of this kind of list frame depends on the level of under-coverage and overcoverage of the list at the census date and on the quality of data and the process used for updating the list after the census date. This updating process has become easier, due to great improvements in data base management, including geographic databases (GIS). Moreover, methodological developments for deterministic as well as for probabilistic record linkage have considerably increased the capacity to identify the same record in different lists. For the Italian agricultural census, a very accurate assessment of the coverage was carried out (Mazziotta, 2013) on the basis of an area sample. Around 1,500 sheets of cadastral maps (areal units in which each municipality is subdivided secondary sampling units) were selected from a sample of municipalities (primary 2
3 sampling units). The owners of the parcels in the selected sheets of cadastral maps were identified, on the basis of the cadastral archive, and interviewed. 21,588 farmers were interviewed ( active farms and temporary inactive farms were identified by the agricultural census). The estimates were computed in the framework of the indirect sampling (Lavallée, 2007), and the weights (Lavallée and Rivest, 2012) were assigned based on the selection probability of each sheet of cadastral map and the number of sheets in which a farm has parcels (derived from the interview). A sophisticated record linkage procedure was implemented in three successive steps: deterministic, probabilistic and manual, involving various kinds of administrative registers % of farms in the area frame were included in the census list; 5.2 % of farms in the area frame were present in the census list with different characteristics, 1.7 % of the farms in the area frame had multiple links with census list, and 11.7 % of the farms in the area frame had no link with the census list. Of course, the percentage of farms in the area and in the census list decreased for small farms: 71% and 78.2 % for farms with utilized agricultural area in the range ( hectares) and in the range ( hectares) respectively. This level of coverage is in line with most developed countries. These results of the quality assessment of the census data stimulates a reflection, if the main aims of the agricultural census are creating the list of all farms (including small ones) to be used as master sampling frame, with accurate structural information for stratification and producing estimates for very small administrative domains, at least once every 5-10 years. Various kinds of administrative registers are generally used for updating the census list. The quality of the result depends on the administrative data that can be used and on the consistency of the identifiers of the units in the different registers. The over and under coverage can be high even if good administrative data, very sophisticated record linkage procedures and geo-location of administrative information are used, as showed by the following experiment. Several kinds of administrative data were taken into consideration for updating the Italian census list in 2008 (8 years after the census). Main registers used were the lists related to farms that apply for subsidies, livestock farms, agrarian income, cadastre, taxes, social security and specific lists created by regional authorities. A sample of 15,682 units was selected out of a subset of 80 municipalities. Enumerators used a web-based data collection system developed on purpose, in order to ensure accurate data collection. The result was that only 39.15% of the farms included in the integrated list were considered existing and active by the test % of the farms in the integrated list were not active and 16.11% of them were not identified through the test (Berntsen and Viviano, 2011). This level of over-coverage implies that, if such a list is used for a sample survey, the enumerators waste much time trying to identify farmers, which then prove to be inactive. Moreover, distinguishing inactive farmers from total non-responses is difficult. Finally, the risk of producing biased estimates is high, unless an accurate estimate of the over-coverage is available. These considerations suggest adopting this approach only where the reliability of administrative data used for updating the census list is very high and the definitions adopted by administrative registers are compatible with the ones of the census. 4. Other kinds of master sampling frame Other approaches have been developed for creating master sampling frames. In several countries, the population census is conducted using an administrative structure in which cartographic or other mapping materials are used to divide the country into enumeration areas. The sampling frame is the list of enumeration areas. In agricultural sample censuses and surveys, a sample of enumeration areas is selected, the list of households in selected enumeration areas is created and a sample is extracted from each of these lists, following a two stages sample design. In 3
4 many countries, a sample agricultural census is conducted: some enumeration areas are randomly selected and screened for farms. The resulting sampling frame consists of the agricultural census enumeration areas. These approaches present coverage problems at least of the entity of the complete enumeration agricultural census described before. A proposal by FAO and UNFPA aims at avoiding to face the cost of the agricultural census: the list of farms or agricultural households is identified on the basis of specific agricultural questions included in the population census questionnaire. This approach is promising for countries where agriculture is not an important economic sector, like small islands. More work is needed for testing the quality of data collected using long questionnaires and the coverage of the list of farms generated from the population census; particularly, the entity of under and over coverage in different categories of countries should be assessed. Finally, the list frame of farms generated through the module on agriculture submitted to the households presents very few auxiliary variables; thus, the efficiency of the sample designs for annual sample surveys is very low, and this may have a strong impact on annual survey costs. For more details and an analysis of advantages, disadvantages and requirements see Keita and Gennari (2013) and Carfagna et at. (2013). In some countries, the list of the farms is based on administrative sources, such as business registrations or tax collections. A big disadvantage of the administrative sources is that they may not include the total population, especially units below a threshold required to be registered or pay taxes. In other words, while they will be inclusive of commercial farms, are not likely to include small-scale farms and subsistence farming units (see Carfagna and Carfagna, 2010). 5. Linking frames at the design stage and at the estimation stage When the coverage and the accuracy of the structural characteristics are not high, alternative approaches can be followed: creating a sampling frame integrating different lists (design level), combining estimates from different lists (estimator level), using an area frame, combining an area frame with one or more list frames. The first option foresees that different lists concerning the same population are used for creating the sampling frame. In such a case, one single frame is created on the basis of two or more lists. In order to get one list combining more than one, records have to be matched. This is not an easy task because farms can appear with different pieces of information in the different lists, and sometimes only partial or wrong information is available. A wide literature has been developed on record linkage, focusing on deterministic and probabilistic rules for matching; moreover, the capacity of storing and managing databases is increased impressively. However, the coverage of the sampling frame is strongly influenced by the quality of the combined lists. Lists with limited coverage or out of date information can create difficulties in the record linkage process, increase the over-coverage and give little contribution to reduce the undercoverage of the sampling frame. Unless the different lists contribute with essential information to complete the frame and the record matching gives extremely reliable results, the frame will be still incomplete and with many duplications (see Carfagna and Ferraz, 2015). Another option is treating the different lists separately and selecting samples from each list. All observations can be treated as though they had been sampled from a single frame, with modified weights for observations in the intersection of the lists (single-stage estimation). The basic idea is that a multiple frame sample can be viewed as a special case of selecting two or more samples independently from the same frame. As stated by Kalton and Anderson (1986), when a sample is drawn from two or more overlapping frames, the chance of an element being selected depends on the number of frames on which it appears. Compensation for the varying inclusion probabilities of different population elements may be made, by means of a weighting adjustment in the analysis, 4
5 such as assigning sample element weights made inversely proportional to their inclusion probabilities. Kalton and Anderson (1986) and Skinner (1991) proposed an unbiased estimator that does not require determining the common units of samples from the different frames. Mecatti (2007) and Mecatti and Singh (2014) also gave a contribution to the development of single-stage estimators proposing their multiplicity estimator. Like the other single-stage estimators developed previously, the Mecatti and Singh estimator has two crucial requirements: the multiplicity of each sample unit is known and the union of the collection of frames covers the target population. Mecatti and Singh (2014) assume that the information on the multiplicity can be given by the interviewed sample units. For agricultural statistics, this assumption implies that each of the selected farmers knows which frames include his farm. The assumption that the union of the collection of frames covers the target population is seldom realistic, even in developed countries. Indeed, if the aim is providing a rough estimate of main agricultural items, the bias introduced by a limited undercoverage tends to be not particularly high, since generally it concerns mainly small farms, whose contribution to the total of main items is limited. However, the bias can be higher and difficult to remove for minor and special agricultural items. Moreover, small farms are important if we want to have an overview of the trends in rural areas. Another way of taking advantage of various frames at the estimator level is adopting an estimator that combines estimates calculated on non-overlapping sample units belonging to the different frames with estimates calculated on overlapping sample units (two-stage estimation). Two-stage estimators do not require the knowledge of the multiplicity for selected units, but assume that the union of the collection of frames covers the target population. Some two-stage estimators need the identification of identical units only in the overlap samples and some others have been developed for cases in which these units cannot be identified (see Fuller and Burmeister 1972). Both single-stage and two-stage estimators do not require record matching of listing units of the different frames (a process that is notoriously error prone when large lists are used). Generally, complex designs are adopted in the different frames to improve the efficiency and this affects the estimators. Lohr and Rao (2006) proposed optimal estimators and pseudo maximum likelihood estimators when two or more frames are used. Ferraz and Coelho (2007) investigated the estimation of population totals incorporating available auxiliary information from one of the frames at the estimation stage, for the case of a stratified dual frame survey; for a review of multiple frame estimators see Carfagna (2001) and Carfagna and Carfagna (2010). 6. Combining lists and area frames, advantages, disadvantages and requirements Combining a list and an area frame is a special case of multiple frame sample surveys in which sample units belonging to the lists and not to the area frame do not exist. This approach is very convenient when the list contains units with large (thus probably more variable) values of some variables of interest and the survey cost of units in the list is much lower than in the area frame. Ground data collection through an area frames is the most reliable way for collecting crop data and some agri-environmental data linked to the land, like the ones included in the field data collection form 2015 of the European land use and cover area frame survey (LUCAS). These data allow computing the following indicators: land cover/land use/change, parcel size, cropping system/land management, irrigation, landscape elements, associated trees and shrubs, soil erosion/soil quality. Ground positioning systems (GPS), aerial images, aerial photos (also photointerpreted and stored on a PDA, Google Earth, Geographic information Systems (GIS) have considerably modified the data collection process and increased the quality of data. 5
6 If economic and rural characteristics and/or agri-environmental indicators related to the farm management are relevant for a country, the ground observation through an area frame is not sufficient and the farmers have to be selected and interviewed. Moreover, when the area frame is combined with one or more list frames, the presence on the lists of the farms selected through the area frame has to be assessed for most estimators. The main typologies of area frames are segments, with or without physical boundaries, and clustered and un-clustered points. When segments are adopted, the fields totally or partially included in the segments can be used for identifying the corresponding farms; then, from the estimation viewpoint, the traditional open, closed and weighted estimators can be taken into consideration. The number of farms indirectly selected through a segment depends on the number of parts of farms included in the segment; thus, it changes from segment to segment and only an expected number of farms can be prefixed by selecting the segment size. If clustered or un-clustered points are selected, the field corresponding to the point identifies the farm. The challenging part is collecting the data of the farm corresponding to the field. This task is difficult when the farmers live in villages far from the land. When un-clustered point sampling is adopted, the identification of the farmer is more cumbersome because the next farmer to be identified is far away. Close farmers are easier to identify, since one of them can give some information on the others. Sometimes, point sampling of farms in a segment is carried out, in order to select only a subset of the farms totally or partially included in the segment. This approach is appropriate where the optimum segment size for collecting area and yield information in the fields is larger than the optimum segment size for farmers interviews. This happens where the farm size is small. Point sampling in the segments also allows prefixing the number of farms selected in each segment, in case point sampling with replacement is adopted (the same farm can be selected by more than one point). This is a big advantage for the sample allocation to the frames. 7. Sample allocation Under a linear cost function, the optimum share of the total sample to be allocated to each frame can be determined, in order to optimize the precision of the total estimate. However, the optimum sample allocation depends on the variances of domains, which are generally unknown before the survey. An adaptive sequential approach could be adopted for determining the allocation during the survey. Consider that adaptive sequential sample designs are very efficient because the sample selection depends on previously selected units and the stopping rule is based on the estimate. Unfortunately, sequential sample designs are biased, for the same reasons. Thompson and Seber (1996, pages ) faced the problem of sample allocation without previous information on the variability inside strata suggesting a stratified random survey in two phases or, more generally, in k phases. In our case, the strata represent the strata in the different sampling frames. At the k-th phase, a complete stratified random sample is selected, with sample sizes depending on data from previous phases. Then the conventional stratified estimator, based on the data from the k- th phase, is unbiased for the population total Y. The key to design unbiasedness of such an estimator is that each of the estimators is design unbiased and that the weights are fixed in advance and do not depend on observations made during the survey, which implies that, at whatever k-th phase, each of the strata needs to be sampled. These elements guarantee unbiased but not very efficient estimates. Carfagna and Marzialetti (2009), proposed the adoption of an adaptive sequential sample selection with permanent random numbers, which allows optimizing the sample allocation to the different strata and the use of optimum weights for estimating the population total. This procedure foresees that one sample unit is selected at each step, the standard deviations of the 6
7 domains are computed and the next sample unit is assigned to the stratum where the sample size is farthest below the size assigned by Neyman s allocation. In the case of the sample allocation to two or more sampling frames, a less cumbersome k-step procedure with permanent random numbers, where k is equal to a small (2 or 3) number of steps is more appropriate. A permanent random number is assigned to all sampling unit in each domain (each stratum of each sampling frame). Then, a first random sample of sampling units is selected. The main aim of this first sample is generating a first estimate of the standard deviations in the domains, which are used for determining the optimum allocation of the second step sample and the optimum weights for combining the estimates from the various lists, then the process can be repeated. 8. Concluding remarks The quality of the data collected by a complete enumeration census of agriculture should be checked before using the list of farms generated by the census as a master sampling frame, since the under-coverage is about 20% in developed countries. The impressive progress in managing big amount of data and the use of georeferenced data have considerably improved the quality of the updated list; however, this kind of update does not eliminate the under-coverage and can increase the over-coverage, creating several data collection problems. Creating a muster sampling frame integrating different kinds of lists, taking advantage of the improvements in record linkage can be an alternative. However, unless the different lists contribute with essential information to complete the frame and the record matching gives extremely reliable results, the frame will be still incomplete and with many duplications. Another option is treating the different lists separately and selecting samples from each list, using a single-stage or a two-stage estimator. The single-stage estimators have crucial requirements which are seldom satisfied, whiles two-stage estimators facilitate the use of different and complex sample designs in the different lists, increasing the efficiency of the estimators. Ground data collection through an area frame is the most reliable way for collecting crop data and some agri-environmental data linked to the land; however, if all economic characteristics and/or agri-environmental indicators related to the farm management have to be estimated, the ground observation through an area frame is not sufficient and the farmers have to be selected and interviewed. When un-clustered point sampling is adopted, the identification of the farmer is cumbersome because the next farmer to be identified is far away. The optimum sample allocation to different strata of the sampling frames depends on their variances, which are generally unknown before the survey. An adaptive sequential approach for determining the allocation during the survey increases the efficiency of the estimates. REFERENCES Benedetti R., Bee M., Espa R., Piersimoni F., eds. (2010) Agricultural Survey Methods. Chichester, UK, Wiley. 434 pp. Berntsen E., Viviano C. (2011) La progettazione dei censimenti generali : la rilevazione di controllo della copertura e qualità del prototipo di registro statistico delle aziende agricole (Clag) e la riconciliazione con la Base integrata delle fonti amministrative (Bifa), Istat working papers, n
8 Carfagna E. (2001), Multiple Frame Sample Surveys: Advantages, Disadvantages and Requirements, in International Statistical Institute, Proceedings, Invited papers, International Association of Survey Statisticians (IASS) Topics, Seoul August22-29, 2001, pp Carfagna, E. and Carfagna, A. (2010) Alternative sampling frames and administrative data; which is the best data source for agricultural statistics? in R. Benedetti, M. Bee, R. Espa & F. Piersimoni (eds.) Agricultural Survey Methods, Chichester, UK, Wiley. 434 pp Carfagna E. and Ferraz C. (2015) Updating sampling frames for agricultural statistics: approaches, challenges and issues, the 60th World Statistical Congress, Proceedings, Specialized Topic Session, Rio De Janeiro, July 2015, International Statistical Institute. Carfagna, E., Pratesi M., Carfagna, A. (2013) Methodological developments for improving the reliability and cost-effectiveness of agricultural statistics in developing countries, the 59th World Statistical Congress, Proceedings, Special Topic Session, Hong Kong, August 2013 Carfagna E. and Marzialetti J. (2009) Sequential Design in Quality Control and Validation of Land Cover Data Bases, Journal of Applied Stochastic Models in Business and Industry, Volume 25, Issue 2, 2009, pp , DOI: /asmb.742, John Wiley & Sons, Ltd. Ferraz C., Coelho H.F.C. (2007), Ratio Type Estimators for Stratified Dual Frame Surveys, in Proceedings of the 56 session of the ISI, 2007, Lisbon. Fuller, W.A., & Burmeister, L.F. (1972). Estimators of samples selected from two overlapping frames, Proceedings of the Social Statistics Sections, American Statistical Association, Kalton G. and Anderson D. W. (1986), Sampling rare populations, Journal of the Royal Statistical Society, Ser. A, 149, pp Keita N., Gennari P. (2013) Building a Master Sampling Frame by Linking the Population and Housing Census with the Agricultural Census, the 59th World Statistical Congress, Proceedings, Special Topic Session, Hong Kong, August Lavallée P. (2007), Indirect Sampling, Springer, New York. Lavallée, P. and Rivest L.P. (2012), Capture-Recapture Sampling and Indirect Sampling, Journal of Official Statistics, 28, n.1, pp Lohr, S., and Rao, J.N.K. (2006), Multiple frame surveys: Point estimation and inference, Journal of American Statistical Association, 101, Mazziotta M. (ed.) (2013) La valutazione della qualità. Atti del 6 Censimento Generale dell Agricoltura, Istituto nazionale di statistica, Roma, Italy. Mecatti, F. (2007) A Single Frame Multiplicity Estimator for Multiple Frame Surveys, Survey Methodology, volume 33, pages Mecatti, F. and Singh, A.C. (2014) Estimation in Multiple Frame Surveys: A Simplified and Unified Review using Multiplicity Approach, Journal de la Societé Francaise de Statistique, 4, volume 155. Skinner C. J. (1991) On the Efficiency of Raking Ratio Estimation for Multiple Frame Surveys, Journal of the American Statistical Association, vol. 86, No. 415, Theory and Methods, pp Thompson S.K., Seber G.A.F. (1996) Adaptive Sampling, Wiley, New York. 8
Register-based National Accounts
Register-based National Accounts Anders Wallgren, Britt Wallgren Statistics Sweden and Örebro University, e-mail: ba.statistik@telia.com Abstract Register-based censuses have been discussed for many years
More informationINTEGRATED COVERAGE MEASUREMENT SAMPLE DESIGN FOR CENSUS 2000 DRESS REHEARSAL
INTEGRATED COVERAGE MEASUREMENT SAMPLE DESIGN FOR CENSUS 2000 DRESS REHEARSAL David McGrath, Robert Sands, U.S. Bureau of the Census David McGrath, Room 2121, Bldg 2, Bureau of the Census, Washington,
More information6 Sampling. 6.2 Target Population and Sample Frame. See ECB (2011, p. 7). Monetary Policy & the Economy Q3/12 addendum 61
6 Sampling 6.1 Introduction The sampling design of the HFCS in Austria was specifically developed by the OeNB in collaboration with the Institut für empirische Sozialforschung GmbH IFES. Sampling means
More informationUse of administrative sources and registers in the Finnish EU-SILC survey
Use of administrative sources and registers in the Finnish EU-SILC survey Workshop on best practices for EU-SILC revision Marie Reijo, Senior Researcher Content Preconditions for good registers utilisation
More informationChapter 3 Monday, May 17th
Chapter 3 Monday, May 17 th Surveys The reason we are doing surveys is because we are curious of what other people believe, or what customs other people p have etc But when we collect the data what are
More informationPUBLIC EXPENDITURE TRACKING SURVEYS. Sampling. Dr Khangelani Zuma, PhD
PUBLIC EXPENDITURE TRACKING SURVEYS Sampling Dr Khangelani Zuma, PhD Human Sciences Research Council Pretoria, South Africa http://www.hsrc.ac.za kzuma@hsrc.ac.za 22 May - 26 May 2006 Chapter 1 Surveys
More informationCrop area estimates in the EU. The use of area frame surveys and remote sensing
INRA Rabat, October 14,. 2011 1 Crop area estimates in the EU. The use of area frame surveys and remote sensing Javier.gallego@jrc.ec.europa.eu Main approaches to agricultural statistics INRA Rabat, October
More informationProceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001
Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001 COVERAGE MEASUREMENT RESULTS FROM THE CENSUS 2000 ACCURACY AND COVERAGE EVALUATION SURVEY Dawn E. Haines and
More informationIntroduction INTRODUCTION TO SURVEY SAMPLING. Why sample instead of taking a census? General information. Probability vs. non-probability.
Introduction Census: Gathering information about every individual in a population Sample: Selection of a small subset of a population INTRODUCTION TO SURVEY SAMPLING October 28, 2015 Karen Foote Retzer
More informationSaint Lucia Country Presentation
Saint Lucia Country Presentation Workshop on Integrating Population and Housing with Agricultural Censuses 10 th 12 th June, 2013 Edwin St Catherine Director of Statistics Household and Population Census
More informationStat472/572 Sampling: Theory and Practice Instructor: Yan Lu Albuquerque, UNM
Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu Albuquerque, UNM 1 Chapter 1: Introduction Three Elements of Statistical Study: Collecting Data: observational data, experimental data, survey
More information6 Sampling. 6.2 Target population and sampling frame. See ECB (2013a), p. 80f. MONETARY POLICY & THE ECONOMY Q2/16 ADDENDUM 65
6 Sampling 6.1 Introduction The sampling design for the second wave of the HFCS in Austria was specifically developed by the OeNB in collaboration with the survey company IFES (Institut für empirische
More informationBotswana - Botswana AIDS Impact Survey III 2008
Statistics Botswana Data Catalogue Botswana - Botswana AIDS Impact Survey III 2008 Statistics Botswana - Ministry of Finance and Development Planning, National AIDS Coordinating Agency (NACA) Report generated
More informationPacific Training on Sampling Methods for Producing Core Data Items for Agricultural and Rural Statistics
Pacific Training on Sampling Methods for Producing Core Data Items for Agricultural and Rural Statistics 13-17 August, Suva, Fiji Module 2: Review of Basics of Sampling Methods Session 2.1: Terminology,
More informationSampling Designs and Sampling Procedures
Business Research Methods 9e Zikmund Babin Carr Griffin 16 Sampling Designs and Sampling Procedures Chapter 16 Sampling Designs and Sampling Procedures 2013 Cengage Learning. All Rights Reserved. May not
More informationEconomic and Social Council
United Nations Economic and Social Council Distr.: General 21 March 2012 ECE/CES/2012/22 Original: English Economic Commission for Europe Conference of European Statisticians Sixtieth plenary session Paris,
More informationThe Savvy Survey #3: Successful Sampling 1
AEC393 1 Jessica L. O Leary and Glenn D. Israel 2 As part of the Savvy Survey series, this publication provides Extension faculty with an overview of topics to consider when thinking about who should be
More informationCensus: Gathering information about every individual in a population. Sample: Selection of a small subset of a population.
INTRODUCTION TO SURVEY SAMPLING October 18, 2012 Linda Owens University of Illinois at Chicago www.srl.uic.edu Census or sample? Census: Gathering information about every individual in a population Sample:
More informationChapter 12 Summary Sample Surveys
Chapter 12 Summary Sample Surveys What have we learned? A representative sample can offer us important insights about populations. o It s the size of the same, not its fraction of the larger population,
More informationOctober 6, Linda Owens. Survey Research Laboratory University of Illinois at Chicago 1 of 22
INTRODUCTION TO SURVEY SAMPLING October 6, 2010 Linda Owens University of Illinois at Chicago www.srl.uic.edu 1 of 22 Census or sample? Census: Gathering information about every individual in a population
More informationSection 2: Preparing the Sample Overview
Overview Introduction This section covers the principles, methods, and tasks needed to prepare, design, and select the sample for your STEPS survey. Intended audience This section is primarily designed
More informationRemoving Duplication from the 2002 Census of Agriculture
Removing Duplication from the 2002 Census of Agriculture Kara Daniel, Tom Pordugal United States Department of Agriculture, National Agricultural Statistics Service 1400 Independence Ave, SW, Washington,
More informationChapter 4: Sampling Design 1
1 An introduction to sampling terminology for survey managers The following paragraphs provide brief explanations of technical terms used in sampling that a survey manager should be aware of. They can
More informationPolls, such as this last example are known as sample surveys.
Chapter 12 Notes (Sample Surveys) In everything we have done thusfar, the data were given, and the subsequent analysis was exploratory in nature. This type of statistical analysis is known as exploratory
More informationStats: Modeling the World. Chapter 11: Sample Surveys
Stats: Modeling the World Chapter 11: Sample Surveys Sampling Methods: Sample Surveys Sample Surveys: A study that asks questions of a small group of people in the hope of learning something about the
More informationExperiences with the Use of Addressed Based Sampling in In-Person National Household Surveys
Experiences with the Use of Addressed Based Sampling in In-Person National Household Surveys Jennifer Kali, Richard Sigman, Weijia Ren, Michael Jones Westat, 1600 Research Blvd, Rockville, MD 20850 Abstract
More information2020 Population and Housing Census Planning Perspective and challenges for data collection
2020 Population and Housing Census Planning Perspective and challenges for data collection Mexico Contents Background of Censuses in Mexico Planning the 2020 Census Georeferencing Statistical Information
More informationOther Effective Sampling Methods
Other Effective Sampling Methods MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Stratified Sampling Definition A stratified sample is obtained by separating the
More informationSierra Leone - Multiple Indicator Cluster Survey 2017
Microdata Library Sierra Leone - Multiple Indicator Cluster Survey 2017 Statistics Sierra Leone, United Nations Children s Fund Report generated on: September 27, 2018 Visit our data catalog at: http://microdata.worldbank.org
More informationAn Introduction to ACS Statistical Methods and Lessons Learned
An Introduction to ACS Statistical Methods and Lessons Learned Alfredo Navarro US Census Bureau Measuring People in Place Boulder, Colorado October 5, 2012 Outline Motivation Early Decisions Statistical
More informationSample Surveys. Chapter 11
Sample Surveys Chapter 11 Objectives Population Sample Sample survey Bias Randomization Sample size Census Parameter Statistic Simple random sample Sampling frame Stratified random sample Cluster sample
More informationSupplementary questionnaire on the 2011 Population and Housing Census SWITZERLAND
Supplementary questionnaire on the 2011 Population and Housing Census SWITZERLAND Supplementary questionnaire on the 2011 Population and Housing Census Fields marked with are mandatory. INTRODUCTION As
More informationChapter 12: Sampling
Chapter 12: Sampling In all of the discussions so far, the data were given. Little mention was made of how the data were collected. This and the next chapter discuss data collection techniques. These methods
More informationMODERN CENSUS IN POLAND
United Nations International Seminar on Population and Housing Censuses: Beyond the 2010 Round 27-29 November 2012 Seoul, Republic of Korea SESSION 7: Use of modern technologies for censuses MODERN CENSUS
More informationSampling distributions and the Central Limit Theorem
Sampling distributions and the Central Limit Theorem Johan A. Elkink University College Dublin 14 October 2013 Johan A. Elkink (UCD) Central Limit Theorem 14 October 2013 1 / 29 Outline 1 Sampling 2 Statistical
More informationSampling Terminology. all possible entities (known or unknown) of a group being studied. MKT 450. MARKETING TOOLS Buyer Behavior and Market Analysis
Sampling Terminology MARKETING TOOLS Buyer Behavior and Market Analysis Population all possible entities (known or unknown) of a group being studied. Sampling Procedures Census study containing data from
More informationComparative Study of Electoral Systems (CSES) Module 4: Design Report (Sample Design and Data Collection Report) September 10, 2012
Comparative Study of Electoral Systems 1 Comparative Study of Electoral Systems (CSES) (Sample Design and Data Collection Report) September 10, 2012 Country: Poland Date of Election: 09.10.2011 Prepared
More informationCHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies 4.1 Samples and Surveys The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Samples and Surveys Learning Objectives After this
More informationSAMPLING. A collection of items from a population which are taken to be representative of the population.
SAMPLING Sample A collection of items from a population which are taken to be representative of the population. Population Is the entire collection of items which we are interested and wish to make estimates
More informationMAT 1272 STATISTICS LESSON STATISTICS AND TYPES OF STATISTICS
MAT 1272 STATISTICS LESSON 1 1.1 STATISTICS AND TYPES OF STATISTICS WHAT IS STATISTICS? STATISTICS STATISTICS IS THE SCIENCE OF COLLECTING, ANALYZING, PRESENTING, AND INTERPRETING DATA, AS WELL AS OF MAKING
More informationEstimation Methodology and General Results for the Census 2000 A.C.E. Revision II Richard Griffin U.S. Census Bureau, Washington, DC 20233
Estimation Methodology and General Results for the Census 2000 A.C.E. Revision II Richard Griffin U.S. Census Bureau, Washington, DC 20233 1. Introduction 1 The Accuracy and Coverage Evaluation (A.C.E.)
More informationIntroduction INTRODUCTION TO SURVEY SAMPLING. General information. Why sample instead of taking a census? Probability vs. non-probability.
Introduction Census: Gathering information about every individual in a population Sample: Selection of a small subset of a population Census INTRODUCTION TO SURVEY SAMPLING Sample February 14, 2018 Linda
More informationAPCAS/10/21 April 2010 ASIA AND PACIFIC COMMISSION ON AGRICULTURAL STATISTICS TWENTY-THIRD SESSION. Siem Reap, Cambodia, April 2010
APCAS/10/21 April 2010 Agenda Item 8 ASIA AND PACIFIC COMMISSION ON AGRICULTURAL STATISTICS TWENTY-THIRD SESSION Siem Reap, Cambodia, 26-30 April 2010 The Use of Remote Sensing for Area Estimation by Robert
More informationVariance Estimation in US Census Data from Kathryn M. Coursolle. Lara L. Cleveland. Steven Ruggles. Minnesota Population Center
Variance Estimation in US Census Data from 1960-2010 Kathryn M. Coursolle Lara L. Cleveland Steven Ruggles Minnesota Population Center University of Minnesota-Twin Cities September, 2012 This paper was
More informationChapter 1 Introduction
Chapter 1 Introduction Statistics is the science of data. Data are the numerical values containing some information. Statistical tools can be used on a data set to draw statistical inferences. These statistical
More information2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression
2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression Richard Griffin, Thomas Mule, Douglas Olson 1 U.S. Census Bureau 1. Introduction This paper
More informationUsing Administrative Records for Imputation in the Decennial Census 1
Using Administrative Records for Imputation in the Decennial Census 1 James Farber, Deborah Wagner, and Dean Resnick U.S. Census Bureau James Farber, U.S. Census Bureau, Washington, DC 20233-9200 Keywords:
More informationSampling Techniques. 70% of all women married 5 or more years have sex outside of their marriages.
Sampling Techniques Introduction In Women and Love: A Cultural Revolution in Progress (1987) Shere Hite obtained several impacting results: 84% of women are not satisfied emotionally with their relationships.
More informationAP Statistics S A M P L I N G C H A P 11
AP Statistics 1 S A M P L I N G C H A P 11 The idea that the examination of a relatively small number of randomly selected individuals can furnish dependable information about the characteristics of a
More informationTurkmenistan - Multiple Indicator Cluster Survey
Microdata Library Turkmenistan - Multiple Indicator Cluster Survey 2015-2016 United Nations Children s Fund, State Committee of Statistics of Turkmenistan Report generated on: February 22, 2017 Visit our
More informationGuyana - Multiple Indicator Cluster Survey 2014
Microdata Library Guyana - Multiple Indicator Cluster Survey 2014 United Nations Children s Fund, Guyana Bureau of Statistics, Guyana Ministry of Public Health Report generated on: December 1, 2016 Visit
More informationThe challenges of sampling in Africa
The challenges of sampling in Africa Prepared by: Dr AC Richards Ask Afrika (Pty) Ltd Head Office: +27 12 428 7400 Tele Fax: +27 12 346 5366 Mobile Phone: +27 83 293 4146 Web Portal: www.askafrika.co.za
More informationObjectives. Module 6: Sampling
Module 6: Sampling 2007. The World Bank Group. All rights reserved. Objectives This session will address - why we use sampling - how sampling can create efficiencies for data collection - sampling techniques,
More informationThailand - The Population and Housing Census of Thailand IPUMS Subset
Microdata Library Thailand - The Population and Housing Census of Thailand 2000 - IPUMS Subset National Statistical Office, Minnesota Population Center - University of Minnesota Report generated on: April
More information1) Analysis of spatial differences in patterns of cohabitation from IECM census samples - French and Spanish regions
1 The heterogeneity of family forms in France and Spain using censuses Béatrice Valdes IEDUB (University of Bordeaux) The deep demographic changes experienced by Europe in recent decades have resulted
More informationKey Words: age-order, last birthday, full roster, full enumeration, rostering, online survey, within-household selection. 1.
Comparing Alternative Methods for the Random Selection of a Respondent within a Household for Online Surveys Geneviève Vézina and Pierre Caron Statistics Canada, 100 Tunney s Pasture Driveway, Ottawa,
More informationREPUBLIC OF TOGO. Census of Agriculture 2012 of Togo : Overview and experience in collecting gender data. ABOU Hibana
REPUBLIC OF TOGO 1 Expert Consultation on Collecting Sex Disaggregated Data on Land Ownership and Management in Agricultural Censuses ------------------------ Kampala, Uganda, 13 to 15 May 2014 Census
More information3. Data and sampling. Plan for today
3. Data and sampling Business Statistics Plan for today Reminders and introduction Data: qualitative and quantitative Quantitative data: discrete and continuous Qualitative data discussion Samples and
More information2007 Census of Agriculture Non-Response Methodology
2007 Census of Agriculture Non-Response Methodology Will Cecere National Agricultural Statistics Service Research and Development Division, U.S. Department of Agriculture, 3251 Old Lee Highway, Fairfax,
More informationThe progress in the use of registers and administrative records. Submitted by the Department of Statistics of the Republic of Lithuania
Working Paper No. 24 ENGLISH ONLY STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE STATISTICAL OFFICE OF THE EUROPEAN COMMUNITIES (EUROSTAT) CONFERENCE OF EUROPEAN STATISTICIANS Joint ECE/Eurostat
More informationb. Stopping students on their way out of the cafeteria is a good way to sample if we want to know about the quality of the food there.
Chapter 12 Sample Surveys Look at Just Checking on page 273. Various claims are made for surveys. Why is each of the following claims not correct? a. It is always better to take a census than to draw a
More informationSURVEY ON USE OF INFORMATION AND COMMUNICATION TECHNOLOGY (ICT)
1. Contact SURVEY ON USE OF INFORMATION AND COMMUNICATION TECHNOLOGY (ICT) 1.1. Contact organization: Kosovo Agency of Statistics KAS 1.2. Contact organization unit: Social Department Living Standard Sector
More informationCrop Area Estimation with Remote Sensing
Boogta 25-28 November 2008 1 Crop Area Estimation with Remote Sensing Some considerations and experiences for the application to general agricultural statistics Javier.gallego@jrc.it Some history: MARS
More informationSurvey of Massachusetts Congressional District #4 Methodology Report
Survey of Massachusetts Congressional District #4 Methodology Report Prepared by Robyn Rapoport and David Dutwin Social Science Research Solutions 53 West Baltimore Pike Media, PA, 19063 Contents Overview...
More informationStrategies for the 2010 Population Census of Japan
The 12th East Asian Statistical Conference (13-15 November) Topic: Population Census and Household Surveys Strategies for the 2010 Population Census of Japan Masato CHINO Director Population Census Division
More informationClass 10: Sampling and Surveys (Text: Section 3.2)
Class 10: Sampling and Surveys (Text: Section 3.2) Populations and Samples If we talk to everyone in a population, we have taken a census. But this is often impractical, so we take a sample instead. We
More informationEXPERT GROUP MEETING ON CONTEMPORARY PRACTICES IN CENSUS MAPPING AND USE OF GEOGRAPHICAL INFORMATION SYSTEMS New York, 29 May - 1 June 2007
EXPERT GROUP MEETING ON CONTEMPORARY PRACTICES IN CENSUS MAPPING AND USE OF GEOGRAPHICAL INFORMATION SYSTEMS New York, 29 May - 1 June 2007 STATEMENT OF DR. PAUL CHEUNG DIRECTOR OF THE UNITED NATIONS STATISTICS
More informationWelcome to the Presentation on. Bangladesh Agriculture Census 2008
Welcome to the Presentation on Bangladesh Census 2008 Previous Censuses in BD Census 1960 Census 1977 Census, 1983-1984 Census 1996 Background of Census Name of Census Census- 1960 Census-1977 Census-1983-84
More informationSampling Subpopulations in Multi-Stage Surveys
Sampling Subpopulations in Multi-Stage Surveys Robert Clark, Angela Forbes, Robert Templeton This research was funded by the Statistics NZ Official Statistics Research Fund 2007/2008, and builds on the
More informationLao PDR - Multiple Indicator Cluster Survey 2006
Microdata Library Lao PDR - Multiple Indicator Cluster Survey 2006 Department of Statistics - Ministry of Planning and Investment, Hygiene and Prevention Department - Ministry of Health, United Nations
More informationTechnical Report Series GO Literature Review on Reconciling Data from Agricultural Censuses and Surveys
Technical Report Series GO-14-2016 Literature Review on Reconciling Data from Agricultural Censuses and Surveys July 2016 Literature Review on Reconciling Data from Agricultural Censuses and Surveys Table
More informationStatistical and operational complexities of the studies I Sample design: Use of sampling and replicated weights
Statistical and operational complexities of the studies I Sample design: Use of sampling and replicated weights Andrés Sandoval-Hernández IEA DPC Workshop on using PISA, PIAAC, TIMSS & PIRLS, TALIS datasets
More informationElements of the Sampling Problem!
Elements of the Sampling Problem! Professor Ron Fricker! Naval Postgraduate School! Monterey, California! Reading Assignment:! 2/1/13 Scheaffer, Mendenhall, Ott, & Gerow,! Chapter 2.1-2.3! 1 Goals for
More informationSection 6.4. Sampling Distributions and Estimators
Section 6.4 Sampling Distributions and Estimators IDEA Ch 5 and part of Ch 6 worked with population. Now we are going to work with statistics. Sample Statistics to estimate population parameters. To make
More informationRURAL, AGRICULTURAL & FISHERY CENSUS IN VIETNAM
APCAS/16/6.2.5 ASIA AND PACIFIC COMMISSION ON AGRICULTURAL STATISTICS TWENTY-SIXTH SESSION Thimphu, Bhutan, 15-19 February 2016 Agenda Item 6.2 RURAL, AGRICULTURAL & FISHERY CENSUS IN VIETNAM Contributed
More informationEconomic and Social Council
United Nations Economic and Social Council ECE/CES/GE.41/2013/3 Distr.: General 15 August 2013 Original: English Economic Commission for Europe Conference of European Statisticians Group of Experts on
More informationReview of the WCA 2010 implementation experiences
Technical Review Meeting on the World Programme for the Census of Agriculture (WCA) 2020 Rome, Italy 7-8 November 2014 Review of the WCA 2010 implementation experiences Miguel Galmes Consulant, Agricultural
More information2010 Census Mapping Evolution, Potentialities and Integration to the National Spatial Data Infrastructure
2010 Census Mapping Evolution, Potentialities and Integration to the National Spatial Data Infrastructure Miriam Barbuda, MsC LATIN AMERICA GEOSPATIAL FORUM Brazil, Rio de Janeiro, 15-17August 2012 BRAZIL
More informationWorking Paper n. 79, January 2009
Methodology of European labour force surveys: (2) Sample design and implementation Francesca Gagliardi, Vijay Verma, Giulia Ciampalini Working Paper n. 79, January 2009 Methodology of European labour
More informationEconomic and Social Council
United Nations Economic and Social Council Distr.: General 30 April 2012 ECE/CES/2012/32 English only Economic Commission for Europe Conference of European Statisticians Sixtieth plenary session Paris,
More information2011 UK Census Coverage Assessment and Adjustment Methodology
2011 UK Census Coverage Assessment and Adjustment Methodology Owen Abbott Introduction The census provides a once-in-a decade opportunity to get an accurate, comprehensive and consistent picture of the
More informationUnited Nations Statistics Division Programme in Support of the 2020 Round of Population and Housing Censuses
United Nations Statistics Division Programme in Support of the 2020 Round of Population and Housing Censuses Session 2 Srdjan Mrkić United Nations Statistics Division Resolution UN Statistical Commission
More informationUNITED NATIONS - NATIONS UNIES ECONOMIC AND SOCIAL COMMISSION FOR ASIA AND THE PACIFIC STATISTICAL INSTITUTE FOR ASIA AND THE PACIFIC (SIAP)
UNITED NATIONS - NATIONS UNIES ECONOMIC AND SOCIAL COMMISSION FOR ASIA AND THE PACIFIC STATISTICAL INSTITUTE FOR ASIA AND THE PACIFIC (SIAP) Regional Training Course on Computer Assisted Personal Interviewing
More information7.1 Sampling Distribution of X
7.1 Sampling Distribution of X Definition 1 The population distribution is the probability distribution of the population data. Example 1 Suppose there are only five students in an advanced statistics
More information5 TH MANAGEMENT SEMINARS FOR HEADS OF NATIONAL STATISTICAL OFFICES (NSO) IN ASIA AND THE PACIFIC SEPTEMBER 2006, DAEJEON, REPUBLIC OF KOREA
Malaysia 5 TH MANAGEMENT SEMINARS FOR HEADS OF NATIONAL STATISTICAL OFFICES (NSO) IN ASIA AND THE PACIFIC. 18 20 SEPTEMBER 2006, DAEJEON, REPUBLIC OF KOREA 1. Overview of the Population and Housing Census
More informationCanada Agricultural Census 2011 Explanatory notes
Canada Agricultural Census 2011 Explanatory notes 1. Historical outline The British North America Act of 1867 included the requirement for a census to be taken every 10 years starting in 1871. However,
More informationEconomic and Social Council
UNITED NATIONS E Economic and Social Council Distr. GENERAL 5 May 2008 Original: ENGLISH ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS Joint UNECE/Eurostat Meeting on Population and
More informationESSnet on DATA INTEGRATION
ESSnet on DATA INTEGRATION WP5. On-the-job training applications LIST OF CONTENTS On-the-job training courses 2 1. Introduction 2. Ranking the application on record linkage 2 Appendix A - Applications
More informationpopulation and housing censuses in Viet Nam: experiences of 1999 census and main ideas for the next census Paper prepared for the 22 nd
population and housing censuses in Viet Nam: experiences of 1999 census and main ideas for the next census Paper prepared for the 22 nd Population Census Conference Seattle, Washington, USA, 7 9 March
More informationSample size, sample weights in household surveys
Sample size, sample weights in household surveys Outline Background Total quality in surveys Sampling Controversy Sample size, stratification and clustering effects An overview of the quality dimensions
More informationGhana - Ghana Living Standards Survey
Microdata Library Ghana - Ghana Living Standards Survey 5+ 2008 Institute of Statistical, Social and Economic Research - University of Ghana Report generated on: June 11, 2015 Visit our data catalog at:
More informationSample Surveys. Sample Surveys. Al Nosedal. University of Toronto. Summer 2017
Al Nosedal. University of Toronto. Summer 2017 My momma always said: Life was like a box of chocolates. You never know what you re gonna get. Forrest Gump. Population, Sample, Sampling Design The population
More informationSession 12. Quality assessment and assurance in the civil registration and vital statistics system
Session 12. Quality assessment and assurance in the civil registration and vital statistics system Basic framework Adequately funded evaluation activities are essential For improving systems that have
More informationChapter 4: Designing Studies
Chapter 4: Designing Studies Section 4.1 Samples and Surveys The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 4 Designing Studies 4.1 Samples and Surveys 4.2 Experiments 4.3
More informationStatistical Aspects of a Census
Statistical Aspects of a Census Carol C. House This paper focuses on the statistical aspects of a census. It addresses issues such as the coverage, classification, sampling, non-sampling error, post collection
More informationFull file at
Chapter 2 Data Collection 2.1 Observation single data point. Variable characteristic about an individual. 2.2 Answers will vary. 2.3 a. categorical b. categorical c. discrete numerical d. continuous numerical
More informationNigeria - Multiple Indicator Cluster Survey
Microdata Library Nigeria - Multiple Indicator Cluster Survey 2016-2017 National Bureau of Statistics of Nigeria, United Nations Children s Fund Report generated on: May 1, 2018 Visit our data catalog
More informationMeasuring ICT use by businesses in Brazil: The Project of the Brazilian Institute of Geography and Statistic (IBGE)
Measuring ICT use by businesses in Brazil: The Project of the Brazilian Institute of Geography and Statistic (IBGE) International Seminar on Information and Communication Technology Statistics Roberto
More informationChapter 8. Producing Data: Sampling. BPS - 5th Ed. Chapter 8 1
Chapter 8 Producing Data: Sampling BPS - 5th Ed. Chapter 8 1 Population and Sample Researchers often want to answer questions about some large group of individuals (this group is called the population)
More informationArmenian Experience on Agricultural Census
Armenian Experience on Agricultural Census Name(s) of the Author(s): Mr. Stepan Mnatsakanyan, Ms. Anahit Safyan* Address (es) of the Author(s): Ms. Anahit Safyan Institution, Department: National Statistical
More information