AP Statistics S A M P L I N G C H A P 11

Similar documents
Stats: Modeling the World. Chapter 11: Sample Surveys

Sample Surveys. Chapter 11

Chapter 12: Sampling

Class 10: Sampling and Surveys (Text: Section 3.2)

7.1 Sampling Distribution of X

Chapter 12 Summary Sample Surveys

Polls, such as this last example are known as sample surveys.

Elements of the Sampling Problem!

These days, surveys are used everywhere and for many reasons. For example, surveys are commonly used to track the following:

MAT 1272 STATISTICS LESSON STATISTICS AND TYPES OF STATISTICS

Stat Sampling. Section 1.2: Sampling. What about a census? Idea 1: Examine a part of the whole.

Objectives. Module 6: Sampling

b. Stopping students on their way out of the cafeteria is a good way to sample if we want to know about the quality of the food there.

Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu Albuquerque, UNM

Sample Surveys. Sample Surveys. Al Nosedal. University of Toronto. Summer 2017

Basic Practice of Statistics 7th

STA 218: Statistics for Management

Chapter 8. Producing Data: Sampling. BPS - 5th Ed. Chapter 8 1

Introduction INTRODUCTION TO SURVEY SAMPLING. Why sample instead of taking a census? General information. Probability vs. non-probability.

Honors Statistics. Daily Agenda

Unit 8: Sample Surveys

Gathering information about an entire population often costs too much or is virtually impossible.

4.1: Samples & Surveys. Mrs. Daniel AP Stats

Sampling distributions and the Central Limit Theorem

Chapter 4: Designing Studies

Introduction INTRODUCTION TO SURVEY SAMPLING. General information. Why sample instead of taking a census? Probability vs. non-probability.

Census: Gathering information about every individual in a population. Sample: Selection of a small subset of a population.

Warm Up The following table lists the 50 states.

March 10, Monday, March 10th. 1. Bell Work: Week #5 OAA. 2. Vocabulary: Sampling Ch. 9-1 MB pg Notes/Examples: Sampling Ch.

Honors Statistics. Daily Agenda

October 6, Linda Owens. Survey Research Laboratory University of Illinois at Chicago 1 of 22

Chapter 3 Monday, May 17th

3. Data and sampling. Plan for today

CHAPTER 4 Designing Studies

Chapter 1 Introduction

Sampling Terminology. all possible entities (known or unknown) of a group being studied. MKT 450. MARKETING TOOLS Buyer Behavior and Market Analysis

Full file at

Ch. 12: Sample Surveys

Population vs. Sample

Sampling. I Oct 2008

Other Effective Sampling Methods

CHAPTER 8: Producing Data: Sampling

Introduction. Descriptive Statistics. Problem Solving. Inferential Statistics. Chapter1 Slides. Maurice Geraghty

SAMPLING. A collection of items from a population which are taken to be representative of the population.

Chapter 4: Sampling Design 1

An Introduction to ACS Statistical Methods and Lessons Learned

STAT 100 Fall 2014 Midterm 1 VERSION B

Sampling, Part 2. AP Statistics Chapter 12

not human choice is used to select the sample.

Section 7.1 Experiments, Sample Spaces, and Events

Statistical Measures

Comparative Study of Electoral Systems (CSES) Module 4: Design Report (Sample Design and Data Collection Report) September 10, 2012

Experiences with the Use of Addressed Based Sampling in In-Person National Household Surveys

Date. Probability. Chapter

PUBLIC EXPENDITURE TRACKING SURVEYS. Sampling. Dr Khangelani Zuma, PhD

Getting Paid for Your Opinion Your Guide to Online Surveys

Austria Documentation

Social Studies 201 Notes for November 8, 2006 Sampling distributions Rest of semester For the remainder of the semester, we will be studying and

Sampling Designs and Sampling Procedures

Massachusetts Renewables/ Cape Wind Survey

POLI 300 PROBLEM SET #2 10/04/10 SURVEY SAMPLING: ANSWERS & DISCUSSION

Making Use of Benford s Law for the Randomized Response Technique. Andreas Diekmann ETH-Zurich

Methods and Techniques Used for Statistical Investigation

a) Getting 10 +/- 2 head in 20 tosses is the same probability as getting +/- heads in 320 tosses

The main focus of the survey is to measure income, unemployment, and poverty.

Methodology Marquette Law School Poll February 25-March 1, 2018

Use of administrative sources and registers in the Finnish EU-SILC survey

Saint Lucia Country Presentation

Methodology Marquette Law School Poll August 13-16, 2015

Two Candidates in Lockstep on the Brink of the Debates

Methodology Marquette Law School Poll June 22-25, 2017

The Savvy Survey #3: Successful Sampling 1

SURVEY ON POLICE INTEGRITY IN THE WESTERN BALKANS (ALBANIA, BOSNIA AND HERZEGOVINA, MACEDONIA, MONTENEGRO, SERBIA AND KOSOVO) Research methodology

Moore, IPS 6e Chapter 05

Sample size, sample weights in household surveys

UNIT 8 SAMPLE SURVEYS

Methodology Marquette Law School Poll October 26-31, 2016

Chapter 1: Introduction to Statistics

Student-Built Glossary

6 Sampling. 6.2 Target Population and Sample Frame. See ECB (2011, p. 7). Monetary Policy & the Economy Q3/12 addendum 61

A Guide to Sampling for Community Health Assessments and Other Projects

6 Sampling. 6.2 Target population and sampling frame. See ECB (2013a), p. 80f. MONETARY POLICY & THE ECONOMY Q2/16 ADDENDUM 65

Census Response Rate, 1970 to 1990, and Projected Response Rate in 2000

Lesson 13: Populations, Samples, and Generalizing from a Sample to a Population

Combinatorics. Chapter Permutations. Counting Problems

Turkmenistan - Multiple Indicator Cluster Survey

Section 2: Preparing the Sample Overview

FINANCIAL LITERACY SURVEY IN BOSNIA AND HERZEGOVINA 2011

Survey of Massachusetts Congressional District #4 Methodology Report

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 13

Course Overview J-PAL HOW TO RANDOMIZE 2

Assignment 4: Permutations and Combinations

Methodology Marquette Law School Poll April 3-7, 2018

Paid Surveys Secret. The Most Guarded Secret Top Survey Takers Cash In and Will Never Tell You! Top Secret Report. Published by Surveys & Friends

Overview of the Course Population Size

Sampling Techniques. 70% of all women married 5 or more years have sex outside of their marriages.

Blow Up: Expanding a Complex Random Sample Travel Survey

The study of human populations involves working not PART 2. Cemetery Investigation: An Exercise in Simple Statistics POPULATIONS

Fundamentals of Probability

Sampling Subpopulations in Multi-Stage Surveys

Transcription:

AP Statistics 1 S A M P L I N G C H A P 11 The idea that the examination of a relatively small number of randomly selected individuals can furnish dependable information about the characteristics of a vast unseen universe is an idea so powerful that only familiarity makes it cease to be exciting. Helen Mary Walker (1891-1983)

What is Sampling? 2 We want to know something about a population (a mean or proportion, for example), but it isn t possible to go out and obtain the information from every member of the population (called a census). So we take a random sample from the population, obtain information about the sample, and then use the sample information to estimate what we would get if we could reach the entire population.

What is Sampling? 3 The information we want to know about the population is called a parameter. The information we get from the sample is called a statistic. We use the statistic to provide an estimate of the parameter. Sample Population

Notation 4 Name Statistic Parameter Mean y (mu) Standard Deviation s (sigma) Correlation r (rho) Regression coefficient b (beta) Proportion pˆ (pi -- but we will use p)

Surveys 5 A survey is a common way to get information from a population of people. Follow your favorite news source and you will hear nearly every day the phrase In a recent survey A survey generally consists of questions asked of a selected group (the sample) in order to obtain an estimate of the opinions of the population (U.S. voters, for example).

Sampling Variability 6 Different samples will produce different results. Statisticians refer to these different results as sampling variability (or sampling errors). We will learn 2 nd semester how to take just a single sample and estimate what the sampling variability would be if we could take more than one sample.

Bias 7 In an ideal world, the sample statistic will be very close to the actual population parameter we are trying to estimate. Unfortunately, the Real World is not ideal

Bias In 1936, incumbent President Franklin Roosevelt was running for re-election. A magazine, The Literary Digest, attempted to predict the outcome of the election by polling 10 million potential voters. They received answers from 2.4 million of those polled and predicted the challenger, Alf Landon, to win in a landslide (you studied President Alf Landon in your U.S. History class, didn t you????) 8 The questionnaires sent out by the Literary Digest went to people listed in telephone directories, motor vehicle registries, and country club memberships. Remember the year is 1936. What problems do you see in the potential sample?

Bias 9 The sample obtained by the Literary Digest did not represent the population very well. Remember that in 1936, during the Great Depression, telephones 1, automobiles, and country club memberships were luxuries that few could afford. This is a form of selection bias called undercoverage the sample was selected from a subset of the population that did not allow for a representative sample. Meanwhile, George Gallup 2 queried a mere 50,000 U. S. voters obtained in a random manner from across the entire spectrum of voters and correctly predicted Roosevelt to win. 1 It wasn t until 1986 that enough homes had telephones to make this a viable method of surveying! And now, cell phones are making this method less effective. 2 http://www.gallup.com/home.aspx

Random Selection 10 Our best defense against selection bias is to obtain a random sample selected from a sampling frame (a list of the entire population). In addition, random sampling will make possible the more formal methods of analysis we will learn next semester.

How Big a Sample? 11 Bigger is better, right? Yes and no. A large sample provides more information about the population than a small sample. But a large sample costs more to obtain. It turns out, a sample of say, 1000 residents of Denton will provide as much information about Denton as a sample of 1000 residents of the United States will provide about the United States! It s the size of the sample, not the fraction of the population that is important. (More on this next semester.)

How Big a Sample? 12 It s like this If you are making soup, no matter how big the pot, a single spoonful is enough of a taste to know if the seasonings are to your liking.

Sampling Methods 13 Census Measure or observe every member of a population. May be too expensive. May not be practical. The population is always changing (new items manufactured, births, deaths) so your count won t be exact. In the U.S. census, undercount is a problem. Sometimes entire groups of people are missed.

Sampling Methods 14 Simple Random Sample (SRS) Requires a sampling frame Every sample of a given size has the same chance of selection.* A good mental picture is draw from a hat. *This is important.

Sampling Methods 15 Stratified Random Sample Population is first organized into homogeneous groups called strata. Random sample then taken from each of the strata. Improves the representativeness of a sample when members of the population are recognized to belong to particular groups that are related to the variables of interest. Sample Random Stratify Nonrandom Sample Random

Sampling Methods 16 Systematic Random Sample It is not practical to randomly select individuals from a list. The population is not organized in any particular manner with respect to the variable of interest.* When appropriate, this method of sampling can be cheap and easy to do. Imagine the population is lined up. Estimate the population size N. If you want a sample of size n, randomly select a number from 1 to N/n and then take every (N/n)th member of the population. Random Non-random *This is important.

Sampling Methods 17 Cluster Random Sample It is not practical to select individuals from a list. The population is divided into logical clusters (family groups, animal nests, classrooms, apartment buildings, etc.). Randomly select clusters and measure/observe every member of the cluster. Unlike homogeneous strata, clusters are roughly heterogeneous (i.e. each cluster more or less resembles the entire population). A cluster sample is a matter of practicality. Non-random Sample Random

Sampling Methods 18 Multi-stage Random Sample Sampling methods may be repeated (stage 1, stage 2, etc.) or combined with other methods. Example: The U.S. is divided into approximately 3000 counties that are labeled rural, suburban, and urban. A nation-wide sample might start with this stratification of counties to ensure that some of each type are selected. Then within each selected county, individual towns and cities can be selected (cluster sampling). Within each town or city, individual voting precincts can be selected (another cluster sampling), etc.

(Really) Bad Sampling 19 Voluntary Response Sample Ask people to participate Call in radio polls, website polls, etc. Tends to draw strong, nonrepresentative opinions

(Really) Bad Sampling 20 Convenience Sample Sample obtained from easily observed members of the population. Sample not likely to be representative of the population of interest. Example: An opinion survey about shopping conducted at the local mall only reaches people who go to that mall. Example: A school newspaper wants to estimate the percent of seniors who are likely to go to college, so the reporter asks everyone in her AP classes about future plans.

(Really) Bad Sampling 21 Incomplete Sampling Frame A sampling frame that does not include all of the population of interest will lead to a non-representative sample. What do you think about the following sampling frames? Who might be left out, and why would it matter? Telephone book. Voter registration lists. Vehicle registration lists.

Sources of Bias 22 Undercoverage Bias Certain groups within the population are underrepresented in the sample For example, a telephone survey conducted during the day will tend to miss people who work This is a problem if the people who work differ in an important way from people who don t work Pay close attention to the sample design to minimize this problem.

Sources of Bias 23 Nonresponse Bias Selected individuals are not available or choose not to respond This is important because the people who do not respond may have a different opinion or answer than the people who do respond. To encourage a better response: Keep the survey short Offer an incentive for participating

Sources of Bias 24 Response Bias The questions being asked tend to lead towards particular answers because of the choice of wording or how the question is asked (if the survey is conducted through live interview): In light of the problems in the current economy, are you opposed to raising property taxes to pay for new schools? Do you favor or oppose raising property taxes to pay for new schools?

Sources of Bias 25 Response Bias The order of the questions may influence the response: Do you think a communist country like the Soviet Union should allow U.S. reporters to enter their country and freely report the news to the readers back home? Do you think the U.S. should allow reporters from a communist country like the Soviet Union to enter our country and freely report the news to the readers back home? When these questions were asked in a survey in the 1970 s, the percent who answered the questions Yes varied depending on the order of the questions.

Example 26 The police set up a roadblock to check cars for up-to-date registration, insurance, and safety inspections. They stop every 10 th car that passes. Population Parameter Sampling Frame Sample Sampling Method Possible bias? All cars in the jurisdiction of the police Proportion of cars with up-to-date registration, etc. The cars on the road when they set up the roadblock. Every 10 th car that is stopped. Systematic Random Sample The time of day or location of the roadblock may not lead to a representative sample of cars. Otherwise, this is probably a pretty good method of collecting the data.

Assignment 27 Read Chapter 11 Exercises #1-11 odd, 17, 18, 23-29 odd, 31, 37 www.causeweb.org John Landers