Stat Sampling. Section 1.2: Sampling. What about a census? Idea 1: Examine a part of the whole.

Similar documents
Objectives. Module 6: Sampling

Sample Surveys. Chapter 11

Chapter 8. Producing Data: Sampling. BPS - 5th Ed. Chapter 8 1

Stats: Modeling the World. Chapter 11: Sample Surveys

Chapter 12 Summary Sample Surveys

Chapter 3 Monday, May 17th

Chapter 12: Sampling

Other Effective Sampling Methods

AP Statistics S A M P L I N G C H A P 11

Polls, such as this last example are known as sample surveys.

Class 10: Sampling and Surveys (Text: Section 3.2)

Basic Practice of Statistics 7th

3. Data and sampling. Plan for today

MAT 1272 STATISTICS LESSON STATISTICS AND TYPES OF STATISTICS

Unit 8: Sample Surveys

not human choice is used to select the sample.

Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu Albuquerque, UNM

b. Stopping students on their way out of the cafeteria is a good way to sample if we want to know about the quality of the food there.

7.1 Sampling Distribution of X

Sampling, Part 2. AP Statistics Chapter 12

Full file at

CHAPTER 8: Producing Data: Sampling

4.1: Samples & Surveys. Mrs. Daniel AP Stats

Sampling Terminology. all possible entities (known or unknown) of a group being studied. MKT 450. MARKETING TOOLS Buyer Behavior and Market Analysis

These days, surveys are used everywhere and for many reasons. For example, surveys are commonly used to track the following:

Chapter 4: Designing Studies

Honors Statistics. Daily Agenda

Gathering information about an entire population often costs too much or is virtually impossible.

Elements of the Sampling Problem!

Sample Surveys. Sample Surveys. Al Nosedal. University of Toronto. Summer 2017

March 10, Monday, March 10th. 1. Bell Work: Week #5 OAA. 2. Vocabulary: Sampling Ch. 9-1 MB pg Notes/Examples: Sampling Ch.

CHAPTER 4 Designing Studies

Honors Statistics. Daily Agenda

Sampling distributions and the Central Limit Theorem

STAT 100 Fall 2014 Midterm 1 VERSION B

Introduction. Descriptive Statistics. Problem Solving. Inferential Statistics. Chapter1 Slides. Maurice Geraghty

STA 218: Statistics for Management

Sampling. I Oct 2008

Ch. 12: Sample Surveys

POLI 300 PROBLEM SET #2 10/04/10 SURVEY SAMPLING: ANSWERS & DISCUSSION

Population vs. Sample

PUBLIC EXPENDITURE TRACKING SURVEYS. Sampling. Dr Khangelani Zuma, PhD

Sampling Designs and Sampling Procedures

Jeopardy. Ben is too lazy to think of fancy titles

a) Getting 10 +/- 2 head in 20 tosses is the same probability as getting +/- heads in 320 tosses

Section 2: Preparing the Sample Overview

SAMPLING. A collection of items from a population which are taken to be representative of the population.

Statistics and Data Long-Term Memory Review Review 1

Sampling Techniques. 70% of all women married 5 or more years have sex outside of their marriages.

Statistical and operational complexities of the studies I Sample design: Use of sampling and replicated weights

Warm Up The following table lists the 50 states.

Introduction INTRODUCTION TO SURVEY SAMPLING. Why sample instead of taking a census? General information. Probability vs. non-probability.

October 6, Linda Owens. Survey Research Laboratory University of Illinois at Chicago 1 of 22

The Savvy Survey #3: Successful Sampling 1

Lecture 3. Lecture Outline. 1. Turn in Homework 2. Sampling Quiz 3. Essay Writing Lecture. Assignments

Unit 1B-Modelling with Statistics. By: Niha, Julia, Jankhna, and Prerana

Sample size, sample weights in household surveys

Census: Gathering information about every individual in a population. Sample: Selection of a small subset of a population.

Why Randomize? Dan Levy Harvard Kennedy School

There is no class tomorrow! Have a good weekend! Scores will be posted in Compass early Friday morning J

Statistical Measures

CH 13. Probability and Data Analysis

FOX News/Mason-Dixon New York State Poll

AP Statistics Ch In-Class Practice (Probability)

The challenges of sampling in Africa

Key Words: age-order, last birthday, full roster, full enumeration, rostering, online survey, within-household selection. 1.

SAMPLING BASICS. Frances Chumney, PhD

Introduction INTRODUCTION TO SURVEY SAMPLING. General information. Why sample instead of taking a census? Probability vs. non-probability.

Sampling Subpopulations

A Guide to Sampling for Community Health Assessments and Other Projects

Massachusetts Renewables/ Cape Wind Survey

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

2016 Election Impact on Cherokee County Voter Registration

Social Studies 201 Notes for November 8, 2006 Sampling distributions Rest of semester For the remainder of the semester, we will be studying and

Chapter 1 Introduction

Chapter 6: Probability and Simulation. The study of randomness

4-8 Bayes Theorem Bayes Theorem The concept of conditional probability is introduced in Elementary Statistics. We noted that the conditional

THE AP-GfK POLL August, 2012

Probability Homework

Botswana - Botswana AIDS Impact Survey III 2008

Statistics 101 Reviewer for Final Examination

Fundamentals of Probability

Using registers E-enumeration and CAPI Electronic map. Census process. E-enumeration. Census moment and census period E-enumeration process

Liberia - Household Income and Expenditure Survey 2016

Thailand - The Population and Housing Census of Thailand IPUMS Subset

Moore, IPS 6e Chapter 05

Why Randomize? Jim Berry Cornell University

STAT Statistics I Midterm Exam One. Good Luck!

Classical Definition of Probability Relative Frequency Definition of Probability Some properties of Probability

RECOMMENDED CITATION: Pew Research Center, March 2014, Hillary Clinton s Strengths: Record at State, Toughness, Honesty

**Gettysburg Address Spotlight Task

INTRODUCTORY STATISTICS LECTURE 4 PROBABILITY

Spring 2015 Math227 Test #2 (Chapter 4 and Chapter 5) Name

May 10, 2016, NSF-Census Research Network, Census Bureau. Research supported by NSF grant SES

Probability and Counting Techniques

Methodology Marquette Law School Poll August 13-16, 2015

Use of administrative sources and registers in the Finnish EU-SILC survey

Session V: Sampling. Juan Muñoz Module 1: Multi-Topic Household Surveys March 7, 2012

Probability - Introduction Chapter 3, part 1

Case 1: If Denver is the first city visited, then the outcome looks like: ( D ).

Transcription:

Section 1.2: Sampling Idea 1: Examine a part of the whole. Population Sample 1 Idea 1: Examine a part of the whole. e.g. Population Entire group of individuals that we want to make a statement about. Sample Part of the population we actually examine. Population: My 9am statistics class Sample: The group defined by all students sitting in a seat with a seat number ending in a 2. 2 What about a census? Collect info on everyone Would a census of the population be a better way to go? " Often difficult to do time, money, resources, non-responders, etc. " Populations are often dynamic They re changing as you re collecting the data " Can be complex, who gets missed? 3

Properties of a Sample Would like the sample to be representative of the population. Suppose you want to taste (or sample) your soup. If you leave it sitting for 2 hours and spoon off the top, would that be representative of the soup as a whole? Will you miss some important parts? If you stir it thoroughly and then take a taste, would that be more representative of the soup as a whole? 4 Properties of a Sample A representative sample is a sample in which the relevant characteristics of the sample members are generally the same as the characteristics of the population. Population Sample 5 Properties of a Sample Getting a perfectly representative sample may not be possible, but we would at least like a sample that is not biased. Biased Sample the sample is out of step with the full population. A biased sample differs in a specific way from the population. 6

Stat 1010 - Sampling Are we Introducing bias? How? Response: Grade Point Average (GPA) " Population " Sample (whole): STAT1010 class (subset): All students in last 3 rows Is it a representative sample? 7 Are we Introducing bias? How? Response: Hotel quality " Population (whole): All users of the hotel " Sample (subset): Users who too the time to upload review on internet Is it a representative sample? 8 Are we Introducing bias? How? Response: Defect rate of a product " Population (whole): all products produced (subset): products produced on Friday from 3-5pm " Sample Is it a representative sample? 9

Are we Introducing bias? How? A good statistical study MUST have a representative sample. Otherwise the sample is biased and conclusions from the study are not trustworthy. Gallup poll was very off in presidential election prediction in 2012. " Post-election examination determined that part of the poll s overstatement of Romney support arose from too few phone interviews in the Eastern and Pacific time zones overstating the white vote... (See link to article in USA Today on course website) 10 Sample Surveys Idea 2: Choosing randomly " Selecting items for the sample should be done at random so as to reduce the chance of getting a biased sample. " We can t always perfectly use random choice, but we do the best we can for the matter at hand. 11 Simple Random Sample (SRS) Want a representative sample but will settle for one that is not biased. SRS of size n=400 " Give each individual in the population a number, then randomly generate 400 numbers as the chosen individuals. " Each combination of 400 individuals has the same chance of being selected. 12

Simple Random Sample If one were to do this more than once " Different random numbers will give different samples of 400 students. " We have introduced variability by sampling See web-based GUI applet on sampling words from the Gettysburg Address and observed word length: http://www.rossmanchance.com/applets/onesample.html 268 words in the population (whole) 13 10 chosen Which were chosen One sample s information Population information Cumulative results over 5 different simulations 14 Other Sampling Plans Systematic Sampling " Select in a systematic way from the sampling frame. e.g. Every 60 th student (arranged alphabetically) on the list from the Registrar for opinion survey. Use a random start point. " Caution- the order must be random... Every Friday on assembly line, not a good idea. Every 15 minutes at museum entry seems fine. 15

Other Sampling Plans Stratified Sampling " Divide population into strata (subpopulations) and select a SRS from each strata. e.g. SRS from each county in Iowa. Example strata: race, income, age, sex, etc, " Lets you make sure you re getting a certain amount of input from each strata or group. All strata will be represented. 16 Other Sampling Plans Cluster " Divide population into clusters, randomly select some of the clusters, choose all members (not SRS) from selected clusters as your sample. " Might be more practical than SRS. " Note that ALL individuals from a chosen cluster are sampled compared to only some individuals from each strata in stratified sampling. 17 Other Sampling Plans Convenience " Use a sample that is convenient to attain. e.g. Last 3 rows of students to represent class. e.g. Voluntary responses on internet hotel survey. " In general, not a good idea. Often gives biased results. Could be justified in some cases, but try to use a different sampling plan if possible. 18

Other problems Question bias/response bias Things that influence the response " Question could be worded negatively Would you favor or oppose a law that would take away your constitutional right to own guns? Would you favor or oppose a law that would reduce gun violence in your neighborhood? " Respondents don t like the interviewer " Respondents are embarrassed to tell truth and give false information 19 Other problems Non response " Is there a reason a group doesn t respond? Critical thinking useful here. " If it s a health survey, will unhealthy people be less likely to respond? " Non response is a BIG issue in sample surveys. 20 Is there an association between breast cancer and abortion? Studies include women who have and who have not had breast cancer. " An observational study found there was an association. " Which group of women is more likely to be TOTALLY honest about their personal health? National Cancer Institute (2003) " Refuted the reliability of the study. 21

Variability in Samples Results from a sample provide estimates of the truth about a population. 2 different samples will give 2 different estimates (recall word length sampling example). " Why? Because we used random chance to select the sample. " This allows us to use probability to determine how large of an error we are likely to make we ll talk more on this later. Larger samples give more accurate estimates than smaller samples. 22 Some main topics from Sections 1.1-1.2 Parameter (usually a greek letter) vs. Statistic " Population vs. Sample Choose sample at random " Helps avoid getting a biased sample Sampling methods " Simple Random Sample (SRS) " Stratified sampling " Cluster sampling " Convenience sampling (proceed with caution) " Systematic sampling 23