Statistics 101: Section L Laboratory 10

Similar documents
Business Statistics. Lecture 2: Descriptive Statistical Graphs and Plots

Name Class Date. Introducing Probability Distributions

Excel Lab 2: Plots of Data Sets

CHM 109 Excel Refresher Exercise adapted from Dr. C. Bender s exercise

A graph is an effective way to show a trend in data or relating two variables in an experiment.

TJP TOP TIPS FOR IGCSE STATS & PROBABILITY

How to Make a Run Chart in Excel

This page intentionally left blank

Office 2016 Excel Basics 24 Video/Class Project #36 Excel Basics 24: Visualize Quantitative Data with Excel Charts. No Chart Junk!!!

Chapter 1. Picturing Distributions with Graphs

Math 58. Rumbos Fall Solutions to Exam Give thorough answers to the following questions:

CHAPTER 15. Cross Section Sheets. None, except batch processing of an input file.

Sections Descriptive Statistics for Numerical Variables

Sensors and Scatterplots Activity Excel Worksheet

BE540 - Introduction to Biostatistics Computer Illustration. Topic 1 Summarizing Data Software: STATA. A Visit to Yellowstone National Park, USA

Page 21 GRAPHING OBJECTIVES:

NCSS Statistical Software

IE 361 Module 36. Process Capability Analysis Part 1 (Normal Plotting) Reading: Section 4.1 Statistical Methods for Quality Assurance

Chapter 2. The Excel functions, Excel Analysis ToolPak Add-ins or Excel PHStat2 Add-ins needed to create frequency distributions are:

Excel Tool: Plots of Data Sets

Environmental Stochasticity: Roc Flu Macro

(2) Do the problem again this time using the normal approximation to the binomial distribution using the continuity correction A(2)_

Numerical: Data with quantity Discrete: whole number answers Example: How many siblings do you have?

What is the expected number of rolls to get a Yahtzee?

Section 6.4. Sampling Distributions and Estimators

This week we will work with your Landsat images and classify them using supervised classification.

Step 1: Set up the variables AB Design. Use the top cells to label the variables that will be displayed on the X and Y axes of the graph

Linear Regression Exercise

Chapter 2. Organizing Data. Slide 2-2. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Univariate Descriptive Statistics

Displaying Distributions with Graphs

Excel Manual X Axis Label Below Chart 2010 >>>CLICK HERE<<<

MATHEMATICAL FUNCTIONS AND GRAPHS

EE EXPERIMENT 3 RESISTIVE NETWORKS AND COMPUTATIONAL ANALYSIS INTRODUCTION

J. La Favre Fusion 360 Lesson 5 April 24, 2017

USE OF BASIC ELECTRONIC MEASURING INSTRUMENTS Part II, & ANALYSIS OF MEASUREMENT ERROR 1

Chapter 4. Displaying and Summarizing Quantitative Data. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 10. Definition: Categorical Variables. Graphs, Good and Bad. Distribution

This tutorial will lead you through step-by-step to make the plot below using Excel.

Statistics Laboratory 7

PHY 1405 Conceptual Physics I Making a Spring Scale. Leader: Recorder: Skeptic: Encourager:

Graphing with Excel. Data Table

Section 1.5 Graphs and Describing Distributions

CS/NEUR125 Brains, Minds, and Machines. Due: Wednesday, February 8

16 Histograms. Using Histograms to Reveal Distribution

Chapter 1. Statistics. Individuals and Variables. Basic Practice of Statistics - 3rd Edition. Chapter 1 1. Picturing Distributions with Graphs

Assessing Measurement System Variation

Assignment 5 due Monday, May 7

Exploring Data Patterns. Run Charts, Frequency Tables, Histograms, Box Plots

Graphing Guidelines. Controlled variables refers to all the things that remain the same during the entire experiment.

Frequency Distribution and Graphs

Assessing Measurement System Variation

Interval of Head Circumferences (mm) XS 510 < 530 S 530 < 550 M 550 < 570 L 570 < 590 XL 590 < 610 XXL 610 < 630. Hat Sizes.

Chpt 2. Frequency Distributions and Graphs. 2-3 Histograms, Frequency Polygons, Ogives / 35

Math Exam 2 Review. NOTE: For reviews of the other sections on Exam 2, refer to the first page of WIR #4 and #5.

Math Exam 2 Review. NOTE: For reviews of the other sections on Exam 2, refer to the first page of WIR #4 and #5.

Experiment P55: Light Intensity vs. Position (Light Sensor, Motion Sensor)

CREATING (AB) SINGLE- SUBJECT DESIGN GRAPHS IN MICROSOFT EXCEL Lets try to graph this data

Stream Design: From GEOPAK to HEC-Ras

Subdivision Cross Sections and Quantities

Tektronix digital oscilloscope, BK Precision Function Generator, coaxial cables, breadboard, the crystal earpiece from your AM radio kit.

Exploring the Pythagorean Theorem

EE 210 Lab Exercise #3 Introduction to PSPICE

Summary... 1 Sample Data... 2 Data Input... 3 C Chart... 4 C Chart Report... 6 Analysis Summary... 7 Analysis Options... 8 Save Results...

PASS Sample Size Software

Lesson 8 Tic-Tac-Toe (Noughts and Crosses)

UNIT TWO: Data for Simple Calculations. Enter and format a title Modify font style and size Enter column headings Move data Edit data

Physics 253 Fundamental Physics Mechanic, September 9, Lab #2 Plotting with Excel: The Air Slide

Lesson Sampling Distribution of Differences of Two Proportions

Input of Precise Geometric Data

Math 247: Continuous Random Variables: The Uniform Distribution (Section 6.1) and The Normal Distribution (Section 6.2)

Learning Log Title: CHAPTER 2: ARITHMETIC STRATEGIES AND AREA. Date: Lesson: Chapter 2: Arithmetic Strategies and Area

VARVE MEASUREMENT AND ANALYSIS PROGRAMS OPERATION INSTRUCTIONS. USING THE COUPLET MEASUREMENT UTILITY (Varve300.itm)

Chapter 3. Graphical Methods for Describing Data. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

6. Multivariate EDA. ACE 492 SA - Spatial Analysis Fall 2003

3.6 Theoretical and Experimental Coin Tosses

Organizing Data 10/11/2011. Focus Points. Frequency Distributions, Histograms, and Related Topics. Section 2.1

4.4 Slope and Graphs of Linear Equations. Copyright Cengage Learning. All rights reserved.

IE 361 Module 17. Process Capability Analysis: Part 1. Reading: Sections 5.1, 5.2 Statistical Quality Assurance Methods for Engineers

Anchor Block Draft Tutorial

Chapter 11. Sampling Distributions. BPS - 5th Ed. Chapter 11 1

Describe the variable as Categorical or Quantitative. If quantitative, is it discrete or continuous?

Appendix 3 - Using A Spreadsheet for Data Analysis

FlashChart. Symbols and Chart Settings. Main menu navigation. Data compression and time period of the chart. Chart types.

Laboratory Experiment #1 Introduction to Spectral Analysis

Chapter 1: Stats Starts Here Chapter 2: Data

PLC Papers Created For:

XL1F: V0G Create Histogram using HISTOGRAM in Excel 2013

2.2 More on Normal Distributions and Standard Normal Calculations

How to define Graph in HDSME

1. Setup Output mode. 2. Using a Fixed tile size

EKA Laboratory Muon Lifetime Experiment Instructions. October 2006

Statistics 1040 Summer 2009 Exam III

10 Wyner Statistics Fall 2013

What are the chances?

Experiment P11: Newton's Second Law Constant Force (Force Sensor, Motion Sensor)

The Toolbars submenu selects or deselects the following toolbars, below shows you how to display the Measuring Toolbar: Scale X in Y

For question 1 n = 5, we let the random variable (Y) represent the number out of 5 who get a heart attack, p =.3, q =.7 5

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. B) Blood type Frequency

TeleTrader FlashChart

Transcription:

Statistics 101: Section L Laboratory 10 This lab looks at the sampling distribution of the sample proportion pˆ and probabilities associated with sampling from a population with a categorical variable. Proportions are used to summarize information about categorical variables (the proportion of people that belong to a particular category). To look at the distribution for the sample proportion, pˆ, we will sample from a population of 250 Statistics 101 students. The characteristic we are interested in is the proportion of students with blue eyes. By taking many samples (called repeated sampling) from the population and looking at the distribution of the sample statistics generated, the distribution for the sample statistic is obtained. Activity 1: Refer to the table titled Eye Color for Population of 250 Statistics 101 Students. This table contains a listing of the eye colors of the population members. Rather than list the names of the population members, this table numbers them by rows numbered {00, 01,..., 09, 10,..., 24} and columns numbered {0,1,2,..., 9}. For example, student 057 (Row 05 and Column 7) has brown eyes. a) Use the random number table to select a simple random sample of 10 students from this population. Write the student numbers and eye colors on the answer sheet. Calculate the proportion of the students in your sample with blue eyes. Note: You can make more efficient use of the 3 digit random numbers doing the following. For numbers between 000 and 249 go directly to the table. For numbers between 250 and 499 subtract 250 and then go to the table. For numbers between 500 and 749 subtract 500 and then go to the table. For numbers between 750 and 999 subtract 750 and then go to the table. b) Take another random sample but this time of size 25 from the population. For this sample, just keep track of the number of students in the sample with blue eyes. Write the proportion of the students in your sample with blue eyes on the answer sheet. Activity 2: Doing the random sampling by hand is tedious. We can use JMP to do the random sampling for us provided we have a population to sample from in the form of a JMP data. On the course web site is a JMP data table, called eyecolor.jmp, with information about eye color for a population of 1070 individuals. a) Use Analyze Distribution to find the proportion of this population with blue eyes. b) Go back to the course web site and right-click on the file bluesampleprop.jsl. Choose the Save Link As option from the menu and save the file to the computer s desktop. Go to the desktop and double-click to open the JMP script. To run the JMP script click on the Red JMP icon on the menu bar. This script will take 100 samples of size 10, 100 samples of size 25 and 100 samples of size 50 from this population and determine the proportion of individuals in each of the samples with blue eyes. Once the script finishes running (this may take a while so be patient), you will see a data table (named samplesummaries) with three columns. The first column contains the sample proportions from samples of size 10, the second column contains the sample proportions from sample of size 25, and the last column contains the sample proportions of samples of size 50. Use Analyze Distribution to obtain histograms of these columns (put all three columns into Y, Columns in the Analyze Distribution dialog box). Once you have the three histograms go to the pull down menu next to 1

Distributions in the output and select Uniform Scaling. Use this information to answer the following questions. Turn in your JMP output. c) What are the mean values of the sample proportions for the three sample sizes? What values should each of these means be close to? Why? d) What are the standard deviation values of the sample proportions for the three sample sizes? What values should each of these standard deviations be close to? Why? e) What is the shape of the histogram of the sample proportion values for each of the three sample sizes? Are there any differences in the shapes as the sample size increases? f) Add a normal quantile plot to each distribution output. Describe what you see in the normal quantile plot for each sample size. Could the normal distribution be used to model the distribution of the sample proportions for any of the three sample sizes? If so, which ones. Activity 3. In the first activity in lab this week you looked at selecting a random sample of 10 from the population of 250 students and recording the proportion of students in your sample with blue eyes. In this exercise we will look at how to use probability to see how likely it is to get each of the possible values of the sample proportion, pˆ. Our population has 31.2% blue eyed people and 68.8% of people with non-blue eyes. One probability rule is that for independent trials; Prob(A and B) = Prob(A)*Prob(B) Note that this expands to any number of independent trials; Prob(A and B and C and D) = Prob(A)*Prob(B)*Prob(C)*Prob(D) a) In a random sample of 10, in order to get a value of pˆ = 0 you have to see none of the 10 people with blue eyes. That means that all 10 of the people chosen would have to have non-blue eyes. Write a probability expression for the event that none of the 10 people have blue eyes and compute the probability. b) In a random sample of 10, in order to get a value of pˆ = 0.1 you have to see exactly one person with blue eyes. One way to do this is for the first person selected to have blue eyes and the remaining nine people to have non-blue eyes. Write a probability expression for this event and compute the probability. c) Of course the event described in b. is not the only way to have exactly one person in a sample of 10 have blue eyes. Name another way we could get a random sample with pˆ = 0.1. What is the probability associated with this new event. d) How many different ways can you get a random sample with exactly one person with blue eyes? e) Using b), c) and d), what is the probability that pˆ = 0.1 for a random sample of 10 people from the population with 31.2% blue eyed people? f) What you are calculating are binomial probabilities. This is something JMP does very easily. Go to JMP and create a new data table with three columns. Label the first column # Blue, the second column p-hat, and the third column Probability. In 2

the first column put the numbers from 0 to 10 (you will have 11 rows). For the second column use the Cols Formula and enter the formula # Blue divided by 10. For the third column use the Cols Formula Discrete Probability Binomial Probability and enter 0.312 for p, 10 for n, and click on the # Blue column for k. The formula should look like: Binomial Probability(0.312,10,# Blue) g) What is the probability that pˆ = 0? What is the probability that pˆ = 0.1? h) In order to create a distribution for the values of pˆ add another column to your JMP table labeled Frequency. Use Cols Formula Probability*100,000,000 to fill this column. Use Analyze Distribution with p-hat in Y, Columns and Frequency in Freq and Click on OK. For your JMP output, go to Histogram Options (red pull down menu next to p-hat) and de-select Vertical. Also select a Prob axis. Right click on the horizontal axis of the histogram and select Axis Settings. Make the Minimum 0, the Maximum 1, and the Increment 0.1. Use the JMP output to answer the following questions. Turn in the JMP output. Describe the shape of the distribution. Compare the mean to the median. What does this comparison tell you about the shape of the distribution? What is the mean of the distribution? How does this relate to the proportion of people with blue eyes in the population? What is the standard deviation of the distribution? How does this relate to the proportion of people with blue eyes in the population? i) Repeat parts of f), g) and h) to construct the probability distribution for pˆ with n=25 instead of 10. You will have to create a new data table. Be careful to correctly calculate the value of pˆ (remember that it should go from 0 to 1). Describe the shape, center and spread and relate the center and spread to the proportion of people in the population with blue eyes. 3

Eye Color for Population of 250 Statistics 101 Students 0 1 2 3 4 5 6 7 8 9 00 blue brown blue brown green blue brown green green brown 01 hazel green blue hazel brown blue brown brown brown blue 02 blue brown blue brown hazel green brown brown green green 03 green brown brown brown green brown brown green hazel green 04 brown blue other blue blue hazel brown hazel green brown 05 brown brown brown blue blue brown blue brown blue blue 06 green blue hazel brown green green blue blue blue blue 07 green hazel blue hazel brown green green blue brown green 08 brown hazel brown blue blue blue brown brown hazel brown 09 blue green blue green brown other brown blue blue brown 10 blue brown brown hazel blue brown brown blue green brown 11 brown blue blue blue other green blue hazel green brown 12 blue blue hazel blue hazel brown other blue green blue 13 blue brown hazel brown blue hazel brown blue green blue 14 brown hazel blue hazel hazel blue brown blue blue brown 15 brown brown hazel hazel green brown brown brown brown blue 16 green hazel blue green brown brown hazel blue blue blue 17 green green other brown green brown brown green brown brown 18 green green blue blue blue brown green hazel brown green 19 brown hazel blue blue hazel blue brown brown green green 20 green brown green green brown blue other blue hazel blue 21 green blue brown green other blue blue hazel brown hazel 22 green blue blue blue green brown green blue hazel brown 23 brown blue blue brown brown hazel blue brown brown brown 24 blue brown blue brown green green blue hazel blue brown 4

Stat 101 L: Laboratory 10 Answer Sheet Names: Activity 1: a) Student Number Eye Color Student Number Eye Color n 10, pˆ n 25, pˆ b) Activity 2: a) Value of p? b) c) Sample size Mean Close to? d) Sample size Standard deviation Close to? 5

e) Sample size Shape of histogram Changes in shape? f) Sample size Normal quantile plot Normal? Activity 3: a) P(no one with blue eyes in sample of 10) = b) P(first person with blue eyes and 9 other people with non blue eyes in sample of 10) = c) Another way to get one person with blue eyes and probability associated with that event. d) Number of ways to get exactly one person with blue eyes in a sample of 10? 6

e) P( pˆ = 0.1) = f) g) P( pˆ = 0) = P( pˆ = 0.1) = h) Distribution of pˆ when. Describe the shape of the distribution. Compare the mean to the median. What does this comparison tell you about the shape of the distribution? What is the mean of the distribution? How does this relate to the proportion of people with blue eyes in the population? What is the standard deviation of the distribution? How does this relate to the proportion of people with blue eyes in the population? i) P( pˆ = 0) = P( pˆ = 0.1) = Distribution of pˆ when. Describe the shape of the distribution. Compare the mean to the median. What does this comparison tell you about the shape of the distribution? What is the mean of the distribution? How does this relate to the proportion of people with blue eyes in the population? What is the standard deviation of the distribution? How does this relate to the proportion of people with blue eyes in the population? 7