Section 1.5 Graphs and Describing Distributions

Similar documents
Numerical: Data with quantity Discrete: whole number answers Example: How many siblings do you have?

Notes: Displaying Quantitative Data

Sections Descriptive Statistics for Numerical Variables

Chapter 1: Stats Starts Here Chapter 2: Data

STK110. Chapter 2: Tabular and Graphical Methods Lecture 1 of 2. ritakeller.com. mathspig.wordpress.com

Chapter 4. September 08, appstats 4B.notebook. Displaying Quantitative Data. Aug 4 9:13 AM. Aug 4 9:13 AM. Aug 27 10:16 PM.

Chapter 4. Displaying and Summarizing Quantitative Data. Copyright 2012, 2008, 2005 Pearson Education, Inc.

AP Statistics Composition Book Review Chapters 1 2

Chpt 2. Frequency Distributions and Graphs. 2-3 Histograms, Frequency Polygons, Ogives / 35

Chapter 4 Displaying and Describing Quantitative Data

Chapter 3. Graphical Methods for Describing Data. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Univariate Descriptive Statistics

Descriptive Statistics II. Graphical summary of the distribution of a numerical variable. Boxplot

Frequency Distribution and Graphs

Symmetric (Mean and Standard Deviation)

DESCRIBING DATA. Frequency Tables, Frequency Distributions, and Graphic Presentation

Displaying Distributions with Graphs

Chapter 10. Definition: Categorical Variables. Graphs, Good and Bad. Distribution

Section 1: Data (Major Concept Review)

Describing Data Visually. Describing Data Visually. Describing Data Visually 9/28/12. Applied Statistics in Business & Economics, 4 th edition

Chapter Displaying Graphical Data. Frequency Distribution Example. Graphical Methods for Describing Data. Vision Correction Frequency Relative

Review. In an experiment, there is one variable that is of primary interest. There are several other factors, which may affect the measured result.

Chapter 2. Organizing Data. Slide 2-2. Copyright 2012, 2008, 2005 Pearson Education, Inc.

LESSON 2: FREQUENCY DISTRIBUTION

Outline. Drawing the Graph. 1 Homework Review. 2 Introduction. 3 Histograms. 4 Histograms on the TI Assignment

Elementary Statistics. Graphing Data

Chapter 1. Statistics. Individuals and Variables. Basic Practice of Statistics - 3rd Edition. Chapter 1 1. Picturing Distributions with Graphs

CHAPTER 13A. Normal Distributions

Organizing Data 10/11/2011. Focus Points. Frequency Distributions, Histograms, and Related Topics. Section 2.1

Chapter 1. Picturing Distributions with Graphs

Notes 5C: Statistical Tables and Graphs

10 Wyner Statistics Fall 2013

The numbers are...,..., ...,...,...,...,...,

Chapter 2 Frequency Distributions and Graphs

To describe the centre and spread of a univariate data set by way of a 5-figure summary and visually by a box & whisker plot.

Data About Us Practice Answers

Female Height. Height (inches)

Business Statistics:

Statistics for Managers using Microsoft Excel 3 rd Edition

Exploring Data Patterns. Run Charts, Frequency Tables, Histograms, Box Plots

She concludes that the dice is biased because she expected to get only one 6. Do you agree with June's conclusion? Briefly justify your answer.

Variables. Lecture 13 Sections Wed, Sep 16, Hampden-Sydney College. Displaying Distributions - Quantitative.

Lecture 2: Chapter 2

SAMPLE. This chapter deals with the construction and interpretation of box plots. At the end of this chapter you should be able to:

Probability WS 1 Counting , , , a)625 b)1050c) a)20358,520 b) 1716 c) 55,770

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.2- #

0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 3, 3, 3, 3, 4, 5, 8

Lecture 5 Understanding and Comparing Distributions

Chapter 2. The Excel functions, Excel Analysis ToolPak Add-ins or Excel PHStat2 Add-ins needed to create frequency distributions are:

Name: Date: Period: Histogram Worksheet

11 Wyner Statistics Fall 2018

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. B) Blood type Frequency

Chapter 2. Describing Distributions with Numbers. BPS - 5th Ed. Chapter 2 1

(Notice that the mean doesn t have to be a whole number and isn t normally part of the original set of data.)

Chapter 6: Descriptive Statistics

PSY 307 Statistics for the Behavioral Sciences. Chapter 2 Describing Data with Tables and Graphs

MDM4U Some Review Questions

Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation. Chapter 2

Sidcot intranet - Firefly. Useful links: Instant classroom. MyMaths. Objectives

Math 247: Continuous Random Variables: The Uniform Distribution (Section 6.1) and The Normal Distribution (Section 6.2)

Going back to the definition of Biostatistics. Organizing and Presenting Data. Learning Objectives. Nominal Data 10/10/2016. Tabulation and Graphs

Describing Data. Presenting Categorical Data Graphically. Describing Data 143

Interval of Head Circumferences (mm) XS 510 < 530 S 530 < 550 M 550 < 570 L 570 < 590 XL 590 < 610 XXL 610 < 630. Hat Sizes.

1.1 Displaying Distributions with Graphs, Continued

This Chapter s Topics

EE EXPERIMENT 3 RESISTIVE NETWORKS AND COMPUTATIONAL ANALYSIS INTRODUCTION

ESSENTIAL MATHEMATICS 1 WEEK 17 NOTES AND EXERCISES. Types of Graphs. Bar Graphs

Left skewed because it is stretched to the left side. Lesson 5: Box Plots. Lesson 5

1.3 Density Curves and Normal Distributions

A Visual Display. A graph is a visual display of information or data. This is a graph that shows a girl walking her dog. Communicating with Graphs

A C E. Answers Investigation 3. Applications. Sample 2: 11 moves. or 0.44; MAD Sample 2: 22. , or 2.44; MAD Sample 3: 0, or 0.

i. Are the shapes of the two distributions fundamentally alike or fundamentally different?

TJP TOP TIPS FOR IGCSE STATS & PROBABILITY

1.3 Density Curves and Normal Distributions

Confidence Intervals. Class 23. November 29, 2011

PASS Sample Size Software

What Is a Histogram? A bar graph that shows the distribution of data A snapshot of data taken from a process HISTOGRAM VIEWGRAPH 1

Introduction. Descriptive Statistics. Problem Solving. Inferential Statistics. Chapter1 Slides. Maurice Geraghty

Principles of Graphical Excellence Best Paper: ALAIR April 5 6, 2001 AIR: June 2-5, 2002, Toronto Focus-IR, February 21, 2003

NCSS Statistical Software

1.3 Density Curves and Normal Distributions. Ulrich Hoensch MAT210 Rocky Mountain College Billings, MT 59102

BE540 - Introduction to Biostatistics Computer Illustration. Topic 1 Summarizing Data Software: STATA. A Visit to Yellowstone National Park, USA

2. The value of the middle term in a ranked data set is called: A) the mean B) the standard deviation C) the mode D) the median

Stat 20: Intro to Probability and Statistics

Business Statistics. Lecture 2: Descriptive Statistical Graphs and Plots

2.2 More on Normal Distributions and Standard Normal Calculations

Statistics. Graphing Statistics & Data. What is Data?. Data is organized information. It can be numbers, words, measurements,

Chapter Test Form A. mean median mode. 187 Holt Algebra 1. Name Date Class. Select the best answer.

Mean for population data: x = the sum of all values. N = the population size n = the sample size, µ = the population mean. x = the sample mean

Learning Log Title: CHAPTER 2: ARITHMETIC STRATEGIES AND AREA. Date: Lesson: Chapter 2: Arithmetic Strategies and Area

Office 2016 Excel Basics 24 Video/Class Project #36 Excel Basics 24: Visualize Quantitative Data with Excel Charts. No Chart Junk!!!

HPS Scope Sequence Last Revised June SUBJECT: Math GRADE: 7. Michigan Standard (GLCE) Code & Language. What this Standard means:

Tables and Figures. Germination rates were significantly higher after 24 h in running water than in controls (Fig. 4).

Chapter 2 Descriptive Statistics: Tabular and Graphical Methods

Data Analysis and Probability

HOMEWORK 3 Due: next class 2/3

Learning Objectives. Describing Data: Displaying and Exploring Data. Dot Plot. Dot Plot 12/9/2015

Purpose. Charts and graphs. create a visual representation of the data. make the spreadsheet information easier to understand.

Describing Data: Displaying and Exploring Data. Chapter 4

MAT Mathematics in Today's World

Transcription:

Section 1.5 Graphs and Describing Distributions Data can be displayed using graphs. Some of the most common graphs used in statistics are: Bar graph Pie Chart Dot plot Histogram Stem and leaf plot Box plot Cumulative Frequency plot The question is, what type of graph would be best for our data? For categorical data, bar graphs and pie charts are the best representations. A bar graph used to compare the sizes of different groups. They are created by listing the categorical data along the x-axis and the frequencies along the y-axis. Bars are drawn above each categorical variable and the height of each bar indicates the size of the respective group. R command: barplot( ) A pie chart is a circular chart, divided into sectors, indicating the proportion of each data value compared to the entire set of values. To create one, percentage values for each category must be calculated first. R command: pie( ) Example 1: Twenty-five students were asked on a survey to give the highest educational degree of their parents. The results are given in the table below. Education Level Number of Students Associate s Degree 2 Bachelor s Degree 14 Master s Degree 7 Doctorate degree 2 Total 25 Create a bar graph. Commands: degree=c(2,14,7,2) barplot(degree, names.arg=c( A, B, M, D )) Section 1.5 Graphs and Describing Distributions 1

Dot Plot, Histogram, Stem and Leaf Plot, and Box Plot Quantitative data can be displayed using any of the other types of graphs. Which one we choose usually depends on how much data we have. Example 2: Height measurements for a group of people were taken. The results are recorded below (in inches): 66, 68, 63, 71, 68, 69, 65, 70, 73, 67, 62, 59, 63, 68, 71, 63, 63, 60, 64, 66, 58 We will organize this data using different graphs: A dot plot is a graph consisting of points plotted on simple scale. To create one, simply place dots above the values listed on a number line. The number of dots above each value is determined by how many times that respective value appears in the data set. R command: stripchart( ) a. Create a dot plot for the data. If doing by hand, order the data first. Commands: heights=c(66,68,63,71,68,69,65,70,73,67,62,59,63,68,71,63,63,60,64,66,58) stripchart(heights,method= stack,pch=16,offset=1) Section 1.5 Graphs and Describing Distributions 2

A stem and leaf plot shows exact values of individual observations. The digits in the largest place are referred to as the stem and the digits in the smallest place are referred to as the leaf (leaves). The leaves are displayed to the right of the stem. R command: stem( ) b. Create a stem and leaf plot for the data. If doing by hand, order the data first. Command: stem(heights) What is the median of this stem leaf plot? Try another: What is the median of stem leaf plot? Histograms show how observations are distributed across groups, but do not show the exact values of individual observations. To create one, first divide the data into classes, or bins, of equal width. Next, count the number of observations in each class. The horizontal axis will represent the variable values and the vertical axis will represent your frequency or your relative frequency. R command: hist( ) c. Create a histogram for the data. Command: hist(heights) Section 1.5 Graphs and Describing Distributions 3

Boxplots or sometimes called Box and Whisker Plots not only help identify features about our data quickly (such as spread and location of center) but can be very helpful when comparing data sets. They display patterns of quantitative data. To create one, find the five number summary. Plot each number and draw a box starting at Q1 to Q3. Then draw a line segment within the box to represent the median. Connect the min and max to the box with line segments. This is the boxplot. If data contains outliers, a box and whiskers plot can be used instead to display the data. In a box and whiskers plot, the outliers are displayed with dots above the value and the segments begin (or end) at the next data value within the outlier interval. R command: boxplot( ) d. Create a boxplot for the data. Command: boxplot(heights,horizontal=true) Section 1.5 Graphs and Describing Distributions 4

Example 3: Use the following boxplots to determine which of the following statements about these data sets CANNOT be justified? a. The interquartile range of data set I is equal to the interquartile range of data set II. b. The range of data set I is greater than the range of data set II. c. Data set I and data set II have the same number of data points. Example 4: The weights of male and female students in a class are summarized in the following boxplots. Which of the following is NOT correct? a. The male students have less variability than the female students. b. About 50% of the male students have weights between 150 and 185 lbs. c. The mean weight of the female students is about 120 because of symmetry. Section 1.5 Graphs and Describing Distributions 5

Cumulative Frequency Plot Sometimes we are interested in the relative position of observation values, i.e. finding the percentile from a graph. A cumulative frequency plot of the percentages (also called an ogive) can be used to view the total number of events that occurred up to a certain value. Percentiles can be easily viewed using this type of graph. Example 5: Heights (in inches) of a set of basketball players are indicated below in the cumulative frequency plot. a. What is the median? b. What is the interquartile range? Example 6: The figure below shows a cumulative relative frequency plot of 40 scores on a test given in a Statistics class. Which of the following conclusions can be made from the graph? a. If the passing score is 70, most students did not pass the test. b. Sixty percent of the students had a test score above 80. Section 1.5 Graphs and Describing Distributions 6

It is very helpful in statistics to understand the summary features of quantitative variables by looking at their graphs. Center, spread and shape are the most important features to note. There are many patterns that our distribution can follow. Patterns and Shapes Uniform all data has the same height on the graph. Symmetric same shape to the left and right of the center. The first graph is a bell shaped curve. The mean and median will be equal in a bell shaped distribution. Skewed Right (or positive skew) longer tail on the right side. The mean will be larger than the median in a skewed right distribution. Skewed Left (or negative skew) longer tail on the left side. The mean will be smaller than the median in a skewed left distribution. Section 1.5 Graphs and Describing Distributions 7

Example 7: Given the following stem and leaf plot 1 0 2 2 3 5 7 8 9 2 1 1 2 3 4 3 2 4 5 9 a. Is the data skewed to the left or to the right? What can be said about its mean? b. Any outliers? We can verify this by modeling Examples 2 from Section 1.4! Section 1.5 Graphs and Describing Distributions 8