PSY 307 Statistics for the Behavioral Sciences. Chapter 2 Describing Data with Tables and Graphs

Similar documents
Describing Data Visually. Describing Data Visually. Describing Data Visually 9/28/12. Applied Statistics in Business & Economics, 4 th edition

Chapter 3. Graphical Methods for Describing Data. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Chapter 2 Frequency Distributions and Graphs

Chapter 4. September 08, appstats 4B.notebook. Displaying Quantitative Data. Aug 4 9:13 AM. Aug 4 9:13 AM. Aug 27 10:16 PM.

Chapter 10. Definition: Categorical Variables. Graphs, Good and Bad. Distribution

Section 1.5 Graphs and Describing Distributions

DESCRIBING DATA. Frequency Tables, Frequency Distributions, and Graphic Presentation

CRC Association Conference

Numerical: Data with quantity Discrete: whole number answers Example: How many siblings do you have?

Chapter Displaying Graphical Data. Frequency Distribution Example. Graphical Methods for Describing Data. Vision Correction Frequency Relative

Measurement Statistics, Histograms and Trend Plot Analysis Modes

WOODWORKING TECHNOLOGY IN EUROPE: HIGHLIGHTS European Federation of Woodworking Technology Manufacturers

Outline. Drawing the Graph. 1 Homework Review. 2 Introduction. 3 Histograms. 4 Histograms on the TI Assignment

Chpt 2. Frequency Distributions and Graphs. 2-3 Histograms, Frequency Polygons, Ogives / 35

Economic Outlook for 2016

Notes: Displaying Quantitative Data

Public Private Partnerships & Idea selection

Notes 5C: Statistical Tables and Graphs

Who Reads and Who Follows? What analytics tell us about the audience of academic blogging Chris Prosser Politics in

34 th Annual PURC Conference New Technologies and Services in Telecommunications: Effects on Competition and Investment.

Elementary Statistics. Graphing Data

Welcome to the IFR Press Conference 30 August 2012, Taipei

OECD s Innovation Strategy: Key Findings and Policy Messages

Chem & Bio non-proliferation

Frame through-beam sensors

Regulatory status for using RFID in the UHF spectrum 3 May 2006

OECD Science, Technology and Industry Outlook 2008: Highlights

Economic and Social Council

Innovation policy mixes and implications on HEIs - emerging conclusions from the OECD innovation policy reviews

THE ECONOMICS OF DATA-DRIVEN INNOVATION

Chapter 4 Displaying and Describing Quantitative Data

Understanding Knowledge Societies Report of UNDESA/DPADM. Measurement Aspects. Irene Tinagli Tunis, 17 Nov World Summit on Information Society

1.1 Displaying Distributions with Graphs, Continued

Innovation in Europe: Where s it going? How does it happen? Stephen Roper Aston Business School, Birmingham, UK

RECOVERED PAPER DATA

English - Or. English NUCLEAR ENERGY AGENCY COMMITTEE ON THE SAFETY OF NUCLEAR INSTALLATIONS FINAL REPORT AND ANSWERS TO QUESTIONNAIRE

Does exposure to university research matter to high-potential entrepreneurship?

OECD Science, Technology and Industry Outlook 2010 Highlights

Through-beam ring sensors

Remote participation in Question sessions Audio options VoIP

How big is China s Digital Economy

OBN BioTuesday: Sources of Public Non-Dilutable Funding & Export Support to UK R&D Companies

Section 1: Data (Major Concept Review)

Business Statistics:

2018/2019 HCT Transition Period OFFICIAL COMPETITION RULES

CDP-EIF ITAtech Equity Platform

Frequency Distribution and Graphs

LESSON 2: FREQUENCY DISTRIBUTION

Chapter 4. Displaying and Summarizing Quantitative Data. Copyright 2012, 2008, 2005 Pearson Education, Inc.

the Reinsurance Mechanism

Table of Contents Executive Summary 29

STK110. Chapter 2: Tabular and Graphical Methods Lecture 1 of 2. ritakeller.com. mathspig.wordpress.com

Rebuilding for the Community in New Orleans

Getting to Equal, 2016

Stress Calibration for Jitter >1UI A Practical Method

Review. In an experiment, there is one variable that is of primary interest. There are several other factors, which may affect the measured result.

Displaying Distributions with Graphs

stripax The professional stripping tool

ISTEAC. Paper Recycling. LABORATORY OF HEAT TRANSFER and ENVIRONMENTAL ENGINEERING

The compact test- disconnect terminal interface system for protection and secondary technology

Low Capacitance Probes Minimize Impact on Circuit Operation

Story Excitement Social Compete Escapism Challenge Creation Achieve Curiosity Fantasy Hilgard et al. (2013) Rigby et al. (2006) Bartle (1996) Sherry e

Fundamentals of AC Power Measurements

Verifying Power Supply Sequencing with an 8-Channel Oscilloscope APPLICATION NOTE

Statistics for Managers using Microsoft Excel 3 rd Edition

Monthly Summary of Troop Contribution to UN Operations

Statistics. Graphing Statistics & Data. What is Data?. Data is organized information. It can be numbers, words, measurements,

Automated Frequency Response Measurement with AFG31000, MDO3000 and TekBench Instrument Control Software APPLICATION NOTE

Financing SMEs and Entrepreneurs 2012

NFC Forum: The Evolution of a Consortium

OECD/ADBI 7th Round Table on Capital Market Reform in Asia October 2005 ADB Institute, Tokyo, Japan

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.2- #

BenchTop Extraction Arms with unbeatable flexibility

OECD Innovation Strategy: Developing an Innovation Policy for the 21st Century

10 Wyner Statistics Fall 2013

Sections Descriptive Statistics for Numerical Variables

Patented Medicine Prices Review Board P M P R B GUIDELINES REFORM. 15 th Annual Market Access Summit. Douglas Clark Executive Director PMPRB

Simplifying DC-DC Converter Characterization using a 2600B System SourceMeter SMU Instrument and MSO/DPO5000 or DPO7000 Series Scope APPLICATION NOTE

EU Ecolabel EMAS Environmental Technology Verification (ETV) State-of-play and evaluations

AP Statistics Composition Book Review Chapters 1 2

Japan s Leading Exhibition for Robotics Technologies Jan. 17[Wed]-19[Fri], 2018 Tokyo Big Sight, Japan

National Census Geography Some lessons learned and future challenges in European countries

Chapter 2. Organizing Data. Slide 2-2. Copyright 2012, 2008, 2005 Pearson Education, Inc.

NI sbrio-9632/9642 Verification Procedure

Economic crisis, European Welfare State Models and Inequality

Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation. Chapter 2

CISCO ONS /100-GHZ INTERLEAVER/DE-INTERLEAVER FOR THE CISCO ONS MULTISERVICE TRANSPORT PLATFORM

PO01275C Tabor East Neighborhood Meeting. Monday, April 20, :30 PM 8:30 PM

Munkaanyag

Time and Frequency Measurements for Oscillator Manufacturers

AC Line Rated Disc Capacitors Class X1, 400 VAC/Class Y1, 500 VAC

Central and Eastern Europe Statistics 2005

POWERING AMERICA S AND NEVADA S ADVANCED INDUSTRIES

Maintaining the Argo bibliographies. Megan Scanderbeg

Getting Started with the LabVIEW DSP Module

SR&ED International R&D Tax Credit Strategies

Process-Controller HPP-25

WORLD INTELLECTUAL PROPERTY ORGANIZATION. WIPO PATENT REPORT Statistics on Worldwide Patent Activities

Power Measurement and Analysis Software

Using the Ramp Rate Method for Making Quasistatic C-V Measurements with the 4200A-SCS Parameter Analyzer APPLICATION NOTE

Transcription:

PSY 307 Statistics for the Behavioral Sciences Chapter 2 Describing Data with Tables and Graphs

Class Progress To-Date Math Readiness Descriptives Midterm next Monday

Frequency Distributions One of the simplest forms of measurement is counting How many people show a characteristic, have a given value or are members of a category. Frequency distributions count how many observations exist for each value for a particular variable.

Frequency Table A frequency table is a collection of observations: Sorted into classes Showing the frequency for each class. A class is a group of observations. When each class consists of a single observation, the data is considered to be ungrouped.

Creating a Table List the possible values. Count how many observations exist for each possible value. One way to do this is using hash-marks and crossing off each value. Figure out the corresponding percent for each class by dividing each frequency by the total scores.

Unorganized Data 1, 5, 3, 3, 6, 2, 1, 5, 2, 1, 2, 6, 3, 4, 1, 6, 2, 4, 4, 2 A set of observations like this is difficult to find patterns in or interpret.

Example

When to Create Groups Grouping is a convenience that makes it easier for people to understand the data. Ungrouped data should have <20 possible values or classes (not <20 scores, cases or observations). Identities of individual observations are lost when groups are created.

Guidelines for Grouping See pgs 29-30 in text. Each observation should be included in one and only one class. List all classes, even those with 0 frequency (no observations). All classes with upper & lower boundaries should be equal in width.

Optional Guidelines All classes should have an upper and lower boundary. Open-ended classes do occur. Select an interval (width) that is natural to think about: 5 or 10 are convenient, 13 is not The lower boundary should be a multiple of class width (245-249). Aim for a total of about 10 classes.

Gaps Between Classes With continuous data, there is an implied gap between where one boundary ends and the other starts. The size of the gap equals one unit of measurement the smallest possible difference between scores. That way no observations can ever fall within that gap. Class sizes account for this.

Relative Frequency Relative frequency frequency of each class as a fraction (%) of the total frequency for the distribution. Relative frequency lets you compare two distributions of different sizes. Obtain the fraction by dividing the frequency for each group by the total frequency Total = 1.00 (100%)

Example 4/20 =.20 or 20% 5/20 =.25 or 25% 3/20 =.15 or 15% 3/20 =.15 or 15% 2/20 =.10 or 10% 3/20 =.15 or 15% Total = 20 Total = 1.0 or 100%

Cumulative Frequency Cumulative frequency the total number of observations in a class plus all lower-ranked classes. Used to compare relative standing of individual scores within two distributions. Add the frequency of each class to the frequencies of those below it.

Relative Frequency (Percent) and Cumulative Frequency

Cumulative Proportion (Percent) The cumulative proportion or percent is the relative cumulative frequency. Percent = proportion x 100 It allows comparison of cumulative frequencies across two distributions. To obtain cumulative proportions divide the cumulative frequency by the total frequency for each class. Highest class = 1.00 (100%)

Percentile Ranks Percentile rank percent of observations with the same or lower values than a given observation. Find the score, then use the cumulative percent as the percentile rank: Exact ranks can be found from ungrouped data. Only approximate ranks can be found from grouped data.

Qualitative Data Some categories are ordered (can be placed in a meaningful order): Military ranks, levels of schooling (elementary, high school, college) Frequencies can be converted to relative frequencies. Cumulative frequencies only make sense for ordered categories.

Interpreting Tables First read the title, column headings and any footnotes. Where do the data come from, source? Next, consider whether the table is well-constructed does it follow the grouping guidelines. Finally, look at the data and think about whether it makes sense. Focus on overall trends, not details.

Parts of a Graph

Constructing Graphs Select the type of graph. Place groups on the x-axis. Place frequency on the y-axis. Values for the groups and frequencies depend on the data. Label the axes and give a title to the graph.

Histograms For quantitative data only. Equal units across x axis represent groups. Equal units across y axis represent frequency. Use wiggly line to show breaks in the scale. Bars are adjacent no gaps.

Histogram Applets http://www.stat.sc.edu/~west/javahtml/histogram.html Uses Old Faithful geyser data http://www.shodor.org/interactivate/activities/histogram/?version=1.6.0_11&browser=msie&vendor=sun_microsys tems_inc. Uses math SAT data Notice that bin width refers to class or interval size. SPSS automatically creates classes or intervals.

Frequency Polygons Also called a line graph. A histogram can be converted to a frequency polygon by connecting the midpoints of the bars. Anchor the line to the x axis at beginning and end of distribution. Two frequency polygons can be superimposed for comparison.

Number of Employees Number of Employees Number of Employees Creating a Line Graph from a Histogram 7 6 5 4 3 2 1 0 Histogram 0 2 4 6 8 10 12 Length of Service (years) 8 7 6 5 4 3 2 1 0 Frequency Polygon 0 2 4 6 8 10 12 Years of Service 7 6 5 4 3 2 1 0 Histogram 0 2 4 6 8 10 12 Length of S ervice (years)

Stem-and-Leaf Displays Constructing a display: Notice the highest and lowest 10s Arrange 10s in ascending order. Copy right-hand digits as leaves. The resulting display resembles a frequency histogram. Stems are whatever digits make sense to use.

Sample Stem and leaf display showing the number of passing touchdowns. 3 2337 2 001112223889 1 2244456888899

Purpose of Frequency Graphs In statistics, we are interested in the shapes of distributions because they tell us what statistics to use. They let us identify outliers that might distort the statistics we will be using. They present data so that readers can quickly and easily grasp its meaning.

Shapes of Distributions Normal bell-shaped and symmetrical. Bimodal two peaks. Suggests presence of two different types of observations in the same data. Positively skewed lopsided due to extreme observations in right tail. Negatively skewed extreme observations in left tail.

Shapes of Graphs bimodal normal positive skew negative skew

Heavy vs Light-tailed Distributions Heavy-tailed a distribution with more observations in its tails. Light-tailed a distribution with fewer observations in its tails and more in the center. Kurtosis a statistic that measures the shape of the distribution and the size of the tails.

Other Kinds of Graphs Frequency is not the only measure that can be displayed on the y-axis. We are using a graph to explore the shape of a distribution in this chapter. Usually the y-axis shows the dependent variable while the x-axis shows groups (independent variable). Graphs can be visually interesting!

Graphs Allow Visual Comparisons

The Best Graph Ever Drawn Source: http://strangemaps.wordpress.com/

Details About the Graph The map was the work of Charles Joseph Minard (1781-1870), a French civil engineer who was an inspectorgeneral of bridges and roads, but whose most remembered legacy is in the field of statistical graphics The chart, or statistical graphic, is also a map. And a strange one at that. It depicts the advance into (1812) and retreat from (1813) Russia by Napoleon s Grande Armée, which was decimated by a combination of the Russian winter, the Russian army and its scorched-earth tactics. To my knowledge, this is the origin of the term scorched earth the retreating Russians burnt anything that might feed or shelter the French, thereby severely weakening Napoleon s army. It unites temperature, time, geography and number of soldiers, all in one picture.

A Modern Version

Qualitative Data Bar graphs similar to histograms. Bars do not touch. Categorical groups are on x-axis. Pie charts Where tax money goes.

Misleading Graphs Bars should be equal widths Bars should be two-dimensional, not three-dimensional When the lower bound of the y-axis (frequency) is cut-off (not 0), the differences are exaggerated. Height and width of the graph should be approximately equal.

Graphs are Used to Persuade Reagan Bush Clinton Bush

Gallup s Terry Schiavo Poll

Who Increased the Debt? This chart is misleading because it includes social security as debt. If expressed as a % of public debt, Bush & Obama would be tied around 60-70%. Obama would look 4 times worse than Bush and twice as bad as Reagan if this were expressed as a % of income (GDP).

Misleading Tables Average score, reading literacy, PISA, 2009: Korea 539 Finland 536 Canada 524 New Zealand 521 Japan 520 Australia 515 Netherlands 508 Belgium 506 Norway 503 Estonia 501 Switzerland 501 Poland 500 Iceland 500 United States 500 Sweden 497 Germany 497 Ireland 496 France 496 Denmark 495 United Kingdom 494 Hungary 494 OECD average 493 Portugal 489 Italy 486 Slovenia 483 Greece 483 Spain 481 Czech Republic 478 Slovak Republic 477 Israel 474 Luxembourg 472 Austria 470 Turkey 464 Chile 449 Mexico 425

How Big are Crime Rates? Source: http://www.npr.org/templates/story/story.php? storyid=5480227

How Many Groups (Categories)? This graph is misleading because income above 200k is broken into many sub-categories, making the 100-200k group look larger than higher income groups. How it would look if redrawn.

Comparing Scales (OK)

Misleading Scales The range of the scales for these two variables are too different to be compared visually without being misleading. The crossover point at 2004 disappears when the same range is used on both scales of the graph.

More Misleading Graphs http://www.coolschool.ca/lor/ama1 1/unit1/U01L02.htm