Hierarchical clustering
|
|
- Archibald Holt
- 6 years ago
- Views:
Transcription
1 Hierarchical clustering François Husson Applied Mathematics Department - Rennes Agrocampus husson@agrocampus-ouest.fr 1 / 42
2 Hierarchical clustering 1 Introduction 2 Principles of hierarchical clustering 3 Example 4 K-means : a partitioning algorithm 5 Extras Making more robust partitions Clustering in high dimensions Qualitative variables and clustering Combining with factor analysis - clustering 6 Describing classes of individuals 1 / 42
3 Hierarchical clustering 1 Introduction 2 Principles of hierarchical clustering 3 Example 4 Partitioning algorithm : K-means 5 Extras 6 Characterizing classes of individuals 2 / 42
4 Introduction Definitions : Clustering is : making or building classes Class : set of individuals (or objects) with similar shared characteristics Examples of clustering : animal kingdom, computer hard disk, geographic division of France, etc. of classes : social classes, political classes, etc. Two types of clustering : hierarchical : tree partitioning methods 3 / 42
5 Hierarchical example : the animal kingdom 4 / 42
6 Hierarchical clustering 1 Introduction 2 Principles of hierarchical clustering 3 Example 4 Partitioning algorithm : K-means 5 Extras 6 Characterizing classes of individuals 5 / 42
7 A C B D E F H What data? What goals? Clustering is for data tables : rows of individuals, columns of quantitative variables Goals : build a tree structure that : shows hierarchical links between individuals or groups of individuals detects a natural number of classes in the population G 6 / 42
8 Critères Measuring similarity of individuals : Euclidean distance similarity indices etc. Similarity between groups of individuals : minimum jump or single linkage (smallest distance) complete linkage (largest distance) Ward criterion x x x x x x x x x x x x x x 7 / 42
9 Algorithm 7 th grouping 6 th grouping 5 th grouping 4 th grouping 3 rd grouping ABC DEFGH 4.07 ABC DE DE 4.72 FGH ABC DE FG DE 4.72 FG H ABC D E FG D 4.72 E FG H ABC D E F G D 4.72 E F G H A C B D E F G H {ABCDEFGH} {ABC},{DEFGH} {ABC},{DE},{FGH} {ABC},{DE},{FG},{H} {ABC},{D},{E},{FG},{H} {ABC},{D},{E},{F},{G},{H} {AC},{B},{D},{E},{F},{G},{H} {A},{B},{C},{D},{E},{F},{G},{H} 2 nd grouping AC B D E F G B 0.50 D E F G H A B C D E F G B 0.50 C st grouping D E F G H / 42
10 Trees and partitions Hierarchical Clustering Trees always end up... cut through! Click to cut the tree inertia gain Choosing a height to cut at gives a partition Casarsa Parkhomenko YURKOV Lorenzo NOOL BOURGUIGNON MARTINEAU Karlivans BARRAS Uldal HERNU Turi Karpov Clay Sebrle Schoenbeck Ojaniemi Barras Qi Smirnov Gomez Zsivoczky Macey Smith McMULLEN Bernard ZSIVOCZKY Hernu KARPOV SEBRLE Terek Pogorelov Korkizoglou CLAY BERNARD Nool Warners Drews WARNERS Schwarzl Averyanov Remark : given how it was made, the partition is interesting but not optimal 9 / 42
11 Partition quality When is a partition a good one? If individuals placed in the same class are close to each other If individuals in different classes are far from each other Mathematically speaking? small within-class variability large between-class variability = Two criteria. Which one to use? 10 / 42
12 Partition quality x k the mean of the x k, x qk the mean of the x k in class q K Q k=1 q=1 i=1 I (x iqk x k ) 2 = } {{ } total inertia K Q I K Q (x iqk x qk ) 2 + k=1 q=1 i=1 } {{ } within-class inertia k=1 q=1 i=1 I ( x qk x k ) 2 } {{ } between-class inertia x 2 x x x 1 x 3 = 1 criterion only! 11 / 42
13 Partition quality is measured by : Partition quality 0 between-class inertia total inertia 1 inertia between inertia total = 0 = k, q, x qk = x k by variable, classes have the same means Doesn t allow us to classify inertia between inertia total = 1 = k, q, i, x iqk = x qk individuals in the same class are identical Ideal for classifying Warning : don t just accept this criteria at face value : it depends on the number of individuals and classes 12 / 42
14 Ward s method Initialize : 1 class = 1 individual = Between-class inertia = total inertia At each step : combine classes a and b that minimize the decrease in between-class inertia Inertia(a) + Inertia(b) = Inertia(a b) Group together objects with small weights and avoid chain effects xx x x x x x x Saut minimum Ward m am b m a + m b d 2 (a, b) } {{ } to minimize Group together classes with similar centers of gravity xx x x x x *************************** x x Saut minimum Ward Direct use for clustering / 42
15 Hierarchical clustering 1 Introduction 2 Principles of hierarchical clustering 3 Example 4 Partitioning algorithm : K-means 5 Extras 6 Characterizing classes of individuals 14 / 42
16 Temperature data 23 individuals : European capitals 12 variables : mean monthly temperatures over 30 years Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Area Amsterdam West Athens South Berlin West Brussels West Budapest East Copenhagen North Dublin North Elsinki North Kiev East Krakow East Lisbon South London North Madrid South Minsk East Moscow East Oslo North Paris West Prague East Reykjavik North Rome South Sarajevo South Sofia East Stockholm North Which cities have similar weather patterns? How to characterize groups of cities? 15 / 42
17 Temperature data : hierarchical tree Hierarchical clustering Cluster Dendrogram Athens Lisbon Madrid Rome Reykjavik Moscow Minsk Elsinki Oslo Stockholm Dublin Paris London Amsterdam Brussels Copenhagen Kiev Krakow inertia gain Budapest Berlin Prague Sarajevo Sofia 16 / 42
18 Temperature data Loss in between-inertia when going from 23 clusters to 22 clusters: clusters to 21 clusters: clusters to 20 clusters: clusters to 8 clusters: clusters to 7 clusters: clusters to 6 clusters: clusters to 5 clusters: clusters to 4 clusters: clusters to 3 clusters: clusters to 2 clusters: clusters to 1 clusters: 6.76 Sum of losses of inertia = inertia gain Important loss when going from2 clusters to a unique cluster thuswepreferto keep 2 custers 17 / 42
19 Using the tree to build a partition Should we make 2 groups? 3? 4? Cut into 2 groups : between-class inertia total inertia What can we compare this percentage with? = = 56% Athens Lisbon Madrid Rome Reykjavik Moscow Minsk Hierarchical clustering Elsinki Oslo Cluster Dendrogram Stockholm Dublin Paris London Amsterdam Brussels Copenhagen Kiev Krakow Budapest Berlin Prague Sarajevo Sofia 18 / 42
20 Using the tree to build a partition 66 % of the information is contained in this 2-class cut What can we compare this percentage with? Dim 2 (15.40%) Moscow Kiev Budapest Minsk Krakow Sofia Elsinki Oslo Prague Sarajevo Stockholm Berlin Copenhagen Paris Brussels London Amsterdam Madrid Rome Lisbon Athens Reykjavik Dublin Dim 1 (82.90%) 19 / 42
21 Using the tree to build a partition Athens Lisbon Madrid Rome Reykjavik Moscow Minsk Hierarchical clustering Elsinki Oslo Cluster Dendrogram Stockholm Dublin Paris London Amsterdam Brussels Copenhagen Kiev Krakow Budapest Berlin Prague Sarajevo Sofia Separate cold cities into 2 groups : between-class inertia total inertia = = 20% 19 / 42
22 Using the tree to build a partition The move from 23 cities to 3 classes : 56 % + 20 % = 76 % of the variability in the data Dim 2 (15.40%) Moscow Kiev Minsk Krakow Elsinki Oslo Sarajevo Berlin Stockholm Copenhagen Brussels London Budapest Prague Sofia Paris Amsterdam Madrid Rome Lisbon Athens Reykjavik Dublin Dim 1 (82.90%) 20 / 42
23 Determining the number of classes Starting from the tree Depends on the use (survey, etc.) Athens Lisbon Madrid Rome Reykjavik Moscow Hierarchical clustering Minsk Elsinki Cluster Dendrogram Oslo Stockholm Dublin Paris London Amsterdam Brussels Copenhagen Kiev Krakow Budapest Berlin Prague Sarajevo Sofia Plot with the bars Ultimate criterion : interpretability of the classes inertia gain 20 / 42
24 Hierarchical clustering 1 Introduction 2 Principles of hierarchical clustering 3 Example 4 Partitioning algorithm : K-means 5 Extras 6 Characterizing classes of individuals 21 / 42
25 Reykjavik Reykjavik Reykjavik Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Budapest Copenhagen Paris Brussels Amsterdam London Dublin Madrid Rome Lisbon Athens Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Dim 1 ( 82.9 %) Budapest Copenhagen Paris Brussels Amsterdam London Dublin Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Dim 1 ( 82.9 %) Copenhagen Paris Brussels Amsterdam London Dublin Budapest Madrid Rome Lisbon Athens Dim 1 ( 82.9 %) Madrid Rome Lisbon Athens Reykjavik Reykjavik Reykjavik Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Budapest Copenhagen Paris Brussels Amsterdam London Dublin Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Dim 1 ( 82.9 %) Budapest Copenhagen Paris Brussels Amsterdam London Dublin Madrid Rome Lisbon Athens Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Dim 1 ( 82.9 %) Copenhagen Paris Brussels Amsterdam London Dublin Budapest Madrid Rome Lisbon Athens Dim 1 ( 82.9 %) Madrid Rome Lisbon Athens Reykjavik Reykjavik Reykjavik Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Budapest Copenhagen Paris Brussels Amsterdam London Dublin Madrid Rome Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Dim 1 ( 82.9 %) Budapest Copenhagen Paris Brussels Amsterdam London Dublin Lisbon Athens Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Dim 1 ( 82.9 %) Copenhagen Paris Brussels Amsterdam London Dublin Budapest Madrid Rome Lisbon Athens Dim 1 ( 82.9 %) Madrid Rome Lisbon Athens Reykjavik Reykjavik Reykjavik Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Budapest Copenhagen Paris Brussels Amsterdam London Dublin Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Dim 1 ( 82.9 %) Budapest Copenhagen Paris Brussels Amsterdam London Dublin Madrid Rome Lisbon Athens Moscow Minsk Elsinki Oslo Stockholm Kiev Krakow Prague Sofia Sarajevo Berlin Dim 1 ( 82.9 %) Copenhagen Paris Brussels Amsterdam London Dublin Budapest Madrid Rome Lisbon Athens Dim 1 ( 82.9 %) Madrid Rome Lisbon Athens Partitioning algorithm : K-means Algorithm for aggregating around moving centers (K-means) Choose randomly Q centers of gravity Assign the points to the closest center Calculate anew the Q centers of gravity Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) Dim 2 ( 15.4 %) / 42
26 Hierarchical clustering 1 Introduction 2 Principles of hierarchical clustering 3 Example 4 Partitioning algorithm : K-means 5 Extras 6 Characterizing classes of individuals 23 / 42
27 Robustifying a partition obtained using hierarchical clustering The partition obtained by hierarchical clustering is not optimal and can be improved or made robust using K-means Algorithm : use the obtained hierarchical partition to initialize K-means run a few iterations of K-means = potentially improved partition Advantage : more robust partition Disadvantage : loss of hierarchical structure 24 / 42
28 Hierarchical clustering in high dimension If many variables : do PCA and keep only first axes = takes us to classical case If many individuals, hierarchical algorithm is too long Use K-means to partition into around 100 classes Build tree using these classes (weighted by the number of individuals in each class) Gives us the top of the tree 25 / 42
29 Hierarchical clustering in high dimension If many variables : do PCA and keep only first axes = takes us to classical case If many individuals, hierarchical algorithm is too long Use K-means to partition into around 100 classes Build tree using these classes (weighted by the number of individuals in each class) Hierarchical clustering Hierarchical Clustering Gives us the top of the tree Cluster Dendrogram Hierarchical Classification Tree from original data Tree using classes 25 / 42
30 Hierarchical clustering on qualitative data Two strategies : Transform them to quantitative data Do MCA and keep only the first dimensions Do hierarchical clustering using the principal axes of the MCA Use measures/indices suitable for qualitative variables : similarity indices, Jaccard index, etc. 26 / 42
31 Doing factor analysis followed by clustering Qualitative data : MCA outputs quantitative principal components Factor analysis eliminates the last components, which are just noise = more stable clustering x.1 x.k x.k F 1 F Q F K Data PCA Structure Noise 27 / 42
32 Doing factor analysis followed by clustering Representation of the tree and classes on two factor axes = FA gives continuous information, the tree gives discontinuous information. The tree hints at information hidden in further axes Hierarchical clustering on the factor map height cluster 1 cluster 2 cluster 3 Moscow Minsk Elsinki Krakow Oslo Berlin Stockholm Copenhagen Reykjavik Dublin London Kiev Prague Sofia Sarajevo Paris Brussels Amsterdam Dim 1 (82.9%) Budapest Madrid Rome Lisbon Athens / 42
33 Hierarchical clustering 1 Introduction 2 Principles of hierarchical clustering 3 Example 4 Partitioning algorithm : K-means 5 Extras 6 Characterizing classes of individuals 29 / 42
34 The class make-up : using model individuals Model individuals : the ones closest to each class center Cluster 1: Oslo Helsinki Stockholm Minsk Moscow Cluster 2: Berlin Sarajevo Brussels Prague Amsterdam Cluster 3: Rome Lisbon Madrid Athens Dim 2 (15.40%) cluster 1 cluster 2 cluster 3 Moscow Kiev Budapest Minsk Krakow cluster 1 Prague Elsinki Oslo Sarajevo Sofia Berlin Stockholm Copenhagen cluster 2 Paris Brussels Amsterdam London Athens Madrid Rome cluster 3 Lisbon Reykjavik Dublin Dim 1 (82.90%) 30 / 42
35 Characterizing/describing classes Goals : Find the variables which are most important for the partition Characterize a class (or group of individuals) in terms of quantitative variables Sort the variables that best describe the classes Questions : Which variables best characterize the partition How can we characterize individuals in the 1st class? Which variables describe them best? 31 / 42
36 Characterizing/describing classes Which variables best represent the partition? For each quantitative variable : build an analysis of variance model between the quantitative variable and the class variable do a Fisher test to detect class effect Sort the variables by increasing p-value Eta2 P-value October e-10 March e-10 November e-09 September e-09 April e-08 February e-08 December e-07 January e-06 August e-06 July e-05 May e-04 June e / 42
37 Characterizing classes using quantitative variables January February March April May June July August September October November December Elsinki Kiev Minsk Moscow Oslo Reykjavik Stockholm Amsterdam Berlin Brussels Budapest Copenhagen Dublin Krakow London Paris Prague Sarajevo Sofia Athens Lisbon Madrid Rome / 42
38 Characterizing classes using quantitative variables 1st idea : if the values of X for class q seem to be randomly drawn from all the values of X, then X doesn t characterize class q. Random January Temperature 2nd idea : the more a random draw appears unlikely, the more X characterizes class q. 34 / 42
39 Characterizing classes using quantitative variables Idea : use as reference a random draw of n q values from N What values can x q take? (i.e., what is the distribution of X q?) ( ) E( X q ) = x V( X q ) = s2 N nq n q N 1 L( X q ) = N because X q is a mean = Test statistic = s 2 n q x q x ( N nq N 1 ) N (0, 1) If test statistic 1.96 then X characterizes class q and the more the test statistic is large, the better X characterizes class q. Idea : rank the variables by decreasing test statistic 35 / 42
40 Characterizing classes using quantitative variables $quanti$ 1 v.test Mean in Overall sd in Overall p.value category mean category sd July June August May September January December November April February October March / 42
41 Characterizing classes using quantitative variables $ 2 NULL $ 3 v.test Mean in Overall sd in Overall p.value category mean category sd September October August November July April March February June December January May / 42
42 Characterizing classes using qualitative variables Which variables best characterize the partition? For each qualitative variable, do a χ 2 test between it and the class variable Sort the variables by increasing p-value $test.chi2 p.value df Area / 42
43 Characterizing classes using qualitative variables Does the South category characterize the 3rd class? Cluster 3 Other cluster Total South n mc = 4 1 n m = 5 Not south Total n c = 4 19 n = 23 n Test : H 0 : mc n c = nm n versus H 1 : m abnormally overrepresented in c Under H 0 : L(N mc ) = H(n c, nm n, n) P H 0 (N mc n mc ) Cluster 3 Cla/Mod Mod/Cla Global p.value v.test Area=South = 80 ; = 100 ; = ; P H(4, 23 5,23) [Nmc 4] = = H 0 rejected, South is overrepresented in the 3rd class Sort the categories in terms of p-values 39 / 42
44 Characterizing classes using factor axes These are also quantitative variables $ 1 v.test Mean in Overall sd in Overall p.value category mean category sd Dim $ 2 v.test Mean in Overall sd in Overall p.value category mean category sd Dim $ 3 v.test Mean in Overall sd in Overall p.value category mean category sd Dim / 42
45 Conclusions Clustering can be done on tables of individuals vs quantitative variables MCA transforms qualitative variables into quantitative ones hierarchical clustering gives a hierarchical tree number of classes K-means can be used to make classes more robust Characterize classes by active and supplementary variables, quantitative or qualitative 41 / 42
46 Introduction Principles of hierarchical clustering Example K-means Extras Describing the classes found More nalysis Series Husson Lê Pagès Exploratory Multivariate Analysis by Example Using R Second Edition Chapman & Hall/CRC Computer Science & Data Analysis Series Exploratory Multivariate Analysis by Example Using R SECOND EDITION Husson F., Lê S. & Pagès J. (2017) Exploratory Multivariate Analysis by Example Using R 2nd edition, 230 p., CRC/Press. François Husson Sébastien Lê Jérôme Pagès The FactoMineR package for performing clustering : Movies on Youtube : a Youtube channel: youtube.com/hussonfrancois a playlist with 11 movies in English a playlist with 17 movies in French 42 / 42
Hierarchical ascendant clustering with FactoMineR (Temperature example) Magalie Houée-Bigot & François Husson
Hierarchical ascendant clustering with FactoMineR (Temperature example) Magalie Houée-Bigot & François Husson Import data setwd("c:/users/houee/downloads") # select the working directory temperature
More informationClassification avec FactoMineR François Husson
Classification avec FactoMineR François Husson Importation des données decathlon
More information5.0% 0.0% -5.0% -10.0% -15.0% 10.0% 5.0% 0.0% -5.0% -10.0% -15.0% 10.0% 5.0% 0.0% -5.0% -10.0% 16.00% 13.00% 10.00% 7.00% 4.
MARKETBEAT The DNA of Real Estate EUROPE A Cushman & Wakefield Research Publication Q4 2014 ICE RENTAL (YEAR-ON-YEAR) Q4 2012 Q4 2013 Q4 2014 All Countries 0.2% 0.7% 0.2% Western Europe -0.5% 0.7% 1.8%
More informationHi! I m Heather. Enjoy the printable pack!
Hi! I m Heather. I am a homeschooling mother of four wild and little kiddos, and I blog over at Only Passionate Curiosity. On my blog, I try to share a little bit of what our homeschool life is like, all
More informationNATIONAL INSTITUTE OF ECONOMIC AND SOCIAL RESEARCH ESTIMATES OF MONTHLY GDP. Embargo until hours on 11 th January 2013
NATIONAL INSTITUTE OF ECONOMIC AND SOCIAL RESEARCH ESTIMATES OF MONTHLY GDP Embargo until 15.00 hours on 11 th January 2013 Our monthly estimates of GDP suggest that output declined by 0.3 per cent in
More informationNational Radio Channels. PPM measurement October 2017
National Radio Channels PPM measurement October 17 Background This overview describes the development of radio broadcasting at national levels with updated figures from October 17. The Norwegian Media
More informationHS in Europe - 2 Characteristics
Description of high-speed systems in operation, under construction or projected in Europe and throughout the world. HS in Europe - 2 Characteristics Dipl. Eng. Ignacio Barrón U I C Passengers & HS Department
More informationPaper 2. Mathematics test. Calculator allowed. satspapers.org. First name. Last name. School KEY STAGE TIER
Ma KEY STAGE 3 TIER 4 6 2005 Mathematics test Paper 2 Calculator allowed Please read this page, but do not open your booklet until your teacher tells you to start. Write your name and the name of your
More informationDigital Excellence Study
Digital Excellence Study Key findings - Slovenia In cooperation with Združenje Manager September 2016 Marko Derča Vice president, Head of Digital Transformation EE A.T. Kearney / Digital Excellence Study
More informationOct-09 Nov-09 Dec-09. Jan-10 Feb-10 Mar-10. Apr-10 May-10 Jun-10
Grade 1 Pacing 2009-2010 Investigations 2009 Investigations How Many of Each? Number Games and Crayon Puzzles S M T W T F S Making Shapes and Designing Quilts Twos, Fives, and Tens 1 2 3 4 5 Solving Story
More informationB262A. MATHEMATICS B (MEI) Paper 2 Section A (Foundation Tier) GENERAL CERTIFICATE OF SECONDARY EDUCATION. Wednesday 14 January 2009 Afternoon WARNING
F GENERAL CERTIFICATE OF SECONDARY EDUCATION MATHEMATICS B (MEI) Paper 2 Section A (Foundation Tier) B262A *CUP/T61563* Candidates answer on the question paper OCR Supplied Materials: None Other Materials
More informationSubtracting Rational Numbers
. How can you use what you know about subtracting integers to subtract rational numbers? ACTIVITY: Work with a partner. Use a number line to find the difference. a. Then move unit left to end at. Subtract.
More informationNational Radio Channels. PPM measurement December 2017
National Radio Channels PPM measurement December 2 Background This overview describes the development of radio broadcasting at national levels with updated figures from December 2. The Norwegian Media
More informationA Business Location at its Best. Saxony Economic Development Corporation
A Business Location at its Best Right in the heart of Europe Reykjavik A German federal state in the center of Europe, a bridge between East and West 3 h Oslo 2 h Helsinki Tallinn Stockholm Riga Moscow
More information3. Data and sampling. Plan for today
3. Data and sampling Business Statistics Plan for today Reminders and introduction Data: qualitative and quantitative Quantitative data: discrete and continuous Qualitative data discussion Samples and
More informationPaper 2. Calculator allowed. Mathematics test. First name. Last name. School. Remember KEY STAGE 3 TIER 3 5
Ma KEY STAGE 3 Mathematics test TIER 3 5 Paper 2 Calculator allowed First name Last name School 2007 Remember The test is 1 hour long. You may use a calculator for any question in this test. You will need:
More informationNumber patterns on a spreadsheet
A1 SS Number patterns on a spreadsheet This sheet will help you to create your own number patterns on a spreadsheet. Do the steps one at a time. You will soon feel more confident with a spreadsheet program.
More informationThe study of human populations involves working not PART 2. Cemetery Investigation: An Exercise in Simple Statistics POPULATIONS
PART 2 POPULATIONS Cemetery Investigation: An Exercise in Simple Statistics 4 When you have completed this exercise, you will be able to: 1. Work effectively with data that must be organized in a useful
More informationBACCARAT: A LONGITUDINAL MICRO-STUDY
BACCARAT: A LONGITUDINAL MICRO-STUDY FIELD RESULTS FROM ONE ATLANTIC CITY CASINO, JANUARY 2004 TO JUNE 2010 CENTER FOR GAMING RESEARCH, JULY 2010 Baccarat is the most important game in the world s biggest
More informationMARKETBEAT THE DNA OF REAL ESTATE EUROPE Q A Cushman & Wakefield Research Publication OFFICE RENTAL GROWTH (YEAR-ON-YEAR)
MARKETBEAT THE DNA OF REAL ESTATE EUROPE A Cushman & Wakefield Research Publication Q2 2015 ICE RENTAL GROWTH (YEAR-ON-YEAR) Q2 2013 Q2 2014 Q2 2015 All Countries 0.4% 0.9% 0.4% Western Europe 0.2% 1.3%
More informationSummer Assignment for AP Environmental Science
Summer Assignment for AP Environmental Science 1. Reading Writing Critically about Environmental Science Issues Read The Ghost Map and write a paper in which you focus on: How the water supply and delivery
More informationLutz Roewer, Sascha Willuweit Dept. Forensic Genetics, Institute of Legal Medicine and Forensic Sciences Charité Universitätsmedizin Berlin, Germany
The new YHRD Lutz Roewer, Sascha Willuweit Dept. Forensic Genetics, Institute of Legal Medicine and Forensic Sciences Charité Universitätsmedizin Berlin, Germany 2000 2004 2008 2014 Aug 99 Jun 00 Jan 03
More informationSimulated Statistics for the Proposed By-Division Design In the Consumer Price Index October 2014
Simulated Statistics for the Proposed By-Division Design In the Consumer Price Index October 2014 John F Schilp U.S. Bureau of Labor Statistics, Office of Prices and Living Conditions 2 Massachusetts Avenue
More informationUsing Graphing Skills
Name Class Date Laboratory Skills 8 Using Graphing Skills Introduction Recorded data can be plotted on a graph. A graph is a pictorial representation of information recorded in a data table. It is used
More informationUnit 6 - Number Games and Crayon Puzzles Unit 8 - Twos, Fives, and Tens Unit 7 - Color, Shape, and Size. Unit 9 - Blocks and Boxes.
GRADE 1 Pacing 2010-2011 Unit 1 - How Many of Each? Unit 2 - Making Shapes and Designing Quilts Unit 3 - Solving Story Problems Unit 4 - What Would You Rather Be? Unit 5 - Fish Lengths and Animal Jumps
More informationManagement Strategy Evaluation Process. used in the. evaluation of. Atlantic Herring Acceptable Biological Catch Control Rules.
Management Strategy Evaluation Process used in the evaluation of Atlantic Herring Acceptable Biological Catch Control Rules February 24, 2017 Prepared by the New England Fishery Management Council and
More informationUsing Graphing Skills
Name Class Date Laboratory Skills 8 Using Graphing Skills Introduction Recorded data can be plotted on a graph. A graph is a pictorial representation of information recorded in a data table. It is used
More informationEfficient Target Detection from Hyperspectral Images Based On Removal of Signal Independent and Signal Dependent Noise
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 9, Issue 6, Ver. III (Nov - Dec. 2014), PP 45-49 Efficient Target Detection from Hyperspectral
More informationNational Radio Channels. PPM measurement November 2017
National Radio Channels PPM measurement November 17 Background This overview describes the development of radio broadcasting at national levels with updated figures from November 17. The Norwegian Media
More informationRecreation Facility Hours
NORMAL HOURS Recreation Facility Hours August 1, 2016 July 31, 2017 Last Revised: July 29, 2016 Effective Dates 2016: AUG 23 31, SEP 1 2, SEP 6 16, SEP 18 23, SEP 25 30, OCT 1 7, OCT 9 18, OCT 24 30, NOV
More informationSSB Debate: Model-based Inference vs. Machine Learning
SSB Debate: Model-based nference vs. Machine Learning June 3, 2018 SSB 2018 June 3, 2018 1 / 20 Machine learning in the biological sciences SSB 2018 June 3, 2018 2 / 20 Machine learning in the biological
More informationProbability and Statistics
Probability and Statistics Activity: Do You Know Your s? (Part 1) TEKS: (4.13) Probability and statistics. The student solves problems by collecting, organizing, displaying, and interpreting sets of data.
More informationANALYSIS OF VARIANCE PROCEDURE FOR ANALYZING UNBALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE PROCEDURE FOR ANALYZING UNBALANCED DAIRY SCIENCE DATA USING SAS Avtar Singh National Dairy Research Institute, Karnal -132001 In statistics, analysis of variance (ANOVA) is a collection
More informationBayport Securitisation (RF) Ltd Investor Report February 2013
Bayport Securitisation (RF) Ltd Investor Report February 2013 Financial statistics for the period ended February 2013 Feb 13 Jan 13 Dec 12 Nov 12 Oct 12 Sep 12 Aug 12 Jul 12 Jun 12 May 12 Apr 12 Mar 12
More informationEurovision Song Contest 2011
Eurovision Song Contest 2011 Annex with detailed illustration of the 15-day spectrum recording Seite 50 von 81 Evaluation of the 1 th day of the DKE spectrum recording: 01.05.2011 Maximum values of the
More informationLecture 3 - Regression
Lecture 3 - Regression Instructor: Prof Ganesh Ramakrishnan July 25, 2016 1 / 30 The Simplest ML Problem: Least Square Regression Curve Fitting: Motivation Error measurement Minimizing Error Method of
More informationUnit 7 Central Tendency and Probability
Name: Block: 7.1 Central Tendency 7.2 Introduction to Probability 7.3 Independent Events 7.4 Dependent Events 7.1 Central Tendency A central tendency is a central or value in a data set. We will look at
More informationAssessing the Feasibility of Wind Power Production for the University of Rhode Island s Bay Campus
University of Rhode Island DigitalCommons@URI Senior Honors Projects Honors Program at the University of Rhode Island 2009 Assessing the Feasibility of Wind Power Production for the University of Rhode
More informationAssessing Measurement System Variation
Example 1 Fuel Injector Nozzle Diameters Problem A manufacturer of fuel injector nozzles has installed a new digital measuring system. Investigators want to determine how well the new system measures the
More informationMassport Capital Programs Enterprise Schedule Management. Primavera Schedule Toolkit Design. January 2018 Version 1.0
Massport Capital Programs Enterprise Schedule Management Primavera Schedule Toolkit Design January 2018 Version 1.0 Contents 1.0 INTRODUCTION 2 1.1 DISCLAIMER 3 2.0 TOOLKIT SCHEDULE COMPONENTS 4 2.1 SCHEDULE
More informationTerritorial dimension of Innovation and Structural Funds - the Swedish perspective
Territorial dimension of Innovation and Structural Funds - the Swedish perspective ESPON workshop Brussels, 5 October 2012 Sverker Lindblad Ministry of Enterprise, Energy and Communications Division for
More informationTotal Marks : 100 READ THE FOLLOWING DIRECTIONS CAREFULLY:
Mathematics Writing Time : 2 Hours Total Marks : 100 READ THE FOLLOWING DIRECTIONS CAREFULLY: 1. Do not write in the first fifteen minutes. This time is to be spent on reading the questions. After having
More informationFrequency Distribution and Graphs
Chapter 2 Frequency Distribution and Graphs 2.1 Organizing Qualitative Data Denition 2.1.1 A categorical frequency distribution lists the number of occurrences for each category of data. Example 2.1.1
More informationOyster Monitoring ( ) Nikki Dix, Pam Marcum, Matt Monroe
Oyster Monitoring (2014-20) Nikki Dix, Pam Marcum, Matt Monroe Oyster Monitoring Objectives 1) Evaluate the status of GTM oysters 2) Estimate ecosystem services Guiding Questions NE FL Oyster Research
More informationOpen To Buy (OTB) User Manual
Open To Buy (OTB) User Manual Open to Buy Purpose... 3 Typical User and Workflow... 4 Open to Buy Definitions and Calculations... 5 Field Definitions... 5 Automatically Calculated OTB Fields... 5 Syntax
More informationStatistical tests. Paired t-test
Statistical tests Gather data to assess some hypothesis (e.g., does this treatment have an effect on this outcome?) Form a test statistic for which large values indicate a departure from the hypothesis.
More informationA Guide to Linked Mortality Data from Hospital Episode Statistics and the Office for National Statistics
A Guide to Linked Mortality Data from Hospital Episode Statistics and the Office for National Statistics June 2015 Version History Version Changes Date Issued Number 1 14/Dec/2010 1.1 Modified Appendix
More information4) If your birthday fell on a Sunday this year, what day will it fall on next year? 5) If it is autumn now, what season will it be in 100 seasons?
Worksheet 1 - Going round in circles Most of these questions were taken from: http://nrich.maths.org/308, http://nrich.maths.org/6651 and http://nrich.maths.org/content/id/6651/going%20round%20in%20circles.pdf.
More informationTHE 2009 VIETNAM POPULATION AND HOUSING CENSUS
THE 2009 VIETNAM POPULATION AND HOUSING CENSUS (Prepared for the 11 th Meeting of the Head of NSOs of East Asian Countries) Dr. Le Manh Hung Director-General General Statistics Office Vietnam This paper
More informationPaper 1. Calculator not allowed. Mathematics test. First name. Last name. School. Remember KEY STAGE 3 TIER 4 6
Ma KEY STAGE 3 Mathematics test TIER 4 6 Paper 1 Calculator not allowed First name Last name School 2008 Remember The test is 1 hour long. You must not use a calculator for any question in this test. You
More informationIntroduction to Information Visualization
Introduction to Information Visualization 1 Source: Jean-Daniel Fekete, Jarke J. van Wijk, John T. Stasko, and Chris North. The Value of Information Visualization (2008) 2 I II III IV x y x y x y x y 10.0
More informationOn the Peculiar Distribution of the U.S. Stock Indeces Digits
On the Peculiar Distribution of the U.S. Stock Indeces Digits Eduardo Ley Resources for the Future, Washington DC Version: November 29, 1994 Abstract. Recent research has focused on studying the patterns
More informationCHAPTER 7 Probability
CHAPTER 7 Probability 7.1. Sets A set is a well-defined collection of distinct objects. Welldefined means that we can determine whether an object is an element of a set or not. Distinct means that we can
More informationGE 113 REMOTE SENSING
GE 113 REMOTE SENSING Topic 8. Image Classification and Accuracy Assessment Lecturer: Engr. Jojene R. Santillan jrsantillan@carsu.edu.ph Division of Geodetic Engineering College of Engineering and Information
More informationI STATISTICAL TOOLS IN SIX SIGMA DMAIC PROCESS WITH MINITAB APPLICATIONS
Six Sigma Quality Concepts & Cases- Volume I STATISTICAL TOOLS IN SIX SIGMA DMAIC PROCESS WITH MINITAB APPLICATIONS Chapter 7 Measurement System Analysis Gage Repeatability & Reproducibility (Gage R&R)
More informationLearning about line graphs
Learning about line graphs Find a brick wall at your school. Use a metre ruler and a piece of chalk to mark 1 metre in height on the wall. Count how many rows of bricks there are in 1 metre. Use this infor
More informationBayport Securitisation (RF) Ltd Investor Report May 2013
Bayport Securitisation (RF) Ltd Investor Report May 2013 Financial statistics for the period ended May 2013 May 13 Apr 13 Mar 13 Feb 13 Jan 13 Dec 12 Nov 12 Oct 12 Sep 12 Aug 12 Jul 12 Jun 12 Performance
More informationUsing Graphing Skills
Name Class Date Laboratory Skills 8 Using Graphing Skills Time required: 30 minutes Introduction Recorded data can be plotted on a graph. A graph is a pictorial representation of information recorded in
More informationFY rd Quarter Financial Results
FY2012 3 rd Quarter Financial Results April 1 December 31, 2011 KONAMI CORPORATION February 2, 2012 Cautionary statement with respect to forward-looking statements Statements made in this presentation
More informationFORESIGHT METHOD HORIZONS. Module. Introduction to Foresight for Canada Beyond 150
HORIZONS FORESIGHT METHOD for Canada Beyond 50 OVERVIEW Where are we in the process? What is Horizons approach to foresight? How do the foresight tools fit together for Canada Beyond 50? 2 A NEW MODEL
More informationStatistical Measures
Statistical Measures Pre-Algebra section 10.1 Statistics is an area of math that deals with gathering information (called data). It is often used to make predictions. Important terms: Population A population
More informationLASER server: ancestry tracing with genotypes or sequence reads
LASER server: ancestry tracing with genotypes or sequence reads The LASER method Supplementary Data For each ancestry reference panel of N individuals, LASER applies principal components analysis (PCA)
More informationSite Surveys for Offshore Windfarms: How to Spend your Money Wisely in an Age of Austerity
Site Surveys for Offshore Windfarms: How to Spend your Money Wisely in an Age of Austerity Richard Orren, Global Technical Head, Director Mark Finch, Geotechnical Director Senergy Survey and GeoEngineering
More informationHEIGHT OF CHRISTMAS TREE DRAWINGS AS A FUNCTION OF TIME
Perceptzrnland illotor Skills, 1963, 17, 335-339. @ Southern Universities Press 1963 HEIGHT OF CHRISTMAS TREE DRAWINGS AS A FUNCTION OF TIME RAY A. CRADDICK New Mexico Stare University' Summary.-Thirty
More informationResting pulse After exercise Resting pulse After exercise. Trial Trial Trial Trial. Subject Subject
EXERCISE 2.3 Data Presentation Objectives After completing this exercise, you should be able to 1. Explain the difference between discrete and continuous variables and give examples. 2. Use one given data
More informationI STATISTICAL TOOLS IN SIX SIGMA DMAIC PROCESS WITH MINITAB APPLICATIONS
Six Sigma Quality Concepts & Cases- Volume I STATISTICAL TOOLS IN SIX SIGMA DMAIC PROCESS WITH MINITAB APPLICATIONS Chapter 7 Measurement System Analysis Gage Repeatability & Reproducibility (Gage R&R)
More informationAGENDA. the giver's guide to greatness. the hidden DNA of the real estate business A MOTIVATIONAL MESSAGE FROM SPECIAL GUEST...
AGENDA BRIAN BUFFINI BUFFINI & COMPANY CHAIRMAN AND FOUNDER @BrianBuffini 1 2 3 4 the giver's guide to greatness the hidden DNA of the real estate business A MOTIVATIONAL MESSAGE FROM SPECIAL GUEST...
More informationRemote Sensing. The following figure is grey scale display of SPOT Panchromatic without stretching.
Remote Sensing Objectives This unit will briefly explain display of remote sensing image, geometric correction, spatial enhancement, spectral enhancement and classification of remote sensing image. At
More informationGraphical Excellence Sandra Murray Jonathan Merrell
These presenters have nothing to disclose Graphical Excellence Sandra Murray Jonathan Merrell April 24, 2013 Why Graphical Displays of Data? 1. Almost all the potential learning from data is available
More informationa) Getting 10 +/- 2 head in 20 tosses is the same probability as getting +/- heads in 320 tosses
Question 1 pertains to tossing a fair coin (8 pts.) Fill in the blanks with the correct numbers to make the 2 scenarios equally likely: a) Getting 10 +/- 2 head in 20 tosses is the same probability as
More informationEstimation of Fault Resistance from Fault Recording Data. Daniel Wong & Michael Tong 2014-November-5
Estimation of Fault Resistance from Fault Recording Data Daniel Wong & Michael Tong 2014-November-5 Agenda Project Background & Introduction Fault Resistance & Effect Estimation Algorithm Estimation Results
More informationEffect of Oyster Stocking Density and Floating Bag Mesh Size on Commercial Oyster Production
Effect of Oyster Stocking Density and Floating Bag Mesh Size on Commercial Oyster Production Year 2015 Project AAF15-008 Prepared by : André Mallet Mallet Research Services 4 Columbo Drive Dartmouth (Nova
More informationWelcome to Introduction to Probability and Statistics Spring
Welcome to 18.05 Introduction to Probability and Statistics Spring 2018 http://xkcd.com/904/ Staff David Vogan dav@math.mit.edu, office hours Sunday 2 4 in 2-355 Nicholas Triantafillou ngtriant@mit.edu,
More informationYorkville Seasoned MLP Investment Team Recognized Experts
Yorkville Seasoned MLP Investment Team Recognized Experts Investing & Researching MLPs Since Early 1990s One of the Longest Audited MLP Strategy Track Records Performance Recognition (Morningstar, Pensions
More informationTJP TOP TIPS FOR IGCSE STATS & PROBABILITY
TJP TOP TIPS FOR IGCSE STATS & PROBABILITY Dr T J Price, 2011 First, some important words; know what they mean (get someone to test you): Mean the sum of the data values divided by the number of items.
More informationTEST (a) Write these numbers in order of increasing size. 12, 7, 15, 4, 1, 10, Circle all the odd numbers.
1 TEST 5 1. Complete the picture so that it has 7 dots. 2. What is the number shown? 0 5 10 3. Fill in the missing numbers. 2 + 3 = 4 1 = (c) 3 + 4 = (d) 4 + = 9 (e) 8 = 3 (f) + 7 = 7 4. Write these numbers
More informationStudy on Implementation of Lean Manufacturing Tools and Techniques
International Journal of Applied Engineering Research ISSN 0973-4562 Volume 11, Number 5 (2016) pp 3289-3293 Study on Implementation of Lean Manufacturing Tools and Techniques M Yogesh, Research Scholar,
More informationDescription cabiplot caprojection Remarks and examples References Also see
Title stata.com ca postestimation plots Postestimation plots for ca and camat cabiplot caprojection Remarks and examples References Also see The following postestimation commands are of special interest
More informationOnline Gaming Category Overview
Online Gaming Category Overview Yandex International Business Development ussales@yandex-team.ru +1 857.288.8762 uksales@yandex-team.ru +44 020 3291 3306 Segment Volumes ü Query dynamics ü User interest
More informationBusinessHaldimand.ca. Haldimand County 2018 Community Profile
BusinessHaldimand.ca Haldimand County 2018 Community Profile BusinessHaldimand.ca Haldimand County 2018 Community Profile BusinessHaldimand.ca Haldimand County 2018 Community Profile Data Sources Please
More informationTHE ALGEBRA III MIDTERM EXAM REVIEW Name
THE ALGEBRA III MIDTERM EXAM REVIEW Name This review MUST be turned in when you take the midterm exam OR you will not be allowed to take the midterm and will receive a ZERO for the exam. ALG III Midterm
More informationFourth Bi-annual EuropeanPWN BoardWomen Monitor 2010 in partnership with Russell Reynolds Associates
European Professional Women s Network Press Release Fourth Bi-annual EuropeanPWN BoardWomen Monitor 2010 in partnership with Russell Reynolds Associates The proportion of women on the boards of the top
More informationPaper 2. Mathematics test. Calculator allowed. First name. Last name. School KEY STAGE TIER
Ma KEY STAGE 3 TIER 3 5 2005 Mathematics test Paper 2 Calculator allowed Please read this page, but do not open your booklet until your teacher tells you to start. Write your name and the name of your
More informationLearning about Biodiversity. Student Handouts
Learning about Biodiversity Student Handouts Presenter: Linda Sigismondi, Ph.D. University of Rio Grande, Rio Grande, OH 45674 lindas@rio.edu, www.rio.edu/lindas Ohio Wildlife History Part 1: Changes 1.
More informationHomogenisation of Maximum and Minimum Air Temperatures in Ireland. Mary Curley * and Seamus Walsh
Homogenisation of Maximum and Minimum Air Temperatures in Ireland Mary Curley * and Seamus Walsh Homogenisation of monthly maximum and minimum temperature series HOMER 2.6 Number of stations homogenised
More informationBusinessHaldimand.ca. Haldimand County 2019 Community Profile
BusinessHaldimand.ca BusinessHaldimand.ca BusinessHaldimand.ca Data Sources Please note that every effort has been made to use the most current data available. The major sources of information for this
More informationSurvey of the adjourned sale rate of the Japanese home video game industry
Survey of the adjourned sale rate of the Japanese home video game industry Yuhsuke KOYAMA Tokyo Institute of Technology Japan koyama@dis.titech.ac.jp ABSTRACT We carried out a survey of the adjourned sales
More informationClassification-based Hybrid Filters for Image Processing
Classification-based Hybrid Filters for Image Processing H. Hu a and G. de Haan a,b a Eindhoven University of Technology, Den Dolech 2, 5600 MB Eindhoven, the Netherlands b Philips Research Laboratories
More informationSite Name. Bird Dimensions Species Hen harrier = data input required length (m) 0.55 = model calculates value wing span (m) 1.
Site Name Loch Urr Bird Dimensions Species Hen harrier = data input required length (m) 0.55 = model calculates value wing span (m) 1.4 speed (m/sec) 8 Turbine Dimensions Height of tower (m) 80 Blade length
More informationLive Hand Gesture Recognition using an Android Device
Live Hand Gesture Recognition using an Android Device Mr. Yogesh B. Dongare Department of Computer Engineering. G.H.Raisoni College of Engineering and Management, Ahmednagar. Email- yogesh.dongare05@gmail.com
More informationLoad-Frequency Control and Reserves Network Code. David Bunney JESG 19 March 2013
Load-Frequency Control and Reserves Network Code David Bunney JESG 19 March 2013 Agenda Overview and Timescales Stakeholder Engagement Overview of the Code More detailed discussion on Frequency Quality
More information80mA Low Noise Ultra Low Dropout Voltage Regulator
80mA Low Noise Ultra Low Dropout Voltage Regulator DESCRIPTION The TS5204 series is an efficient linear voltage regulator with ultra-low noise output, very low dropout voltage (typically 20mV at light
More information2.NBT.1 20) , 200, 300, 400, 500, 600, 700, 800, NBT.2
Saxon Math 2 Class Description: Saxon mathematics is based on the principle of developing math skills incrementally and reviewing past skills daily. It also incorporates regular and cumulative assessments.
More informationNational Radio Channels. PPM Measurement September 2018
National Radio Channels PPM Measurement September 18 Background This overview describes the development of radio broadcasting at national levels with updated figures from September 18. The Norwegian Media
More informationA JOINT MODULATION IDENTIFICATION AND FREQUENCY OFFSET CORRECTION ALGORITHM FOR QAM SYSTEMS
A JOINT MODULATION IDENTIFICATION AND FREQUENCY OFFSET CORRECTION ALGORITHM FOR QAM SYSTEMS Evren Terzi, Hasan B. Celebi, and Huseyin Arslan Department of Electrical Engineering, University of South Florida
More informationCCE Calendar for Session Delhi Region (Split-up Syllabus) Class VI- Mathematics TERM I
CCE Calendar for Session 2016-2017 Delhi Region (Split-up Syllabus) Class VI- Mathematics TERM I MONTHS CHAPTER/TOPIC SUB TOPICS TO BE COVERED NUMB ER OF PERIO DS SUGGESTED ACTIVITIES CH 1. Knowing Our
More information60% 100% 100% Performance Report YTD YTD YTD. June Development Services Division. In this Edition. target: 100% target: 100% target: 100%
City of McKinney Development Services 221 N. Tennessee St. McKinney, TX 7509 McKinney s, including the Building Inspections, Code Compliance, Engineering and Planning Departments, strives to provide excellent
More informationSinusoidal Applications
Sinusoidal Applications A package of 5 activities Problems dealing with graphing and determining the equations of sinusoidal functions for real world situations Fractal image generated by MathWiz Created
More informationHomework Assignment (20 points): MORPHOMETRICS (Bivariate and Multivariate Analyses)
Fossils and Evolution Due: Tuesday, Jan. 31 Spring 2012 Homework Assignment (20 points): MORPHOMETRICS (Bivariate and Multivariate Analyses) Introduction Morphometrics is the use of measurements to assess
More informationTEST 6. 12, 7, 15, 4, 1, 10, Circle all the odd numbers.
TEST 6. Complete the picture so that it has 7 dots. 2. What is the number shown? 0 5 0. Fill in the missing numbers. 2 + = 4 = (c) + 4 = (d) 4 + = 9 (e) 8 = (f) + 7 = 7 4. Write these numbers in order
More information