Introduction to Biosystematics - Zool 575

Size: px
Start display at page:

Download "Introduction to Biosystematics - Zool 575"

Transcription

1 Introduction to Biosystematics Lecture Introduction to maximum likelihood - synopsis of how it works - likelihood of a single sequence - likelihood across a single branch - likelihood as branch length changes - likelihood of 1 site on a tree - likelihood of a tree 2. Mechanistic comparison with Parsimony Derek S. Sikes University of algary Zool 575 Statistical Phylogenetics - rew from mathematical, computer, evolutionary, numerical studies, not as much from systematics 1. Parsimony (avalli-sforza & Edwards, 196) tree with minimum changes preferred 2. (avalli-sforza & Edwards, 1964) tree that maximizes probability of the data is preferred Made available for DN based phylogenetics by Felsenstein in the early 1980s Optimality riteria - iven 2+ trees Maximum Parsimony he tree hypothesizing the fewest number of character state changes is the best he tree maximizing the probability of the observed data is best Parsimony & Parsimony & ladistics - Strict cladists typically use only parsimony methods & justify this choice on philosophical grounds eg it provides the least falsified hypothesis - Parsimony has also been interpreted as a fast approximation to maximum likelihood avalli-sforza and Edwards (1967:555), stated that parsimony s success is probably due to the closeness of the solution it gives to the projection of the maximum likelihood tree and parsimony certainly cannot be justified on the grounds that evolution proceeds according to some minimum principle. (ML) Slow incorporation into systematics 1. omputationally complex - limited early uses to small molecular datasets (<12 OUs) 2. Molecular only (until 2001). Slow development of good software 4. Slow development of objective means to choose process models Now we have more sophisticated software, faster computers, more realistic models, and means to choose among models ML is becoming a widely used method Joseph Felsenstein and David Swofford instrumental in developing software to use ML (ML) statistical model-fitting exercise Which model (process + tree [topology+branch lengths]) is most likely to have generated the observed data? Maximizing the probability of the data given a model L=Pr(D H) Select process model using I etc. select tree & model parameters using ML 1

2 (ML) Note: we could select our process model using ML instead of the I - just pick the model with the highest likelihood Why do we use the I? (or similar criterion like the LR [Likelihood Ratio est] or the BI [Bayesian Information riterion]) Recall the I rewards models for high likelihoods (ML) oal: o estimate the probability that we would observe a particular dataset, given a phylogenetic tree and some notion of how the evolutionary process worked over time ( ) Probability of given - is not the probability that the model or tree (hypothesis) is true - is the likelihood of seeing the data if we assume the model and tree are true hypothesis 1: coin is fair ( P heads = 0.5) hypothesis 2: coin is biased (P heads = 0.21) -lnl (hypo 1) = lnl (hypo 2) = But what about other values for (P heads)? (say 0.?) eg. oin flipping flips and 21 are heads hypothesis 1: coin is fair ( P heads = 0.5) hypothesis 2: coin is biased (P heads = 0.21) -lnl (hypo 1) = lnl (hypo 2) = lnl <- Estimator (MLE) for P heads (0.21) <- P heads ML says nothing about the likelihood of the hypothesis being true it simply allows us to select among many hypotheses based on which makes the data most likely eg. Loud sounds coming from your attic (data) Hypothesis 1: gremlins are bowling upstairs Hypothesis 2: gremlins are sleeping upstairs hypothesis 1 makes the data more likely than hypothesis 2 - but itself is highly improbable! - he data are constant - it is the models (hypotheses) that we are varying - he likelihood is usually very small so we take the natural logarithm (log-likelihood), often abbreviated lnl (or lnl) which is typically negative Eg. lnl = lnl = which is better? he larger value some programs report -lnl so the value is positive, or simply ignore the negative 2

3 Requires explicit process model of evolution Strength: assumptions are spelled out unlike Parsimony - which also requires a model but the model is implicit & only recently understood statistically (next lecture) - he hypotheses (the model) includes: 1. he process model (eg J69, R+) 2. he topology (tree). he branch lengths of (2n-) branches for n taxa - Must calculate MLEs for parameters of the process model (eg the base frequencies) and each of the branch lengths for a given topology - Recall the topologies are supplied by 1) a starting tree that is then 2) subjected to branch swapping (same method to get trees as Parsimony) - For every topology there are huge numbers of possible branch lengths per branch to evaluate - his is a LO of calculations! - single sequence Let s consider a single short sequence We don t need the rate matrix since nothing is changing! We only need a base composition model If we consider a sequence of length 2: Sequence: ga he probability of observing this sequence is the product of the probabilities of observing each character e.g! g = 0.4;! a =0.15 (for instance) Likelihood(D M) = 0.4 x 0.15 = 0.06 (likelihood of the data given the model) - single sequence Why we use log-likelihoods: ake a sequence with 894 sites, (: 272, :297, :95, :20) Using a model that allows an unequal base composition of! = [0.0425, 0.221, , ] he Likelihood would be = x x x Which is a INY number.. - single sequence Why we use log-likelihoods: So we convert to log-likelihoods, which we can then sum rather than multiply: lnl = 272ln(0.0425) + 297ln(0.221) + 95ln( ) + 20ln( ) lnl = Note, if we used the J69 model with its base composition of 25 % for each base, the lnl would be 894ln(0.25) = his is 6 log-likelihood units worse! - single branch Example process model Substitution probabilities (rate matrix) P = note rows sums to 1* Base composition:! = [0.1, 0.4, 0.2, 0.] *he rows of this matrix sum to 1 - meaning that for every nucleotide, we have covered all the possibilities of what might happen to it. he columns do not sum to anything in particular

4 Example model So this model says the probability of an changing to a is nd the probability of it staying the same is nd that makes up 10% of the bases in the data etc. - single branch With this we can calculate the likelihood of going from one sequence to another - single branch simple likelihood calculation lignment ccat ccgt he likelihood going from the first to the second sequence across a short branch would be site = (! c P c"c ) (! c P c"c ) (! a P a"g ) (! t P t"t ) (values are supplied by our model) = (0.4x0.98)(0.4x0.98)(0.1x)(0.x0.979) Likelihood (D M) = variable branch length But this is for a short branch - variable branch length Different branch lengths yield different likelihoods: Probability of change increases as branches get longer We know at least 1 change has happened so the branch length isn t 0.0 (substitutions per site) We also know that all the sites haven t changed so the branch length isn t 1.0 or higher We can use ML to estimate the branch length that makes the data most likely - variable branch length s the branch length increases the probability of a base changing increases (and the probability of it staying the same decreases - of course, they can t both increase!) For a branch times longer than our initial branch: - trees o determine the likelihood score of a tree: - Must have a topology - Find the MLEs for the parameters of the model, minimally this involves finding the MLEs for the 2n- branches of the tree P = he likelihood of each site is calculated taking into account all possible ancestral states (these are unknowns) - sum all possible likelihoods for each site 0.08 his yields a likelihood = (which is better than our short branch likelihood of ) he likelihood of the tree is the sum of all the individual site log-likelihoods (which we ve already done ) 4

5 - branch lengths his is key (1) ML estimates corrected branch lengths (corrected for unobserved changes) nd (2) uses these branch lengths to adjust the probabilities of change Branch lengths are estimated in units of expected number of changes per character he probability of change is a product of mutation rate (µ) and time (t) Branches can be long due to large t, or higher µ, or both (hard to tease apart t from µ) if all branches have same µ then the data fit a molecular clock (ultrametric) 1. he most parsimonious topology is often, but not always, the same as the maximum likelihood topology 2. Parsimony does not correct the data for unobserved changes and so parsimony branch lengths are typically underestimates of actual branch lengths - Parsimony branch lengths = the estimated number of changes that occurred & are mapped onto the tree after it has been found (not used during tree searching - only total tree length is used to select the best tree). Because Parsimony ignores branch length information during searches it is - much faster than ML - unable to use this information to help it find the most probable tree (some strict cladists argue that they do not use parsimony to find the most probable tree, they claim there is no connection between minimal tree length and probability of being correct, truth is unknowable ) - especially unable to use branch lengths (~ rates of change) to detect areas of the tree that are likely to experience higher rates of homoplasy than other regions 4. Parsimony minimizes homoplasy - ML will sometimes prefer a tree with more than the minimum amount of homoplasy (if it makes the data more likely given the model) - gain, it is branch lengths which indicate where homoplasy may be common in the tree - longer branches are more likely to be involved in homoplastic events than shorter ones [Note: Parsimony is sometimes called Maximum Parsimony and abbreviated MP in contrast to ML] ML Parsimony would never prefer the correct, but longer tree on the left whereas ML would (more on this in next lecture) - Parsimony also ignores autapomorphies D B autapomorphies B 1 2 D MP 2 convergence events 1 convergence event hus we expect MP to yield the same tree as ML when branch lengths are similar across the tree (when there is little branch-length heterogeneity) ontrary to some statements to the opposite, it doesn t matter how much change has occurred (some have said that MP will only work if there has been little change - low rates of evolution) his is wrong, MP can do fine with high rates of change as long as there is little branch length heterogeneity - and conversely, even with INY amounts of change MP can fail if branches are of significantly different lengths 5

6 So it is not the total amount of change in the tree that matters - it is how evenly it is distributed (recall that amount of change is related to length of branches) aveat - as you will read (Siddall 1998; Swofford et al 2001) the failure of MP with high branch-length heterogeneity depends on where the longer branches are in the tree - more on this in the next lecture It s all about the branch lengths Recall - Statistical descriptions of MP indicate that this method does not assume a common mechanism, ie does not assume a common set of branch lengths for all characters - this means that MP does not assume a shared probability of change for different states, ie each character can have its own probability of change for every branch - this is a strong assumption - the question is Do the data fit this model? - Instead of one rate matrix to estimate from the dataset MP has to estimate a separate rate matrix for every character by branch combination! - If every unique substitution probability (minus 1) is a parameter, then MP requires the estimation of a HUE number of parameters - his is like assigning a separate J69 model for each character in the dataset - and thus using only a single character to estimate the parameters needed (more on this later) Parsimony Branch Lengths Parsimony will help find the shortest tree but even on this shortest tree there are often many different ways the character evolution can be mapped on the tree 2 common, but arbitrary, ways to map characters using parsimony: RN - accelerated transformation - when alternative mappings are possible this causes characters to change closer to the root and later reverse (prefers reversals) DELRN - delayed transformation - when alternative mappings are possible this causes characters to change further from the root (prefers parallelisms) Parsimony Branch Lengths haracter mapping is directly related to the estimated branch length Note: RN and DELRN have nothing to do with finding the shortest tree - they only relate to how the character evolution is mapped on the tree lso, these are extremes across a continuum of options Unlike ML there is no objective means of choosing which of many character mappings to prefer (since branch lengths are ignored by parsimony) More on this later - in lecture on character evolution erms - from lecture & readings Felsenstein maximum parsimony maximum likelihood L=Pr(D H) Model components Process model MLE log-likelihood branch lengths site log-likelihoods expected number of changes per character mutation rate time branch length heterogeneity RN DELRN 6

7 Study questions Parsimony can be considered a fast approximation to Maximum Likelihood - describe what Minimum Evolution (ME) is considered relative to parsimony and what NJ is considered relative to ME, and finally, how all these methods, NJ, ME, MP, ML are related to one another. Why was ML slow to be incorporated into common use by systematics and what has changed to make it so commonly used today? When we say L=Pr(D H) what does this mean? what are the subcomponents of H? If we use ML to obtain the MLEs of parameters of the model why don!t we just use ML to decide which model to use? Why do we use tools like the I instead of just the log-likelihood score? Study questions When we say given the model, what does given mean? If the likelihood is not the probability of the hypothesis being true, what is it? Why do we work with log-likelihoods instead of likelihoods? Provide the formula (but not the answer) to calculate the loglikelihood of a single sequence of 289 base pairs using the J69 model. When we consider longer branches what happens to the values in the substitution (rate) matrix? ontrast the diagonals with the off-diagonals. In words describe what we need to determine the log-likelihood of a tree. Study questions When would you expect parsimony to prefer a different tree than ML? nd why? What is more important to the performance of parsimony - the total amount of evolutionary change or the degree of branch length heterogeneity? Why? What are the statistical implications of the fact that MP does not assume all characters share a single set of branch lengths (ie all characters do not share a single rate matrix)? [hint: Would the I ever prefer MP over a common mechanism ML model?] 7

Systematics - BIO 615

Systematics - BIO 615 Outline 1. Optimality riteria: Parsimony continued 2. istance vs character methods 3. uilding a tree vs finding a tree - lustering vs Optimality criterion methods 4. Performance of istance and clustering

More information

Phylogenetic Reconstruction Methods

Phylogenetic Reconstruction Methods Phylogenetic Reconstruction Methods Distance-based Methods Character-based Methods non-statistical a. parsimony statistical a. maximum likelihood b. Bayesian inference Parsimony has its roots in Hennig

More information

Frequent Inconsistency of Parsimony Under a Simple Model of Cladogenesis

Frequent Inconsistency of Parsimony Under a Simple Model of Cladogenesis Syst. Biol. 52(5):641 648, 2003 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150390235467 Frequent Inconsistency of Parsimony Under a Simple Model

More information

Coalescent Theory: An Introduction for Phylogenetics

Coalescent Theory: An Introduction for Phylogenetics Coalescent Theory: An Introduction for Phylogenetics Laura Salter Kubatko Departments of Statistics and Evolution, Ecology, and Organismal Biology The Ohio State University lkubatko@stat.ohio-state.edu

More information

Phylogeny and Molecular Evolution

Phylogeny and Molecular Evolution Phylogeny and Molecular Evolution Character Based Phylogeny Large Parsimony 1/50 Credit Ron Shamir s lecture notes Notes by Nir Friedman Dan Geiger, Shlomo Moran, Sagi Snir and Ron Shamir Durbin et al.

More information

Recap: Properties of Trees. Rooting an unrooted tree. Questions trees can address: Data for phylogeny reconstruction. Rooted vs unrooted trees:

Recap: Properties of Trees. Rooting an unrooted tree. Questions trees can address: Data for phylogeny reconstruction. Rooted vs unrooted trees: Pairwise sequence alignment (global and local) Recap: Properties of rees Multiple sequence alignment global local ubstitution matrices atabase ing L equence statistics Leaf nodes contemporary taxa Internal

More information

Coalescence. Outline History. History, Model, and Application. Coalescence. The Model. Application

Coalescence. Outline History. History, Model, and Application. Coalescence. The Model. Application Coalescence History, Model, and Application Outline History Origins of theory/approach Trace the incorporation of other s ideas Coalescence Definition and descriptions The Model Assumptions and Uses Application

More information

The Contest Between Parsimony and Likelihood. Elliott Sober*

The Contest Between Parsimony and Likelihood. Elliott Sober* The Contest Between Parsimony and Likelihood Elliott Sober* Two of the main methods that biologists now use to infer phylogenetic relationships are maximum likelihood and maximum parsimony. The method

More information

Motif finding. GCB 535 / CIS 535 M. T. Lee, 10 Oct 2004

Motif finding. GCB 535 / CIS 535 M. T. Lee, 10 Oct 2004 Motif finding GCB 535 / CIS 535 M. T. Lee, 10 Oct 2004 Our goal is to identify significant patterns of letters (nucleotides, amino acids) contained within long sequences. The pattern is called a motif.

More information

Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week (Feb 3 & 5):

Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week (Feb 3 & 5): Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week (Feb 3 & 5): Chronogram estimation: Penalized Likelihood Approach BEAST Presentations of your projects 1 The Anatomy

More information

The African Origin Hypothesis What do the data tell us?

The African Origin Hypothesis What do the data tell us? The African Origin Hypothesis What do the data tell us? Mitochondrial DNA and Human Evolution Cann, Stoneking and Wilson, Nature 1987. WOS - 1079 citations Mitochondrial DNA and Human Evolution Cann, Stoneking

More information

37 Game Theory. Bebe b1 b2 b3. a Abe a a A Two-Person Zero-Sum Game

37 Game Theory. Bebe b1 b2 b3. a Abe a a A Two-Person Zero-Sum Game 37 Game Theory Game theory is one of the most interesting topics of discrete mathematics. The principal theorem of game theory is sublime and wonderful. We will merely assume this theorem and use it to

More information

Parsimony II Search Algorithms

Parsimony II Search Algorithms Parsimony II Search Algorithms Genome 373 Genomic Informatics Elhanan Borenstein Raw distance correction As two DNA sequences diverge, it is easy to see that their maximum raw distance is ~0.75 (assuming

More information

Project. B) Building the PWM Read the instructions of HO_14. 1) Determine all the 9-mers and list them here:

Project. B) Building the PWM Read the instructions of HO_14. 1) Determine all the 9-mers and list them here: Project Please choose ONE project among the given five projects. The last three projects are programming projects. hoose any programming language you want. Note that you can also write programs for the

More information

Analysis of geographically structured populations: Estimators based on coalescence

Analysis of geographically structured populations: Estimators based on coalescence Analysis of geographically structured populations: Estimators based on coalescence Peter Beerli Department of Genetics, Box 357360, University of Washington, Seattle WA 9895-7360, Email: beerli@genetics.washington.edu

More information

Lecture 2. Tree space and searching tree space

Lecture 2. Tree space and searching tree space Lecture 2. Tree space and searching tree space Joe Felsenstein epartment of Genome Sciences and epartment of iology Lecture 2. Tree space and searching tree space p.1/48 Orang Gorilla himp Human Gibbon

More information

Laboratory 1: Uncertainty Analysis

Laboratory 1: Uncertainty Analysis University of Alabama Department of Physics and Astronomy PH101 / LeClair May 26, 2014 Laboratory 1: Uncertainty Analysis Hypothesis: A statistical analysis including both mean and standard deviation can

More information

Lectures: Feb 27 + Mar 1 + Mar 3, 2017

Lectures: Feb 27 + Mar 1 + Mar 3, 2017 CS420+500: Advanced Algorithm Design and Analysis Lectures: Feb 27 + Mar 1 + Mar 3, 2017 Prof. Will Evans Scribe: Adrian She In this lecture we: Summarized how linear programs can be used to model zero-sum

More information

The study of probability is concerned with the likelihood of events occurring. Many situations can be analyzed using a simplified model of probability

The study of probability is concerned with the likelihood of events occurring. Many situations can be analyzed using a simplified model of probability The study of probability is concerned with the likelihood of events occurring Like combinatorics, the origins of probability theory can be traced back to the study of gambling games Still a popular branch

More information

Hypothesis Tests. w/ proportions. AP Statistics - Chapter 20

Hypothesis Tests. w/ proportions. AP Statistics - Chapter 20 Hypothesis Tests w/ proportions AP Statistics - Chapter 20 let s say we flip a coin... Let s flip a coin! # OF HEADS IN A ROW PROBABILITY 2 3 4 5 6 7 8 (0.5) 2 = 0.2500 (0.5) 3 = 0.1250 (0.5) 4 = 0.0625

More information

JIGSAW ACTIVITY, TASK # Make sure your answer in written in the correct order. Highest powers of x should come first, down to the lowest powers.

JIGSAW ACTIVITY, TASK # Make sure your answer in written in the correct order. Highest powers of x should come first, down to the lowest powers. JIGSAW ACTIVITY, TASK #1 Your job is to multiply and find all the terms in ( 1) Recall that this means ( + 1)( + 1)( + 1)( + 1) Start by multiplying: ( + 1)( + 1) x x x x. x. + 4 x x. Write your answer

More information

Where do evolutionary trees comes from?

Where do evolutionary trees comes from? Probabilistic models of evolutionary trees Joint work with Outline of talk Part 1: History, overview Part 2: Discrete models of tree shape Part 3: Continuous trees Part 4: Applications: phylogenetic diversity,

More information

An Intuitive Approach to Groups

An Intuitive Approach to Groups Chapter An Intuitive Approach to Groups One of the major topics of this course is groups. The area of mathematics that is concerned with groups is called group theory. Loosely speaking, group theory is

More information

YGB #2: Aren t You a Square?

YGB #2: Aren t You a Square? YGB #2: Aren t You a Square? Problem Statement How can one mathematically determine the total number of squares on a chessboard? Counting them is certainly subject to error, so is it possible to know if

More information

Discrete Structures for Computer Science

Discrete Structures for Computer Science Discrete Structures for Computer Science William Garrison bill@cs.pitt.edu 6311 Sennott Square Lecture #23: Discrete Probability Based on materials developed by Dr. Adam Lee The study of probability is

More information

Representing Square Numbers. Use materials to represent square numbers. A. Calculate the number of counters in this square array.

Representing Square Numbers. Use materials to represent square numbers. A. Calculate the number of counters in this square array. 1.1 Student book page 4 Representing Square Numbers You will need counters a calculator Use materials to represent square numbers. A. Calculate the number of counters in this square array. 5 5 25 number

More information

Dominant and Dominated Strategies

Dominant and Dominated Strategies Dominant and Dominated Strategies Carlos Hurtado Department of Economics University of Illinois at Urbana-Champaign hrtdmrt2@illinois.edu Junel 8th, 2016 C. Hurtado (UIUC - Economics) Game Theory On the

More information

CS 787: Advanced Algorithms Homework 1

CS 787: Advanced Algorithms Homework 1 CS 787: Advanced Algorithms Homework 1 Out: 02/08/13 Due: 03/01/13 Guidelines This homework consists of a few exercises followed by some problems. The exercises are meant for your practice only, and do

More information

CS188 Spring 2014 Section 3: Games

CS188 Spring 2014 Section 3: Games CS188 Spring 2014 Section 3: Games 1 Nearly Zero Sum Games The standard Minimax algorithm calculates worst-case values in a zero-sum two player game, i.e. a game in which for all terminal states s, the

More information

Notes for Recitation 3

Notes for Recitation 3 6.042/18.062J Mathematics for Computer Science September 17, 2010 Tom Leighton, Marten van Dijk Notes for Recitation 3 1 State Machines Recall from Lecture 3 (9/16) that an invariant is a property of a

More information

More on games (Ch )

More on games (Ch ) More on games (Ch. 5.4-5.6) Alpha-beta pruning Previously on CSci 4511... We talked about how to modify the minimax algorithm to prune only bad searches (i.e. alpha-beta pruning) This rule of checking

More information

The Problem. Tom Davis December 19, 2016

The Problem. Tom Davis  December 19, 2016 The 1 2 3 4 Problem Tom Davis tomrdavis@earthlink.net http://www.geometer.org/mathcircles December 19, 2016 Abstract The first paragraph in the main part of this article poses a problem that can be approached

More information

Chapter 4: Patterns and Relationships

Chapter 4: Patterns and Relationships Chapter : Patterns and Relationships Getting Started, p. 13 1. a) The factors of 1 are 1,, 3,, 6, and 1. The factors of are 1,,, 7, 1, and. The greatest common factor is. b) The factors of 16 are 1,,,,

More information

Theory of Probability - Brett Bernstein

Theory of Probability - Brett Bernstein Theory of Probability - Brett Bernstein Lecture 3 Finishing Basic Probability Review Exercises 1. Model flipping two fair coins using a sample space and a probability measure. Compute the probability of

More information

A Likelihood Method to Estimate/Detect Gene Flow and A Distance Method to. Estimate Species Trees in the Presence of Gene Flow.

A Likelihood Method to Estimate/Detect Gene Flow and A Distance Method to. Estimate Species Trees in the Presence of Gene Flow. A Likelihood Method to Estimate/Detect Gene Flow and A Distance Method to Estimate Species Trees in the Presence of Gene Flow Thesis Presented in Partial Fulfillment of the Requirements for the Degree

More information

Square Roots and the Pythagorean Theorem

Square Roots and the Pythagorean Theorem UNIT 1 Square Roots and the Pythagorean Theorem Just for Fun What Do You Notice? Follow the steps. An example is given. Example 1. Pick a 4-digit number with different digits. 3078 2. Find the greatest

More information

Logarithms ID1050 Quantitative & Qualitative Reasoning

Logarithms ID1050 Quantitative & Qualitative Reasoning Logarithms ID1050 Quantitative & Qualitative Reasoning History and Uses We noticed that when we multiply two numbers that are the same base raised to different exponents, that the result is the base raised

More information

Human origins and analysis of mitochondrial DNA sequences

Human origins and analysis of mitochondrial DNA sequences Human origins and analysis of mitochondrial DNA sequences Science, February 7, 1992 L. Vigilant et al. [1] recently presented "the strongest support yet for the placement of [their] common mtdna [mitochondrial

More information

CIS 2033 Lecture 6, Spring 2017

CIS 2033 Lecture 6, Spring 2017 CIS 2033 Lecture 6, Spring 2017 Instructor: David Dobor February 2, 2017 In this lecture, we introduce the basic principle of counting, use it to count subsets, permutations, combinations, and partitions,

More information

Bootstraps and testing trees

Bootstraps and testing trees ootstraps and testing trees Joe elsenstein epts. of Genome Sciences and of iology, University of Washington ootstraps and testing trees p.1/20 ln L log-likelihood curve and its confidence interval 2620

More information

I. WHAT IS PROBABILITY?

I. WHAT IS PROBABILITY? C HAPTER 3 PROAILITY Random Experiments I. WHAT IS PROAILITY? The weatherman on 10 o clock news program states that there is a 20% chance that it will snow tomorrow, a 65% chance that it will rain and

More information

CS188 Spring 2010 Section 3: Game Trees

CS188 Spring 2010 Section 3: Game Trees CS188 Spring 2010 Section 3: Game Trees 1 Warm-Up: Column-Row You have a 3x3 matrix of values like the one below. In a somewhat boring game, player A first selects a row, and then player B selects a column.

More information

The topic for the third and final major portion of the course is Probability. We will aim to make sense of statements such as the following:

The topic for the third and final major portion of the course is Probability. We will aim to make sense of statements such as the following: CS 70 Discrete Mathematics for CS Spring 2006 Vazirani Lecture 17 Introduction to Probability The topic for the third and final major portion of the course is Probability. We will aim to make sense of

More information

Algorithms for Genetics: Basics of Wright Fisher Model and Coalescent Theory

Algorithms for Genetics: Basics of Wright Fisher Model and Coalescent Theory Algorithms for Genetics: Basics of Wright Fisher Model and Coalescent Theory Vineet Bafna Harish Nagarajan and Nitin Udpa 1 Disclaimer Please note that a lot of the text and figures here are copied from

More information

Variations on the Two Envelopes Problem

Variations on the Two Envelopes Problem Variations on the Two Envelopes Problem Panagiotis Tsikogiannopoulos pantsik@yahoo.gr Abstract There are many papers written on the Two Envelopes Problem that usually study some of its variations. In this

More information

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis

More information

Developing Algebraic Thinking

Developing Algebraic Thinking Developing Algebraic Thinking DEVELOPING ALGEBRAIC THINKING Algebra is an important branch of mathematics, both historically and presently. algebra has been too often misunderstood and misrepresented as

More information

Handout 6 Enhancement and Human Development David W. Agler, Last Updated: 4/12/2014

Handout 6 Enhancement and Human Development David W. Agler, Last Updated: 4/12/2014 1. Introduction This handout is based on pp.35-52 in chapter 2 ( Enhancement and Human Development ) of Allen Buchanan s 2011 book Beyond Humanity? The Ethics of Biomedical Enhancement. This chapter focuses

More information

What Do You Expect? Concepts

What Do You Expect? Concepts Important Concepts What Do You Expect? Concepts Examples Probability A number from 0 to 1 that describes the likelihood that an event will occur. Theoretical Probability A probability obtained by analyzing

More information

The next several lectures will be concerned with probability theory. We will aim to make sense of statements such as the following:

The next several lectures will be concerned with probability theory. We will aim to make sense of statements such as the following: CS 70 Discrete Mathematics for CS Fall 2004 Rao Lecture 14 Introduction to Probability The next several lectures will be concerned with probability theory. We will aim to make sense of statements such

More information

More on games (Ch )

More on games (Ch ) More on games (Ch. 5.4-5.6) Announcements Midterm next Tuesday: covers weeks 1-4 (Chapters 1-4) Take the full class period Open book/notes (can use ebook) ^^ No programing/code, internet searches or friends

More information

1. The chance of getting a flush in a 5-card poker hand is about 2 in 1000.

1. The chance of getting a flush in a 5-card poker hand is about 2 in 1000. CS 70 Discrete Mathematics for CS Spring 2008 David Wagner Note 15 Introduction to Discrete Probability Probability theory has its origins in gambling analyzing card games, dice, roulette wheels. Today

More information

7.1 Chance Surprises, 7.2 Predicting the Future in an Uncertain World, 7.4 Down for the Count

7.1 Chance Surprises, 7.2 Predicting the Future in an Uncertain World, 7.4 Down for the Count 7.1 Chance Surprises, 7.2 Predicting the Future in an Uncertain World, 7.4 Down for the Count Probability deals with predicting the outcome of future experiments in a quantitative way. The experiments

More information

Lecture 30. Phylogeny methods, part 2 (Searching tree space) p.1/22

Lecture 30. Phylogeny methods, part 2 (Searching tree space) p.1/22 Lecture 30. Phylogeny methods, part 2 (Searching tree space) Joe elsenstein epartment of Genome Sciences and epartment of iology Lecture 30. Phylogeny methods, part 2 (Searching tree space) p.1/22 ll possible

More information

Grade 7/8 Math Circles. Visual Group Theory

Grade 7/8 Math Circles. Visual Group Theory Faculty of Mathematics Waterloo, Ontario N2L 3G1 Centre for Education in Mathematics and Computing Grade 7/8 Math Circles October 25 th /26 th Visual Group Theory Grouping Concepts Together We will start

More information

Outcome 7 Review. *Recall that -1 (-5) means

Outcome 7 Review. *Recall that -1 (-5) means Outcome 7 Review Level 2 Determine the slope of a line that passes through A(3, -5) and B(-2, -1). Step 1: Remember that ordered pairs are in the form (x, y). Label the points so you can substitute into

More information

Comparative method, coalescents, and the future

Comparative method, coalescents, and the future Comparative method, coalescents, and the future Joe Felsenstein Depts. of Genome Sciences and of Biology, University of Washington Comparative method, coalescents, and the future p.1/36 Correlation of

More information

Characteristics of Routes in a Road Traffic Assignment

Characteristics of Routes in a Road Traffic Assignment Characteristics of Routes in a Road Traffic Assignment by David Boyce Northwestern University, Evanston, IL Hillel Bar-Gera Ben-Gurion University of the Negev, Israel at the PTV Vision Users Group Meeting

More information

Maths Is Fun! Activity Pack Year 4

Maths Is Fun! Activity Pack Year 4 Maths Is Fun! Activity Pack Year 4 1. Spot the Difference Draw a horizontal line on a piece of paper. Write a 3 digit number at the left hand end and a higher one at the right hand end. Ask your child

More information

Multitree Decoding and Multitree-Aided LDPC Decoding

Multitree Decoding and Multitree-Aided LDPC Decoding Multitree Decoding and Multitree-Aided LDPC Decoding Maja Ostojic and Hans-Andrea Loeliger Dept. of Information Technology and Electrical Engineering ETH Zurich, Switzerland Email: {ostojic,loeliger}@isi.ee.ethz.ch

More information

Machine and Thought: The Turing Test

Machine and Thought: The Turing Test Machine and Thought: The Turing Test Instructor: Viola Schiaffonati April, 7 th 2016 Machines and thought 2 The dream of intelligent machines The philosophical-scientific tradition The official birth of

More information

The snail and bean conundrum

The snail and bean conundrum Maths B DAY 2008 Friday 21 November The snail and bean conundrum The Maths B Day is sponsored by and Introduction This Maths B day assignment comprises two parts. In the first part, you will study the

More information

Foundations of Probability Worksheet Pascal

Foundations of Probability Worksheet Pascal Foundations of Probability Worksheet Pascal The basis of probability theory can be traced back to a small set of major events that set the stage for the development of the field as a branch of mathematics.

More information

CS1800: More Counting. Professor Kevin Gold

CS1800: More Counting. Professor Kevin Gold CS1800: More Counting Professor Kevin Gold Today Dealing with illegal values Avoiding overcounting Balls-in-bins, or, allocating resources Review problems Dealing with Illegal Values Password systems often

More information

A few chessboards pieces: 2 for each student, to play the role of knights.

A few chessboards pieces: 2 for each student, to play the role of knights. Parity Party Returns, Starting mod 2 games Resources A few sets of dominoes only for the break time! A few chessboards pieces: 2 for each student, to play the role of knights. Small coins, 16 per group

More information

Name. Is the game fair or not? Prove your answer with math. If the game is fair, play it 36 times and record the results.

Name. Is the game fair or not? Prove your answer with math. If the game is fair, play it 36 times and record the results. Homework 5.1C You must complete table. Use math to decide if the game is fair or not. If Period the game is not fair, change the point system to make it fair. Game 1 Circle one: Fair or Not 2 six sided

More information

Ioanna Manolopoulou and Brent C. Emerson. October 7, Abstract

Ioanna Manolopoulou and Brent C. Emerson. October 7, Abstract Phylogeographic Ancestral Inference Using the Coalescent Model on Haplotype Trees Ioanna Manolopoulou and Brent C. Emerson October 7, 2011 Abstract Phylogeographic ancestral inference is a question frequently

More information

For more information on the Common Core State Standards, visit Beast Academy Grade 4 Chapters 1-12:

For more information on the Common Core State Standards, visit   Beast Academy Grade 4 Chapters 1-12: Beast Academy Scope and Sequence for Grade 4 (books 4A through 4D). The content covered in Beast Academy Grade 4 is loosely based on the standards created by the Common Core State Standards Initiative.

More information

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Algorithms and Game Theory Date: 12/4/14

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Algorithms and Game Theory Date: 12/4/14 600.363 Introduction to Algorithms / 600.463 Algorithms I Lecturer: Michael Dinitz Topic: Algorithms and Game Theory Date: 12/4/14 25.1 Introduction Today we re going to spend some time discussing game

More information

Database Normalization as a By-product of MML Inference. Minimum Message Length Inference

Database Normalization as a By-product of MML Inference. Minimum Message Length Inference Database Normalization as a By-product of Minimum Message Length Inference David Dowe Nayyar A. Zaidi Clayton School of IT, Monash University, Melbourne VIC 3800, Australia December 8, 2010 Our Research

More information

Branch Current Method

Branch Current Method Script Hello friends. In this series of lectures we have been discussing the various types of circuits, the voltage and current laws and their application to circuits. Today in this lecture we shall be

More information

Nessie is alive! Gerco Onderwater. Role of statistics, bias and reproducibility in scientific research

Nessie is alive! Gerco Onderwater. Role of statistics, bias and reproducibility in scientific research Nessie is alive! Role of statistics, bias and reproducibility in scientific research Gerco Onderwater c.j.g.onderwater@rug.nl 4/23/15 2 Loch Ness, Scotland 4/23/15 3 Legendary monster Saint Adomnán of

More information

Comparative method, coalescents, and the future. Correlation of states in a discrete-state model

Comparative method, coalescents, and the future. Correlation of states in a discrete-state model Comparative method, coalescents, and the future Joe Felsenstein Depts. of Genome Sciences and of Biology, University of Washington Comparative method, coalescents, and the future p.1/28 Correlation of

More information

Design of Parallel Algorithms. Communication Algorithms

Design of Parallel Algorithms. Communication Algorithms + Design of Parallel Algorithms Communication Algorithms + Topic Overview n One-to-All Broadcast and All-to-One Reduction n All-to-All Broadcast and Reduction n All-Reduce and Prefix-Sum Operations n Scatter

More information

Network-building. Introduction. Page 1 of 6

Network-building. Introduction. Page 1 of 6 Page of 6 CS 684: Algorithmic Game Theory Friday, March 2, 2004 Instructor: Eva Tardos Guest Lecturer: Tom Wexler (wexler at cs dot cornell dot edu) Scribe: Richard C. Yeh Network-building This lecture

More information

Lecture 6: Basics of Game Theory

Lecture 6: Basics of Game Theory 0368.4170: Cryptography and Game Theory Ran Canetti and Alon Rosen Lecture 6: Basics of Game Theory 25 November 2009 Fall 2009 Scribes: D. Teshler Lecture Overview 1. What is a Game? 2. Solution Concepts:

More information

8.F The Possibility of Mistakes: Trembling Hand Perfection

8.F The Possibility of Mistakes: Trembling Hand Perfection February 4, 2015 8.F The Possibility of Mistakes: Trembling Hand Perfection back to games of complete information, for the moment refinement: a set of principles that allow one to select among equilibria.

More information

Temporal-Difference Learning in Self-Play Training

Temporal-Difference Learning in Self-Play Training Temporal-Difference Learning in Self-Play Training Clifford Kotnik Jugal Kalita University of Colorado at Colorado Springs, Colorado Springs, Colorado 80918 CLKOTNIK@ATT.NET KALITA@EAS.UCCS.EDU Abstract

More information

Permutation Groups. Definition and Notation

Permutation Groups. Definition and Notation 5 Permutation Groups Wigner s discovery about the electron permutation group was just the beginning. He and others found many similar applications and nowadays group theoretical methods especially those

More information

A natural number is called a perfect cube if it is the cube of some. some natural number.

A natural number is called a perfect cube if it is the cube of some. some natural number. A natural number is called a perfect square if it is the square of some natural number. i.e., if m = n 2, then m is a perfect square where m and n are natural numbers. A natural number is called a perfect

More information

1.5 How Often Do Head and Tail Occur Equally Often?

1.5 How Often Do Head and Tail Occur Equally Often? 4 Problems.3 Mean Waiting Time for vs. 2 Peter and Paula play a simple game of dice, as follows. Peter keeps throwing the (unbiased) die until he obtains the sequence in two successive throws. For Paula,

More information

Statistical Tests: More Complicated Discriminants

Statistical Tests: More Complicated Discriminants 03/07/07 PHY310: Statistical Data Analysis 1 PHY310: Lecture 14 Statistical Tests: More Complicated Discriminants Road Map When the likelihood discriminant will fail The Multi Layer Perceptron discriminant

More information

Game Theory and Randomized Algorithms

Game Theory and Randomized Algorithms Game Theory and Randomized Algorithms Guy Aridor Game theory is a set of tools that allow us to understand how decisionmakers interact with each other. It has practical applications in economics, international

More information

Kodu Module 1: Eating Apples in the Kodu World

Kodu Module 1: Eating Apples in the Kodu World Kodu Module 1: Eating Apples in the Kodu World David S. Touretzky Version of May 29, 2017 Learning Goals How to navigate through a world using the game controller. New idioms: Pursue and Consume, Let Me

More information

Lesson 16: The Computation of the Slope of a Non Vertical Line

Lesson 16: The Computation of the Slope of a Non Vertical Line ++ Lesson 16: The Computation of the Slope of a Non Vertical Line Student Outcomes Students use similar triangles to explain why the slope is the same between any two distinct points on a non vertical

More information

Computing optimal strategy for finite two-player games. Simon Taylor

Computing optimal strategy for finite two-player games. Simon Taylor Simon Taylor Bachelor of Science in Computer Science with Honours The University of Bath April 2009 This dissertation may be made available for consultation within the University Library and may be photocopied

More information

APPENDIX 2.3: RULES OF PROBABILITY

APPENDIX 2.3: RULES OF PROBABILITY The frequentist notion of probability is quite simple and intuitive. Here, we ll describe some rules that govern how probabilities are combined. Not all of these rules will be relevant to the rest of this

More information

On the Peculiar Distribution of the U.S. Stock Indeces Digits

On the Peculiar Distribution of the U.S. Stock Indeces Digits On the Peculiar Distribution of the U.S. Stock Indeces Digits Eduardo Ley Resources for the Future, Washington DC Version: November 29, 1994 Abstract. Recent research has focused on studying the patterns

More information

Chapter 5 Integers. 71 Copyright 2013 Pearson Education, Inc. All rights reserved.

Chapter 5 Integers. 71 Copyright 2013 Pearson Education, Inc. All rights reserved. Chapter 5 Integers In the lower grades, students may have connected negative numbers in appropriate ways to informal knowledge derived from everyday experiences, such as below-zero winter temperatures

More information

10. Phase Cycling and Pulsed Field Gradients Introduction to Phase Cycling - Quadrature images

10. Phase Cycling and Pulsed Field Gradients Introduction to Phase Cycling - Quadrature images 10. Phase Cycling and Pulsed Field Gradients 10.1 Introduction to Phase Cycling - Quadrature images The selection of coherence transfer pathways (CTP) by phase cycling or PFGs is the tool that allows the

More information

Game Theory and Algorithms Lecture 19: Nim & Impartial Combinatorial Games

Game Theory and Algorithms Lecture 19: Nim & Impartial Combinatorial Games Game Theory and Algorithms Lecture 19: Nim & Impartial Combinatorial Games May 17, 2011 Summary: We give a winning strategy for the counter-taking game called Nim; surprisingly, it involves computations

More information

CARD GAMES AND CRYSTALS

CARD GAMES AND CRYSTALS CARD GAMES AND CRYSTALS This is the extended version of a talk I gave at KIDDIE (graduate student colloquium) in April 2011. I wish I could give this version, but there wasn t enough time, so I left out

More information

Reading 14 : Counting

Reading 14 : Counting CS/Math 240: Introduction to Discrete Mathematics Fall 2015 Instructors: Beck Hasti, Gautam Prakriya Reading 14 : Counting In this reading we discuss counting. Often, we are interested in the cardinality

More information

Signal Recovery from Random Measurements

Signal Recovery from Random Measurements Signal Recovery from Random Measurements Joel A. Tropp Anna C. Gilbert {jtropp annacg}@umich.edu Department of Mathematics The University of Michigan 1 The Signal Recovery Problem Let s be an m-sparse

More information

can mathematicians find the woods?

can mathematicians find the woods? Eolutionary trees, coalescents, and gene trees: can mathematicians find the woods? Joe Felsenstein Department of Genome Sciences and Department of Biology Eolutionary trees, coalescents, and gene trees:

More information

Probability - Introduction Chapter 3, part 1

Probability - Introduction Chapter 3, part 1 Probability - Introduction Chapter 3, part 1 Mary Lindstrom (Adapted from notes provided by Professor Bret Larget) January 27, 2004 Statistics 371 Last modified: Jan 28, 2004 Why Learn Probability? Some

More information

Problem 4.R1: Best Range

Problem 4.R1: Best Range CSC 45 Problem Set 4 Due Tuesday, February 7 Problem 4.R1: Best Range Required Problem Points: 50 points Background Consider a list of integers (positive and negative), and you are asked to find the part

More information

18 Completeness and Compactness of First-Order Tableaux

18 Completeness and Compactness of First-Order Tableaux CS 486: Applied Logic Lecture 18, March 27, 2003 18 Completeness and Compactness of First-Order Tableaux 18.1 Completeness Proving the completeness of a first-order calculus gives us Gödel s famous completeness

More information

Games (adversarial search problems)

Games (adversarial search problems) Mustafa Jarrar: Lecture Notes on Games, Birzeit University, Palestine Fall Semester, 204 Artificial Intelligence Chapter 6 Games (adversarial search problems) Dr. Mustafa Jarrar Sina Institute, University

More information

Experiment #6 MOSFET Dynamic circuits

Experiment #6 MOSFET Dynamic circuits Experiment #6 MOSFET Dynamic circuits Jonathan Roderick Introduction: This experiment will build upon the concepts that were presented in the previous lab and introduce dynamic circuits using MOSFETS.

More information