Maximum likelihood pedigree reconstruction using integer programming

Size: px
Start display at page:

Download "Maximum likelihood pedigree reconstruction using integer programming"

Transcription

1 Maximum likelihood pedigree reconstruction using integer programming James Dept of Computer Science & York Centre for Complex Systems Analysis University of York, York, YO10 5DD, UK Abstract Pedigrees are family trees relating groups of individuals which can usefully be seen as Bayesian networks. The problem of finding a maximum likelihood pedigree from genotypic data is encoded as an integer linear programming problem. Two methods of ensuring that pedigrees are acyclic are considered. Results on obtaining maximum likelihood pedigrees relating 20, 46 and 59 individuals are presented. Running times for larger pedigrees depend strongly on the data used but generally compare well with those in the literature. Solving is particularly fast when allele frequency is uniform. 1 Introduction The problem of finding the most probable pedigree ( family tree ) for a group of related individuals, whether human or not, is often needed for paternity and family reunion cases [7]. Correctly specifying relationships is also needed for the proper application of genetic linkage analysis. In the literature the problem is often called pedigree reconstruction and we will also make use of this term. A Bayesian approach is frequently taken where prior knowledge is combined with observed data to define a posterior probability for any given pedigree. Prior knowledge can include information such as known relationships, age and/or sex of some of the individuals and perhaps limits on the number of generations in the pedigree. In addition, probabilistic prior knowledge stating that, for example, very high levels of inbreeding are unlikely, can also be included [7, 13]. Data will be genotypic data for each individual under consideration. This data will be defined via a set of marker loci each specifying a position on a particular chromosome. The DNA sequence at such loci will vary between different individuals and so the marker can be thought of as a variable. The possible values of this variable are known as alleles. Chromosomes come in pairs, one inherited from the father and one from the mother, so there is a pair of allele values, called the genotype, for each locus. See [9] for further information. Data for pedigree reconstruction typically consists of genotypes for a number of marker loci: call this a multi-locus genotype. In the interests of brevity multi-locus genotype will often be abbreviated to just genotype in what follows. As a result of its importance a number of computational techniques have been used for pedigree reconstruction including simple enumeration [7], simulated annealing [2, 11], MCMC [3] and dynamic programming [5]. However, it appears that constraint-based methods have yet to be used for pedigree reconstruction although [12, 10] apply weighted CSP and SAT techniques, respectively, to check the consistency of a given pedigree. Also a weighted MAX-SAT approach has been used for the problem of Bayesian network learning [6]; pedigree reconstruction can be seen as a special case of this (see Section 2). In this paper pedigree reconstruction is cast as a instance of Bayesian network learning and integer linear programming (IP) is used to search for maximum likelihood Bayesian networks. The paper is structured as follows. In Section 2 a method for representing pedigrees as Bayesian networks (BNs) is given and the likelihood function for such BNs is analysed. Section 3 discusses IP encodings for pedigree reconstruction. Section 4 shows results for the most successful encoding found to date and the paper ends with conclusions and pointers to future work in Section 5. 1

2 2 Pedigrees as Bayesian networks A Bayesian network (BN) is an acyclic directed graph whose nodes (V ) represent random variables. (Such graphs are often called, somewhat imprecisely, directed acyclic graphs (DAGs).) If there is an arrow from node u V to node v V in the graph then u is said to be a parent of v. The parameters of a BN are conditional probability distributions for each node given its parents in the graph. Since the graph is acyclic the product of these conditional probability distributions defines a full joint probability distribution over all random variables represented in the graph. There are a number of ways of representing pedigrees as BNs [9], but here, like [5], each node in the BN represents a known individual, or more precisely the multi-locus genotype of that individual. An arrow from u to v is a statement that u is the biological parent of v. It follows that no node may have more than two parents. A node with no parents represents a founder: an individual neither of whose parents are to be found amongst the individuals considered. A node with one parent represents an individual with exactly one known parent and a node with two parents represents an individual both of whose parents are known. Following [5] Hardy-Weinberg equilibrium will be assumed which implies that the multi-locus genotypes for founders will be probabilistically independent. In addition only complete genotypic data (for a given collection of markers) will be considered. Following [5] let α 1 (g v g u ) denote the probability that individual v has genotype g v given that it has one known parent u with genotype g u. Let α 2 (g v g u,g w ) be the probability that individual v has genotype g v given that its has two known parents u and w with genotypes g u and g w. Let α 0 (g v ) be the marginal probability that individual v has genotype g v. Since for any particular pedigree reconstruction problem the observed genotype g v for each individual v is fixed, the following notational abbreviation can be introduced. α(v,{}) α(v,{u}) α(v,{u,w}) def = α 0 (g v ) def = α 1 (g v g u ) def = α 2 (g v g u,g w ) As noted by [5] due to the assumption of a complete sample, the likelihood of any candidate pedigree G decomposes into a product of conditional probabilities. (The likelihood of G is the probability of the observed data conditional on G being the true pedigree.) Letting Pa(v,G) denote the parents that v V has in a pedigree G, this product can be represented as in (1) and so the log-likelihood, which is more convenient to work with, can be represented as in (2). L(G) = α(v,pa(v,g)) (1) v V l(g) = logl(g) = log α(v, Pa(v, G)) (2) v V The problem of maximum likelihood pedigree reconstruction is that of finding G such that l(g) is maximised. 3 An integer programming encoding for ML pedigree reconstruction In this section a method of encoding the maximum likelihood pedigree reconstruction problem as an integer programming problem is presented. A first step towards the encoding is the simple observation 2

3 that a pedigree specifies the parents, if any, of each individual. So given an individual v, a parent set W and a pedigree G it is determined whether v has parents W in G. This is formalised using the functions I(W v) defined in (3), where W is implicitly restricted to be: W V \ {v}, W 2. This restriction on W will be assumed throughout to simplify the presentation. I(W v)(g) = { 1 if v has parents W in G 0 otherwise. The log-likelihood (2) of any pedigree can now be rewritten as in (4). l(g) = log α(v,w)i(w v)(g) (4) v,w Note that in (4), I(W v)(g) only takes the value 1 when W = Pa(v,G). For any other value of W, I(W v)(g) = 0. The key to the integer programming encoding is to view the I(W v) as binary variables. Any particular pedigree G determines a joint instantiation of these binary variables, setting exactly V = n of these binary variables to 1 and all others to 0. However, most joint instantiations of the I(W v) do not correspond to any pedigree. With this in mind the maximum likelihood pedigree reconstruction problem can be reformulated as in (5). (3) Find an instantiation of the I(W v) which maximises: v,w logα(v,w)i(w v) subject to the I(W v) representing a valid pedigree. (5) Because the variables in (5) are integer-valued and the objective function v,w logα(v,w)i(w v) is linear in these variables this is an integer linear programming problem as long as the necessary constraints on the I(W v) can be expressed as linear equations and inequalities. In the following subsections it will be shown that this is indeed the case. ( Integer linear programming will continue to be abbreviated to integer programming throughout.) 3.1 Constraints The most basic constraint on the I(W v) is that each individual v has exactly one parentset. This can be expressed by n linear equations: v : I(W v) = 1 (6) W Any instantiation of the I(W v) satisfying the equations given by (6) will represent a graph, but the graph may contain directed cycles. To rule out cycles auxiliary variables are required. There are many ways of ruling out cycles. The following sections present two possible approaches Ruling out cycles with a total order For each distinct pair of individuals u,v a binary variable I(u < v) is created. I(u < v) = 1 indicates that u is older than v (u s birth was before that of v). Without loss of generality it can be assumed that no two distinct individuals are of exactly the same age, so that exactly one is older than the other which makes the order on the ages of the individuals a total order. This is expressed using the following n(n 1)/2 equations: u,v : I(u < v) + I(v < u) = 1 (7) 3

4 Note that (7) means that half of the I(u < v) variables are effectively redundant. However, it is more convenient to work with the full complement of I(u < v) variables. This does not introduce inefficiency since the IP solver will detect this redundancy and simplify the representation of constraints internally. Note also that in (7) the obvious requirement that u v has not been explicitly stated. This notational convenience will be used throughout: whenever a constraint depends on more than one individual these individuals will be distinct, but this will not be made explicit. The total order must be transitive (if u < v, v < w then u < w). This can be represented by the following n(n 1)(n 2)/3 constraints: u,v,w : 1 I(u < v) + I(v < w) + I(w < u) 2 (8) Finally, the constraint that parents are older than their children needs to be expressed: u,v : I(u < v) W:u W I(W v) (9) To see that (9) expresses this relation, suppose that u is a parent of v. In that case exactly one of the I(W v) on the RHS of (9) is 1 and thus the RHS is 1. To satisfy the inequality I(u < v) must also be Ruling out cycles with generation variables An alternative approach associates a generation number with each individual in a pedigree. For a founder this number is zero. For any other individual the generation number is the length of the longest path from a founder to the individual. Let m denote the maximum generation number which is a value set by the user to reflect any prior knowledge. In the absence of such knowledge m takes its maximal value n 1. Let gen(v) denote the generation number of individual v. It is not difficult to see that if u is a parent of v then gen(v) gen(u) + 1. This leads to the following set of n(n 1) constraints: u,v : gen(v) gen(u) m + (m + 1) W:u W I(W v) (10) To understand (10) observe that if u is not a parent of v then the sum on the RHS is zero and the constraint becomes vacuous. If u is a parent of v then the sum is 1 and so the entire RHS becomes 1, effecting the desired constraint. To see that (10) suffices to rule out cycles note that if w is an ancestor of v then gen(w) < gen(v). If a cycle obtains, at least two individuals are their own ancestors and the obvious inconsistency arises. Thus as long as (10) is respected no cycles are possible. Note that (10) does not fix the values of the gen(v) variables to their correct values. To see this, suppose that n were, say, 10 and m set to 9, reflecting an absence of domain knowledge. Suppose that an optimal pedigree were found with the highest valued generation variable having value 5. It would be possible to increase each generation variable by one without violating (10). If our only concern is to rule out cycles this is not a problem, but if it is necessary to ensure that the gen(v) variables take on their correct values then additional constraints placing upper bounds on gen(v) are required Ensuring sex-consistency Any instantiation of the I(W v) variables satisfying constraints (6 9) or alternatively (10) will specify an acyclic directed graph where each vertex has at most two parents, but not all such graphs represent pedigrees. It is also necessary to ensure that a sex can be assigned to each individual in a consistent manner. An example of an acyclic directed graph where this is not the case can be seen in Fig 1. Note that this example was also given by [5]. Call such pedigrees sex-inconsistent. 4

5 u 1 u 2 u 3 v 1 v 3 v 2 Figure 1: A sex-inconsistent pedigree. It is not possible to consistently assign a sex to each individual. To rule out sex-inconsistent pedigrees another n auxiliary binary variables I f (v) are created. I f (v) = 1 states that individual v is a female. Constraint (11) states that if an individual v has two parents at most one is female and constraint (12) states that at least one is female. Note that in both cases, if I({u,w} v) = 0 then the constraints are vacuously satisfied. u,v,w : I({u,w} v) + I f (u) + I f (w) 2 (11) u,v,w : I({u,w} v) I f (u) I f (w) 0 (12) With all these constraints in place, the maximum likelihood pedigree reconstruction problem can be restated as follows: Maximise: v,w logα(v,w)i(w v) subject either to (6 9,11,12) or (6,10,11,12). (13) 4 Results All results shown here were produced using a 3GHz dual-core Linux machine with the Gurobi IP solver [8]. A number of tests (not reported here) have also been done with SCIP [1] which produced respectable running times which were nonetheless clearly longer than those produced by Gurobi (which automatically parallelises solving on multi-core machines). Only synthetic genotype data sampled from test pedigrees has been used. This sampling process mimics the inheritance of genotypes from parent to offspring, which is probabilistic, and thus different datasets will be sampled from a given pedigree depending on which random seed is being used. Data was created in this way using the the C++ program pedsim used to produce the results in [5]. pedsim was also used to compute the log conditional probabilities logα(v,w) from each of these synthetic datasets, and then to remove logα(v,w) scores where there exists a higher logα(v,w ) score with W W. Such scores and their accompanying I(W v) variables are not needed since in such a case v would never have parents W in the maximum likelihood pedigree. A Python script (available on request from the author) was used to read in the (filtered) log α(v,w) scores from pedsim. Gurobi s Python interface was then used to define and solve the optimisation problem (13). The following sections present results for a number of synthetic pedigree reconstruction problems. 4.1 Pedigree reconstruction for 20 individuals using a total order The data for this experiment was chosen to be the same (modulo sampling variation) as that of one of Cowell s [5]. The same software (pedsim) was used to generate genotypic data from one of Cowell s 5

6 Figure 2: Pedigree of 20 individuals. This pedigree is [5, Fig 3]. test pedigrees (Fig 2). Ten marker loci were used. Although the data is synthetic, the markers correspond to real ones. Allele values and founder allele frequencies were taken from [4]. A thousand datasets were sampled and the time taken to find a maximum likelihood sex-consistent pedigree in each case was recorded. Total order constraints were used in each case. The mean solving time was 0.65 seconds with the slowest run taking 17.3 seconds. Only 8 runs took longer than 4 seconds. 4.2 Pedigree reconstruction for 46 individuals using a total order An experiment identical to that described in Section 4.1 except using a test pedigree of 46 individuals was then carried out. This pedigree was created by editing the source of Cowell s pedsim program and is displayed in Fig 3. Only 10 runs were attempted since solving time is substantially higher than for the 20 individuals case. The solving times for the first 9 runs were, in decreasing order: 20,566s, 7,470s, 4,266s, 1,175s, 722s, 669s, 115s, 59s and 51s. The 10th run was abandoned after failing to identify the maximum likelihood pedigree after 44,135s. The very high variation in solving times is notable. Most datasets produced optimisation problems which could be solved in a reasonable time, but in some cases solving was unacceptably lengthy. Problems of this size are too large for the approach of [5]. Two further experiments were conducted for the 46-individual pedigree. In the first an extra constraint specifying that there must be at least 20 founders in the pedigree was added. 80 runs were done with this extra constraint. The mean solving time was 18 seconds with 75% of runs below 20 seconds and the slowest taking 128 seconds. In the second experiment a more reasonable constraint on founders was used: that the number of founders was between 10 and runs were done with this constraint. The mean solving time was 34s, with 75% within 30s and the slowest taking 613 seconds. 4.3 Pedigree reconstruction for 46 individuals using generation variables Although using a total order to rule out cycles in pedigrees produced acceptable results for small numbers of individuals, it is clear that for bigger problems finding a maximum likelihood pedigree is unacceptably slow. Fortunately, switching to using generation variables to rule out cycles as described in Section results in a significant speed up. In an initial experiment 100 synthetic datasets were generated from the 46-individual pedigree shown in Fig runs completed successfully, taking a mean time of 432 seconds, but a median time of only 6

7 m1 f2 m3 f4 f6 m5 f7 m8 f9 m6 m9 f11 m10 m20 f19 f21 f23 m22 m24 f12 m13 f14 m19 f20 f31 m32 f33 m34 m30 m17 f18 f16 m15 f43 m44 f47 m46 f45 m42 f41 f57 f59 m58 m54 m56 f55 Figure 3: Test pedigree of 46 individuals. 2.2 seconds. The distribution of solving times for these 19 runs was thus highly skewed with the five longest runs taking 7349, 810, 20, 9 and 4 seconds and each of the 6 quickest taking less than a second. However, on the 20th run, Gurobi ran out of memory. This problem could probably be addressed by instructing Gurobi to use the hard disk when (RAM) memory is exhausted, but this would lead to much slower solving. Instead an extra constraint was added in the hope of both speeding up solving and reducing memory consumption. This constraint stated that in each pedigree there is at least one founder. Since this is always true (due to the acyclicity of pedigrees) a maximum likelihood pedigree will still be returned, but hopefully more quickly. This founder constraint 7

8 LB Time in seconds Likelihood e e e e e e e e e e e+02 Table 1: Solving times for different lower bounds on the number of founders. Likelihood is the likelihood of a maximum likelihood pedigree with the given bound. is formally expressed as follows: I({} v) 1 (14) v With the founder constraint added 100 runs were attempted (i.e. 100 datasets were simulated and maximum likelihood pedigrees were found for each) and all completed successfully. The mean solving time was 195 seconds and the median was 3.8 seconds. As usual there was therefore a highly skewed distribution of solving times with the ten slowest runs taking the following number of seconds: 8181, 7361, 1638, 830, 309, 249, 127, 118, 110 and 45. To investigate the effect of increasing the lower bound on the number of founders above one, a particular dataset simulated from the pedigree in Fig 3 was used. This dataset was chosen since it is one of the harder ones resulting in reasonably long solving times. A maximum likelihood pedigree for this data is shown in Fig 4. As Table 1 makes clear, increasing the lower bound on the number of founders makes a big difference in the time it takes to find a maximum likelihood pedigree. Note, from Fig 4, that at least one maximum likelihood pedigree has 8 founders (m1, f2, m3, f4, f9, m6, f41 and m30). Using 8 as a lower bound on the number of founders solving takes only 3 seconds. Higher lower bounds reduce the solving time further but the pedigree returned is no longer of maximum likelihood as shown by the third column in Table 1. Importantly, raising the lower bound from 0 (which amounts to removing the constraint) to 1 reduces the solving time drastically. Also, interestingly, using lower bounds of 2, 3 or 4 actually increases the solving time compared to a lower bound of 1, but all are still quicker than using no lower bound. 4.4 Pedigree reconstruction for 59 individuals using generation variables An experiment was done to provide as direct a comparison as possible with Almudevar s simulated annealing approach [2]. Datasets were simulated from Almudevar s 59 individual pedigree [2, Fig 2] and as in that paper ten marker loci were used each with 8 equally frequent alleles. Generation variables were used to rule out cycles and the (always admissible) constraint that there is at least one founder was used. Maximum likelihood pedigrees were obtained from 1000 simulated datasets. The mean solving time was seconds, the median and 75% of runs completed within seconds. A few much longer runs occurred, with one of length seconds. The distribution of the 1000 solving 8

9 m1 f2 m3 f4 f6 m5 f7 m8 f9 m6 m9 f11 m10 m20 f19 f21 f23 m22 m24 f12 m13 f14 m19 f20 f31 m32 f33 m34 m30 m17 f18 f16 m15 f43 m44 f47 m46 f45 m42 f41 f57 f59 m58 m54 m56 f55 Figure 4: A maximum likelihood pedigree of 46 individuals for a particular dataset simulated from the pedigree in Fig 3 times is shown as a box plot in Fig 5 It is notable that solving times are substantially faster on this 59 individual pedigree than for the 46 individual pedigree previously discussed. This is most probably due to the assumption of equally frequent (i.e. equally probable) alleles for each of the ten marker loci. This means that genotypic data is more informative than is the case where the distribution is (realistically) skewed as is the case with Cowell s ten marker loci data (Markus Riester, personal communication). Comparing running times to Almudevar [2], there it is stated that for the quickest configuration the time taken for the simulated annealing algorithm to converge was approximately 6.6 min using a standard personal computer. Note also that simulated annealing does not guarantee that the pedigree found has maximal likelihood. 9

10 Solving time (seconds) Figure 5: Boxplot representation of the distribution of 1000 maximum likelihood pedigree reconstruction solving times for datasets simulated from Almudevar s 59 individual pedigree. 5 Conclusions and future work Results on finding pedigrees which are guaranteed to have maximal likelihood using IP have been presented in this paper. The results compare favourably with others in the literature as regards scalability, speed and ensuring sex-consistency (however Riester et al [11] report on reconstructing pedigrees from thousands of individuals using simulated annealing). In this paper the focus is on how best to do maximum likelihood pedigree reconstruction, but there is also the entirely distinct question of whether maximising likelihood is the best way to reconstruct pedigrees. With large amounts of data maximum likelihood usually provides a reasonable estimate of the true pedigree. So, for example, comparing the 1000 maximum likelihood pedigrees discussed in Section 4.4 to the true data-generating pedigree [2, Fig 2] we find that 533 of them are exactly equal to the true pedigree. The full distribution of parent assignment errors is shown in Fig 6. Nonetheless the alternative Bayesian approach allows the incorporation of domain knowledge and allows a principled way of quantifying the uncertainty inherent in pedigree reconstruction (model uncertainty). In an IP formulation the prior distribution is represented by incorporating extra terms in the objective function. Model uncertainty is addressed by finding many distinct high probability pedigrees rather than returning a single one. Work is currently underway on such a Bayesian approach. Acknowledgements Thanks to Robert Cowell, Nuala Sheehan and Markus Riester for useful expertise on pedigrees. Many thanks to Robert Cowell for supplying the pedsim software. Thanks also to the anonymous reviewers for their comments and suggestions. References [1] Tobias Achterberg. Constraint Integer Programming. PhD thesis, TU Berlin, July

11 533 Frequency Error Figure 6: Distribution of parent assignment errors for 1000 pedigrees constructed from data generated from the true pedigree [2, Fig 2] [2] Anthony Almudevar. A simulated annealing algorithm for maximum likelihood pedigree reconstruction. Theoretical Population Biology, 63:63 75, [3] Anthony Almudevar. A graphical approach to relatedness inference. Theoretical Population Biology, 71: , [4] J.M. Butler, R. Schoske, P.M. Vallone, J.W. Redman, and M.C. Kline. Allele frequencies for 15 autosomal STR loci on U.S. Caucasian, African American and Hispanic populations. Journal of Forensic Sciences, 48(4), [5] Robert G. Cowell. Efficient maximum likelihood pedigree reconstruction. Theoretical Popluation Biology, 76(4): , December [6] James. Bayesian network learning by compiling to weighted MAX-SAT. In Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence (UAI 2008), pages , Helsinki, AUAI Press. [7] T. Egeland, P. F. Mostad, B. Mevåg, and M. Stenersen. Beyond traditional paternity and identification cases: Selecting the most probable pedigree. Forensic Science International, 110:47 59, [8] Gurobi Optimization Inc. Gurobi Optimizer Reference Manual, Version 3.0. [9] Steffen L. Lauritzen and Nuala A. Sheehan. Graphical models for genetic analyses. Statistical Science, 18(4): , [10] Panagiotis Manolios, Marc Galceran Oms, and Sergi Oliva Valls. Checking pedigree consistency with PCS. In Thirteenth International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS 2007), number 4424 in LNCS, pages Springer, [11] Markus Riester, Peter F. Stadler, and Konstantin Klemm. FRANz: reconstruction of wild multi-generation pedigrees. Bioinformatics, 25(16): , [12] Marti Sanchez, Simon de Givry, and Thomas Schiex. Mendelian error detection in complex pedigrees using weighted constraint satisfaction techniques. Constraints, 13: , [13] N A Sheehan and T Egeland. Structured incorporation of prior information in relationship indentification problems. Annals of Human Genetics, 71: ,

Pedigree Reconstruction using Identity by Descent

Pedigree Reconstruction using Identity by Descent Pedigree Reconstruction using Identity by Descent Bonnie Kirkpatrick Electrical Engineering and Computer Sciences University of California at Berkeley Technical Report No. UCB/EECS-2010-43 http://www.eecs.berkeley.edu/pubs/techrpts/2010/eecs-2010-43.html

More information

Lecture 1: Introduction to pedigree analysis

Lecture 1: Introduction to pedigree analysis Lecture 1: Introduction to pedigree analysis Magnus Dehli Vigeland NORBIS course, 8 th 12 th of January 2018, Oslo Outline Part I: Brief introductions Pedigrees symbols and terminology Some common relationships

More information

Methods of Parentage Analysis in Natural Populations

Methods of Parentage Analysis in Natural Populations Methods of Parentage Analysis in Natural Populations Using molecular markers, estimates of genetic maternity or paternity can be achieved by excluding as parents all adults whose genotypes are incompatible

More information

Gene coancestry in pedigrees and populations

Gene coancestry in pedigrees and populations Gene coancestry in pedigrees and populations Thompson, Elizabeth University of Washington, Department of Statistics Box 354322 Seattle, WA 98115-4322, USA E-mail: eathomp@uw.edu Glazner, Chris University

More information

Two-point linkage analysis using the LINKAGE/FASTLINK programs

Two-point linkage analysis using the LINKAGE/FASTLINK programs 1 Two-point linkage analysis using the LINKAGE/FASTLINK programs Copyrighted 2018 Maria Chahrour and Suzanne M. Leal These exercises will introduce the LINKAGE file format which is the standard format

More information

Coalescence. Outline History. History, Model, and Application. Coalescence. The Model. Application

Coalescence. Outline History. History, Model, and Application. Coalescence. The Model. Application Coalescence History, Model, and Application Outline History Origins of theory/approach Trace the incorporation of other s ideas Coalescence Definition and descriptions The Model Assumptions and Uses Application

More information

ICMP DNA REPORTS GUIDE

ICMP DNA REPORTS GUIDE ICMP DNA REPORTS GUIDE Distribution: General Sarajevo, 16 th December 2010 GUIDE TO ICMP DNA REPORTS 1. Purpose of This Document 1. The International Commission on Missing Persons (ICMP) endeavors to secure

More information

Large scale kinship:familial Searching and DVI. Seoul, ISFG workshop

Large scale kinship:familial Searching and DVI. Seoul, ISFG workshop Large scale kinship:familial Searching and DVI Seoul, ISFG workshop 29 August 2017 Large scale kinship Familial Searching: search for a relative of an unidentified offender whose profile is available in

More information

DNA: Statistical Guidelines

DNA: Statistical Guidelines Frequency calculations for STR analysis When a probative association between an evidence profile and a reference profile is made, a frequency estimate is calculated to give weight to the association. Frequency

More information

SNP variant discovery in pedigrees using Bayesian networks. Amit R. Indap

SNP variant discovery in pedigrees using Bayesian networks. Amit R. Indap SNP variant discovery in pedigrees using Bayesian networks Amit R. Indap 1 1 Background Next generation sequencing technologies have reduced the cost and increased the throughput of DNA sequencing experiments

More information

DNA Basics, Y DNA Marker Tables, Ancestral Trees and Mutation Graphs: Definitions, Concepts, Understanding

DNA Basics, Y DNA Marker Tables, Ancestral Trees and Mutation Graphs: Definitions, Concepts, Understanding DNA Basics, Y DNA Marker Tables, Ancestral Trees and Mutation Graphs: Definitions, Concepts, Understanding by Dr. Ing. Robert L. Baber 2014 July 26 Rights reserved, see the copyright notice at http://gengen.rlbaber.de

More information

Kenneth Nordtvedt. Many genetic genealogists eventually employ a time-tomost-recent-common-ancestor

Kenneth Nordtvedt. Many genetic genealogists eventually employ a time-tomost-recent-common-ancestor Kenneth Nordtvedt Many genetic genealogists eventually employ a time-tomost-recent-common-ancestor (TMRCA) tool to estimate how far back in time the common ancestor existed for two Y-STR haplotypes obtained

More information

An Optimal Algorithm for Automatic Genotype Elimination

An Optimal Algorithm for Automatic Genotype Elimination Am. J. Hum. Genet. 65:1733 1740, 1999 An Optimal Algorithm for Automatic Genotype Elimination Jeffrey R. O Connell 1,2 and Daniel E. Weeks 1 1 Department of Human Genetics, University of Pittsburgh, Pittsburgh,

More information

DAR POLICY STATEMENT AND BACKGROUND Using DNA Evidence for DAR Applications

DAR POLICY STATEMENT AND BACKGROUND Using DNA Evidence for DAR Applications Effective January 1, 2014, DAR will begin accepting Y-DNA evidence in support of new member applications and supplemental applications as one element in a structured analysis. This analysis will use a

More information

AFDAA 2012 WINTER MEETING Population Statistics Refresher Course - Lecture 3: Statistics of Kinship Analysis

AFDAA 2012 WINTER MEETING Population Statistics Refresher Course - Lecture 3: Statistics of Kinship Analysis AFDAA 2012 WINTER MEETING Population Statistics Refresher Course - Lecture 3: Statistics of Kinship Analysis Ranajit Chakraborty, PhD Center for Computational Genomics Institute of Applied Genetics Department

More information

Population Structure and Genealogies

Population Structure and Genealogies Population Structure and Genealogies One of the key properties of Kingman s coalescent is that each pair of lineages is equally likely to coalesce whenever a coalescent event occurs. This condition is

More information

Chapter 2: Genes in Pedigrees

Chapter 2: Genes in Pedigrees Chapter 2: Genes in Pedigrees Chapter 2-0 2.1 Pedigree definitions and terminology 2-1 2.2 Gene identity by descent (ibd) 2-5 2.3 ibd of more than 2 genes 2-14 2.4 Data on relatives 2-21 2.1.1 GRAPHICAL

More information

Inbreeding and self-fertilization

Inbreeding and self-fertilization Inbreeding and self-fertilization Introduction Remember that long list of assumptions associated with derivation of the Hardy-Weinberg principle that we just finished? Well, we re about to begin violating

More information

Midterm Examination. CSCI 561: Artificial Intelligence

Midterm Examination. CSCI 561: Artificial Intelligence Midterm Examination CSCI 561: Artificial Intelligence October 10, 2002 Instructions: 1. Date: 10/10/2002 from 11:00am 12:20 pm 2. Maximum credits/points for this midterm: 100 points (corresponding to 35%

More information

Popstats Parentage Statistics Strength of Genetic Evidence In Parentage Testing

Popstats Parentage Statistics Strength of Genetic Evidence In Parentage Testing Popstats Parentage Statistics Strength of Genetic Evidence In Parentage Testing Arthur J. Eisenberg, Ph.D. Director DNA Identity Laboratory UNT-Health Science Center eisenber@hsc.unt.edu PATERNITY TESTING

More information

Algorithms for Genetics: Basics of Wright Fisher Model and Coalescent Theory

Algorithms for Genetics: Basics of Wright Fisher Model and Coalescent Theory Algorithms for Genetics: Basics of Wright Fisher Model and Coalescent Theory Vineet Bafna Harish Nagarajan and Nitin Udpa 1 Disclaimer Please note that a lot of the text and figures here are copied from

More information

CCO Commun. Comb. Optim.

CCO Commun. Comb. Optim. Communications in Combinatorics and Optimization Vol. 2 No. 2, 2017 pp.149-159 DOI: 10.22049/CCO.2017.25918.1055 CCO Commun. Comb. Optim. Graceful labelings of the generalized Petersen graphs Zehui Shao

More information

Inbreeding and self-fertilization

Inbreeding and self-fertilization Inbreeding and self-fertilization Introduction Remember that long list of assumptions associated with derivation of the Hardy-Weinberg principle that I went over a couple of lectures ago? Well, we re about

More information

A GRAPH THEORETICAL APPROACH TO SOLVING SCRAMBLE SQUARES PUZZLES. 1. Introduction

A GRAPH THEORETICAL APPROACH TO SOLVING SCRAMBLE SQUARES PUZZLES. 1. Introduction GRPH THEORETICL PPROCH TO SOLVING SCRMLE SQURES PUZZLES SRH MSON ND MLI ZHNG bstract. Scramble Squares puzzle is made up of nine square pieces such that each edge of each piece contains half of an image.

More information

Detection of Misspecified Relationships in Inbred and Outbred Pedigrees

Detection of Misspecified Relationships in Inbred and Outbred Pedigrees Detection of Misspecified Relationships in Inbred and Outbred Pedigrees Lei Sun 1, Mark Abney 1,2, Mary Sara McPeek 1,2 1 Department of Statistics, 2 Department of Human Genetics, University of Chicago,

More information

TDT vignette Use of snpstats in family based studies

TDT vignette Use of snpstats in family based studies TDT vignette Use of snpstats in family based studies David Clayton April 30, 2018 Pedigree data The snpstats package contains some tools for analysis of family-based studies. These assume that a subject

More information

UNIVERSITY of PENNSYLVANIA CIS 391/521: Fundamentals of AI Midterm 1, Spring 2010

UNIVERSITY of PENNSYLVANIA CIS 391/521: Fundamentals of AI Midterm 1, Spring 2010 UNIVERSITY of PENNSYLVANIA CIS 391/521: Fundamentals of AI Midterm 1, Spring 2010 Question Points 1 Environments /2 2 Python /18 3 Local and Heuristic Search /35 4 Adversarial Search /20 5 Constraint Satisfaction

More information

Linkage Analysis in Merlin. Meike Bartels Kate Morley Danielle Posthuma

Linkage Analysis in Merlin. Meike Bartels Kate Morley Danielle Posthuma Linkage Analysis in Merlin Meike Bartels Kate Morley Danielle Posthuma Software for linkage analyses Genehunter Mendel Vitesse Allegro Simwalk Loki Merlin. Mx R Lisrel MERLIN software Programs: MERLIN

More information

Automated Discovery of Pedigrees and Their Structures in Collections of STR DNA Specimens Using a Link Discovery Tool

Automated Discovery of Pedigrees and Their Structures in Collections of STR DNA Specimens Using a Link Discovery Tool University of Tennessee, Knoxville Trace: Tennessee Research and Creative Exchange Masters Theses Graduate School 5-2010 Automated Discovery of Pedigrees and Their Structures in Collections of STR DNA

More information

Lecture 6: Inbreeding. September 10, 2012

Lecture 6: Inbreeding. September 10, 2012 Lecture 6: Inbreeding September 0, 202 Announcements Hari s New Office Hours Tues 5-6 pm Wed 3-4 pm Fri 2-3 pm In computer lab 3306 LSB Last Time More Hardy-Weinberg Calculations Merle Patterning in Dogs:

More information

Laboratory 1: Uncertainty Analysis

Laboratory 1: Uncertainty Analysis University of Alabama Department of Physics and Astronomy PH101 / LeClair May 26, 2014 Laboratory 1: Uncertainty Analysis Hypothesis: A statistical analysis including both mean and standard deviation can

More information

Alternation in the repeated Battle of the Sexes

Alternation in the repeated Battle of the Sexes Alternation in the repeated Battle of the Sexes Aaron Andalman & Charles Kemp 9.29, Spring 2004 MIT Abstract Traditional game-theoretic models consider only stage-game strategies. Alternation in the repeated

More information

BIOINFORMATICS ORIGINAL PAPER

BIOINFORMATICS ORIGINAL PAPER BIOINFORMATICS ORIGINAL PAPER Vol. 25 no. 6 29, pages 234 239 doi:.93/bioinformatics/btp64 Genetics and population analysis FRANz: reconstruction of wild multi-generation pedigrees Markus Riester,, Peter

More information

On identification problems requiring linked autosomal markers

On identification problems requiring linked autosomal markers * Title Page (with authors & addresses) On identification problems requiring linked autosomal markers Thore Egeland a Nuala Sheehan b a Department of Medical Genetics, Ulleval University Hospital, 0407

More information

Optimum contribution selection conserves genetic diversity better than random selection in small populations with overlapping generations

Optimum contribution selection conserves genetic diversity better than random selection in small populations with overlapping generations Optimum contribution selection conserves genetic diversity better than random selection in small populations with overlapping generations K. Stachowicz 12*, A. C. Sørensen 23 and P. Berg 3 1 Department

More information

Meta-Heuristic Approach for Supporting Design-for- Disassembly towards Efficient Material Utilization

Meta-Heuristic Approach for Supporting Design-for- Disassembly towards Efficient Material Utilization Meta-Heuristic Approach for Supporting Design-for- Disassembly towards Efficient Material Utilization Yoshiaki Shimizu *, Kyohei Tsuji and Masayuki Nomura Production Systems Engineering Toyohashi University

More information

Past questions from the last 6 years of exams for programming 101 with answers.

Past questions from the last 6 years of exams for programming 101 with answers. 1 Past questions from the last 6 years of exams for programming 101 with answers. 1. Describe bubble sort algorithm. How does it detect when the sequence is sorted and no further work is required? Bubble

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/1122655/dc1 Supporting Online Material for Finding Criminals Through DNA of Their Relatives Frederick R. Bieber,* Charles H. Brenner, David Lazer *Author for correspondence.

More information

Objective: Why? 4/6/2014. Outlines:

Objective: Why? 4/6/2014. Outlines: Objective: Develop mathematical models that quantify/model resemblance between relatives for phenotypes of a quantitative trait : - based on pedigree - based on markers Outlines: Causal model for covariances

More information

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis

More information

COMBINATORIAL RECONSTRUCTION OF HALF-SIBLING GROUPS

COMBINATORIAL RECONSTRUCTION OF HALF-SIBLING GROUPS COMBINATORIAL RECONSTRUCTION OF HALF-SIBLING GROUPS Saad I. Sheikh, Tanya Y. Berger-Wolf, Ashfaq A. Khokhar Dept. of Computer Science, University of Illinois at Chicago, 851 S. Morgan St (M/C 152), Chicago,

More information

Game Theory and Randomized Algorithms

Game Theory and Randomized Algorithms Game Theory and Randomized Algorithms Guy Aridor Game theory is a set of tools that allow us to understand how decisionmakers interact with each other. It has practical applications in economics, international

More information

Illumina GenomeStudio Analysis

Illumina GenomeStudio Analysis Illumina GenomeStudio Analysis Paris Veltsos University of St Andrews February 23, 2012 1 Introduction GenomeStudio is software by Illumina used to score SNPs based on the Illumina BeadExpress platform.

More information

Exercise 4 Exploring Population Change without Selection

Exercise 4 Exploring Population Change without Selection Exercise 4 Exploring Population Change without Selection This experiment began with nine Avidian ancestors of identical fitness; the mutation rate is zero percent. Since descendants can never differ in

More information

Scheduling. Radek Mařík. April 28, 2015 FEE CTU, K Radek Mařík Scheduling April 28, / 48

Scheduling. Radek Mařík. April 28, 2015 FEE CTU, K Radek Mařík Scheduling April 28, / 48 Scheduling Radek Mařík FEE CTU, K13132 April 28, 2015 Radek Mařík (marikr@fel.cvut.cz) Scheduling April 28, 2015 1 / 48 Outline 1 Introduction to Scheduling Methodology Overview 2 Classification of Scheduling

More information

Supervisory Control for Cost-Effective Redistribution of Robotic Swarms

Supervisory Control for Cost-Effective Redistribution of Robotic Swarms Supervisory Control for Cost-Effective Redistribution of Robotic Swarms Ruikun Luo Department of Mechaincal Engineering College of Engineering Carnegie Mellon University Pittsburgh, Pennsylvania 11 Email:

More information

4. Kinship Paper Challenge

4. Kinship Paper Challenge 4. António Amorim (aamorim@ipatimup.pt) Nádia Pinto (npinto@ipatimup.pt) 4.1 Approach After a woman dies her child claims for a paternity test of the man who is supposed to be his father. The test is carried

More information

2 person perfect information

2 person perfect information Why Study Games? Games offer: Intellectual Engagement Abstraction Representability Performance Measure Not all games are suitable for AI research. We will restrict ourselves to 2 person perfect information

More information

KINSHIP ANALYSIS AND HUMAN IDENTIFICATION IN MASS DISASTERS: THE USE OF MDKAP FOR THE WORLD TRADE CENTER TRAGEDY

KINSHIP ANALYSIS AND HUMAN IDENTIFICATION IN MASS DISASTERS: THE USE OF MDKAP FOR THE WORLD TRADE CENTER TRAGEDY 1 KINSHIP ANALYSIS AND HUMAN IDENTIFICATION IN MASS DISASTERS: THE USE OF MDKAP FOR THE WORLD TRADE CENTER TRAGEDY Benoît Leclair 1, Steve Niezgoda 2, George R. Carmody 3 and Robert C. Shaler 4 1 Myriad

More information

What are the chances?

What are the chances? What are the chances? Student Worksheet 7 8 9 10 11 12 TI-Nspire Investigation Student 90 min Introduction In probability, we often look at likelihood of events that are influenced by chance. Consider

More information

CONGEN. Inbreeding vocabulary

CONGEN. Inbreeding vocabulary CONGEN Inbreeding vocabulary Inbreeding Mating between relatives. Inbreeding depression Reduction in fitness due to inbreeding. Identical by descent Alleles that are identical by descent are direct descendents

More information

BayesChess: A computer chess program based on Bayesian networks

BayesChess: A computer chess program based on Bayesian networks BayesChess: A computer chess program based on Bayesian networks Antonio Fernández and Antonio Salmerón Department of Statistics and Applied Mathematics University of Almería Abstract In this paper we introduce

More information

Spring 06 Assignment 2: Constraint Satisfaction Problems

Spring 06 Assignment 2: Constraint Satisfaction Problems 15-381 Spring 06 Assignment 2: Constraint Satisfaction Problems Questions to Vaibhav Mehta(vaibhav@cs.cmu.edu) Out: 2/07/06 Due: 2/21/06 Name: Andrew ID: Please turn in your answers on this assignment

More information

DVA325 Formal Languages, Automata and Models of Computation (FABER)

DVA325 Formal Languages, Automata and Models of Computation (FABER) DVA325 Formal Languages, Automata and Models of Computation (FABER) Lecture 1 - Introduction School of Innovation, Design and Engineering Mälardalen University 11 November 2014 Abu Naser Masud FABER November

More information

Forensic Statistics and Graphical Models (1) Richard Gill Spring Semester 2015

Forensic Statistics and Graphical Models (1) Richard Gill Spring Semester 2015 Forensic Statistics and Graphical Models (1) Richard Gill Spring Semester 2015 http://www.math.leidenuniv.nl/~gill/teaching/graphical Forensic Statistics Distinguish criminal investigation and criminal

More information

Multitree Decoding and Multitree-Aided LDPC Decoding

Multitree Decoding and Multitree-Aided LDPC Decoding Multitree Decoding and Multitree-Aided LDPC Decoding Maja Ostojic and Hans-Andrea Loeliger Dept. of Information Technology and Electrical Engineering ETH Zurich, Switzerland Email: {ostojic,loeliger}@isi.ee.ethz.ch

More information

Outline. Communications Engineering 1

Outline. Communications Engineering 1 Outline Introduction Signal, random variable, random process and spectra Analog modulation Analog to digital conversion Digital transmission through baseband channels Signal space representation Optimal

More information

Fast Placement Optimization of Power Supply Pads

Fast Placement Optimization of Power Supply Pads Fast Placement Optimization of Power Supply Pads Yu Zhong Martin D. F. Wong Dept. of Electrical and Computer Engineering Dept. of Electrical and Computer Engineering Univ. of Illinois at Urbana-Champaign

More information

Primer on Human Pedigree Analysis:

Primer on Human Pedigree Analysis: Primer on Human Pedigree Analysis: Criteria for the selection and collection of appropriate Family Reference Samples John V. Planz. Ph.D. UNT Center for Human Identification Successful Missing Person ID

More information

CS4700 Fall 2011: Foundations of Artificial Intelligence. Homework #2

CS4700 Fall 2011: Foundations of Artificial Intelligence. Homework #2 CS4700 Fall 2011: Foundations of Artificial Intelligence Homework #2 Due Date: Monday Oct 3 on CMS (PDF) and in class (hardcopy) Submit paper copies at the beginning of class. Please include your NetID

More information

A tournament problem

A tournament problem Discrete Mathematics 263 (2003) 281 288 www.elsevier.com/locate/disc Note A tournament problem M.H. Eggar Department of Mathematics and Statistics, University of Edinburgh, JCMB, KB, Mayeld Road, Edinburgh

More information

Developing Conclusions About Different Modes of Inheritance

Developing Conclusions About Different Modes of Inheritance Pedigree Analysis Introduction A pedigree is a diagram of family relationships that uses symbols to represent people and lines to represent genetic relationships. These diagrams make it easier to visualize

More information

Comparative method, coalescents, and the future

Comparative method, coalescents, and the future Comparative method, coalescents, and the future Joe Felsenstein Depts. of Genome Sciences and of Biology, University of Washington Comparative method, coalescents, and the future p.1/36 Correlation of

More information

A GRASP HEURISTIC FOR THE COOPERATIVE COMMUNICATION PROBLEM IN AD HOC NETWORKS

A GRASP HEURISTIC FOR THE COOPERATIVE COMMUNICATION PROBLEM IN AD HOC NETWORKS A GRASP HEURISTIC FOR THE COOPERATIVE COMMUNICATION PROBLEM IN AD HOC NETWORKS C. COMMANDER, C.A.S. OLIVEIRA, P.M. PARDALOS, AND M.G.C. RESENDE ABSTRACT. Ad hoc networks are composed of a set of wireless

More information

COMBINATORIAL RECONSTRUCTION OF HALF-SIBLING GROUPS

COMBINATORIAL RECONSTRUCTION OF HALF-SIBLING GROUPS COMBINATORIAL RECONSTRUCTION OF HALF-SIBLING GROUPS Saad I. Sheikh, Tanya Y. Berger-Wolf, Ashfaq A. Khokhar Department of Computer Science, University of Illinois at Chicago, 851 S. Morgan St (M/C 152),

More information

GEDmatch Home Page The upper left corner of your home page has Information about you and links to lots of helpful information. Check them out!

GEDmatch Home Page The upper left corner of your home page has Information about you and links to lots of helpful information. Check them out! USING GEDMATCH Created March 2015 GEDmatch is a free, non-profit site that accepts raw autosomal data files from Ancestry, FTDNA, and 23andme. As such, it provides a large autosomal database that spans

More information

DNA Parentage Test No Summary Report

DNA Parentage Test No Summary Report Collaborative Testing Services, Inc FORENSIC TESTING PROGRAM DNA Parentage Test No. 16-5870 Summary Report This proficiency test was sent to 27 participants. Each participant received a sample pack consisting

More information

Game Mechanics Minesweeper is a game in which the player must correctly deduce the positions of

Game Mechanics Minesweeper is a game in which the player must correctly deduce the positions of Table of Contents Game Mechanics...2 Game Play...3 Game Strategy...4 Truth...4 Contrapositive... 5 Exhaustion...6 Burnout...8 Game Difficulty... 10 Experiment One... 12 Experiment Two...14 Experiment Three...16

More information

SOLITAIRE CLOBBER AS AN OPTIMIZATION PROBLEM ON WORDS

SOLITAIRE CLOBBER AS AN OPTIMIZATION PROBLEM ON WORDS INTEGERS: ELECTRONIC JOURNAL OF COMBINATORIAL NUMBER THEORY 8 (2008), #G04 SOLITAIRE CLOBBER AS AN OPTIMIZATION PROBLEM ON WORDS Vincent D. Blondel Department of Mathematical Engineering, Université catholique

More information

Complex DNA and Good Genes for Snakes

Complex DNA and Good Genes for Snakes 458 Int'l Conf. Artificial Intelligence ICAI'15 Complex DNA and Good Genes for Snakes Md. Shahnawaz Khan 1 and Walter D. Potter 2 1,2 Institute of Artificial Intelligence, University of Georgia, Athens,

More information

Pedigrees How do scientists trace hereditary diseases through a family history?

Pedigrees How do scientists trace hereditary diseases through a family history? Why? Pedigrees How do scientists trace hereditary diseases through a family history? Imagine you want to learn about an inherited genetic trait present in your family. How would you find out the chances

More information

AI Approaches to Ultimate Tic-Tac-Toe

AI Approaches to Ultimate Tic-Tac-Toe AI Approaches to Ultimate Tic-Tac-Toe Eytan Lifshitz CS Department Hebrew University of Jerusalem, Israel David Tsurel CS Department Hebrew University of Jerusalem, Israel I. INTRODUCTION This report is

More information

A general quadratic programming method for the optimisation of genetic contributions using interior point algorithm. R Pong-Wong & JA Woolliams

A general quadratic programming method for the optimisation of genetic contributions using interior point algorithm. R Pong-Wong & JA Woolliams A general quadratic programming method for the optimisation of genetic contributions using interior point algorithm R Pong-Wong & JA Woolliams Introduction Inbreeding is a risk and it needs to be controlled

More information

Spring 06 Assignment 2: Constraint Satisfaction Problems

Spring 06 Assignment 2: Constraint Satisfaction Problems 15-381 Spring 06 Assignment 2: Constraint Satisfaction Problems Questions to Vaibhav Mehta(vaibhav@cs.cmu.edu) Out: 2/07/06 Due: 2/21/06 Name: Andrew ID: Please turn in your answers on this assignment

More information

Constructions of Coverings of the Integers: Exploring an Erdős Problem

Constructions of Coverings of the Integers: Exploring an Erdős Problem Constructions of Coverings of the Integers: Exploring an Erdős Problem Kelly Bickel, Michael Firrisa, Juan Ortiz, and Kristen Pueschel August 20, 2008 Abstract In this paper, we study necessary conditions

More information

Lectures: Feb 27 + Mar 1 + Mar 3, 2017

Lectures: Feb 27 + Mar 1 + Mar 3, 2017 CS420+500: Advanced Algorithm Design and Analysis Lectures: Feb 27 + Mar 1 + Mar 3, 2017 Prof. Will Evans Scribe: Adrian She In this lecture we: Summarized how linear programs can be used to model zero-sum

More information

On the Benefits of Enhancing Optimization Modulo Theories with Sorting Jul 1, Networks 2016 for 1 / MAXS 31

On the Benefits of Enhancing Optimization Modulo Theories with Sorting Jul 1, Networks 2016 for 1 / MAXS 31 On the Benefits of Enhancing Optimization Modulo Theories with Sorting Networks for MAXSMT Roberto Sebastiani, Patrick Trentin roberto.sebastiani@unitn.it trentin@disi.unitn.it DISI, University of Trento

More information

Coalescent Theory: An Introduction for Phylogenetics

Coalescent Theory: An Introduction for Phylogenetics Coalescent Theory: An Introduction for Phylogenetics Laura Salter Kubatko Departments of Statistics and Evolution, Ecology, and Organismal Biology The Ohio State University lkubatko@stat.ohio-state.edu

More information

Optimal Multicast Routing in Ad Hoc Networks

Optimal Multicast Routing in Ad Hoc Networks Mat-2.108 Independent esearch Projects in Applied Mathematics Optimal Multicast outing in Ad Hoc Networks Juha Leino 47032J Juha.Leino@hut.fi 1st December 2002 Contents 1 Introduction 2 2 Optimal Multicasting

More information

Analysis of geographically structured populations: Estimators based on coalescence

Analysis of geographically structured populations: Estimators based on coalescence Analysis of geographically structured populations: Estimators based on coalescence Peter Beerli Department of Genetics, Box 357360, University of Washington, Seattle WA 9895-7360, Email: beerli@genetics.washington.edu

More information

Workshop on anonymization Berlin, March 19, Basic Knowledge Terms, Definitions and general techniques. Murat Sariyar TMF

Workshop on anonymization Berlin, March 19, Basic Knowledge Terms, Definitions and general techniques. Murat Sariyar TMF Workshop on anonymization Berlin, March 19, 2015 Basic Knowledge Terms, Definitions and general techniques Murat Sariyar TMF Workshop Anonymisation, March 19, 2015 Outline Background Aims of Anonymization

More information

Lecture Notes on Game Theory (QTM)

Lecture Notes on Game Theory (QTM) Theory of games: Introduction and basic terminology, pure strategy games (including identification of saddle point and value of the game), Principle of dominance, mixed strategy games (only arithmetic

More information

Kinship and Population Subdivision

Kinship and Population Subdivision Kinship and Population Subdivision Henry Harpending University of Utah The coefficient of kinship between two diploid organisms describes their overall genetic similarity to each other relative to some

More information

37 Game Theory. Bebe b1 b2 b3. a Abe a a A Two-Person Zero-Sum Game

37 Game Theory. Bebe b1 b2 b3. a Abe a a A Two-Person Zero-Sum Game 37 Game Theory Game theory is one of the most interesting topics of discrete mathematics. The principal theorem of game theory is sublime and wonderful. We will merely assume this theorem and use it to

More information

Population Adaptation for Genetic Algorithm-based Cognitive Radios

Population Adaptation for Genetic Algorithm-based Cognitive Radios Population Adaptation for Genetic Algorithm-based Cognitive Radios Timothy R. Newman, Rakesh Rajbanshi, Alexander M. Wyglinski, Joseph B. Evans, and Gary J. Minden Information Technology and Telecommunications

More information

A GRASP heuristic for the Cooperative Communication Problem in Ad Hoc Networks

A GRASP heuristic for the Cooperative Communication Problem in Ad Hoc Networks MIC2005: The Sixth Metaheuristics International Conference??-1 A GRASP heuristic for the Cooperative Communication Problem in Ad Hoc Networks Clayton Commander Carlos A.S. Oliveira Panos M. Pardalos Mauricio

More information

A new mixed integer linear programming formulation for one problem of exploration of online social networks

A new mixed integer linear programming formulation for one problem of exploration of online social networks manuscript No. (will be inserted by the editor) A new mixed integer linear programming formulation for one problem of exploration of online social networks Aleksandra Petrović Received: date / Accepted:

More information

1 This work was partially supported by NSF Grant No. CCR , and by the URI International Engineering Program.

1 This work was partially supported by NSF Grant No. CCR , and by the URI International Engineering Program. Combined Error Correcting and Compressing Codes Extended Summary Thomas Wenisch Peter F. Swaszek Augustus K. Uht 1 University of Rhode Island, Kingston RI Submitted to International Symposium on Information

More information

On uniquely k-determined permutations

On uniquely k-determined permutations On uniquely k-determined permutations Sergey Avgustinovich and Sergey Kitaev 16th March 2007 Abstract Motivated by a new point of view to study occurrences of consecutive patterns in permutations, we introduce

More information

Genealogical trees, coalescent theory, and the analysis of genetic polymorphisms

Genealogical trees, coalescent theory, and the analysis of genetic polymorphisms Genealogical trees, coalescent theory, and the analysis of genetic polymorphisms Magnus Nordborg University of Southern California The importance of history Genetic polymorphism data represent the outcome

More information

10/5/2015. Constraint Satisfaction Problems. Example: Cryptarithmetic. Example: Map-coloring. Example: Map-coloring. Constraint Satisfaction Problems

10/5/2015. Constraint Satisfaction Problems. Example: Cryptarithmetic. Example: Map-coloring. Example: Map-coloring. Constraint Satisfaction Problems 0/5/05 Constraint Satisfaction Problems Constraint Satisfaction Problems AIMA: Chapter 6 A CSP consists of: Finite set of X, X,, X n Nonempty domain of possible values for each variable D, D, D n where

More information

University of Washington, TOPMed DCC July 2018

University of Washington, TOPMed DCC July 2018 Module 12: Comput l Pipeline for WGS Relatedness Inference from Genetic Data Timothy Thornton (tathornt@uw.edu) & Stephanie Gogarten (sdmorris@uw.edu) University of Washington, TOPMed DCC July 2018 1 /

More information

1) Using the sightings data, determine who moved from one area to another and fill this data in on the data sheet.

1) Using the sightings data, determine who moved from one area to another and fill this data in on the data sheet. Parentage and Geography 5. The Life of Lulu the Lioness: A Heroine s Story Name: Objective Using genotypes from many individuals, determine maternity, paternity, and relatedness among a group of lions.

More information

Determining Relatedness from a Pedigree Diagram

Determining Relatedness from a Pedigree Diagram Kin structure & relatedness Francis L. W. Ratnieks Aims & Objectives Aims 1. To show how to determine regression relatedness among individuals using a pedigree diagram. Social Insects: C1139 2. To show

More information

arxiv: v1 [cs.cc] 21 Jun 2017

arxiv: v1 [cs.cc] 21 Jun 2017 Solving the Rubik s Cube Optimally is NP-complete Erik D. Demaine Sarah Eisenstat Mikhail Rudoy arxiv:1706.06708v1 [cs.cc] 21 Jun 2017 Abstract In this paper, we prove that optimally solving an n n n Rubik

More information

Branch-and-cut for a real-life highly constrained soccer tournament scheduling problem

Branch-and-cut for a real-life highly constrained soccer tournament scheduling problem Branch-and-cut for a real-life highly constrained soccer tournament scheduling problem Guillermo Durán 1, Thiago F. Noronha 2, Celso C. Ribeiro 3, Sebastián Souyris 1, and Andrés Weintraub 1 1 Department

More information

KINALYZER, a computer program for reconstructing sibling groups

KINALYZER, a computer program for reconstructing sibling groups Molecular Ecology Resources (2009) 9, 1127 1131 doi: 10.1111/j.1755-0998.2009.02562.x Blackwell Publishing Ltd COMPUTER PROGRAM NOTE KINALYZER, a computer program for reconstructing sibling groups M. V.

More information

Population Genetics using Trees. Peter Beerli Genome Sciences University of Washington Seattle WA

Population Genetics using Trees. Peter Beerli Genome Sciences University of Washington Seattle WA Population Genetics using Trees Peter Beerli Genome Sciences University of Washington Seattle WA Outline 1. Introduction to the basic coalescent Population models The coalescent Likelihood estimation of

More information

Non-Paternity: Implications and Resolution

Non-Paternity: Implications and Resolution Non-Paternity: Implications and Resolution Michelle Beckwith PTC Labs 2006 AABB HITA Meeting October 8, 2006 Considerations when identifying victims using relatives Identification requires knowledge of

More information

Chapter 1. Probability

Chapter 1. Probability Chapter 1. Probability 1.1 Basic Concepts Scientific method a. For a given problem, we define measures that explains the problem well. b. Data is collected with observation and the measures are calculated.

More information