Estimating Ancient Population Sizes using the Coalescent with Recombination
|
|
- Joella Sullivan
- 5 years ago
- Views:
Transcription
1 Estimating Ancient Population Sizes using the Coalescent with Recombination Sara Sheehan joint work with Kelley Harris and Yun S. Song May 26, 2012 Sheehan, Harris, Song May 26,
2 Motivation Introduction Out of Africa bottleneck 1 Humans 2 Neanderthals 3 Early Hominids Sheehan, Harris, Song May 26,
3 Introduction Effects of population size variation Sheehan, Harris, Song May 26,
4 Introduction Effects of population size variation Sheehan, Harris, Song May 26,
5 Introduction How can we infer past population sizes? The population size at time t is inversely proportional to the rate of coalescence at time t Sheehan, Harris, Song May 26,
6 Introduction How can we infer past population sizes? The population size at time t is inversely proportional to the rate of coalescence at time t Coalescence times are unknown, but are correlated with mutation frequencies: Sheehan, Harris, Song May 26,
7 Adding recombination Introduction Model Genealogies of nearby loci are correlated with each other Use mutations to learn about the ancestral recombination graph Sheehan, Harris, Song May 26, 2012 Sheehan, Harris, Song May 26,
8 Introduction TMRCA changes throughout the genome Previous work: Pairwise Sequentially Markovian Coalescent (PSMC), Li and Durbin 2011 uses 2 haplotypes Sheehan, Harris, Song May 26,
9 Model Conditional Sampling Distribution Data: multiple sequence alignment of haplotypes h 1 = CCATACCTGGTCAATTTTTGTATTTGAAGTAGAGACG h 2 = CCAGACCTAGTCAATTTTTGTATTTGAAGTAGAGACG h 3 = CCAGACCTGGTCAATTTCTGTATTTTTAGTAGAGACG h 4 = CCATACCTAGTCAATTTTTGTATTTTTAGTAGCCACG Sheehan, Harris, Song May 26,
10 Model Conditional Sampling Distribution Data: multiple sequence alignment of haplotypes h 1 = CCATACCTGGTCAATTTTTGTATTTGAAGTAGAGACG h 2 = CCAGACCTAGTCAATTTTTGTATTTGAAGTAGAGACG h 3 = CCAGACCTGGTCAATTTCTGTATTTTTAGTAGAGACG h 4 = CCATACCTAGTCAATTTTTGTATTTTTAGTAGCCACG Want to compute: likelihood of our data given past population size function N(t) P(h 1, h 2, h 3,, h n, h n+1 N(t)) Sheehan, Harris, Song May 26,
11 Model Conditional Sampling Distribution Data: multiple sequence alignment of haplotypes h 1 = CCATACCTGGTCAATTTTTGTATTTGAAGTAGAGACG h 2 = CCAGACCTAGTCAATTTTTGTATTTGAAGTAGAGACG h 3 = CCAGACCTGGTCAATTTCTGTATTTTTAGTAGAGACG h 4 = CCATACCTAGTCAATTTTTGTATTTTTAGTAGCCACG Want to compute: likelihood of our data given past population size function N(t) P(h 1, h 2, h 3,, h n, h n+1 N(t)) Can obtain via the CSD: P(h n+1 h 1, h 2,, h n, N(t)) Sheehan, Harris, Song May 26,
12 roduction Background Sequential Model interpretation Discretization Resu Conditional Sampling Conditional Distribution genealogy C G n Suppose conditional we observe genealogy the C(hidden) genealogy G n for h n. Construct a conditional (hidden) genealogy genealogy G C for a single haplotype: n for h 1,, h n Coalescence, mutation, and recombination as usual. Paul, Steinrücken, and Song (2011) Absorption of lineage in C into a lineage in G n at rate 1. Sheehan, Harris, Song May 26,
13 equential interpretation Discretization Model Results Introduction Background Sequential interpretation Discretization Model Conditional Sampling Distribution ional genealogy A realization of ˆπ PS G n G n (h n ) den) Simplification genealogy An example I: Greplace n for hrealization the n. Construct distributionaof coalescent genealogies with a a single trunk haplotype: Assume genealogy G n = Gn (h n). and recombination Construct as usual. conditional Paul, Steinrücken, genealogy and Song (2011) C, with absorption at rate 1/2 C into a lineage Sheehan, Harris, in GSong n at rate 1. May 26,
14 Model Introduction Background Sequential interpretation Discretization Results Conditional Sampling Distribution Marginal conditional genealogies t (a) h (a) t (b) h (b) C G n(h n ) Simplification II: (Sequentially Markov assumption) If t 1,..., t l is the sequence of times at which segments of h n+1 coalesce with the trunk genealogy, then The marginal conditional genealogy S l at locus l is described by: 1 the absorption P(t l time t l 1 T, t l 2 l,,..., t 1 ) = P(t l t l 1 ) 2 the absorption haplotype H l. Paul, Steinrücken, and Song (2011) Sheehan, Harris, Song May 26,
15 Model Introduction Background Sequential interpretation Discretization Results Conditional Sampling Distribution Marginal conditional genealogies t (a) h (a) S 1 t (b) h (b) C G n(h n ) C G n(h n ) Marginal conditional S 1 =(T 1, H 1 )= ( genealogy t (1), h (1)) at locus 1 (red lines): absorption time = t (a) The absorption marginal haplotype conditional = h genealogy (a) S l at locus l is described by: 1 the absorption time T l, Paul, Steinrücken, and Song (2011) 2 the absorption haplotype H l. Sheehan, Harris, Song May 26,
16 Model Introduction Background Sequential interpretation Discretization Results Conditional Sampling Distribution Marginal conditional genealogies t (a) h (a) S 1 t (b) h (b) S 2 C G n(h n ) C G n(h n ) Marginal conditional S 1 =(T 1, H 1 )= ( genealogy t (1), h (1)) at locus 2 (red lines): S 2 =(T 2, H 2 )= ( t (2), h (2)) absorption time = t (b) The absorption marginal haplotype conditional = h genealogy (b) S l at locus l is described by: 1 the absorption time T l, Paul, Steinrücken, and Song (2011) 2 the absorption haplotype H l. Sheehan, Harris, Song May 26,
17 Model Introduction Background Sequential interpretation Discretization Results Conditional Sampling Distribution Marginal conditional genealogies t (a) h (a) S 1 S 3 t (b) h (b) S 2 C G n(h n ) C G n(h n ) Marginal conditional S 1 =(T 1, H 1 )= ( genealogy t (1), h (1)) at locus 3 (red lines): S 2 =(T 2, H 2 )= ( t (2), h (2)) S 3 = S 1 absorption time = t (a) The absorption marginal haplotype conditional = h genealogy (a) S l at locus l is described by: 1 the absorption time T l, Paul, Steinrücken, and Song (2011) 2 the absorption haplotype H l. Sheehan, Harris, Song May 26,
18 Model Hidden Markov Model framework 2D State space: S l = (t l, h l ) t l = absorption time h l = absorption haplotype Sheehan, Harris, Song May 26,
19 Model Hidden Markov Model framework 2D State space: S l = (t l, h l ) t l = absorption time h l = absorption haplotype ( ) Initial absorption time distribution: ζ(t) = n N(t) exp n t 0 N(τ)dτ where N(t) is the population size change function Sheehan, Harris, Song May 26,
20 Model Hidden Markov Model framework 2D State space: S l = (t l, h l ) t l = absorption time h l = absorption haplotype ( ) Initial absorption time distribution: ζ(t) = n N(t) exp n t 0 N(τ)dτ where N(t) is the population size change function Transitions: Given (t l, h l ), { (t (t l+1, h l+1 ) = l, h l ), if no recombination new time and haplotype, if recombination Sheehan, Harris, Song May 26,
21 Model Hidden Markov Model framework 2D State space: S l = (t l, h l ) t l = absorption time h l = absorption haplotype ( ) Initial absorption time distribution: ζ(t) = n N(t) exp n t 0 N(τ)dτ where N(t) is the population size change function Transitions: Given (t l, h l ), { (t (t l+1, h l+1 ) = l, h l ), if no recombination new time and haplotype, if recombination Emissions: mutations occur on the lineage h n+1 only to create the emitted allele (Poisson process with rate θ) Sheehan, Harris, Song May 26,
22 Model Improved accuracy: the wedding cake genealogy Too many lineages in the past under estimation of population sizes Replace the trunk genealogy with wedding cake genealogy where n(t) = the expected number of remaining lineages at time t Sheehan, Harris, Song May 26,
23 EM framework Inferring past population sizes Discretization: Discretize time into d intervals 0 = t 0 < t 2 < < t d = Constant population size N i during interval [t i 1, t i ) Sheehan, Harris, Song May 26,
24 EM procedure EM framework Fix all sizes N i = 1. E-step: Given population size estimates, use a leave-one-out likelihood approach. P(h 1,, h n N(t)) n P(h k h 1,, h k 1, h k+1,, h n, N(t)) k=1 M-step: Minimize the difference between E-step transitions and expected transitions to obtain approximate maximum likelihood estimates for population sizes. Terminate when likelihood has plateaued. Sheehan, Harris, Song May 26,
25 Results and Future Work Inference of TMRCA Red line = psmc meantmrca, Blue line = our meantmrca, Green line = our mediantmrca TMRCA 3 4 Black line = True TMRCA, 0e+00 2e+04 4e+04 6e+04 8e+04 1e+05 Position Sheehan, Harris, Song May 26,
26 Results and Future Work Drosophila melanogaster data haplotypes from the Drosophila Population Genomics Project 37 from North America (Raleigh, USA) 22 from Africa (Gikongoro, Rwanda) used trimmed intergenic regions longer than 70 kb and with < 10% missing data 9 such regions: two from chromosome 2 six from chromosome 3 one from chromosome X 650 kb in total Sheehan, Harris, Song May 26,
27 Data analysis goal Results and Future Work [Figure: Karasov, Messer, and Petrov (2010)] Sheehan, Harris, Song May 26,
28 Results and Future Work Simulated Drosophila history # Population size results, dros orig, hist1 True 3.0 population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
29 Results and Future Work Simulated Drosophila history # 1 1.3x Population size results, dros orig, hist1 PSMC True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
30 Results and Future Work Simulated Drosophila history # 1 1.3x Population size results, dros orig, hist1 PSMC n=4 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
31 Results and Future Work Simulated Drosophila history # 1 1.3x Population size results, dros orig, hist1 PSMC n=6 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
32 Results and Future Work Simulated Drosophila history # 1 1.3x Population size results, dros orig, hist1 PSMC n=8 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
33 Results and Future Work Simulated Drosophila history # 1 1.3x Population size results, dros orig, hist1 PSMC n=10 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
34 Results and Future Work Simulated Drosophila history # Population size results, dros orig, hist2 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
35 Results and Future Work Simulated Drosophila history # Population size results, dros orig, hist2 PSMC True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
36 Results and Future Work Simulated Drosophila history # Population size results, dros orig, hist2 PSMC n=4 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
37 Results and Future Work Simulated Drosophila history # Population size results, dros orig, hist2 PSMC n=6 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
38 Results and Future Work Simulated Drosophila history # Population size results, dros orig, hist2 PSMC n=8 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
39 Results and Future Work Simulated Drosophila history # Population size results, dros orig, hist2 PSMC n=10 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
40 Results and Future Work Simulated constant population size results, N(t) = Population size results, null, L500kb PSMC n=10 True population size (scaling factor) time (2N generations) Sheehan, Harris, Song May 26,
41 Results and Future Work Simulated constant population size results, N(t) = 0.1 Population size results, null, size0.1 population size (scaling factor) 1.1x PSMC n=10 True time (2N generations) Sheehan, Harris, Song May 26,
42 Results and Future Work Impact of using wedding cake genealogy Population size results, n vs. n, macs data 1.0 population size (scaling factor) n=6 n=6, no n True time (2N generations) Sheehan, Harris, Song May 26,
43 Results and Future Work Real Drosophila data: Raleigh, USA 5.9e e+06 Population size results, real data, drosophila, RAL PSMC n=10 effective population size 3.0e e e e e e e time (years) Sheehan, Harris, Song May 26,
44 Results and Future Work Real Drosophila data: Gikongoro, Rwanda 5.2e e+06 Population size results, real data, drosophila, RG PSMC n=10 effective population size 3.0e e e e e e e time (years) Sheehan, Harris, Song May 26,
45 Future work Results and Future Work Accounting for selection in the Drosophila genome Incorporating variable recombination rate Incorporating migration Use our model for human data What is the right data for certain time periods Sheehan, Harris, Song May 26,
46 Thank you! Results and Future Work Kelley Harris Yun S. Song The Song Group Sheehan, Harris, Song May 26,
47 Results and Future Work Parameter values for Drosophila data θ = 4N 0 µ = ( ) = ρ = (rescaled based on θ) mutation matrix (order ACGT): P = Sheehan, Harris, Song May 26,
Ancestral Recombination Graphs
Ancestral Recombination Graphs Ancestral relationships among a sample of recombining sequences usually cannot be accurately described by just a single genealogy. Linked sites will have similar, but not
More informationGenealogical trees, coalescent theory, and the analysis of genetic polymorphisms
Genealogical trees, coalescent theory, and the analysis of genetic polymorphisms Magnus Nordborg University of Southern California The importance of history Genetic polymorphism data represent the outcome
More informationPopulation Structure and Genealogies
Population Structure and Genealogies One of the key properties of Kingman s coalescent is that each pair of lineages is equally likely to coalesce whenever a coalescent event occurs. This condition is
More informationThe genealogical history of a population The coalescent process. Identity by descent Distribution of pairwise coalescence times
The coalescent The genealogical history of a population The coalescent process Identity by descent Distribution of pairwise coalescence times Adding mutations Expected pairwise differences Evolutionary
More informationAlgorithms for Genetics: Basics of Wright Fisher Model and Coalescent Theory
Algorithms for Genetics: Basics of Wright Fisher Model and Coalescent Theory Vineet Bafna Harish Nagarajan and Nitin Udpa 1 Disclaimer Please note that a lot of the text and figures here are copied from
More informationPopulation Genetics using Trees. Peter Beerli Genome Sciences University of Washington Seattle WA
Population Genetics using Trees Peter Beerli Genome Sciences University of Washington Seattle WA Outline 1. Introduction to the basic coalescent Population models The coalescent Likelihood estimation of
More informationViral epidemiology and the Coalescent
Viral epidemiology and the Coalescent Philippe Lemey and Marc A. Suchard Department of Microbiology and Immunology K.U. Leuven, and Departments of Biomathematics and Human Genetics David Geffen School
More informationTheoretical Population Biology. An approximate likelihood for genetic data under a model with recombination and population splitting
Theoretical Population Biology 75 (2009) 33 345 Contents lists available at ScienceDirect Theoretical Population Biology journal homepage: www.elsevier.com/locate/tpb An approximate likelihood for genetic
More informationPopulation genetics: Coalescence theory II
Population genetics: Coalescence theory II Peter Beerli August 27, 2009 1 The variance of the coalescence process The coalescent is an accumulation of waiting times. We can think of it as standard queuing
More informationCoalescence. Outline History. History, Model, and Application. Coalescence. The Model. Application
Coalescence History, Model, and Application Outline History Origins of theory/approach Trace the incorporation of other s ideas Coalescence Definition and descriptions The Model Assumptions and Uses Application
More informationBioinformatics I, WS 14/15, D. Huson, December 15,
Bioinformatics I, WS 4/5, D. Huson, December 5, 204 07 7 Introduction to Population Genetics This chapter is closely based on a tutorial given by Stephan Schiffels (currently Sanger Institute) at the Australian
More informationSimulated gene genealogy of a sample of size 50 from a population of constant size. The History of Population Size from Whole Genomes.
Simulated gene genealogy of a sample of size 50 from a population of constant size The History of Population Size from Whole Genomes Alan R Rogers October 1, 2018 Short terminal branches; long basal ones
More informationSTAT 536: The Coalescent
STAT 536: The Coalescent Karin S. Dorman Department of Statistics Iowa State University November 7, 2006 Wright-Fisher Model Our old friend the Wright-Fisher model envisions populations moving forward
More informationApproximating the coalescent with recombination
Approximating the coalescent with recombination Gilean A. T. McVean* and Niall J. Cardin 360, 1387 1393 doi:10.1098/rstb.2005.1673 Published online 7 July 2005 Department of Statistics, 1 South Parks Road,
More informationBIOL Evolution. Lecture 8
BIOL 432 - Evolution Lecture 8 Expected Genotype Frequencies in the Absence of Evolution are Determined by the Hardy-Weinberg Equation. Assumptions: 1) No mutation 2) Random mating 3) Infinite population
More informationForward thinking: the predictive approach
Coalescent Theory 1 Forward thinking: the predictive approach Random variation in reproduction causes random fluctuation in allele frequencies. Can describe this process as diffusion: (Wright 1931) showed
More informationCoalescent Theory: An Introduction for Phylogenetics
Coalescent Theory: An Introduction for Phylogenetics Laura Salter Kubatko Departments of Statistics and Evolution, Ecology, and Organismal Biology The Ohio State University lkubatko@stat.ohio-state.edu
More informationComparative method, coalescents, and the future
Comparative method, coalescents, and the future Joe Felsenstein Depts. of Genome Sciences and of Biology, University of Washington Comparative method, coalescents, and the future p.1/36 Correlation of
More informationCoalescents. Joe Felsenstein. GENOME 453, Autumn Coalescents p.1/48
Coalescents p.1/48 Coalescents Joe Felsenstein GENOME 453, Autumn 2015 Coalescents p.2/48 Cann, Stoneking, and Wilson Becky Cann Mark Stoneking the late Allan Wilson Cann, R. L., M. Stoneking, and A. C.
More information2 The Wright-Fisher model and the neutral theory
0 THE WRIGHT-FISHER MODEL AND THE NEUTRAL THEORY The Wright-Fisher model and the neutral theory Although the main interest of population genetics is conceivably in natural selection, we will first assume
More informationThe Two Phases of the Coalescent and Fixation Processes
The Two Phases of the Coalescent and Fixation Processes Introduction The coalescent process which traces back the current population to a common ancestor and the fixation process which follows an individual
More informationComparative method, coalescents, and the future. Correlation of states in a discrete-state model
Comparative method, coalescents, and the future Joe Felsenstein Depts. of Genome Sciences and of Biology, University of Washington Comparative method, coalescents, and the future p.1/28 Correlation of
More informationCoalescents. Joe Felsenstein. GENOME 453, Winter Coalescents p.1/39
Coalescents Joe Felsenstein GENOME 453, Winter 2007 Coalescents p.1/39 Cann, Stoneking, and Wilson Becky Cann Mark Stoneking the late Allan Wilson Cann, R. L., M. Stoneking, and A. C. Wilson. 1987. Mitochondrial
More informationTREES OF GENES IN POPULATIONS
1 TREES OF GENES IN POPULATIONS Joseph Felsenstein Abstract Trees of ancestry of copies of genes form in populations, as a result of the randomness of birth, death, and Mendelian reproduction. Considering
More informationAncestral population genomics: the coalescent hidden Markov. model approach. Julien Y Dutheil 1, Ganeshkumar Ganapathy 2, Asger Hobolth 1,
Ancestral population genomics: the coalescent hidden Markov model approach Julien Y Dutheil 1, Ganeshkumar Ganapathy 2, Asger Hobolth 1, Thomas Mailund 1, Marcy K Uyenoyama 3, Mikkel H Schierup 1,4 1 Bioinformatics
More informationAdvanced data analysis in population genetics Likelihood-based demographic inference using the coalescent
Advanced data analysis in population genetics Likelihood-based demographic inference using the coalescent Raphael Leblois Centre de Biologie pour la Gestion des Populations (CBGP), INRA, Montpellier master
More informationCoalescence time distributions for hypothesis testing -Kapil Rajaraman 498BIN, HW# 2
Coalescence time distributions for hypothesis testing -Kapil Rajaraman (rajaramn@uiuc.edu) 498BIN, HW# 2 This essay will be an overview of Maryellen Ruvolo s work on studying modern human origins using
More informationKinship/relatedness. David Balding Professor of Statistical Genetics University of Melbourne, and University College London.
Kinship/relatedness David Balding Professor of Statistical Genetics University of Melbourne, and University College London 2 Feb 2016 1 Ways to measure relatedness 2 Pedigree-based kinship coefficients
More informationInference of population structure using dense haplotype data Daniel John Lawson 1, Garrett Hellenthal 2, Simon Myers,3 and Daniel Falush,4,
1 Inference of population structure using dense haplotype data Daniel John Lawson 1, Garrett Hellenthal 2, Simon Myers,3 and Daniel Falush,4, 1 Department of Mathematics, University of Bristol, Bristol,
More informationSome of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More informationInference of Population Structure using Dense Haplotype Data
using Dense Haplotype Data Daniel John Lawson 1, Garrett Hellenthal 2, Simon Myers 3., Daniel Falush 4,5. * 1 Department of Mathematics, University of Bristol, Bristol, United Kingdom, 2 Wellcome Trust
More informationGENEALOGICAL TREES, COALESCENT THEORY AND THE ANALYSIS OF GENETIC POLYMORPHISMS
GENEALOGICAL TREES, COALESCENT THEORY AND THE ANALYSIS OF GENETIC POLYMORPHISMS Noah A. Rosenberg and Magnus Nordborg Improvements in genotyping technologies have led to the increased use of genetic polymorphism
More informationMOLECULAR POPULATION GENETICS: COALESCENT METHODS BASED ON SUMMARY STATISTICS
MOLECULAR POPULATION GENETICS: COALESCENT METHODS BASED ON SUMMARY STATISTICS Daniel A. Vasco*, Keith A. Crandall* and Yun-Xin Fu *Department of Zoology, Brigham Young University, Provo, UT 8460, USA Human
More informationAnalysis of geographically structured populations: Estimators based on coalescence
Analysis of geographically structured populations: Estimators based on coalescence Peter Beerli Department of Genetics, Box 357360, University of Washington, Seattle WA 9895-7360, Email: beerli@genetics.washington.edu
More informationSupplementary Note: Analysis of Latino populations from GALA and MEC reveals genomic loci with biased local ancestry estimation
Supplementary Note: Analysis of Latino populations from GALA and MEC reveals genomic loci with biased local ancestry estimation Bogdan Pasaniuc, Sriram Sankararaman, et al. 1 Relation between Error Rate
More informationCoalescent Likelihood Methods. Mary K. Kuhner Genome Sciences University of Washington Seattle WA
Coalescent Likelihood Methods Mary K. Kuhner Genome Sciences University of Washington Seattle WA Outline 1. Introduction to coalescent theory 2. Practical example 3. Genealogy samplers 4. Break 5. Survey
More informationA hidden Markov model to estimate inbreeding from whole genome sequence data
A hidden Markov model to estimate inbreeding from whole genome sequence data Tom Druet & Mathieu Gautier Unit of Animal Genomics, GIGA-R, University of Liège, Belgium Centre de Biologie pour la Gestion
More informationPedigree Reconstruction using Identity by Descent
Pedigree Reconstruction using Identity by Descent Bonnie Kirkpatrick Electrical Engineering and Computer Sciences University of California at Berkeley Technical Report No. UCB/EECS-2010-43 http://www.eecs.berkeley.edu/pubs/techrpts/2010/eecs-2010-43.html
More informationObjective: Why? 4/6/2014. Outlines:
Objective: Develop mathematical models that quantify/model resemblance between relatives for phenotypes of a quantitative trait : - based on pedigree - based on markers Outlines: Causal model for covariances
More informationKenneth Nordtvedt. Many genetic genealogists eventually employ a time-tomost-recent-common-ancestor
Kenneth Nordtvedt Many genetic genealogists eventually employ a time-tomost-recent-common-ancestor (TMRCA) tool to estimate how far back in time the common ancestor existed for two Y-STR haplotypes obtained
More informationGene coancestry in pedigrees and populations
Gene coancestry in pedigrees and populations Thompson, Elizabeth University of Washington, Department of Statistics Box 354322 Seattle, WA 98115-4322, USA E-mail: eathomp@uw.edu Glazner, Chris University
More informationSensitive Detection of Chromosomal Segments of Distinct Ancestry in Admixed Populations
Sensitive Detection of Chromosomal Segments of Distinct Ancestry in Admixed Populations Alkes L. Price 1,2,3, Arti Tandon 3,4, Nick Patterson 3, Kathleen C. Barnes 5, Nicholas Rafaels 5, Ingo Ruczinski
More informationSINGLE nucleotide polymorphisms (SNPs) are single cases the SNPs have originally been identified by sequencing.
Copyright 2000 by the Genetics Society of America Estimation of Population Parameters and Recombination Rates From Single Nucleotide Polymorphisms Rasmus Nielsen Department of Organismic and Evolutionary
More informationResearch Article The Ancestry of Genetic Segments
International Scholarly Research Network ISRN Biomathematics Volume 2012, Article ID 384275, 8 pages doi:105402/2012/384275 Research Article The Ancestry of Genetic Segments R B Campbell Department of
More information5 Inferring Population
5 Inferring Population History and Demography While population genetics was a very theoretical discipline originally, the modern abundance of population genetic data has forced the field to become more
More informationLecture 1: Introduction to pedigree analysis
Lecture 1: Introduction to pedigree analysis Magnus Dehli Vigeland NORBIS course, 8 th 12 th of January 2018, Oslo Outline Part I: Brief introductions Pedigrees symbols and terminology Some common relationships
More informationEvery human cell (except red blood cells and sperm and eggs) has an. identical set of 23 pairs of chromosomes which carry all the hereditary
Introduction to Genetic Genealogy Every human cell (except red blood cells and sperm and eggs) has an identical set of 23 pairs of chromosomes which carry all the hereditary information that is passed
More informationGrowing the Family Tree: The Power of DNA in Reconstructing Family Relationships
Growing the Family Tree: The Power of DNA in Reconstructing Family Relationships Luke A. D. Hutchison Natalie M. Myres Scott R. Woodward Sorenson Molecular Genealogy Foundation (www.smgf.org) 2511 South
More informationTópicos Depto. Ciencias Biológicas, UniAndes Profesor Andrew J. Crawford Semestre II
Tópicos Depto. Ciencias Biológicas, UniAndes Profesor Andrew J. Crawford Semestre 29 -II Lab Coalescent simulation using SIMCOAL 17 septiembre 29 Coalescent theory provides a powerful model
More informationIoanna Manolopoulou and Brent C. Emerson. October 7, Abstract
Phylogeographic Ancestral Inference Using the Coalescent Model on Haplotype Trees Ioanna Manolopoulou and Brent C. Emerson October 7, 2011 Abstract Phylogeographic ancestral inference is a question frequently
More informationPart I. Concepts and Methods in Bacterial Population Genetics COPYRIGHTED MATERIAL
Part I Concepts and Methods in Bacterial Population Genetics COPYRIGHTED MATERIAL Chapter 1 The Coalescent of Bacterial Populations Mikkel H. Schierup and Carsten Wiuf 1.1 BACKGROUND AND MOTIVATION Recent
More informationPOPULATION GENETICS: WRIGHT FISHER MODEL AND COALESCENT PROCESS. Hailong Cui and Wangshu Zhang. Superviser: Prof. Quentin Berger
POPULATIO GEETICS: WRIGHT FISHER MODEL AD COALESCET PROCESS by Hailong Cui and Wangshu Zhang Superviser: Prof. Quentin Berger A Final Project Report Presented In Partial Fulfillment of the Requirements
More informationChapter 4 Neutral Mutations and Genetic Polymorphisms
Chapter 4 Neutral Mutations and Genetic Polymorphisms The relationship between genetic data and the underlying genealogy was introduced in Chapter. Here we will combine the intuitions of Chapter with the
More informationOn the nonidentifiability of migration time estimates in isolation with migration models
Molecular Ecology (2011) 20, 3956 3962 doi: 10.1111/j.1365-294X.2011.05247.x NEWS AND VIEWS COMMENT On the nonidentifiability of migration time estimates in isolation with migration models VITOR C. SOUSA,
More informationChapter 12 Gene Genealogies
Chapter 12 Gene Genealogies Noah A. Rosenberg Program in Molecular and Computational Biology. University of Southern California, Los Angeles, California 90089-1113 USA. E-mail: noahr@usc.edu. Phone: 213-740-2416.
More informationGenetics: Early Online, published on June 29, 2016 as /genetics A Genealogical Look at Shared Ancestry on the X Chromosome
Genetics: Early Online, published on June 29, 2016 as 10.1534/genetics.116.190041 GENETICS INVESTIGATION A Genealogical Look at Shared Ancestry on the X Chromosome Vince Buffalo,,1, Stephen M. Mount and
More informationThe Structure of Genealogies and the Distribution of Fixed Differences Between DNA Sequence Samples From Natural Populations
Copyright 0 1991 by the Genetics Society of America The Structure of Genealogies the Distribution of Fixed Differences Between DNA Sequence Samples From Natural Populations Department of Biological Sciences,
More informationModelling of Real Network Traffic by Phase-Type distribution
Modelling of Real Network Traffic by Phase-Type distribution Andriy Panchenko Dresden University of Technology 27-28.Juli.2004 4. Würzburger Workshop "IP Netzmanagement, IP Netzplanung und Optimierung"
More informationCoalescent Theory. Magnus Nordborg. Department of Genetics, Lund University. March 24, 2000
Coalescent Theory Magnus Nordborg Department of Genetics, Lund University March 24, 2000 Abstract The coalescent process is a powerful modeling tool for population genetics. The allelic states of all homologous
More informationMitochondrial Eve and Y-chromosome Adam: Who do your genes come from?
Mitochondrial Eve and Y-chromosome Adam: Who do your genes come from? 28 July 2010. Joe Felsenstein Evening At The Genome Mitochondrial Eve and Y-chromosome Adam: Who do your genes come from? p.1/39 Evolutionary
More information[CLIENT] SmithDNA1701 DE January 2017
[CLIENT] SmithDNA1701 DE1704205 11 January 2017 DNA Discovery Plan GOAL Create a research plan to determine how the client s DNA results relate to his family tree as currently constructed. The client s
More informationCoalescent genealogy samplers: windows into population history
Review Coalescent genealogy samplers: windows into population history Mary K. Kuhner Department of Genome Sciences, University of Washington, Box 355065, Seattle, WA 98195-5065, USA Coalescent genealogy
More informationDISCUSSION: RECENT COMMON ANCESTORS OF ALL PRESENT-DAY INDIVIDUALS
Adv. Appl. Prob. 31, 1027 1035 (1999) Printed in Northern Ireland Applied Probability Trust 1999 DISCUSSION: RECENT COMMON ANCESTORS OF ALL PRESENT-DAY INDIVIDUALS It is a pleasure to be able to comment
More informationEstimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling
Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling Mary K. Kuhner, Jon Yamato, and Joseph Felsenstein Department of Genetics, University of Washington
More informationFactors affecting phasing quality in a commercial layer population
Factors affecting phasing quality in a commercial layer population N. Frioni 1, D. Cavero 2, H. Simianer 1 & M. Erbe 3 1 University of Goettingen, Department of nimal Sciences, Center for Integrated Breeding
More informationReport on the VAN_TUYL Surname Project Y-STR Results 3/11/2013 Rory Van Tuyl
Report on the VAN_TUYL Surname Project Y-STR Results 3/11/2013 Rory Van Tuyl Abstract: Recent data for two descendants of Ott van Tuyl has been added to the project, bringing the total number of Gameren
More information6.047/6.878 Lecture 21: Phylogenomics II
Guest Lecture by Matt Rasmussen Orit Giguzinsky and Ethan Sherbondy December 13, 2012 1 Contents 1 Introduction 3 2 Inferring Orthologs/Paralogs, Gene Duplication and Loss 3 2.1 Species Tree..............................................
More informationA Likelihood Method to Estimate/Detect Gene Flow and A Distance Method to. Estimate Species Trees in the Presence of Gene Flow.
A Likelihood Method to Estimate/Detect Gene Flow and A Distance Method to Estimate Species Trees in the Presence of Gene Flow Thesis Presented in Partial Fulfillment of the Requirements for the Degree
More informationPackage EILA. February 19, Index 6. The CEU-CHD-YRI admixed simulation data
Type Package Title Efficient Inference of Local Ancestry Version 0.1-2 Date 2013-09-09 Package EILA February 19, 2015 Author James J. Yang, Jia Li, Anne Buu, and L. Keoki Williams Maintainer James J. Yang
More informationInbreeding and self-fertilization
Inbreeding and self-fertilization Introduction Remember that long list of assumptions associated with derivation of the Hardy-Weinberg principle that I went over a couple of lectures ago? Well, we re about
More informationcan mathematicians find the woods?
Eolutionary trees, coalescents, and gene trees: can mathematicians find the woods? Joe Felsenstein Department of Genome Sciences and Department of Biology Eolutionary trees, coalescents, and gene trees:
More informationDNA Basics, Y DNA Marker Tables, Ancestral Trees and Mutation Graphs: Definitions, Concepts, Understanding
DNA Basics, Y DNA Marker Tables, Ancestral Trees and Mutation Graphs: Definitions, Concepts, Understanding by Dr. Ing. Robert L. Baber 2014 July 26 Rights reserved, see the copyright notice at http://gengen.rlbaber.de
More informationDNA: Statistical Guidelines
Frequency calculations for STR analysis When a probative association between an evidence profile and a reference profile is made, a frequency estimate is calculated to give weight to the association. Frequency
More informationAutosomal-DNA. How does the nature of Jewish genealogy make autosomal DNA research more challenging?
Autosomal-DNA How does the nature of Jewish genealogy make autosomal DNA research more challenging? Using Family Finder results for genealogy is more challenging for individuals of Jewish ancestry because
More informationDetecting Heterogeneity in Population Structure Across the Genome in Admixed Populations
Genetics: Early Online, published on July 20, 2016 as 10.1534/genetics.115.184184 GENETICS INVESTIGATION Detecting Heterogeneity in Population Structure Across the Genome in Admixed Populations Caitlin
More informationUniversity of Washington, TOPMed DCC July 2018
Module 12: Comput l Pipeline for WGS Relatedness Inference from Genetic Data Timothy Thornton (tathornt@uw.edu) & Stephanie Gogarten (sdmorris@uw.edu) University of Washington, TOPMed DCC July 2018 1 /
More informationChapter 2: Genes in Pedigrees
Chapter 2: Genes in Pedigrees Chapter 2-0 2.1 Pedigree definitions and terminology 2-1 2.2 Gene identity by descent (ibd) 2-5 2.3 ibd of more than 2 genes 2-14 2.4 Data on relatives 2-21 2.1.1 GRAPHICAL
More informationThe program Bayesian Analysis of Trees With Internal Node Generation (BATWING)
Supplementary methods Estimation of TMRCA using BATWING The program Bayesian Analysis of Trees With Internal Node Generation (BATWING) (Wilson et al. 2003) was run using a model of a single population
More informationPopulation Genetics. Joe Felsenstein. GENOME 453, Autumn Population Genetics p.1/70
Population Genetics Joe Felsenstein GENOME 453, Autumn 2013 Population Genetics p.1/70 Godfrey Harold Hardy (1877-1947) Wilhelm Weinberg (1862-1937) Population Genetics p.2/70 A Hardy-Weinberg calculation
More informationAdvanced Techniques for Mobile Robotics Location-Based Activity Recognition
Advanced Techniques for Mobile Robotics Location-Based Activity Recognition Wolfram Burgard, Cyrill Stachniss, Kai Arras, Maren Bennewitz Activity Recognition Based on L. Liao, D. J. Patterson, D. Fox,
More informationWhat can evolution tell us about the feasibility of artificial intelligence? Carl Shulman Singularity Institute for Artificial Intelligence
What can evolution tell us about the feasibility of artificial intelligence? Carl Shulman Singularity Institute for Artificial Intelligence Artificial intelligence Systems that can learn to perform almost
More informationInbreeding and self-fertilization
Inbreeding and self-fertilization Introduction Remember that long list of assumptions associated with derivation of the Hardy-Weinberg principle that we just finished? Well, we re about to begin violating
More informationAFDAA 2012 WINTER MEETING Population Statistics Refresher Course - Lecture 3: Statistics of Kinship Analysis
AFDAA 2012 WINTER MEETING Population Statistics Refresher Course - Lecture 3: Statistics of Kinship Analysis Ranajit Chakraborty, PhD Center for Computational Genomics Institute of Applied Genetics Department
More informationWarning: software often displays unrooted trees like this:
Warning: software often displays unrooted trees like this: /------------------------------ Chara /-------------------------- Chlorella /---------16 \---------------------------- Volvox +-------------------17
More informationAncient Admixture in Human History
Genetics: Published Articles Ahead of Print, published on September 7, 2012 as 10.1534/genetics.112.145037 Ancient Admixture in Human History Nick Patterson 1, Priya Moorjani 2, Yontao Luo 3, Swapan Mallick
More informationPopulation Genetics. Joe Felsenstein. GENOME 453, Autumn Population Genetics p.1/74
Population Genetics Joe Felsenstein GENOME 453, Autumn 2011 Population Genetics p.1/74 Godfrey Harold Hardy (1877-1947) Wilhelm Weinberg (1862-1937) Population Genetics p.2/74 A Hardy-Weinberg calculation
More informationESTIMATION OF THE NUMBER OF INDIVIDUALS FOUNDING COLONIZED POPULATIONS
ORIGINAL ARTICLE doi:1.1111/j.1558-5646.7.8.x ESTIMATION OF THE NUMBER OF INDIVIDUALS FOUNDING COLONIZED POPULATIONS Eric C. Anderson 1, and Montgomery Slatkin 3,4 1 Fisheries Ecology Division, Southwest
More informationY-Chromosome Haplotype Origins via Biogeographical Multilateration
Y-Chromosome Haplotype Origins via Biogeographical Multilateration Michael R. Maglio Abstract Current Y-chromosome migration maps only cover the broadest-brush strokes of the highest-level haplogroups.
More informationSNP variant discovery in pedigrees using Bayesian networks. Amit R. Indap
SNP variant discovery in pedigrees using Bayesian networks Amit R. Indap 1 1 Background Next generation sequencing technologies have reduced the cost and increased the throughput of DNA sequencing experiments
More informationChapter 20. Inference about a Population Proportion. BPS - 5th Ed. Chapter 19 1
Chapter 20 Inference about a Population Proportion BPS - 5th Ed. Chapter 19 1 Proportions The proportion of a population that has some outcome ( success ) is p. The proportion of successes in a sample
More informationThe Coalescent Model. Florian Weber
The Coalescent Model Florian Weber 23. 7. 2016 The Coalescent Model coalescent = zusammenwachsend Outline Population Genetics and the Wright-Fisher-model The Coalescent on-constant population-sizes Further
More informationUNDERSTANDING the genealogical relationship finite for any sample size. But, even positions sharing
Copyright 1999 by the Genetics Society of America The Ancestry of a Sample of Sequences Subject to Recombination Carsten Wiuf and Jotun Hein Institute of Biological Sciences, University of Aarhus, DK-8000
More informationMATCHED FIELD PROCESSING: ENVIRONMENTAL FOCUSING AND SOURCE TRACKING WITH APPLICATION TO THE NORTH ELBA DATA SET
MATCHED FIELD PROCESSING: ENVIRONMENTAL FOCUSING AND SOURCE TRACKING WITH APPLICATION TO THE NORTH ELBA DATA SET Cristiano Soares 1, Andreas Waldhorst 2 and S. M. Jesus 1 1 UCEH - Universidade do Algarve,
More informationGenetic Diversity and the Structure of Genealogies in Rapidly Adapting Populations
Genetic Diversity and the Structure of Genealogies in Rapidly Adapting Populations The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters
More informationPATTERNS of heritable genetic variation in contem- relationships, but does not provide a basis for assessing
Copyright 1998 by the Genetics Society of America Genealogical Inference From Microsatellite Data Ian J. Wilson*, and David J. Balding *School of Biological Sciences, Queen Mary and Westfield College,
More informationA Novel approach for Optimizing Cross Layer among Physical Layer and MAC Layer of Infrastructure Based Wireless Network using Genetic Algorithm
A Novel approach for Optimizing Cross Layer among Physical Layer and MAC Layer of Infrastructure Based Wireless Network using Genetic Algorithm Vinay Verma, Savita Shiwani Abstract Cross-layer awareness
More informationDetection of Compound Structures in Very High Spatial Resolution Images
Detection of Compound Structures in Very High Spatial Resolution Images Selim Aksoy Department of Computer Engineering Bilkent University Bilkent, 06800, Ankara, Turkey saksoy@cs.bilkent.edu.tr Joint work
More informationProject. B) Building the PWM Read the instructions of HO_14. 1) Determine all the 9-mers and list them here:
Project Please choose ONE project among the given five projects. The last three projects are programming projects. hoose any programming language you want. Note that you can also write programs for the
More informationEvaluating the performance of likelihood methods for. detecting population structure and migration
Molecular Ecology (2004) 13, 837 851 doi: 10.1111/j.1365-294X.2004.02132.x Evaluating the performance of likelihood methods for Blackwell Publishing, Ltd. detecting population structure and migration ZAID
More informationMODERN population genetics is data driven and
Copyright Ó 2009 by the Genetics Society of America DOI: 10.1534/genetics.108.092460 Note Extensions of the Coalescent Effective Population Size John Wakeley 1 and Ori Sargsyan Department of Organismic
More information