Lecture 1: Introduction to pedigree analysis

Similar documents
Pedigrees How do scientists trace hereditary diseases through a family history?

Using Pedigrees to interpret Mode of Inheritance

Developing Conclusions About Different Modes of Inheritance

AFDAA 2012 WINTER MEETING Population Statistics Refresher Course - Lecture 3: Statistics of Kinship Analysis

Methods of Parentage Analysis in Natural Populations

Puzzling Pedigrees. Essential Question: How can pedigrees be used to study the inheritance of human traits?

Two-point linkage analysis using the LINKAGE/FASTLINK programs

Population Genetics 3: Inbreeding

Spring 2013 Assignment Set #3 Pedigree Analysis. Set 3 Problems sorted by analytical and/or content type

Genetics. 7 th Grade Mrs. Boguslaw

Chapter 2: Genes in Pedigrees

BIOL 502 Population Genetics Spring 2017

Decrease of Heterozygosity Under Inbreeding

TDT vignette Use of snpstats in family based studies

DNA Basics, Y DNA Marker Tables, Ancestral Trees and Mutation Graphs: Definitions, Concepts, Understanding

CONGEN. Inbreeding vocabulary

Pedigree Charts. The family tree of genetics

Statistical methods in genetic relatedness and pedigree analysis

Eastern Regional High School. 1 2 Aa Aa Aa Aa

Lecture 6: Inbreeding. September 10, 2012

ICMP DNA REPORTS GUIDE

Bottlenecks reduce genetic variation Genetic Drift

Population Genetics. Joe Felsenstein. GENOME 453, Autumn Population Genetics p.1/70

Investigations from last time. Inbreeding and neutral evolution Genes, alleles and heterozygosity

Kinship/relatedness. David Balding Professor of Statistical Genetics University of Melbourne, and University College London.

Population Genetics. Joe Felsenstein. GENOME 453, Autumn Population Genetics p.1/74

BIOL Evolution. Lecture 8

1.4.1(Question should be rather: Another sibling of these two brothers) 25% % % (population risk of heterozygot*2/3*1/4)

Growing the Family Tree: The Power of DNA in Reconstructing Family Relationships

Popstats Parentage Statistics Strength of Genetic Evidence In Parentage Testing

The Pedigree. NOTE: there are no definite conclusions that can be made from a pedigree. However, there are more likely and less likely explanations

Population Structure. Population Structure

Pedigree Worksheet Name Period Date Interpreting a Human Pedigree Use the pedigree below to answer 1-5

Populations. Arindam RoyChoudhury. Department of Biostatistics, Columbia University, New York NY 10032, U.S.A.,

Linkage Analysis in Merlin. Meike Bartels Kate Morley Danielle Posthuma

Development Team. Importance and Implications of Pedigree and Genealogy. Anthropology. Principal Investigator. Paper Coordinator.

Contributed by "Kathy Hallett"

DNA: Statistical Guidelines

PopGen3: Inbreeding in a finite population

Genome-Wide Association Exercise - Data Quality Control

Pedigree Reconstruction using Identity by Descent

The Coalescent Model. Florian Weber

NON-RANDOM MATING AND INBREEDING

SNP variant discovery in pedigrees using Bayesian networks. Amit R. Indap

Inbreeding depression in corn. Inbreeding. Inbreeding depression in humans. Genotype frequencies without random mating. Example.

fbat August 21, 2010 Basic data quality checks for markers

Population Genetics using Trees. Peter Beerli Genome Sciences University of Washington Seattle WA

4. Kinship Paper Challenge

TRACK 1: BEGINNING DNA RESEARCH presented by Andy Hochreiter

Ancestral Recombination Graphs

An Optimal Algorithm for Automatic Genotype Elimination

Determining Relatedness from a Pedigree Diagram

DNA Testing. February 16, 2018

Inbreeding and self-fertilization

Every human cell (except red blood cells and sperm and eggs) has an. identical set of 23 pairs of chromosomes which carry all the hereditary

Genealogical trees, coalescent theory, and the analysis of genetic polymorphisms

On identification problems requiring linked autosomal markers

Maximum likelihood pedigree reconstruction using integer programming

Walter Steets Houston Genealogical Forum DNA Interest Group January 6, 2018

Halley Family. Mystery? Mystery? Can you solve a. Can you help solve a

Kenneth Nordtvedt. Many genetic genealogists eventually employ a time-tomost-recent-common-ancestor

DNA Testing What you need to know first

Inbreeding and self-fertilization

Pizza and Who do you think you are?

Supporting Online Material for

Genetic Effects of Consanguineous Marriage: Facts and Artifacts

Biology Partnership (A Teacher Quality Grant) Lesson Plan Construction Form

9Consanguineous marriage and recessive

BIOINFORMATICS. Efficient Genome Ancestry Inference in Complex Pedigrees with Inbreeding

Your Family 101 Beginning Genealogical Research

Using Meiosis to make a Mini-Manc

Large scale kinship:familial Searching and DVI. Seoul, ISFG workshop

Ewing Surname Y-DNA Project Article 8

University of Washington, TOPMed DCC July 2018

COMMUNITY UNIT SCHOOL DISTRICT 200 Science Curriculum Philosophy

Using Y-DNA for Genealogy Debbie Parker Wayne, CG, CGL SM

Automated Discovery of Pedigrees and Their Structures in Collections of STR DNA Specimens Using a Link Discovery Tool

Gene coancestry in pedigrees and populations

Mehdi Sargolzaei L Alliance Boviteq, St-Hyacinthe, QC, Canada and CGIL, University of Guelph, Guelph, ON, Canada. Summary

Genetics Practice Problems Pedigree Tables Answer Key

Primer on Human Pedigree Analysis:

THE BASICS OF DNA TESTING. By Jill Garrison, Genealogy Coordinator Frankfort Community Public Library

Illumina GenomeStudio Analysis

Assessment of alternative genotyping strategies to maximize imputation accuracy at minimal cost

Detection of Misspecified Relationships in Inbred and Outbred Pedigrees

NIH Public Access Author Manuscript Genet Res (Camb). Author manuscript; available in PMC 2011 April 4.

ville, VA Associate Editor: XXXXXXX Received on XXXXX; revised on XXXXX; accepted on XXXXX

and g2. The second genotype, however, has a doubled opportunity of transmitting the gene X to any

Genomic Variation of Inbreeding and Ancestry in the Remaining Two Isle Royale Wolves

The Structure of DNA Let s take a closer look at how this looks under a microscope.

1) Using the sightings data, determine who moved from one area to another and fill this data in on the data sheet.

What Can I Learn From DNA Testing?

Chromosome X haplotyping in deficiency paternity testing principles and case report

Pedigree- The Genetic Family Tree

genetics paper pets By the end of the eighth grade, students are Learning with Introduction to inheritance by Valerie Raunig Finnerty

Optimum contribution selection conserves genetic diversity better than random selection in small populations with overlapping generations

DNA Basics. OLLI: Genealogy 101 October 1, ~ Monique E. Rivera ~

A hidden Markov model to estimate inbreeding from whole genome sequence data

! FTDNA! Ancestry. ! 23andMe. ! Medical Considera,ons. ! Iden,fying family medical history. ! Communica,ng with the medical community

Need a little help with the lab?

Transcription:

Lecture 1: Introduction to pedigree analysis Magnus Dehli Vigeland NORBIS course, 8 th 12 th of January 2018, Oslo

Outline Part I: Brief introductions Pedigrees symbols and terminology Some common relationships Genetics Locus, allele, genotype, marker Mendelian inheritance Autosomal X, Y Part II: Pedigree likelihoods Motivation: Real-life problems Ingredients: Hardy-Weinberg equilibrium Mendelian transition probabilities Likelihoods by hand Computer algorithms

Outline Part I: Brief introductions Pedigrees symbols and terminology Some common relationships Genetics Locus, allele, genotype, marker Mendelian inheritance Autosomal X, Y Part II: Pedigree likelihoods Motivation: Real-life problems Ingredients: Hardy-Weinberg equilibrium Mendelian transition probabilities Likelihoods by hand Computer algorithms

Pedigrees: Symbols and terminology Founders: No parents included in the pedigree = male = female Nonfounders

Pedigrees: Symbols and terminology = male = female Consanguineous marriage

Pedigrees: Symbols and terminology Medical pedigrees: = affected = unaffected = carrier of disease allele

Alternative ways of drawing pedigrees 1 3 5 Standard Simplified Directed acyclic graph

Outline Part I: Brief introductions Pedigrees symbols and terminology Some common relationships Genetics Locus, allele, genotype, marker Mendelian inheritance Autosomal X, Y Part II: Pedigree likelihoods Motivation: Real-life problems Ingredients: Hardy-Weinberg equilibrium Mendelian transition probabilities Likelihoods by hand Computer algorithms

Cousin relationships Full siblings First cousins Second cousins

Cousin relationships First cousins once removed

Half cousin relationships Half siblings (paternal) Half first cousins Half second cousins

Half cousin relationships

More complicated relationships Other: Double first cousins Quadruple half first cousins 3/4 siblings

Outline Part I: Brief introductions Pedigrees symbols and terminology Some common relationships Genetics Locus, allele, genotype, marker Mendelian inheritance Autosomal X, Y Part II: Pedigree likelihoods Motivation: Real-life problems Ingredients: Hardy-Weinberg equilibrium Mendelian transition probabilities Likelihoods by hand Computer algorithms

Genetics Human genome: Diploid 22 pairs of autosomes Sex chroms: X and Y Some important terms Locus Allele Genotype Genetic markers SNPs microsatellites

Locus, allele, genotype M F alleles A B locus genotype: A/B Homologous chromosomes LOCUS = a specific place in the genome, e.g. a base pair, a gene or a region ALLELE = any of the alternative forms of a locus GENOTYPE = the set of alleles carried by an individual at a given locus

Genetic markers Small parts of the genome which... have known position vary in the population are easy to genotype SNPs (single nucleotide polymorphisms) two alleles usual requirement: MAF > 1% = minor allele frequency very common in the genome (millions!) used in medical genetics +++...CCGTTATATGGGC......CCGTTAGATGGGC......CCGTTATATGGGC......CCGTTATATGGGC......CCGTTAGATGGGC... STRs (short tandem repeats) = microsatellites consecutive repeats of 2-5 bases multiallelic: 5-50 alleles allele names: # repeats used in forensics...acg TTAG TTAG TTAG TTAG AAC.....ACG TTAG TTAG AAC.....ACG TTAG TTAG TTAG TTAG TTAG AAC..

Outline Part I: Brief introductions Pedigrees symbols and terminology Some common relationships Genetics Locus, allele, genotype, marker Mendelian inheritance Autosomal X, Y Part II: Pedigree likelihoods Motivation: Real-life problems Ingredients: Hardy-Weinberg equilibrium Mendelian transition probabilities Likelihoods by hand Computer algorithms

Mendelian inheritance: Autosomal (chromosomes 1-22) Example: autosomal marker with 3 alleles: A, B, C homozygous A/A B/C heterozygous A/B A/C A/B Probability of transmitting either allele: always 50% B/C

Mendelian inheritance: X-linked Example: X-linked marker with 3 alleles: A, B, C males are hemizygous A B/C A/C A/B C forced transmission from father to daughter A no transmisison from father to son

Mendelian inheritance: Y-linked Example: Y-linked marker with 2 alleles: A, B A B no transmission involving females B father-son forced

Assumptions throughout (most of) this course Diploid species No cytogenetic abnormalitites No de novo mutations COFFEE BREAK!

Outline Part I: Brief introductions Pedigrees symbols and terminology Some common relationships Genetics Locus, allele, genotype, marker Mendelian inheritance Autosomal X, Y Part II: Pedigree likelihoods Motivation: Real-life problems Ingredients: Hardy-Weinberg equilibrium Mendelian transition probabilities Likelihoods by hand Computer algorithms

Questions related to pedigrees with genotypes Will my child have the disease? Is NN the true father? Brothers or half brothers? Is NN related to this family? How? Predict the missing genotype?

Questions related to pedigrees with genotypes D/N? D Disease locus: alleles D and N Will my child have the disease?

Questions related to pedigrees with genotypes 1 2 11/13 -/- 13/18 11/18 Suppose: 11 is common 18 is rare Who is the true father?

Questions related to pedigrees with genotypes Brothers or half brothers?

Questions related to pedigrees with genotypes 12/14 32/40 7/11 6/21 11/14 32/40 13/13 6/25 12/16 34/40 7/7 12/21 11/16 32/41 7/13 6/25 Is this woman related to the family?

Questions related to pedigrees with genotypes A/B A/A A/A?/? A/B A/B Can we predict the missing genotype?

Common to all of these: The need to calculate probabilities P( genotypes pedigree, marker info, allele freqs,.. ) Called the likelihood of the pedigree.

Ingredients for likelihood computations founder probabilites A/B A/A transition probabilities A/A -/- A/B A/A untyped individuals

Ingredient 1: Founder probabilities Suppose the allele frequencies are: P A = p P B = q What are the frequencies of the genotypes AA, AB, BB? Under certain assumptions, the alleles can be treated as independent: P(AA) = P A P A = p 2 P BB = P B P B = q 2 P AB = P AB or BA = pq + qp = 2pq two possible orderings!

The Hardy-Weinberg principle Assumptions: A p B q infinite population random mating no selection AA AB BB no mutations no migration A p B q Hardy (1908): Shows «... using a little mathemathics of the multiplication table kind»: p 2 2pq q 2 AA AB BB allele freqs are unchanged from generation to generation after 1 generation the genotype freqs stay unchanged A p B q P AA = p 2 P AB = 2pq P(BB) = q 2 HW equilibrium p 2 AA 2pq AB q 2 BB

p AA = p 2 p AB = 2pq p BB = q 2 assuming HWE Allele frequencies Genotype frequencies always p = p AA + 0.5 p AB q = p BB + 0.5 p AB

Ingredient 2: Transition probabilities P(g child g parents ) Easy - follows directly from Mendel's laws! A/A -/- A/B child parents A/A AB BB AA AA 1 0 0 AA AB 0.5 0.5 0 AA BB 0 1 0 AB AA 0.5 0.5 0 AB AB 0.25 0.5 0.25 AB BB 0 0.5 0.5 BB AA 0 1 0 BB AB 0 0.5 0.5 BB BB 0 0 1

Example 1 2 A/A A/B 3 A/B L = P g 1, g 2, g 3 = P g 1 ) P(g 2 ) P(g 3 g 1, g 2 = P AA P AB P AB parents = AA AB) = p 2 2pq 0.5 = p 3 q assuming HWE!

Example on X A 1 2 A/B 3 4 5 B/B A/B B 6 B/B L = P genotypes pedigree, p, q 1 2 3 4 5 6 contribution from each individual = p 2pq 0.5 0.5 q 2 1 = 0.5 p 2 q 3

Ingredient 3: How to deal with untyped individuals Solution: Sum of all possible genotypes for the untyped 1 2 A/A -/- 3 A/B P g 1, g 3 = P(g 1, g 2, g 3 ) = P g 1 ) P(g 2 ) P(g 3 g 1, g 2 g 2 g 2 = P(AA) P(AA) P AB AA AA) + P(AA) P(AB) P AB AA AB) + P(AA) P(BB) P AB AA BB) = p 2 p 2 0 + p 2 2pq 0.5 + p 2 q 2 1 = p 3 q + p 2 q 2 = p 2 q p + q = p 2 q

Pedigree likelihood: General formula Given: pedigree with n individuals k members are genotyped: g 1, g 2,, g k Then: founders non-founders P g 1,..., g k = G 1 G 2 G n P g 1 P g j P g j+1 par P g n par If everyone is typed: Only one term easy G i = set of possible genotypes for individual i Number of terms grows exponentially in #(untyped) but clever algorithms exist!

Computer algorithms for pedigree likelihoods Elston-Stewart algorithm a peeling algorithm linear in pedigree size! A/B A/A A/A -/- A/A Lander-Green based on inheritance vectors hidden Markov model best choice with many linked markers small/medium pedigrees only A/B

Software R/paramlink R environment Elston-Stewart general likelihoods, inbreeding, simulation ++ Familias GUI for forensic applications Elston-Stewart handles mutations, HW deviations, ++ MERLIN command line program Lander-Green medical applications: multipoint linkage