Data mining in the Dutch Civil Registration from 1811-present

Similar documents
population onomastic databases

Overview of Civil Registration and Vital Statistics systems

SURVEY OF HISTORICAL DATABASES WITH LONGITUDINAL MICRO-DATA

Naming and subclutures in The Netherlands Gerrit Bloothooft Proceedings Int. Conference of Onomastic Sciences, Uppsala, 19-24/8/2002 1

How Do I Start My Family History?

CIVILIAN PERSONAL HISTORY FORM

Make payable to MGCC for genealogy ONLY

SELECTED SOCIAL CHARACTERISTICS IN THE UNITED STATES American Community Survey 5-Year Estimates

The progress in the use of registers and administrative records. Submitted by the Department of Statistics of the Republic of Lithuania

JACKSON COUNTY PIONEER CERTIFICATE PROJECT

Guidelines for Completion of a Youth Application

American Community Survey 5-Year Estimates

American Community Survey 5-Year Estimates

Victor Pootman & Maria Davidts

Métis Federation of Canada Membership Application Form

Métis Genealogical Centre of Canada Central Processing Office for Canadian Métis Council-IT

Examples of Record Linkage Studies from Norway and Bosnia

San Joaquin County First Families Certificate Program

Victor Pootman & Marie Davids

Follow your family using census records

Personal Information. Single Common Law Married Separated Divorced Widowed. Number Street Apartment City Province/Territory Postal Code

Introduction to the course, lecturers, participants and the European Census 2021

I will read certain parts of this presentation, but since there is limited time, I am hoping to read each part in its entirety at a later time.

CHANGE OF SEX DESIGNATION - 16 YEARS OF AGE OR OLDER Instructions to complete application to Vital Statistics, Service Nova Scotia

CHANGE OF SEX DESIGNATION - 16 YEARS OF AGE OR OLDER Instructions to complete application to Vital Statistics, Service Nova Scotia

Civil Registry System National Population Register

This Workbook has been developed to help aid in organizing notes and references while working on the Genealogy Merit Badge Requirements.

Maiden Names: Unlocking the mystery of the Mrs. Jim Lawson Professional Genealogist

SURVEY OF HISTORICAL DATABASES WITH LONGITUDINAL MICRO-DATA. The second questionnaire

Strategies for the 2010 Population Census of Japan

For research to begin please forward the following information:

Quebec population resources: towards an integrated infrastructure of historical microdata ( )

Overview. Tips for Getting Started Principal Records of Genealogical Interest Culture Specific Records Website Demo

Order of the Founders of North America Lineage Documentation Guidelines 09/18/2012 A. General Application requirements. 1. Application completeness

Using Birth, Marriage and Death Certificates from the General Register Office (GRO) for England and Wales

USING CENSUS RECORDS IN GENEALOGICAL RESEARCH AN ONLINE COURSE

Multi-Source Family Reconstruction

Workshop on the Improvement of Civil Registration and Vital Statistics in SADC Region Blantyre, Malawi 1 5 December 2008

Methodology Statement: 2011 Australian Census Demographic Variables

Las Villas del Norte

Curriculum Vitae. 1. Personal data. Title Address Meeuwenlaan 72. Place of residence Country of residence Telephone number Mobile number

MÉTIS NATION BRITISH COLUMBIA CITIZENSHIP APPLICATION PACKAGE 15 YRS & OLDER Please read carefully, items listed below are mandatory.

The Finnish Social Statistics System and its Potential

OVERVIEW. Ancestors in the 19th Century Class 3 Lindsay Fulton, Director of Research Services. Meet today s presenter 4/4/2017

Grandparenting in Europe: Main study Preliminary Findings Briefing

Registry Publication 62

Palatine Families Of New York (2 Volume Set) By Henry Z. Jones READ ONLINE

SESSION 11. QUALITY ASSESSMENT AND ASSURANCE IN THE CIVIL REGISTRATION

Ch ange of name fo r adul ts

THE CANADIAN HERALDIC AUTHORITY

The main focus of the survey is to measure income, unemployment, and poverty.

BEGINNING GENEALOGY Ellen Miller Reference Assistant Midwest Genealogy Center Copyright 12 March Welcome. Thank You For Your Time Today.

The Art of Searching on FamilySearch: Finding Elusive Records on FamilySearch

MÉTIS NATION BRITISH COLUMBIA CITIZENSHIP APPLICATION PACKAGE 14 YRS & YOUNGER

Migration statistics and 2021 Population Census in Spain. Why exchanging microdata? Antonio Argüeso National Statistics Institute (INE) Spain

WRITING ABOUT THE DATA

NIKKEI-JIN VISA (JAPANESE DESCENDANT)

ENGLAND FOR BEGINNERS

VITAL STATISTICS ACT REGULATIONS

The Family Name as Socio-Cultural Feature and Genetic Metaphor: From Concepts to Methods

Get Your Census Worth: Using the Census as a Research Tool

What s New at FamilySearch.org

SETTLERS OF LORAIN COUNTY, OHIO Application Deadline is June 1 of any given year

Application to record an overseas birth in the register of births (section 36 of the Civil Status Act)

FAMILY HISTORY GROUP RESEARCHING YOUR ANCESTORS IN IRELAND

IrishGenealogy.ie. Friends of Irish Research Richard Reid 08/03/2015

front cover Index of Jews Resident in New Brunswick, Nova Scotia and Prince Edward Island According to the 1861 to 1901 Censuses of Canada approximate

0-4 years: 8% 7% 5-14 years: 13% 12% years: 6% 6% years: 65% 66% 65+ years: 8% 10%

Family History: Genealogy Made Easy with Lisa Louise Cooke

South Africa - South African Census Community Profiles 2011

Discovering an Immigrant s Place of Origin

What s in a Name? HANDOUT Andrea Patterson, RVGS Volunteer.

Using the FamilySearch Family Tree (23 March 2012)

A short how-to guide on searching ancestors in the Netherlands

2. Please use maiden names where applicable, and all given names of ancestors.

LIVINGSTON COUNTY GENEALOGICAL SOCIETY Howell, Michigan. Ancestral Certificate Program

Perry County Pioneers Lineage Society. Rules and Application Procedures

Presentation for BCG Webinar, April 2016

population and housing censuses in Viet Nam: experiences of 1999 census and main ideas for the next census Paper prepared for the 22 nd

One of the most popular paper filling systems was developed by Mary E. Vassel Hill. This is the filling system we are going to talk about today.

Use U.S. Census Information to Resolve Family History Research Problems

FUNERAL DIRECTOR INSTRUCTIONS

VICTORIAN PANEL STUDY

Y-DNA Genetic Testing

Neighbourhood Profiles Census

Guidelines for Completion of Application

ABOUT MORTALITY DATA FOR THE NETHERLANDS By Domantas Jasilionis Last Revised: 09 May 2006

Schedule A Application to be Enrolled as a Beneficiary of the Labrador Inuit Land Claims Agreement

Neighbourhood Profiles Census and National Household Survey

Descendants of William Wonnacott

Williams County Genealogical Society. Lineage Society Rules and Application Procedures

Founders and Survivors Linkage Strategy

THE BASICS OF DNA TESTING. By Jill Garrison, Genealogy Coordinator Frankfort Community Public Library

A Web-Based Genealogy System

CENTENARY PIONEER RECOGNITION PROGRAM

Typical mistakes were made when spelling peoples names, or noting their occupations, or even when recording their ages.

ESSnet on DATA INTEGRATION

TURKISH STATISTICAL INSTITUTE

Hamilton County Genealogical Society

Basic Information: What do you know?

Transcription:

Data mining in the Dutch Civil Registration from 1811-present Gerrit Bloothooft 1,2,3, Kees Mandemakers 2, Leendert Brouwer 3, Matthijs Brouwer 3 1 Universiteit Utrecht / 2 IISG KNAW / 3 Meertens Instituut KNAW The Netherlands Paris workshop 9-10/12/2010 1

names in family trees Paris workshop 9-10/12/2010 2

intergenerational (family names) Hoeksema I:1 1895-1959 I:2 1902-1987 II:6 II:5 1935 Hoeksema III:9 III:8 1961 III:10 Hoeksema (Hoeksema) IV:4 1986 IV:5 1993 Paris workshop 9-10/12/2010 3

family names Patrilinear» daughters keep family name of father Start ~17 th century, compulsory 1811 Limited geographic spread Linguistic properties» language (dialect)» suffixes (-s(e)ma, -stra, -ing -ink,..)» length» patronyms, occupation, provenance, Paris workshop 9-10/12/2010 4

intergenerational (first names) Johannes I:1 1895-1959 I:2 1902-1987 Cornelia Maria II:6 II:5 1935 Willem Dirk III:9 III:8 1961 Corrie III:10 Jan Priscilla IV:4 1986 IV:5 1993 Semantha Paris workshop 9-10/12/2010 5

first names Traditional naming Modern naming» little effect of social class (in The Netherlands)» after grandparents in prescribed order» dominant until 1960» fashion during one generation» correlation with education and income, lifestyle geographic spread socio-onomastic features (modern) Paris workshop 9-10/12/2010 6

ideal source: Civil Registration Names Dates and places of birth, marriage, death Family relations (parents, partners) Occupation» not in modern CR Paris workshop 9-10/12/2010 7

restrictions CR Privacy restrictions modern: since 2000 available for research (in NL)» no identification of individuals allowed» but names are intended to identify Digital availability modern: since 1994 historical: certificates digitized by volonteers» from 1811-1909 (birth), 1811-1934 (marriages), 1811-1959 (death) [50% completed]» LINKS project to reconstruct families Paris workshop 9-10/12/2010 8

modern data (full population) first names (selection 2006: 16+6 million) all first names date, place and country of birth current residence (postal code) id id of parents (and their names, date, place and country of birth) family names (selection 2007: 16 million) prefix and family name date, place and country of birth current residence Paris workshop 9-10/12/2010 9

different names first names 500.000 300.000 in first position 5.000.000 as full name family names 314.000 Paris workshop 9-10/12/2010 10

online name corpora first names www.meertens.knaw.nl/nvb June 3, 2010 family names www.meertens.knaw.nl/nfb December 3, 2009 Paris workshop 9-10/12/2010 11

on show all presentations absolute & relative first names name & gender # as first name, # as following name (totals 2006) births per year (since 1880) places of birth (for 2006 population) 468 municipalities in 2006 explanations (20.000, ~4000 extensive) family names name # in 2007, # in 1947 places of residence in 2006, provinces in 1947 explanations (90.000, ~ 4000 extensive) relational network of names Paris workshop 9-10/12/2010 12

privacy issues shown: all names not shown: any figure < 5» Norway: <3, France: <5, Belgium: all if identifiable:» not on map for small municipalities (by rules)» unless in telephone directory Paris workshop 9-10/12/2010 13

name search exact begins with pick one from list ends with contains pick one from list pick one from list advanced regular expression aggregated data matthijs matt ijs th ^ma.*(ij y)s$ Paris workshop 9-10/12/2010 14

family names Paris workshop 9-10/12/2010 15

main page external links to among others the national bureau for genealogy network of related names Paris workshop 9-10/12/2010 16

surname maps Janse Janssen Jansen 100 km relative figures Paris workshop 9-10/12/2010 17

properties of sets of surnames regular expression: stra$ results in 483 surnames on -stra Paris workshop 9-10/12/2010 18

surnames on -stra Protestant Industry and mines 100 km Catholic Paris workshop 9-10/12/2010 19

first names Paris workshop 9-10/12/2010 20

popularity of Gerrit (numbers) Paris workshop 9-10/12/2010 21

popularity of Gerrit (relative) Paris workshop 9-10/12/2010 22

from tradition to fashion Traditional naming Jan Kevin Nelly Jayden Ingrid Femke Paris workshop 9-10/12/2010 23

map of Gerrit (relative) Paris workshop 9-10/12/2010 24

co-variation: from many to few assumption parents chose names for their children that fit their social environment traditional names (traditional Dutch latinized) Frisian, Arabic, Turkish names English, French, Scandinavian, Spanish, Italian names names from the Old Testament names from history and culture names from nature.. analysis of names of children found in the same family Paris workshop 9-10/12/2010 25

first names and religion Traditional Dutch names Traditional Protestants Dutch bible belt Paris workshop 9-10/12/2010 26

ap of urrent rst names Typical name group per postal code area Traditional Latinized Traditional Dutch Old Testament Frisian PreModern Dutch Elite French Nordic French Modern Dutch Modern Italian Spanish English Arabic & Turkish Paris workshop 9-10/12/2010 27

socio-economics of first names Socio-economic data at family level Names of children in the family known for 281.751 households (2000-2005) with children (questionnaire)» name group and» income» highest education» lifestyle profile Paris workshop 9-10/12/2010 28

two dimensions 2 traditional trendy 1 0-1 Italian-Spanish Arabic2 Arabic1 Turkish English Modern French Mixed(Nordic) Dutch-preModern Dutch-Modern Hebrew Frisian Elite Traditional -2-2 -1 0 1 low income high income 2 Paris workshop 9-10/12/2010 29

migration we know family relations (in the database of first names) placesof birth descendants, where were they born ancestors, where were they born Paris workshop 9-10/12/2010 30

descendants of people from Sneek places of birth of grandchildren from males born in Sneek between 1880-1900 This example concerns the town of Sneek, but in the interactive application any municipality can be chosen Paris workshop 9-10/12/2010 31

ancestors of people from Sneek places of birth of great-grandparents from current male inhabitants of Sneek who are between 30 and 50 years of age Paris workshop 9-10/12/2010 32

extension to 1811 automatic family reconstruction in progress for full population» 50% of CR certifcates available; completed in 2020» many record linkage issues to be solved» ~15 million persons 1811-1930 Historical Sample of The Netherlands» 78.000 life courses manually reconstructed» unbiased sample 1812-1922 Paris workshop 9-10/12/2010 33

in conclusion names are exciting challenges for: pattern recognition space-time representations historical, linguistic and (socio)onomastic studies (inter)national data sharing (& privacy issues) Paris workshop 9-10/12/2010 34