February 24, [Click for Most Updated Paper] [Click for Most Updated Online Appendices]

Size: px
Start display at page:

Download "February 24, [Click for Most Updated Paper] [Click for Most Updated Online Appendices]"

Transcription

1 ONLINE APPENDICES for How Well Do Automated Linking Methods Perform in Historical Samples? Evidence from New Ground Truth Martha Bailey, 1,2 Connor Cole, 1 Morgan Henderson, 1 Catherine Massey 1 1 University of Michigan 2 National Bureau of Economic Research February 24, 2018 [Click for Most Updated Paper] [Click for Most Updated Online Appendices] Online Appendix for How Well Do Automated Linking Methods Perform? 1

2 A. Details Regarding the Implementation of Algorithms and Effects of Race-Blocking 1. Implementation of s Method We generate links for the (2016) two ways. The Iowa coefficient results use the parameters that estimated with his training data that links the 1915 Iowa Census to the 1940 Census, which we directly apply to the relevant data ground truth sample. Because the performance of this algorithm hinges critically on the quality of the training data and its similarities with records we would like to link, we also examine performance based on coefficients that we estimate using a simple, random sample of the ground truth data used in this paper. For the Early Indicators data, we select a random sample of 500 observations. For all the other datasets, we select a random sample of 2,000 observations. (Our simulations show the size of the random sample has little effect on the outcomes above the sample size of 2,000). In each of these samples, we estimate a probit model on the training dataset using s covariates, with the outcome being the binary variable denoting a true match. Then, we apply the estimated parameters to the ground truth sample (less the training data) to classify links. Appendix Table A1 reports the estimation results using each ground truth sample. Column 1 presents the covariates reported by (2016) and the remaining columns present the coefficients for each ground truth sample. The parameter estimates vary somewhat, but the signs and magnitudes are similar across ground truth datasets. The missing variables in the probit reflect the fact that the relevant ground truth data either does not have variation in the variable or that the excluded variables perfectly predict matches or non-matches in the ground truth. Appendix Table 2 lists the other parameters that are relevant for s algorithm. Additionally, since the parameters estimated above are estimated with only a single random subset of the ground-truth, we report the average results from 200 repetitions of the algorithm. The b1 and b2 values are similar across datasets, with the exception of the IPUMS data. For IPUMS, the comparatively high value of b2 reflects the fact that IPUMS matches are likely more conservative. Across samples, the Type I and Type II error rates are similar across 200 repetitions in the ground truth. Online Appendix for How Well Do Automated Linking Methods Perform? 2

3 Appendix Table A1. Probit Coefficients Estimated Using Different Ground Truth Samples (1) (2) (3) (4) (5) Variables LIFE-M Simulated Early IPUMS-LRS (2016) Data Indicators First and Last Names Match 0.632*** 0.663*** * (0.086) (0.127) (0.0812) (0.172) (0.152) First Name Jaro-Winkler Score *** *** *** ** (0.525) (1.662) (1.058) (1.736) (2.059) Last Name Jaro-Winkler Score *** *** *** *** *** (0.487) (1.270) (0.826) (1.406) (1.735) Absolute Value of Difference in Year of Birth is *** *** *** *** (0.044) (0.0713) (0.0688) (0.114) (0.0970) Absolute Value of Difference in Year of Birth is *** *** *** *** *** (0.065) (0.113) (0.0700) (0.144) (0.526) Absolute Value of Difference in Year of Birth is *** *** *** - (0.102) (0.121) - (0.167) - First Name Soundex Match 0.153*** *** 0.715*** (0.054) (0.229) (0.137) (0.270) (0.273) Last Name Soundex Match 0.698*** 1.139*** 0.268*** 0.495*** 1.618*** (0.069) (0.145) (0.0938) (0.164) (0.370) Number of Potential Matches *** *** *** *** *** (0.002) ( ) ( ) ( ) ( ) Number of Potential Matches Squared **** 2.01e-05*** 5.34e-05*** 1.23e-05*** 7.51e-05*** ( ) (4.92e-06) (7.96e-06) (2.43e-05) (9.64e-06) More than One Exact Match on First and Last *** *** *** *** *** Name (0.093) (0.0754) (0.0576) (0.120) (0.153) First Letter of First Name Matches 0.871*** 1.004** 0.430*** (0.130) (0.434) (0.146) - (0.428) First Letter of Last Name Matches 0.886*** ** - (0.148) - (0.112) (0.273) - Last Letter of First Name Matches 0.147*** *** (0.053) (0.202) (0.140) (0.240) (0.234) Last Letter of Last Name Matches 0.649*** 0.382*** 0.942*** 0.542*** (0.070) (0.147) (0.118) (0.157) (0.205) Middle Initial Matches (0 if Middle Initial Does Not Exist or Does Not Match) 0.537*** 0.867*** 2.161*** 0.887*** 1.508*** (0.097) (0.0930) (0.0521) (0.154) (0.132) State is Ohio N/A 0.361*** 0.224*** N/A N/A (0.0656) (0.0505) Constant *** *** *** ** *** (0.225) (0.519) (0.289) (0.455) (0.669) Observations 38,091 42,973 50,255 6,582 43,960 Log-Likelihood Akaike Inf. Crit Notes: Column 2 drops the estimate for First Letter on Last Name, because LIFE-M blocked on this variable to create the list of potential links. Column 3 drops Absolute Value of Difference in Year of Birth is 3, because there is no variation in this. Column 5 drops Absolute Value of Difference in Year of Birth is 3 because IPUMS selected matches with 2 years gap or less; column 5 drops First letter on Last Name because there is no variation in this variable for IPUMS. Online Appendix for How Well Do Automated Linking Methods Perform? 3

4 Appendix Table A2. Features of the Classifier using the Estimated Coefficients (1) (2) (3) (4) (5) (2016) LIFE-M Simulated Data Early Indicators IPUMS- LRS B1 Value B2 Value Match Rate Type I Error Type II Error Average B1 Value Average B2 Value Average Match Rate Average Type I Error Average Type II Error Implementation of the Iterative Method The authors of the Iterative Method generously shared the most updated version of code used in et al. (2014), which was updated and posted here: We implement all algorithms using the code provided by the authors (labeled et al 2014 in Appendix Table A3). To implement et al. (2014), one must decide which dataset is the destfile and the sourcefile (the terms in Abramitsky et al. (2014) code shared with us are childfile and adultfile, respectively). The authors indicated in correspondence that the their 2014 paper uses the full Census data as the destfile and the sample file as the sourcefile, so we report these estimates using the same ordering in the main text of this paper. For the LIFE-M and synthetic data, the 1940 Census is the destfile and the birth certificate sample is the sourcefile. For the Early Indicators data, we make the UA Army Data the sourcefile and the 1900 Census data the destfile. For the IPUMS data, we make the non-1880 data the sourcefile and the 1880 full Census the destfile. This appendix presents two alternate specifications, including 1. et al. s (2014) robustness check that restricts both datasets to individuals with unique within a two-year age radius (-2, -1, 0, 1, 2; described as a five-year band in the paper). This check is a restriction that creates an un name-age sample, which is similar to the Ferrie (1996) Online Appendix for How Well Do Automated Linking Methods Perform? 4

5 method. 2. An alternative ordering that treats the Census file as the destfile and the non-census year as the sourcefile. This second specification is not published elsewhere, but the authors are aware of it. We present it to emphasize the importance of order for the Iterative Method with the existing publiclyavailable STATA code.since the code pre-emptively drops all duplicates name-age matches in the sourcefile, making a Census file the sourcefile may incorrectly link individuals from the destfile to the second-best link in the sourcefile. Consider the following example. Suppose that a smaller sample has been made the destfile and a Census the sourcefile, and there is one John Smith born in Ohio who would have been age 25 in the later year (the destfile ) but in the sourcefile there are three John Smiths born in Ohio age 25 (record 1-3 in the example below) and one John Smith born in Ohio age 27 (record 4) (and no other John Smith observations in the age range of 23-27). Since the Census is the sourcefile, the Iterative Method pre-emptively drops all John Smiths who are age 25 in the sourcefile and would link in error the sample John Smith age 25 to the Census John Smith age 27 (record 4), even though presumably better matches existed that were ruled out with the first step. On the other hand, if the Census is the destfile and the smaller sample the sourcefile, the code would not match the John Smith observation in the sourcefile to any of the observations in the Census since multiple potential exact links exist. Example Linking Problem where Order Matters destfile sourcefile destfile Birthplace Age Record sourcefile Birthplace Name number Name Age 1 John Smith OH 25 John Smith OH 25 2 John Smith OH 25 3 John Smith OH 25 4 John Smith 27 Choices about which file is the sourcefile and destfile are, therefore, consequential for error rates and may be more consequential in settings where the sample files are generally much smaller than a population Online Appendix for How Well Do Automated Linking Methods Perform? 5

6 enumeration. It would be less consequential in settings where researchers are linking full populations to one another, because multiple John Smiths would likely appear in both files and, therefore, be eliminated as matches. Note that the robustness check with the two-year radius eliminates this problem: in our example, it would drop all of the John Smith observations in the sourcefile. The table A3.A and A3.B demonstrate how match and error rates change as with the choice of which file is the sourcefile. Online Appendix for How Well Do Automated Linking Methods Perform? 6

7 Appendix Table A3.A. Summary of Match and Error Rates for Linking with Census as destfile, by Ground Truth et al Baseline A. Match Rates B. Type I Error Rate (False Links) C. Type II Error Rate (Missed links) LIFE-M Synthetic EI IPUMS- LRS LIFE-M Synthetic EI IPUMS- LRS LIFE-M Synthetic EI Name NYSIIS SDX et al 2014 Robustness Check Name NYSIIS SDX IPUMS- LRS Appendix Table A3.B. Summary of Match and Error Rates for Linking with Census as sourcefile, by Ground Truth et al Baseline A. Match Rates B. Type I Error Rate (False Links) C. Type II Error Rate (Missed links) LIFE-M Synthetic EI IPUMS- LRS LIFE-M Synthetic EI IPUMS- LRS LIFE-M Synthetic EI Name NYSIIS SDX et al 2014 Robustness Check Name NYSIIS SDX IPUMS- LRS Notes: In Table A3.A, we treat the Census file as the childfile (or destfile ) in the code. In Table A3.B, we treat the Census file as the adultfile (or sourcefile ). EI stands for the Early Indicators data. Baseline refers to the primary matching algorithm used in. Robustness refers to specifications where both the original dataset and the data being linked to are limited to unique name combinations within a five-year age band prior to linkage. Online Appendix for How Well Do Automated Linking Methods Perform? 7

8 B. Description of LIFE-M Data The LIFE-M samples used in this paper are based on the first two years of hand-linking in this project. This section supplements the description in the paper. 1. Determining Sex Because it was the norm for girls to change their name at marriage and around 90 percent of surviving girls married in the middle of the 20 th century, we only attempt to link male infants to their records in the 1940 Census. In some cases, sex is missing or apparently incorrect. To determine which of these records were very likely to be boys, we generated an empirical distribution of sex and first using all vital records in the LIFE-M collection. We use this distribution to classify as male if there were at least 50 records with the name and at least 99 percent of the records with that name were female. If there was ambiguity about whether the infant was a boy, we did not attempt to link the record. This is why fewer than 50 percent of the records in Ohio (and especially North Carolina) were male. 2. Constructing Samples of Male Birth Certificates To construct this data for Ohio, we drew a random sample of 13,270 birth certificates for individuals born in Ohio from 1909 to 1920: a 2 percent sample of 1909 and a 1-percent sample each year between 1910 and Next, sets of parents for these infants were linked to parents of other infants using the first and birth name of both fathers and mothers (four distinct fields), thus recovering siblings for each sampled birth certificate. The final sample of reconstructed families consists of 53,721 children, 19,090 of which were determined to be boys (see discussion above). Appendix Figure C1 plots the distribution of age in 1940 for the 19,090 Ohio boys. To construct the North Carolina data, we drew a random sample of 23,073 birth certificates for individuals born in North Carolina from 1915 to (These sampling years differ from Ohio due to the incompleteness of the North Carolina birth data before 1915). We used the same process as in Ohio to link parents to identify siblings. This resulted in a total sample of 86,209 infants, 26,352 of whom we determined to be boys (see discussion above). Appendix Figure C1 shows the distribution of age in 1940 Online Appendix for How Well Do Automated Linking Methods Perform? 8

9 for these 26,352 boys. 3. Linking Birth Certificate Samples to the 1940 Census A final step linked the Ohio and North Carolina samples to the 1940 Census using the process described in the paper. During the hand-linkage process, each potential link was initially reviewed by two trainers in a double-blind review process. If these trainers agreed on whether to link or not to link a case, we take their decision as the truth. If they disagreed, an additional three additional trainers reviewed the record. We take records for which 4/5 of the trainers agreed as a match, the presumption being that one of the original trainers made an error. The first two data trainers disagreed 10 percent of the time for male infants in Ohio and 8.3 percent of the time for male infants in North Carolina. Ohio Births Appendix Figure B1. Distribution of Boys Age in 1940 North Carolina Births Age Age Online Appendix for How Well Do Automated Linking Methods Perform? 9

10 C. Representativeness Regression Results Appendix Table C1. Representativeness Results for LIFE-M LIFE-M Feigenbau m 2016 (Iowa Feigenbau m 2016 (estimated Day of Birth *** *** *** *** *** *** *** *** *** *** *** * *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Number of Siblings *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Child's Name *** *** *** *** *** *** *** *** ** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Father's Name *** *** *** *** ** *** *** ** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Mother's Name *** *** *** *** ** *** *** ** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Share of Observations with * ** * *** * ** Misspelled Mother's Name ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Share of Observations with *** ** *** *** ** *** *** ** *** *** * *** * *** ** Misspelled Father's Name ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Link in North Carolina *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Constant *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Observations 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 43,417 R-squared Wald Statistic Prob > W Note: Robust standard errors are in parentheses. Regressions are a test of representativeness in the LIFE-M data by regressing for all observations in the master data a dummy variable indicating whether or not an observation was matched on the variables included in this table. The Wald statistic at the end offers a test of whether or not matched status systematically relates to covariates. Online Appendix for How Well Do Automated Linking Methods Perform? 10

11 Appendix Table C2. Representativeness Results for Synthetic Data LIFE-M Feigenbau m 2016 (Iowa Feigenbau m 2016 (estimated Day of Birth ** * *** *** * *** *** ** ** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Number of Siblings *** *** *** *** *** *** *** *** *** ** * *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Child's Name ** *** ** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Father's Name *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Mother's Name *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Share of Observations with ** ** ** Misspelled Mother's Name ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Share of Observations with *** *** *** ** *** *** ** *** *** *** *** *** *** Misspelled Father's Name ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Link is in North Carolina *** *** * *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Constant *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Observations 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 45,417 43,417 R-squared Wald Statistic Prob > W Note: Robust standard errors in parentheses. Regressions are a test of representativeness in the synthetic data by regressing for all observations in the master data a dummy variable indicating whether or not an observation was matched on the variables included in this table. The Wald statistic at the end offers a test of whether or not matched status systematically relates to covariates. Online Appendix for How Well Do Automated Linking Methods Perform? 11

12 Appendix Table C3. Representativeness Results for Early Indicators Ferrie 1996 Ferrie Qian 2015 Qian (Iowa 2016 (estimated Age 0.01* * *** 0.020*** 0.019*** 0.013** 0.011* (0.005) (0.005) (0.005) (0.005) (0.005) (0.005) (0.005) (0.006) (0.006) (0.005) (0.007) (0.006) (0.006) (0.007) Is Currently Married 0.08** * (0.037) (0.037) (0.037) (0.038) (0.038) (0.038) (0.037) (0.038) (0.038) (0.038) (0.037) (0.036) (0.037) (0.043) Foreign Born ** *** ** (0.053) (0.052) (0.048) (0.053) (0.054) (0.054) (0.052) (0.053) (0.054) (0.053) (0.052) (0.051) (0.053) (0.060) Day of Birth * (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) Literate 0.15** 0.129** 0.100* *** 0.241*** 0.249*** Length of First Name Length of Second Name (0.062) (0.062) (0.060) (0.067) (0.067) (0.067) (0.065) (0.067) (0.067) (0.066) (0.067) (0.069) (0.064) (0.076) 0.02* 0.023*** 0.032*** * 0.023*** *** (0.008) (0.008) (0.008) (0.009) (0.009) (0.009) (0.008) (0.009) (0.009) (0.009) (0.009) (0.008) (0.008) (0.010) -0.03*** 0.020*** 0.023*** *** *** *** *** *** (0.007) (0.007) (0.007) (0.007) (0.007) (0.007) (0.007) (0.007) (0.007) (0.007) (0.007) (0.007) (0.007) (0.008) Mother is Foreign Born -0.12** ** ** *** *** (0.053) (0.053) (0.052) (0.053) (0.053) (0.055) (0.054) (0.053) (0.055) (0.052) (0.049) (0.050) (0.051) (0.059) Father is Foreign Born ** (0.051) (0.050) (0.050) (0.051) (0.050) (0.052) (0.052) (0.051) (0.052) (0.050) (0.047) (0.047) (0.048) (0.055) Year of Birth *** 0.019*** 0.018*** (0.005) (0.005) (0.005) (0.006) (0.006) (0.006) (0.006) (0.006) (0.006) (0.005) (0.007) (0.006) (0.007) (0.007) Constant ** *** *** (9.951) (10.399) (10.107) (10.558) (10.839) (10.935) (10.872) (11.465) (11.410) (10.400) (13.631) (12.049) (12.459) (13.009) Observations 1,774 1,774 1,774 1,774 1,774 1,774 1,774 1,774 1,774 1,774 1,774 1,774 1,774 1,343 R-squared Wald Statistic Prob > W Note: Robust standard errors in parentheses. Regressions are a test of representativeness in the Early Indicators data by regressing for all observations in the master data a dummy variable indicating whether or not an observation was matched on the variables included in this table. The Wald statistic at the end offers a test of whether or not matched status systematically relates to covariates. Note that the representativeness of the data are measured against the universe of high quality links stated by the Early Indicators project for the Oldest Old sample. Online Appendix for How Well Do Automated Linking Methods Perform? 12

13 Appendix Table C4. Representativeness Results for IPUMS-LRS IPUMS (Iowa 2016 (estimated Number of *** *** *** *** *** ** *** ** *** *** *** *** *** ** Siblings ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Age *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Lives with *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** Mother ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Lives with *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** Father ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Lives with *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** Both Parents ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Father: Born ** *** *** *** *** *** *** *** *** *** *** *** Abroad ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Mother: Born *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** Abroad ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Lives in *** *** *** *** *** *** *** *** *** ** *** *** Northeast ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Lives in *** *** *** *** *** *** *** *** *** *** *** *** *** *** Midwest ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Lives in West *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Farm Status *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Married *** *** *** *** *** *** ** *** *** *** ** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) White Collar *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** Occupation ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Farming *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** Occupation ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Semi-Skilled *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** Occupation ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Online Appendix for How Well Do Automated Linking Methods Perform? 13

14 Appendix Table C4. Representativeness Results for IPUMS-LRS - Continued IPUMS (Iowa 2016 (estimated Unskilled *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** Occupation ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) White *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Urban Status *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Born Abroad *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Constant *** *** *** ** *** ** ** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Observations 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 32,959,605 R-squared Wald Statistic Prob > W Note: Robust standard errors in parentheses. Regressions are a test of representativeness in the IPUMS data by regressing for all observations in the final year data (e.g for matches from 1880 to 1900) a dummy variable indicating whether or not an observation was matched on the variables included in this table from that final year. The Wald statistic at the end offers a test of whether or not matched status systematically relates to covariates. The IPUMS sample sizes are different than other regressions because, in order to use the weights provided by IPUMS, we use the one percent sample for all matches involving 1880 as this sample was what was used to initially construct weights. Other than the 'IPUMS - Weighted' results, all other tests are unweighted and use the complete 1880 Census for matches from earlier years to the 1880 Census. Since we only match individuals who were plausibly 0-15 in the base year, we restrict attention here to men who would plausibly be that age in the final year (e.g. in matching from 1850 to 1880, we restrict attention to men aged in 1880). For concision, we stack all years for a single test. Additional results that break down test results by year are available by request. Online Appendix for How Well Do Automated Linking Methods Perform? 14

15 D. Correlation of Incorrect Links with Baseline Sample Characteristics Appendix Table D1. Correlation of Incorrect Links with Baseline Sample Characteristics in LIFE-M Data LIFE-M (Iowa 2016 (estimated Day of Birth *** *** * *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Number of Siblings * *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Child's Name ** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Father's Name *** *** *** *** * *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Mother's Name *** *** *** *** *** *** ** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Share of Observations with ** Misspelled Mother's Name ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Share of Observations with *** *** *** *** *** ** Misspelled Father's Name ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Link in North Carolina *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Constant *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Observations 19,391 13,865 11,762 8,494 19,703 19,572 17,656 17,646 17,894 16,722 30,520 34,553 37,145 21,310 19,323 R-squared Wald Statistic Prob > W Note: Robust standard errors in parentheses. Regressions are a test of correct matches in the LIFE-M data by regressing for matches by each matching strategy a dummy variable indicating whether or not an observation was correctly matched (as determined by the LIFE-M 'police line-up' review process described in the paper) on the variables included in this table. The Wald statistic at the end offers a test of whether or not correct matches compared to incorrect matches systematically relate to covariates. Online Appendix for How Well Do Automated Linking Methods Perform? 15

16 Appendix Table D2. Correlation of Incorrect Links with Baseline Sample Characteristics in Simulated Data (Iowa 2016 (estimated Day of Birth ** ** ** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Number of Siblings * *** ** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Child's Name * * ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Father's Name ** ** * ** *** * ** *** *** *** *** *** ** ** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Length of Mother's Name ** *** *** *** ** *** *** *** *** ** *** Share of Observations with Misspelled Mother's Name Share of Observations with Misspelled Father's Name Link is in North Carolina Constant ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) * ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) * ** *** *** * *** *** * *** *** *** ** ** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) *** ** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) *** *** *** *** *** *** *** *** *** *** *** *** *** *** ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) Observations 14,107 12,254 9,287 19,383 19,818 18,722 17,316 18,196 17,849 30,852 35,720 39,368 25,020 26,146 R-squared Wald Statistic Prob > W Note: Robust standard errors in parentheses. Regressions are a test of correct matches in the synthetic data by regressing for matches by matching strategy a dummy variable indicating whether or not an observation was correctly matched (as determined by the process used to create the synthetic data initially) on the variables included in this table. The Wald statistic at the end offers a test of whether or not correct matches compared to incorrect matches systematically relate to covariates. Online Appendix for How Well Do Automated Linking Methods Perform? 16

17 Appendix Table D3. Correlation of Incorrect Links with Baseline Sample Characteristics in Early Indicators Ferrie 1996 Ferrie 1996 Ferrie Qian 2015 Qian (Iowa 2016 (estimated Age -0.02** ** *** *** *** *** *** *** ** * * *** ** (0.010) (0.010) (0.010) (0.008) (0.008) (0.008) (0.009) (0.009) (0.010) (0.010) (0.007) (0.009) (0.008) (0.008) Is Currently Married (0.044) (0.045) (0.051) (0.046) (0.045) (0.049) (0.046) (0.046) (0.049) (0.045) (0.044) (0.046) (0.037) (0.045) Foreign Born (0.078) (0.078) (0.091) (0.075) (0.074) (0.078) (0.077) (0.075) (0.078) (0.073) (0.068) (0.065) (0.059) (0.071) Day of Birth ** * ** ** * ** ** ** (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) Literate ** *** * *** ** * (0.106) (0.105) (0.115) (0.095) (0.088) (0.091) (0.101) (0.097) (0.096) (0.105) (0.101) (0.092) (0.076) (0.091) Length of First Name -0.02* * ** ** *** (0.012) (0.012) (0.013) (0.012) (0.012) (0.012) (0.012) (0.012) (0.011) (0.011) (0.011) (0.011) (0.010) (0.012) Length of Second Name ** ** *** ** (0.009) (0.010) (0.010) (0.009) (0.009) (0.009) (0.009) (0.009) (0.010) (0.009) (0.009) (0.009) (0.007) (0.008) Mother is Foreign Born * * (0.070) (0.073) (0.086) (0.068) (0.072) (0.073) (0.068) (0.075) (0.076) (0.064) (0.064) (0.065) (0.055) (0.066) Father is Foreign Born (0.055) (0.059) (0.072) (0.056) (0.060) (0.065) (0.055) (0.063) (0.068) (0.059) (0.059) (0.060) (0.048) (0.060) Year of Birth *** ** ** *** *** ** ** (0.010) (0.010) (0.010) (0.008) (0.008) (0.009) (0.009) (0.009) (0.010) (0.010) (0.007) (0.009) (0.008) (0.008) Constant *** ** ** *** *** ** ** * (19.173) (18.999) (19.516) (14.834) (15.231) (16.257) (17.144) (17.324) (19.130) (18.618) (14.057) (17.912) (15.552) (15.846) Observations ,124 1,198 1, R-squared Wald Statistic Prob > W Note: Robust standard errors in parentheses. Regressions are a test of correct matches in the Early Indicators data by regressing for matches a dummy variable indicating whether or not an observation was correctly matched (as determined by the matches from the Early Indicators data) on the variables included in this table. The Wald statistic at the end offers a test of whether or not correct matches compared to incorrect matches systematically relate to covariates. Online Appendix for How Well Do Automated Linking Methods Perform? 17

18 Appendix Table D4. Correlation of Incorrect Links with Baseline Sample Characteristics in IPUMS-LRS IP UMS (Iowa 2016 (estimated Number of *** *** *** *** *** *** *** *** *** *** *** *** *** *** Siblings (0.0007) (0.0010) (0.0010) (0.0011) (0.0009) (0.0008) (0.0009) (0.0009) (0.0009) (0.0009) (0.0008) (0.0007) (0.0007) (0.0010) (0.0013) Age * *** *** *** *** *** *** *** *** *** Lives with Mother Lives with Father Lives with Both Parents Father: Born Abroad Mother: Born Abroad Lives in Northeast Lives in Midwest Lives in West (0.0002) (0.0003) (0.0003) (0.0003) (0.0002) (0.0002) (0.0002) (0.0002) (0.0002) (0.0002) (0.0002) (0.0002) (0.0002) (0.0003) (0.0004) *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** (0.0047) (0.0062) (0.0062) (0.0068) (0.0053) (0.0050) (0.0052) (0.0058) (0.0054) (0.0054) (0.0048) (0.0043) (0.0041) (0.0067) (0.0096) *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** (0.0068) (0.0097) (0.0094) (0.0106) (0.0083) (0.0078) (0.0081) (0.0090) (0.0082) (0.0085) (0.0077) (0.0068) (0.0066) (0.0098) (0.0134) ** * *** *** * *** ** ** *** (0.0077) (0.0110) (0.0107) (0.0120) (0.0095) (0.0089) (0.0093) (0.0103) (0.0094) (0.0097) (0.0088) (0.0077) (0.0075) (0.0111) (0.0152) *** *** *** * *** *** *** *** (0.0041) (0.0064) (0.0064) (0.0074) (0.0059) (0.0056) (0.0059) (0.0063) (0.0059) (0.0062) (0.0054) (0.0048) (0.0046) (0.0081) (0.0115) *** *** *** *** *** *** *** *** *** *** *** *** *** *** (0.0045) (0.0067) (0.0067) (0.0078) (0.0062) (0.0058) (0.0062) (0.0066) (0.0061) (0.0065) (0.0057) (0.0050) (0.0047) (0.0084) (0.0119) *** *** *** *** *** *** *** *** *** *** *** *** *** *** (0.0032) (0.0041) (0.0041) (0.0045) (0.0035) (0.0034) (0.0035) (0.0039) (0.0036) (0.0036) (0.0033) (0.0030) (0.0029) (0.0048) (0.0065) *** *** *** *** *** *** *** *** *** *** *** *** *** *** *** (0.0033) (0.0041) (0.0041) (0.0045) (0.0036) (0.0034) (0.0035) (0.0040) (0.0036) (0.0036) (0.0033) (0.0029) (0.0028) (0.0046) (0.0063) ** *** *** *** *** *** *** *** *** *** *** *** *** *** *** (0.0064) (0.0079) (0.0077) (0.0080) (0.0069) (0.0064) (0.0063) (0.0077) (0.0068) (0.0066) (0.0064) (0.0056) (0.0052) (0.0099) (0.0132) Farm Status ** * ** ** ** *** * *** ** (0.0036) (0.0045) (0.0045) (0.0050) (0.0039) (0.0037) (0.0038) (0.0043) (0.0039) (0.0040) (0.0036) (0.0032) (0.0031) (0.0049) (0.0068) Married *** *** *** *** *** *** *** *** *** *** *** *** *** *** White Collar Occupation Farming Occupation Semi-Skilled Occupation (0.0040) (0.0046) (0.0046) (0.0049) (0.0039) (0.0036) (0.0036) (0.0043) (0.0039) (0.0038) (0.0034) (0.0030) (0.0028) (0.0058) (0.0084) *** *** *** *** *** *** *** *** *** *** *** *** *** * (0.0047) (0.0063) (0.0063) (0.0070) (0.0055) (0.0052) (0.0054) (0.0059) (0.0055) (0.0056) (0.0051) (0.0046) (0.0044) (0.0071) (0.0097) *** *** *** ** ** (0.0052) (0.0067) (0.0065) (0.0073) (0.0058) (0.0055) (0.0057) (0.0063) (0.0058) (0.0059) (0.0054) (0.0048) (0.0046) (0.0071) (0.0097) *** *** ** *** ** * *** ** ** ** ** (0.0046) (0.0058) (0.0058) (0.0064) (0.0050) (0.0047) (0.0049) (0.0054) (0.0050) (0.0051) (0.0046) (0.0041) (0.0039) (0.0064) (0.0089) Online Appendix for How Well Do Automated Linking Methods Perform? 18

LIFE-M. Longitudinal, Intergenerational Family Electronic Microdata

LIFE-M. Longitudinal, Intergenerational Family Electronic Microdata LIFE-M Longitudinal, Intergenerational Family Electronic Microdata Martha J. Bailey Professor of Economics and Research Professor, Population Studies Center University of Michigan What is LIFE-M? A large

More information

Best Practices for Automated Linking Using Historical Data: A Progress Report

Best Practices for Automated Linking Using Historical Data: A Progress Report Best Practices for Automated Linking Using Historical Data: A Progress Report Preliminary; Comments are welcome Ran Abramitzky 1 Leah Boustan 2 Katherine Eriksson 3 James Feigenbaum 4 Santiago Perez 5

More information

ONLINE APPENDIX: SUPPLEMENTARY ANALYSES AND ADDITIONAL ESTIMATES FOR. by Martha J. Bailey, Olga Malkova, and Zoë M. McLaren.

ONLINE APPENDIX: SUPPLEMENTARY ANALYSES AND ADDITIONAL ESTIMATES FOR. by Martha J. Bailey, Olga Malkova, and Zoë M. McLaren. ONLINE APPENDIX: SUPPLEMENTARY ANALYSES AND ADDITIONAL ESTIMATES FOR DOES ACCESS TO FAMILY PLANNING INCREASE CHILDREN S OPPORTUNITIES? EVIDENCE FROM THE WAR ON POVERTY AND THE EARLY YEARS OF TITLE X by

More information

Appendix 1: Sample Analogs of Average Direct and Indirect Effects

Appendix 1: Sample Analogs of Average Direct and Indirect Effects Online Appendix Stacey H. Chen, Yen-Chien Chen and Jin-Tan Liu Appendix 1: Sample Analogs of Average Direct and Indirect Effects Under the assumption of randomized sibling gender, we find two useful properties

More information

For Online Publication APPENDIX VII. UP FROM SLAVERY? AFRICAN AMERICAN INTERGENERATIONAL MOBILITY SINCE 1880

For Online Publication APPENDIX VII. UP FROM SLAVERY? AFRICAN AMERICAN INTERGENERATIONAL MOBILITY SINCE 1880 For Online Publication APPENDIX TO UP FROM SLAVERY? AFRICAN AMERICAN INTERGENERATIONAL MOBILITY SINCE 1880 APPENDIX I. APPENDIX II. DATA APPENDIX a. Construction of Samples i. Linked Sample Construction

More information

Measuring Multiple-Race Births in the United States

Measuring Multiple-Race Births in the United States Measuring Multiple-Race Births in the United States By Jennifer M. Ortman 1 Frederick W. Hollmann 2 Christine E. Guarneri 1 Presented at the Annual Meetings of the Population Association of America, San

More information

Follow your family using census records

Follow your family using census records Census records are one of the best ways to discover details about your family and how that family changed every 10 years. You ll discover names, addresses, what people did for a living, even which ancestor

More information

Socio-Economic Status and Names: Relationships in 1880 Male Census Data

Socio-Economic Status and Names: Relationships in 1880 Male Census Data 1 Socio-Economic Status and Names: Relationships in 1880 Male Census Data Rebecca Vick, University of Minnesota Record linkage is the process of connecting records for the same individual from two or more

More information

The Impact of the Great Migration on Mortality of African Americans: Evidence from the Deep South

The Impact of the Great Migration on Mortality of African Americans: Evidence from the Deep South The Impact of the Great Migration on Mortality of African Americans: Evidence from the Deep South Dan A. Black Seth G. Sanders Evan J. Taylor Lowell J. Taylor Online Appendix A. Selection of States Our

More information

Health, gender and mobility: Intergenerational correlations in longevity over time

Health, gender and mobility: Intergenerational correlations in longevity over time Health, gender and mobility: Intergenerational correlations in longevity over time John Parman September 17, 2017 Abstract Changes in intergenerational mobility over time have been the focus of extensive

More information

Completeness of Birth Registration

Completeness of Birth Registration Vol. 33 A,S Completeness of Birth Registration in the United States in 1940 ROBERT F. LENHART, M.S.P.A. Chief, Vital Statistics Consulting Service, Division of Vital Statistics, Bureau of the Census, Suitland,

More information

Schooling Opportunities and Intergenerational Educational Mobility in Turkey. An IV Estimation Using Census Data

Schooling Opportunities and Intergenerational Educational Mobility in Turkey. An IV Estimation Using Census Data Schooling Opportunities and Intergenerational Educational Mobility in Turkey. An IV Estimation Using Census Data AYÇA AKARÇAY-GÜRBÜZ & SEZGIN POLAT Department of Economics, Galatasaray University Economic

More information

Registry Publication 62

Registry Publication 62 Births, Deaths, Missing Persons Background The Civil Aviation (Births, Deaths and Missing Persons) Regulations 1948 1 place requirements on the pilot in command and owner of aircraft to report births deaths

More information

Labour Economics 16 (2009) Contents lists available at ScienceDirect. Labour Economics. journal homepage:

Labour Economics 16 (2009) Contents lists available at ScienceDirect. Labour Economics. journal homepage: Labour Economics 16 (2009) 451 460 Contents lists available at ScienceDirect Labour Economics journal homepage: www.elsevier.com/locate/labeco Can the one-drop rule tell us anything about racial discrimination?

More information

TED NAT! ONS. LIMITED ST/ECLA/Conf.43/ July 1972 ORIGINAL: ENGLISH. e n

TED NAT! ONS. LIMITED ST/ECLA/Conf.43/ July 1972 ORIGINAL: ENGLISH. e n BIBLIOTECA NACIONES UNIDAS MEXIGO TED NAT! ONS LIMITED ST/ECLA/Conf.43/1.4 11 July 1972 e n ORIGINAL: ENGLISH (»»«tiiitmiimmiimitmtiitmtmihhimtfimiiitiinihmihmiimhfiiim i infittititi m m ECONOMIC COMMISSION

More information

Census Records. P. J. Smith

Census Records. P. J. Smith Census Records P. J. Smith What is a census? Regularly occurring and official count of a particular population Apportioning Congressional representatives Apportioning taxes Provides statistics for planning

More information

A Metric-Based Machine Learning Approach to Genealogical Record Linkage

A Metric-Based Machine Learning Approach to Genealogical Record Linkage A Metric-Based Machine Learning Approach to Genealogical Record Linkage S. Ivie, G. Henry, H. Gatrell and C. Giraud-Carrier Department of Computer Science, Brigham Young University Abstract Genealogical

More information

Census Response Rate, 1970 to 1990, and Projected Response Rate in 2000

Census Response Rate, 1970 to 1990, and Projected Response Rate in 2000 Figure 1.1 Census Response Rate, 1970 to 1990, and Projected Response Rate in 2000 80% 78 75% 75 Response Rate 70% 65% 65 2000 Projected 60% 61 0% 1970 1980 Census Year 1990 2000 Source: U.S. Census Bureau

More information

Population and dwellings Number of people counted Total population

Population and dwellings Number of people counted Total population Henderson-Massey Local Board Area Population and dwellings Number of people counted Total population 107,685 people usually live in Henderson-Massey Local Board Area. This is an increase of 8,895 people,

More information

Supplementary Data for

Supplementary Data for Supplementary Data for Gender differences in obtaining and maintaining patent rights Kyle L. Jensen, Balázs Kovács, and Olav Sorenson This file includes: Materials and Methods Public Pair Patent application

More information

Using Administrative Records for Imputation in the Decennial Census 1

Using Administrative Records for Imputation in the Decennial Census 1 Using Administrative Records for Imputation in the Decennial Census 1 James Farber, Deborah Wagner, and Dean Resnick U.S. Census Bureau James Farber, U.S. Census Bureau, Washington, DC 20233-9200 Keywords:

More information

United Nations Demographic Yearbook Data Collection System

United Nations Demographic Yearbook Data Collection System United Nations Demographic Yearbook Data Collection System Adriana Skenderi United Nations Statistics Division United Nations Demographic Yearbook Mandated by ECOSOC in 1947 a publication of demographic

More information

THE SCOTTISH LONGITUDINAL STUDY Tracing rates and sample quality for the 1991 Census SLS sample

THE SCOTTISH LONGITUDINAL STUDY Tracing rates and sample quality for the 1991 Census SLS sample THE SCOTTISH LONGITUDINAL STUDY Tracing s and quality for the 1991 Census SLS LSCS Working Paper 2.0 October 2007 Lin Hattersley LSCS & General Register Office for Scotland Gillian Raab LSCS & University

More information

Year Census, Supas, Susenas CPS and DHS pre-2000 DHS Retro DHS 2007 Retro

Year Census, Supas, Susenas CPS and DHS pre-2000 DHS Retro DHS 2007 Retro levels and trends in Indonesia Over the last four decades Indonesia, like most countries in Asia, has undergone a major transition from high to low fertility. Where up to the 1970s had long born an average

More information

Scenario 5: Family Structure

Scenario 5: Family Structure Scenario 5: Family Structure Because human infants require the long term care and nurturing of adults before they can fend for themselves in often hostile environments, the family in some identifiable

More information

Spring 2017 Math 54 Test #2 Name:

Spring 2017 Math 54 Test #2 Name: Spring 2017 Math 54 Test #2 Name: You may use a TI calculator and formula sheets from the textbook. Show your work neatly and systematically for full credit. Total points: 101 1. (6) Suppose P(E) = 0.37

More information

Online Appendix. Intergenerational Mobility and the Informational Content of Surnames. José V. Rodríguez Mora. University of Edinburgh and CEPR

Online Appendix. Intergenerational Mobility and the Informational Content of Surnames. José V. Rodríguez Mora. University of Edinburgh and CEPR Online Appendix Intergenerational Mobility and the Informational Content of Surnames Maia Güell University of Edinburgh, CEP (LSE), CEPR & IZA José V. Rodríguez Mora University of Edinburgh and CEPR November

More information

Chapter 1: Economic and Social Indicators Comparison of BRICS Countries Chapter 2: General Chapter 3: Population

Chapter 1: Economic and Social Indicators Comparison of BRICS Countries Chapter 2: General Chapter 3: Population 1: Economic and Social Indicators Comparison of BRICS Countries 2: General 3: Population 3: Population 4: Economically Active Population 5: National Accounts 6: Price Indices 7: Population living standard

More information

USING CENSUS RECORDS IN GENEALOGICAL RESEARCH AN ONLINE COURSE

USING CENSUS RECORDS IN GENEALOGICAL RESEARCH AN ONLINE COURSE IN GENEALOGICAL RESEARCH AN ONLINE COURSE Syllabus An NGS Online Course IN GENEALOGICAL RESEARCH SYLLABUS Copyright 2009 National Genealogical Society 3108 Columbia Pike, Suite 300 Arlington, Virginia

More information

Estimates and Implications of the U.S. Census Undercount of the Native-Born Population. Janna E. Johnson PRELIMINARY.

Estimates and Implications of the U.S. Census Undercount of the Native-Born Population. Janna E. Johnson PRELIMINARY. Estimates and Implications of the U.S. Census Undercount of the Native-Born Population Janna E. Johnson Harris School of Public Policy University of Chicago jannaj@uchicago.edu PRELIMINARY August 24, 2012

More information

LINKING HISTORICAL CENSUSES: A NEW APPROACH STEVEN RUGGLES

LINKING HISTORICAL CENSUSES: A NEW APPROACH STEVEN RUGGLES LINKING HISTORICAL CENSUSES: A NEW APPROACH STEVEN RUGGLES This article describes a new initiative at the Minnesota Population Center (MPC) to create linked representative samples of individuals and family

More information

The main focus of the survey is to measure income, unemployment, and poverty.

The main focus of the survey is to measure income, unemployment, and poverty. HUNGARY 1991 - Documentation Table of Contents A. GENERAL INFORMATION B. POPULATION AND SAMPLE SIZE, SAMPLING METHODS C. MEASURES OF DATA QUALITY D. DATA COLLECTION AND ACQUISITION E. WEIGHTING PROCEDURES

More information

Use U.S. Census Information to Resolve Family History Research Problems

Use U.S. Census Information to Resolve Family History Research Problems Use U.S. Census Information to Resolve Family History Research Problems Using 1860-1900 migration patterns to find records 1 Using 1860-1900 migration patterns to find records Between 1860 and 1900 the

More information

Public Use Microdata Sample Files Data Note 1

Public Use Microdata Sample Files Data Note 1 Data Note 1 TECHNICAL NOTE ON SAME-SEX UNMARRIED PARTNER DATA FROM THE 1990 AND 2000 CENSUSES The release of data from the 2000 census has brought with it a number of analyses documenting change that has

More information

Date Range Propagation in Genealogical Databases

Date Range Propagation in Genealogical Databases Date Range Propagation in Genealogical Databases Randy Wilson FamilySearch.org Abstract.Genealogical data is rarely complete on a given individual in a particular source. A birth certificate, for example,

More information

Southern Africa Labour and Development Research Unit

Southern Africa Labour and Development Research Unit Southern Africa Labour and Development Research Unit Sampling methodology and field work changes in the october household surveys and labour force surveys by Andrew Kerr and Martin Wittenberg Working Paper

More information

2001 Population and Housing Census

2001 Population and Housing Census ISBN: 0-86976-625-2 REPUBLIC OF NAMIBIA 2001 Population and Housing Census Karas Region Basic Analysis with Highlights Central Bureau of Statistics National Planning Commission Private Bag 13356, Windhoek

More information

Divided Landscapes of Economic Opportunity: The Canadian Geography of Intergenerational Income Mobility

Divided Landscapes of Economic Opportunity: The Canadian Geography of Intergenerational Income Mobility Divided Landscapes of Economic Opportunity: The Canadian Geography of Intergenerational Income Mobility DATA APPENDIX REPLICATING THE RESULTS USING ONLY THE AGE COHORTS 16 TO 19 YEARS OF AGE IN 1986 Miles

More information

WRITING ABOUT THE DATA

WRITING ABOUT THE DATA WRITING ABOUT THE DATA 2nd TRAINING WORKSHOP Project to strengthen national capacity in producing and disseminating vital statistics from civil registration records in Asia and the Pacific Bangkok, Thailand,

More information

A gender perspective on the 2005 Census of Korea (R.O.K) Focusing on Economic Activity, and Living Expense of the Aged.

A gender perspective on the 2005 Census of Korea (R.O.K) Focusing on Economic Activity, and Living Expense of the Aged. GLOBAL FORUM ON GENDER STATISTICS ESA/STAT/AC.168/28 26-28 January 29 English only Accra, Ghana A gender perspective on the 25 Census of Korea (R.O.K) Focusing on Economic Activity, and Living Expense

More information

2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression

2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression 2010 Census Coverage Measurement - Initial Results of Net Error Empirical Research using Logistic Regression Richard Griffin, Thomas Mule, Douglas Olson 1 U.S. Census Bureau 1. Introduction This paper

More information

CONSTRUCTION OF SOCIAL CLASS VARIABLES

CONSTRUCTION OF SOCIAL CLASS VARIABLES ESRC Research Project on Education and Youth Transitions in England, Wales and Scotland, 1984-2002 Working Paper 4 CONSTRUCTION OF SOCIAL CLASS VARIABLES Linda Croxford Centre for Educational Sociology,

More information

Vincent Thomas Mule, Jr., U.S. Census Bureau, Washington, DC

Vincent Thomas Mule, Jr., U.S. Census Bureau, Washington, DC Paper SDA-06 Vincent Thomas Mule, Jr., U.S. Census Bureau, Washington, DC ABSTRACT As part of the evaluation of the 2010 Census, the U.S. Census Bureau conducts the Census Coverage Measurement (CCM) Survey.

More information

Digit preference in Iranian age data

Digit preference in Iranian age data Digit preference in Iranian age data Aida Yazdanparast 1, Mohamad Amin Pourhoseingholi 2, Aliraza Abadi 3 BACKGROUND: Data on age in developing countries are subject to errors, particularly in circumstances

More information

Student's height (in)

Student's height (in) Psych 315, Winter 2018, Homework 4 Answer Key Due Wednesday, January 31 either in section or in your TA s mailbox by 4pm. Name ID Section [AA Kit] [AB Kit] [AC Kelly] [AD Kelly] The scatterplot below plots

More information

NATIONAL SOCIAL ECONOMIC SURVEY (SUSENAS) 2002

NATIONAL SOCIAL ECONOMIC SURVEY (SUSENAS) 2002 Manual II.B NATIONAL SOCIAL ECONOMIC SURVEY (SUSENAS) 2002 MANUAL CORE SUPERVISOR Statistics-Indonesia (BPS), Jakarta- Indonesia CONTENTS CONTENTS I. PREFACE II. SUPERVISING PROCEDURES III. METHODOLOGY

More information

POWELL RIVER REGIONAL DISTRICT. And UNINCORPORATED AREAS AGGREGATED POPULATION PROJECTIONS to 2041

POWELL RIVER REGIONAL DISTRICT. And UNINCORPORATED AREAS AGGREGATED POPULATION PROJECTIONS to 2041 POWELL RIVER REGIONAL DISTRICT And UNINCORPORATED AREAS AGGREGATED POPULATION PROJECTIONS 2011 to 2041 By W. W. Munroe June 20, 2012 Page 1 of 17 POWELL RIVER REGIONAL DISTRICT And UNINCORPORATED AREAS

More information

IM M IG RAN TS AN D TH E IR CHILDREN, ^

IM M IG RAN TS AN D TH E IR CHILDREN, ^ 232 The Milbank Memorial Fund Quarterly proportion of the time, sampling fluctuations will yield samples in which the relationships between a control factor and the independent and dependent variables

More information

Zambia - Demographic and Health Survey 2007

Zambia - Demographic and Health Survey 2007 Microdata Library Zambia - Demographic and Health Survey 2007 Central Statistical Office (CSO) Report generated on: June 16, 2017 Visit our data catalog at: http://microdata.worldbank.org 1 2 Sampling

More information

Namibia - Demographic and Health Survey

Namibia - Demographic and Health Survey Microdata Library Namibia - Demographic and Health Survey 2006-2007 Ministry of Health and Social Services (MoHSS) Report generated on: June 16, 2017 Visit our data catalog at: http://microdata.worldbank.org

More information

Understanding and Using the U.S. Census Bureau s American Community Survey

Understanding and Using the U.S. Census Bureau s American Community Survey Understanding and Using the US Census Bureau s American Community Survey The American Community Survey (ACS) is a nationwide continuous survey that is designed to provide communities with reliable and

More information

Methods of Parentage Analysis in Natural Populations

Methods of Parentage Analysis in Natural Populations Methods of Parentage Analysis in Natural Populations Using molecular markers, estimates of genetic maternity or paternity can be achieved by excluding as parents all adults whose genotypes are incompatible

More information

The 1999 Population Census in the Republic of Kazakhstan CENSUS QUESTIONNAIRE 3C

The 1999 Population Census in the Republic of Kazakhstan CENSUS QUESTIONNAIRE 3C 1111111111 samples of letters and numbers 1111111111111 Approved by the Committee Of Statistics and Analysis No 20 of 29.06.98 The 1999 Population Census in the Republic of Kazakhstan Enumerators and other

More information

DATA APPENDIX TO UNDERSTANDING THE IMPACT OF IMMIGRATION ON CRIME

DATA APPENDIX TO UNDERSTANDING THE IMPACT OF IMMIGRATION ON CRIME DATA APPENDIX TO UNDERSTANDING THE IMPACT OF IMMIGRATION ON CRIME A. Crime Data All measures of crime are based on agency level data on the number of crimes reported to the police, as compiled by the Federal

More information

Population and dwellings Number of people counted Total population

Population and dwellings Number of people counted Total population Whakatane District Population and dwellings Number of people counted Total population 32,691 people usually live in Whakatane District. This is a decrease of 606 people, or 1.8 percent, since the 2006

More information

An Automated Record Linkage System - Linking 1871 Canadian census to 1881 Canadian Census

An Automated Record Linkage System - Linking 1871 Canadian census to 1881 Canadian Census An Automated Record Linkage System - Linking 1871 Canadian census to 1881 Canadian Census Luiza Antonie Peter Baskerville Kris Inwood Andrew Ross Abstract This paper describes a recently developed linkage

More information

Research Training Guide

Research Training Guide Research Training Guide Objective: To help library patrons and staff get a quick start with researching information on their ancestor in the library. You will be guided through a process of searching for

More information

Evaluation of the Completeness of Birth Registration in China Using Analytical Methods and Multiple Sources of Data (Preliminary draft)

Evaluation of the Completeness of Birth Registration in China Using Analytical Methods and Multiple Sources of Data (Preliminary draft) United Nations Expert Group Meeting on "Methodology and lessons learned to evaluate the completeness and quality of vital statistics data from civil registration" New York, 3-4 November 2016 Evaluation

More information

NATIONAL SOCIO- ECONOMIC SURVEY (SUSENAS) 2001 MANUAL HEAD OF PROVINCIAL, REGENCY/ MUNICIPALITY AND CORE SUPERVISOR/ EDITOR

NATIONAL SOCIO- ECONOMIC SURVEY (SUSENAS) 2001 MANUAL HEAD OF PROVINCIAL, REGENCY/ MUNICIPALITY AND CORE SUPERVISOR/ EDITOR Manual 1.A NATIONAL SOCIO- ECONOMIC SURVEY (SUSENAS) 2001 MANUAL HEAD OF PROVINCIAL, REGENCY/ MUNICIPALITY AND CORE SUPERVISOR/ EDITOR Statistics Indonesia (BPS), Jakarta- Indonesia CONTENTS CONTENTS I.

More information

East -West Population Institute. Accuracy of Age Data

East -West Population Institute. Accuracy of Age Data ON ESTIMATING ANNUAL BIRTH RATES FROM CENSUS DATA ON CHILDREN Lee -Jay Cho, East -West Population Institute and University of Hawaii I. INTRODUCTION For the majority of the world's population, the registration

More information

Chapter 12: Sampling

Chapter 12: Sampling Chapter 12: Sampling In all of the discussions so far, the data were given. Little mention was made of how the data were collected. This and the next chapter discuss data collection techniques. These methods

More information

CENSUS DATA. No. Rolls Jun 1840 M ,069, Jun 1850 M432 1,009 23,191, Jun 1860 M653 1,438 31,433,321

CENSUS DATA. No. Rolls Jun 1840 M ,069, Jun 1850 M432 1,009 23,191, Jun 1860 M653 1,438 31,433,321 CENSUS DATA No. Year Census Day NARA Series No. Rolls U.S. Population 1 1790 2 Aug 1790 T498 3 3,929,326 2 1800 4 Aug 1800 M32 52 5,308,483 3 1810 6 Aug 1810 M252 71 7,239,881 4 1820 7 Aug 1820 M33 142

More information

Distinguishing Identical Twins by Face Recognition

Distinguishing Identical Twins by Face Recognition Distinguishing Identical Twins by Face Recognition P. Jonathon Phillips, Patrick J. Flynn, Kevin W. Bowyer, Richard W. Vorder Bruegge, Patrick J. Grother, George W. Quinn, and Matthew Pruitt Abstract The

More information

The progress in the use of registers and administrative records. Submitted by the Department of Statistics of the Republic of Lithuania

The progress in the use of registers and administrative records. Submitted by the Department of Statistics of the Republic of Lithuania Working Paper No. 24 ENGLISH ONLY STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE STATISTICAL OFFICE OF THE EUROPEAN COMMUNITIES (EUROSTAT) CONFERENCE OF EUROPEAN STATISTICIANS Joint ECE/Eurostat

More information

Record Linkage between the 2006 Census of the Population and the Canadian Mortality Database

Record Linkage between the 2006 Census of the Population and the Canadian Mortality Database Proceedings of Statistics Canada Symposium 2016 Growth in Statistical Information: Challenges and Benefits Record Linkage between the 2006 Census of the Population and the Canadian Mortality Database Mohan

More information

Neighbourhood Profiles Census

Neighbourhood Profiles Census Neighbourhood Profiles - 2011 Census 35 Queen s This neighbourhood profile is based on custom area tabulations generated by Statistics Canada and contains data from the 2011 Census only. The 2011 National

More information

Health Record Linkage at Statistics Canada

Health Record Linkage at Statistics Canada Health Record Linkage at Statistics Canada www.statcan.gc.ca Telling Canada s story in numbers Nicole Aitken, Philippe Finès Statistics Canada Thursday, November 16 th 2017 Why use linked data? Harnessing

More information

Lao PDR - Multiple Indicator Cluster Survey 2006

Lao PDR - Multiple Indicator Cluster Survey 2006 Microdata Library Lao PDR - Multiple Indicator Cluster Survey 2006 Department of Statistics - Ministry of Planning and Investment, Hygiene and Prevention Department - Ministry of Health, United Nations

More information

Predicting Success, Preventing Failure: An Investigation of the California High School Exit Exam Technical Appendix

Predicting Success, Preventing Failure: An Investigation of the California High School Exit Exam Technical Appendix Predicting Success, Preventing Failure: An Investigation of the California High School Exit Exam Technical Appendix Andrew C. Zau Julian R. Betts Description This appendix contains tables that show the

More information

FamilySearch. When you sign into FamilySearch, your own personalized home page will appear. This page will consistently change.

FamilySearch. When you sign into FamilySearch, your own personalized home page will appear. This page will consistently change. 1 FamilySearch When you sign into FamilySearch, your own personalized home page will appear. This page will consistently change. 1. On the left, some may see the latest things that FamilySearch has created

More information

Order of the Founders of North America Lineage Documentation Guidelines 09/18/2012 A. General Application requirements. 1. Application completeness

Order of the Founders of North America Lineage Documentation Guidelines 09/18/2012 A. General Application requirements. 1. Application completeness Order of the Founders of North America Lineage Documentation Guidelines 09/18/2012 A. General Application requirements 1. Application completeness Documentation of applicant s biological bloodline ascent

More information

Founders and Survivors Linkage Strategy

Founders and Survivors Linkage Strategy Founders and Survivors Linkage Strategy John Bass, University of Tasmania Sandra Silcot, University of Melbourne Len Smith, Australian National University Founders and Survivors Prosopography Database

More information

The SCOTTISH LONGITUDINAL STUDY (SLS)

The SCOTTISH LONGITUDINAL STUDY (SLS) The SCOTTISH LONGITUDINAL STUDY (SLS) What is the SLS? The SLS is a large-scale, anonymised linkage study designed to capture 5.5% of the Scottish population Sample based on 20 semi-random birthdates It

More information

Family of Lou Vicie Pierce Wright Custer Smith

Family of Lou Vicie Pierce Wright Custer Smith Family of Lou Vicie Pierce Wright Custer Smith Lou Vicie PIERCE, b. 14 Oct 1854 in Georgia, i,ii,iii,iv,v,vi,vii,viii (daughter of Andrew Jackson PIERCE and Nancy ABERCROMBIE), census 1920 in Ahsahka District

More information

Task Specific Human Capital

Task Specific Human Capital Task Specific Human Capital Christopher Taber Department of Economics University of Wisconsin-Madison March 10, 2014 Outline Poletaev and Robinson Gathmann and Schoenberg Poletaev and Robinson Human Capital

More information

Family Size and Educational Attainment in England and Wales

Family Size and Educational Attainment in England and Wales Family Size and Educational Attainment in England and Wales Tak Wing Chan 1 Morag Henderson 2 Rachel Stuchbury 3 1 Department of Sociology, University of Warwick 2 Institute of Education, University College

More information

Get Your Census Worth: Using the Census as a Research Tool

Get Your Census Worth: Using the Census as a Research Tool Get Your Census Worth: Using the Census as a Research Tool INTRODUCTION Noted genealogist and author Val D. Greenwood said that, there is probably no other single group of records in existence which contain

More information

Statistical Methods in Computer Science

Statistical Methods in Computer Science Statistical Methods in Computer Science Experiment Design Gal A. Kaminka galk@cs.biu.ac.il Experimental Lifecycle Vague idea groping around experiences Initial observations Model/Theory Data, analysis,

More information

Sampling Techniques. 70% of all women married 5 or more years have sex outside of their marriages.

Sampling Techniques. 70% of all women married 5 or more years have sex outside of their marriages. Sampling Techniques Introduction In Women and Love: A Cultural Revolution in Progress (1987) Shere Hite obtained several impacting results: 84% of women are not satisfied emotionally with their relationships.

More information

Guyana - Multiple Indicator Cluster Survey 2014

Guyana - Multiple Indicator Cluster Survey 2014 Microdata Library Guyana - Multiple Indicator Cluster Survey 2014 United Nations Children s Fund, Guyana Bureau of Statistics, Guyana Ministry of Public Health Report generated on: December 1, 2016 Visit

More information

Thomas Turner Presley Descendant Information. First Generation

Thomas Turner Presley Descendant Information. First Generation Thomas Turner Presley Descendant Information First Generation 1. Thomas Turner 1 Presley was a son of Barney Harrison Presley and Malinda Morgan. He was born about 1833 in Chester County, South Carolina.

More information

Why Randomize? Jim Berry Cornell University

Why Randomize? Jim Berry Cornell University Why Randomize? Jim Berry Cornell University Session Overview I. Basic vocabulary for impact evaluation II. III. IV. Randomized evaluation Other methods of impact evaluation Conclusions J-PAL WHY RANDOMIZE

More information

Overview of Civil Registration and Vital Statistics systems

Overview of Civil Registration and Vital Statistics systems Overview of Civil Registration and Vital Statistics systems Training Workshop on CRVS ESCAP, Bangkok 9-13 January 2016 Helge Brunborg Statistics Norway Helge.Brunborg@gmail.com Outline Civil Registration

More information

Mathematics (Project Maths)

Mathematics (Project Maths) 2010. M128 S Coimisiún na Scrúduithe Stáit State Examinations Commission Leaving Certificate Examination Sample Paper Mathematics (Project Maths) Paper 2 Ordinary Level Time: 2 hours, 30 minutes 300 marks

More information

TURKISH STATISTICAL INSTITUTE

TURKISH STATISTICAL INSTITUTE VITAL STATISTICS Birth statistics Death statistics Marriage statistics Divorce statistics Vital Statistics Coverage Country wide Data Collection System Administrative registers Data sources MERNIS (The

More information

Review Questions on Ch4 and Ch5

Review Questions on Ch4 and Ch5 Review Questions on Ch4 and Ch5 1. Find the mean of the distribution shown. x 1 2 P(x) 0.40 0.60 A) 1.60 B) 0.87 C) 1.33 D) 1.09 2. A married couple has three children, find the probability they are all

More information

The Impact of Technological Change within the Home

The Impact of Technological Change within the Home Dissertation Summaries 539 American Economic Review American Economic Review 96, no. 2 (2006): 1 21. Goldin, Claudia D., and Robert A. Margo. The Great Compression: The Wage Structure in the United States

More information

Panel Study of Income Dynamics: Mortality File Documentation. Release 1. Survey Research Center

Panel Study of Income Dynamics: Mortality File Documentation. Release 1. Survey Research Center Panel Study of Income Dynamics: 1968-2015 Mortality File Documentation Release 1 Survey Research Center Institute for Social Research The University of Michigan Ann Arbor, Michigan December, 2016 The 1968-2015

More information

MATRIX SAMPLING DESIGNS FOR THE YEAR2000 CENSUS. Alfredo Navarro and Richard A. Griffin l Alfredo Navarro, Bureau of the Census, Washington DC 20233

MATRIX SAMPLING DESIGNS FOR THE YEAR2000 CENSUS. Alfredo Navarro and Richard A. Griffin l Alfredo Navarro, Bureau of the Census, Washington DC 20233 MATRIX SAMPLING DESIGNS FOR THE YEAR2000 CENSUS Alfredo Navarro and Richard A. Griffin l Alfredo Navarro, Bureau of the Census, Washington DC 20233 I. Introduction and Background Over the past fifty years,

More information

Evidence and Enquiry: Using the 1901 and 1911 census forms in the History classroom, Examining the 1911 census record of Edward Bailey

Evidence and Enquiry: Using the 1901 and 1911 census forms in the History classroom, Examining the 1911 census record of Edward Bailey Evidence and Enquiry: Using the 1901 and 1911 census forms in the History classroom, 2016 Examining the 1911 census record of Edward Bailey Section One: Locating the census record. To find the census form

More information

Census - General info

Census - General info By Clint Williams Quitta family Census - General info Censuses are available from 1790-1940 in ten year increments (except for 1890 and a few other burned or lost records). Note that the most useful censuses

More information

1990 Census Measures. Fast Track Project Technical Report Patrick S. Malone ( ; 9-May-00

1990 Census Measures. Fast Track Project Technical Report Patrick S. Malone ( ; 9-May-00 1990 Census Measures Fast Track Project Technical Report Patrick S. Malone (919-668-6910; malone@alumni.duke.edu) 9-May-00 Table of Contents I. Scale Description II. Report Sample III. Scaling IV. Differences

More information

Using the FamilySearch Family Tree (23 March 2012)

Using the FamilySearch Family Tree (23 March 2012) Using the FamilySearch Family Tree (23 March 2012) 2012 by Intellectual Reserve, Inc. All rights reserved Printed in the United States of America Published by FamilySearch, International Salt Lake City,

More information

DATA VALIDATION-I Evaluation of editing and imputation

DATA VALIDATION-I Evaluation of editing and imputation DATA VALIDATION-I Evaluation of editing and imputation Census processing overview Steps of data processing depend on the technology used in general, the process covers the following steps: Preparati on

More information

WORLD HEALTH ORGANIZATION - Questionnaire on mortality data

WORLD HEALTH ORGANIZATION - Questionnaire on mortality data WORLD HEALTH ORGANIZATION - Questionnaire on mortality data This questionnaire consists of two sections: the first section deals with overall mortality regardless of causes of death while the second section

More information

Economic and Social Council

Economic and Social Council United Nations Economic and Social Council Distr.: General 18 December 2017 Original: English Statistical Commission Forty-ninth session 6 9 March 2018 Item 4 (a) of the provisional agenda* Items for information:

More information

Do Grandparents and Great-Grandparents Matter? Multigenerational Mobility in the US,

Do Grandparents and Great-Grandparents Matter? Multigenerational Mobility in the US, Do Grandparents and Great-Grandparents Matter? Multigenerational Mobility in the US, 1910-2013 By JOSEPH FERRIE, CATHERINE MASSEY, AND JONATHAN ROTHBAUM* Abstract Studies of US intergenerational mobility

More information

Adjusting for linkage errors to analyse coverage of the Integrated Data Infrastructure (IDI) and the administrative population (IDI-ERP)

Adjusting for linkage errors to analyse coverage of the Integrated Data Infrastructure (IDI) and the administrative population (IDI-ERP) Adjusting for linkage errors to analyse coverage of the Integrated Data Infrastructure (IDI) and the administrative population (IDI-ERP) Hochang Choi, Statistical Analyst, Stats NZ Paper prepared for the

More information

Indonesia - Demographic and Health Survey 2007

Indonesia - Demographic and Health Survey 2007 Microdata Library Indonesia - Demographic and Health Survey 2007 Central Bureau of Statistics (Badan Pusat Statistik (BPS)) Report generated on: June 16, 2017 Visit our data catalog at: http://microdata.worldbank.org

More information

Guide to the Genealogy Search

Guide to the Genealogy Search Guide to the Genealogy Search Introduction This guide provides an overview on how to look up records on the Genealogy Search. It covers the three different search options (General, Browse and Keyword searches),

More information