Challenges and methods of international census harmonization

Albert Esteve, Matt Sobek

Research output: Contribution to journalArticle

22 Citations (Scopus)

Abstract

The development of IPUMS-International involves harmonizing data from different national statistical offices created over several decades. The original samples vary in quality and have different data formats and variable coding schemes. The authors describe the methods developed to deal with the challenges posed by such diversity and unevenness. The first stage of harmonization involves standardizing the data formats and correcting errors. Diagnostic routines analyze each data set, and custom computer programs modify the different data structures into a single standard format. The second stage of the work centers on harmonizing the codes for all variables shared across data sets, including the compilation and integration of all the relevant documentation.

Original languageEnglish (US)
Pages (from-to)66-79
Number of pages14
JournalHistorical Methods
Volume36
Issue number2
DOIs
StatePublished - Jan 1 2003

Fingerprint

Harmonization
Census
Documentation
Diagnostics
Compilation
Coding Scheme

Keywords

  • Census
  • Demography
  • IPUMS
  • International
  • Microdata
  • Population

Cite this

Challenges and methods of international census harmonization. / Esteve, Albert; Sobek, Matt.

In: Historical Methods, Vol. 36, No. 2, 01.01.2003, p. 66-79.

Research output: Contribution to journalArticle

@article{2ab21d2885ff4dd9a5518c654171693d,
title = "Challenges and methods of international census harmonization",
abstract = "The development of IPUMS-International involves harmonizing data from different national statistical offices created over several decades. The original samples vary in quality and have different data formats and variable coding schemes. The authors describe the methods developed to deal with the challenges posed by such diversity and unevenness. The first stage of harmonization involves standardizing the data formats and correcting errors. Diagnostic routines analyze each data set, and custom computer programs modify the different data structures into a single standard format. The second stage of the work centers on harmonizing the codes for all variables shared across data sets, including the compilation and integration of all the relevant documentation.",
keywords = "Census, Demography, IPUMS, International, Microdata, Population",
author = "Albert Esteve and Matt Sobek",
year = "2003",
month = "1",
day = "1",
doi = "10.1080/01615440309601216",
language = "English (US)",
volume = "36",
pages = "66--79",
journal = "Historical Methods",
issn = "0161-5440",
publisher = "Routledge",
number = "2",

}

TY - JOUR

T1 - Challenges and methods of international census harmonization

AU - Esteve, Albert

AU - Sobek, Matt

PY - 2003/1/1

Y1 - 2003/1/1

N2 - The development of IPUMS-International involves harmonizing data from different national statistical offices created over several decades. The original samples vary in quality and have different data formats and variable coding schemes. The authors describe the methods developed to deal with the challenges posed by such diversity and unevenness. The first stage of harmonization involves standardizing the data formats and correcting errors. Diagnostic routines analyze each data set, and custom computer programs modify the different data structures into a single standard format. The second stage of the work centers on harmonizing the codes for all variables shared across data sets, including the compilation and integration of all the relevant documentation.

AB - The development of IPUMS-International involves harmonizing data from different national statistical offices created over several decades. The original samples vary in quality and have different data formats and variable coding schemes. The authors describe the methods developed to deal with the challenges posed by such diversity and unevenness. The first stage of harmonization involves standardizing the data formats and correcting errors. Diagnostic routines analyze each data set, and custom computer programs modify the different data structures into a single standard format. The second stage of the work centers on harmonizing the codes for all variables shared across data sets, including the compilation and integration of all the relevant documentation.

KW - Census

KW - Demography

KW - IPUMS

KW - International

KW - Microdata

KW - Population

UR - http://www.scopus.com/inward/record.url?scp=0043232785&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0043232785&partnerID=8YFLogxK

U2 - 10.1080/01615440309601216

DO - 10.1080/01615440309601216

M3 - Article

VL - 36

SP - 66

EP - 79

JO - Historical Methods

JF - Historical Methods

SN - 0161-5440

IS - 2

ER -