Imputation of Area-Level Covariates by Registry Linking

J. Sunil Rao, Jie Fan

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Social epidemiological research has long studied the impact of social determinants of place on health outcomes. Geocoding is a well-known technique for extracting such information by mapping geographical location to census tract and then extracting relevant information from tract-level databases. However, sometimes location information is unknown. This is often the case when using many of today's public databases (e.g., genomic data repositories). For some diseases such as cancer, statewide registries exist which provide a strategy for building a linking model between analysis observations and a reference sample drawn from the registry using variables in common to both. We detail this methodology and then show how to use this linking model together with classified mixed model prediction to impute area-level covariates for analysis observations. We study empirical performance via a series of simulations, and then perform predictive geocoding on colon cancer patients drawn (both analysis and reference samples) from the Florida Cancer Data Systems registry.

Original languageEnglish (US)
Title of host publicationHandbook of Statistics
PublisherElsevier B.V.
Pages3-21
Number of pages19
DOIs
StatePublished - 2017
Externally publishedYes

Publication series

NameHandbook of Statistics
Volume37
ISSN (Print)0169-7161

Bibliographical note

Funding Information:
J.S.R. is partially supported by NSF grant SES-1122399 and NIH grants R01GM085205A1 and UL1TR000460. J.F. is partially supported by a graduate research fellowship from the Sylvester Comprehensive Cancer Center (SCCC) at the University of Miami Miller School of Medicine. Dr. J.S.R. would like to thank Dr. Erin Kobetz at the University of Miami for making him aware of the role of geocoding in social epidemiology research, and for providing us access to the FCDS data. The Florida cancer incidence data used in this report were collected by the Florida Cancer Data system under contract with the Department of Health (DOH). The views expressed herein are solely those of the author(s), and do not necessarily reflect those of the contractor of DOH.

Publisher Copyright:
© 2017 Elsevier B.V.

Keywords

  • Area-level covariates
  • Cancer registries
  • Census tracts
  • Geocoding
  • Imputation
  • Mixed models
  • Prediction
  • Spatial data

Fingerprint

Dive into the research topics of 'Imputation of Area-Level Covariates by Registry Linking'. Together they form a unique fingerprint.

Cite this