Quantifying the effect of data quality on the validity of an eMeasure

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Objective The objective of this study was to demonstrate the utility of a healthcare data quality framework by using it to measure the impact of synthetic data quality issues on the validity of an eMeasure (CMS178—urinary catheter removal after surgery). Methods Data quality issues were artificially created by systematically degrading the underlying quality of EHR data using two methods: independent and correlated degradation. A linear model that describes the change in the events included in the eMeasure quantifies the impact of each data quality issue. Results Catheter duration had the most impact on the CMS178 eMeasure with every 1% reduction in data quality causing a 1.21% increase in the number of missing events. For birth date and admission type, every 1% reduction in data quality resulted in a 1% increase in missing events. Conclusion This research demonstrated that the impact of data quality issues can be quantified using a generalized process and that the CMS178 eMeasure, as currently defined, may not measure how well an organization is meeting the intended best practice goal. Secondary use of EHR data is warranted only if the data are of sufficient quality. The assessment approach described in this study demonstrates how the impact of data quality issues on an eMeasure can be quantified and the approach can be generalized for other data analysis tasks. Healthcare organizations can prioritize data quality improvement efforts to focus on the areas that will have the most impact on validity and assess whether the values that are reported should be trusted.

Original languageEnglish (US)
Pages (from-to)1012-1021
Number of pages10
JournalApplied clinical informatics
Volume8
Issue number4
DOIs
StatePublished - Jan 1 2017

Fingerprint

Catheters
Surgery
Degradation
Data Accuracy
Delivery of Health Care
Quality Improvement
Practice Guidelines
Linear Models
Parturition

Keywords

  • Data quality
  • Data quality assessment
  • Electronic health record
  • Ontology
  • Quality

Cite this

Quantifying the effect of data quality on the validity of an eMeasure. / Johnson, Steve; Speedie, Stuart M; Simon, Gyorgy J; Kumar, Vipin; Westra, Bonnie L.

In: Applied clinical informatics, Vol. 8, No. 4, 01.01.2017, p. 1012-1021.

Research output: Contribution to journalArticle

@article{ba371e97961b42ccaf588cc9e79c35a6,
title = "Quantifying the effect of data quality on the validity of an eMeasure",
abstract = "Objective The objective of this study was to demonstrate the utility of a healthcare data quality framework by using it to measure the impact of synthetic data quality issues on the validity of an eMeasure (CMS178—urinary catheter removal after surgery). Methods Data quality issues were artificially created by systematically degrading the underlying quality of EHR data using two methods: independent and correlated degradation. A linear model that describes the change in the events included in the eMeasure quantifies the impact of each data quality issue. Results Catheter duration had the most impact on the CMS178 eMeasure with every 1{\%} reduction in data quality causing a 1.21{\%} increase in the number of missing events. For birth date and admission type, every 1{\%} reduction in data quality resulted in a 1{\%} increase in missing events. Conclusion This research demonstrated that the impact of data quality issues can be quantified using a generalized process and that the CMS178 eMeasure, as currently defined, may not measure how well an organization is meeting the intended best practice goal. Secondary use of EHR data is warranted only if the data are of sufficient quality. The assessment approach described in this study demonstrates how the impact of data quality issues on an eMeasure can be quantified and the approach can be generalized for other data analysis tasks. Healthcare organizations can prioritize data quality improvement efforts to focus on the areas that will have the most impact on validity and assess whether the values that are reported should be trusted.",
keywords = "Data quality, Data quality assessment, Electronic health record, Ontology, Quality",
author = "Steve Johnson and Speedie, {Stuart M} and Simon, {Gyorgy J} and Vipin Kumar and Westra, {Bonnie L}",
year = "2017",
month = "1",
day = "1",
doi = "10.4338/ACI-2017-03-RA-0042",
language = "English (US)",
volume = "8",
pages = "1012--1021",
journal = "Applied Clinical Informatics",
issn = "1869-0327",
publisher = "Schattauer GmbH",
number = "4",

}

TY - JOUR

T1 - Quantifying the effect of data quality on the validity of an eMeasure

AU - Johnson, Steve

AU - Speedie, Stuart M

AU - Simon, Gyorgy J

AU - Kumar, Vipin

AU - Westra, Bonnie L

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Objective The objective of this study was to demonstrate the utility of a healthcare data quality framework by using it to measure the impact of synthetic data quality issues on the validity of an eMeasure (CMS178—urinary catheter removal after surgery). Methods Data quality issues were artificially created by systematically degrading the underlying quality of EHR data using two methods: independent and correlated degradation. A linear model that describes the change in the events included in the eMeasure quantifies the impact of each data quality issue. Results Catheter duration had the most impact on the CMS178 eMeasure with every 1% reduction in data quality causing a 1.21% increase in the number of missing events. For birth date and admission type, every 1% reduction in data quality resulted in a 1% increase in missing events. Conclusion This research demonstrated that the impact of data quality issues can be quantified using a generalized process and that the CMS178 eMeasure, as currently defined, may not measure how well an organization is meeting the intended best practice goal. Secondary use of EHR data is warranted only if the data are of sufficient quality. The assessment approach described in this study demonstrates how the impact of data quality issues on an eMeasure can be quantified and the approach can be generalized for other data analysis tasks. Healthcare organizations can prioritize data quality improvement efforts to focus on the areas that will have the most impact on validity and assess whether the values that are reported should be trusted.

AB - Objective The objective of this study was to demonstrate the utility of a healthcare data quality framework by using it to measure the impact of synthetic data quality issues on the validity of an eMeasure (CMS178—urinary catheter removal after surgery). Methods Data quality issues were artificially created by systematically degrading the underlying quality of EHR data using two methods: independent and correlated degradation. A linear model that describes the change in the events included in the eMeasure quantifies the impact of each data quality issue. Results Catheter duration had the most impact on the CMS178 eMeasure with every 1% reduction in data quality causing a 1.21% increase in the number of missing events. For birth date and admission type, every 1% reduction in data quality resulted in a 1% increase in missing events. Conclusion This research demonstrated that the impact of data quality issues can be quantified using a generalized process and that the CMS178 eMeasure, as currently defined, may not measure how well an organization is meeting the intended best practice goal. Secondary use of EHR data is warranted only if the data are of sufficient quality. The assessment approach described in this study demonstrates how the impact of data quality issues on an eMeasure can be quantified and the approach can be generalized for other data analysis tasks. Healthcare organizations can prioritize data quality improvement efforts to focus on the areas that will have the most impact on validity and assess whether the values that are reported should be trusted.

KW - Data quality

KW - Data quality assessment

KW - Electronic health record

KW - Ontology

KW - Quality

UR - http://www.scopus.com/inward/record.url?scp=85032806544&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85032806544&partnerID=8YFLogxK

U2 - 10.4338/ACI-2017-03-RA-0042

DO - 10.4338/ACI-2017-03-RA-0042

M3 - Article

C2 - 29241241

AN - SCOPUS:85032806544

VL - 8

SP - 1012

EP - 1021

JO - Applied Clinical Informatics

JF - Applied Clinical Informatics

SN - 1869-0327

IS - 4

ER -