A system for efficient cleaning and transformation of geospatial data attributes

Yao Yi Chiang, Bo Wu, Akshay Anand, Ketan Akade, Craig A. Knoblock

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

A significant challenge in handling geographic datasets is that the datasets can come from heterogeneous sources with various data qualities and formats. Before these datasets can be used in a Geographic Information System (GIS) for spa-tial analysis or to create maps, a typical task is to clean the attribute data and transform the data into a uniform format. However, conventional GIS products focus on manipulating the spatial component of geographic features and only offer basic tools for editing the attribute data (e.g., one row at a time). This limits the capability for handling large datasets in a GIS since manually editing and transforming attribute data between different formats is not practical for thousands of geographic features. In this demo, we present ArcKarma, which is built on our previous work on data transforma-tion, to efficiently clean and transform data attributes in a GIS. ArcKarma generates transformation programs from a few user-provided examples and applies these programs to transform individual attribute columns into the desired for-mats. We show that ArcKarma produces accurate results and eliminates the need for laborious manual data cleaning and scripting tasks.

Original languageEnglish (US)
Title of host publication22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014
EditorsMarkus Schneider, Michael Gertz, Yan Huang, Jagan Sankaranarayanan, John Krumm
PublisherAssociation for Computing Machinery
Pages577-580
Number of pages4
ISBN (Electronic)9781450331319
DOIs
StatePublished - Nov 4 2014
Externally publishedYes
Event22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014 - Dallas, United States
Duration: Nov 4 2014Nov 7 2014

Publication series

NameGIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems
Volume04-07-November-2014

Other

Other22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014
Country/TerritoryUnited States
CityDallas
Period11/4/1411/7/14

Bibliographical note

Publisher Copyright:
© Copyright 2014 ACM.

Keywords

  • Data Cleaning
  • Data Trans-Formation
  • Geographic Information System

Fingerprint

Dive into the research topics of 'A system for efficient cleaning and transformation of geospatial data attributes'. Together they form a unique fingerprint.

Cite this