Abstract
A significant challenge in handling geographic datasets is that the datasets can come from heterogeneous sources with various data qualities and formats. Before these datasets can be used in a Geographic Information System (GIS) for spa-tial analysis or to create maps, a typical task is to clean the attribute data and transform the data into a uniform format. However, conventional GIS products focus on manipulating the spatial component of geographic features and only offer basic tools for editing the attribute data (e.g., one row at a time). This limits the capability for handling large datasets in a GIS since manually editing and transforming attribute data between different formats is not practical for thousands of geographic features. In this demo, we present ArcKarma, which is built on our previous work on data transforma-tion, to efficiently clean and transform data attributes in a GIS. ArcKarma generates transformation programs from a few user-provided examples and applies these programs to transform individual attribute columns into the desired for-mats. We show that ArcKarma produces accurate results and eliminates the need for laborious manual data cleaning and scripting tasks.
Original language | English (US) |
---|---|
Title of host publication | 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014 |
Editors | Markus Schneider, Michael Gertz, Yan Huang, Jagan Sankaranarayanan, John Krumm |
Publisher | Association for Computing Machinery |
Pages | 577-580 |
Number of pages | 4 |
ISBN (Electronic) | 9781450331319 |
DOIs | |
State | Published - Nov 4 2014 |
Externally published | Yes |
Event | 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014 - Dallas, United States Duration: Nov 4 2014 → Nov 7 2014 |
Publication series
Name | GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems |
---|---|
Volume | 04-07-November-2014 |
Other
Other | 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014 |
---|---|
Country/Territory | United States |
City | Dallas |
Period | 11/4/14 → 11/7/14 |
Bibliographical note
Publisher Copyright:© Copyright 2014 ACM.
Keywords
- Data Cleaning
- Data Trans-Formation
- Geographic Information System