Terra Populus’ architecture for integrated big geospatial services

David Haynes, Steve Manson, Eric Shook

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

Big geospatial data is an emerging sub-area of geographic information science, big data, and cyberinfrastructure. Big geospatial data poses two unique challenges. First, raster and vector data structures and analyses have developed on largely separate paths for the last 20 years. This is creating an impediment to geospatial researchers seeking to utilize big data platforms that do not promote heterogeneous data types. Second, big spatial data repositories have yet to be integrated with big data computation platforms in ways that allow researchers to spatio-temporally analyze big geospatial datasets. IPUMS-Terra, a National Science Foundation cyberInfrastructure project, addresses these challenges by providing a unified framework of integrated geospatial services which access, analyze, and transform big heterogeneous spatio-temporal data. As IPUMS-Terra's data volume grows, we seek to integrate geospatial platforms that will scale geospatial analyses and address current bottlenecks within our system. However, our work shows that there are still unresolved challenges for big geospatial analysis. The most pertinent is that there is a lack of a unified framework for conducting scalable integrated vector and raster data analysis. We conducted a comparative analysis between PostgreSQL with PostGIS and SciDB and concluded that SciDB is the superior platform for scalable raster zonal analyses.

Original languageEnglish (US)
Pages (from-to)546-559
Number of pages14
JournalTransactions in GIS
Volume21
Issue number3
DOIs
StatePublished - Jun 2017

Fingerprint

raster
services
repository
spatial data
transform
analysis
science

Cite this

Terra Populus’ architecture for integrated big geospatial services. / Haynes, David; Manson, Steve; Shook, Eric.

In: Transactions in GIS, Vol. 21, No. 3, 06.2017, p. 546-559.

Research output: Contribution to journalArticle

@article{c2c7a9187ce443c2a1462f629b7c8548,
title = "Terra Populus’ architecture for integrated big geospatial services",
abstract = "Big geospatial data is an emerging sub-area of geographic information science, big data, and cyberinfrastructure. Big geospatial data poses two unique challenges. First, raster and vector data structures and analyses have developed on largely separate paths for the last 20 years. This is creating an impediment to geospatial researchers seeking to utilize big data platforms that do not promote heterogeneous data types. Second, big spatial data repositories have yet to be integrated with big data computation platforms in ways that allow researchers to spatio-temporally analyze big geospatial datasets. IPUMS-Terra, a National Science Foundation cyberInfrastructure project, addresses these challenges by providing a unified framework of integrated geospatial services which access, analyze, and transform big heterogeneous spatio-temporal data. As IPUMS-Terra's data volume grows, we seek to integrate geospatial platforms that will scale geospatial analyses and address current bottlenecks within our system. However, our work shows that there are still unresolved challenges for big geospatial analysis. The most pertinent is that there is a lack of a unified framework for conducting scalable integrated vector and raster data analysis. We conducted a comparative analysis between PostgreSQL with PostGIS and SciDB and concluded that SciDB is the superior platform for scalable raster zonal analyses.",
author = "David Haynes and Steve Manson and Eric Shook",
year = "2017",
month = "6",
doi = "10.1111/tgis.12286",
language = "English (US)",
volume = "21",
pages = "546--559",
journal = "Transactions in GIS",
issn = "1361-1682",
publisher = "Wiley-Blackwell",
number = "3",

}

TY - JOUR

T1 - Terra Populus’ architecture for integrated big geospatial services

AU - Haynes, David

AU - Manson, Steve

AU - Shook, Eric

PY - 2017/6

Y1 - 2017/6

N2 - Big geospatial data is an emerging sub-area of geographic information science, big data, and cyberinfrastructure. Big geospatial data poses two unique challenges. First, raster and vector data structures and analyses have developed on largely separate paths for the last 20 years. This is creating an impediment to geospatial researchers seeking to utilize big data platforms that do not promote heterogeneous data types. Second, big spatial data repositories have yet to be integrated with big data computation platforms in ways that allow researchers to spatio-temporally analyze big geospatial datasets. IPUMS-Terra, a National Science Foundation cyberInfrastructure project, addresses these challenges by providing a unified framework of integrated geospatial services which access, analyze, and transform big heterogeneous spatio-temporal data. As IPUMS-Terra's data volume grows, we seek to integrate geospatial platforms that will scale geospatial analyses and address current bottlenecks within our system. However, our work shows that there are still unresolved challenges for big geospatial analysis. The most pertinent is that there is a lack of a unified framework for conducting scalable integrated vector and raster data analysis. We conducted a comparative analysis between PostgreSQL with PostGIS and SciDB and concluded that SciDB is the superior platform for scalable raster zonal analyses.

AB - Big geospatial data is an emerging sub-area of geographic information science, big data, and cyberinfrastructure. Big geospatial data poses two unique challenges. First, raster and vector data structures and analyses have developed on largely separate paths for the last 20 years. This is creating an impediment to geospatial researchers seeking to utilize big data platforms that do not promote heterogeneous data types. Second, big spatial data repositories have yet to be integrated with big data computation platforms in ways that allow researchers to spatio-temporally analyze big geospatial datasets. IPUMS-Terra, a National Science Foundation cyberInfrastructure project, addresses these challenges by providing a unified framework of integrated geospatial services which access, analyze, and transform big heterogeneous spatio-temporal data. As IPUMS-Terra's data volume grows, we seek to integrate geospatial platforms that will scale geospatial analyses and address current bottlenecks within our system. However, our work shows that there are still unresolved challenges for big geospatial analysis. The most pertinent is that there is a lack of a unified framework for conducting scalable integrated vector and raster data analysis. We conducted a comparative analysis between PostgreSQL with PostGIS and SciDB and concluded that SciDB is the superior platform for scalable raster zonal analyses.

UR - http://www.scopus.com/inward/record.url?scp=85021111617&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85021111617&partnerID=8YFLogxK

U2 - 10.1111/tgis.12286

DO - 10.1111/tgis.12286

M3 - Article

AN - SCOPUS:85021111617

VL - 21

SP - 546

EP - 559

JO - Transactions in GIS

JF - Transactions in GIS

SN - 1361-1682

IS - 3

ER -