Sphinx: Distributed execution of interactive SQL queries on big spatial data

Ahmed Eldawy, Mostafa Elganainy, Ammar Bakeer, Ahmed Abdelmotaleb, Mohamed F Mokbel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations

Abstract

This paper presents Sphinx, a full-fledged distributed system which uses a standard SQL interface to process big spatial data. Sphinx adds spatial data types, indexes and query processing, inside the code-base of Cloudera Impala for efficient processing of spatial data. In particular, Sphinx is composed of four main components, namely, query parser, indexer, query planner, and query executor. The query parser injects spatial data types and functions in the SQL interface of Sphinx. The indexer creates spatial indexes in Sphinx by adopting a two-layered index design. The query planner utilizes these indexes to construct efficient query plans for range query and spatial join operations. Finally, the query executor carries out these plans on big spatial datasets in a distributed cluster. A system prototype of Sphinx running on real datasets shows up-to three orders of magnitude performance improvement over traditional Impala.

Original languageEnglish (US)
Title of host publication23rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2015
EditorsYan Huang, Mohamed Ali, Jagan Sankaranarayanan, Matthias Renz, Michael Gertz
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450339674
DOIs
StatePublished - Nov 3 2015
Event23rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2015 - Seattle, United States
Duration: Nov 3 2015Nov 6 2015

Publication series

NameGIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems
Volume03-06-November-2015

Other

Other23rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2015
Country/TerritoryUnited States
CitySeattle
Period11/3/1511/6/15

Bibliographical note

Publisher Copyright:
© 2015 ACM.

Keywords

  • Impala
  • Range query
  • SQL
  • Spatial
  • Spatial join
  • Sphinx

Fingerprint

Dive into the research topics of 'Sphinx: Distributed execution of interactive SQL queries on big spatial data'. Together they form a unique fingerprint.

Cite this