Sphinx: Empowering impala for efficient execution of SQL queries on big spatial data

Ahmed Eldawy, Ibrahim Sabek, Mostafa Elganainy, Ammar Bakeer, Ahmed Abdelmotaleb, Mohamed F Mokbel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Abstract

This paper presents Sphinx, a full-fledged open-source system for big spatial data which overcomes the limitations of existing systems by adopting a standard SQL interface, and by providing a high efficient core built inside the core of the Apache Impala system. Sphinx is composed of four main layers, namely, query parser, indexer, query planner, and query executor. The query parser injects spatial data types and functions in the SQL interface of Sphinx. The indexer creates spatial indexes in Sphinx by adopting a two-layered index design. The query planner utilizes these indexes to construct efficient query plans for range query and spatial join operations. Finally, the query executor carries out these plans on big spatial datasets in a distributed cluster. A system prototype of Sphinx running on real datasets shows up-to three orders of magnitude performance improvement over plain-vanilla Impala, SpatialHadoop, and PostGIS.

Original languageEnglish (US)
Title of host publicationAdvances in Spatial and Temporal Databases - 15th International Symposium, SSTD 2017, Proceedings
EditorsWei-Shinn Ku, Agnes Voisard, Haiquan Chen, Chang-Tien Lu, Siva Ravada, Matthias Renz, Yan Huang, Michael Gertz, Liang Tang, Chengyang Zhang, Erik Hoel, Xiaofang Zhou
PublisherSpringer Verlag
Pages65-83
Number of pages19
ISBN (Print)9783319643663
DOIs
StatePublished - Jan 1 2017
Event15th International Symposium on Spatial and Temporal Databases, SSTD 2017 - Arlington, United States
Duration: Aug 21 2017Aug 23 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10411 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other15th International Symposium on Spatial and Temporal Databases, SSTD 2017
CountryUnited States
CityArlington
Period8/21/178/23/17

Fingerprint Dive into the research topics of 'Sphinx: Empowering impala for efficient execution of SQL queries on big spatial data'. Together they form a unique fingerprint.

Cite this