TY - GEN
T1 - Demonstration of kite
T2 - 33rd IEEE International Conference on Data Engineering, ICDE 2017
AU - Magdy, Amr
AU - Mokbel, Mohamed F
PY - 2017/5/16
Y1 - 2017/5/16
N2 - Motivated by its wide availability and richness, there have been a plethora of recent work in querying, analyzing, and visualizing microblogs (see [3] for a brief survey). Examples of microblogs include tweets, online reviews, and comments on news websites. Unfortunately, existing work in microblog lacks data management tools that provide the necessary infrastructure to support efficient storage, indexing, and retrieval of microblogs. Hence, researchers, developers, and practitioners who need to process microblogs for their own purposes would need to either build their own ad-hoc techniques [5] or use any of existing general purpose big data engines, e.g., Spark, as their backbone [4]. Relying on ad-hoc techniques does not scale for large data sizes. Meanwhile, existing general purpose big data engines are built in a generic way to support various query workloads. Thus, they are not equipped to support the characteristics of microblogs [2], and so they are missing necessary infrastructure like supporting the real-Time indexing and promoting temporal, spatial, and ranking queries. This results in sub par performance when supporting microblogs.
AB - Motivated by its wide availability and richness, there have been a plethora of recent work in querying, analyzing, and visualizing microblogs (see [3] for a brief survey). Examples of microblogs include tweets, online reviews, and comments on news websites. Unfortunately, existing work in microblog lacks data management tools that provide the necessary infrastructure to support efficient storage, indexing, and retrieval of microblogs. Hence, researchers, developers, and practitioners who need to process microblogs for their own purposes would need to either build their own ad-hoc techniques [5] or use any of existing general purpose big data engines, e.g., Spark, as their backbone [4]. Relying on ad-hoc techniques does not scale for large data sizes. Meanwhile, existing general purpose big data engines are built in a generic way to support various query workloads. Thus, they are not equipped to support the characteristics of microblogs [2], and so they are missing necessary infrastructure like supporting the real-Time indexing and promoting temporal, spatial, and ranking queries. This results in sub par performance when supporting microblogs.
UR - http://www.scopus.com/inward/record.url?scp=85021186705&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85021186705&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2017.187
DO - 10.1109/ICDE.2017.187
M3 - Conference contribution
AN - SCOPUS:85021186705
T3 - Proceedings - International Conference on Data Engineering
SP - 1383
EP - 1384
BT - Proceedings - 2017 IEEE 33rd International Conference on Data Engineering, ICDE 2017
PB - IEEE Computer Society
Y2 - 19 April 2017 through 22 April 2017
ER -