Abstract
This paper advocates for the need to build a Microblogs Data Management System (MDMS) as an end-to-end data management system to support indexing, querying, and analyzing microblogs, e.g., Tweets, comments, or check-in's. We identify a set of characteristics for microblogging environments that are distinguishing from any other data management environment. Then, we propose a system architecture for the first Microblogs Data Management System, which includes indexing, querying, and recovery components. The indexing component is responsible for indexing recent data in memory, indexing older data in disk, and synchronizing the flow of data from memory to disk without affecting the query response time. The querying component is responsible for retrieving the query answer from both memory and disk storage as well as employing online selectivity estimation techniques tuned to the behavior of microblogs data. The recovery module allows for efficiently storing and processing incoming microblogs in memory without worrying about data loss.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 2015 IEEE 16th International Conference on Mobile Data Management, MDM 2015 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 271-278 |
Number of pages | 8 |
ISBN (Electronic) | 9781479999729 |
DOIs | |
State | Published - Sep 11 2015 |
Event | 16th IEEE International Conference on Mobile Data Management, MDM 2015 - Pittsburgh, United States Duration: Jun 15 2015 → Jun 18 2015 |
Publication series
Name | Proceedings - IEEE International Conference on Mobile Data Management |
---|---|
Volume | 1 |
ISSN (Print) | 1551-6245 |
Other
Other | 16th IEEE International Conference on Mobile Data Management, MDM 2015 |
---|---|
Country/Territory | United States |
City | Pittsburgh |
Period | 6/15/15 → 6/18/15 |
Bibliographical note
Publisher Copyright:© 2015 IEEE.
Keywords
- Big Data
- Data Management Systems
- Data Stream
- Indexing
- Keyword
- Microblogs
- Query Processing
- Spatial
- Temporal
- Top-k