Abstract
Compared with a hash table, a Bloom Filter (BF) is more space-efficient for supporting fast matching though resulting in a controllable and acceptable false positive probability. The space size of the basic BF is predetermined based on the expected number of elements to be stored. However, we cannot predict the scale of a BF space for dynamic sets. The two existing solutions for supporting dynamic sets, Scalable BF (SBF) and Dynamic BF (DBF), still face some challenges on system performance and memory overhead.This paper presents a new BF for dynamic data sets, called Partitioned BF (Par-BF). Compared with DBF and SBF, the size and the range of the false positive probability can be calculated by a group of formulas to leverage a sweet spot between high-performance and low-overhead. Moreover, Par-BF supports parallel fast matching which can improve the overall throughput. From our trace-driven experimental results, the IOPS of Par-BF outperforms that of DBF and SBF from 6X to 10X, and from 2X to 4X, respectively. Meanwhile, through our proposed garbage collection policy, the memory overhead of Par-BF is less than half of the memory usage of SBF.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of DISCS 2014 |
Subtitle of host publication | The 2014 International Workshop on Data-Intensive Scalable Computing Systems - Held in Conjuction with SC 2014: The International Conference for High Performance Computing, Networking, Storage and Analysis |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1-8 |
Number of pages | 8 |
ISBN (Electronic) | 9781479970384 |
DOIs | |
State | Published - Apr 2 2014 |
Event | 2014 International Workshop on Data-Intensive Scalable Computing Systems, DISCS 2014 - Held in Conjuction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2014 - New Orleans, United States Duration: Nov 16 2014 → … |
Publication series
Name | Proceedings of DISCS 2014: The 2014 International Workshop on Data-Intensive Scalable Computing Systems - Held in Conjuction with SC 2014: The International Conference for High Performance Computing, Networking, Storage and Analysis |
---|
Other
Other | 2014 International Workshop on Data-Intensive Scalable Computing Systems, DISCS 2014 - Held in Conjuction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2014 |
---|---|
Country/Territory | United States |
City | New Orleans |
Period | 11/16/14 → … |
Bibliographical note
Publisher Copyright:© 2014 IEEE.
Keywords
- Bloom Filter
- Par-BF
- Parallel
- dynamic sets