Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching

Xinyue Hu, Eman Ramadan, Wei Ye, Feng Tian, Zhi Li Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Scopus citations

Abstract

Performance of caching algorithms not only determines the quality of experience for users, but also affects the operating and capital expenditures for cloud service providers. Today's production systems rely on heuristics such as LRU (least recently used) and its variants, which work well for certain types of workloads, and cannot effectively cope with diverse and time-varying workload characteristics. While learning-based caching algorithms have been proposed to deal with these challenges, they still impose assumptions about workload characteristics and often suffer poor generalizability. In this paper, we propose Raven, a general learning-based caching framework that leverages the insights from the offline optimal Belady algorithm for both in-memory and content caching. Raven learns the distributions of objects' next-request arrival times without any prior assumptions by employing Mixture Density Network (MDN)-based universal distribution estimation. It utilizes the estimated distributions to compute the probability of an object that arrives farthest than any other objects in the cache and evicts the one with the largest such probability, regulated by the sizes of objects if appropriate. Raven (probabilistically) approximates Belady by explicitly accounting for the stochastic, time-varying, and non-stationary nature of object arrival processes. Evaluation results on production workloads demonstrate that, compared with the best existing caching algorithms, Raven improves the object hit ratio and byte hit ratio by up to 7.3% and 7.1%, respectively, reduces the average access latency by up to 17.9% and the traffic to the origin servers by up to 18.8%.

Original languageEnglish (US)
Title of host publicationCoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies
PublisherAssociation for Computing Machinery, Inc
Pages72-90
Number of pages19
ISBN (Electronic)9781450395083
DOIs
StatePublished - Nov 30 2022
Event18th ACM Conference on Emerging Networking Experiment and Technologies, CoNEXT 2022 - Rome, Italy
Duration: Dec 6 2022Dec 9 2022

Publication series

NameCoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies

Conference

Conference18th ACM Conference on Emerging Networking Experiment and Technologies, CoNEXT 2022
Country/TerritoryItaly
CityRome
Period12/6/2212/9/22

Bibliographical note

Funding Information:
We would like to thank our anonymous reviewers for their valuable feedback and our shepherd Nikolaos Laoutaris for his insightful comments and guidance. This research was in part supported by NSF under Grants CNS-1814322, CNS-1836772, CNS-1901103, CCF-2123987 and CNS-2106771.

Publisher Copyright:
© 2022 ACM.

Fingerprint

Dive into the research topics of 'Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching'. Together they form a unique fingerprint.

Cite this