research-article

Open access

Dictionary Based Cache Line Compression

Authors:

Daniel Waddington,

Moshik HershcovitchAuthors Info & Claims

HotStorage '24: Proceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems

Pages 8 - 14

https://doi.org/10.1145/3655038.3665941

Published: 08 July 2024 Publication History

Abstract

Active-standby mechanisms for VM high-availability demand frequent synchronization of memory and CPU state, involving the identification and transfer of "dirty" memory pages to a standby target. Building upon the granularity offered by CXL-enabled memory devices, as discussed by Waddington et al. [21], this paper proposes a dictionary-based compression method operating on 64-byte cache lines to minimize snapshot volume and synchronization latency. The method aims to transmit only necessary information required to reconstruct the memory state at the standby machine, augmented by byte grouping and cache-line partitioning techniques. We assess the compression benefits on memory access patterns across 20 benchmarks snapshots and compare our approach to standard off-the-shelf compression methods. Our findings reveal significant improvements across nearly all benchmarks, with some experiencing over a twofold enhancement compared to standard compression, while others show more moderate gains. We conduct an in-depth experimental analysis on the contribution of each method and examine the nature of the benchmarks. We ascertain that the repeating nature of cache lines across snapshots (caused by transient memory changes) and their concise representation contributes most to the size reduction, accounting for 92% of the gains. Our work paves the way for further reduction in the data transferred to standby machines, thereby enhancing VM high-availability and reducing synchronization latency.

References

[1]

2019. Zert0: Architecture Guide for the IT Resilience Platform.

[2]

Alexandru Agache, Marc Brooker, Alexandra Iordache, Anthony Liguori, Rolf Neugebauer, Phil Piwonka, and Diana-Maria Popa. 2020. Firecracker: Lightweight virtualization for serverless applications. In 17th {usenix} symposium on networked systems design and implementation ({nsdi} 20). 419--434.

[3]

Raja Wasim Ahmad, Abdullah Gani, Siti Hafizah Ab. Hamid, Muhammad Shiraz, Feng Xia, and Sajjad A. Madani. 2015. Virtual Machine Migration in Cloud Data Centers: A Review, Taxonomy, and Open Research Issues. J. Supercomput. 71, 7 (jul 2015), 2473--2515. https://doi.org/10.1007/s11227-015-1400-5

Digital Library

[4]

Alaa Alameldeen and David Wood. 2004. Frequent pattern compression: A significance-based compression scheme for L2 caches. Technical Report. University of Wisconsin-Madison Department of Computer Sciences.

[5]

Angelos Arelakis and Per Stenstrom. 2014. SC2: A statistical compression cache scheme. ACM SIGARCH Computer Architecture News 42, 3 (2014), 145--156.

Digital Library

[6]

Luca Benini, Davide Bruni, Alberto Macii, and Enrico Macii. 2004. Memory energy minimization by data compression: algorithms, architectures and implementation. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 12, 3 (2004), 255--268.

Digital Library

[7]

Daniel Cohen, Sarel Cohen, Dalit Naor, Daniel Waddington, and Moshik Hershcovitch. 2023. Cache Line Deltas Compression. In Proceedings of the 16th ACM International Conference on Systems and Storage. 145--145.

Digital Library

[8]

Brendan Cully, Geoffrey Lefebvre, Dutch Meyer, Mike Feeley, Norm Hutchinson, and Andrew Warfield. 2008. Remus: High Availability via Asynchronous Virtual Machine Replication. In Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation (San Francisco, California) (NSDI'08). USENIX Association, USA, 161--174.

Digital Library

[9]

Stuart Hacking and Benoît Hudzia. 2009. Improving the Live Migration Process of Large Enterprise Applications. In Proceedings of the 3rd International Workshop on Virtualization Technologies in Distributed Computing (Barcelona, Spain) (VTDC '09). Association for Computing Machinery, New York, NY, USA, 51--58. https://doi.org/10.1145/1555336.1555346

Digital Library

[10]

Muyang He, Shaoning Pang, Denis Lavrov, Ding Lu, Yuan Zhang, and Abdolhossein Sarrafzadeh. 2016. Reverse Replication of Virtual Machines (rRVM) for Low Latency and High Availability Services. In Proceedings of the 9th International Conference on Utility and Cloud Computing (Shanghai, China) (UCC '16). Association for Computing Machinery, New York, NY, USA, 118--127. https://doi.org/10.1145/2996890.2996894

Digital Library

[11]

Moshik Hershcovitch, Leshem Choshen, Andrew Wood, Ilias Enmouri, Peter Chin, Swaminathan Sundararaman, and Danny Harnik. 2024. Lossless and Near-Lossless Compression for Foundation Models. arXiv preprint arXiv:2404.15198 (2024).

[12]

Hai Jin, Li Deng, Song Wu, Xuanhua Shi, and Xiaodong Pan. 2009. Live virtual machine migration with adaptive, memory compression. In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--10. https://doi.org/10.1109/CLUSTR.2009.5289170

[13]

Morten Kjelso, Mark Gooch, and Simon Jones. 1996. Design and performance of a main memory hardware data compressor. In Proceedings of EUROMICRO 96. 22nd Euromicro Conference. Beyond 2000: Hardware and Software Design Strategies. IEEE, 423--430.

[14]

Sparsh Mittal and Jeffrey S Vetter. 2015. A survey of architectural approaches for data compression in cache and main memory systems. IEEE Transactions on Parallel and Distributed Systems 27, 5 (2015), 1524--1536.

Digital Library

[15]

Fereydoun Farrahi Moghaddam and Mohamed Cheriet. 2010. Decreasing live virtual machine migration down-time using a memory page selection based on memory change PDF. In 2010 International Conference on Networking, Sensing and Control (ICNSC). 355--359. https://doi.org/10.1109/ICNSC.2010.5461517

[16]

Guangyong Piao, Youngsup Oh, Baegjae Sung, and Chanik Park. 2014. Efficient Pre-copy Live Migration with Memory Compaction and Adaptive VM Downtime Control. In 2014 IEEE Fourth International Conference on Big Data and Cloud Computing. 85--90. https://doi.org/10.1109/BDCloud.2014.57

Digital Library

[17]

Janis Schoetterl-Glausch. 2016. Intel page modification logging for lightweight continuous checkpointing. Bachelor thesis, Operating Systems Group, Karlsruhe Institute of Technology (KIT), Germany, October31 (2016).

[18]

Yoshiaki Tamura, Koji Sato, Seiji Kihara, and Satoshi Moriai. 2008. Kemari: virtual machine synchronization for fault tolerance. (01 2008).

[19]

Yingying Tian, Samira M Khan, Daniel A Jiménez, and Gabriel H Loh. 2014. Last-level cache deduplication. In Proceedings of the 28th ACM international conference on Supercomputing. 53--62.

Digital Library

[20]

Franco Travostino, Paul Daspit, Leon Gommans, Chetan Jog, Cees de Laat, Joe Mambretti, Inder Monga, Bas van Oudenaarde, Satish Raghunath, and Phil Yonghui Wang. 2006. Seamless live migration of virtual machines over the MAN/WAN. Future Generation Computer Systems 22, 8 (2006), 901--907. https://doi.org/10.1016/j.future.2006.03.007

Digital Library

[21]

Daniel Waddington, Moshik Hershcovitch, Swaminathan Sundararaman, and Clem Dickey. 2022. A case for using cache line deltas for high frequency VM snapshotting. In Proceedings of the 13th Symposium on Cloud Computing (SoCC '22). Association for Computing Machinery, 526--539. https://doi.org/10.1145/3542929.3563481

Digital Library

[22]

Fei Zhang, Guangming Liu, Xiaoming Fu, and Ramin Yahyapour. 2018. A Survey on Virtual Machine Migration: Challenges, Techniques, and Open Issues. IEEE Communications Surveys Tutorials 20, 2 (2018), 1206--1243. https://doi.org/10.1109/COMST.2018.2794881

[23]

Youtao Zhang, Jun Yang, and Rajiv Gupta. 2000. Frequent value locality and value-centric data cache design. ACM SIGARCH Computer Architecture News 28, 5 (2000), 150--159.

Digital Library

Recommendations

Cache Line Deltas Compression
SYSTOR '23: Proceedings of the 16th ACM International Conference on Systems and Storage

Synchronization of replicated data and program state is an essential aspect of application fault-tolerance. Current solutions use virtual memory mapping to identify page writes and replicate them at the destination. This approach has limitations ...
Yet Another Compressed Cache: A Low-Cost Yet Effective Compressed Cache

Cache memories play a critical role in bridging the latency, bandwidth, and energy gaps between cores and off-chip memory. However, caches frequently consume a significant fraction of a multicore chip's area and thus account for a significant fraction ...
Size-Aware Cache Management for Compressed Cache Architectures
A practical way to increase the effective capacity of a microprocessor's cache, without physically increasing the cache size, is to employ data compression. Last-Level Caches (LLC) are particularly amenable to such compression schemes, since the primary ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

HotStorage '24: Proceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems

July 2024

141 pages

ISBN:9798400706301

DOI:10.1145/3655038

Copyright © 2024 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

SIGOPS: ACM Special Interest Group on Operating Systems

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2024

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

HOTSTORAGE '24

Sponsor:

SIGOPS

HOTSTORAGE '24: 16th ACM Workshop on Hot Topics in Storage and File Systems

July 8 - 9, 2024

CA, Santa Clara, USA

Acceptance Rates

Overall Acceptance Rate 34 of 87 submissions, 39%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
246
Total Downloads

Downloads (Last 12 months)246
Downloads (Last 6 weeks)44

Reflects downloads up to 06 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents