skip to main content
10.1145/3655038.3665941acmconferencesArticle/Chapter ViewAbstractPublication PageshotstorageConference Proceedingsconference-collections
research-article
Open access

Dictionary Based Cache Line Compression

Published: 08 July 2024 Publication History

Abstract

Active-standby mechanisms for VM high-availability demand frequent synchronization of memory and CPU state, involving the identification and transfer of "dirty" memory pages to a standby target. Building upon the granularity offered by CXL-enabled memory devices, as discussed by Waddington et al. [21], this paper proposes a dictionary-based compression method operating on 64-byte cache lines to minimize snapshot volume and synchronization latency. The method aims to transmit only necessary information required to reconstruct the memory state at the standby machine, augmented by byte grouping and cache-line partitioning techniques. We assess the compression benefits on memory access patterns across 20 benchmarks snapshots and compare our approach to standard off-the-shelf compression methods. Our findings reveal significant improvements across nearly all benchmarks, with some experiencing over a twofold enhancement compared to standard compression, while others show more moderate gains. We conduct an in-depth experimental analysis on the contribution of each method and examine the nature of the benchmarks. We ascertain that the repeating nature of cache lines across snapshots (caused by transient memory changes) and their concise representation contributes most to the size reduction, accounting for 92% of the gains. Our work paves the way for further reduction in the data transferred to standby machines, thereby enhancing VM high-availability and reducing synchronization latency.

References

[1]
2019. Zert0: Architecture Guide for the IT Resilience Platform.
[2]
Alexandru Agache, Marc Brooker, Alexandra Iordache, Anthony Liguori, Rolf Neugebauer, Phil Piwonka, and Diana-Maria Popa. 2020. Firecracker: Lightweight virtualization for serverless applications. In 17th {usenix} symposium on networked systems design and implementation ({nsdi} 20). 419--434.
[3]
Raja Wasim Ahmad, Abdullah Gani, Siti Hafizah Ab. Hamid, Muhammad Shiraz, Feng Xia, and Sajjad A. Madani. 2015. Virtual Machine Migration in Cloud Data Centers: A Review, Taxonomy, and Open Research Issues. J. Supercomput. 71, 7 (jul 2015), 2473--2515. https://doi.org/10.1007/s11227-015-1400-5
[4]
Alaa Alameldeen and David Wood. 2004. Frequent pattern compression: A significance-based compression scheme for L2 caches. Technical Report. University of Wisconsin-Madison Department of Computer Sciences.
[5]
Angelos Arelakis and Per Stenstrom. 2014. SC2: A statistical compression cache scheme. ACM SIGARCH Computer Architecture News 42, 3 (2014), 145--156.
[6]
Luca Benini, Davide Bruni, Alberto Macii, and Enrico Macii. 2004. Memory energy minimization by data compression: algorithms, architectures and implementation. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 12, 3 (2004), 255--268.
[7]
Daniel Cohen, Sarel Cohen, Dalit Naor, Daniel Waddington, and Moshik Hershcovitch. 2023. Cache Line Deltas Compression. In Proceedings of the 16th ACM International Conference on Systems and Storage. 145--145.
[8]
Brendan Cully, Geoffrey Lefebvre, Dutch Meyer, Mike Feeley, Norm Hutchinson, and Andrew Warfield. 2008. Remus: High Availability via Asynchronous Virtual Machine Replication. In Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation (San Francisco, California) (NSDI'08). USENIX Association, USA, 161--174.
[9]
Stuart Hacking and Benoît Hudzia. 2009. Improving the Live Migration Process of Large Enterprise Applications. In Proceedings of the 3rd International Workshop on Virtualization Technologies in Distributed Computing (Barcelona, Spain) (VTDC '09). Association for Computing Machinery, New York, NY, USA, 51--58. https://doi.org/10.1145/1555336.1555346
[10]
Muyang He, Shaoning Pang, Denis Lavrov, Ding Lu, Yuan Zhang, and Abdolhossein Sarrafzadeh. 2016. Reverse Replication of Virtual Machines (rRVM) for Low Latency and High Availability Services. In Proceedings of the 9th International Conference on Utility and Cloud Computing (Shanghai, China) (UCC '16). Association for Computing Machinery, New York, NY, USA, 118--127. https://doi.org/10.1145/2996890.2996894
[11]
Moshik Hershcovitch, Leshem Choshen, Andrew Wood, Ilias Enmouri, Peter Chin, Swaminathan Sundararaman, and Danny Harnik. 2024. Lossless and Near-Lossless Compression for Foundation Models. arXiv preprint arXiv:2404.15198 (2024).
[12]
Hai Jin, Li Deng, Song Wu, Xuanhua Shi, and Xiaodong Pan. 2009. Live virtual machine migration with adaptive, memory compression. In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--10. https://doi.org/10.1109/CLUSTR.2009.5289170
[13]
Morten Kjelso, Mark Gooch, and Simon Jones. 1996. Design and performance of a main memory hardware data compressor. In Proceedings of EUROMICRO 96. 22nd Euromicro Conference. Beyond 2000: Hardware and Software Design Strategies. IEEE, 423--430.
[14]
Sparsh Mittal and Jeffrey S Vetter. 2015. A survey of architectural approaches for data compression in cache and main memory systems. IEEE Transactions on Parallel and Distributed Systems 27, 5 (2015), 1524--1536.
[15]
Fereydoun Farrahi Moghaddam and Mohamed Cheriet. 2010. Decreasing live virtual machine migration down-time using a memory page selection based on memory change PDF. In 2010 International Conference on Networking, Sensing and Control (ICNSC). 355--359. https://doi.org/10.1109/ICNSC.2010.5461517
[16]
Guangyong Piao, Youngsup Oh, Baegjae Sung, and Chanik Park. 2014. Efficient Pre-copy Live Migration with Memory Compaction and Adaptive VM Downtime Control. In 2014 IEEE Fourth International Conference on Big Data and Cloud Computing. 85--90. https://doi.org/10.1109/BDCloud.2014.57
[17]
Janis Schoetterl-Glausch. 2016. Intel page modification logging for lightweight continuous checkpointing. Bachelor thesis, Operating Systems Group, Karlsruhe Institute of Technology (KIT), Germany, October31 (2016).
[18]
Yoshiaki Tamura, Koji Sato, Seiji Kihara, and Satoshi Moriai. 2008. Kemari: virtual machine synchronization for fault tolerance. (01 2008).
[19]
Yingying Tian, Samira M Khan, Daniel A Jiménez, and Gabriel H Loh. 2014. Last-level cache deduplication. In Proceedings of the 28th ACM international conference on Supercomputing. 53--62.
[20]
Franco Travostino, Paul Daspit, Leon Gommans, Chetan Jog, Cees de Laat, Joe Mambretti, Inder Monga, Bas van Oudenaarde, Satish Raghunath, and Phil Yonghui Wang. 2006. Seamless live migration of virtual machines over the MAN/WAN. Future Generation Computer Systems 22, 8 (2006), 901--907. https://doi.org/10.1016/j.future.2006.03.007
[21]
Daniel Waddington, Moshik Hershcovitch, Swaminathan Sundararaman, and Clem Dickey. 2022. A case for using cache line deltas for high frequency VM snapshotting. In Proceedings of the 13th Symposium on Cloud Computing (SoCC '22). Association for Computing Machinery, 526--539. https://doi.org/10.1145/3542929.3563481
[22]
Fei Zhang, Guangming Liu, Xiaoming Fu, and Ramin Yahyapour. 2018. A Survey on Virtual Machine Migration: Challenges, Techniques, and Open Issues. IEEE Communications Surveys Tutorials 20, 2 (2018), 1206--1243. https://doi.org/10.1109/COMST.2018.2794881
[23]
Youtao Zhang, Jun Yang, and Rajiv Gupta. 2000. Frequent value locality and value-centric data cache design. ACM SIGARCH Computer Architecture News 28, 5 (2000), 150--159.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HotStorage '24: Proceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems
July 2024
141 pages
ISBN:9798400706301
DOI:10.1145/3655038
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2024

Check for updates

Author Tags

  1. CXL
  2. Compression
  3. VM snapshots
  4. performance

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

HOTSTORAGE '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 34 of 87 submissions, 39%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 246
    Total Downloads
  • Downloads (Last 12 months)246
  • Downloads (Last 6 weeks)44
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media