skip to main content
10.1145/3663338.3663681acmconferencesArticle/Chapter ViewAbstractPublication PagespodcConference Proceedingsconference-collections
research-article
Open access

Invited Paper: DeepSLOs for the Computing Continuum

Published: 20 June 2024 Publication History

Abstract

The advent of the computing continuum, i.e., the blending of all existing computational tiers, calls for novel techniques and methods that consider its complex dynamics. This work presents the DeepSLO as a novel design paradigm to define and structure Service Level Objectives (SLOs) for distributed computing continuum systems. Hence, when multiple stakeholders are involved, the DeepSLO allows them to plan the overarching behaviors of the system. Further, the techniques employed (Bayesian networks, Markov blanket, Active inference) provide autonomy and decentralization to each SLO while the DeepSLO hierarchy remains to account for objectives dependencies. Finally, DeepSLOs are represented graphically, as well as individual SLOs enabling a human interpretation of the system performance.

References

[1]
Hussain AlJahdali, Abdulaziz Albatli, Peter Garraghan, Paul Townend, Lydia Lau, and Jie Xu. 2014. Multi-tenancy in Cloud Computing. In 2014 IEEE 8th International Symposium on Service Oriented System Engineering. 344--351.
[2]
Ahmad Alzu'bi, Ala'a Alomar, Shahed Alkhaza'leh, Abdelrahman Abuarqoub, and Mohammad Hammoudeh. 2024. A Review of Privacy and Security of Edge Computing in Smart Healthcare Systems: Issues, Challenges, and Research Directions. Tsinghua Science and Technology 29, 4 (Aug. 2024), 1152--1180.
[3]
Himani Bajaj, Anjali Sharma, Deepshi Arora, Mayank Yadav, Devkant Sharma, and Prabhjot Singh Bajwa. 2024. Challenges in E-Waste Management. In Sustainable Management of Electronic Waste. John Wiley & Sons, Ltd, 201--220.
[4]
Pengfei Chen, Yong Qi, and Di Hou. 2019. CauseInfer: Automated End-to-End Performance Diagnosis with Hierarchical Causality Graph in Cloud Environment. IEEE Transactions on Services Computing 12, 2 (March 2019), 214--230.
[5]
Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, and Ziran Wang. 2024. Drive As You Speak: Enabling Human-Like Interaction With Large Language Models in Autonomous Vehicles. 902--909.
[6]
Schahram Dustdar, Victor Casamayor Pujol, and Praveen Kumar Donta. 2023. On Distributed Computing Continuum Systems. IEEE Transactions on Knowledge and Data Engineering 35, 4 (April 2023), 4092--4105.
[7]
Karl Friston, Thomas FitzGerald, Francesco Rigoli, Philipp Schwartenbeck, John O'Doherty, and Giovanni Pezzulo. 2016. Active inference and learning. Neuro-science & Biobehavioral Reviews 68 (Sept. 2016), 862--879.
[8]
Karl Friston, James Kilner, and Lee Harrison. 2006. A free energy principle for the brain. Journal of Physiology Paris 100, 1-3 (July 2006), 70--87.
[9]
K. J. Friston, L. Harrison, and W. Penny. 2003. Dynamic causal modelling. NeuroImage 19, 4 (Aug. 2003), 1273--1302.
[10]
Shunkai Fu and Michel C. Desmarais. 2008. Fast Markov Blanket Discovery Algorithm Via Local Learning within Single Pass. In Advances in Artificial Intelligence. Springer, Berlin, Heidelberg, 96--107.
[11]
Noor Ul Huda, Ijaz Ahmed, Muhammad Adnan, Mansoor Ali, and Faisal Naeem. 2024. Experts and intelligent systems for smart homes' Transformation to Sustainable Smart Cities: A comprehensive review. Expert Systems with Applications 238 (March 2024), 122380.
[12]
Byeonghui Jeong, Seungyeon Baek, Sihyun Park, Jueun Jeon, and Young-Sik Jeong. 2023. Stable and efficient resource management using deep neural network on cloud computing. Neurocomputing 521 (Feb. 2023), 99--112.
[13]
Faria Kalim. 2020. Satisfying service level objectives in stream processing systems. Ph. D. Dissertation.
[14]
Qianlin Liang, Walid A. Hanafy, Ahmed Ali-Eldin, and Prashant Shenoy. 2023. Model-driven Cluster Resource Management for AI Workloads in Edge Clouds. ACM Transactions on Autonomous and Adaptive Systems 18, 1 (March 2023), 2:1--2:26.
[15]
Andrea Morichetta, Victor Casamayor Pujol, Stefan Nastic, Schahram Dustdar, Deepak Vij, Ying Xiong, and Zhaobo Zhang. 2023. PolarisProfiler: A Novel Metadata-Based Profiling Approach for Optimizing Resource Management in the Edge-Cloud Continnum. In 2023 IEEE International Conference on Service-Oriented System Engineering (SOSE). 27--36.
[16]
Andrea Morichetta, Thomas Pusztai, Deepak Vij, Víctor Casamayor Pujol, Philipp Raith, Ying Xiong, Stefan Nastic, Schahram Dustdar, and Zhaobo Zhang. 2023. Demystifying deep learning in predictive monitoring for cloud-native SLOs. In 2023 IEEE 16th International Conference on Cloud Computing (CLOUD). 1--11.
[17]
Michael R. Nelson. 2009. Building an Open Cloud. Science 324, 5935 (June 2009), 1656--1657.
[18]
Judea Pearl. 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
[19]
Judea Pearl and Dana Mackenzie. 2018. The Book of Why: The New Science of Cause and Effect. Basic Books, Inc., USA.
[20]
Victor Casamayor Pujol, Andrea Morichetta, and Stefan Nastic. 2023. Intelligent Sampling: A Novel Approach to Optimize Workload Scheduling in Large-Scale Heterogeneous Computing Continuum. In 2023 IEEE International Conference on Service-Oriented System Engineering (SOSE). 140--149.
[21]
Haoran Qiu, Subho S. Banerjee, Saurabh Jha, Zbigniew T. Kalbarczyk, and Ravishankar K. Iyer. 2020. FIRM: an intelligent fine-grained resource management framework for SLO-oriented microservices. In Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation (OSDI'20). USENIX Association, USA, 805--825.
[22]
Noor Sajid, Philip J. Ball, Thomas Parr, and Karl J. Friston. 2021. Active Inference: Demystified and Compared. Neural Computation 33, 3 (March 2021), 674--712.
[23]
Mauro Scanagatta, Antonio Salmerón, and Fabio Stella. 2019. A survey on Bayesian network structure learning from data. Progress in Artificial Intelligence 8, 4 (Dec. 2019), 425--439.
[24]
Marco Scutari, Catharina Elisabeth Graafland, and José Manuel Gutiérrez. 2019. Who learns better Bayesian network structures: Accuracy and speed of structure learning algorithms. International Journal of Approximate Reasoning 115 (Dec. 2019), 235--253.
[25]
Boris Sedlak, Victor Casamayor Pujol, Praveen Kumar Donta, and Schahram Dustdar. 2023. Controlling Data Gravity and Data Friction: From Metrics to Multidimensional Elasticity Strategies. In 2023 IEEE International Conference on Software Services Engineering (SSE). 43--49.
[26]
Boris Sedlak, Victor Casamayor Pujol, Praveen Kumar Donta, and Schahram Dustdar. 2023. Designing Reconfigurable Intelligent Systems with Markov Blankets. In Service-Oriented Computing (Lecture Notes in Computer Science), Flavia Monti, Stefanie Rinderle-Ma, Antonio Ruiz Cortés, Zibin Zheng, and Massimo Mecella (Eds.). Springer Nature Switzerland, Cham, 42--50.
[27]
Boris Sedlak, Victor Casamayor Pujol, Praveen Kumar Donta, and Schahram Dustdar. 2023. Equilibrium in the Computing Continuum through Active Inference.
[28]
Hyowoon Seo, Jihong Park, Mehdi Bennis, and Mérouane Debbah. 2021. Semantics-Native Communication with Contextual Reasoning. (Aug. 2021).
[29]
Weisong Shi, Jie Cao, Quan Zhang, Youhuizi Li, and Lanyu Xu. 2016. Edge Computing: Vision and Challenges. IEEE Internet of Things Journal 3, 5 (Oct. 2016), 637--646.
[30]
Ryan Smith, Karl J Friston, and Christopher J Whyte. 2022. A step-by-step tutorial on active inference and its application to empirical data. Journal of Mathematical Psychology 107 (2022), 102632.
[31]
Ion Stoica and Scott Shenker. 2021. From cloud computing to sky computing. In Proceedings of the Workshop on Hot Topics in Operating Systems (HotOS '21). Association for Computing Machinery, New York, NY, USA, 26--32.
[32]
Ajaya K Swain and Valeria R Garza. 2023. Key factors in achieving Service Level Agreements (SLA) for Information Technology (IT) incident resolution. Information Systems Frontiers 25, 2 (2023), 819--834.
[33]
Ming Tang and Vincent W.S. Wong. 2022. Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing Systems. IEEE Transactions on Mobile Computing 21, 6 (June 2022), 1985--1997.
[34]
László Toka, Gergely Dobreff, Balázs Fodor, and Balázs Sonkoly. 2021. Machine Learning-Based Scaling Management for Kubernetes Edge Clusters. IEEE Transactions on Network and Service Management 18, 1 (March 2021), 958--972.
[35]
Elif Uysal, Onur Kaya, Anthony Ephremides, James Gross, Marian Codreanu, Petar Popovski, Mohamad Assaad, Gianluigi Liva, Andrea Munari, Touraj Soleymani, Beatriz Soret, and Karl Henrik Johansson. 2021. Semantic Communications in Networked Systems: A Data Significance Perspective. arXiv (March 2021).
[36]
Matthew J. Vowels, Necati Cihan Camgoz, and Richard Bowden. 2021. D'ya like DAGs? A Survey on Structure Learning and Causal Discovery. (March 2021). https://arxiv.org/abs/2103.02582v2
[37]
Jing Wu, Lin Wang, Qirui Jin, and Fangming Liu. 2024. GRAFT: Efficient inference serving for hybrid deep learning with SLO guarantees via DNN re-alignment. IEEE Transactions on Parallel and Distributed Systems 35, 2 (2024), 280--296.
[38]
Xiong Xiong, Kan Zheng, Lei Lei, and Lu Hou. 2020. Resource Allocation Based on Deep Reinforcement Learning in IoT Edge Computing. IEEE Journal on Selected Areas in Communications 38, 6 (June 2020), 1133--1146.
[39]
Fan Zhang, Xuxin Tang, Xiu Li, Samee U Khan, and Zhijiang Li. 2019. Quantifying cloud elasticity with container-based autoscaling. Future Generation Computer Systems 98 (2019), 672--681.
[40]
Zhizhou Zhang, Murali Krishna Ramanathan, Prithvi Raj, Abhishek Parwal, Timothy Sherwood, and Milind Chabbi. 2022. {CRISP}: Critical Path Analysis of {Large-Scale} Microservice Architectures. 655--672. https://www.usenix.org/conference/atc22/presentation/zhang-zhizhou

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ApPLIED'24: Proceedings of the 2024 Workshop on Advanced Tools, Programming Languages, and PLatforms for Implementing and Evaluating algorithms for Distributed systems
June 2024
95 pages
ISBN:9798400706707
DOI:10.1145/3663338
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2024

Check for updates

Qualifiers

  • Research-article

Funding Sources

  • European Union

Conference

ApPLIED'24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 3 of 4 submissions, 75%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 189
    Total Downloads
  • Downloads (Last 12 months)189
  • Downloads (Last 6 weeks)36
Reflects downloads up to 12 Jan 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media