skip to main content
10.1145/2534248.2534256acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article
Open access

A framework for dynamically generating predictive models of workflow execution

Published: 17 November 2013 Publication History

Abstract

The ability to accurately predict the performance of software components executing within a Cloud environment is an area of intense interest to many researchers. The availability of an accurate prediction of the time taken for a piece of code to execute would be beneficial for both planning and cost optimisation purposes. To that end, this paper proposes a performance data capture and modelling architecture that can be used to generate models of code execution time that are dynamically updated as additional performance data is collected. To demonstrate the utility of this approach, the workflow engine within the e-Science Central Cloud platform has been instrumented to capture execution data with a view to generating predictive models of workflow performance. Models have been generated for both simple and more complex workflow components operating on local hardware and within a virtualised Cloud environment and the ability to generate accurate performance predictions given a number of caveats is demonstrated.

References

[1]
J. Cala, H. Hiden, S. Woodman, and P. Watson. Cloud computing for fast prediction of chemical activity. Future Generation Computer Systems, 29(7): 1860--1869, 2013.
[2]
R. Cushing, S. Koulouzis, A. S. Z. Belloum, and M. Bubak. Prediction-based auto-scaling of scientific workflows. In Proceedings of the 9th International Workshop on Middleware for Grids, Clouds and e-Science, MGC '11, pages 1:1--1:6, New York, NY, USA, 2011. ACM.
[3]
M. Dobber, R. van der Mei, and G. Koole. Effective prediction of job processing times in a large-scale grid environment. In High Performance Distributed Computing, 2006 15th IEEE International Symposium on, pages 359--360, 2006.
[4]
R. Duan, F. Nadeem, J. Wang, Y. Zhang, R. Prodan, and T. Fahringer. A hybrid intelligent method for performance modeling and prediction of workflow activities in grids. In Cluster Computing and the Grid, 2009. CCGRID '09. 9th IEEE/ACM International Symposium on, pages 339--347, 2009.
[5]
H. Hiden, S. Woodman, P. Watson, and J. Cala. Developing cloud applications using the e-science central platform. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 371(1983), 2013.
[6]
X. Liu, Z. Ni, D. Yuan, Y. Jiang, Z. Wu, J. Chen, and Y. Yang. A novel statistical time-series pattern based interval forecasting strategy for activity durations in workflow systems. Journal of Systems and Software, 84(3): 354--376, 2011.
[7]
T. Miu and P. Missier. Predicting the Execution Time of Workflow Activities Based on Their Input Features. In I. Taylor and J. Montagnat, editors, Procs. WORKS 2012, Salt Lake City, US, 2012. ACM.
[8]
F. Nadeem and T. Fahringer. Predicting the execution time of grid workflow applications through local learning. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09, pages 33:1--33:12, New York, NY, USA, 2009. ACM.
[9]
F. Nadeem and T. Fahringer. Using templates to predict execution time of scientific workflow applications in the grid. In Cluster Computing and the Grid, 2009. CCGRID '09. 9th IEEE/ACM International Symposium on, pages 316--323, 2009.
[10]
N. Roy, A. Dubey, and A. Gokhale. Efficient autoscaling in the cloud using predictive models for workload forecasting. In Cloud Computing (CLOUD), 2011 IEEE International Conference on, pages 500--507, 2011.
[11]
S. Sadjadi, S. Shimizu, J. Figueroa, R. Rangaswami, J. Delgado, H. Duran, and X. Collazo-Mojica. A modeling approach for estimating execution time of long-running scientific applications. In Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on, pages 1--8, 2008.
[12]
V. Taylor, X. Wu, J. Geisler, and R. Stevens. Using kernel couplings to predict parallel application performance. In High Performance Distributed Computing, 2002. HPDC-11 2002. Proceedings. 11th IEEE International Symposium on, pages 125--134, 2002.
[13]
V. Taylor, X. Wu, and R. Stevens. Prophesy: an infrastructure for performance analysis and modeling of parallel and grid applications. SIGMETRICS Perform. Eval. Rev., 30(4): 13--18, Mar. 2003.
[14]
V. T. van Hees, L. Gorzelniak, E. C. Dean Leon, M. Eder, M. Pias, S. Taherian, U. Ekelund, F. Renstrom, P. W. Franks, A. Horsch, and S. Brage. Separating movement and gravity components in an acceleration signal and implications for the assessment of human daily physical activity. PLoS ONE, 8(4): e61691, 04 2013.
[15]
P. Watson, H. G. Hiden, S. J. Woodman, D. E. Leahy, J. Cala, and P. Missier. The panel of experts cloud pattern. In Proceedings of the third international workshop on Cloud data management, CloudDB '11, pages 23--24, New York, NY, USA, 2011. ACM.
[16]
S. Woodman, H. Hiden, P. Watson, and P. Missier. Achieving reproducibility by combining provenance with service and workflow versioning. In Proceedings of the 6th workshop on Workflows in support of large-scale science, WORKS '11, pages 127--136, New York, NY, USA, 2011. ACM.
[17]
Q. Wu and V. V. Datla. On performance modeling and prediction in support of scientific workflow optimization. In Proceedings of the 2011 IEEE World Congress on Services, SERVICES '11, pages 161--168, Washington, DC, USA, 2011. IEEE Computer Society.
[18]
S. Zhang, A. Rowlands, P. Murray, and T. Hurst. Physical activity classification using the genea wrist-worn accelerometer. Medicine and Science in Sports and Exercise, 44(4): 742--748, April 2012.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WORKS '13: Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
November 2013
133 pages
ISBN:9781450325028
DOI:10.1145/2534248
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 November 2013

Check for updates

Author Tags

  1. cloud computing
  2. performance analysis
  3. predictive modelling

Qualifiers

  • Research-article

Funding Sources

  • SiDE project, the RCUK Digital Economy Research Hub on Social Inclusion through the Digital Economy
  • Newcastle University

Conference

SC13

Acceptance Rates

WORKS '13 Paper Acceptance Rate 13 of 16 submissions, 81%;
Overall Acceptance Rate 30 of 54 submissions, 56%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)32
  • Downloads (Last 6 weeks)5
Reflects downloads up to 06 Nov 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media