Szczegóły publikacji

Opis bibliograficzny

Evaluation of machine learning techniques for predicting run times of scientific workflow jobs / Bartosz BALIŚ, Michał Grabowski // W: Parallel Processing and Applied Mathematics : 14th international Conference, PPAM 2022 : Gdansk, Poland, September 11–14, 2022 : revised selected papers, Pt. 1 / eds. Roman Wyrzykowski [et al.]. — Cham : Springer Nature Switzerland, cop. 2023. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 13826). — ISBN: 978-3-031-30441-5; e-ISBN: 978-3-031-30442-2. — S. 197–208. — Bibliogr., Abstr. — Publikacja dostępna online od: 2023-04-28


Autorzy (2)


Słowa kluczowe

scientific workflowsperformance predictionmachine learningsymbolic regression

Dane bibliometryczne

ID BaDAP147817
Data dodania do BaDAP2023-09-06
DOI10.1007/978-3-031-30442-2_15
Rok publikacji2023
Typ publikacjimateriały konferencyjne (aut.)
Otwarty dostęptak
WydawcaSpringer
KonferencjaParallel Processing and Applied Mathematics : 14th International Conference
Czasopismo/seriaLecture Notes in Computer Science

Abstract

Predicting execution time of computational jobs helps improve resource management, reduce execution cost, and optimize energy consumption. In this paper, we evaluate machine learning techniques for the purpose of predicting execution times of scientific workflow jobs. Various aspects of applying these techniques are evaluated in terms of their impact on prediction performance. These include (1) Comparison of performance of different regressors; (2) using a single-stage prediction pipeline vs. two-stage one; (3) impact of categorization granularity in the first stage of the two-stage pipeline; (4) training one global model for all jobs vs. using separate models for individual job types. We also propose a novel prediction model based on symbolic regression and evaluate its performance. Interpretability of prediction models and usage of proper performance metrics are also discussed. Experimental evaluation has led to a number of interesting findings that provide valuable insight on how to apply machine learning techniques to prediction of execution time of computational jobs.

Publikacje, które mogą Cię zainteresować

fragment książki
Cloud infrastructure automation for scientific workflows / Bartosz BALIŚ, Michał ORZECHOWSKI, Krystian Pawlik, Maciej PAWLIK, Maciej MALAWSKI // W: Parallel Processing and Applied Mathematics : 13th international conference, PPAM 2019 : Białystok, Poland, September 8–11, 2019 : revised selected papers, Pt. 1 / eds. Roman Wyrzykowski [et al.]. — Cham : Springer Nature Switzerland, cop. 2020. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 12043. Theoretical Computer Science and General Issues ; ISSN 0302-9743). — ISBN: 978-3-030-43228-7; e-ISBN:  978-3-030-43229-4. — S. 287–297. — Bibliogr. s. 296–297, Abstr. — Publikacja dostępna online od: 2020-03-19
fragment książki
Cost optimization of execution of multi-level deadline-constrained scientific workflows on clouds / Maciej MALAWSKI, Kamil FIGIELA, Marian BUBAK, Ewa Deelman, Jarek Nabrzyski // W: Parallel Processing and Applied Mathematics : 10th international conference, PPAM 2013 : Warsaw, Poland, September 8–11, 2013 : revised selected papers, Pt. 1 / eds. Roman Wyrzykowski, [et al.]. — Berlin ; Heidelberg : Springer-Verlag, cop. 2014. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; 8384). — ISBN: 978-3-642-55223-6; e-ISBN: 978-3-642-55224-3. — S. 251–260. — Bibliogr. s. 259–260, Abstr.