Szczegóły publikacji
Opis bibliograficzny
Evaluation of machine learning techniques for predicting run times of scientific workflow jobs / Bartosz BALIŚ, Michał Grabowski // W: Parallel Processing and Applied Mathematics : 14th international Conference, PPAM 2022 : Gdansk, Poland, September 11–14, 2022 : revised selected papers, Pt. 1 / eds. Roman Wyrzykowski [et al.]. — Cham : Springer Nature Switzerland, cop. 2023. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 13826). — ISBN: 978-3-031-30441-5; e-ISBN: 978-3-031-30442-2. — S. 197–208. — Bibliogr., Abstr. — Publikacja dostępna online od: 2023-04-28
Autorzy (2)
Słowa kluczowe
Dane bibliometryczne
ID BaDAP | 147817 |
---|---|
Data dodania do BaDAP | 2023-09-06 |
DOI | 10.1007/978-3-031-30442-2_15 |
Rok publikacji | 2023 |
Typ publikacji | materiały konferencyjne (aut.) |
Otwarty dostęp | |
Wydawca | Springer |
Konferencja | Parallel Processing and Applied Mathematics : 14th International Conference |
Czasopismo/seria | Lecture Notes in Computer Science |
Abstract
Predicting execution time of computational jobs helps improve resource management, reduce execution cost, and optimize energy consumption. In this paper, we evaluate machine learning techniques for the purpose of predicting execution times of scientific workflow jobs. Various aspects of applying these techniques are evaluated in terms of their impact on prediction performance. These include (1) Comparison of performance of different regressors; (2) using a single-stage prediction pipeline vs. two-stage one; (3) impact of categorization granularity in the first stage of the two-stage pipeline; (4) training one global model for all jobs vs. using separate models for individual job types. We also propose a novel prediction model based on symbolic regression and evaluate its performance. Interpretability of prediction models and usage of proper performance metrics are also discussed. Experimental evaluation has led to a number of interesting findings that provide valuable insight on how to apply machine learning techniques to prediction of execution time of computational jobs.