Szczegóły publikacji
Opis bibliograficzny
Reproducibility of computational experiments on Kubernetes-managed container clouds with HyperFlow / Michał ORZECHOWSKI, Bartosz BALIŚ, Renata G. SŁOTA, Jacek KITOWSKI // W: Computational Science - ICCS 2020 : 20th International Conference : Amsterdam, The Netherlands, June 3–5, 2020 : proceedings, Pt. 1 / eds. Valeria V. Krzhizhanovskaya, [et al.]. — Cham : Springer Nature Switzerland, cop. 2020. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 12137. Theoretical Computer Science and General Issues ; ISSN 0302-9743). — ISBN: 978-3-030-50370-3; e-ISBN: 978-3-030-50371-0. — S. 220–233. — Bibliogr. s. 232–233, Abstr. — Publikacja dostępna online od: 2020-06-15. — J. Kitowski - dod. afiliacja: ACK Cyfronet AGH
Autorzy (4)
Słowa kluczowe
Dane bibliometryczne
ID BaDAP | 129156 |
---|---|
Data dodania do BaDAP | 2020-06-25 |
Tekst źródłowy | URL |
DOI | 10.1007/978-3-030-50371-0_16 |
Rok publikacji | 2020 |
Typ publikacji | materiały konferencyjne (aut.) |
Otwarty dostęp | |
Wydawca | Springer |
Konferencja | 20th International Conference on Computational Science |
Czasopisma/serie | Theoretical Computer Science and General Issues, Lecture Notes in Computer Science |
Abstract
We propose a comprehensive solution for reproducibility of scientific workflows. We focus particularly on Kubernetes-managed container clouds, increasingly important in scientific computing. Our solution addresses conservation of the scientific procedure, scientific data, execution environment and experiment deployment, while using standard tools in order to avoid maintainability issues that can obstruct reproducibility. We introduce an Experiment Digital Object (EDO), a record published in an open science repository that contains artifacts required to reproduce an experiment. We demonstrate a variety of reproducibility scenarios including experiment repetition (same experiment and conditions), replication (same experiment, different conditions), and propose a smart reuse scenario in which a previous experiment is partially replayed and partially re-executed. The approach is implemented in the HyperFlow workflow management system and experimentally evaluated using a genomic scientific workflow. The experiment is published as an EDO record on the Zenodo platform.