Performance portable ice-sheet modeling with MALI
High resolution simulations of polar ice-sheets play a crucial role in the ongoing effort to develop more accurate and reliable Earth-system models for probabilistic sea-level projections. These simulations often require a massive amount of memory and computation from large supercomputing clusters t...
Main Authors: | , , , , , , , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2022
|
Subjects: | |
Online Access: | http://arxiv.org/abs/2204.04321 |
id |
ftarxivpreprints:oai:arXiv.org:2204.04321 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:2204.04321 2023-09-05T13:20:16+02:00 Performance portable ice-sheet modeling with MALI Watkins, Jerry Carlson, Max Shan, Kyle Tezaur, Irina Perego, Mauro Bertagna, Luca Kao, Carolyn Hoffman, Matthew J. Price, Stephen F. 2022-04-08 http://arxiv.org/abs/2204.04321 unknown http://arxiv.org/abs/2204.04321 Computer Science - Computational Engineering Finance and Science Computer Science - Performance Physics - Computational Physics text 2022 ftarxivpreprints 2023-08-16T17:01:39Z High resolution simulations of polar ice-sheets play a crucial role in the ongoing effort to develop more accurate and reliable Earth-system models for probabilistic sea-level projections. These simulations often require a massive amount of memory and computation from large supercomputing clusters to provide sufficient accuracy and resolution. The latest exascale machines poised to come online contain a diverse set of computing architectures. In an effort to avoid architecture specific programming and maintain productivity across platforms, the ice-sheet modeling code known as MALI uses high level abstractions to integrate Trilinos libraries and the Kokkos programming model for performance portable code across a variety of different architectures. In this paper, we analyze the performance portable features of MALI via a performance analysis on current CPU-based and GPU-based supercomputers. The analysis highlights performance portable improvements made in finite element assembly and multigrid preconditioning within MALI with speedups between 1.26-1.82x across CPU and GPU architectures but also identifies the need to further improve performance in software coupling and preconditioning on GPUs. We also perform a weak scalability study and show that simulations on GPU-based machines perform 1.24-1.92x faster when utilizing the GPUs. The best performance is found in finite element assembly which achieved a speedup of up to 8.65x and a weak scaling efficiency of 82.9% with GPUs. We additionally describe an automated performance testing framework developed for this code base using a changepoint detection method. The framework is used to make actionable decisions about performance within MALI. We provide several concrete examples of scenarios in which the framework has identified performance regressions, improvements, and algorithm differences over the course of two years of development. Text Ice Sheet ArXiv.org (Cornell University Library) |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Computational Engineering Finance and Science Computer Science - Performance Physics - Computational Physics |
spellingShingle |
Computer Science - Computational Engineering Finance and Science Computer Science - Performance Physics - Computational Physics Watkins, Jerry Carlson, Max Shan, Kyle Tezaur, Irina Perego, Mauro Bertagna, Luca Kao, Carolyn Hoffman, Matthew J. Price, Stephen F. Performance portable ice-sheet modeling with MALI |
topic_facet |
Computer Science - Computational Engineering Finance and Science Computer Science - Performance Physics - Computational Physics |
description |
High resolution simulations of polar ice-sheets play a crucial role in the ongoing effort to develop more accurate and reliable Earth-system models for probabilistic sea-level projections. These simulations often require a massive amount of memory and computation from large supercomputing clusters to provide sufficient accuracy and resolution. The latest exascale machines poised to come online contain a diverse set of computing architectures. In an effort to avoid architecture specific programming and maintain productivity across platforms, the ice-sheet modeling code known as MALI uses high level abstractions to integrate Trilinos libraries and the Kokkos programming model for performance portable code across a variety of different architectures. In this paper, we analyze the performance portable features of MALI via a performance analysis on current CPU-based and GPU-based supercomputers. The analysis highlights performance portable improvements made in finite element assembly and multigrid preconditioning within MALI with speedups between 1.26-1.82x across CPU and GPU architectures but also identifies the need to further improve performance in software coupling and preconditioning on GPUs. We also perform a weak scalability study and show that simulations on GPU-based machines perform 1.24-1.92x faster when utilizing the GPUs. The best performance is found in finite element assembly which achieved a speedup of up to 8.65x and a weak scaling efficiency of 82.9% with GPUs. We additionally describe an automated performance testing framework developed for this code base using a changepoint detection method. The framework is used to make actionable decisions about performance within MALI. We provide several concrete examples of scenarios in which the framework has identified performance regressions, improvements, and algorithm differences over the course of two years of development. |
format |
Text |
author |
Watkins, Jerry Carlson, Max Shan, Kyle Tezaur, Irina Perego, Mauro Bertagna, Luca Kao, Carolyn Hoffman, Matthew J. Price, Stephen F. |
author_facet |
Watkins, Jerry Carlson, Max Shan, Kyle Tezaur, Irina Perego, Mauro Bertagna, Luca Kao, Carolyn Hoffman, Matthew J. Price, Stephen F. |
author_sort |
Watkins, Jerry |
title |
Performance portable ice-sheet modeling with MALI |
title_short |
Performance portable ice-sheet modeling with MALI |
title_full |
Performance portable ice-sheet modeling with MALI |
title_fullStr |
Performance portable ice-sheet modeling with MALI |
title_full_unstemmed |
Performance portable ice-sheet modeling with MALI |
title_sort |
performance portable ice-sheet modeling with mali |
publishDate |
2022 |
url |
http://arxiv.org/abs/2204.04321 |
genre |
Ice Sheet |
genre_facet |
Ice Sheet |
op_relation |
http://arxiv.org/abs/2204.04321 |
_version_ |
1776200978777767936 |