Multi-Target Vectorization With MTPS C++ Generic Library
International audience This article introduces MTPS, a C++ template library dedicated at vectorizing algorithms for different target architectures. Algorithms written with MTPS benefit from optimized memory access patterns and show performances close to hardware limits, both on multicore CPU and on...
Main Authors: | , , |
---|---|
Other Authors: | , , , , , , |
Format: | Conference Object |
Language: | English |
Published: |
HAL CCSD
2010
|
Subjects: | |
Online Access: | https://centralesupelec.hal.science/hal-00491980 https://centralesupelec.hal.science/hal-00491980/document https://centralesupelec.hal.science/hal-00491980/file/Supelec621.pdf |
id |
ftunilorrainehal:oai:HAL:hal-00491980v1 |
---|---|
record_format |
openpolar |
spelling |
ftunilorrainehal:oai:HAL:hal-00491980v1 2024-09-15T18:13:21+00:00 Multi-Target Vectorization With MTPS C++ Generic Library Kirschenmann, Wilfried Plagne, Laurent Vialle, Stéphane EDF (EDF) Algorithms for the Grid (ALGORILLE) INRIA Lorraine Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA) Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS) SUPELEC-Campus Metz Ecole Supérieure d'Electricité - SUPELEC (FRANCE) Reykjavik, Iceland 2010-06-06 https://centralesupelec.hal.science/hal-00491980 https://centralesupelec.hal.science/hal-00491980/document https://centralesupelec.hal.science/hal-00491980/file/Supelec621.pdf en eng HAL CCSD hal-00491980 https://centralesupelec.hal.science/hal-00491980 https://centralesupelec.hal.science/hal-00491980/document https://centralesupelec.hal.science/hal-00491980/file/Supelec621.pdf info:eu-repo/semantics/OpenAccess Proceedings of PARA 2010 conference: State of the Art in Scientific and Parallel Computing PARA 2010 : State of the Art in Scientific and Parallel Computing https://centralesupelec.hal.science/hal-00491980 PARA 2010 : State of the Art in Scientific and Parallel Computing, Jun 2010, Reykjavik, Iceland. 4 p GPU SSE Vectorization c++ Template Metaprogramming Performances [INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC] info:eu-repo/semantics/conferenceObject Conference papers 2010 ftunilorrainehal 2024-08-22T07:42:44Z International audience This article introduces MTPS, a C++ template library dedicated at vectorizing algorithms for different target architectures. Algorithms written with MTPS benefit from optimized memory access patterns and show performances close to hardware limits, both on multicore CPU and on GPU. Conference Object Iceland Université de Lorraine: HAL |
institution |
Open Polar |
collection |
Université de Lorraine: HAL |
op_collection_id |
ftunilorrainehal |
language |
English |
topic |
GPU SSE Vectorization c++ Template Metaprogramming Performances [INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC] |
spellingShingle |
GPU SSE Vectorization c++ Template Metaprogramming Performances [INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC] Kirschenmann, Wilfried Plagne, Laurent Vialle, Stéphane Multi-Target Vectorization With MTPS C++ Generic Library |
topic_facet |
GPU SSE Vectorization c++ Template Metaprogramming Performances [INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC] |
description |
International audience This article introduces MTPS, a C++ template library dedicated at vectorizing algorithms for different target architectures. Algorithms written with MTPS benefit from optimized memory access patterns and show performances close to hardware limits, both on multicore CPU and on GPU. |
author2 |
EDF (EDF) Algorithms for the Grid (ALGORILLE) INRIA Lorraine Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA) Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS) SUPELEC-Campus Metz Ecole Supérieure d'Electricité - SUPELEC (FRANCE) |
format |
Conference Object |
author |
Kirschenmann, Wilfried Plagne, Laurent Vialle, Stéphane |
author_facet |
Kirschenmann, Wilfried Plagne, Laurent Vialle, Stéphane |
author_sort |
Kirschenmann, Wilfried |
title |
Multi-Target Vectorization With MTPS C++ Generic Library |
title_short |
Multi-Target Vectorization With MTPS C++ Generic Library |
title_full |
Multi-Target Vectorization With MTPS C++ Generic Library |
title_fullStr |
Multi-Target Vectorization With MTPS C++ Generic Library |
title_full_unstemmed |
Multi-Target Vectorization With MTPS C++ Generic Library |
title_sort |
multi-target vectorization with mtps c++ generic library |
publisher |
HAL CCSD |
publishDate |
2010 |
url |
https://centralesupelec.hal.science/hal-00491980 https://centralesupelec.hal.science/hal-00491980/document https://centralesupelec.hal.science/hal-00491980/file/Supelec621.pdf |
op_coverage |
Reykjavik, Iceland |
genre |
Iceland |
genre_facet |
Iceland |
op_source |
Proceedings of PARA 2010 conference: State of the Art in Scientific and Parallel Computing PARA 2010 : State of the Art in Scientific and Parallel Computing https://centralesupelec.hal.science/hal-00491980 PARA 2010 : State of the Art in Scientific and Parallel Computing, Jun 2010, Reykjavik, Iceland. 4 p |
op_relation |
hal-00491980 https://centralesupelec.hal.science/hal-00491980 https://centralesupelec.hal.science/hal-00491980/document https://centralesupelec.hal.science/hal-00491980/file/Supelec621.pdf |
op_rights |
info:eu-repo/semantics/OpenAccess |
_version_ |
1810451113964470272 |