What will the future of cloud-based astronomical data processing look like?

Abstract Astronomy is rapidly approaching an impasse: very large datasets require remote or cloud-based parallel processing, yet many astronomers still try to download the data and develop serial code locally. Astronomers understand the need for change, but the hurdles remain high. We are developing...

Full description

Bibliographic Details
Published in:Proceedings of the International Astronomical Union
Main Authors: Green, Andrew W., Mannering, Elizabeth, Harischandra, Lloyd, Vuong, Minh, O’Toole, Simon, Sealey, Katrina, Hopkins, Andrew M.
Format: Article in Journal/Newspaper
Language:English
Published: Cambridge University Press (CUP) 2016
Subjects:
Online Access:http://dx.doi.org/10.1017/s1743921317001363
https://www.cambridge.org/core/services/aop-cambridge-core/content/view/S1743921317001363
Description
Summary:Abstract Astronomy is rapidly approaching an impasse: very large datasets require remote or cloud-based parallel processing, yet many astronomers still try to download the data and develop serial code locally. Astronomers understand the need for change, but the hurdles remain high. We are developing a data archive designed from the ground up to simplify and encourage cloud-based parallel processing. While the volume of data we host remains modest by some standards, it is still large enough that download and processing times are measured in days and even weeks. We plan to implement a python based, notebook-like interface that automatically parallelises execution. Our goal is to provide an interface sufficiently familiar and user-friendly that it encourages the astronomer to run their analysis on our system in the cloud—astroinformatics as a service. We describe how our system addresses the approaching impasse in astronomy using the SAMI Galaxy Survey as an example.