A conceptual framework for managing very diverse data for complex, interdisciplinary science

Much attention has been given to the challenges of handling massive data volumes in modern data-intensive science. This paper examines an equally daunting challenge – the diversity of interdisciplinary data, notably research data, and the need to interrelate these data to understand complex systemic...

Full description

Bibliographic Details
Published in:Journal of Information Science
Main Authors: Parsons, Mark A., Godøy, Øystein, LeDrew, Ellsworth, de Bruin, Taco F., Danis, Bruno, Tomlinson, Scott, Carlson, David
Format: Article in Journal/Newspaper
Language:English
Published: SAGE Publications 2011
Subjects:
IPY
Online Access:http://dx.doi.org/10.1177/0165551511412705
http://journals.sagepub.com/doi/pdf/10.1177/0165551511412705
Description
Summary:Much attention has been given to the challenges of handling massive data volumes in modern data-intensive science. This paper examines an equally daunting challenge – the diversity of interdisciplinary data, notably research data, and the need to interrelate these data to understand complex systemic problems such as environmental change and its impact. We use the experience of the International Polar Year 2007–8 (IPY) as a case study to examine data management approaches seeking to address issues around complex interdisciplinary science. We find that, while technology is a critical factor in addressing the interdisciplinary dimension of the data intensive science, the technologies developing for exa-scale data volumes differ from those that are needed for extremely distributed and heterogeneous data. Research data will continue to be highly heterogeneous and distributed and will require technologies to be much simpler and more flexible. More importantly, there is a need for both technical and cultural adaptation. We describe a vision of discoverable, open, linked, useful, and safe collections of data, organized and curated using the best principles and practices of information and library science. This vision provides a framework for our discussion and leads us to suggest several short- and long-term strategies to facilitate a socio-technical evolution in the overall science data ecosystem.