Remaining gaps in open source software for Big Spatial Data

The volume and coverage of spatial data has increased dramatically in recent years, with Earth observation programmes producing dozens of GB of data on a daily basis. The term Big Spatial Data is now applied to data sets that impose real challenges to researchers and practitioners alike. The difficu...

Full description

Bibliographic Details
Main Author: de Sousa, Luís Moreira
Format: Other/Unknown Material
Language:unknown
Published: PeerJ 2018
Subjects:
Online Access:http://dx.doi.org/10.7287/peerj.preprints.27215v1
https://peerj.com/preprints/27215v1.pdf
https://peerj.com/preprints/27215v1.xml
https://peerj.com/preprints/27215v1.html
Description
Summary:The volume and coverage of spatial data has increased dramatically in recent years, with Earth observation programmes producing dozens of GB of data on a daily basis. The term Big Spatial Data is now applied to data sets that impose real challenges to researchers and practitioners alike. The difficulties are partly related to a lack of tools supporting appropriate Coordinate Reference Systems (CRS). As rule, these data are provided in highly irregular geodesic grids, defined along equal intervals of latitude and longitude. Compounding the problem, users of such data end up taking geodesic coordinates in these grids as a Cartesian system, implicitly applying Marinus of Tyre's projection. A first approach towards the compactness of global geo-spatial data is to work in a Cartesian system produced by an equal-area projection. There are a good number to choose from, but those commonly supported by GIS software invariably relate to the sinusoidal or pseudo-cylindrical families, that impose important distortions of shape and distance. The land masses of Antarctica, Alaska, Canada, Greenland and Russia are particularly distorted with such projections. A more effective approach is to store and work with data in modern cartographic projections, in particular those defined with the Platonic and Archimedean solids. In spite of various attempts at open source software supporting these projections, in practice they remain today largely out of reach to GIS practitioners. This communication reviews persisting difficulties in working with worldwide big spatial data, current strategies to address such difficulties, the compromises they impose and the remaining gaps in open source software.