Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping

We (Intelligent Mining and Analysis of Remote Sensing big data, IMARS) create a large-scale annotated dataset (Globe230k) for land use/land cover (LULC) mapping, which is annotated on Google Earth image of 1 m spatial resolution. Globe230k is annotated by numerous experts and students major in surve...

Full description

Bibliographic Details
Main Authors: Shi, Qian, He, Da, Liu, Zhengyu, Liu, Xiaoping, Xue, Jingqian
Format: Other/Unknown Material
Language:unknown
Published: Zenodo 2023
Subjects:
Online Access:https://doi.org/10.5281/zenodo.8429200
Description
Summary:We (Intelligent Mining and Analysis of Remote Sensing big data, IMARS) create a large-scale annotated dataset (Globe230k) for land use/land cover (LULC) mapping, which is annotated on Google Earth image of 1 m spatial resolution. Globe230k is annotated by numerous experts and students major in survey and mapping after necessary training, through visual interpretation on very high-resolution images, as well as in-situ field survey, under the guidance of the organized annotation pipeline. Globe230k has three superiorities: 1) Large scale: the Globe230k includes 232,819annotated images with the size of 512x512 and spatial resolution of 1 m, with more than 3x10 10 annotated pixels,andit includes10 first-level categories. 2) Rich diversity: the annotated images are sampled from worldwide regions, with coverage area of over 60,000 km 2 , indicating a high variability and diversity.Besides, in order to ensure the category balance, we intentionally give more chance to the rare categories to be sampled, such as wetland, ice/snow, etc. 3) Multi-modal: Globe230k not only contains RGB bands, but also include other important features for Earth system research, such as Normalized differential vegetation index (NDVI), digital elevation model (DEM), vertical-vertical polarization (VV) bands, vertical-horizontal polarization (VH) bands, which can facilitate the multi-modal data fusion research.(This part will updating soon). The image patches and their corresponding annotated patches are respectively stored in "patch_image.rar" and "patch_label.rar" file. The RGB image is in forms of ".jpg", with size of 512x512, the pixel value is ranged from 0-255. The annotated patches is in forms of ".png", also with size of 512x512, the pixel value is ranged from 1-10, which respectively represent 1#cropland, 2#forest, 3#grass, 4#shrubland, 5#wetland, 6#water, 7#tundra, 8#impervious, 9#bareland, 10#ice/snow.The total 232,819 pairs are officially divided into training set, validation set, and test set, based on ratio of 7:1:2, which can be ...