5 research outputs found

    A crowdsourced global data set for validating built-up surface layers

    Get PDF
    Several global high-resolution built-up surface products have emerged over the last five years, taking full advantage of open sources of satellite data such as Landsat and Sentinel. However, these data sets require validation that is independent of the producers of these products. To fill this gap, we designed a validation sample set of 50 K locations using a stratified sampling approach independent of any existing global built-up surface products. We launched a crowdsourcing campaign using Geo-Wiki (https://www.geo-wiki.org/) to visually interpret this sample set for built-up surfaces using very high-resolution satellite images as a source of reference data for labelling the samples, with a minimum of five validations per sample location. Data were collected for 10 m sub-pixels in an 80 × 80 m grid to allow for geo-registration errors as well as the application of different validation modes including exact pixel matching to majority or percentage agreement. The data set presented in this paper is suitable for the validation and inter-comparison of multiple products of built-up areas

    Methodology for generating a global forest management layer

    Get PDF
    The first ever global map of forest management was generated based on remote sensing data. To collect training data, we launched a series of Geo-Wiki (https://www.geo-wiki.org/) campaigns involving forest experts from different world regions, to explore which information related to forest management could be collected by visual interpretation of very high-resolution images from Google Maps and Microsoft Bing, Sentinel time series and normalized difference vegetation index (NDVI) profiles derived from Google Earth Engine. A machine learning technique was then used with the visually interpreted sample (280K locations) as a training dataset to classify PROBA-V satellite imagery. Finally, we obtained a global wall-to-wall map of forest management at a 100m resolution for the year 2015. The map includes classes such as intact forests; forests with signs of management, including logging; planted forests; woody plantations with a rotation period up to 15 years; oil palm plantations; and agroforestry. The map can be used to deliver further information about forest ecosystems, protected and observed forest status changes, biodiversity assessments, and other ecosystem-related aspects

    A Crowdsourced Global Data Set for Validating Built-up Surface Layers V.2

    No full text
    This collection contains data that were collected during a crowdsourcing campaign using Geo-Wiki (https://www.geo-wiki.org/). The campaign involved visual interpretation of a sample that is designed for validating any existing global built-up surface product. A zipped shapefile (ValidationGrids.zip) contains the random stratified sample of 50K locations, which consist of 80x80m grids further sub-divided into 10m cells so there are 64 cells per grid. These locations were provided to the crowd, who used very high-resolution satellite images to label the grids as built-up (i.e., containing a building), non-built-up or unsure. The file (Geo-WikiBuilt-upCentroidsAll.csv) contains the data collected in the campaign summarized by the centroid (or central point of each 80m grid location). It also contains fields for quality control, one that indicates if the change information matches the control points (see below) or the majority answer from the crowd, and another that indicates whether the presence/absence of built-up matches the control points (see below) or the majority answer from the crowd. The data collected for all 64 cells per grid can be found in Geo-WikiBuilt-upCellsAll.csv. The Geo-Wiki campaign uses visually interpreted grid locations called control points as part of the scoring mechanism of Geo-Wiki for quality control. These control points are provided by centroid (Geo-WikiBuilt-upCentroidsControls.csv) and for all cells in the 80m grid (Geo-WikiBuilt-upCellsControls.csv). In addition to the raw data, two additional quality-controlled files have been produced. The first file (Geo-WikiBuilt-upCentroidsChangeQualityControlled.csv) provides a single record for each location on change in built-up (if built-up is present) that lists either the control point answer or the majority answer from the crowd. The second file (Geo-WikiBuilt-upCellsQualityControlled.csv) contains a single record for each of the 64 cells in each grid, listing either the control point answer or the majority answer from the crowd. Finally, the file Strata.csv contains the mapping between the grid location and the sampling stratum used in the design of the sample

    A Crowdsourced Global Data Set for Validating Built-up Surface Layers

    Get PDF
    This collection contains data that were collected during a crowdsourcing campaign using Geo-Wiki (https://www.geo-wiki.org/). The campaign involved visual interpretation of a sample that is designed for validating any existing global built-up surface product. A zipped shapefile (ValidationGrids.zip) contains the random stratified sample of 50K locations, which consist of 80x80m grids further sub-divided into 10m cells so there are 64 cells per grid. These locations were provided to the crowd, who used very high-resolution satellite images to label the grids as built-up (i.e., containing a building), non-built-up or unsure. The file (Geo-WikiBuilt-upCentroidsAll.csv) contains the data collected in the campaign summarized by the centroid (or central point of each 80m grid location). The data collected for all 64 cells per grid can be found in Geo-WikiBuilt-upCellsAll.csv. The Geo-Wiki campaign uses visually interpreted grid locations called control points as part of the scoring mechanism of Geo-Wiki for quality control. These control points are provided by centroid (Geo-WikiBuilt-upCentroidsControls.csv) and for all cells in the 80m grid (Geo-WikiBuilt-upCellsControls.csv). Finally, the file Strata.csv contains the mapping between the grid location and the sampling stratum used in the design of the sample

    Global forest management data at a 100m resolution for the year 2015

    No full text
    We provide four data records: 1.The reference data set as a comma-separated file ("reference_data_set.csv") with the following attributes: “ID” is a unique location identifier “Latitude, Longitude” are centroid coordinates of a 100m x 100m pixel. “Land_use_ID “is a land use class: 11 - Naturally regenerating forest without any signs of human activities, e.g., primary forests. 20 - Naturally regenerating forest with signs of human activities, e.g., logging, clear cuts etc. 31 - Planted forest. 32 - Short rotation plantations for timber. 40 - Oil palm plantations. 53 - Agroforestry. “Flag” identifies a data origin: 1- the crowdsourced locations, 2- the control data set, 0 – the additional experts' classifications following the opportunistic approach. 2. The 100 m forest management map in a geoTiff format with the classes presented - "FML_v3.2.tif ". 3. The predicted class probability from the Random Forest classification in a geoTiff format - "ProbaV_LC100_epoch2015_global_v2.0.3_forest-management--layer-proba_EPSG-4326.tif" 4. Validation data set as a comma-separated file ("validation_data_set.csv) with the following attributes: “ID” is a unique location identifier “pixel_center_x” , “pixel_center_y ” are centroid coordinates of a 100m x 100m pixel in lat/lon projection “first_landuse_class “is a land use class, as in (1). “second_landuse_class “is a second possible land use class, as in (1), identified in case it was difficult to assign one class with high confidence
    corecore