22 research outputs found
Recommended from our members
Extracting Semantics of Individual Places from Movement Data by Analyzing Temporal Patterns of Visits
Data reflecting movements of people, such as GPS or GSM tracks, can be a source of information about mobility behaviors and activities of people. Such information is required for various kinds of spatial planning in the public and business sectors. Movement data by themselves are semantically poor. Meaningful information can be derived by means of interactive visual analysis performed by a human expert; however, this is only possible for data about a small number of people. We suggest an approach that allows scaling to large datasets reflecting movements of numerous people. It includes extracting stops, clustering them for identifying personal places of interest (POIs), and creating temporal signatures of the POIs characterizing the temporal distribution of the stops with respect to the daily and weekly time cycles and the time line. The analyst can give meanings to selected POIs based on their temporal signatures (i.e., classify them as home, work, etc.), and then POIs with similar signatures can be classified automatically. We demonstrate the possibilities for interactive visual semantic analysis by example of GSM, GPS, and Twitter data. GPS data allow inferring richer semantic information, but temporal signatures alone may be insufficient for interpreting short stops. Twitter data are similar to GSM data but additionally contain message texts, which can help in place interpretation. We plan to develop an intelligent system that learns how to classify personal places and trips while a human analyst visually analyzes and semantically annotates selected subsets of movement data
Towards a Protocol for the Collection of VGI Vector Data
A protocol for the collection of vector data in Volunteered Geographic Information (VGI) projects is proposed. VGI is a source of crowdsourced geographic data and information which is comparable, and in some cases better, than equivalent data from National Mapping Agencies (NMAs) and Commercial Surveying Companies (CSC). However, there are many differences in how NMAs and CSC collect, analyse, manage and distribute geographic information to that of VGI projects. NMAs and CSC make use of robust and standardised data collection protocols whilst VGI projects often provide guidelines rather than rigorous data collection specifications. The proposed protocol addresses formalising the collection and creation of vector data in VGI projects in three principal ways: by manual vectorisation; field survey; and reuse of existing data sources. This protocol is intended to be generic rather than being linked to any specific VGI project. We believe that this is the first protocol for VGI vector data collection that has been formally described in the literature. Consequently, this paper shall serve as a starting point for on-going development and refinement of the protocol
Mapping and the Citizen Sensor
Maps are a fundamental resource in a diverse array of applications ranging from everyday activities, such as route planning through the legal demarcation of space to scientific studies, such as those seeking to understand biodiversity and inform the design of nature reserves for species conservation. For a map to have value, it should provide an accurate and timely representation of the phenomenon depicted and this can be a challenge in a dynamic world. Fortunately, mapping activities have benefitted greatly from recent advances in geoinformation technologies. Satellite remote sensing, for example, now offers unparalleled data acquisition and authoritative mapping agencies have developed systems for the routine production of maps in accordance with strict standards. Until recently, much mapping activity was in the exclusive realm of authoritative agencies but technological development has also allowed the rise of the amateur mapping community. The proliferation of inexpensive and highly mobile and location aware devices together with Web 2.0 technology have fostered the emergence of the citizen as a source of data. Mapping presently benefits from vast amounts of spatial data as well as people able to provide observations of geographic phenomena, which can inform map production, revision and evaluation. The great potential of these developments is, however, often limited by concerns. The latter span issues from the nature of the citizens through the way data are collected and shared to the quality and trustworthiness of the data. This book reports on some of the key issues connected with the use of citizen sensors in mapping. It arises from a European Co-operation in Science and Technology (COST) Action, which explored issues linked to topics ranging from citizen motivation, data acquisition, data quality and the use of citizen derived data in the production of maps that rival, and sometimes surpass, maps arising from authoritative agencies
Geographically weighted evidence combination approaches for combining discordant and inconsistent volunteered geographical information
There is much interest in being able to combine crowdsourced data. One of the critical issues in information sciences is how to combine data or information that are discordant or inconsistent in some way. Many previous approaches have taken a majority rules approach under the assumption that most people are correct most of the time. This paper analyses crowdsourced land cover data generated by the Geo-Wiki initiative in order to infer the land cover present at locations on a 50 km grid. It compares four evidence combination approaches (Dempster Shafer, Bayes, Fuzzy Sets and Possibility) applied under a geographically weighted kernel with the geographically weighted average approach applied in many current Geo-Wiki analyses. A geographically weighted approach uses a moving kernel under which local analyses are undertaken. The contribution (or salience) of each data point to the analysis is weighted by its distance to the kernel centre, reflecting Tobler’s 1st law of geography. A series of analyses were undertaken using different kernel sizes (or bandwidths). Each of the geographically weighted evidence combination methods generated spatially distributed measures of belief in hypotheses associated with the presence of individual land cover classes at each location on the grid. These were compared with GlobCover, a global land cover product. The results from the geographically weighted average approach in general had higher correspondence with the reference data and this increased with bandwidth. However, for some classes other evidence combination approaches had higher correspondences possibly because of greater ambiguity over class conceptualisations and / or lower densities of crowdsourced data. The outputs also allowed the beliefs in each class to be mapped. The differences in the soft and the crisp maps are clearly associated with the logics of each evidence combination approach and of course the different questions that they ask of the data. The results show that discordant data can be combined (rather than being removed from analysis) and that data integrated in this way can be parameterised by different measures of belief uncertainty. The discussion highlights a number of critical areas for future research
A CLASSIFICATION MODEL FOR THE INFERENCE OF SPATIAL PRECISION OF OPENSTREETMAP BUILDINGS WITH INTRINSIC INDICATORS
To evaluate the quality of OSM data, similarities between OSM features and their homologous features represented in a reference database are relevant metrics. However, reference databases do not exist everywhere or are not freely available. Thus, having data quality assessment methods that rely only on intrinsic indicators (i.e. based on data itself without considering external information) would be useful in these cases. This article specifically uses the radial distance as a target quality metric to measure the quality of shapes. Its aim is to build a random-forest based classification method that reconstructs whether this distance is higher or lower than a specified threshold, using only intrinsic indicators as inputs. The classification algorithm is evaluated on a first dataset by computing the ROC (Receiver Operating Characteristic) curve and using the AUC (Area Under Curve) as an evaluation metric. The transferability of the resulting algorithm is then evaluated by measuring its performance on a second, distinct dataset. The experiments show that the algorithm performs reasonably well on both the initial and the second dataset, and that intrinsic indicators give relevant information to infer comparison-based shape quality (i.e. the radial distance)
Use of automated change detection and VGI sources for identifying and validating urban land use change
© 2020, by the authors. Land use and land cover (LULC) mapping is often undertaken by national mapping agencies, where these LULC products are used for different types of monitoring and reporting applications. Updating of LULC databases is often done on a multi-year cycle due to the high costs involved, so changes are only detected when mapping exercises are repeated. Consequently, the information on LULC can quickly become outdated and hence may be incorrect in some areas. In the current era of big data and Earth observation, change detection algorithms can be used to identify changes in urban areas, which can then be used to automatically update LULC databases on a more continuous basis. However, the change detection algorithm must be validated before the changes can be committed to authoritative databases such as those produced by national mapping agencies. This paper outlines a change detection algorithm for identifying construction sites, which represent ongoing changes in LU, developed in the framework of the LandSense project. We then use volunteered geographic information (VGI) captured through the use of mapathons from a range of different groups of contributors to validate these changes. In total, 105 contributors were involved in the mapathons, producing a total of 2778 observations. The 105 contributors were grouped according to six different user-profiles and were analyzed to understand the impact of the experience of the users on the accuracy assessment. Overall, the results show that the change detection algorithm is able to identify changes in residential land use to an adequate level of accuracy (85%) but changes in infrastructure and industrial sites had lower accuracies (57% and 75 %, respectively), requiring further improvements. In terms of user profiles, the experts in LULC from local authorities, researchers in LULC at the French national mapping agency (IGN), and first-year students with a basic knowledge of geographic information systems had the highest overall accuracies (86.2%, 93.2%, and 85.2%, respectively). Differences in how the users approach the task also emerged, e.g., local authorities used knowledge and context to try to identify types of change while those with no knowledge of LULC (i.e., normal citizens) were quicker to choose 'Unknown' when the visual interpretation of a class was more difficult
Crowdsourcing, Citizen Science or Volunteered Geographic Information? The Current State of Crowdsourced Geographic Information
Citizens are increasingly becoming an important source of geographic information, sometimes entering domains that had until recently been the exclusive realm of authoritative agencies. This activity has a very diverse character as it can, amongst other things, be active or passive, involve spatial or aspatial data and the data provided can be variable in terms of key attributes such as format, description and quality. Unsurprisingly, therefore, there are a variety of terms used to describe data arising from citizens. In this article, the expressions used to describe citizen sensing of geographic information are reviewed and their use over time explored, prior to categorizing them and highlighting key issues in the current state of the subject. The latter involved a review of ~100 Internet sites with particular focus on their thematic topic, the nature of the data and issues such as incentives for contributors. This review suggests that most sites involve active rather than passive contribution, with citizens typically motivated by the desire to aid a worthy cause, often receiving little training. As such, this article provides a snapshot of the role of citizens in crowdsourcing geographic information and a guide to the current status of this rapidly emerging and evolving subject
LandSense: Coupling citizen science and earth observation data to promote environmental monitoring
The Horizon 2020 project, LandSense, is a modern citizen observatory for Land Use & Land Cover (LULC) monitoring, that connects citizens with Earth Observation (EO) data to transform current approaches to environmental decision making. Citizen Observatories are community-driven mechanisms to complement existing environmental monitoring systems and can be fostered through EO-based mobile and web applications, allowing citizens to not only play a key role in LULC monitoring, but also to be directly involved in the co-creation of such solutions. Within LandSense, citizens can participate in ongoing demonstration pilots using their own devices (e.g. mobile phones and tablets), through interactive reporting, gaming applications and mapathons. Campaigns in Vienna, Toulouse, Amsterdam, Serbia, Spain and Indonesia address topics such as urban greenspaces, agricultural management and biodiversity/habitat threat monitoring. For example, in the case of Toulouse and Indonesia, hotspots of change in LULC are identified through Sentinel 2 time series analysis. These hotspots are then validated by citizens and interested stakeholders either directly on-site via customized mobile applications, providing geotagged photos, or remotely via the online LandSense Engagement platform. The presentation will not only showcase the tools and results from these campaigns, but also highlight how citizen-driven observations can contribute to sustainable development. Such initiatives present clear opportunities to integrate citizen-driven observations with established authoritative data sources to further extend GEOSS and Copernicus capacities, and support comprehensive environmental monitoring systems. In addition, these applications have considerable potential in lowering expenditure costs on in-situ data collection and current calibration/validation approaches within the processing chain of environmental monitoring activities both within and beyond Europe
An Experimental Framework for Integrating Citizen and Community Science into Land Cover, Land Use, and Land Change Detection Processes in a National Mapping Agency
Accurate and up-to-date information on land use and land cover (LULC) is needed to develop policies on reducing soil sealing through increased urbanization as well as to meet climate targets. More detailed information about building function is also required but is currently lacking. To improve these datasets, the national mapping agency of France, Institut de l’Information Géographique et Foréstière (IGN France), has developed a strategy for updating their LULC database on a update cycle every three years and building information on a continuous cycle using web, mobile, and wiki applications. Developed as part of the LandSense project and eventually tapping into the LandSense federated authentication system, this paper outlines the data collection campaigns, the key concepts that have driven the system architecture, and a description of the technologies developed for this solution. The campaigns have only just begun, so there are only preliminary results to date. Thus far, feedback on the web and mobile applications has been positive, but still requires a further demonstration of feasibility
Assessing the transferability of a multi-source land use classification workflow across two heterogeneous urban and rural areas
Mapping Land Use (LU) is crucial for monitoring and managing the dynamic evolution of the human activities of a given area and their consequential environmental impacts. In this study, a multimodal machine learning framework, using the XGBoost classifier applied to attributes constructed from heterogeneous spatial data sources, is defined and used to automatically classify LU in the two French departments of Gers and Rhône. It reaches a mean F1 score of 83 and 86% respectively. This research work also assesses the robustness and transferability of the machine learning model between these two diverse study areas and highlights the challenges encountered, arising mainly from the differences of distribution of the attributes and classes between the study areas. Adding a few samples from the test study area allows the model to learn some specificities of the test study area, and thus improves the results. Moreover, the study evaluates the individual contributions of each data source to the accuracy of predictions of the LU classes, providing insights concerning the relevance of each data source in enhancing the overall precision of the Land Use classification. The findings contribute to a validated LU classification workflow, identify valuable data sources, and enhance understanding of model transferability challenges